<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: yeag123</title><link>https://news.ycombinator.com/user?id=yeag123</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Tue, 16 Jun 2026 00:54:49 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=yeag123" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by yeag123 in "Ask HN: What are you working on? (May 2026)"]]></title><description><![CDATA[
<p>I've been working on this for about a year and a half, and decided to finally open source it.<p>I wanted an intelligent document processing SaaS (Document AI, Form Recognizer, the various PDF-to-JSON tools) that you could run on your own hardware.<p>The interesting bits:<p>- Three-tier extraction: PyMuPDF for digital PDFs (~50ms), Docling layout-only for scanned-but-readable, Docling+OCR for the rough stuff. Auto-fallback based on extracted character count.
 - Smart templates use vector similarity (Qdrant) to classify docs, then LLM extraction for fields — no regex, so layout drift doesn't break templates.
 - Local Ollama or Azure OpenAI, switchable per-user.<p>Built on top of Cole Medin's local-ai-packaged. Apache 2.0.<p><a href="https://github.com/nickyeager/fetchtext" rel="nofollow">https://github.com/nickyeager/fetchtext</a></p>
]]></description><pubDate>Mon, 11 May 2026 21:52:59 +0000</pubDate><link>https://news.ycombinator.com/item?id=48101150</link><dc:creator>yeag123</dc:creator><comments>https://news.ycombinator.com/item?id=48101150</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48101150</guid></item><item><title><![CDATA[New comment by yeag123 in "Ask HN: What Are You Working On? (March 2026)"]]></title><description><![CDATA[
<p>I’m working on a tool to automate manual document workflows, specifically for industries like manufacturing where accounting paperwork is still a manual burden.<p>The workflow: Upload doc → LLM extracts structured data → Generate new doc from template.<p>It’s API-first, includes webhooks, and is built to be self-hosted/self-provisioned for privacy. Still very much a WIP, but looking for feedback on the feature set and the extraction accuracy.<p>URL: <a href="https://fetchtext.io" rel="nofollow">https://fetchtext.io</a></p>
]]></description><pubDate>Mon, 09 Mar 2026 03:24:05 +0000</pubDate><link>https://news.ycombinator.com/item?id=47304544</link><dc:creator>yeag123</dc:creator><comments>https://news.ycombinator.com/item?id=47304544</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47304544</guid></item><item><title><![CDATA[New comment by yeag123 in "Ask HN: What are you working on? (February 2026)"]]></title><description><![CDATA[
<p>Working on either a self hosted, or self "provisioned" document extraction platform. Trying to make it as flexible as possible, so businesses<p>I worked with manufacturing companies, and the amount of manual document extraction and manipulation, particularly from accounting documents, was always a large burden.<p>The goal is upload a document → extract structured fields via LLM → generate new documents from templates. Has a dashboard, with an API, along with a webhook, very much a WIP.<p><a href="https://fetchtext.io" rel="nofollow">https://fetchtext.io</a></p>
]]></description><pubDate>Mon, 09 Feb 2026 01:22:03 +0000</pubDate><link>https://news.ycombinator.com/item?id=46940484</link><dc:creator>yeag123</dc:creator><comments>https://news.ycombinator.com/item?id=46940484</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46940484</guid></item><item><title><![CDATA[New comment by yeag123 in "Ask HN: Who is hiring? (October 2019)"]]></title><description><![CDATA[
<p>Quoteapro Inc | Software Engineer | San Francisco | Onsite or Remote | Full Time | <a href="https://quoteapro.com" rel="nofollow">https://quoteapro.com</a><p>Quoteapro is helping increase the global recycling rate by automating the complex world of scrap metal brokerage. We work with scrap yards and end processors to safely broker containers of recycled material worldwide. We build tools to grow domestic scrap processors network of buyers and automate tasks needed to sell in the export market.<p>Full job posting: <a href="https://angel.co/l/2iPxPG" rel="nofollow">https://angel.co/l/2iPxPG</a><p>Ideally you'd be:<p>-Detail and process oriented
-As excited about developing innovative software as we are
-Want to be part of a team of creative, confident, thoughtful people who are enthusiastic about increasing the level of global recycling
-Comfortable working in an early-stage startup environment where things move extremely fast and requirements change frequently
-Comfortable and have previous experience working with distributed team members<p>If you're interested please email nick@quoteapro.com.</p>
]]></description><pubDate>Tue, 01 Oct 2019 16:22:32 +0000</pubDate><link>https://news.ycombinator.com/item?id=21127233</link><dc:creator>yeag123</dc:creator><comments>https://news.ycombinator.com/item?id=21127233</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=21127233</guid></item><item><title><![CDATA[New comment by yeag123 in "Ask HN: Once you have the skills, how do you start getting freelance jobs?"]]></title><description><![CDATA[
<p>Odesk sort of does this with feedback ratings:
<a href="https://www.odesk.com/" rel="nofollow">https://www.odesk.com/</a>
I'm sure there are other freelance sites that do as well.</p>
]]></description><pubDate>Sun, 03 Jul 2011 20:57:01 +0000</pubDate><link>https://news.ycombinator.com/item?id=2724579</link><dc:creator>yeag123</dc:creator><comments>https://news.ycombinator.com/item?id=2724579</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=2724579</guid></item><item><title><![CDATA[New comment by yeag123 in "Twitter’s Shit Sandwich"]]></title><description><![CDATA[
<p>My understanding is that inorder for a developer to receive an xAuth application key, they have to first be vetted by a representative from twitter. This involves exchanging information regarding a summary of the app, how it will be using the API, etc. So there is still some existing measure of security regarding xAuth, although not nearly as much oauth.</p>
]]></description><pubDate>Wed, 18 May 2011 22:49:22 +0000</pubDate><link>https://news.ycombinator.com/item?id=2562161</link><dc:creator>yeag123</dc:creator><comments>https://news.ycombinator.com/item?id=2562161</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=2562161</guid></item><item><title><![CDATA[New comment by yeag123 in "Get-shit-done - Easy way to stop distractions"]]></title><description><![CDATA[
<p>A Chrome extension that I use pretty regularly for this sort of thing is Stay Focused: 
<a href="http://goo.gl/gHWFQ" rel="nofollow">http://goo.gl/gHWFQ</a></p>
]]></description><pubDate>Wed, 04 May 2011 17:32:03 +0000</pubDate><link>https://news.ycombinator.com/item?id=2514560</link><dc:creator>yeag123</dc:creator><comments>https://news.ycombinator.com/item?id=2514560</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=2514560</guid></item></channel></rss>