<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: sidmanchkanti21</title><link>https://news.ycombinator.com/user?id=sidmanchkanti21</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Tue, 07 Apr 2026 23:24:13 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=sidmanchkanti21" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[Computational complexity of schema-guided document extraction]]></title><description><![CDATA[
<p>Article URL: <a href="https://www.runpulse.com/blog/computational-complexity-of-schema">https://www.runpulse.com/blog/computational-complexity-of-schema</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=46589576">https://news.ycombinator.com/item?id=46589576</a></p>
<p>Points: 13</p>
<p># Comments: 1</p>
]]></description><pubDate>Mon, 12 Jan 2026 15:13:31 +0000</pubDate><link>https://www.runpulse.com/blog/computational-complexity-of-schema</link><dc:creator>sidmanchkanti21</dc:creator><comments>https://news.ycombinator.com/item?id=46589576</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46589576</guid></item><item><title><![CDATA[New comment by sidmanchkanti21 in "Launch HN: Pulse (YC S24) – Production-grade unstructured document extraction"]]></title><description><![CDATA[
<p>Thanks for testing! Glad the results work well for you</p>
]]></description><pubDate>Thu, 18 Dec 2025 17:37:44 +0000</pubDate><link>https://news.ycombinator.com/item?id=46315903</link><dc:creator>sidmanchkanti21</dc:creator><comments>https://news.ycombinator.com/item?id=46315903</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46315903</guid></item><item><title><![CDATA[Launch HN: Pulse (YC S24) – Production-grade unstructured document extraction]]></title><description><![CDATA[
<p>Hi HN, we’re Sid and Ritvik, co-founders of Pulse (<a href="https://www.runpulse.com/">https://www.runpulse.com/</a>). Pulse is a document extraction system to create LLM-ready text using hybrid VLM + OCR models.<p>Here’s a demo video: <a href="https://video.runpulse.com/video/pulse-platform-walkthrough-69f9">https://video.runpulse.com/video/pulse-platform-walkthrough-...</a>.<p>Later in this post, you’ll find links to before-and-after examples on particularly tricky cases. Check those out to see what Pulse can really do!
Modern vision language models are great at producing plausible text, but that makes them risky for OCR and data ingestion. Plausibility isn’t good enough when you need accuracy.<p>When we started working on document extraction, we assumed the same thing many teams do: foundation models are improving quickly, multi-modal systems appear to read documents well, what’s not to like? And indeed, for small or clean inputs, those assumptions mostly give good results.
However, limitations show up once you begin processing real documents in volume. Long PDFs, dense tables, mixed layouts, low-fidelity scans, and financial or operational data expose errors that are subtle, hard to detect, and expensive to correct. Outputs look reasonable even though they contain small but important mistakes, especially in tables and numeric fields.<p>Running into those challenges got us working. We ran controlled evaluations on complex documents, fine tuned vision models, and built labeled datasets where ground truth actually matters. There have been many nights where our team stayed up hand-annotating pages, drawing bounding boxes around tables, labeling charts point by point, or debating whether a number was unreadable or simply poorly scanned. That process shaped our intuition far more than benchmarks.<p>One thing became clear quickly. The core challenge is not extraction itself, but confidence. Vision language models embed document images into high-dimensional representations optimized for semantic understanding, not precise transcription. That process is inherently lossy. When uncertainty appears, models tend to resolve it using learned priors instead of surfacing ambiguity. This behavior can be helpful in consumer settings. In production pipelines, it creates verification problems that do not scale well.
Pulse grew out of our trying to address this gap through system design rather than prompting alone.<p>Instead of treating document understanding as a single generative step, our system separates layout analysis from language modeling. Documents are normalized into structured representations that preserve hierarchy and tables before schema mapping occurs. Extraction is constrained by schemas defined ahead of time, and extracted values are tied back to source locations so uncertainty can be inspected rather than guessed away. In practice, this results in a hybrid approach that combines traditional computer vision techniques, layout models, and vision language models, because no single approach handles these cases reliably on its own.<p>We are intentionally sharing a few documents that reflect the types of inputs that motivated this work. These are representative of cases where we saw generic OCR or VLM-based pipelines struggle.<p>Here is a financial 10K:
<a href="https://platform.runpulse.com/dashboard/examples/example1">https://platform.runpulse.com/dashboard/examples/example1</a><p>Here is a newspaper:
<a href="https://platform.runpulse.com/dashboard/examples/example2">https://platform.runpulse.com/dashboard/examples/example2</a><p>Here is a rent roll: <a href="https://platform.runpulse.com/dashboard/examples/example3">https://platform.runpulse.com/dashboard/examples/example3</a><p>Pulse is not perfect, particularly on highly degraded scans or uncommon handwriting, and we’re working on improvements. However, our goal is not to eliminate errors entirely, but to make them visible, auditable, and easier to reason about.<p>Pulse is available via usage-based access to the API and platform You can sign up to try it at <a href="https://platform.runpulse.com/login">https://platform.runpulse.com/login</a>. API docs are at <a href="https://docs.runpulse.com/introduction">https://docs.runpulse.com/introduction</a>.<p>We’d love to hear how others here evaluate correctness for document extraction, which failure modes you have seen in practice, and what signals you rely on to decide whether an output can be trusted.<p>We will be around to answer questions and are happy to run additional documents if people want to share examples. Put links in the comments and we’ll plug them in and get back to you.<p>Looking forward to your comments!</p>
<hr>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=46313930">https://news.ycombinator.com/item?id=46313930</a></p>
<p>Points: 40</p>
<p># Comments: 37</p>
]]></description><pubDate>Thu, 18 Dec 2025 15:35:52 +0000</pubDate><link>https://news.ycombinator.com/item?id=46313930</link><dc:creator>sidmanchkanti21</dc:creator><comments>https://news.ycombinator.com/item?id=46313930</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46313930</guid></item><item><title><![CDATA[Why Tables Are the Hardest Problem in Document AI]]></title><description><![CDATA[
<p>Article URL: <a href="https://www.runpulse.com/blog/the-geometry-problem-why-tables-are-the-hardest-problem-in-document-ai">https://www.runpulse.com/blog/the-geometry-problem-why-tables-are-the-hardest-problem-in-document-ai</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=45332890">https://news.ycombinator.com/item?id=45332890</a></p>
<p>Points: 2</p>
<p># Comments: 1</p>
]]></description><pubDate>Mon, 22 Sep 2025 13:06:05 +0000</pubDate><link>https://www.runpulse.com/blog/the-geometry-problem-why-tables-are-the-hardest-problem-in-document-ai</link><dc:creator>sidmanchkanti21</dc:creator><comments>https://news.ycombinator.com/item?id=45332890</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45332890</guid></item><item><title><![CDATA[Evaluating Document Extraction Accuracy]]></title><description><![CDATA[
<p>Article URL: <a href="https://www.runpulse.com/blog/evaluating-document-extraction">https://www.runpulse.com/blog/evaluating-document-extraction</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=44879562">https://news.ycombinator.com/item?id=44879562</a></p>
<p>Points: 2</p>
<p># Comments: 0</p>
]]></description><pubDate>Tue, 12 Aug 2025 17:42:45 +0000</pubDate><link>https://www.runpulse.com/blog/evaluating-document-extraction</link><dc:creator>sidmanchkanti21</dc:creator><comments>https://news.ycombinator.com/item?id=44879562</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44879562</guid></item><item><title><![CDATA[New comment by sidmanchkanti21 in "Beyond the Hype: Real-World Tests of Mistral's OCR"]]></title><description><![CDATA[
<p>shoot us any questions you may have, we’ll be monitoring this closely</p>
]]></description><pubDate>Fri, 07 Mar 2025 00:10:50 +0000</pubDate><link>https://news.ycombinator.com/item?id=43286347</link><dc:creator>sidmanchkanti21</dc:creator><comments>https://news.ycombinator.com/item?id=43286347</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43286347</guid></item><item><title><![CDATA[New comment by sidmanchkanti21 in "Putting Andrew Ng's OCR models to the test"]]></title><description><![CDATA[
<p>appreciate it!</p>
]]></description><pubDate>Fri, 28 Feb 2025 07:21:19 +0000</pubDate><link>https://news.ycombinator.com/item?id=43202698</link><dc:creator>sidmanchkanti21</dc:creator><comments>https://news.ycombinator.com/item?id=43202698</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43202698</guid></item><item><title><![CDATA[New comment by sidmanchkanti21 in "Why LLMs still have problems with OCR"]]></title><description><![CDATA[
<p>thanks, glad to hear it.</p>
]]></description><pubDate>Sat, 08 Feb 2025 18:14:55 +0000</pubDate><link>https://news.ycombinator.com/item?id=42984879</link><dc:creator>sidmanchkanti21</dc:creator><comments>https://news.ycombinator.com/item?id=42984879</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42984879</guid></item><item><title><![CDATA[New comment by sidmanchkanti21 in "Why LLMs still have problems with OCR"]]></title><description><![CDATA[
<p>thanks! re: 18-19th century cursive, while we handle historical handwriting, we can't guarantee specific error rates. each document's accuracy varies based on condition, writing style, and preservation. happy to run test samples to check.<p>feel free to send over sample docs: sid [at] trypulse [dot] ai</p>
]]></description><pubDate>Sat, 08 Feb 2025 17:42:34 +0000</pubDate><link>https://news.ycombinator.com/item?id=42984594</link><dc:creator>sidmanchkanti21</dc:creator><comments>https://news.ycombinator.com/item?id=42984594</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42984594</guid></item><item><title><![CDATA[New comment by sidmanchkanti21 in "Why LLMs still have problems with OCR"]]></title><description><![CDATA[
<p>our current customers are both enterprises and individuals.<p>pricing page is here <a href="https://www.runpulse.com/pricing-studio-pulse">https://www.runpulse.com/pricing-studio-pulse</a></p>
]]></description><pubDate>Fri, 07 Feb 2025 22:23:19 +0000</pubDate><link>https://news.ycombinator.com/item?id=42978197</link><dc:creator>sidmanchkanti21</dc:creator><comments>https://news.ycombinator.com/item?id=42978197</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42978197</guid></item><item><title><![CDATA[New comment by sidmanchkanti21 in "Why LLMs still suck at OCR"]]></title><description><![CDATA[
<p>fixed the year, good catch</p>
]]></description><pubDate>Fri, 07 Feb 2025 21:57:54 +0000</pubDate><link>https://news.ycombinator.com/item?id=42977972</link><dc:creator>sidmanchkanti21</dc:creator><comments>https://news.ycombinator.com/item?id=42977972</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42977972</guid></item></channel></rss>