Hacker News: raunakchowdhuri

New comment by raunakchowdhuri in "Reducto releases Deep Extract"

raunakchowdhuri — Mon, 06 Apr 2026 23:55:42 +0000

We've made a lot of changes in the past few months that make our standard extract much, much better, as well as Deep Extract for documents even longer than that. We'd love for you to give it a try!

Reducto releases Deep Extract

raunakchowdhuri — Mon, 06 Apr 2026 16:13:47 +0000

Article URL: https://reducto.ai/blog/reducto-deep-extract-agent

Comments URL: https://news.ycombinator.com/item?id=47662833

Points: 44

# Comments: 7

Show HN: A SOTA chart-extraction system combining traditional CV and LVMs

raunakchowdhuri — Thu, 04 Dec 2025 19:33:10 +0000

Article URL: https://reducto.ai/blog/reducto-chart-extraction

Comments URL: https://news.ycombinator.com/item?id=46151778

Points: 1

# Comments: 0

New comment by raunakchowdhuri in "Shai-Hulud Returns: Over 300 NPM Packages Infected"

raunakchowdhuri — Mon, 24 Nov 2025 17:00:58 +0000

Have a slack channel with them, these are the versions they mentioned: posthog-node 4.18.1 posthog-js 1.297.3 posthog-react-native 4.11.1 posthog-docusaurus 2.0.6

We did a DB migration without logical replication – with zero downtime

raunakchowdhuri — Wed, 24 Sep 2025 18:06:47 +0000

Article URL: https://reducto.ai/blog/reducto-database-migration-zero-downtime

Comments URL: https://news.ycombinator.com/item?id=45363901

Points: 4

# Comments: 0

New comment by raunakchowdhuri in "Launch HN: Reducto Studio (YC W24) – Build accurate document pipelines, fast"

raunakchowdhuri — Tue, 24 Jun 2025 01:03:22 +0000

We're fixing it! This for some reason happens on only _some_ phones in our office so was hard to repro. I think has to do with Safari rendering. Will tone down our WebGPU usage

New comment by raunakchowdhuri in "Launch HN: Reducto Studio (YC W24) – Build accurate document pipelines, fast"

raunakchowdhuri — Mon, 23 Jun 2025 21:32:49 +0000

dang no way! we were both in boston too

New comment by raunakchowdhuri in "Launch HN: Reducto Studio (YC W24) – Build accurate document pipelines, fast"

raunakchowdhuri — Mon, 23 Jun 2025 18:41:31 +0000

this is exactly where we're going with this! glad you see the vision :)

New comment by raunakchowdhuri in "Launch HN: Reducto Studio (YC W24) – Build accurate document pipelines, fast"

raunakchowdhuri — Mon, 23 Jun 2025 16:42:11 +0000

yep!

New comment by raunakchowdhuri in "Mistral OCR"

raunakchowdhuri — Fri, 07 Mar 2025 08:00:10 +0000

comparisons to more outputs coming soon!

New comment by raunakchowdhuri in "Mistral OCR"

raunakchowdhuri — Fri, 07 Mar 2025 03:54:06 +0000

We ran some benchmarks comparing against Gemini Flash 2.0. You can find the full writeup here: https://reducto.ai/blog/lvm-ocr-accuracy-mistral-gemini

A high level summary is that while this is an impressive model, it underperforms even current SOTA VLMs on document parsing and has a tendency to hallucinate with OCR, table structure, and drop content.

Evaluating Mistral OCR Against Gemini 2.0 Flash

raunakchowdhuri — Fri, 07 Mar 2025 03:36:11 +0000

Article URL: https://reducto.ai/blog/lvm-ocr-accuracy-mistral-gemini

Comments URL: https://news.ycombinator.com/item?id=43287210

Points: 15

# Comments: 0

New comment by raunakchowdhuri in "Ingesting PDFs and why Gemini 2.0 changes everything"

raunakchowdhuri — Wed, 05 Feb 2025 20:02:14 +0000

CTO of Reducto here. Love this writeup!

We’ve generally found that Gemini 2.0 is a great model and have tested this (and nearly every VLM) very extensively.

A big part of our research focus is incorporating the best of what new VLMs offer without losing the benefits and reliability of traditional CV models. A simple example of this is we’ve found bounding box based attribution to be a non-negotiable for many of our current customers. Citing the specific region in a document where an answer came from becomes (in our opinion) even MORE important when using large vision models in the loop, as there is a continued risk of hallucination.

Whether that matters in your product is ultimately use case dependent, but the more important challenge for us has been reliability in outputs. RD-TableBench currently uses a single table image on a page, but when testing with real world dense pages we find that VLMs deviate more. Sometimes that involves minor edits (summarizing a sentence but preserving meaning), but sometimes it’s a more serious case such as hallucinating large sets of content.

The more extreme case is that internally we fine tuned a version of Gemini 1.5 along with base Gemini 2.0, specifically for checkbox extraction. We found that even with a broad distribution of checkbox data we couldn’t prevent frequent checkbox hallucination on both the flash (+17% error rate) and pro model (+8% error rate). Our customers in industries like healthcare expect us to get it right, out of the box, deterministically, and our team’s directive is to get as close as we can to that ideal state.

We think that the ideal state involves a combination of the two. The flexibility that VLMs provide, for example with cases like handwriting, is what I think will make it possible to go from 80 or 90 percent accuracy to some number very close 99%. I should note that the Reducto performance for table extraction is with our pre-VLM table parsing pipeline, and we’ll have more to share in terms of updates there soon. For now, our focus is entirely on the performance frontier (though we do scale costs down with volume). In the longer term as inference becomes more efficient we want to move the needle on cost as well.

Overall though, I’m very excited about the progress here.

--- One small comment on your footnote, the evaluation script with Needlemen-Wunsch algorithm doesn’t actually consider the headers outputted by the models and looks only at the table structure itself.

New comment by raunakchowdhuri in "Ingesting PDFs and why Gemini 2.0 changes everything"

raunakchowdhuri — Wed, 05 Feb 2025 19:32:09 +0000

would encourage you to take a look at some of the real data here! https://huggingface.co/spaces/reducto/rd_table_bench

you'll find that most of the errors here are structural issues with the table or inability to parse some special characters. tables can get crazy!

New comment by raunakchowdhuri in "Rd-TableBench – Accurately evaluating table extraction"

raunakchowdhuri — Tue, 05 Nov 2024 19:09:26 +0000

Love the Pubtables work! It's a really useful dataset. Their data comes from existing annotations from scientific papers, so in our experience it doesn't include a lot of the hardest cases that a lot of methods fail at today. The annotations are computer generated instead of manually labeled, so you don't have things like scanned and rotated images or a lot of diversity in languages.

I'd encourage you to take a look at some of our data points to compare for yourself! Link: huggingface.co/spaces/reducto/rd_table_bench

In terms of the overall importance of table extraction, we've found it to be a key bottleneck for folks looking to do document parsing. It's up there amongst the hardest problems in the space alongside complex form region parsing. I don't have the exact statistics handy, but I'd estimate that ~25% of the pages we parse have some hairy tables in them!

Rd-TableBench – Accurately evaluating table extraction

raunakchowdhuri — Tue, 05 Nov 2024 18:46:31 +0000

Hey HN!

A ton of document parsing solutions have been coming out lately, each claiming SOTA with little evidence. A lot of these turned out to be LLM or LVM wrappers that hallucinate frequently on complex tables.

We just released RD-TableBench, an open benchmark to help teams evaluate extraction performance for complex tables. The benchmark includes a variety of challenging scenarios including scanned tables, handwriting, language detection, merged cells, and more.

We employed an independent team of PhD-level human labelers who manually annotated 1000 complex table images from a diverse set of publicly available documents.

Alongside this, we also release a new bioinformatics inspired algorithm for grading table similarity. Would love to hear any feedback!

-Raunak

Comments URL: https://news.ycombinator.com/item?id=42054144

Points: 29

# Comments: 6

New comment by raunakchowdhuri in "Launch HN: Parity (YC S24) – AI for on-call engineers working with Kubernetes"

raunakchowdhuri — Mon, 26 Aug 2024 15:33:01 +0000

hmmm idk how I would feel about giving an llm cluster access from a security pov

New comment by raunakchowdhuri in "Show HN: K8sAI – open-source GPT CLI tool for Kubernetes"

raunakchowdhuri — Thu, 02 May 2024 15:15:58 +0000

Interesting... how did you do the scraping of the documentation?

Show HN: Reducto – A vision based document ingestion API for LLMs

raunakchowdhuri — Tue, 27 Feb 2024 13:04:31 +0000

Hey HN, I'm Raunak from Reducto (https://reducto.ai), a high-quality document ingestion API tailored for language models. We developed Reducto to address our own need - no existing parsing solutions provided the accuracy and speed necessary for our user-facing AI applications. We designed a system that comprehends documents visually (like a human), ignoring document metadata and processing each page as an image to ensure the highest possible accuracy (with benchmarks to prove it).

Please give our demo a try with some of your own PDFs or reach out at founders@reducto.ai if you’d like to start using Reducto in production.

Comments URL: https://news.ycombinator.com/item?id=39523565

Points: 3

# Comments: 0

New comment by raunakchowdhuri in "Open Sourcing Remembrall: A Long-Term Memory Proxy for LLMs"

raunakchowdhuri — Thu, 26 Oct 2023 16:03:54 +0000

Hey HN,

A few weeks ago I shared a beta of Remembrall here and got a lot of great feedback from people in this community, with one of the most common requests being to open source the project.

Excited to share that we’re doing exactly that!

Remembrall is a proxy (integrates in two lines!) on top of your OpenAI queries that uses GPT to save/update important details from each user’s conversations into a vector db. When the user continues the conversation we query the db for relevant info and prepend it into the system prompt. We have a lot of improvements in the works (function calling, queryable user profiles, and more) and would love your feedback on features you’d like to see!