<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: adit_a</title><link>https://news.ycombinator.com/user?id=adit_a</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Tue, 07 Apr 2026 01:55:33 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=adit_a" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by adit_a in "Reducto releases Deep Extract"]]></title><description><![CDATA[
<p>We're releasing an open dataset for challenging structured extraction tasks as a starting point for people to do any comparisons soon!<p>vikp and the Datalab team have done great work in the space, but their structured extraction product is closer to our baseline /extract api since both of those are single pass extractions.<p>Deep Extract is more accurate than any structured extraction product we've tried, <i>but</i> the approach comes with a very clear cost/latency tradeoff over a single pass extraction. We have free credits if you'd like to do a side by side</p>
]]></description><pubDate>Mon, 06 Apr 2026 17:59:49 +0000</pubDate><link>https://news.ycombinator.com/item?id=47664477</link><dc:creator>adit_a</dc:creator><comments>https://news.ycombinator.com/item?id=47664477</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47664477</guid></item><item><title><![CDATA[New comment by adit_a in "Show HN: Jmail – Google Suite for Epstein files"]]></title><description><![CDATA[
<p>This might be our coolest case study yet. Thanks for the mention!</p>
]]></description><pubDate>Sun, 21 Dec 2025 19:35:12 +0000</pubDate><link>https://news.ycombinator.com/item?id=46347591</link><dc:creator>adit_a</dc:creator><comments>https://news.ycombinator.com/item?id=46347591</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46347591</guid></item><item><title><![CDATA[New comment by adit_a in "Launch HN: Reducto Studio (YC W24) – Build accurate document pipelines, fast"]]></title><description><![CDATA[
<p>Hahaha, a while ago (even before choosing this idea space) we said we would build "magical tools for developers" and Reducto was the name we landed on out of a long list of magic adjacent things</p>
]]></description><pubDate>Sat, 28 Jun 2025 01:02:51 +0000</pubDate><link>https://news.ycombinator.com/item?id=44401607</link><dc:creator>adit_a</dc:creator><comments>https://news.ycombinator.com/item?id=44401607</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44401607</guid></item><item><title><![CDATA[New comment by adit_a in "Launch HN: Reducto Studio (YC W24) – Build accurate document pipelines, fast"]]></title><description><![CDATA[
<p>Yes in the sense that we have features that will create persisted share links, and by default you can revisit results in your free account until you decide to delete them.<p>If helpful, we also offer free trial accounts with zero data retention if that's important for your use case</p>
]]></description><pubDate>Tue, 24 Jun 2025 17:41:30 +0000</pubDate><link>https://news.ycombinator.com/item?id=44368727</link><dc:creator>adit_a</dc:creator><comments>https://news.ycombinator.com/item?id=44368727</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44368727</guid></item><item><title><![CDATA[New comment by adit_a in "Launch HN: Reducto Studio (YC W24) – Build accurate document pipelines, fast"]]></title><description><![CDATA[
<p>Code!</p>
]]></description><pubDate>Mon, 23 Jun 2025 21:41:35 +0000</pubDate><link>https://news.ycombinator.com/item?id=44360449</link><dc:creator>adit_a</dc:creator><comments>https://news.ycombinator.com/item?id=44360449</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44360449</guid></item><item><title><![CDATA[New comment by adit_a in "Launch HN: Reducto Studio (YC W24) – Build accurate document pipelines, fast"]]></title><description><![CDATA[
<p>Appreciate the thoughtful note and want to wish you guys the best as well!</p>
]]></description><pubDate>Mon, 23 Jun 2025 21:40:47 +0000</pubDate><link>https://news.ycombinator.com/item?id=44360445</link><dc:creator>adit_a</dc:creator><comments>https://news.ycombinator.com/item?id=44360445</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44360445</guid></item><item><title><![CDATA[New comment by adit_a in "Launch HN: Reducto Studio (YC W24) – Build accurate document pipelines, fast"]]></title><description><![CDATA[
<p>Thanks! We have a lot of respect for the work VikP and his team did on Surya but we haven't benchmarked his newer pipeline so I don't want to make a 1:1 claim.<p>If you want to do a side by side with your use case we'd be happy to set you up with free trial access.</p>
]]></description><pubDate>Mon, 23 Jun 2025 19:00:16 +0000</pubDate><link>https://news.ycombinator.com/item?id=44358919</link><dc:creator>adit_a</dc:creator><comments>https://news.ycombinator.com/item?id=44358919</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44358919</guid></item><item><title><![CDATA[New comment by adit_a in "Launch HN: Reducto Studio (YC W24) – Build accurate document pipelines, fast"]]></title><description><![CDATA[
<p>Yeah, we're extremely excited about the potential of building a flywheel for each individual customer's pipeline.</p>
]]></description><pubDate>Mon, 23 Jun 2025 18:55:53 +0000</pubDate><link>https://news.ycombinator.com/item?id=44358876</link><dc:creator>adit_a</dc:creator><comments>https://news.ycombinator.com/item?id=44358876</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44358876</guid></item><item><title><![CDATA[New comment by adit_a in "Launch HN: Reducto Studio (YC W24) – Build accurate document pipelines, fast"]]></title><description><![CDATA[
<p>We have a default DPA we're willing to sign on all tiers -- the note in the pricing page is meant to refer to custom/redlined DPAs that become complex to manage over time<p>We'll edit that to make it more clear</p>
]]></description><pubDate>Mon, 23 Jun 2025 18:53:37 +0000</pubDate><link>https://news.ycombinator.com/item?id=44358851</link><dc:creator>adit_a</dc:creator><comments>https://news.ycombinator.com/item?id=44358851</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44358851</guid></item><item><title><![CDATA[New comment by adit_a in "Launch HN: Reducto Studio (YC W24) – Build accurate document pipelines, fast"]]></title><description><![CDATA[
<p>Ah yeah I remember! Great to hear from you and thanks :)</p>
]]></description><pubDate>Mon, 23 Jun 2025 18:52:21 +0000</pubDate><link>https://news.ycombinator.com/item?id=44358834</link><dc:creator>adit_a</dc:creator><comments>https://news.ycombinator.com/item?id=44358834</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44358834</guid></item><item><title><![CDATA[New comment by adit_a in "Launch HN: Reducto Studio (YC W24) – Build accurate document pipelines, fast"]]></title><description><![CDATA[
<p>Would love to help if you end up having any use cases in the future!</p>
]]></description><pubDate>Mon, 23 Jun 2025 18:51:41 +0000</pubDate><link>https://news.ycombinator.com/item?id=44358827</link><dc:creator>adit_a</dc:creator><comments>https://news.ycombinator.com/item?id=44358827</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44358827</guid></item><item><title><![CDATA[New comment by adit_a in "Launch HN: Reducto Studio (YC W24) – Build accurate document pipelines, fast"]]></title><description><![CDATA[
<p>Hey, we've never used or even attempted to use your platform. Respectfully I think you know that, and that you also know that your team has tried to get access to ours using personal gmail accounts dating back to 2024.<p>A schema builder with nested array fields has been part of our playground (and nearly every structured extraction solution) for a very long time and is just not something that we even view as a defining part of the platform.</p>
]]></description><pubDate>Mon, 23 Jun 2025 18:51:12 +0000</pubDate><link>https://news.ycombinator.com/item?id=44358822</link><dc:creator>adit_a</dc:creator><comments>https://news.ycombinator.com/item?id=44358822</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44358822</guid></item><item><title><![CDATA[New comment by adit_a in "Launch HN: Reducto Studio (YC W24) – Build accurate document pipelines, fast"]]></title><description><![CDATA[
<p>Thank you!<p>To clarify, our API was already fully launched and in prod with customers when we raised our series A. This launch is specifically for the platform we're building around the API :)</p>
]]></description><pubDate>Mon, 23 Jun 2025 18:16:17 +0000</pubDate><link>https://news.ycombinator.com/item?id=44358502</link><dc:creator>adit_a</dc:creator><comments>https://news.ycombinator.com/item?id=44358502</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44358502</guid></item><item><title><![CDATA[New comment by adit_a in "Launch HN: Reducto Studio (YC W24) – Build accurate document pipelines, fast"]]></title><description><![CDATA[
<p>Thank you! What's the error you're seeing on mobile?</p>
]]></description><pubDate>Mon, 23 Jun 2025 18:15:31 +0000</pubDate><link>https://news.ycombinator.com/item?id=44358494</link><dc:creator>adit_a</dc:creator><comments>https://news.ycombinator.com/item?id=44358494</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44358494</guid></item><item><title><![CDATA[New comment by adit_a in "Launch HN: Reducto Studio (YC W24) – Build accurate document pipelines, fast"]]></title><description><![CDATA[
<p>Thanks! To clarify, we launched our document processing APIs a while ago. This launch is specifically for a new platform we're building around our API based on all of the things our customers previously had to build internally to support their use of Reducto (eval tools, monitoring etc).<p>Generally speaking, my view on the space is that this was crowded well before LLMs. We've met a lot of the folks that worked on things like drivers for printers to print PDFs in the 1990s, IDP players from the last few decades, and more recent cloud offerings.<p>The context today is clearly very different than it was in the IDP era though (human process with semi-structured content -> LLMs are going to reason over most human data), and so is the solution space (VLMs are an incredible new tool to help address the problem).<p>Given that I don't think it's surprising that companies inside and outside of YC have pivoted into offering document processing APIs over the past year. Generally speaking we don't see differentiation in the sense of just feature set since that'll converge over time, and instead primarily focus on accuracy, reliability, and scalability, all 3 of which have a very substantive impact from last mile improvements. I think the best testament I have to that is that the customers we've onboarded are very technical, and as a result are very thorough when choosing the right solution for them. That includes a company wide roll out at one of the 4 biggest tech companies, one of the 3 biggest trading firms, and a big set of AI product teams like Harvey, Rogo, ScaleAI etc.<p>At the end of the day I don't see VLM improvements as antagonistic to what we're doing. We already use them a lot for things like an agentic OCR (correcting mistakes from our traditional CV pipeline). On some level our customers aren't just choosing us for PDF->markdown, they're onboarding with us because they want to spend more of their time on the things that are downstream from having accurate data, and I expect that there'll be room for us to make that even more true as models improve.</p>
]]></description><pubDate>Mon, 23 Jun 2025 18:14:54 +0000</pubDate><link>https://news.ycombinator.com/item?id=44358486</link><dc:creator>adit_a</dc:creator><comments>https://news.ycombinator.com/item?id=44358486</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44358486</guid></item><item><title><![CDATA[New comment by adit_a in "Launch HN: Reducto Studio (YC W24) – Build accurate document pipelines, fast"]]></title><description><![CDATA[
<p>Let us know if you have any feedback!</p>
]]></description><pubDate>Mon, 23 Jun 2025 17:36:57 +0000</pubDate><link>https://news.ycombinator.com/item?id=44358131</link><dc:creator>adit_a</dc:creator><comments>https://news.ycombinator.com/item?id=44358131</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44358131</guid></item><item><title><![CDATA[New comment by adit_a in "Launch HN: Reducto Studio (YC W24) – Build accurate document pipelines, fast"]]></title><description><![CDATA[
<p>The direct loom link isn't working for you? Are you seeing the same redirects error?</p>
]]></description><pubDate>Mon, 23 Jun 2025 17:35:35 +0000</pubDate><link>https://news.ycombinator.com/item?id=44358111</link><dc:creator>adit_a</dc:creator><comments>https://news.ycombinator.com/item?id=44358111</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44358111</guid></item><item><title><![CDATA[New comment by adit_a in "Launch HN: Reducto Studio (YC W24) – Build accurate document pipelines, fast"]]></title><description><![CDATA[
<p>Thank you! We worked with Airfoil for the website :)</p>
]]></description><pubDate>Mon, 23 Jun 2025 16:26:45 +0000</pubDate><link>https://news.ycombinator.com/item?id=44357403</link><dc:creator>adit_a</dc:creator><comments>https://news.ycombinator.com/item?id=44357403</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44357403</guid></item><item><title><![CDATA[New comment by adit_a in "Launch HN: Reducto Studio (YC W24) – Build accurate document pipelines, fast"]]></title><description><![CDATA[
<p>Fixed! Sorry about that</p>
]]></description><pubDate>Mon, 23 Jun 2025 16:26:14 +0000</pubDate><link>https://news.ycombinator.com/item?id=44357395</link><dc:creator>adit_a</dc:creator><comments>https://news.ycombinator.com/item?id=44357395</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44357395</guid></item><item><title><![CDATA[Launch HN: Reducto Studio (YC W24) – Build accurate document pipelines, fast]]></title><description><![CDATA[
<p>Hi HN! We’re Adit and Raunak, co-founders of Reducto (YC W24, <a href="https://reducto.ai">https://reducto.ai</a>). Reducto turns unstructured documents (e.g., PDFs, scans, spreadsheets) into structured data. This data can then be used for retrieval, passed into LLMs, or used elsewhere downstream.<p>We started Reducto when we realized that so many of today’s AI applications require good quality data. Everyone knows that good inputs lead to better outputs, but 80% of the world’s data is still trapped inside of things like messy PDFs and spreadsheets. Raunak and I launched a really early MVP of parsing and extracting from unstructured documents, and were lucky to have a lot of interest from technical teams when they realized that the accuracy was something they hadn’t seen before.<p>We started by just releasing an API for engineers to build with, but over time we realized that an accurate API was only part of the puzzle. Our customers wanted to be able to easily set up multi step pipelines, evaluate and iterate on performance within their use case, and work with non-engineering teammates that were also involved in the real world document processing flow.<p>That’s why we’re launching Reducto Studio, a web platform that sits on top of our APIs for users to build and iterate on end-to-end document pipelines.<p>With Studio, you can:<p>- Drop an entire file set and get per-field and per-document accuracy scores against your eval data.<p>- Auto-generate and continuously optimize extraction schemas to hit production-grade quality fast.<p>- Save every run, iterate on parse/extract configs, and compare results side-by-side.<p>You can see some examples here (<a href="https://studio.reducto.ai">https://studio.reducto.ai</a>) or you can watch this walkthrough: <a href="https://www.loom.com/share/b243551741c642c6a594c00353fcecb3" rel="nofollow">https://www.loom.com/share/b243551741c642c6a594c00353fcecb3</a>.<p>If you’d like to upload your own document you can log in and do so as well - we don’t make you book a demo or put a payment down to try it.<p>Thanks for reading and checking it out! This is only the first step for Studio, so we’d love feedback on anything: UX rough edges (we know they’re there!), features that would make evaluations better for you, hard documents you’ve had trouble with, or anything else about wrangling with unstructured data.</p>
<hr>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=44356799">https://news.ycombinator.com/item?id=44356799</a></p>
<p>Points: 85</p>
<p># Comments: 55</p>
]]></description><pubDate>Mon, 23 Jun 2025 15:30:33 +0000</pubDate><link>https://news.ycombinator.com/item?id=44356799</link><dc:creator>adit_a</dc:creator><comments>https://news.ycombinator.com/item?id=44356799</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44356799</guid></item></channel></rss>