<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: martimchaves</title><link>https://news.ycombinator.com/user?id=martimchaves</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Sun, 12 Apr 2026 20:14:37 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=martimchaves" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[Show HN: Smol Invoice Agent, invoice processor that learns from your corrections]]></title><description><![CDATA[
<p>Hey HN, I built a small app, called Smol Invoice Agent.<p>You can upload invoices, have them parsed, and correct the parsed data. The corrections are saved, and the next time you upload a similar invoice, the agent may apply them automatically if they make sense.<p>You can provide feedback on the automatically applied corrections. Rejected corrections are immediately reverted, and you can add an explanation, which will be taken into account the next time the agent encounters a similar invoice.<p>Here's a 4 min demo showing the full feedback loop in action: <a href="https://www.youtube.com/watch?v=txaJ0OhOFVw" rel="nofollow">https://www.youtube.com/watch?v=txaJ0OhOFVw</a><p>Export as JSON or CSV. You can also use it via the API.<p>Tech stack: Postgres (with pgvector), FastAPI, Claude for extraction, Celery + Redis for background processing, TypeScript with React, Voyage AI for embeddings, Mistral for OCR (with pymupdf fallback).<p>It's pay-as-you-go, no subscription required, just top up your account and use as needed. You can try it for free.<p>I'd really appreciate hearing what you think, especially if you deal with invoices regularly. Is the correction/feedback loop something that would actually save you time?<p>Thanks!
Martim</p>
<hr>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=47693967">https://news.ycombinator.com/item?id=47693967</a></p>
<p>Points: 1</p>
<p># Comments: 0</p>
]]></description><pubDate>Wed, 08 Apr 2026 18:04:36 +0000</pubDate><link>https://smolinvoiceagent.com/</link><dc:creator>martimchaves</dc:creator><comments>https://news.ycombinator.com/item?id=47693967</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47693967</guid></item><item><title><![CDATA[Show HN: A tool to create and evaluate document processing pipelines for RAG]]></title><description><![CDATA[
<p>Hey HN, I built [ragbandit](<a href="https://ragbandit.com" rel="nofollow">https://ragbandit.com</a>), a tool to help you evaluate different document processing pipelines for the retrieval stage of your RAG systems.<p>I was a bit overwhelmed with the different ways that you can process documents to create embeddings for RAG, so I wanted to create a tool to experiment with different OCR models, refining the OCR results, different chunking methods, and different embedding models.<p>You can:
- search processed documents in the playground
- evaluate the retrieval results using an llm-as-judge (not perfect, but can be a useful signal)
- compare different datasets (using aggregate metrics or by side by side comparison in the playground)<p>You can also manually inspect the results of each query, and of each intermediate document processing result.<p>To get a better idea, check out one of the use cases: <a href="https://ragbandit.com/use-cases/optimizing-insurance-document-retrieval" rel="nofollow">https://ragbandit.com/use-cases/optimizing-insurance-documen...</a><p>To be completely fair, I haven't added that many options for the different stages of the document processing pipeline! There are tons of features that I'd like to add, but I've already spent quite a bit of time on this, so I'd really appreciate it if you could let me know if this is something that could be useful for you/you find interesting. Would you use something like this?<p>Tech stack: Postgres (with pgvector), fastapi, [ragbandit-core](<a href="https://github.com/MartimChaves/ragbandit-core" rel="nofollow">https://github.com/MartimChaves/ragbandit-core</a>) (the document processing core is open source), typescript with react, celery for background tasks (and redis as the broker).<p>It's currently a credits-based subscription with optional top-ups. You can get 1000 credits to try it out (I ask for card info for these 1000 credits as a spam filter).<p>Thanks,
Martim</p>
<hr>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=47542679">https://news.ycombinator.com/item?id=47542679</a></p>
<p>Points: 2</p>
<p># Comments: 0</p>
]]></description><pubDate>Fri, 27 Mar 2026 13:53:11 +0000</pubDate><link>https://ragbandit.com</link><dc:creator>martimchaves</dc:creator><comments>https://news.ycombinator.com/item?id=47542679</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47542679</guid></item><item><title><![CDATA[New comment by martimchaves in "Show HN: Geo Racers – Race from London to Tokyo on a single bus pass"]]></title><description><![CDATA[
<p>Very fun game, thanks. When you try to change the speed of the game on mobile using firefox, it glitches and keeps opening the speed modal. Pretty much crashes the browser. Also +1 to the taxi fare not being paid in the currency of the country, but in some other currency. More jobs would be great! Appreciate the difficulty of not knowing exactly where you'll be taken, but without consistent jobs sometimes it's really punishing (or at least feels really punishing). I was thinking it might be cool to add stamina, and fancier hotels may replenish more stamina. An incentive to splurge ah.</p>
]]></description><pubDate>Fri, 13 Feb 2026 17:03:50 +0000</pubDate><link>https://news.ycombinator.com/item?id=47004961</link><dc:creator>martimchaves</dc:creator><comments>https://news.ycombinator.com/item?id=47004961</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47004961</guid></item><item><title><![CDATA[New comment by martimchaves in "Ingesting PDFs and why Gemini 2.0 changes everything"]]></title><description><![CDATA[
<p>I'm guessing that human accuracy may be lower or around that value, given that handwritten notes are generally difficult to read. A better metric for document parsing might be accuracy relative to human performance (how much better the LLM performs compared to a human).</p>
]]></description><pubDate>Thu, 06 Feb 2025 11:40:21 +0000</pubDate><link>https://news.ycombinator.com/item?id=42961428</link><dc:creator>martimchaves</dc:creator><comments>https://news.ycombinator.com/item?id=42961428</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42961428</guid></item><item><title><![CDATA[New comment by martimchaves in "The FizzBuzz that did not get me the job"]]></title><description><![CDATA[
<p>Cool read! I loved it when you changed the numbers to base 15, I thought that was a beautiful solution.</p>
]]></description><pubDate>Sat, 25 Jan 2025 10:54:30 +0000</pubDate><link>https://news.ycombinator.com/item?id=42820881</link><dc:creator>martimchaves</dc:creator><comments>https://news.ycombinator.com/item?id=42820881</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42820881</guid></item><item><title><![CDATA[New comment by martimchaves in "Ask HN: What's Your Morning Routine?"]]></title><description><![CDATA[
<p>I feel you, exact same routine here.</p>
]]></description><pubDate>Mon, 30 Dec 2024 16:43:50 +0000</pubDate><link>https://news.ycombinator.com/item?id=42550916</link><dc:creator>martimchaves</dc:creator><comments>https://news.ycombinator.com/item?id=42550916</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42550916</guid></item></channel></rss>