<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: faxmeyourcode</title><link>https://news.ycombinator.com/user?id=faxmeyourcode</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Fri, 10 Apr 2026 09:24:12 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=faxmeyourcode" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by faxmeyourcode in "25 Years of Eggs"]]></title><description><![CDATA[
<p>Wow, I didn't realize some RFID could reach 15 feet out - that's good to know. I naively thought you essentially had to be touching the surface of the tag.</p>
]]></description><pubDate>Mon, 23 Mar 2026 18:52:18 +0000</pubDate><link>https://news.ycombinator.com/item?id=47493598</link><dc:creator>faxmeyourcode</dc:creator><comments>https://news.ycombinator.com/item?id=47493598</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47493598</guid></item><item><title><![CDATA[New comment by faxmeyourcode in "Qwen3.5 Fine-Tuning Guide"]]></title><description><![CDATA[
<p>If you treat LLMs as generic transformers, you can fine tune with a ton of examples of input output pairs. For messy input data with lots of examples already built, this is ideal.<p>At my day job we have experimented with fine tuned transformers for our receipt processing workflow. We take images of receipts, run them through OCR (this step might not even be necessary, but we do it at scale already anyways), and then take the OCR output text blobs and "transform" them into structured receipts with retailer, details like zip code, transaction timestamps, line items, sales taxes, sales, etc.<p>I trained a small LLM (mistral-7b) via SFT with 1000 (maybe 10,000? I don't remember) examples from receipts in our database from 2019. When I tested  the model on receipts from 2020 it hit something like 98% accuracy.<p>The key that made this work so well is that we had a ton of data (potentially billions of example input/output pairs) and we could easily evaluate the correctness by unpacking the json output and comparing with our source tables.<p>Note that this isn't running in production, it was an experiment. There are edge cases I didn't consider, and there's a lot more to it in terms of accurately evaling, when to re-train, dealing with net new receipt types, retailers, new languages (we're doing global expansion RN so it's top of mind), general diversity of edge cases in your training data, etc.</p>
]]></description><pubDate>Tue, 10 Mar 2026 15:15:44 +0000</pubDate><link>https://news.ycombinator.com/item?id=47324409</link><dc:creator>faxmeyourcode</dc:creator><comments>https://news.ycombinator.com/item?id=47324409</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47324409</guid></item><item><title><![CDATA[New comment by faxmeyourcode in "Qwen3.5 Fine-Tuning Guide"]]></title><description><![CDATA[
<p>Especially for super constrained applications. I don't care if the language model that I use for my extremely specific business domain can solve PhD math or remember the works of Shakespeare. I'd trade all of that for pure task specific accuracy.</p>
]]></description><pubDate>Wed, 04 Mar 2026 17:14:20 +0000</pubDate><link>https://news.ycombinator.com/item?id=47250644</link><dc:creator>faxmeyourcode</dc:creator><comments>https://news.ycombinator.com/item?id=47250644</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47250644</guid></item><item><title><![CDATA[New comment by faxmeyourcode in "Qwen3.5 Fine-Tuning Guide"]]></title><description><![CDATA[
<p>Labeling or categorization tasks like this are the bread and butter of small fine tuned models. Especially if you need outputs in a specific json format or whatever.<p>I did an experiment where I did very simple SFT on Mistral 7b and it was extremely good at converting receipt images into structured json outputs and I only used 1,000 examples. The difficulty is trying to get a diverse enough set of examples, evaling, etc.<p>If you have great data with simple input output pairs, you should really give it a shot.</p>
]]></description><pubDate>Wed, 04 Mar 2026 17:06:32 +0000</pubDate><link>https://news.ycombinator.com/item?id=47250531</link><dc:creator>faxmeyourcode</dc:creator><comments>https://news.ycombinator.com/item?id=47250531</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47250531</guid></item><item><title><![CDATA[New comment by faxmeyourcode in "Claude's Cycles [pdf]"]]></title><description><![CDATA[
<p>> Filip also told me that he asked Claude to continue on the even case after the odd case had been resolved. “But there after a while it seemed to get stuck. In the end, it was not even able to write and run explore programs correctly anymore, very weird. So I stopped the search.”<p>Interesting snippet towards the end. I wonder if they were using claude.ai or claude code. Sounds like they ran out of context and entered the "dumb zone."</p>
]]></description><pubDate>Tue, 03 Mar 2026 16:56:56 +0000</pubDate><link>https://news.ycombinator.com/item?id=47235247</link><dc:creator>faxmeyourcode</dc:creator><comments>https://news.ycombinator.com/item?id=47235247</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47235247</guid></item><item><title><![CDATA[New comment by faxmeyourcode in "Google Street View in 2026"]]></title><description><![CDATA[
<p>Yea, I agree. The dataset is < 100MB... so duckdb can very easily handle this on an old macbook air.<p><a href="https://duckdb.org/2025/05/19/the-lost-decade-of-small-data" rel="nofollow">https://duckdb.org/2025/05/19/the-lost-decade-of-small-data</a></p>
]]></description><pubDate>Thu, 26 Feb 2026 19:16:55 +0000</pubDate><link>https://news.ycombinator.com/item?id=47170708</link><dc:creator>faxmeyourcode</dc:creator><comments>https://news.ycombinator.com/item?id=47170708</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47170708</guid></item><item><title><![CDATA[New comment by faxmeyourcode in "The First Fully General Computer Action Model"]]></title><description><![CDATA[
<p>Neel, this is really cool. How long have you been working on this, and where did you guys get inspiration from? Did you work on vlms earlier or something like that? Just curious.<p>Also, thanks for choosing a technical blog post for presenting this information.</p>
]]></description><pubDate>Thu, 26 Feb 2026 04:20:24 +0000</pubDate><link>https://news.ycombinator.com/item?id=47161799</link><dc:creator>faxmeyourcode</dc:creator><comments>https://news.ycombinator.com/item?id=47161799</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47161799</guid></item><item><title><![CDATA[New comment by faxmeyourcode in "Ask HN: What are you working on? (February 2026)"]]></title><description><![CDATA[
<p>I love the idea behind MyVisualRoutine as a father with a disabled kiddo, thanks for sharing.<p>The app is beautiful - much better than I could build - what tech is it using if you don't mind me asking? Is it flutter, react native, something else? Just want to get better at mobile dev.</p>
]]></description><pubDate>Mon, 09 Feb 2026 15:43:42 +0000</pubDate><link>https://news.ycombinator.com/item?id=46946439</link><dc:creator>faxmeyourcode</dc:creator><comments>https://news.ycombinator.com/item?id=46946439</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46946439</guid></item><item><title><![CDATA[New comment by faxmeyourcode in "UEFI Bindings for JavaScript"]]></title><description><![CDATA[
<p>Love this. An example of complete and total dominion over the machine. Great quote here too lol<p>> Prometheus stole fire from the gods and gave it to man. For this he was chained to a rock and tortured for eternity.</p>
]]></description><pubDate>Mon, 09 Feb 2026 15:00:06 +0000</pubDate><link>https://news.ycombinator.com/item?id=46945942</link><dc:creator>faxmeyourcode</dc:creator><comments>https://news.ycombinator.com/item?id=46945942</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46945942</guid></item><item><title><![CDATA[New comment by faxmeyourcode in "Claude Opus 4.6"]]></title><description><![CDATA[
<p>Everybody is different, I simply cannot stand the sight of chatgpt styled writing. Give me paragraphs.</p>
]]></description><pubDate>Fri, 06 Feb 2026 14:21:58 +0000</pubDate><link>https://news.ycombinator.com/item?id=46913155</link><dc:creator>faxmeyourcode</dc:creator><comments>https://news.ycombinator.com/item?id=46913155</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46913155</guid></item><item><title><![CDATA[New comment by faxmeyourcode in "Some notes on starting to use Django"]]></title><description><![CDATA[
<p>I've lobbied to replace our internal tool with a django admin panel. I prototyped it and it showed that it would reduce our code by > 15k lines.<p>Any internal webapps I need to build like this will 100% be set up with django in the future due to this. I don't need it to be pretty, I just want the UI, database migrations, users, roles, groups, etc for free</p>
]]></description><pubDate>Thu, 29 Jan 2026 05:40:24 +0000</pubDate><link>https://news.ycombinator.com/item?id=46806243</link><dc:creator>faxmeyourcode</dc:creator><comments>https://news.ycombinator.com/item?id=46806243</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46806243</guid></item><item><title><![CDATA[New comment by faxmeyourcode in "Vibe coding kills open source"]]></title><description><![CDATA[
<p>One trick to get out of this scenario where you're writing a ton is to ask the model to interview until we're in alignment on what is being built. Claude and open code both have an AskUserQuestionTool which is really nice for this and cuts down on explanation a lot. It becomes an iterative interview and clarifies my thinking significantly.</p>
]]></description><pubDate>Mon, 26 Jan 2026 14:20:12 +0000</pubDate><link>https://news.ycombinator.com/item?id=46765936</link><dc:creator>faxmeyourcode</dc:creator><comments>https://news.ycombinator.com/item?id=46765936</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46765936</guid></item><item><title><![CDATA[New comment by faxmeyourcode in "Vibe coding kills open source"]]></title><description><![CDATA[
<p>> LLMs dont care about the story, they just care about the current state of the code<p>You have to tell it about the backstory. It does not know unless you write about it somewhere and give it as input to the model.</p>
]]></description><pubDate>Mon, 26 Jan 2026 14:17:16 +0000</pubDate><link>https://news.ycombinator.com/item?id=46765896</link><dc:creator>faxmeyourcode</dc:creator><comments>https://news.ycombinator.com/item?id=46765896</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46765896</guid></item><item><title><![CDATA[New comment by faxmeyourcode in "AI is a horse (2024)"]]></title><description><![CDATA[
<p>I'm sure the same could be said about tractors when they were coming on the scene.<p>There was probably initial excitement about not having to manually break the earth, then stories spread about farmers ruining entire crops with one tractor, some farms begin touting 10x more efficiency by running multiple tractors at once, some farmers saying the maintenance burden of a tractor is not worth it compared to feeding/watering their mule, etc.<p>Fast forward and now gigantic remote controlled combines are dominating thousands of acres of land with the efficiency greater than 100 men with 100 early tractors.</p>
]]></description><pubDate>Fri, 23 Jan 2026 15:25:49 +0000</pubDate><link>https://news.ycombinator.com/item?id=46733611</link><dc:creator>faxmeyourcode</dc:creator><comments>https://news.ycombinator.com/item?id=46733611</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46733611</guid></item><item><title><![CDATA[New comment by faxmeyourcode in "Can AI Do My Bookkeeping?"]]></title><description><![CDATA[
<p>You should check out plain text accounting software Beancount. I've built a skill for this here <a href="https://github.com/ouachitalabs/skills/blob/main/beancount/SKILL.md" rel="nofollow">https://github.com/ouachitalabs/skills/blob/main/beancount/S...</a> and I use it for my custom personal finance agent.</p>
]]></description><pubDate>Thu, 22 Jan 2026 20:14:42 +0000</pubDate><link>https://news.ycombinator.com/item?id=46724596</link><dc:creator>faxmeyourcode</dc:creator><comments>https://news.ycombinator.com/item?id=46724596</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46724596</guid></item><item><title><![CDATA[New comment by faxmeyourcode in "The Cathedral, the Megachurch, and the Bazaar"]]></title><description><![CDATA[
<p>> The TL;DR was that old open source was the cathedral of exclusive developers and groups. Then the Bazaar showed up (which was the Linux Kernel for example) and that freed us from the shackles of the cathedral.<p>I didn't make it past the tldr lol is this some kind of poisoned data for GPT 6?</p>
]]></description><pubDate>Sun, 18 Jan 2026 18:52:06 +0000</pubDate><link>https://news.ycombinator.com/item?id=46670892</link><dc:creator>faxmeyourcode</dc:creator><comments>https://news.ycombinator.com/item?id=46670892</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46670892</guid></item><item><title><![CDATA[New comment by faxmeyourcode in "ThinkNext Design"]]></title><description><![CDATA[
<p>I ran a T450s all through college. My late grandmother gifted me the money after high school graduation to buy my first laptop. I'd learned to code on the old gateway pentium 4 (Windows XP) machine with 512 MB of RAM from the factory. I'd resurrected it as a teenager and completely gutted it. I scoured the web endlessly looking for something that was _not_ touchscreen (hard to find at the time), linux-compatible, and rugged enough for the neglect of a college student's backpack.<p>It was a linux user's dream. I swapped out the spinny HDD with an SSD. It had hotswappable batteries, a spill resistant keyboard with a drain system, and I ran every linux distribution I could get my hands on, eventually settling on the xfce flavor of Ubuntu.<p>I still have that machine, and will keep it for my son when he's older. I plan to replace the battery and SSD with something more reliable. These days I use an m2 macbook air for portability... but that thinkpad is still something I will treasure for years.</p>
]]></description><pubDate>Sun, 18 Jan 2026 18:23:47 +0000</pubDate><link>https://news.ycombinator.com/item?id=46670576</link><dc:creator>faxmeyourcode</dc:creator><comments>https://news.ycombinator.com/item?id=46670576</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46670576</guid></item><item><title><![CDATA[New comment by faxmeyourcode in "Ask HN: Share your personal website"]]></title><description><![CDATA[
<p><a href="https://ouachitalabs.com/" rel="nofollow">https://ouachitalabs.com/</a></p>
]]></description><pubDate>Thu, 15 Jan 2026 16:13:05 +0000</pubDate><link>https://news.ycombinator.com/item?id=46634716</link><dc:creator>faxmeyourcode</dc:creator><comments>https://news.ycombinator.com/item?id=46634716</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46634716</guid></item><item><title><![CDATA[New comment by faxmeyourcode in "Believe the Checkbook"]]></title><description><![CDATA[
<p>I use claude exclusively at my day job with legacy codebases. I'm not using swarms or fancy multi-agent setups. Just sticking to plan mode, giving enough context, and iterating on a spec before starting the actual build.<p>I'm also not building webapps. I work in data engineering on a large legacy airflow project, internal python libraries, infrastructure with terraform, etc.</p>
]]></description><pubDate>Wed, 31 Dec 2025 07:25:18 +0000</pubDate><link>https://news.ycombinator.com/item?id=46442158</link><dc:creator>faxmeyourcode</dc:creator><comments>https://news.ycombinator.com/item?id=46442158</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46442158</guid></item><item><title><![CDATA[New comment by faxmeyourcode in "Believe the Checkbook"]]></title><description><![CDATA[
<p>Unfortunately this is a skill issue. And it's a totally different skill than reading and writing code, building solid systems, and general software engineering at large. That is annoying, but where we're currently at.<p>Assume you're writing code manually, and you personally make a mistake. It's often worthwhile to create a <i>mechanism</i> that prevents that class of mistake from cropping up again. Adding better LSP or refactoring support to your editor, better syntax highlighting, better type checking, etc.<p>That same exact game of whack a mole now has to be done for you and whatever agent you're building with. Some questions to ask: What caused the hallucination? Did you have the agent lay out its plan before it writes any code? Ask you questions and iterate on a spec before implementation? Have you given it all of the necessary tools, test harnesses, and context it needs to complete a request that you've made to it? How do you automate this so that it's impossible for these pieces to be missing for the next request? Are you using the right model for the task at hand?</p>
]]></description><pubDate>Wed, 31 Dec 2025 07:22:54 +0000</pubDate><link>https://news.ycombinator.com/item?id=46442148</link><dc:creator>faxmeyourcode</dc:creator><comments>https://news.ycombinator.com/item?id=46442148</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46442148</guid></item></channel></rss>