<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: lsb</title><link>https://news.ycombinator.com/user?id=lsb</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Wed, 15 Apr 2026 16:28:32 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=lsb" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by lsb in "Leanstral: Open-source agent for trustworthy coding and formal proof engineering"]]></title><description><![CDATA[
<p>The real world success they report reminds me of Simon Willison’s Red Green TDD: <a href="https://simonwillison.net/guides/agentic-engineering-patterns/red-green-tdd/" rel="nofollow">https://simonwillison.net/guides/agentic-engineering-pattern...</a><p>> Instead of taking a stab in the dark, Leanstral rolled up its sleeves. It successfully built test code to recreate the failing environment and diagnosed the underlying issue with definitional equality. The model correctly identified that because def creates a rigid definition requiring explicit unfolding, it was actively blocking the rw tactic from seeing the underlying structure it needed to match.</p>
]]></description><pubDate>Mon, 16 Mar 2026 22:26:24 +0000</pubDate><link>https://news.ycombinator.com/item?id=47405846</link><dc:creator>lsb</dc:creator><comments>https://news.ycombinator.com/item?id=47405846</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47405846</guid></item><item><title><![CDATA[New comment by lsb in "Lena by qntm (2021)"]]></title><description><![CDATA[
<p>It’s named after the multi-decade data compression test image <a href="https://en.wikipedia.org/wiki/Lenna" rel="nofollow">https://en.wikipedia.org/wiki/Lenna</a><p>Buy the book! <a href="https://qntm.org/vhitaos" rel="nofollow">https://qntm.org/vhitaos</a></p>
]]></description><pubDate>Fri, 13 Feb 2026 06:58:08 +0000</pubDate><link>https://news.ycombinator.com/item?id=46999737</link><dc:creator>lsb</dc:creator><comments>https://news.ycombinator.com/item?id=46999737</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46999737</guid></item><item><title><![CDATA[New comment by lsb in "Ask HN: How are you doing RAG locally?"]]></title><description><![CDATA[
<p>I'm using Sonnet with 1M Context Window at work, just stuffing everything in a window (it works fine for now), and I'm hoping to investigate Recursive Language Models with DSPy when I'm using local models with Ollama</p>
]]></description><pubDate>Thu, 15 Jan 2026 09:15:15 +0000</pubDate><link>https://news.ycombinator.com/item?id=46630070</link><dc:creator>lsb</dc:creator><comments>https://news.ycombinator.com/item?id=46630070</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46630070</guid></item><item><title><![CDATA[New comment by lsb in "Tell HN: No continental US flights due to attack on Venezuela"]]></title><description><![CDATA[
<p>The New York Times has said that the US president has reported capturing the president of Venezuela <a href="https://www.nytimes.com/live/2026/01/03/world/trump-united-states-strikes-venezuela" rel="nofollow">https://www.nytimes.com/live/2026/01/03/world/trump-united-s...</a><p>Source about aviation: primary (I am at an airport now) and also there are no flights going into or out of JFK right now <a href="https://www.jfkairport.com/flight-tracker?view=VIEW_DEPARTURE&apt=JFK" rel="nofollow">https://www.jfkairport.com/flight-tracker?view=VIEW_DEPARTUR...</a></p>
]]></description><pubDate>Sat, 03 Jan 2026 09:57:53 +0000</pubDate><link>https://news.ycombinator.com/item?id=46474759</link><dc:creator>lsb</dc:creator><comments>https://news.ycombinator.com/item?id=46474759</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46474759</guid></item><item><title><![CDATA[Tell HN: No continental US flights due to attack on Venezuela]]></title><description><![CDATA[

<p>Comments URL: <a href="https://news.ycombinator.com/item?id=46474744">https://news.ycombinator.com/item?id=46474744</a></p>
<p>Points: 11</p>
<p># Comments: 3</p>
]]></description><pubDate>Sat, 03 Jan 2026 09:55:19 +0000</pubDate><link>https://news.ycombinator.com/item?id=46474744</link><dc:creator>lsb</dc:creator><comments>https://news.ycombinator.com/item?id=46474744</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46474744</guid></item><item><title><![CDATA[New comment by lsb in "Lite^3, a JSON-compatible zero-copy serialization format"]]></title><description><![CDATA[
<p>This is super interesting!<p>Apache Arrow is trying to do something similar, using Flatbuffer to serialize with zero-copy and zero-parse semantics, and an index structure built on top of that.<p>Would love to see comparisons with Arrow</p>
]]></description><pubDate>Fri, 19 Dec 2025 05:54:42 +0000</pubDate><link>https://news.ycombinator.com/item?id=46322661</link><dc:creator>lsb</dc:creator><comments>https://news.ycombinator.com/item?id=46322661</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46322661</guid></item><item><title><![CDATA[New comment by lsb in "Why are your models so big? (2023)"]]></title><description><![CDATA[
<p>My threshold for “does not need to be smaller” is “can this run on a Raspberry Pi”. This is a helpful benchmark for maximum likely useful optimization.<p>A Pi has 4 cores and 16GB of memory these days, so, running Qwen3 4B on a pi is pretty comfortable: <a href="https://leebutterman.com/2025/11/01/prompt-optimization-on-a-raspberry-pi.html" rel="nofollow">https://leebutterman.com/2025/11/01/prompt-optimization-on-a...</a></p>
]]></description><pubDate>Sat, 06 Dec 2025 00:31:25 +0000</pubDate><link>https://news.ycombinator.com/item?id=46169277</link><dc:creator>lsb</dc:creator><comments>https://news.ycombinator.com/item?id=46169277</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46169277</guid></item><item><title><![CDATA[New comment by lsb in "Show HN: DSPy on a Pi: Cheap Prompt Optimization with GEPA and Qwen3"]]></title><description><![CDATA[
<p>Happy to answer any questions you have :)</p>
]]></description><pubDate>Tue, 18 Nov 2025 22:08:00 +0000</pubDate><link>https://news.ycombinator.com/item?id=45972886</link><dc:creator>lsb</dc:creator><comments>https://news.ycombinator.com/item?id=45972886</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45972886</guid></item><item><title><![CDATA[Show HN: DSPy on a Pi: Cheap Prompt Optimization with GEPA and Qwen3]]></title><description><![CDATA[
<p>GEPA was very efficient at optimizing my LLM task description, and DSPy was effective at turning a soup of a prompt into something programmable with inputs and outputs</p>
<hr>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=45969676">https://news.ycombinator.com/item?id=45969676</a></p>
<p>Points: 4</p>
<p># Comments: 1</p>
]]></description><pubDate>Tue, 18 Nov 2025 17:54:04 +0000</pubDate><link>https://leebutterman.com/2025/11/01/prompt-optimization-on-a-raspberry-pi.html</link><dc:creator>lsb</dc:creator><comments>https://news.ycombinator.com/item?id=45969676</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45969676</guid></item><item><title><![CDATA[New comment by lsb in "Show HN: Apache Fory Rust – 10-20x faster serialization than JSON/Protobuf"]]></title><description><![CDATA[
<p>Curious about comparisons with Apache Arrow, which uses flatbuffers to avoid memory copying during deserialization, which is well supported by the Pandas ecosystem, and which allows users to serialize arrays as lists of numbers that have hardware support from a GPU (int8-64, float)</p>
]]></description><pubDate>Tue, 28 Oct 2025 20:19:58 +0000</pubDate><link>https://news.ycombinator.com/item?id=45738562</link><dc:creator>lsb</dc:creator><comments>https://news.ycombinator.com/item?id=45738562</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45738562</guid></item><item><title><![CDATA[New comment by lsb in "Solveit – A course and platform for solving problems with code"]]></title><description><![CDATA[
<p>fast.ai (some of the authors of this) was transformative for me, and the community was super nice. Cannot recommend looking into this highly enough.</p>
]]></description><pubDate>Fri, 03 Oct 2025 00:45:10 +0000</pubDate><link>https://news.ycombinator.com/item?id=45457420</link><dc:creator>lsb</dc:creator><comments>https://news.ycombinator.com/item?id=45457420</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45457420</guid></item><item><title><![CDATA[New comment by lsb in "Show HN: A store that generates products from anything you type in search"]]></title><description><![CDATA[
<p>This is halfbakery! I love it!<p>(For example, a recent half baked idea there is a perpetually burning flag. <a href="https://www.halfbakery.com/idea/Perpetually_20Burning_20Flag#1756839998" rel="nofollow">https://www.halfbakery.com/idea/Perpetually_20Burning_20Flag...</a> )</p>
]]></description><pubDate>Sat, 13 Sep 2025 14:33:28 +0000</pubDate><link>https://news.ycombinator.com/item?id=45232382</link><dc:creator>lsb</dc:creator><comments>https://news.ycombinator.com/item?id=45232382</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45232382</guid></item><item><title><![CDATA[New comment by lsb in "LandChad, a site dedicated to turning internet peasants into Internet Landlords"]]></title><description><![CDATA[
<p>How are you a landlord if you're paying property taxes?<p>Once you have everything else set up, you can migrate to a server hosted on your own internet connection. Running your own data center is one of the more tricky parts of the equation, compared to almost-free web hosting for a 10MB site.<p>You're also just renting a domain name.</p>
]]></description><pubDate>Sun, 31 Aug 2025 03:16:26 +0000</pubDate><link>https://news.ycombinator.com/item?id=45080062</link><dc:creator>lsb</dc:creator><comments>https://news.ycombinator.com/item?id=45080062</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45080062</guid></item><item><title><![CDATA[New comment by lsb in "Show HN: I replaced vector databases with Git for AI memory (PoC)"]]></title><description><![CDATA[
<p>Interesting! Text files in git can work for small sizes, like your 100MB.<p>That is what's known in FAISS as a "flat" index, just one thing after another. And obviously you can query by primary key to the key-value store that is git, and do atomic updates as you'd expect. In SQL land this is an unindexed column, you can do primary key lookups on the table, or you can look through every row in order to find what you want.<p>If you don't need fast query times, this could work great! You could also use SQL (maybe an AWS Aurora Postgres/MySQL table?) and stuff the fact and its embedding into a table, and get declarative relational queries (find me the closest 10 statements users A-J have made to embedding [0.1, 0.2, -0.1, ...] within the past day). Lots of SQL databases are getting embedding search (Postgres, sqlite, and more) so that will allow your embedding search to happen in a few milliseconds instead of a few seconds.<p>It could be worth sketching out how to use SQLite for your application, instead of using files on disk: SQLite was designed to be a better alternative to opening a file (what happens if power goes out while you are writing a file? what happens if you want to update two people's records, and not get caught mid-update by another web app process?) and is very well supported by many language ecosystems.<p>Then, to take full advantage of vector embedding engines: what happens if my embedding is 1024 dimensions and each one is a 32 bit floating point value? Do I need to save all of that precision? Is 16-bit okay? 8-bit floats? What about reducing the dimensionality? Is it good enough accuracy and recall if I represent each dimension with an index to a palette of the best 256 floats for that dimension? What about representing each pair of dimensions with an index to a palette of the best 256 pairs of floats for those two dimensions? What about, instead of looking through every embedding one by one, we know that people talk about one of three different topics, and we have three different indices for each of those major topics, and to find your nearest neighbors you want to first find your closest topic (or maybe closest two topics?) and then search in those lower indices? Each of these hypotheticals is literally a different “index string” in an embedding search called FAISS, and could easily be thousands of lines of code if you did it yourself.<p>It’s definitely a good learning experience to implement your own embedding database atop git! Especially if you run it in production! 100MB is small enough that anything reasonable is going to be fast.</p>
]]></description><pubDate>Thu, 21 Aug 2025 09:10:08 +0000</pubDate><link>https://news.ycombinator.com/item?id=44970643</link><dc:creator>lsb</dc:creator><comments>https://news.ycombinator.com/item?id=44970643</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44970643</guid></item><item><title><![CDATA[New comment by lsb in "Gemma 3 270M re-implemented in pure PyTorch for local tinkering"]]></title><description><![CDATA[
<p>That’s wild that with a KV cache and compilation on the Mac CPU you are faster than on an A100 GPU.</p>
]]></description><pubDate>Wed, 20 Aug 2025 16:17:24 +0000</pubDate><link>https://news.ycombinator.com/item?id=44963262</link><dc:creator>lsb</dc:creator><comments>https://news.ycombinator.com/item?id=44963262</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44963262</guid></item><item><title><![CDATA[New comment by lsb in "Vendors that treat single sign-on as a luxury feature"]]></title><description><![CDATA[
<p>Also: this SSO tax is deceptively framed. Many of these services allow one to sign in through, for example, Google, which can count as a single sign on, and many organizations have a mail account, but that isn’t taken into account.</p>
]]></description><pubDate>Tue, 19 Aug 2025 21:35:39 +0000</pubDate><link>https://news.ycombinator.com/item?id=44956478</link><dc:creator>lsb</dc:creator><comments>https://news.ycombinator.com/item?id=44956478</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44956478</guid></item><item><title><![CDATA[New comment by lsb in "What's the strongest AI model you can train on a laptop in five minutes?"]]></title><description><![CDATA[
<p>This is evocative of “cramming”, a paper from a few years ago, where the author tried to find the best model they could train for a day on a modern laptop: <a href="https://arxiv.org/abs/2212.14034" rel="nofollow">https://arxiv.org/abs/2212.14034</a></p>
]]></description><pubDate>Thu, 14 Aug 2025 19:54:07 +0000</pubDate><link>https://news.ycombinator.com/item?id=44904881</link><dc:creator>lsb</dc:creator><comments>https://news.ycombinator.com/item?id=44904881</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44904881</guid></item><item><title><![CDATA[New comment by lsb in "Ask HN: With all the AI hype, how are software engineers feeling?"]]></title><description><![CDATA[
<p>I used Claude Code to navigate a legacy codebase the other day, and having the ability to ask "how many of these files have helper methods that are duplicated or <i>almost but not quite exactly</i> duplicated?" was very much a superpower.</p>
]]></description><pubDate>Mon, 11 Aug 2025 05:24:48 +0000</pubDate><link>https://news.ycombinator.com/item?id=44860963</link><dc:creator>lsb</dc:creator><comments>https://news.ycombinator.com/item?id=44860963</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44860963</guid></item><item><title><![CDATA[Vibe Scraping / Vibe Coding a schedule app on a phone]]></title><description><![CDATA[
<p>Article URL: <a href="https://simonwillison.net/2025/Jul/17/vibe-scraping/">https://simonwillison.net/2025/Jul/17/vibe-scraping/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=44602309">https://news.ycombinator.com/item?id=44602309</a></p>
<p>Points: 2</p>
<p># Comments: 0</p>
]]></description><pubDate>Fri, 18 Jul 2025 07:55:42 +0000</pubDate><link>https://simonwillison.net/2025/Jul/17/vibe-scraping/</link><dc:creator>lsb</dc:creator><comments>https://news.ycombinator.com/item?id=44602309</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44602309</guid></item><item><title><![CDATA[New comment by lsb in "Ask HN: Moving a not-for-profit web app off AWS"]]></title><description><![CDATA[
<p>I’ve been running half-a-billion parameter models comfortably in a web browser, especially with WebGPU, and you can definitely run billion parameter LLMs in the browser. It becomes a heavyweight browser app, but if the main costs are running ML models you can pretty easily serve static files from a directory and let clients’ browsers do the heavy lifting. Feel free to reach out if you have questions, happy to help, I’ve been working on language web apps as well</p>
]]></description><pubDate>Sat, 25 Jan 2025 18:45:35 +0000</pubDate><link>https://news.ycombinator.com/item?id=42823605</link><dc:creator>lsb</dc:creator><comments>https://news.ycombinator.com/item?id=42823605</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42823605</guid></item></channel></rss>