Hacker News: lsb

New comment by lsb in "Who Is America's Homer?"

lsb — Mon, 20 Jul 2026 04:24:02 +0000

Homer is the Homer of America, unless you believe in fencing off universal properties of mankind into exclusive tribal ownership.

(cf Saul Bellow asking “who is the Tolstoy of the Zulus”, and Ralph Wiley in the Atlantic saying that Tolstoy is the Tolstoy of the Zulus.)

New comment by lsb in "Show HN: Visualizing Tiny LLMs from OpenAI's Parameter Golf"

lsb — Thu, 14 May 2026 19:12:36 +0000

Happy to answer any questions :)

Show HN: Visualizing Tiny LLMs from OpenAI's Parameter Golf

lsb — Thu, 14 May 2026 18:52:29 +0000

The two from parameter golf (one I trained, one was the baseline) are just 16MB each! They produce barely plausible English

Comments URL: https://news.ycombinator.com/item?id=48139581

Points: 3

# Comments: 1

New comment by lsb in "Healthchecks.io now uses self-hosted object storage"

lsb — Fri, 17 Apr 2026 15:19:24 +0000

Self Hosted object storage looks neat!

For this project, where you have 120GB of customer data, and thirty requests a second for ~8k objects (0.25MB/s object reads), you’d seem to be able to 100x the throughput vertically scaling on one machine with a file system and an SSD and never thinking about object storage. Would love to see why the complexity

New comment by lsb in "Leanstral: Open-source agent for trustworthy coding and formal proof engineering"

lsb — Mon, 16 Mar 2026 22:26:24 +0000

The real world success they report reminds me of Simon Willison’s Red Green TDD: https://simonwillison.net/guides/agentic-engineering-pattern...

> Instead of taking a stab in the dark, Leanstral rolled up its sleeves. It successfully built test code to recreate the failing environment and diagnosed the underlying issue with definitional equality. The model correctly identified that because def creates a rigid definition requiring explicit unfolding, it was actively blocking the rw tactic from seeing the underlying structure it needed to match.

New comment by lsb in "Lena by qntm (2021)"

lsb — Fri, 13 Feb 2026 06:58:08 +0000

It’s named after the multi-decade data compression test image https://en.wikipedia.org/wiki/Lenna

Buy the book! https://qntm.org/vhitaos

New comment by lsb in "Ask HN: How are you doing RAG locally?"

lsb — Thu, 15 Jan 2026 09:15:15 +0000

I'm using Sonnet with 1M Context Window at work, just stuffing everything in a window (it works fine for now), and I'm hoping to investigate Recursive Language Models with DSPy when I'm using local models with Ollama

New comment by lsb in "Tell HN: No continental US flights due to attack on Venezuela"

lsb — Sat, 03 Jan 2026 09:57:53 +0000

The New York Times has said that the US president has reported capturing the president of Venezuela https://www.nytimes.com/live/2026/01/03/world/trump-united-s...

Source about aviation: primary (I am at an airport now) and also there are no flights going into or out of JFK right now https://www.jfkairport.com/flight-tracker?view=VIEW_DEPARTUR...

Tell HN: No continental US flights due to attack on Venezuela

lsb — Sat, 03 Jan 2026 09:55:19 +0000

Comments URL: https://news.ycombinator.com/item?id=46474744

Points: 11

# Comments: 3

New comment by lsb in "Lite^3, a JSON-compatible zero-copy serialization format"

lsb — Fri, 19 Dec 2025 05:54:42 +0000

This is super interesting!

Apache Arrow is trying to do something similar, using Flatbuffer to serialize with zero-copy and zero-parse semantics, and an index structure built on top of that.

Would love to see comparisons with Arrow

New comment by lsb in "Why are your models so big? (2023)"

lsb — Sat, 06 Dec 2025 00:31:25 +0000

My threshold for “does not need to be smaller” is “can this run on a Raspberry Pi”. This is a helpful benchmark for maximum likely useful optimization.

A Pi has 4 cores and 16GB of memory these days, so, running Qwen3 4B on a pi is pretty comfortable: https://leebutterman.com/2025/11/01/prompt-optimization-on-a...

New comment by lsb in "Show HN: DSPy on a Pi: Cheap Prompt Optimization with GEPA and Qwen3"

lsb — Tue, 18 Nov 2025 22:08:00 +0000

Happy to answer any questions you have :)

Show HN: DSPy on a Pi: Cheap Prompt Optimization with GEPA and Qwen3

lsb — Tue, 18 Nov 2025 17:54:04 +0000

GEPA was very efficient at optimizing my LLM task description, and DSPy was effective at turning a soup of a prompt into something programmable with inputs and outputs

Comments URL: https://news.ycombinator.com/item?id=45969676

Points: 4

# Comments: 1

New comment by lsb in "Show HN: Apache Fory Rust – 10-20x faster serialization than JSON/Protobuf"

lsb — Tue, 28 Oct 2025 20:19:58 +0000

Curious about comparisons with Apache Arrow, which uses flatbuffers to avoid memory copying during deserialization, which is well supported by the Pandas ecosystem, and which allows users to serialize arrays as lists of numbers that have hardware support from a GPU (int8-64, float)

New comment by lsb in "Solveit – A course and platform for solving problems with code"

lsb — Fri, 03 Oct 2025 00:45:10 +0000

fast.ai (some of the authors of this) was transformative for me, and the community was super nice. Cannot recommend looking into this highly enough.

New comment by lsb in "Show HN: A store that generates products from anything you type in search"

lsb — Sat, 13 Sep 2025 14:33:28 +0000

This is halfbakery! I love it!

(For example, a recent half baked idea there is a perpetually burning flag. https://www.halfbakery.com/idea/Perpetually_20Burning_20Flag... )

New comment by lsb in "LandChad, a site dedicated to turning internet peasants into Internet Landlords"

lsb — Sun, 31 Aug 2025 03:16:26 +0000

How are you a landlord if you're paying property taxes?

Once you have everything else set up, you can migrate to a server hosted on your own internet connection. Running your own data center is one of the more tricky parts of the equation, compared to almost-free web hosting for a 10MB site.

You're also just renting a domain name.

New comment by lsb in "Show HN: I replaced vector databases with Git for AI memory (PoC)"

lsb — Thu, 21 Aug 2025 09:10:08 +0000

Interesting! Text files in git can work for small sizes, like your 100MB.

That is what's known in FAISS as a "flat" index, just one thing after another. And obviously you can query by primary key to the key-value store that is git, and do atomic updates as you'd expect. In SQL land this is an unindexed column, you can do primary key lookups on the table, or you can look through every row in order to find what you want.

If you don't need fast query times, this could work great! You could also use SQL (maybe an AWS Aurora Postgres/MySQL table?) and stuff the fact and its embedding into a table, and get declarative relational queries (find me the closest 10 statements users A-J have made to embedding [0.1, 0.2, -0.1, ...] within the past day). Lots of SQL databases are getting embedding search (Postgres, sqlite, and more) so that will allow your embedding search to happen in a few milliseconds instead of a few seconds.

It could be worth sketching out how to use SQLite for your application, instead of using files on disk: SQLite was designed to be a better alternative to opening a file (what happens if power goes out while you are writing a file? what happens if you want to update two people's records, and not get caught mid-update by another web app process?) and is very well supported by many language ecosystems.

Then, to take full advantage of vector embedding engines: what happens if my embedding is 1024 dimensions and each one is a 32 bit floating point value? Do I need to save all of that precision? Is 16-bit okay? 8-bit floats? What about reducing the dimensionality? Is it good enough accuracy and recall if I represent each dimension with an index to a palette of the best 256 floats for that dimension? What about representing each pair of dimensions with an index to a palette of the best 256 pairs of floats for those two dimensions? What about, instead of looking through every embedding one by one, we know that people talk about one of three different topics, and we have three different indices for each of those major topics, and to find your nearest neighbors you want to first find your closest topic (or maybe closest two topics?) and then search in those lower indices? Each of these hypotheticals is literally a different “index string” in an embedding search called FAISS, and could easily be thousands of lines of code if you did it yourself.

It’s definitely a good learning experience to implement your own embedding database atop git! Especially if you run it in production! 100MB is small enough that anything reasonable is going to be fast.

New comment by lsb in "Gemma 3 270M re-implemented in pure PyTorch for local tinkering"

lsb — Wed, 20 Aug 2025 16:17:24 +0000

That’s wild that with a KV cache and compilation on the Mac CPU you are faster than on an A100 GPU.

New comment by lsb in "Vendors that treat single sign-on as a luxury feature"

lsb — Tue, 19 Aug 2025 21:35:39 +0000

Also: this SSO tax is deceptively framed. Many of these services allow one to sign in through, for example, Google, which can count as a single sign on, and many organizations have a mail account, but that isn’t taken into account.