<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: bashbjorn</title><link>https://news.ycombinator.com/user?id=bashbjorn</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Fri, 15 May 2026 14:15:03 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=bashbjorn" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by bashbjorn in "What's in a GGUF, besides the weights – and what's still missing?"]]></title><description><![CDATA[
<p>There is an RSS feed now: <a href="https://nobodywho.ooo/feed.xml" rel="nofollow">https://nobodywho.ooo/feed.xml</a></p>
]]></description><pubDate>Fri, 15 May 2026 09:16:24 +0000</pubDate><link>https://news.ycombinator.com/item?id=48146342</link><dc:creator>bashbjorn</dc:creator><comments>https://news.ycombinator.com/item?id=48146342</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48146342</guid></item><item><title><![CDATA[New comment by bashbjorn in "What's in a GGUF, besides the weights – and what's still missing?"]]></title><description><![CDATA[
<p>It's just an eleventy site: <a href="https://github.com/nobodywho-ooo/website" rel="nofollow">https://github.com/nobodywho-ooo/website</a><p>No RSS feed currently, but it's a good idea to add one!</p>
]]></description><pubDate>Thu, 14 May 2026 23:49:11 +0000</pubDate><link>https://news.ycombinator.com/item?id=48142750</link><dc:creator>bashbjorn</dc:creator><comments>https://news.ycombinator.com/item?id=48142750</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48142750</guid></item><item><title><![CDATA[New comment by bashbjorn in "What's in a GGUF, besides the weights – and what's still missing?"]]></title><description><![CDATA[
<p>Cool technique, but I'm not sure I'd call it simple.<p>Doing this means that you can't just tokenize the string output of the chat template as one big string. You might need to tokenize things separately, and combine them after.</p>
]]></description><pubDate>Thu, 14 May 2026 23:47:08 +0000</pubDate><link>https://news.ycombinator.com/item?id=48142730</link><dc:creator>bashbjorn</dc:creator><comments>https://news.ycombinator.com/item?id=48142730</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48142730</guid></item><item><title><![CDATA[New comment by bashbjorn in "What's in a GGUF, besides the weights – and what's still missing?"]]></title><description><![CDATA[
<p>You're right, there must be a good and simple way to do it.<p>Obviously the prefix-with-backslash convention won't do it.
The escaping system could be something like inserting a character on the second position in the text repr, and reversing that on output too if it matches an escaped known special token.<p>Changing the vocab on the fly requires tokenizing things separately, breaking the chat template.<p>Anecdotally, even claude code has an anneurism sometimes when listing special tokens. Idk exactly what claude's <eos> token is, but I'm fairly sure I've seen it stop generation when it tried to generate it before.<p>I should also say that I've (clearly) not thought about this deeply. There should be a simpler way to do it.</p>
]]></description><pubDate>Thu, 14 May 2026 23:45:10 +0000</pubDate><link>https://news.ycombinator.com/item?id=48142719</link><dc:creator>bashbjorn</dc:creator><comments>https://news.ycombinator.com/item?id=48142719</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48142719</guid></item><item><title><![CDATA[New comment by bashbjorn in "What's in a GGUF, besides the weights – and what's still missing?"]]></title><description><![CDATA[
<p>The model sees one token per marker - but the overlap with ingested actual text is still relevant, because the tokenizer will ingest regular text, where it will turn "<|turn>" into the same token.<p>For this reason, it can be tricky to work on the runtime for a model with the same model. This really feels like an accidental problem, but I'm not sure if it's really solvable without abandoning the text representations altogether (and the jinja abstraction along with it).</p>
]]></description><pubDate>Thu, 14 May 2026 21:45:10 +0000</pubDate><link>https://news.ycombinator.com/item?id=48141671</link><dc:creator>bashbjorn</dc:creator><comments>https://news.ycombinator.com/item?id=48141671</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48141671</guid></item><item><title><![CDATA[New comment by bashbjorn in "What's in a GGUF, besides the weights – and what's still missing?"]]></title><description><![CDATA[
<p>Yeah, TheBloke era of local LLMs were good times. TBF Unsloth are doing a fantastic job of publishing quants of the major models quickly - they just don't have nearly the volume of "weird" models as TheBloke did.</p>
]]></description><pubDate>Thu, 14 May 2026 19:07:10 +0000</pubDate><link>https://news.ycombinator.com/item?id=48139774</link><dc:creator>bashbjorn</dc:creator><comments>https://news.ycombinator.com/item?id=48139774</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48139774</guid></item><item><title><![CDATA[New comment by bashbjorn in "What's in a GGUF, besides the weights – and what's still missing?"]]></title><description><![CDATA[
<p>I love mistral, but that model is... not the best. Maybe try out Gemma 4 e4b, it's a similar size to Mistral 7B, and should run great on your 4070 ("E4B" is slightly misleading naming).</p>
]]></description><pubDate>Thu, 14 May 2026 19:04:18 +0000</pubDate><link>https://news.ycombinator.com/item?id=48139729</link><dc:creator>bashbjorn</dc:creator><comments>https://news.ycombinator.com/item?id=48139729</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48139729</guid></item><item><title><![CDATA[New comment by bashbjorn in "What's in a GGUF, besides the weights – and what's still missing?"]]></title><description><![CDATA[
<p>whoops, my bad. Just a typo in the markdown. Fixed :)</p>
]]></description><pubDate>Thu, 14 May 2026 19:00:52 +0000</pubDate><link>https://news.ycombinator.com/item?id=48139684</link><dc:creator>bashbjorn</dc:creator><comments>https://news.ycombinator.com/item?id=48139684</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48139684</guid></item><item><title><![CDATA[What's in a GGUF, besides the weights – and what's still missing?]]></title><description><![CDATA[
<p>Article URL: <a href="https://nobodywho.ooo/posts/whats-in-a-gguf/">https://nobodywho.ooo/posts/whats-in-a-gguf/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=48138332">https://news.ycombinator.com/item?id=48138332</a></p>
<p>Points: 161</p>
<p># Comments: 48</p>
]]></description><pubDate>Thu, 14 May 2026 17:21:49 +0000</pubDate><link>https://nobodywho.ooo/posts/whats-in-a-gguf/</link><dc:creator>bashbjorn</dc:creator><comments>https://news.ycombinator.com/item?id=48138332</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48138332</guid></item><item><title><![CDATA[New comment by bashbjorn in "The local LLM ecosystem doesn’t need Ollama"]]></title><description><![CDATA[
<p>Oh hey I'm also working on a thing to solve the devx of llama.cpp: <a href="https://github.com/nobodywho-ooo/nobodywho" rel="nofollow">https://github.com/nobodywho-ooo/nobodywho</a><p>In contrast to Ollama, this is a self-contained library, not a server.<p>I wrote some quick notes on this blogpost, just to jot down how we think about good open-source citizenship: <a href="https://www.nobodywho.ai/posts/notes-on-friends-dont-let-friends-use-ollama/" rel="nofollow">https://www.nobodywho.ai/posts/notes-on-friends-dont-let-fri...</a></p>
]]></description><pubDate>Thu, 16 Apr 2026 18:00:42 +0000</pubDate><link>https://news.ycombinator.com/item?id=47797173</link><dc:creator>bashbjorn</dc:creator><comments>https://news.ycombinator.com/item?id=47797173</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47797173</guid></item><item><title><![CDATA[New comment by bashbjorn in "Ask HN: Who is hiring? (January 2026)"]]></title><description><![CDATA[
<p>NobodyWho | Copenhagen, Denmark | Software Engineer, ML Engineer, DevRel | Full-time | ONSITE<p>NobodyWho is making developer tools for running small language models in local-first applications.
Our core principle is to ship the model weights along with the application, and then do efficient inference locally and offline, on any device. We run fast on Linux, MacOS, Windows, Android and iOS.<p>The main product is an inference library that wraps llama.cpp, written in Rust. We provide bindings for Python, Godot (the game engine), and will be releasing a Flutter plugin soon. It's all licensed under EUPL 1.2. Repo here: <a href="https://github.com/nobodywho-ooo/nobodywho/" rel="nofollow">https://github.com/nobodywho-ooo/nobodywho/</a><p>We're hiring people who are comfortable building highly cross-platform FFI applications in Rust (with C++ dependencies), and people who are deeply familiar with language models and the open standards around them, as well as fine-tuning and evaluating models. We're also looking for a technical DevRel profile.<p>If any of that sounds relevant to you, feel free to email me: a>at<nobodywho.ooo</p>
]]></description><pubDate>Sat, 03 Jan 2026 14:19:13 +0000</pubDate><link>https://news.ycombinator.com/item?id=46476963</link><dc:creator>bashbjorn</dc:creator><comments>https://news.ycombinator.com/item?id=46476963</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46476963</guid></item><item><title><![CDATA[New comment by bashbjorn in "When vibe coding, isn't the source code the prompt?"]]></title><description><![CDATA[
<p>I made something silly to this effect last year:<p><a href="https://gitlab.com/AsbjornOlling/nixllm" rel="nofollow">https://gitlab.com/AsbjornOlling/nixllm</a><p>You provide the prompt to a nix function, and it deterministically generates the source code using llama.cpp.</p>
]]></description><pubDate>Tue, 05 Aug 2025 13:56:46 +0000</pubDate><link>https://news.ycombinator.com/item?id=44798029</link><dc:creator>bashbjorn</dc:creator><comments>https://news.ycombinator.com/item?id=44798029</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44798029</guid></item><item><title><![CDATA[New comment by bashbjorn in "Ask HN: Is anyone doing anything cool with tiny language models?"]]></title><description><![CDATA[
<p>I'm working on a plugin[1] that runs local LLMs from the Godot game engine. The optimal model sizes seem to be 2B-7B ish, since those will run fast enough on most computers. We recommend that people try it out with Gemma 2 2B (but it will work with any model that works with llama.cpp)<p>At those sizes, it's great for generating non-repetitive flavortext for NPCs. No more "I took an arrow to the knee".<p>Models at around the 2B size aren't really capable enough to act a competent adversary - but they are great for something like bargaining with a shopkeeper, or some other role where natural language can let players do a bit more immersive roleplay.<p>[1] <a href="https://github.com/nobodywho-ooo/nobodywho">https://github.com/nobodywho-ooo/nobodywho</a></p>
]]></description><pubDate>Wed, 22 Jan 2025 14:42:13 +0000</pubDate><link>https://news.ycombinator.com/item?id=42793335</link><dc:creator>bashbjorn</dc:creator><comments>https://news.ycombinator.com/item?id=42793335</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42793335</guid></item><item><title><![CDATA[New comment by bashbjorn in "Ask HN: Outstanding Programmers"]]></title><description><![CDATA[
<p>What makes you say that these are all Stenbergs creations?<p>Could it be that these are just projects that use libcurl in some way?<p>I'm having trouble finding any sources that say that Daniel Stenberg actually worked on spotify, utorrent or openttd directly - just to test three of them.</p>
]]></description><pubDate>Mon, 22 Apr 2024 12:33:39 +0000</pubDate><link>https://news.ycombinator.com/item?id=40113736</link><dc:creator>bashbjorn</dc:creator><comments>https://news.ycombinator.com/item?id=40113736</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40113736</guid></item><item><title><![CDATA[New comment by bashbjorn in "Show HN: Hate Regex? This Free Tool Writes It for You"]]></title><description><![CDATA[
<p>Neat. I wish that I could test the regex immediately in the browser.
Tools like regexr.com are great for convincing myself that my regex works.</p>
]]></description><pubDate>Tue, 16 Apr 2024 21:37:56 +0000</pubDate><link>https://news.ycombinator.com/item?id=40057531</link><dc:creator>bashbjorn</dc:creator><comments>https://news.ycombinator.com/item?id=40057531</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40057531</guid></item><item><title><![CDATA[New comment by bashbjorn in "Ask HN: What programming languages are you using at $WORK?"]]></title><description><![CDATA[
<p>Python, Typescript, Nim, Rust, C++, C and Elm.</p>
]]></description><pubDate>Thu, 14 Sep 2023 19:19:54 +0000</pubDate><link>https://news.ycombinator.com/item?id=37513526</link><dc:creator>bashbjorn</dc:creator><comments>https://news.ycombinator.com/item?id=37513526</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=37513526</guid></item><item><title><![CDATA[New comment by bashbjorn in "Ask HN: ARM Laptop Suggestions?"]]></title><description><![CDATA[
<p>The MNT Reform[1] is a pretty sexy open hardware laptop with a mechanical keyboard, optional trackball, and a small ARM processor.<p>A cursory glance at geekbench shows that you can get around 50% the performance of a 2020 Macbook Air M1, if you get the upgraded CPU option.[2][3] I have no idea how useful those benchmarks are, though.<p>[1] <a href="https://shop.mntre.com/products/mnt-reform" rel="nofollow noreferrer">https://shop.mntre.com/products/mnt-reform</a>
[2] <a href="https://browser.geekbench.com/v4/cpu/16291496" rel="nofollow noreferrer">https://browser.geekbench.com/v4/cpu/16291496</a>
[3] <a href="https://browser.geekbench.com/v6/cpu/2504172" rel="nofollow noreferrer">https://browser.geekbench.com/v6/cpu/2504172</a></p>
]]></description><pubDate>Tue, 05 Sep 2023 12:34:11 +0000</pubDate><link>https://news.ycombinator.com/item?id=37390799</link><dc:creator>bashbjorn</dc:creator><comments>https://news.ycombinator.com/item?id=37390799</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=37390799</guid></item><item><title><![CDATA[New comment by bashbjorn in "Advent of Code 2021"]]></title><description><![CDATA[
<p>I did AoC in Nim one year and was very happy with it! I also became infatuated with Nim that year..</p>
]]></description><pubDate>Sun, 21 Nov 2021 01:20:04 +0000</pubDate><link>https://news.ycombinator.com/item?id=29293333</link><dc:creator>bashbjorn</dc:creator><comments>https://news.ycombinator.com/item?id=29293333</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=29293333</guid></item><item><title><![CDATA[New comment by bashbjorn in "Advent of Code 2021"]]></title><description><![CDATA[
<p>I highly recommend it to explore new programming styles in general. As some others, I use it as an excuse to learn a new language each year.<p>While I think python is a really great language for advent of code, I'm not sure I'd recommend going for an OOP-heavy style. Although that might just be a matter of personal taste - I think OOP is a poor strategy for most problems.</p>
]]></description><pubDate>Sun, 21 Nov 2021 01:17:32 +0000</pubDate><link>https://news.ycombinator.com/item?id=29293320</link><dc:creator>bashbjorn</dc:creator><comments>https://news.ycombinator.com/item?id=29293320</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=29293320</guid></item><item><title><![CDATA[Cactus Comments: Matrix Spec Changes we're excited for]]></title><description><![CDATA[
<p>Article URL: <a href="https://cactus.chat/blog/mscs-were-excited-for/">https://cactus.chat/blog/mscs-were-excited-for/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=26915944">https://news.ycombinator.com/item?id=26915944</a></p>
<p>Points: 6</p>
<p># Comments: 1</p>
]]></description><pubDate>Fri, 23 Apr 2021 15:47:38 +0000</pubDate><link>https://cactus.chat/blog/mscs-were-excited-for/</link><dc:creator>bashbjorn</dc:creator><comments>https://news.ycombinator.com/item?id=26915944</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=26915944</guid></item></channel></rss>