<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: killcoder</title><link>https://news.ycombinator.com/user?id=killcoder</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Sun, 24 May 2026 21:11:47 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=killcoder" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by killcoder in "Multi-Stream LLMs: new paper on parallelizing/separating prompts, thinking, I/O"]]></title><description><![CDATA[
<p>I think 'actual parallelism' is a vastly easier and more fruitful way to get better performance out of these kinds of systems, compared to pushing against single-threaded faster generation. Tool calling and responses are often embarrassingly parallel. Code generation tasks naturally have a dependency tree that can be unrolled into a fixed budget of parallelism. Tasks can be hierarchically decomposed into subtasks.<p>It's the same asynchronous stream pattern we're used to dealing with in regular software engineering. We have a fixed thread pool, lots of work that can be scheduled concurrently. Since these are streams, we can do the compute incrementally to reduce the time-to-first-byte/token/response.<p>Since so many tool calls are inherently asynchronous, and subagent task decomposition can be modelled as such, the IO streams can be oversubscribed, and incoming responses can be priority queued.<p>On the intelligence front, it's incredible how much better frontier models perform when you just interrupt them every so often and go 'is that the best you can do?', or re-iterate instructions, or repeat the overall goal. I find instruction following _so poor_, especially for 'presentation layer' aspects. Yet if I ask the model to rewrite its last response, it does so perfectly. Why can't the model do this 'internally' and save me having to say 'try again'!<p>Just because the 'model' is autoregressive doesn't mean the system as a whole needs to present a single stream of immutable text.</p>
]]></description><pubDate>Fri, 22 May 2026 03:01:02 +0000</pubDate><link>https://news.ycombinator.com/item?id=48231470</link><dc:creator>killcoder</dc:creator><comments>https://news.ycombinator.com/item?id=48231470</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48231470</guid></item><item><title><![CDATA[New comment by killcoder in "The missed opportunity of constrained decoding"]]></title><description><![CDATA[
<p>I don't buy the "any constraints cause lower performance via being out of distribution" idea. Sure if you ask the model to output 'reasoning' in JSON steps, that is a completely different 'channel' to its trained 'reasoning' output. For real tasks though, I think it's more about picking the _right_ context free grammar to enforce format correctness. You can enforce an in-distribution format and get the best of both worlds. I don't think the industry should settle so hard on JSON-for-everything.</p>
]]></description><pubDate>Sat, 17 Jan 2026 06:03:34 +0000</pubDate><link>https://news.ycombinator.com/item?id=46655725</link><dc:creator>killcoder</dc:creator><comments>https://news.ycombinator.com/item?id=46655725</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46655725</guid></item><item><title><![CDATA[New comment by killcoder in "The missed opportunity of constrained decoding"]]></title><description><![CDATA[
<p>I was working on a speculative decoding optimisation and its accompanying blog post. Explaining the more basic concepts filled so much of the post I decided to pull them out, forming this article.<p>I had a bit too much fun with the tokenisation diagrams / animations. The raw text is provided to an Astro component, which tokenises it, and forms the individual DOM elements of the tokens. I find it really hard to read 'tokenised' text, I figured some consistent colouring would help. The 'Probabilities' component is a trivial grid, but all the other components support 'word wrap'.<p>I ended up writing a 'responsive design aware graph colouring solver'.<p>Multiple screen widths, 'desktop' and 'mobile' are 'simulated', forming an adjacency graph of tokens that touch. Colours are then greedily allocated, then optimised per page over a few hundred iterations, swapping allocations to enforce minimum hue distance between touching tokens at those common screen sizes. The optimising value function prioritises even distribution of colours, because it looks nicer than maximal hue difference.<p>Originally I naively outputted the palette styles per component, but found the css post processing optimisers didn't handle that as well as I'd have thought. So then I wrote a little 'CSS compiler' that takes the high level palette and timing concepts of the animations, and optimally merges rule declarations.<p>The start of the post really relies on the animation occurring while fully in view, so I set up some IntersectionObservers that do the 'please scroll' text.<p>I tried my best to have it all work when JS is disabled on the client. I tried to get the 'hovering' to be CSS-only, but found the JS solution much more performant.<p>The DAG diagrams are formed with this neat Needleman-Wunsch algorithm from the bioinformatics field. The Astro component accepts several 'examples' then aligns common subsequences, producing the CSS grid and the 'basic SVG' on the server. The responsive nature meant I had to move the final 'allow' generation to the client.<p>Some browsers seem to throttle the token animations sometimes but I haven't figured out what causes that. This is my first time leaning hard on CSS variables.</p>
]]></description><pubDate>Sat, 17 Jan 2026 05:49:07 +0000</pubDate><link>https://news.ycombinator.com/item?id=46655665</link><dc:creator>killcoder</dc:creator><comments>https://news.ycombinator.com/item?id=46655665</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46655665</guid></item><item><title><![CDATA[The missed opportunity of constrained decoding]]></title><description><![CDATA[
<p>Article URL: <a href="https://michaelorenstein.com/blog/zero-entropy-tokens/">https://michaelorenstein.com/blog/zero-entropy-tokens/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=46655562">https://news.ycombinator.com/item?id=46655562</a></p>
<p>Points: 2</p>
<p># Comments: 2</p>
]]></description><pubDate>Sat, 17 Jan 2026 05:29:09 +0000</pubDate><link>https://michaelorenstein.com/blog/zero-entropy-tokens/</link><dc:creator>killcoder</dc:creator><comments>https://news.ycombinator.com/item?id=46655562</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46655562</guid></item><item><title><![CDATA[New comment by killcoder in "$96M AUD revamp of Bom website bombs out on launch"]]></title><description><![CDATA[
<p>In South Australia an algal bloom started in ~mid-March of this year, it's a pretty big ecological disaster, probably the worst non-bushfire disaster in living memory. Probably 30% of SA's coastline is affected. It's a pretty big deal affecting many people's livelihoods.<p>The joint state and federal government relief and cleanup package is worth AUD $102.5 million dollars.<p>I hope the public receives that comparison at every opportunity.<p>The old website was frankly excellent, the only problem was it didn't have HTTPS support. I would have happily upgraded that part of the system for the cost of a cup of coffee if I'd had an opportunity to submit for the tender!<p>The new website is significantly more difficult to navigate (for me, a seasoned tech user). The primary thing Dad's everywhere use it for (the weather radar) now requires scrolling to the _bottom_ of the page, and zooming in from the 'map of Australia' to the region you live in. It used to be like, a click to go from home page -> state weather radar with all the info you needed.<p><a href="https://www.abc.net.au/news/2025-11-23/bureau-of-meteorology-new-website-cost-blowout-to-96-million/106042202" rel="nofollow">https://www.abc.net.au/news/2025-11-23/bureau-of-meteorology...</a><p>If you want to read our local news about it.<p>> [BOM] said the cost breakdown included $4.1 million for the redesign, $79.8 million for the website build, and the site's launch and security testing cost $12.6 million.<p>Absolutely stupid, even those numbers are outrageous. They say it's part of some 'larger upgrade package', prompted by a cyber attack in 2015.<p><a href="https://www.abc.net.au/news/2015-12-02/china-blamed-for-cyber-attack-on-bureau-of-meteorology/6993278" rel="nofollow">https://www.abc.net.au/news/2015-12-02/china-blamed-for-cybe...</a><p>But politicians over here love to blame cyber attacks when technical blunders happen. We had a census a couple years ago and the website fell over due to 'unprecedented load' or maybe it was a 'DDOS attack'? The news at the time couldn't decide who to blame!<p>Welp, I hope this gets as much world-wide attention as possible so they can be embarrassed and do better.</p>
]]></description><pubDate>Thu, 27 Nov 2025 10:09:41 +0000</pubDate><link>https://news.ycombinator.com/item?id=46067700</link><dc:creator>killcoder</dc:creator><comments>https://news.ycombinator.com/item?id=46067700</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46067700</guid></item><item><title><![CDATA[New comment by killcoder in "Building more with GPT-5.1-Codex-Max"]]></title><description><![CDATA[
<p>> 30% more token-efficient at the same reasoning level across many tasks<p>But they're claiming it's more token efficient, so me switching my usage to the new model should _free up_ capacity.</p>
]]></description><pubDate>Sun, 23 Nov 2025 04:05:14 +0000</pubDate><link>https://news.ycombinator.com/item?id=46020664</link><dc:creator>killcoder</dc:creator><comments>https://news.ycombinator.com/item?id=46020664</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46020664</guid></item><item><title><![CDATA[New comment by killcoder in "Building more with GPT-5.1-Codex-Max"]]></title><description><![CDATA[
<p>It would be nice if users of the codex-cli that are just using API keys as a way to handle rate limits and billing could receive these new models at the same time. I appreciate the reasoning behind delayed 'actual API' release, but I've found the rate limiting to be quite annoying, and my own API keys don't have this limitation.</p>
]]></description><pubDate>Thu, 20 Nov 2025 03:47:48 +0000</pubDate><link>https://news.ycombinator.com/item?id=45988707</link><dc:creator>killcoder</dc:creator><comments>https://news.ycombinator.com/item?id=45988707</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45988707</guid></item><item><title><![CDATA[New comment by killcoder in "GPT-OSS-Safeguard"]]></title><description><![CDATA[
<p>Apart from the actual model release, this is the second set of models from OpenAI that uses the Harmony response format. I don't suppose anyone knows if OpenAI uses the Harmony format internally for GPT-5 as well?<p><a href="https://cookbook.openai.com/articles/openai-harmony" rel="nofollow">https://cookbook.openai.com/articles/openai-harmony</a></p>
]]></description><pubDate>Wed, 29 Oct 2025 14:01:48 +0000</pubDate><link>https://news.ycombinator.com/item?id=45746961</link><dc:creator>killcoder</dc:creator><comments>https://news.ycombinator.com/item?id=45746961</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45746961</guid></item><item><title><![CDATA[GPT-OSS-Safeguard]]></title><description><![CDATA[
<p>Article URL: <a href="https://openai.com/index/introducing-gpt-oss-safeguard/">https://openai.com/index/introducing-gpt-oss-safeguard/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=45746907">https://news.ycombinator.com/item?id=45746907</a></p>
<p>Points: 4</p>
<p># Comments: 1</p>
]]></description><pubDate>Wed, 29 Oct 2025 13:57:29 +0000</pubDate><link>https://openai.com/index/introducing-gpt-oss-safeguard/</link><dc:creator>killcoder</dc:creator><comments>https://news.ycombinator.com/item?id=45746907</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45746907</guid></item><item><title><![CDATA[New comment by killcoder in "Show HN: Electrico – Electron Without Node and Chrome"]]></title><description><![CDATA[
<p>Not at all, in a renderer the Node and Chromium event loops are bound together, they’re part of the same v8 isolate, no IPC shenanigans.<p>The main process really shouldn’t be used for anything except setup. Since it controls gpu paints amongst other things, blocking on it will cause visible stuttering and a bad user experience.<p><a href="https://www.electronjs.org/blog/electron-internals-node-integration" rel="nofollow">https://www.electronjs.org/blog/electron-internals-node-inte...</a></p>
]]></description><pubDate>Wed, 18 Sep 2024 09:48:42 +0000</pubDate><link>https://news.ycombinator.com/item?id=41577736</link><dc:creator>killcoder</dc:creator><comments>https://news.ycombinator.com/item?id=41577736</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=41577736</guid></item><item><title><![CDATA[New comment by killcoder in "Show HN: Electrico – Electron Without Node and Chrome"]]></title><description><![CDATA[
<p>Renderers can access Node APIs via the ‘node integration’ setting or via a preload script.</p>
]]></description><pubDate>Tue, 17 Sep 2024 12:06:21 +0000</pubDate><link>https://news.ycombinator.com/item?id=41566787</link><dc:creator>killcoder</dc:creator><comments>https://news.ycombinator.com/item?id=41566787</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=41566787</guid></item><item><title><![CDATA[New comment by killcoder in "Show HN: Electrico – Electron Without Node and Chrome"]]></title><description><![CDATA[
<p>You were correct. Electron lets you expose specific NodeJS APIs via the preload script or everything via the ‘nodeIntegration’ setting:<p><a href="https://www.electronjs.org/docs/latest/api/structures/web-preferences" rel="nofollow">https://www.electronjs.org/docs/latest/api/structures/web-pr...</a><p>Separately the IPC lets you do zero copy in some circumstances via Transferable objects such as ArrayBuffers. Structured cloning is efficient but not zero copy, and json serialisation shouldn’t be used (since structured cloning is easily available).</p>
]]></description><pubDate>Tue, 17 Sep 2024 12:04:58 +0000</pubDate><link>https://news.ycombinator.com/item?id=41566775</link><dc:creator>killcoder</dc:creator><comments>https://news.ycombinator.com/item?id=41566775</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=41566775</guid></item><item><title><![CDATA[New comment by killcoder in "Show HN: Electrico – Electron Without Node and Chrome"]]></title><description><![CDATA[
<p>You don’t need IPC, you can either use a preload script to expose particular Node APIs in a secure manner or set ‘nodeIntegration‘ to ‘true’ to expose everything.<p>Source: <a href="https://www.electronjs.org/docs/latest/api/structures/web-preferences" rel="nofollow">https://www.electronjs.org/docs/latest/api/structures/web-pr...</a></p>
]]></description><pubDate>Tue, 17 Sep 2024 12:00:25 +0000</pubDate><link>https://news.ycombinator.com/item?id=41566741</link><dc:creator>killcoder</dc:creator><comments>https://news.ycombinator.com/item?id=41566741</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=41566741</guid></item><item><title><![CDATA[New comment by killcoder in "Show HN: Electrico – Electron Without Node and Chrome"]]></title><description><![CDATA[
<p>Within a renderer you can access NodeJS APIs directly. The main process shouldn’t be used for any significant computation, as it will block GPU paints and cross-process synchronisation.<p>The other main difference is Electron bundles a known set of APIs, given the known Chromium version. There’s such a huge variance of supported features across the embedded web views.</p>
]]></description><pubDate>Tue, 17 Sep 2024 10:59:51 +0000</pubDate><link>https://news.ycombinator.com/item?id=41566370</link><dc:creator>killcoder</dc:creator><comments>https://news.ycombinator.com/item?id=41566370</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=41566370</guid></item><item><title><![CDATA[New comment by killcoder in "Hi-Tech Bifocals Improved My Eyesight but Made Me Look Like a Dork"]]></title><description><![CDATA[
<p>Conversely, the last blog post we wrote was 8,000+ words and took months of testing, yet the average 'read' time is under 2 minutes. I'm convinced there's a correlation between interested technical users and the blocking of analytics scripts - but if I were to naively look at the data, I'd also come to the conclusion that "lower effort" was better return on investment. I wonder if these tech journalism establishments are following their analytics and A/B testing themselves into oblivion.</p>
]]></description><pubDate>Sun, 15 Sep 2024 20:41:32 +0000</pubDate><link>https://news.ycombinator.com/item?id=41550206</link><dc:creator>killcoder</dc:creator><comments>https://news.ycombinator.com/item?id=41550206</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=41550206</guid></item><item><title><![CDATA[New comment by killcoder in "Serious Sam handled massive amounts of enemies on 56k modem connections"]]></title><description><![CDATA[
<p>Factorio is a good example of a modern game that works this way.</p>
]]></description><pubDate>Thu, 13 Jun 2024 04:45:17 +0000</pubDate><link>https://news.ycombinator.com/item?id=40666037</link><dc:creator>killcoder</dc:creator><comments>https://news.ycombinator.com/item?id=40666037</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40666037</guid></item><item><title><![CDATA[New comment by killcoder in "DuckDB 1.0.0"]]></title><description><![CDATA[
<p>Does DuckDB supports partial reading of .duckdb files hosted externally?</p>
]]></description><pubDate>Wed, 05 Jun 2024 00:19:14 +0000</pubDate><link>https://news.ycombinator.com/item?id=40580238</link><dc:creator>killcoder</dc:creator><comments>https://news.ycombinator.com/item?id=40580238</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40580238</guid></item><item><title><![CDATA[New comment by killcoder in "Sony overturns Helldivers 2 PSN requirement following backlash"]]></title><description><![CDATA[
<p>I wish this recent backlash was also spent on removing this rootkit. It's a PvE game, I hardly see why a rootkit is warranted.</p>
]]></description><pubDate>Mon, 06 May 2024 12:16:41 +0000</pubDate><link>https://news.ycombinator.com/item?id=40273743</link><dc:creator>killcoder</dc:creator><comments>https://news.ycombinator.com/item?id=40273743</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40273743</guid></item><item><title><![CDATA[Driftsort: An efficient, generic and robust stable sort implementation]]></title><description><![CDATA[
<p>Article URL: <a href="https://github.com/Voultapher/sort-research-rs/blob/main/writeup/driftsort_introduction/text.md">https://github.com/Voultapher/sort-research-rs/blob/main/writeup/driftsort_introduction/text.md</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=40060925">https://news.ycombinator.com/item?id=40060925</a></p>
<p>Points: 3</p>
<p># Comments: 0</p>
]]></description><pubDate>Wed, 17 Apr 2024 05:49:33 +0000</pubDate><link>https://github.com/Voultapher/sort-research-rs/blob/main/writeup/driftsort_introduction/text.md</link><dc:creator>killcoder</dc:creator><comments>https://news.ycombinator.com/item?id=40060925</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40060925</guid></item><item><title><![CDATA[Overthinking Leetcode's Two Sum with SIMD]]></title><description><![CDATA[
<p>Article URL: <a href="https://darkcoding.net/software/two-sum/">https://darkcoding.net/software/two-sum/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=39428556">https://news.ycombinator.com/item?id=39428556</a></p>
<p>Points: 2</p>
<p># Comments: 0</p>
]]></description><pubDate>Mon, 19 Feb 2024 11:21:30 +0000</pubDate><link>https://darkcoding.net/software/two-sum/</link><dc:creator>killcoder</dc:creator><comments>https://news.ycombinator.com/item?id=39428556</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=39428556</guid></item></channel></rss>