<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: remilouf</title><link>https://news.ycombinator.com/user?id=remilouf</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Thu, 23 Apr 2026 18:35:16 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=remilouf" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by remilouf in "The M×N problem of tool calling and open-source models"]]></title><description><![CDATA[
<p>> Ironically LLMs solve the MxN problem he's complaining about<p>Enlighten me please</p>
]]></description><pubDate>Wed, 15 Apr 2026 09:09:18 +0000</pubDate><link>https://news.ycombinator.com/item?id=47776539</link><dc:creator>remilouf</dc:creator><comments>https://news.ycombinator.com/item?id=47776539</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47776539</guid></item><item><title><![CDATA[New comment by remilouf in "The M×N problem of tool calling and open-source models"]]></title><description><![CDATA[
<p>Ooops sorry</p>
]]></description><pubDate>Tue, 14 Apr 2026 17:05:53 +0000</pubDate><link>https://news.ycombinator.com/item?id=47768297</link><dc:creator>remilouf</dc:creator><comments>https://news.ycombinator.com/item?id=47768297</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47768297</guid></item><item><title><![CDATA[New comment by remilouf in "The M×N problem of tool calling and open-source models"]]></title><description><![CDATA[
<p>Author here. You're right, it's not a hard problem, but a particularly annoying one.</p>
]]></description><pubDate>Tue, 14 Apr 2026 16:14:51 +0000</pubDate><link>https://news.ycombinator.com/item?id=47767575</link><dc:creator>remilouf</dc:creator><comments>https://news.ycombinator.com/item?id=47767575</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47767575</guid></item><item><title><![CDATA[The M×N problem of tool calling and open-source models]]></title><description><![CDATA[
<p>Article URL: <a href="https://www.thetypicalset.com/blog/grammar-parser-maintenance-contract">https://www.thetypicalset.com/blog/grammar-parser-maintenance-contract</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=47704729">https://news.ycombinator.com/item?id=47704729</a></p>
<p>Points: 159</p>
<p># Comments: 50</p>
]]></description><pubDate>Thu, 09 Apr 2026 15:07:13 +0000</pubDate><link>https://www.thetypicalset.com/blog/grammar-parser-maintenance-contract</link><dc:creator>remilouf</dc:creator><comments>https://news.ycombinator.com/item?id=47704729</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47704729</guid></item><item><title><![CDATA[New comment by remilouf in "I run my company from Emacs"]]></title><description><![CDATA[
<p>I haven't always done this, and the knowledge base used to visibly degrade over time. Reviewing a PR does not take a long time, maybe a few minutes, and this compounds over time.</p>
]]></description><pubDate>Fri, 03 Apr 2026 13:45:59 +0000</pubDate><link>https://news.ycombinator.com/item?id=47626608</link><dc:creator>remilouf</dc:creator><comments>https://news.ycombinator.com/item?id=47626608</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47626608</guid></item><item><title><![CDATA[I run my company from Emacs]]></title><description><![CDATA[
<p>Article URL: <a href="https://www.thetypicalset.com/blog/emacs-as-ceo-operating-system">https://www.thetypicalset.com/blog/emacs-as-ceo-operating-system</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=47626006">https://news.ycombinator.com/item?id=47626006</a></p>
<p>Points: 20</p>
<p># Comments: 1</p>
]]></description><pubDate>Fri, 03 Apr 2026 12:40:21 +0000</pubDate><link>https://www.thetypicalset.com/blog/emacs-as-ceo-operating-system</link><dc:creator>remilouf</dc:creator><comments>https://news.ycombinator.com/item?id=47626006</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47626006</guid></item><item><title><![CDATA[Every AI Integration Is Held Together with Parsing Logic and Prayer]]></title><description><![CDATA[
<p>Article URL: <a href="https://blog.dottxt.co/do-one-thing-well.html">https://blog.dottxt.co/do-one-thing-well.html</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=44519905">https://news.ycombinator.com/item?id=44519905</a></p>
<p>Points: 2</p>
<p># Comments: 0</p>
]]></description><pubDate>Thu, 10 Jul 2025 11:45:40 +0000</pubDate><link>https://blog.dottxt.co/do-one-thing-well.html</link><dc:creator>remilouf</dc:creator><comments>https://news.ycombinator.com/item?id=44519905</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44519905</guid></item><item><title><![CDATA[New comment by remilouf in ".txt raises $11.9M to make language models programmable"]]></title><description><![CDATA[
<p>This is actually pretty funny.</p>
]]></description><pubDate>Sun, 20 Oct 2024 15:17:31 +0000</pubDate><link>https://news.ycombinator.com/item?id=41895854</link><dc:creator>remilouf</dc:creator><comments>https://news.ycombinator.com/item?id=41895854</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=41895854</guid></item><item><title><![CDATA[New comment by remilouf in "Show HN: Infinite Testimonials with FastHTML, Outlines, and Modal"]]></title><description><![CDATA[
<p>That’d be a pretty inefficient way to generate bullshit at scale</p>
]]></description><pubDate>Tue, 10 Sep 2024 19:41:24 +0000</pubDate><link>https://news.ycombinator.com/item?id=41504705</link><dc:creator>remilouf</dc:creator><comments>https://news.ycombinator.com/item?id=41504705</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=41504705</guid></item><item><title><![CDATA[New comment by remilouf in "[dead]"]]></title><description><![CDATA[
<p>LLM evaluations are very sensitive to the details of the prompt's structure. This post shows how using structured generation reduces the results' variance and the ranking shifts.</p>
]]></description><pubDate>Thu, 02 May 2024 14:53:48 +0000</pubDate><link>https://news.ycombinator.com/item?id=40237005</link><dc:creator>remilouf</dc:creator><comments>https://news.ycombinator.com/item?id=40237005</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40237005</guid></item><item><title><![CDATA[New comment by remilouf in "Tool Use (function calling)"]]></title><description><![CDATA[
<p>Looks like it’s quite the opposite:  <a href="http://blog.dottxt.co/performance-gsm8k.html" rel="nofollow">http://blog.dottxt.co/performance-gsm8k.html</a></p>
]]></description><pubDate>Fri, 05 Apr 2024 17:19:32 +0000</pubDate><link>https://news.ycombinator.com/item?id=39944856</link><dc:creator>remilouf</dc:creator><comments>https://news.ycombinator.com/item?id=39944856</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=39944856</guid></item><item><title><![CDATA[New comment by remilouf in "Structured Generation Improves LLM Performance: GSM8K Benchmark"]]></title><description><![CDATA[
<p>What do you mean by "semantic dimension"?</p>
]]></description><pubDate>Fri, 15 Mar 2024 14:21:54 +0000</pubDate><link>https://news.ycombinator.com/item?id=39716071</link><dc:creator>remilouf</dc:creator><comments>https://news.ycombinator.com/item?id=39716071</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=39716071</guid></item><item><title><![CDATA[New comment by remilouf in "Structured Generation Improves LLM Performance: GSM8K Benchmark"]]></title><description><![CDATA[
<p>That whole structured generation line of work looks promising. I hope someone else takes this and runs evaluations on other benchmarks. Curious to see if the results translate!</p>
]]></description><pubDate>Fri, 15 Mar 2024 14:06:10 +0000</pubDate><link>https://news.ycombinator.com/item?id=39715873</link><dc:creator>remilouf</dc:creator><comments>https://news.ycombinator.com/item?id=39715873</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=39715873</guid></item><item><title><![CDATA[New comment by remilouf in "LoRAX and Outlines: Better JSON Extraction with Structured Generation and LoRA"]]></title><description><![CDATA[
<p>Awesome work! I am really impressed by how much structured generation improves model performance.</p>
]]></description><pubDate>Tue, 05 Mar 2024 18:24:46 +0000</pubDate><link>https://news.ycombinator.com/item?id=39607317</link><dc:creator>remilouf</dc:creator><comments>https://news.ycombinator.com/item?id=39607317</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=39607317</guid></item><item><title><![CDATA[New comment by remilouf in "Tokenization Is a Problem for LLMs"]]></title><description><![CDATA[
<p>This article presents a way to make structured generation with LLMs much faster than standard generation, but what I find most interesting is how it highlights the issues that tokenization entails towards the end.</p>
]]></description><pubDate>Sun, 04 Feb 2024 16:13:07 +0000</pubDate><link>https://news.ycombinator.com/item?id=39251434</link><dc:creator>remilouf</dc:creator><comments>https://news.ycombinator.com/item?id=39251434</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=39251434</guid></item><item><title><![CDATA[Tokenization Is a Problem for LLMs]]></title><description><![CDATA[
<p>Article URL: <a href="http://blog.dottxt.co/coalescence.html">http://blog.dottxt.co/coalescence.html</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=39251433">https://news.ycombinator.com/item?id=39251433</a></p>
<p>Points: 2</p>
<p># Comments: 1</p>
]]></description><pubDate>Sun, 04 Feb 2024 16:13:07 +0000</pubDate><link>http://blog.dottxt.co/coalescence.html</link><dc:creator>remilouf</dc:creator><comments>https://news.ycombinator.com/item?id=39251433</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=39251433</guid></item><item><title><![CDATA[New comment by remilouf in "Show HN: VLLM with JSON Guided Generation"]]></title><description><![CDATA[
<p>We already support regex-guided generation in the library, and could easily make an API to serve this as well if that's a feature people want!</p>
]]></description><pubDate>Fri, 22 Dec 2023 16:20:19 +0000</pubDate><link>https://news.ycombinator.com/item?id=38735595</link><dc:creator>remilouf</dc:creator><comments>https://news.ycombinator.com/item?id=38735595</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=38735595</guid></item><item><title><![CDATA[Show HN: Vectorize OpenAI API calls (like NumPy)]]></title><description><![CDATA[
<p>Outlines is a Python library that focuses on text generation with large language models. While the library mainly focuses on open source models we have an integration with the OpenAI API.
When trying to implement Tree of Thoughts we became frustrated with the iterative workflow in which other libraries forced us. What we wanted was to be able to pass an array of prompts or completions with arbitrary shape, ask for several samples, and get an array of completions back.<p>So we implemented a vectorization routine inspired by NumPy's. This makes our OpenAI integration have the "broadcast" behavior described above. All call are performed concurrently.<p>We thought this could be of use to some folks here who are trying to implement complex workflows with the OpenAI API.</p>
<hr>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=38252197">https://news.ycombinator.com/item?id=38252197</a></p>
<p>Points: 3</p>
<p># Comments: 0</p>
]]></description><pubDate>Mon, 13 Nov 2023 16:46:05 +0000</pubDate><link>https://github.com/outlines-dev/outlines</link><dc:creator>remilouf</dc:creator><comments>https://news.ycombinator.com/item?id=38252197</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=38252197</guid></item><item><title><![CDATA[New comment by remilouf in "Show HN: LLMs can generate valid JSON 100% of the time"]]></title><description><![CDATA[
<p>It is currently limited by the time it takes to build the index. There are obvious optimizations we can apply to this, however in a production setting it does not matter much since you only need to build the index once for each (schema, vocabulary) pair.</p>
]]></description><pubDate>Thu, 17 Aug 2023 21:47:39 +0000</pubDate><link>https://news.ycombinator.com/item?id=37168594</link><dc:creator>remilouf</dc:creator><comments>https://news.ycombinator.com/item?id=37168594</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=37168594</guid></item><item><title><![CDATA[New comment by remilouf in "Show HN: LLMs can generate valid JSON 100% of the time"]]></title><description><![CDATA[
<p>You mean nested JSON? It's totally possible.</p>
]]></description><pubDate>Thu, 17 Aug 2023 21:44:09 +0000</pubDate><link>https://news.ycombinator.com/item?id=37168543</link><dc:creator>remilouf</dc:creator><comments>https://news.ycombinator.com/item?id=37168543</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=37168543</guid></item></channel></rss>