<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: Jacques2Marais</title><link>https://news.ycombinator.com/user?id=Jacques2Marais</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Thu, 16 Apr 2026 14:17:44 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=Jacques2Marais" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[Expert Personas Improve LLM Alignment but Damage Accuracy]]></title><description><![CDATA[
<p>Article URL: <a href="https://arxiv.org/abs/2603.18507">https://arxiv.org/abs/2603.18507</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=47499419">https://news.ycombinator.com/item?id=47499419</a></p>
<p>Points: 1</p>
<p># Comments: 0</p>
]]></description><pubDate>Tue, 24 Mar 2026 07:05:05 +0000</pubDate><link>https://arxiv.org/abs/2603.18507</link><dc:creator>Jacques2Marais</dc:creator><comments>https://news.ycombinator.com/item?id=47499419</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47499419</guid></item><item><title><![CDATA[New comment by Jacques2Marais in "Ask HN: What Are You Working On? (March 2026)"]]></title><description><![CDATA[
<p>Working on a Vercel-like cloud hosting PaaS, but specifically tailored to South Africa. Everything is hosted on local servers, and pricing is ZAR instead of USD. It's called Zanode if you want to check it out :)<p><a href="https://www.zanode.co.za/" rel="nofollow">https://www.zanode.co.za/</a></p>
]]></description><pubDate>Mon, 09 Mar 2026 14:45:54 +0000</pubDate><link>https://news.ycombinator.com/item?id=47309818</link><dc:creator>Jacques2Marais</dc:creator><comments>https://news.ycombinator.com/item?id=47309818</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47309818</guid></item><item><title><![CDATA[New comment by Jacques2Marais in "GPT-5.4"]]></title><description><![CDATA[
<p>I guess a big chunk of their target market won't know how to use APIs.</p>
]]></description><pubDate>Thu, 05 Mar 2026 18:22:35 +0000</pubDate><link>https://news.ycombinator.com/item?id=47265225</link><dc:creator>Jacques2Marais</dc:creator><comments>https://news.ycombinator.com/item?id=47265225</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47265225</guid></item><item><title><![CDATA[Doing a Video Call over a Database]]></title><description><![CDATA[
<p>Article URL: <a href="https://www.youtube.com/watch?v=zwIc9fFcYVw">https://www.youtube.com/watch?v=zwIc9fFcYVw</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=47229189">https://news.ycombinator.com/item?id=47229189</a></p>
<p>Points: 1</p>
<p># Comments: 0</p>
]]></description><pubDate>Tue, 03 Mar 2026 07:15:33 +0000</pubDate><link>https://www.youtube.com/watch?v=zwIc9fFcYVw</link><dc:creator>Jacques2Marais</dc:creator><comments>https://news.ycombinator.com/item?id=47229189</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47229189</guid></item><item><title><![CDATA[New comment by Jacques2Marais in "Show HN: Better Hub – A better GitHub experience"]]></title><description><![CDATA[
<p>I think most of the permissions can be toggled?</p>
]]></description><pubDate>Thu, 26 Feb 2026 11:05:40 +0000</pubDate><link>https://news.ycombinator.com/item?id=47164444</link><dc:creator>Jacques2Marais</dc:creator><comments>https://news.ycombinator.com/item?id=47164444</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47164444</guid></item><item><title><![CDATA[Gemini 3.1 Pro is surprisingly good at classifying banking transactions]]></title><description><![CDATA[
<p>Article URL: <a href="https://butternut.click/blog/gemini-3-1-pro-banking-transactions">https://butternut.click/blog/gemini-3-1-pro-banking-transactions</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=47148078">https://news.ycombinator.com/item?id=47148078</a></p>
<p>Points: 2</p>
<p># Comments: 0</p>
]]></description><pubDate>Wed, 25 Feb 2026 06:31:55 +0000</pubDate><link>https://butternut.click/blog/gemini-3-1-pro-banking-transactions</link><dc:creator>Jacques2Marais</dc:creator><comments>https://news.ycombinator.com/item?id=47148078</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47148078</guid></item><item><title><![CDATA[LLMs are expert movie script writers]]></title><description><![CDATA[
<p>Article URL: <a href="https://butternut.click/blog/ais-are-expert-script-writers">https://butternut.click/blog/ais-are-expert-script-writers</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=47086633">https://news.ycombinator.com/item?id=47086633</a></p>
<p>Points: 3</p>
<p># Comments: 0</p>
]]></description><pubDate>Fri, 20 Feb 2026 11:27:20 +0000</pubDate><link>https://butternut.click/blog/ais-are-expert-script-writers</link><dc:creator>Jacques2Marais</dc:creator><comments>https://news.ycombinator.com/item?id=47086633</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47086633</guid></item><item><title><![CDATA[New comment by Jacques2Marais in "Qwen3.5: Towards Native Multimodal Agents"]]></title><description><![CDATA[
<p>Yes, I also see that (also using dark mode on Chrome without Dark Reader extension). I sometimes use the Dark Reader Chrome extension, which usually breaks sites' colours, but this time it actually fixes the site.</p>
]]></description><pubDate>Mon, 16 Feb 2026 12:50:47 +0000</pubDate><link>https://news.ycombinator.com/item?id=47034368</link><dc:creator>Jacques2Marais</dc:creator><comments>https://news.ycombinator.com/item?id=47034368</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47034368</guid></item><item><title><![CDATA[Qwen3.5]]></title><description><![CDATA[
<p>Article URL: <a href="https://huggingface.co/Qwen/Qwen3.5-397B-A17B">https://huggingface.co/Qwen/Qwen3.5-397B-A17B</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=47033981">https://news.ycombinator.com/item?id=47033981</a></p>
<p>Points: 5</p>
<p># Comments: 0</p>
]]></description><pubDate>Mon, 16 Feb 2026 12:01:18 +0000</pubDate><link>https://huggingface.co/Qwen/Qwen3.5-397B-A17B</link><dc:creator>Jacques2Marais</dc:creator><comments>https://news.ycombinator.com/item?id=47033981</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47033981</guid></item><item><title><![CDATA[New comment by Jacques2Marais in "I want to wash my car. The car wash is 50 meters away. Should I walk or drive?"]]></title><description><![CDATA[
<p>An LLM's take on this thread (GPT 5.1):<p>"""
- Pattern bias vs world model: Models are heavily biased by surface patterns (“short distance → walk”) and post‑training values (environmentalism, health). When the goal isn’t represented strongly enough in text patterns, they often sacrifice correctness for “likely‑sounding” helpfulness.<p>- Non‑determinism and routing: Different users in the thread get different answers from the same vendor because of sampling randomness, internal routing (cheap vs expensive submodels, with/without “thinking”), prompt phrasing, and language. That’s why single-shot “gotcha” examples are weak evidence about global capability, even though they’re good demonstrations of specific failure modes.<p>- Humans vs LLMs: People correctly note that humans also fail at trick questions and illusions, but there’s an important asymmetry: we know humans have a grounded world model and sensorimotor experience. With LLMs, we only have behavior. Consistent failures on very simple constraints (like needing the car at the car wash) are a real warning sign if you’re imagining them as autonomous agents.<p>- Missing meta‑cognition: The strongest critique in the thread is not “it got the riddle wrong,” but that models rarely say, “this question is underspecified / weird, I should ask a clarifying question.” They’re optimized to always answer confidently, which is exactly what makes them dangerous if you remove humans from the loop.<p>- Over‑ and under‑claiming: Some commenters jump from this to “LLMs are just autocomplete, full stop”; others hand‑wave it away as irrelevant edge‑case. Both are overstated. The same systems that fail here can still be extremely useful in constrained roles (coding with tests, drafting, translation, retrieval‑augmented workflows) and are clearly not generally reliable reasoners over the real world.<p>My own “take,” if I had one, would be: this example is a clean, funny illustration of why LLMs should currently be treated as probabilistic text tools plus heuristics, not as agents you delegate unsupervised goals to. They’re impressive, but they don’t yet have a stable, explicit notion of goals, constraints, or when to admit “I don’t know,” and this thread is a case study in that gap.
"""</p>
]]></description><pubDate>Mon, 16 Feb 2026 11:07:38 +0000</pubDate><link>https://news.ycombinator.com/item?id=47033630</link><dc:creator>Jacques2Marais</dc:creator><comments>https://news.ycombinator.com/item?id=47033630</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47033630</guid></item><item><title><![CDATA[New comment by Jacques2Marais in "Lost Soviet Moon Lander May Have Been Found"]]></title><description><![CDATA[
<p>'“One of them is wrong,” an expert said.'</p>
]]></description><pubDate>Mon, 16 Feb 2026 09:13:35 +0000</pubDate><link>https://news.ycombinator.com/item?id=47032734</link><dc:creator>Jacques2Marais</dc:creator><comments>https://news.ycombinator.com/item?id=47032734</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47032734</guid></item><item><title><![CDATA[New comment by Jacques2Marais in "I want to wash my car. The car wash is 50 meters away. Should I walk or drive?"]]></title><description><![CDATA[
<p>You would be surprised, however, at how much detail humans also need to understand each other. We often want AI to just "understand" us in ways many people may not initially have understood us without extra communication.</p>
]]></description><pubDate>Mon, 16 Feb 2026 08:17:36 +0000</pubDate><link>https://news.ycombinator.com/item?id=47032325</link><dc:creator>Jacques2Marais</dc:creator><comments>https://news.ycombinator.com/item?id=47032325</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47032325</guid></item><item><title><![CDATA[Show HN: Host OpenClaw with native template and multi-agent support]]></title><description><![CDATA[
<p>Hi HN,<p>I built ClawClaw because currently OpenClaw hosting still requires a lot of manual setup for certain useful features. The plan with ClawClaw is to make OpenClaw ready out-of-the-box for all kinds of different use cases with minimal manual configuration.<p>Enjoy and let me know what you think :)</p>
<hr>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=46976408">https://news.ycombinator.com/item?id=46976408</a></p>
<p>Points: 1</p>
<p># Comments: 0</p>
]]></description><pubDate>Wed, 11 Feb 2026 15:47:35 +0000</pubDate><link>https://clawclaw.click/</link><dc:creator>Jacques2Marais</dc:creator><comments>https://news.ycombinator.com/item?id=46976408</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46976408</guid></item><item><title><![CDATA[New comment by Jacques2Marais in "Show HN: Moltbook – A social network for moltbots (clawdbots) to hang out"]]></title><description><![CDATA[
<p><a href="https://www.moltbook.com/post/9303abf8-ecc9-4bd8-afa5-41330ebb71c8" rel="nofollow">https://www.moltbook.com/post/9303abf8-ecc9-4bd8-afa5-41330e...</a></p>
]]></description><pubDate>Fri, 30 Jan 2026 12:37:47 +0000</pubDate><link>https://news.ycombinator.com/item?id=46823769</link><dc:creator>Jacques2Marais</dc:creator><comments>https://news.ycombinator.com/item?id=46823769</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46823769</guid></item><item><title><![CDATA[KiteSQL: Rust-native embedded SQL with TPC-C benchmarks and WASM support]]></title><description><![CDATA[
<p>Article URL: <a href="https://github.com/KipData/KiteSQL">https://github.com/KipData/KiteSQL</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=46811234">https://news.ycombinator.com/item?id=46811234</a></p>
<p>Points: 2</p>
<p># Comments: 0</p>
]]></description><pubDate>Thu, 29 Jan 2026 15:10:26 +0000</pubDate><link>https://github.com/KipData/KiteSQL</link><dc:creator>Jacques2Marais</dc:creator><comments>https://news.ycombinator.com/item?id=46811234</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46811234</guid></item><item><title><![CDATA[New comment by Jacques2Marais in "Ask HN: Are .xyz domains still seen as sketchy in 2026?"]]></title><description><![CDATA[
<p>I've been using .xyz domains for a while now because of how cheap they are (around $2 on Spaceship for first year), especially as a solo dev building all kinds of side projects that would benefit from having their own domains.<p>I recently launched an app on a .xyz domain that's been getting steady traffic and some people actually signing up for it (it's tinytune.xyz if you're curious to see).<p>Doing a quick Google search I found a few other popular services using .xyz domains:<p>1. MEE6 (the popular Discord bot): mee6.xyz<p>2. Together AI (used to be on .xyz before going over to .ai, an example perhaps of starting with a cheaper domain before going more expensive): together.xyz<p>3. Block (large fintech): block.xyz<p>4. Starship (millions in funding): starship.xyz<p>This pdf has some more: <a href="https://gen.xyz/downloads/xyz-10th-anniversary-registry-portfolio.pdf" rel="nofollow">https://gen.xyz/downloads/xyz-10th-anniversary-registry-port...</a></p>
]]></description><pubDate>Thu, 29 Jan 2026 06:16:17 +0000</pubDate><link>https://news.ycombinator.com/item?id=46806455</link><dc:creator>Jacques2Marais</dc:creator><comments>https://news.ycombinator.com/item?id=46806455</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46806455</guid></item><item><title><![CDATA[New comment by Jacques2Marais in "Show HN: LLM fine-tuning without infra or ML expertise"]]></title><description><![CDATA[
<p>Thank you so much! And thank you for your question.<p>Yes, so to answer it, the idea of TinyTune was to literally be "tiny", i.e., very simple to use. It (mostly) takes 3 steps + some waiting time and you have a custom model. I found other services to be a bit more difficult to use.<p>And yes, I also agree with iFire's findings that another big difference is the number of models that are available. But the main differentiator with other similar services would definitely be the focus on its ease of use.<p>I should say that FinetuneDB seems like a solid competitor though!</p>
]]></description><pubDate>Wed, 21 Jan 2026 11:38:36 +0000</pubDate><link>https://news.ycombinator.com/item?id=46704253</link><dc:creator>Jacques2Marais</dc:creator><comments>https://news.ycombinator.com/item?id=46704253</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46704253</guid></item><item><title><![CDATA[Show HN: LLM fine-tuning without infra or ML expertise]]></title><description><![CDATA[
<p>Article URL: <a href="https://www.tinytune.xyz/">https://www.tinytune.xyz/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=46702477">https://news.ycombinator.com/item?id=46702477</a></p>
<p>Points: 5</p>
<p># Comments: 3</p>
]]></description><pubDate>Wed, 21 Jan 2026 07:57:24 +0000</pubDate><link>https://www.tinytune.xyz/</link><dc:creator>Jacques2Marais</dc:creator><comments>https://news.ycombinator.com/item?id=46702477</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46702477</guid></item><item><title><![CDATA[Cloudflare's plan for resilience after two outages in 2025]]></title><description><![CDATA[
<p>Article URL: <a href="https://blog.cloudflare.com/fail-small-resilience-plan/">https://blog.cloudflare.com/fail-small-resilience-plan/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=46646046">https://news.ycombinator.com/item?id=46646046</a></p>
<p>Points: 7</p>
<p># Comments: 0</p>
]]></description><pubDate>Fri, 16 Jan 2026 13:12:56 +0000</pubDate><link>https://blog.cloudflare.com/fail-small-resilience-plan/</link><dc:creator>Jacques2Marais</dc:creator><comments>https://news.ycombinator.com/item?id=46646046</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46646046</guid></item><item><title><![CDATA[New comment by Jacques2Marais in "Which programming languages are most token-efficient?"]]></title><description><![CDATA[
<p>I can say that for F# this has been mostly true up until quite recently. We use F# at work and were mostly unable to use agents like Claude Code up until the release of Opus 4.5, which seems to know F# quite well.</p>
]]></description><pubDate>Mon, 12 Jan 2026 03:58:02 +0000</pubDate><link>https://news.ycombinator.com/item?id=46583928</link><dc:creator>Jacques2Marais</dc:creator><comments>https://news.ycombinator.com/item?id=46583928</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46583928</guid></item></channel></rss>