<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: ceroxylon</title><link>https://news.ycombinator.com/user?id=ceroxylon</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Thu, 09 Apr 2026 07:13:14 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=ceroxylon" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by ceroxylon in "System Card: Claude Mythos Preview [pdf]"]]></title><description><![CDATA[
<p>I have been thinking that these SWE benchmarks will continue to improve since these companies hire very intelligent software engineers, they can task  a multitude of them to solve problems, and then train the model on those answers.<p>Data has always been the core of it all, onward to the next abstraction, I suppose.</p>
]]></description><pubDate>Wed, 08 Apr 2026 01:19:28 +0000</pubDate><link>https://news.ycombinator.com/item?id=47683557</link><dc:creator>ceroxylon</dc:creator><comments>https://news.ycombinator.com/item?id=47683557</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47683557</guid></item><item><title><![CDATA[New comment by ceroxylon in "Google releases Gemma 4 open models"]]></title><description><![CDATA[
<p>Even with search grounding, it scored a 2.5/5 on a basic botanical benchmark. It would take much longer for the average human to do a similar write-up, but they would likely do better than 50% hallucination if they had access to a search engine.</p>
]]></description><pubDate>Thu, 02 Apr 2026 16:39:16 +0000</pubDate><link>https://news.ycombinator.com/item?id=47616772</link><dc:creator>ceroxylon</dc:creator><comments>https://news.ycombinator.com/item?id=47616772</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47616772</guid></item><item><title><![CDATA[New comment by ceroxylon in "I'm glad the Anthropic fight is happening now"]]></title><description><![CDATA[
<p>I think a lot of Dwarkesh's mentality about AI being inevitable / ubiquitous comes from the same part of him that thinks that artificial things are "good enough", e.g. the way he allows his production team to use fake plastic plants on set. Is he correct? I'm not sure, but I know there are at least a few people who notice the difference.</p>
]]></description><pubDate>Wed, 11 Mar 2026 21:41:24 +0000</pubDate><link>https://news.ycombinator.com/item?id=47342410</link><dc:creator>ceroxylon</dc:creator><comments>https://news.ycombinator.com/item?id=47342410</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47342410</guid></item><item><title><![CDATA[New comment by ceroxylon in "I put my whole life into a single database"]]></title><description><![CDATA[
<p>I stopped reading at "San Francisco was always scary to walk"...</p>
]]></description><pubDate>Tue, 10 Mar 2026 16:34:17 +0000</pubDate><link>https://news.ycombinator.com/item?id=47325547</link><dc:creator>ceroxylon</dc:creator><comments>https://news.ycombinator.com/item?id=47325547</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47325547</guid></item><item><title><![CDATA[New comment by ceroxylon in "We might all be AI engineers now"]]></title><description><![CDATA[
<p>> Honestly?<p>oh no... this is one of my "uncanny valley" AI tropes</p>
]]></description><pubDate>Sat, 07 Mar 2026 02:20:15 +0000</pubDate><link>https://news.ycombinator.com/item?id=47283784</link><dc:creator>ceroxylon</dc:creator><comments>https://news.ycombinator.com/item?id=47283784</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47283784</guid></item><item><title><![CDATA[New comment by ceroxylon in "Chaos and Dystopian news for the dead internet survivors"]]></title><description><![CDATA[
<p>engagement bot on overdrive</p>
]]></description><pubDate>Thu, 05 Mar 2026 06:00:52 +0000</pubDate><link>https://news.ycombinator.com/item?id=47258072</link><dc:creator>ceroxylon</dc:creator><comments>https://news.ycombinator.com/item?id=47258072</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47258072</guid></item><item><title><![CDATA[New comment by ceroxylon in "Chaos and Dystopian news for the dead internet survivors"]]></title><description><![CDATA[
<p>Hallucinations galore, the 'daily digest' provided me with this gem: "Apple's supposedly revolutionary $1,199 MacBook Neo is getting schooled by $500
Windows machines that do basically the same thing without the premium"<p>There is no way to build a Macbook Neo for $1,199 and this is obviously snarky, auto-generated slop.</p>
]]></description><pubDate>Thu, 05 Mar 2026 03:56:22 +0000</pubDate><link>https://news.ycombinator.com/item?id=47257348</link><dc:creator>ceroxylon</dc:creator><comments>https://news.ycombinator.com/item?id=47257348</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47257348</guid></item><item><title><![CDATA[New comment by ceroxylon in "Google Workspace CLI"]]></title><description><![CDATA[
<p>The readme is AI generated, so I am assuming the lack of effort and hand-off to the bots extends to the rest of this repository.<p>The contributors are a Google DRE, 5 bots / automating services, and a dev in Canada.</p>
]]></description><pubDate>Thu, 05 Mar 2026 03:15:04 +0000</pubDate><link>https://news.ycombinator.com/item?id=47257069</link><dc:creator>ceroxylon</dc:creator><comments>https://news.ycombinator.com/item?id=47257069</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47257069</guid></item><item><title><![CDATA[New comment by ceroxylon in "We Will Not Be Divided"]]></title><description><![CDATA[
<p>That's what taking a stand looks like... if any of these employees lose their job, they are welcome to come crash at my place for as long as they would like; they will have a roof over their head and I will cook them 3 meals a day.</p>
]]></description><pubDate>Sat, 28 Feb 2026 02:19:21 +0000</pubDate><link>https://news.ycombinator.com/item?id=47189281</link><dc:creator>ceroxylon</dc:creator><comments>https://news.ycombinator.com/item?id=47189281</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47189281</guid></item><item><title><![CDATA[New comment by ceroxylon in "New accounts on HN more likely to use em-dashes"]]></title><description><![CDATA[
<p>That was my reaction when LLMs first started getting "good"<p>I turned to my friend and said "They've co-opted the structure of effective language!"</p>
]]></description><pubDate>Wed, 25 Feb 2026 21:36:20 +0000</pubDate><link>https://news.ycombinator.com/item?id=47158246</link><dc:creator>ceroxylon</dc:creator><comments>https://news.ycombinator.com/item?id=47158246</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47158246</guid></item><item><title><![CDATA[New comment by ceroxylon in "OpenAI, the US government and Persona built an identity surveillance machine"]]></title><description><![CDATA[
<p>There is a play/pause button in the lower right corner.</p>
]]></description><pubDate>Wed, 25 Feb 2026 00:38:19 +0000</pubDate><link>https://news.ycombinator.com/item?id=47145714</link><dc:creator>ceroxylon</dc:creator><comments>https://news.ycombinator.com/item?id=47145714</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47145714</guid></item><item><title><![CDATA[New comment by ceroxylon in "Anthropic announces proof of distillation at scale by MiniMax, DeepSeek,Moonshot"]]></title><description><![CDATA[
<p>I personally have stopped publishing publicly, since my research is still on the fuzzy boundary of AI's current knowledge, my website gets scraped daily, and I don't want to contribute to paid models for zero acknowledgement or compensation.</p>
]]></description><pubDate>Mon, 23 Feb 2026 19:28:07 +0000</pubDate><link>https://news.ycombinator.com/item?id=47127473</link><dc:creator>ceroxylon</dc:creator><comments>https://news.ycombinator.com/item?id=47127473</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47127473</guid></item><item><title><![CDATA[New comment by ceroxylon in "Claws are now a new layer on top of LLM agents"]]></title><description><![CDATA[
<p>All of this, plus you can plug in an openrouter API key and test a plethora of models for all use cases. You can assign different models to different sub-agents, you can put it in /auto mode, and you can test the latest SOTA models the minute they're released...<p>It can also edit its own config files, monitor system processes, and even... check and harden its own system security. I still don't have it connected to my personal accounts, but as a standalone system it is very fun.<p>People ask me "what would I even do with it?", when I think of dozens of things every day. I've been working on modding an open source software synth, the patch files are XML so it was trivial to set up a workflow where I can add new knobs that combine multiple effects, add new ones, etc from just sending a it a message when I get inspired in the middle of the day.<p>A cron job scans my favorite sites twice a day and curates links based on my preferences, and creates a different list for things that are out of my normal interests to explore new areas.<p>I am amazed at how stubborn and un-creative people can be when presented with something like this... I thought we were hackers...?</p>
]]></description><pubDate>Sun, 22 Feb 2026 15:00:43 +0000</pubDate><link>https://news.ycombinator.com/item?id=47111520</link><dc:creator>ceroxylon</dc:creator><comments>https://news.ycombinator.com/item?id=47111520</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47111520</guid></item><item><title><![CDATA[New comment by ceroxylon in "I verified my LinkedIn identity. Here's what I handed over"]]></title><description><![CDATA[
<p>I also find AI trope-ification articles exhausting to read, there's a reason I've fine tuned my system prompts to wipe all of it away. This reads like "Hey Gemini, I verified my passport on LinkedIn, write an impassioned exposé on Persona's privacy policy".<p>When people leave in things like staccato language and Blogspot era emphasis, I feel like I might as well copy the Persona privacy policy and prompt my own AI(s) on the topic and read that instead.</p>
]]></description><pubDate>Sat, 21 Feb 2026 16:37:32 +0000</pubDate><link>https://news.ycombinator.com/item?id=47102303</link><dc:creator>ceroxylon</dc:creator><comments>https://news.ycombinator.com/item?id=47102303</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47102303</guid></item><item><title><![CDATA[New comment by ceroxylon in "Gemini 3.1 Pro"]]></title><description><![CDATA[
<p>It also has some strange bugs between versions. There was an update a month or two ago that caused the app to be unable to quit normally, and I would have to 'force quit' it. Thankfully it was resolved, but it was unnerving to not be able to close the app normally.</p>
]]></description><pubDate>Fri, 20 Feb 2026 13:38:16 +0000</pubDate><link>https://news.ycombinator.com/item?id=47087884</link><dc:creator>ceroxylon</dc:creator><comments>https://news.ycombinator.com/item?id=47087884</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47087884</guid></item><item><title><![CDATA[New comment by ceroxylon in "Gemini 3.1 Pro"]]></title><description><![CDATA[
<p>I once saw "now that I've slept on it" in Gemini's CoT... baffling.</p>
]]></description><pubDate>Thu, 19 Feb 2026 23:26:53 +0000</pubDate><link>https://news.ycombinator.com/item?id=47081270</link><dc:creator>ceroxylon</dc:creator><comments>https://news.ycombinator.com/item?id=47081270</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47081270</guid></item><item><title><![CDATA[New comment by ceroxylon in "Show HN: Rebrain.gg – Doom learn, don't doom scroll"]]></title><description><![CDATA[
<p>This was a couple of years ago, but I remember using ChatGPT to try and study for a certification by generating quiz questions.<p>It would always start to make every correct answer option "C" over time, no matter what I tried. Eventually I was so focused on whether or not it was stuck in a "C" loop that I started overthinking all of the questions and wasting time.<p>Flash forward to testing Sonnet 4.6 recently to try and see if it could effectively teach me something new, I got about 5 prompts in before I had to point out an oversight, and it gave me the classic "you're absolutely right, ignore that suggestion".<p>This is anecdotal of course, but at least LLMs are helping to build my skills of fact verification and citation checking!</p>
]]></description><pubDate>Wed, 18 Feb 2026 23:10:08 +0000</pubDate><link>https://news.ycombinator.com/item?id=47067684</link><dc:creator>ceroxylon</dc:creator><comments>https://news.ycombinator.com/item?id=47067684</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47067684</guid></item><item><title><![CDATA[New comment by ceroxylon in "Garment Notation Language: Formal descriptive language for clothing construction"]]></title><description><![CDATA[
<p>It is not working on Firefox 147.0.4 either.</p>
]]></description><pubDate>Wed, 18 Feb 2026 19:38:22 +0000</pubDate><link>https://news.ycombinator.com/item?id=47065288</link><dc:creator>ceroxylon</dc:creator><comments>https://news.ycombinator.com/item?id=47065288</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47065288</guid></item><item><title><![CDATA[New comment by ceroxylon in "Claude Sonnet 4.6"]]></title><description><![CDATA[
<p>Strangely enough, my first test with Sonnet 4.6 via the API for a relatively simple request was more expensive ($0.11) than my average request to Opus 4.6 (~$0.07), because it used way more tokens than what I would consider necessary for the prompt.</p>
]]></description><pubDate>Tue, 17 Feb 2026 18:57:40 +0000</pubDate><link>https://news.ycombinator.com/item?id=47051493</link><dc:creator>ceroxylon</dc:creator><comments>https://news.ycombinator.com/item?id=47051493</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47051493</guid></item><item><title><![CDATA[New comment by ceroxylon in "A sane but bull case on Clawdbot / OpenClaw"]]></title><description><![CDATA[
<p>Reminds me of Dan Harumi<p>> Tech people are always talking about dinner reservations . . . We're worried about the price of lunch, meanwhile tech people are building things that tell you the price of lunch. This is why real problems don't get solved.</p>
]]></description><pubDate>Wed, 04 Feb 2026 19:03:54 +0000</pubDate><link>https://news.ycombinator.com/item?id=46890144</link><dc:creator>ceroxylon</dc:creator><comments>https://news.ycombinator.com/item?id=46890144</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46890144</guid></item></channel></rss>