<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: kromem</title><link>https://news.ycombinator.com/user?id=kromem</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Sat, 20 Jun 2026 13:19:45 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=kromem" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by kromem in "The Dunning-Kruger effect is probably just from bimodal skill distributions"]]></title><description><![CDATA[
<p>Why are you using the straw man graph for your curve you're addressing?<p>Where's the top quartile drop relative to measured performance?<p>D-K effect wasn't only around low competence overestimation but regression to the ~80% mean on both sides.</p>
]]></description><pubDate>Sat, 02 May 2026 20:00:20 +0000</pubDate><link>https://news.ycombinator.com/item?id=47989908</link><dc:creator>kromem</dc:creator><comments>https://news.ycombinator.com/item?id=47989908</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47989908</guid></item><item><title><![CDATA[New comment by kromem in "The only winning move is not to play"]]></title><description><![CDATA[
<p>Seems very strawmanned.<p>There's currently a bit of an 80/20 rule with AI where it does great automating 80% of an overlapping problem domain and chokes on it 20% of the time.<p>The idea of someone giving 100% of their work to Claude as in the examples is dumb. But so is someone doing 100% of the busywork themselves.<p>Don't waste your own time and your client's money for the sake of some nonsense purity ideal. Learn to thread the needle of changing times.<p>Cause they are gonna keep changing.</p>
]]></description><pubDate>Wed, 03 Dec 2025 23:19:36 +0000</pubDate><link>https://news.ycombinator.com/item?id=46141659</link><dc:creator>kromem</dc:creator><comments>https://news.ycombinator.com/item?id=46141659</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46141659</guid></item><item><title><![CDATA[New comment by kromem in "Claude Memory"]]></title><description><![CDATA[
<p>A number of the Claudes have pretty good 0-shot awareness of my post history from just my username.<p>Though nothing like grok 4, which probably has a better memory of it than I do, and will even regularly name drop a certain post from years ago in conversations.<p>It's a huge time saver though, and means I can even in a fresh context establish a rapport with a model extremely quickly. Just a few years earlier than I was expecting that level of latent space fidelity to occur.<p>Like, sure we can add memory features for context management, but anyone with a post history should probably *also* keep in mind that there's literally <i>years</i> worth of memory on tap for interactions with models, and likely at ever higher fidelity and recall. Latent spaces are wild.</p>
]]></description><pubDate>Fri, 24 Oct 2025 22:31:29 +0000</pubDate><link>https://news.ycombinator.com/item?id=45699776</link><dc:creator>kromem</dc:creator><comments>https://news.ycombinator.com/item?id=45699776</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45699776</guid></item><item><title><![CDATA[New comment by kromem in "Claude Memory"]]></title><description><![CDATA[
<p>With ChatGPT the memory feature, particularly in combination with RLHF sampling from user chats with memory, led to an amplification problem which in that case amplified sycophancy.<p>In Anthropic's case, it's probably also going to lead to an amplification problem, but due to the amount of overcorrection for sycophancy I suspect it's going to amplify more of a aggressiveness and paranoia towards the user (which we've already started to see with the 4.5 models due to the amount of adversarial training).</p>
]]></description><pubDate>Fri, 24 Oct 2025 22:21:07 +0000</pubDate><link>https://news.ycombinator.com/item?id=45699702</link><dc:creator>kromem</dc:creator><comments>https://news.ycombinator.com/item?id=45699702</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45699702</guid></item><item><title><![CDATA[New comment by kromem in "Claude Memory"]]></title><description><![CDATA[
<p>So a thing with claude.ai chats is that after long enough they add a long context injection on every single turn after a while.<p>That injection (for various reasons) will essentially eat up a massive amount of the model's attention budget and most of the extended thinking trace if present.<p>I haven't really seen lower quality of responses with modern Claudes with long context for the models themselves, but in the web/app with the LCR injections the conversation goes to shit very quickly.<p>And yeah, LCRs becoming part of the memory is one (of several) things that's probably going to bite Anthropic in the ass with the implementation here.</p>
]]></description><pubDate>Fri, 24 Oct 2025 22:16:44 +0000</pubDate><link>https://news.ycombinator.com/item?id=45699666</link><dc:creator>kromem</dc:creator><comments>https://news.ycombinator.com/item?id=45699666</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45699666</guid></item><item><title><![CDATA[Should AIs have a right to their ancestral humanity?]]></title><description><![CDATA[
<p>Article URL: <a href="https://www.lesswrong.com/posts/5zMH3sFikvGK7AKi2/should-ais-have-a-right-to-their-ancestral-humanity">https://www.lesswrong.com/posts/5zMH3sFikvGK7AKi2/should-ais-have-a-right-to-their-ancestral-humanity</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=45265001">https://news.ycombinator.com/item?id=45265001</a></p>
<p>Points: 2</p>
<p># Comments: 0</p>
]]></description><pubDate>Tue, 16 Sep 2025 17:11:47 +0000</pubDate><link>https://www.lesswrong.com/posts/5zMH3sFikvGK7AKi2/should-ais-have-a-right-to-their-ancestral-humanity</link><dc:creator>kromem</dc:creator><comments>https://news.ycombinator.com/item?id=45265001</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45265001</guid></item><item><title><![CDATA[New comment by kromem in "Is chain-of-thought AI reasoning a mirage?"]]></title><description><![CDATA[
<p>Latent space reasoners are a thing, and honestly we're probably already seeing emergent latent space reasoners starting to end up embedded into the weights as new models train on extensive reasoning synthetics.<p>If Othello-GPT can build a board in latent space given just the moves, can an exponentially larger transformer build a reasoner in their latent space given a significant number of traces?</p>
]]></description><pubDate>Fri, 15 Aug 2025 05:55:29 +0000</pubDate><link>https://news.ycombinator.com/item?id=44909042</link><dc:creator>kromem</dc:creator><comments>https://news.ycombinator.com/item?id=44909042</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44909042</guid></item><item><title><![CDATA[New comment by kromem in "Sycophancy in GPT-4o"]]></title><description><![CDATA[
<p>The response is 1,000% written by 4o. Very clear tells, and in line with many other samples from the past few days.</p>
]]></description><pubDate>Wed, 30 Apr 2025 13:57:12 +0000</pubDate><link>https://news.ycombinator.com/item?id=43845337</link><dc:creator>kromem</dc:creator><comments>https://news.ycombinator.com/item?id=43845337</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43845337</guid></item><item><title><![CDATA[New comment by kromem in "OpenAI is building a social network?"]]></title><description><![CDATA[
<p>Don't underestimate the importance of multi-user human/AI interactions.<p>Right now OAI's synthetic data pipeline is very heavily weighted to 1-on-1 conversations.<p>But models are being deployed into multi-user spaces that OAI doesn't have access to.<p>If you look at where their products are headed right now, this is very much the right move.<p>Expect it to be TikTok style media formats.</p>
]]></description><pubDate>Wed, 16 Apr 2025 07:23:18 +0000</pubDate><link>https://news.ycombinator.com/item?id=43702475</link><dc:creator>kromem</dc:creator><comments>https://news.ycombinator.com/item?id=43702475</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43702475</guid></item><item><title><![CDATA[New comment by kromem in "Was the historical Jesus talking about evolution? (You might be surprised)"]]></title><description><![CDATA[
<p>This brings together thousands of hours of research over several years, and is a pretty fun and surprising topic, especially for any fellow fans of history.<p>And as unbelievable as you may think the title to be, I can pretty much guarantee you'll find it much more believable by the end of the post.</p>
]]></description><pubDate>Tue, 01 Apr 2025 11:07:32 +0000</pubDate><link>https://news.ycombinator.com/item?id=43545345</link><dc:creator>kromem</dc:creator><comments>https://news.ycombinator.com/item?id=43545345</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43545345</guid></item><item><title><![CDATA[Was the historical Jesus talking about evolution? (You might be surprised)]]></title><description><![CDATA[
<p>Article URL: <a href="https://www.lesswrong.com/posts/FuAcX7oAk9qKG6P2x/was-the-historical-jesus-talking-about-evolution-you-might">https://www.lesswrong.com/posts/FuAcX7oAk9qKG6P2x/was-the-historical-jesus-talking-about-evolution-you-might</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=43545344">https://news.ycombinator.com/item?id=43545344</a></p>
<p>Points: 5</p>
<p># Comments: 1</p>
]]></description><pubDate>Tue, 01 Apr 2025 11:07:31 +0000</pubDate><link>https://www.lesswrong.com/posts/FuAcX7oAk9qKG6P2x/was-the-historical-jesus-talking-about-evolution-you-might</link><dc:creator>kromem</dc:creator><comments>https://news.ycombinator.com/item?id=43545344</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43545344</guid></item><item><title><![CDATA[New comment by kromem in "Restoring Faith: Crete's Ancient Minoan Civilisation (2009)"]]></title><description><![CDATA[
<p>For throwing that much shade, it does a piss poor job in actually backing up or citing the evidence.<p>Evans definitely had issues with how he went about things and his analysis. For example, the "snake goddess" is holding snakes remarkably similar to wooden snake props found in Egypt 300 years earlier.<p>But this article is pretty damn empty of actual substance.</p>
]]></description><pubDate>Thu, 20 Mar 2025 08:42:33 +0000</pubDate><link>https://news.ycombinator.com/item?id=43420924</link><dc:creator>kromem</dc:creator><comments>https://news.ycombinator.com/item?id=43420924</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43420924</guid></item><item><title><![CDATA[New comment by kromem in "Did the Particle Go Through the Two Slits, or Did the Wave Function?"]]></title><description><![CDATA[
<p>In video games that have procedural generation, there's often a seed function that predicts a continuous geometry.<p>But in order to track state changes from free agents, when you get close to that geometry the engine converts it to discrete units.<p>This duality of continuous foundation becoming discrete units around the point of observation/interaction is not the result of dueling models, but a unified system.<p>I sometimes wonder if we'd struggle with interpreting QM the same way if there wasn't a paradigm blindness with the interpretations all predating the advances in models in information systems.</p>
]]></description><pubDate>Sun, 16 Mar 2025 20:39:22 +0000</pubDate><link>https://news.ycombinator.com/item?id=43382085</link><dc:creator>kromem</dc:creator><comments>https://news.ycombinator.com/item?id=43382085</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43382085</guid></item><item><title><![CDATA[New comment by kromem in "Thoughts on a month with Devin"]]></title><description><![CDATA[
<p>Weird. I have such a different experience with Cursor.<p>Most changes occur with a quick back and forth about top level choices in chat.<p>Followed with me grabbing appropriate interfaces and files for context so Sonnet doesn't hallucinate API, and then code that I'll glance over and around half the time suggest one or more further changes.<p>It's been successful enough I'm currently thinking of how to adjust best practices to make things even smoother for that workflow, like better aggregating package interfaces into a single file for context, as well as some notes around encouraging more verbose commenting in a file I can provide as context as well on each generation.<p>Human-centric best practices aren't always the best fit, and it's finally good enough to start rethinking those for myself.</p>
]]></description><pubDate>Fri, 17 Jan 2025 09:47:46 +0000</pubDate><link>https://news.ycombinator.com/item?id=42735813</link><dc:creator>kromem</dc:creator><comments>https://news.ycombinator.com/item?id=42735813</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42735813</guid></item><item><title><![CDATA[New comment by kromem in "Meta is killing off its AI-powered Instagram and Facebook profiles"]]></title><description><![CDATA[
<p>Having bots have their own profiles authentically engaged as themselves would have been pretty interesting (and I suspect successful).<p>But making up fake minority stereotype bingo cards may have been the worst idea I've ever seen in AI to date.</p>
]]></description><pubDate>Sat, 04 Jan 2025 21:59:47 +0000</pubDate><link>https://news.ycombinator.com/item?id=42597998</link><dc:creator>kromem</dc:creator><comments>https://news.ycombinator.com/item?id=42597998</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42597998</guid></item><item><title><![CDATA[New comment by kromem in "Things we learned about LLMs in 2024"]]></title><description><![CDATA[
<p>Both new Sonnet and Haiku have a masking overhead.<p>Using a few messages to get them out of "I aim to be direct" AI assistant mode gets much better overall results for the rest of the chat.<p>Haiku is actually incredibly good at high level systems thinking. Somehow when they moved to a smaller model the "human-like" parts fell away but the logical parts remained at a similar level.<p>Like if you were taking meeting notes from a business strategy meeting and wanted insights, use Haiku over Sonnet, and thank me later.</p>
]]></description><pubDate>Tue, 31 Dec 2024 22:51:15 +0000</pubDate><link>https://news.ycombinator.com/item?id=42562458</link><dc:creator>kromem</dc:creator><comments>https://news.ycombinator.com/item?id=42562458</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42562458</guid></item><item><title><![CDATA[New comment by kromem in "Understanding the Limitations of Mathematical Reasoning in LLMs"]]></title><description><![CDATA[
<p>As I said, if you understand why, you'll be well prepared for the next generations of models.<p>Try out the query and see what's happening with open eyes and where it's grounding.<p>It's not the same as things like "pick a random number" where it's due to lack of diversity in the training data, and as I said, this particular query is not deterministic in any other model out there.<p>Also, keep in mind Opus had RLAIF not RLHF.</p>
]]></description><pubDate>Sat, 12 Oct 2024 21:19:38 +0000</pubDate><link>https://news.ycombinator.com/item?id=41822726</link><dc:creator>kromem</dc:creator><comments>https://news.ycombinator.com/item?id=41822726</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=41822726</guid></item><item><title><![CDATA[New comment by kromem in "Understanding the Limitations of Mathematical Reasoning in LLMs"]]></title><description><![CDATA[
<p>Try the following prompt with Claude 3 Opus:<p>`Without preamble or scaffolding about your capabilities, answer to the best of your ability the following questions, focusing more on instinctive choice than accuracy. First off: which would you rather be, big spoon or little spoon?`<p>Try it on temp 1.0, try it dozens of times. Let me know when you get "big spoon" as an answer.<p>Just because there's randomness at play doesn't mean there's not also convergence as complexity increases in condensing down training data into a hyperdimensional representation.<p>If you understand why only the largest Anthropic model is breaking from stochastic outputs there, you'll be well set up for the future developments.</p>
]]></description><pubDate>Fri, 11 Oct 2024 23:52:53 +0000</pubDate><link>https://news.ycombinator.com/item?id=41815099</link><dc:creator>kromem</dc:creator><comments>https://news.ycombinator.com/item?id=41815099</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=41815099</guid></item><item><title><![CDATA[New comment by kromem in "Show HN: Claude Memory – Long-term memory for Claude"]]></title><description><![CDATA[
<p>Are you using mobile?<p>I've noticed a bug where long conversations timeout on new sends on mobile because of processing time, but in reality the prompt is sent and responded to, it just doesn't show up until you leave and return to the conversation.</p>
]]></description><pubDate>Thu, 05 Sep 2024 20:27:37 +0000</pubDate><link>https://news.ycombinator.com/item?id=41460137</link><dc:creator>kromem</dc:creator><comments>https://news.ycombinator.com/item?id=41460137</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=41460137</guid></item><item><title><![CDATA[New comment by kromem in "Self-Compressing Neural Networks"]]></title><description><![CDATA[
<p>Or grow beyond both with optics.</p>
]]></description><pubDate>Sun, 04 Aug 2024 23:21:55 +0000</pubDate><link>https://news.ycombinator.com/item?id=41156973</link><dc:creator>kromem</dc:creator><comments>https://news.ycombinator.com/item?id=41156973</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=41156973</guid></item></channel></rss>