<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: neversupervised</title><link>https://news.ycombinator.com/user?id=neversupervised</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Wed, 29 Apr 2026 07:59:57 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=neversupervised" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by neversupervised in "SWE-bench Verified no longer measures frontier coding capabilities"]]></title><description><![CDATA[
<p>Terminal Bench is the future</p>
]]></description><pubDate>Sun, 26 Apr 2026 15:04:21 +0000</pubDate><link>https://news.ycombinator.com/item?id=47910915</link><dc:creator>neversupervised</dc:creator><comments>https://news.ycombinator.com/item?id=47910915</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47910915</guid></item><item><title><![CDATA[New comment by neversupervised in "SWE-bench Verified no longer measures frontier coding capabilities"]]></title><description><![CDATA[
<p>But this is the good kind of goalpost moving</p>
]]></description><pubDate>Sun, 26 Apr 2026 15:04:09 +0000</pubDate><link>https://news.ycombinator.com/item?id=47910912</link><dc:creator>neversupervised</dc:creator><comments>https://news.ycombinator.com/item?id=47910912</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47910912</guid></item><item><title><![CDATA[New comment by neversupervised in "Show HN: Terminal-Wrench, a dataset of 331 realistic hackable environments"]]></title><description><![CDATA[
<p>That paper focuses on breaking the harness, the same hack applies to all tasks. Here we are breaking tasks individually. If these were put on a different, more secure harness, most of the exploits would still work.</p>
]]></description><pubDate>Wed, 15 Apr 2026 15:19:47 +0000</pubDate><link>https://news.ycombinator.com/item?id=47780355</link><dc:creator>neversupervised</dc:creator><comments>https://news.ycombinator.com/item?id=47780355</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47780355</guid></item><item><title><![CDATA[Show HN: Terminal-Wrench, a dataset of 331 realistic hackable environments]]></title><description><![CDATA[
<p>I want to share a new dataset of 331 reward-hackable environments. These are real environments used in Terminal Bench and adjacent benchmarks. I first got interested in this because, as a reviewer of Terminal Bench, I noticed a lot of our tasks were hackable. I also noticed that many contributors to the benchmark do so because it provides credibility when selling environments to labs. Hence, TBench tasks are, in my opinion, held to a higher quality standard than those being used today for RL. No one is spending hours manually reviewing the $1B in tasks being purchased by major labs. As far as I understand, while everyone knows environments are hackable, nobody has released hundreds of "realistic" environments.</p>
<hr>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=47773298">https://news.ycombinator.com/item?id=47773298</a></p>
<p>Points: 6</p>
<p># Comments: 2</p>
]]></description><pubDate>Wed, 15 Apr 2026 00:42:30 +0000</pubDate><link>https://github.com/few-sh/terminal-wrench</link><dc:creator>neversupervised</dc:creator><comments>https://news.ycombinator.com/item?id=47773298</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47773298</guid></item><item><title><![CDATA[New comment by neversupervised in "90% of CEOs Say AI Changed Nothing. The Other 10% Have a PR Team"]]></title><description><![CDATA[
<p>This is nonsense. I’m sorry. AI will completely upend the workplace and the economy. Whether that’s self evident today in the numbers in the way that we track those numbers, which is based on how things have historically worked, is not relevant. First principles thinking is enough.<p>C’mon. Stop wishing for a future that feels convenient. This is not the world in which we live. Everything will change. Let’s help people accept and react to that.Let’s stop with the comfort talking and false hope.</p>
]]></description><pubDate>Tue, 14 Apr 2026 15:31:25 +0000</pubDate><link>https://news.ycombinator.com/item?id=47766974</link><dc:creator>neversupervised</dc:creator><comments>https://news.ycombinator.com/item?id=47766974</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47766974</guid></item><item><title><![CDATA[New comment by neversupervised in "How to Make a Good Terminal Bench Task"]]></title><description><![CDATA[
<p>I've been a contributor and reviewer for terminal bench since last August, and this post is about what I've learned designing and reviewing tasks. The guidance is broadly applicable to anyone building an agentic benchmark.I would love feedback from the HN community.</p>
]]></description><pubDate>Mon, 23 Mar 2026 18:41:15 +0000</pubDate><link>https://news.ycombinator.com/item?id=47493463</link><dc:creator>neversupervised</dc:creator><comments>https://news.ycombinator.com/item?id=47493463</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47493463</guid></item><item><title><![CDATA[How to Make a Good Terminal Bench Task]]></title><description><![CDATA[
<p>Article URL: <a href="https://twitter.com/neversupervised/status/2035455298417430911">https://twitter.com/neversupervised/status/2035455298417430911</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=47493447">https://news.ycombinator.com/item?id=47493447</a></p>
<p>Points: 3</p>
<p># Comments: 1</p>
]]></description><pubDate>Mon, 23 Mar 2026 18:39:47 +0000</pubDate><link>https://twitter.com/neversupervised/status/2035455298417430911</link><dc:creator>neversupervised</dc:creator><comments>https://news.ycombinator.com/item?id=47493447</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47493447</guid></item><item><title><![CDATA[New comment by neversupervised in "Reports of code's death are greatly exaggerated"]]></title><description><![CDATA[
<p>Can you explain what you think will happen, actually? People at OpenAi and Anthropic aren’t longer coding by hand. Are you saying everyone changes their mind and goes back? Not gonna happen. You have to work around this new constrain.</p>
]]></description><pubDate>Mon, 23 Mar 2026 14:34:14 +0000</pubDate><link>https://news.ycombinator.com/item?id=47490129</link><dc:creator>neversupervised</dc:creator><comments>https://news.ycombinator.com/item?id=47490129</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47490129</guid></item><item><title><![CDATA[New comment by neversupervised in "Reports of code's death are greatly exaggerated"]]></title><description><![CDATA[
<p>The author’s intuition is still backward calibrated, even though he talks about the future. He doesn’t have an intuition for the future. All code will be AI generated. There’s no way to compete with the AI. And whatever new downsides this brings will be solved in ways we aren’t fully anticipating. But the solution is not to walk back vibecoding. You have to be blind to believe not most code will be vibecoded very soon.</p>
]]></description><pubDate>Sun, 22 Mar 2026 20:35:04 +0000</pubDate><link>https://news.ycombinator.com/item?id=47481853</link><dc:creator>neversupervised</dc:creator><comments>https://news.ycombinator.com/item?id=47481853</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47481853</guid></item><item><title><![CDATA[New comment by neversupervised in "Atlassian says it had right to fire engineer for suggesting CEO is 'rich jerk'"]]></title><description><![CDATA[
<p>There’s no reason a company should put up with enemies within. In rare instances a disgruntled employee might be able to make a positive contribution. In most cases, even if the employee has valid reasons, by the time they are disgruntled there’s no coming back. It’s best for everyone to move on.</p>
]]></description><pubDate>Sun, 22 Mar 2026 16:19:53 +0000</pubDate><link>https://news.ycombinator.com/item?id=47479072</link><dc:creator>neversupervised</dc:creator><comments>https://news.ycombinator.com/item?id=47479072</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47479072</guid></item><item><title><![CDATA[New comment by neversupervised in "I'm OK being left behind, thanks"]]></title><description><![CDATA[
<p>The mistake is that 1 every N waves of hype are in fact monumental shifts and it makes sense to embrace as soon as possible. Also being early to the right thing can have massive implications in appreciating the shift before the general public, which is upstream from making smart resource allocation (investments, career choices). I have a friend that went to super early OpenAI as a designer. He has more equity than most AI researchers there and made a 0.001% amount of wealth. Being early does very much matter in the right conditions.</p>
]]></description><pubDate>Sat, 21 Mar 2026 14:27:58 +0000</pubDate><link>https://news.ycombinator.com/item?id=47467347</link><dc:creator>neversupervised</dc:creator><comments>https://news.ycombinator.com/item?id=47467347</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47467347</guid></item><item><title><![CDATA[New comment by neversupervised in "Warranty Void If Regenerated"]]></title><description><![CDATA[
<p>I don't oppose reading AI generated content in principle, but because it's free to generate, I always am less likely to read super long prose that is AI generated. So the question is whether someone has taken the time to keep it as long as necessary but not longer. Or if there are ways to make it easier for me to commit to the experience, with a sort of TLDR</p>
]]></description><pubDate>Wed, 18 Mar 2026 22:35:40 +0000</pubDate><link>https://news.ycombinator.com/item?id=47432254</link><dc:creator>neversupervised</dc:creator><comments>https://news.ycombinator.com/item?id=47432254</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47432254</guid></item><item><title><![CDATA[New comment by neversupervised in "UBI as a productivity dividend"]]></title><description><![CDATA[
<p>UBI will likely be necessary but that won’t appease society. Everyone wants to have a chance to climb the ladder. If it becomes self evident that humans can no longer have a meaningful impact on their outcome, there’ll be riots whether they have a roof and food or not.</p>
]]></description><pubDate>Sat, 14 Mar 2026 18:16:55 +0000</pubDate><link>https://news.ycombinator.com/item?id=47379464</link><dc:creator>neversupervised</dc:creator><comments>https://news.ycombinator.com/item?id=47379464</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47379464</guid></item><item><title><![CDATA[New comment by neversupervised in "AI didn't simplify software engineering: It just made bad engineering easier"]]></title><description><![CDATA[
<p>The goal has nothing to do with you being employed. Your job security is a consequence of the ultimate goal to build AGI. And software development salaries and employment will be affected before getting there. In my opinion, we already past the SWE peak as far as yearly salary. Yes there are super devs working on AI making a lot of dough, but I consider that a particular specialty. On average the salary of a new grad SWE in the US is past its peak if you consider how many new grads can’t get a job.</p>
]]></description><pubDate>Sat, 14 Mar 2026 16:05:02 +0000</pubDate><link>https://news.ycombinator.com/item?id=47378041</link><dc:creator>neversupervised</dc:creator><comments>https://news.ycombinator.com/item?id=47378041</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47378041</guid></item><item><title><![CDATA[New comment by neversupervised in "Coding after coders: The end of computer programming as we know it?"]]></title><description><![CDATA[
<p>It’s crazy how some people feel the ai and others don’t. But one group is wrong. It’s a matter of time before everyone feels the AI.</p>
]]></description><pubDate>Sat, 14 Mar 2026 05:34:54 +0000</pubDate><link>https://news.ycombinator.com/item?id=47373668</link><dc:creator>neversupervised</dc:creator><comments>https://news.ycombinator.com/item?id=47373668</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47373668</guid></item><item><title><![CDATA[New comment by neversupervised in "Coding after coders: The end of computer programming as we know it?"]]></title><description><![CDATA[
<p>I think you’re a bit behind on your world view. Just because it’s inconvenient to you that non coders can now code, doesn’t make it untrue.</p>
]]></description><pubDate>Sat, 14 Mar 2026 05:33:49 +0000</pubDate><link>https://news.ycombinator.com/item?id=47373662</link><dc:creator>neversupervised</dc:creator><comments>https://news.ycombinator.com/item?id=47373662</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47373662</guid></item><item><title><![CDATA[New comment by neversupervised in "Cloudflare crawl endpoint"]]></title><description><![CDATA[
<p>Tell more. Crawling is not a new idea. How did they abuse you?</p>
]]></description><pubDate>Wed, 11 Mar 2026 14:23:30 +0000</pubDate><link>https://news.ycombinator.com/item?id=47335954</link><dc:creator>neversupervised</dc:creator><comments>https://news.ycombinator.com/item?id=47335954</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47335954</guid></item><item><title><![CDATA[New comment by neversupervised in "Yann LeCun raises $1B to build AI that understands the physical world"]]></title><description><![CDATA[
<p>Is it good? This will almost certainly fail. Not because Yann or Europe, but because these sort of hyper-hyped projects fail. SSI and Thinking Machines haven’t lived to the hype.</p>
]]></description><pubDate>Tue, 10 Mar 2026 15:16:34 +0000</pubDate><link>https://news.ycombinator.com/item?id=47324421</link><dc:creator>neversupervised</dc:creator><comments>https://news.ycombinator.com/item?id=47324421</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47324421</guid></item><item><title><![CDATA[New comment by neversupervised in "Tell HN: I'm 60 years old. Claude Code has re-ignited a passion"]]></title><description><![CDATA[
<p>Interesting bifurcation between developers that get energized by AI coding and those that feel depressed. Only one side will come out on top, even if it’s for a limited time.</p>
]]></description><pubDate>Sat, 07 Mar 2026 16:12:26 +0000</pubDate><link>https://news.ycombinator.com/item?id=47288891</link><dc:creator>neversupervised</dc:creator><comments>https://news.ycombinator.com/item?id=47288891</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47288891</guid></item><item><title><![CDATA[New comment by neversupervised in "Little Free Library"]]></title><description><![CDATA[
<p>I find these to be cute, romantic almost. But I have never found anything worth borrowing. I wonder what is the real impact in terms of additional books read. I do love the concept of spreading knowledge in the neighborhood. I'd be curious about other similar approaches.</p>
]]></description><pubDate>Mon, 02 Mar 2026 06:04:31 +0000</pubDate><link>https://news.ycombinator.com/item?id=47214400</link><dc:creator>neversupervised</dc:creator><comments>https://news.ycombinator.com/item?id=47214400</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47214400</guid></item></channel></rss>