<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: senko</title><link>https://news.ycombinator.com/user?id=senko</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Thu, 28 May 2026 21:12:26 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=senko" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by senko in "Claude Opus 4.8"]]></title><description><![CDATA[
<p>Great idea!<p>I have a static server of my own, so here's my list (of all the tests I published so far): <a href="https://senko.net/vibecode-bench/" rel="nofollow">https://senko.net/vibecode-bench/</a></p>
]]></description><pubDate>Thu, 28 May 2026 21:00:10 +0000</pubDate><link>https://news.ycombinator.com/item?id=48315395</link><dc:creator>senko</dc:creator><comments>https://news.ycombinator.com/item?id=48315395</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48315395</guid></item><item><title><![CDATA[New comment by senko in "Claude Opus 4.8"]]></title><description><![CDATA[
<p>I'm saving them all as gists here: <a href="https://gist.github.com/senko" rel="nofollow">https://gist.github.com/senko</a><p>But I just vibe-coded a handy list of all the tests I did (unfortunately without the commentary I usually leave in social media posts -- I should add those at some point): <a href="https://senko.net/vibecode-bench/" rel="nofollow">https://senko.net/vibecode-bench/</a></p>
]]></description><pubDate>Thu, 28 May 2026 20:59:32 +0000</pubDate><link>https://news.ycombinator.com/item?id=48315389</link><dc:creator>senko</dc:creator><comments>https://news.ycombinator.com/item?id=48315389</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48315389</guid></item><item><title><![CDATA[New comment by senko in "Claude Opus 4.8"]]></title><description><![CDATA[
<p>It's a combination of reasoning effort (max) + enabling workflow that orchestrates multiple sub-agents.<p>After some interrogation, here's how it organized the work:<p>1. Design workflow (rts-game-design, 11 agents, ~13 min) ran first, produced SPEC.md + DESIGN.md:<p>1.1. Proposals (3 parallel agents): each designed a complete RTS from a different philosophy<p>1.2 Judge (1 agent): evaluated all three and synthesized one unified design, committing to specific numbers (costs, HP, map size, etc.).<p>1.3 Deep-dives (6 parallel agents): each wrote an implementation-ready spec for one subsystem, all consistent with the chosen design<p>1.4 Synthesis (1 agent): merged the design + all six subsystem specs into one conflict-free master spec<p>2. Code-review workflow (rts-code-review, 25 agents, ~5 min),  ran after the main agent had written and tested the code:<p>2.1 Review (6 agents, read-only Explore type): each scrutinized one dimension and returned structured findings.<p>2.2. Verify (19 agents): every finding got its own skeptic agent told to try to refute it,  Result: 19 flagged → 16 confirmed, 3 rejected as non-bugs.<p>What the main agent did in the main loop:<p>- Wrote all ~2,400 lines of index.html by hand from the spec.<p>- All browser testing/debugging via headless Chrome (I told it to use rodney by @simonw, love the tool :)<p>- Applied all 16 fixes from the review and re-verified them in the browser.</p>
]]></description><pubDate>Thu, 28 May 2026 20:49:37 +0000</pubDate><link>https://news.ycombinator.com/item?id=48315260</link><dc:creator>senko</dc:creator><comments>https://news.ycombinator.com/item?id=48315260</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48315260</guid></item><item><title><![CDATA[New comment by senko in "Claude Opus 4.8"]]></title><description><![CDATA[
<p>Yeah looks extremely compact. I didn't instruct it or told it to use as few lines of code or characters or nothing of the sort.<p>Not sure why it did that. Its own rationale (which is highly suspect, but the only lead I have) is that it defaults to dense style if it has to write a file in a single go. May be a kernel of truth somewhere in there.</p>
]]></description><pubDate>Thu, 28 May 2026 20:40:04 +0000</pubDate><link>https://news.ycombinator.com/item?id=48315151</link><dc:creator>senko</dc:creator><comments>https://news.ycombinator.com/item?id=48315151</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48315151</guid></item><item><title><![CDATA[New comment by senko in "Claude Opus 4.8"]]></title><description><![CDATA[
<p>It took 50 minutes, would be ~$20 in API costs (I'm on a Pro sub).</p>
]]></description><pubDate>Thu, 28 May 2026 20:36:42 +0000</pubDate><link>https://news.ycombinator.com/item?id=48315105</link><dc:creator>senko</dc:creator><comments>https://news.ycombinator.com/item?id=48315105</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48315105</guid></item><item><title><![CDATA[New comment by senko in "Claude Opus 4.8"]]></title><description><![CDATA[
<p>My fav coding benchmark for frontier models is to build a simple RTS game in one file (js/html/css). Claude Code with Opus 4.8 in ultracode mode nailed it, the best result so far:<p><a href="https://bsky.app/profile/senko.net/post/3mmwnrkwboc2v" rel="nofollow">https://bsky.app/profile/senko.net/post/3mmwnrkwboc2v</a><p>The prompt was: <i>Create a simple but functional real time strategy (RTS) game similar to old WarCraft, StarCraft or Command & Conquer games. The player should be able to build buildings, create units, gather resources and should uncover the whole map. No AI or multiplayer needed. Use simple but nice-looking graphics. No sound. Implement everything in HTML/CSS/JS, everything in a single file (you can use 3rd-party js or css libraries/frameworks via CDN).</i></p>
]]></description><pubDate>Thu, 28 May 2026 18:34:08 +0000</pubDate><link>https://news.ycombinator.com/item?id=48313432</link><dc:creator>senko</dc:creator><comments>https://news.ycombinator.com/item?id=48313432</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48313432</guid></item><item><title><![CDATA[New comment by senko in "Why Japanese companies do so many different things"]]></title><description><![CDATA[
<p>Do you have a good book, article(s) or podcast to recommend on this topic (business culture in East Asia, or a specific country), in English & approachable by Westerners, while not giving just a cartoon overview of the issues?</p>
]]></description><pubDate>Sat, 23 May 2026 14:49:33 +0000</pubDate><link>https://news.ycombinator.com/item?id=48248197</link><dc:creator>senko</dc:creator><comments>https://news.ycombinator.com/item?id=48248197</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48248197</guid></item><item><title><![CDATA[New comment by senko in "Gemini Omni"]]></title><description><![CDATA[
<p>Did they film on location, tho?</p>
]]></description><pubDate>Tue, 19 May 2026 21:21:39 +0000</pubDate><link>https://news.ycombinator.com/item?id=48199834</link><dc:creator>senko</dc:creator><comments>https://news.ycombinator.com/item?id=48199834</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48199834</guid></item><item><title><![CDATA[New comment by senko in "Gemini Omni"]]></title><description><![CDATA[
<p>Have you seen the past dozen or so Marvel movies?</p>
]]></description><pubDate>Tue, 19 May 2026 21:21:10 +0000</pubDate><link>https://news.ycombinator.com/item?id=48199827</link><dc:creator>senko</dc:creator><comments>https://news.ycombinator.com/item?id=48199827</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48199827</guid></item><item><title><![CDATA[New comment by senko in "Why is almost everyone right-handed? A new study connects it to bipedalism"]]></title><description><![CDATA[
<p>Left-to-right writing systems are optimized for right-hand use. Two examples:<p>* if you're left-handed, your hand smudges over the ink before it dries. There are various contortions that some left-handed people do (hover the hand or wrap it around from above) - right handed ones don't need any of that.<p>* stroke patterns, as usually learnt in school, result in pushing away if left handed, vs drawing to, if you're right handed. This results in less ideal strokes, and if you're working with a sharp pencil/pen on a sensitive paper, this can tear the paper. If you're working with a felt-tip pen, the line width/pressure suffers as well.<p>That said, if you really make an effort, you can have a pretty decent handwriting if you're left handed. And if you are forced to use right hand when learning handwriting, you can still have a pretty decent handwriting.<p>I'm not familiar with details of chinese handwriting (what's easier/better if you're left vs right handed), wouldn't be surprise the constraints are similar.<p>So I guess your remark about messy handwriting is related to the strict standards for the students (which includes expectation they must write with right hand).</p>
]]></description><pubDate>Tue, 19 May 2026 20:48:31 +0000</pubDate><link>https://news.ycombinator.com/item?id=48199406</link><dc:creator>senko</dc:creator><comments>https://news.ycombinator.com/item?id=48199406</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48199406</guid></item><item><title><![CDATA[New comment by senko in "Leaving GitHub for Forgejo"]]></title><description><![CDATA[
<p>This is literaly just a bare repo over ssh, and a gitweb interface.<p>It's too trivial for anyone to be selling that. And I don't think there's a large market for $5-$10 barebones setup when GH is free and you can self host.</p>
]]></description><pubDate>Wed, 13 May 2026 13:57:37 +0000</pubDate><link>https://news.ycombinator.com/item?id=48121966</link><dc:creator>senko</dc:creator><comments>https://news.ycombinator.com/item?id=48121966</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48121966</guid></item><item><title><![CDATA[New comment by senko in "Screenshots of Old Desktop OSes"]]></title><description><![CDATA[
<p>A few weeks ago I was co-hosting a live coding session (in front of a crowd, it was pretty collaborative, back-and-forth).<p>I had to authorize something with Firebase, for which I had to auth with Google, for which I had to do a MFA with my (Pixel) phone.<p>Usually it's "are you trying to auth" and finger-to-the-scanner, but around that time this particular way didn't work. It also didn't want to send me a text or a call to auth me.<p>No, I had to find an OTP code. Easy, right? Wrong. The instructions, and the docs, don't match where it was in that particular version of Android, and there were a bunch of blind alleys that were named basically the same.<p>It took me like 10 minutes, on stage, browsing my phone (thankfully, not casted to screen) to find the friggin' option. Thankfully the cohost was doing the presenting at that time, but it was pretty lousy.<p>And this is using Google's OS on a Google phone doing a Google auth flow for a Google property. And I'm a techie who's been using Android for 15+ years now. And I did the exact same dance a few weeks before that - also so roundabout I had no idea how I stumbled on the correct page.<p>User experience my ass.<p>PS. The regular "are you trying to sign in?" flow works again. No idea what happened - wasn't me.</p>
]]></description><pubDate>Tue, 12 May 2026 17:57:15 +0000</pubDate><link>https://news.ycombinator.com/item?id=48111812</link><dc:creator>senko</dc:creator><comments>https://news.ycombinator.com/item?id=48111812</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48111812</guid></item><item><title><![CDATA[New comment by senko in "The Future of Obsidian Plugins"]]></title><description><![CDATA[
<p>> I want to use Obsidian... but I won't as long as it's not open source.<p>Sooo... don't use it?<p>There are plenty of open source alternatives, and I'm sure someone's going to mention org-mode.</p>
]]></description><pubDate>Tue, 12 May 2026 17:43:13 +0000</pubDate><link>https://news.ycombinator.com/item?id=48111627</link><dc:creator>senko</dc:creator><comments>https://news.ycombinator.com/item?id=48111627</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48111627</guid></item><item><title><![CDATA[New comment by senko in "If AI writes your code, why use Python?"]]></title><description><![CDATA[
<p>And a non-sequitur.</p>
]]></description><pubDate>Tue, 12 May 2026 07:44:20 +0000</pubDate><link>https://news.ycombinator.com/item?id=48105386</link><dc:creator>senko</dc:creator><comments>https://news.ycombinator.com/item?id=48105386</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48105386</guid></item><item><title><![CDATA[New comment by senko in "Software engineering may no longer be a lifetime career"]]></title><description><![CDATA[
<p>If you're on the top, you probably aren't coding much. So you're more in management than getting your hands dirty.</p>
]]></description><pubDate>Mon, 11 May 2026 17:57:28 +0000</pubDate><link>https://news.ycombinator.com/item?id=48098323</link><dc:creator>senko</dc:creator><comments>https://news.ycombinator.com/item?id=48098323</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48098323</guid></item><item><title><![CDATA[New comment by senko in "AMÁLIA and the future of European Portuguese LLMs"]]></title><description><![CDATA[
<p>Hello from 23rd</p>
]]></description><pubDate>Mon, 11 May 2026 17:52:29 +0000</pubDate><link>https://news.ycombinator.com/item?id=48098247</link><dc:creator>senko</dc:creator><comments>https://news.ycombinator.com/item?id=48098247</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48098247</guid></item><item><title><![CDATA[New comment by senko in "The OpenAI Deployment Company"]]></title><description><![CDATA[
<p>Mostly.<p>Many of these companies do build up internal know-how.<p>In case of big consulting companies, they have huge internal knowledge bases, decks, cases etc from previous engagements. In case of product companies with FDEs, the feedback from customers ideally trickles down to product improvements. In case of OpenAI, they can improve their models.<p>Whether that's <i>actually</i> valuable enough to turn a blind eye to the downsides of consulting, I don't know.</p>
]]></description><pubDate>Mon, 11 May 2026 17:06:18 +0000</pubDate><link>https://news.ycombinator.com/item?id=48097649</link><dc:creator>senko</dc:creator><comments>https://news.ycombinator.com/item?id=48097649</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48097649</guid></item><item><title><![CDATA[New comment by senko in "The OpenAI Deployment Company"]]></title><description><![CDATA[
<p>I run a dev agency, and I can spot one when I see one.<p>The trouble with dev agencies, "services companies", integration specialists and "forward deployed engineers" is that they scale lineraly with the number of people.<p>You can't 100x your revenue without at least 80x-ing your headcount.Oh, you might go for that <i>once</i> due to AI - but so can everyone else. After that, it's boring linear growth.<p>When I say "boring", I don't mean as "capitalism requires exponential growth" critique. I mean OpenAI valuation is not priced for that. They're priced for singularity. If the bulk of their revenue turns out to be bodyshop, that's...quite a different math.<p>The way to charge big with this kind of work is to do what big consultancies (MBB, IBM, etc) do: brand equity and (supposed) expertise in solving domain problems. OpenAI has ... interesting tech.<p>It's going to be interesting seeing if they can pull this off. If I were a betting man, my money would be on "no".</p>
]]></description><pubDate>Mon, 11 May 2026 15:03:18 +0000</pubDate><link>https://news.ycombinator.com/item?id=48095960</link><dc:creator>senko</dc:creator><comments>https://news.ycombinator.com/item?id=48095960</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48095960</guid></item><item><title><![CDATA[Claude Mythos Opens the Cybersecurity Pandora's Box]]></title><description><![CDATA[
<p>Article URL: <a href="https://shiftmag.dev/claude-mythos-opens-the-cybersecurity-pandoras-box-9622/">https://shiftmag.dev/claude-mythos-opens-the-cybersecurity-pandoras-box-9622/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=48095144">https://news.ycombinator.com/item?id=48095144</a></p>
<p>Points: 5</p>
<p># Comments: 0</p>
]]></description><pubDate>Mon, 11 May 2026 14:01:19 +0000</pubDate><link>https://shiftmag.dev/claude-mythos-opens-the-cybersecurity-pandoras-box-9622/</link><dc:creator>senko</dc:creator><comments>https://news.ycombinator.com/item?id=48095144</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48095144</guid></item><item><title><![CDATA[New comment by senko in "Local AI needs to be the norm"]]></title><description><![CDATA[
<p>I love this line:<p>> <i>Stop shipping distributed systems when you meant to ship a feature.</i><p>But not in the contex the author meant.<p>Many people don't realize that when you have a frontend, a backend (several instances, for failover/scaling), a (separate) database, maybe some object store -- you have a distributed system.<p>A recent article[0] touched on that, although most HN commenters[1] latched on the "go" part. But there's something to avoiding rube goldberg machines where we don't need them.<p>[0] <a href="https://blainsmith.com/articles/just-fucking-use-go/" rel="nofollow">https://blainsmith.com/articles/just-fucking-use-go/</a><p>[1] <a href="https://news.ycombinator.com/item?id=48062997">https://news.ycombinator.com/item?id=48062997</a></p>
]]></description><pubDate>Mon, 11 May 2026 13:48:38 +0000</pubDate><link>https://news.ycombinator.com/item?id=48094975</link><dc:creator>senko</dc:creator><comments>https://news.ycombinator.com/item?id=48094975</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48094975</guid></item></channel></rss>