<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: martinrolph</title><link>https://news.ycombinator.com/user?id=martinrolph</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Fri, 05 Jun 2026 04:15:37 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=martinrolph" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by martinrolph in "Embarrassingly simple self-distillation improves code generation"]]></title><description><![CDATA[
<p>Think of it less like a test suite and more like an exam. If you're trying to differentiate between the performance of different people/systems/models, you need to calibrate the difficulty accordingly.<p>When designing a benchmark, a pass rate of roughly 50% is useful because it gives you the most information about the relative performance of different models. If the pass rate is 90%+ too often, that means the test is too easy: you're wasting questions asking the model to do things we already know it can do, and getting no extra information. And if it's too low then you're wasting questions at the other end, trying to make it do impossible tasks.</p>
]]></description><pubDate>Sun, 05 Apr 2026 08:09:41 +0000</pubDate><link>https://news.ycombinator.com/item?id=47647231</link><dc:creator>martinrolph</dc:creator><comments>https://news.ycombinator.com/item?id=47647231</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47647231</guid></item><item><title><![CDATA[New comment by martinrolph in "Endgame – A short story about machines optimising for engagement"]]></title><description><![CDATA[
<p>I wrote a stream of consciousness that turned into a piece of flash fiction. A peek just a little way into the future, inspired by the vertigo I felt on the announcements of Vibes and Sora 2, the AI-only short form video feeds. What happens, I wonder, when the AIs that are optimising your For You page for addiction achieve their goals? What is the endgame for addictive content?</p>
]]></description><pubDate>Thu, 30 Oct 2025 13:53:14 +0000</pubDate><link>https://news.ycombinator.com/item?id=45760043</link><dc:creator>martinrolph</dc:creator><comments>https://news.ycombinator.com/item?id=45760043</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45760043</guid></item><item><title><![CDATA[Endgame – A short story about machines optimising for engagement]]></title><description><![CDATA[
<p>Article URL: <a href="https://malagostudio.substack.com/p/endgame">https://malagostudio.substack.com/p/endgame</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=45760042">https://news.ycombinator.com/item?id=45760042</a></p>
<p>Points: 1</p>
<p># Comments: 1</p>
]]></description><pubDate>Thu, 30 Oct 2025 13:53:14 +0000</pubDate><link>https://malagostudio.substack.com/p/endgame</link><dc:creator>martinrolph</dc:creator><comments>https://news.ycombinator.com/item?id=45760042</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45760042</guid></item><item><title><![CDATA[New comment by martinrolph in "Overlock – IoT Exception Tracking"]]></title><description><![CDATA[
<p>In an IoT deployment, an issue that is detected on one device (e.g. an exception on the server) may actually originate on an entirely different device (e.g. the end device sends some incorrect data). Overlock collects together the state of all the devices involved in a given request and the associated logs in one place so that you can track the error back to where it originated.</p>
]]></description><pubDate>Thu, 16 Nov 2017 15:43:14 +0000</pubDate><link>https://news.ycombinator.com/item?id=15713547</link><dc:creator>martinrolph</dc:creator><comments>https://news.ycombinator.com/item?id=15713547</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=15713547</guid></item><item><title><![CDATA[New comment by martinrolph in "Offload: Build seamless multiscreen interfaces"]]></title><description><![CDATA[
<p>When researching this I came across a few tools for using a phone as a trackpad, for example:<p><a href="https://play.google.com/store/apps/details?id=com.thingsstuff.touchpad&hl=en" rel="nofollow">https://play.google.com/store/apps/details?id=com.thingsstuf...</a><p>We couldn't find anything which worked as a digitiser and that is certinaly something we're considering.</p>
]]></description><pubDate>Tue, 04 Nov 2014 14:34:33 +0000</pubDate><link>https://news.ycombinator.com/item?id=8556629</link><dc:creator>martinrolph</dc:creator><comments>https://news.ycombinator.com/item?id=8556629</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=8556629</guid></item><item><title><![CDATA[New comment by martinrolph in "Offload: Build seamless multiscreen interfaces"]]></title><description><![CDATA[
<p>Oops! Sorry, changed the title.</p>
]]></description><pubDate>Tue, 04 Nov 2014 13:41:29 +0000</pubDate><link>https://news.ycombinator.com/item?id=8556402</link><dc:creator>martinrolph</dc:creator><comments>https://news.ycombinator.com/item?id=8556402</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=8556402</guid></item><item><title><![CDATA[Offload: Build seamless multiscreen interfaces]]></title><description><![CDATA[
<p>Article URL: <a href="http://offload.strikingly.com/">http://offload.strikingly.com/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=8556140">https://news.ycombinator.com/item?id=8556140</a></p>
<p>Points: 44</p>
<p># Comments: 9</p>
]]></description><pubDate>Tue, 04 Nov 2014 12:13:18 +0000</pubDate><link>http://offload.strikingly.com/</link><dc:creator>martinrolph</dc:creator><comments>https://news.ycombinator.com/item?id=8556140</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=8556140</guid></item><item><title><![CDATA[Ask HN: Are there any web-apps aimed at e-readers?]]></title><description><![CDATA[
<p>It would be great (for me at least) to be able to do several common activities on my Kindle. The main motivator is that I find it hard to sleep if I've been looking at screens late at night. For example, I'd love to be able to use some kind of messaging client.<p>I was never able to get Kindle Active Content (the closest thing there was to an app store) as I'm not US-based and that seems to be disappearing anyway.<p>Which means the only available option is using web-apps. Of course, the obvious problem is that they're totally unoptimised for e-ink screens where scrolling, colour and movement are all either poorly implemented or not available at all.<p>Are there any web-apps designed to work well with e-readers?</p>
<hr>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=8553417">https://news.ycombinator.com/item?id=8553417</a></p>
<p>Points: 2</p>
<p># Comments: 1</p>
]]></description><pubDate>Mon, 03 Nov 2014 20:53:41 +0000</pubDate><link>https://news.ycombinator.com/item?id=8553417</link><dc:creator>martinrolph</dc:creator><comments>https://news.ycombinator.com/item?id=8553417</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=8553417</guid></item></channel></rss>