<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: bnmik</title><link>https://news.ycombinator.com/user?id=bnmik</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Fri, 03 Jul 2026 07:55:23 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=bnmik" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by bnmik in "Ask HN: What are you working on? (May 2026)"]]></title><description><![CDATA[
<p>I am working on Desiderata (<a href="https://github.com/github-of-NMI/Desiderata" rel="nofollow">https://github.com/github-of-NMI/Desiderata</a>).<p>An LLM benchmark for open-weight models only, with secret questions.<p>The questions are asked multiple times to calculate a consistency score.<p>The results are available in JSON, containing the hash of the question with the number of correct and incorrect answers, the number of unique answers, and the number of times no answer is given. (Uses \boxed{})</p>
]]></description><pubDate>Sun, 10 May 2026 20:27:40 +0000</pubDate><link>https://news.ycombinator.com/item?id=48087599</link><dc:creator>bnmik</dc:creator><comments>https://news.ycombinator.com/item?id=48087599</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48087599</guid></item></channel></rss>