<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: mufeedvh</title><link>https://news.ycombinator.com/user?id=mufeedvh</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Tue, 14 Apr 2026 12:15:54 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=mufeedvh" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by mufeedvh in "N-Day-Bench – Can LLMs find real vulnerabilities in real codebases?"]]></title><description><![CDATA[
<p>This is a good idea.<p>Will incorporate false-positive rates into the rubric from the next run onwards.<p>At winfunc, we spent a lot of research time taming these models to eradicate false-positive rates (it's high!) so this does feel important enough to be documented. Thanks!</p>
]]></description><pubDate>Mon, 13 Apr 2026 22:37:41 +0000</pubDate><link>https://news.ycombinator.com/item?id=47758840</link><dc:creator>mufeedvh</dc:creator><comments>https://news.ycombinator.com/item?id=47758840</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47758840</guid></item><item><title><![CDATA[N-Day-Bench – Can LLMs find real vulnerabilities in real codebases?]]></title><description><![CDATA[
<p>N-Day-Bench tests whether frontier LLMs can find known security vulnerabilities in real repository code. Each month it pulls fresh cases from GitHub security advisories, checks out the repo at the last commit before the patch, and gives models a sandboxed bash shell to explore the codebase.<p>Static vulnerability discovery benchmarks become outdated quickly. Cases leak into training data, and scores start measuring memorization. The monthly refresh keeps the test set ahead of contamination — or at least makes the contamination window honest.<p>Each case runs three agents: a Curator reads the advisory and builds an answer key, a Finder (the model under test) gets 24 shell steps to explore the code and write a structured report, and a Judge scores the blinded submission. The Finder never sees the patch. It starts from sink hints and must trace the bug through actual code.<p>Only repos with 10k+ stars qualify. A diversity pass prevents any single repo from dominating the set. Ambiguous advisories (merge commits, multi-repo references, unresolvable refs) are dropped.<p>Currently evaluating GPT-5.4, Claude Opus 4.6, Gemini 3.1 Pro, GLM-5.1, and Kimi K2.5. All traces are public.<p>Methodology: <a href="https://ndaybench.winfunc.com/methodology">https://ndaybench.winfunc.com/methodology</a><p>Live Leaderboard: <a href="https://ndaybench.winfunc.com/leaderboard">https://ndaybench.winfunc.com/leaderboard</a><p>Live Traces: <a href="https://ndaybench.winfunc.com/traces">https://ndaybench.winfunc.com/traces</a></p>
<hr>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=47758347">https://news.ycombinator.com/item?id=47758347</a></p>
<p>Points: 78</p>
<p># Comments: 26</p>
]]></description><pubDate>Mon, 13 Apr 2026 21:54:03 +0000</pubDate><link>https://ndaybench.winfunc.com</link><dc:creator>mufeedvh</dc:creator><comments>https://news.ycombinator.com/item?id=47758347</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47758347</guid></item><item><title><![CDATA[New comment by mufeedvh in "I made a programming language with M&Ms"]]></title><description><![CDATA[
<p>thank you! :)</p>
]]></description><pubDate>Mon, 09 Mar 2026 06:00:39 +0000</pubDate><link>https://news.ycombinator.com/item?id=47305342</link><dc:creator>mufeedvh</dc:creator><comments>https://news.ycombinator.com/item?id=47305342</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47305342</guid></item><item><title><![CDATA[New comment by mufeedvh in "I made a programming language with M&Ms"]]></title><description><![CDATA[
<p>Funny you mention that, because yes, a combinator-style encoding is probably a cleaner fit for the “only six colors constraint than my stack machine. I hacked together a tiny SKI-flavored M&M reducer as a proof of concept: B=S, G=K, R=I, Y=(, O=), and N... is a free atom, so `B G G NNN` reduces to `a2`.<p>Gist: <a href="https://gist.github.com/mufeedvh/db930a423fdce8c1d8e495c7a3ffdcdb" rel="nofollow">https://gist.github.com/mufeedvh/db930a423fdce8c1d8e495c7a3f...</a></p>
]]></description><pubDate>Mon, 09 Mar 2026 05:16:04 +0000</pubDate><link>https://news.ycombinator.com/item?id=47305134</link><dc:creator>mufeedvh</dc:creator><comments>https://news.ycombinator.com/item?id=47305134</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47305134</guid></item><item><title><![CDATA[New comment by mufeedvh in "I made a programming language with M&Ms"]]></title><description><![CDATA[
<p>Yes, for messy real-world photos a lightweight CNN would probably outperform the deterministic decoder, but I’d still use it in a hybrid pipeline with classic CV for blob detection and deterministic logic for reconstructing the actual program.</p>
]]></description><pubDate>Mon, 09 Mar 2026 04:35:18 +0000</pubDate><link>https://news.ycombinator.com/item?id=47304921</link><dc:creator>mufeedvh</dc:creator><comments>https://news.ycombinator.com/item?id=47304921</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47304921</guid></item><item><title><![CDATA[New comment by mufeedvh in "I made a programming language with M&Ms"]]></title><description><![CDATA[
<p>Yes! Just make sure to take a photo on a plain white surface is all.<p>With:<p><pre><code>  uv run mnm decompile path/to/photo.png --mode photo</code></pre></p>
]]></description><pubDate>Mon, 09 Mar 2026 01:08:51 +0000</pubDate><link>https://news.ycombinator.com/item?id=47303600</link><dc:creator>mufeedvh</dc:creator><comments>https://news.ycombinator.com/item?id=47303600</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47303600</guid></item><item><title><![CDATA[New comment by mufeedvh in "I made a programming language with M&Ms"]]></title><description><![CDATA[
<p>Author of this silly project here!<p>Sharing a bit of backstory on why I decided to work on this; Firstly, “for fun” but primarily because I felt like I started losing the childlike wonder/whimsy I once had with programming.<p>So I started this new hobby where I ask myself “can I hack on this?” upon getting/seeing something.<p>For instance, I got this new Aula F75 keyboard (really good keyboard for the price btw, it sounds good too!) and it only has dedicated control software for Windows. So I downloaded the driver files, software executable, and manual sheet and reverse engineered the full protocol/packets and rebuilt it for my Mac. Then played snake with the backlights. Fun.<p>Anywho, happy to see my blog on the front page. Would love to hear if anyones going through something similar or working on silly little projects! :)</p>
]]></description><pubDate>Sun, 08 Mar 2026 21:47:37 +0000</pubDate><link>https://news.ycombinator.com/item?id=47301883</link><dc:creator>mufeedvh</dc:creator><comments>https://news.ycombinator.com/item?id=47301883</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47301883</guid></item><item><title><![CDATA[I made a programming language with M&Ms]]></title><description><![CDATA[
<p>Article URL: <a href="https://mufeedvh.com/posts/i-made-a-programming-language-with-mnms/">https://mufeedvh.com/posts/i-made-a-programming-language-with-mnms/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=47283723">https://news.ycombinator.com/item?id=47283723</a></p>
<p>Points: 5</p>
<p># Comments: 0</p>
]]></description><pubDate>Sat, 07 Mar 2026 02:13:13 +0000</pubDate><link>https://mufeedvh.com/posts/i-made-a-programming-language-with-mnms/</link><dc:creator>mufeedvh</dc:creator><comments>https://news.ycombinator.com/item?id=47283723</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47283723</guid></item><item><title><![CDATA[How a single typo led to RCE in Firefox]]></title><description><![CDATA[
<p>Article URL: <a href="https://kqx.io/post/firefox0day/">https://kqx.io/post/firefox0day/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=47006974">https://news.ycombinator.com/item?id=47006974</a></p>
<p>Points: 3</p>
<p># Comments: 0</p>
]]></description><pubDate>Fri, 13 Feb 2026 19:55:16 +0000</pubDate><link>https://kqx.io/post/firefox0day/</link><dc:creator>mufeedvh</dc:creator><comments>https://news.ycombinator.com/item?id=47006974</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47006974</guid></item><item><title><![CDATA[The Recent 0-Days in Node.js and React Were Found by an AI]]></title><description><![CDATA[
<p>Article URL: <a href="https://winfunc.com/blog/recent-0-days-in-nodejs-and-react-were-found-by-an-ai">https://winfunc.com/blog/recent-0-days-in-nodejs-and-react-were-found-by-an-ai</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=46866345">https://news.ycombinator.com/item?id=46866345</a></p>
<p>Points: 2</p>
<p># Comments: 0</p>
]]></description><pubDate>Tue, 03 Feb 2026 04:04:42 +0000</pubDate><link>https://winfunc.com/blog/recent-0-days-in-nodejs-and-react-were-found-by-an-ai</link><dc:creator>mufeedvh</dc:creator><comments>https://news.ycombinator.com/item?id=46866345</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46866345</guid></item><item><title><![CDATA[New Vulnerability in React Server Components – CVE-2026-23864]]></title><description><![CDATA[
<p>Article URL: <a href="https://vercel.com/changelog/summary-of-cve-2026-23864">https://vercel.com/changelog/summary-of-cve-2026-23864</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=46771593">https://news.ycombinator.com/item?id=46771593</a></p>
<p>Points: 3</p>
<p># Comments: 0</p>
]]></description><pubDate>Mon, 26 Jan 2026 21:11:39 +0000</pubDate><link>https://vercel.com/changelog/summary-of-cve-2026-23864</link><dc:creator>mufeedvh</dc:creator><comments>https://news.ycombinator.com/item?id=46771593</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46771593</guid></item><item><title><![CDATA[New comment by mufeedvh in "Show HN: Code2prompt – Generate LLM prompts from your codebase"]]></title><description><![CDATA[
<p>Yes, this just depends on the model you're using. Small-medium size codebases would fit inside Claude's 200K context window and Gemini 1.5 has a 1M context window which would essentially fit 99% of codebases.<p>For reference:<p>- The Flask web framework for Python: 131880 tokens<p>- The Spring Framework for Java: 11070559 tokens</p>
]]></description><pubDate>Wed, 13 Mar 2024 00:39:52 +0000</pubDate><link>https://news.ycombinator.com/item?id=39686687</link><dc:creator>mufeedvh</dc:creator><comments>https://news.ycombinator.com/item?id=39686687</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=39686687</guid></item><item><title><![CDATA[New comment by mufeedvh in "Show HN: Code2prompt – Generate LLM prompts from your codebase"]]></title><description><![CDATA[
<p>Merged, thank you! :)</p>
]]></description><pubDate>Tue, 12 Mar 2024 20:55:09 +0000</pubDate><link>https://news.ycombinator.com/item?id=39684838</link><dc:creator>mufeedvh</dc:creator><comments>https://news.ycombinator.com/item?id=39684838</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=39684838</guid></item><item><title><![CDATA[New comment by mufeedvh in "Show HN: Code2prompt – Generate LLM prompts from your codebase"]]></title><description><![CDATA[
<p>For small codebases, you can run this tool on the entire directory and it would generate a well-formatted Markdown prompt detailing the source tree structure, and all the code. You can then upload this document to either GPT or Claude models with higher context windows and ask it to:<p>- Rewrite the code to another language.<p>- Find bugs/security vulnerabilities.<p>- Document the code.<p>- Implement new features.<p>You can customize the prompt template to achieve any of the desired use cases. It essentially traverses a codebase and creates a prompt with all source files combined. In short, it automates copy-pasting multiple source files into your prompt and formatting them along with letting you know how many tokens your code consumes.</p>
]]></description><pubDate>Tue, 12 Mar 2024 16:22:13 +0000</pubDate><link>https://news.ycombinator.com/item?id=39681341</link><dc:creator>mufeedvh</dc:creator><comments>https://news.ycombinator.com/item?id=39681341</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=39681341</guid></item><item><title><![CDATA[Show HN: Code2prompt – Generate LLM prompts from your codebase]]></title><description><![CDATA[
<p>I made code2prompt, a CLI tool to generate LLM prompts from your codebase with support for prompt templating and token counting.<p>I initially wrote this for personal use to utilize Claude 3.0's 200K context window and it has proven to be pretty useful so I decided to open-source it. Let me know what you think.</p>
<hr>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=39672932">https://news.ycombinator.com/item?id=39672932</a></p>
<p>Points: 4</p>
<p># Comments: 7</p>
]]></description><pubDate>Mon, 11 Mar 2024 20:32:28 +0000</pubDate><link>https://github.com/mufeedvh/code2prompt</link><dc:creator>mufeedvh</dc:creator><comments>https://news.ycombinator.com/item?id=39672932</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=39672932</guid></item><item><title><![CDATA[New comment by mufeedvh in "Krita fund has no corporate support"]]></title><description><![CDATA[
<p>I have made a simple CLI utility[0] with this purpose in mind. It scans your entire filesystem for README.md and FUNDING.yml files for a set of donation/sponsor links and tag it with the associated repo (No HTTP calls, just the assumption that most repos link their support URL in either of these files). The output is a CSV sheet containing the open-source dependencies/libraries you use in your system that accepts donations.<p>I have plans to expand/plug this into a donation aggregator platform like you mentioned if time permits. But if there is an existing effort for the same, I am happy to contribute. :)<p>[0] - <a href="https://github.com/mufeedvh/paydept">https://github.com/mufeedvh/paydept</a></p>
]]></description><pubDate>Thu, 05 Oct 2023 13:38:12 +0000</pubDate><link>https://news.ycombinator.com/item?id=37778448</link><dc:creator>mufeedvh</dc:creator><comments>https://news.ycombinator.com/item?id=37778448</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=37778448</guid></item><item><title><![CDATA[Nobody’s on the Ball on AGI Alignment]]></title><description><![CDATA[
<p>Article URL: <a href="https://www.forourposterity.com/nobodys-on-the-ball-on-agi-alignment/">https://www.forourposterity.com/nobodys-on-the-ball-on-agi-alignment/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=35362080">https://news.ycombinator.com/item?id=35362080</a></p>
<p>Points: 3</p>
<p># Comments: 2</p>
]]></description><pubDate>Wed, 29 Mar 2023 19:28:55 +0000</pubDate><link>https://www.forourposterity.com/nobodys-on-the-ball-on-agi-alignment/</link><dc:creator>mufeedvh</dc:creator><comments>https://news.ycombinator.com/item?id=35362080</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=35362080</guid></item><item><title><![CDATA[Security in the age of LLMs]]></title><description><![CDATA[
<p>Article URL: <a href="https://www.mufeedvh.com/llm-security/">https://www.mufeedvh.com/llm-security/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=33918026">https://news.ycombinator.com/item?id=33918026</a></p>
<p>Points: 42</p>
<p># Comments: 6</p>
]]></description><pubDate>Fri, 09 Dec 2022 05:53:53 +0000</pubDate><link>https://www.mufeedvh.com/llm-security/</link><dc:creator>mufeedvh</dc:creator><comments>https://news.ycombinator.com/item?id=33918026</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=33918026</guid></item><item><title><![CDATA[New comment by mufeedvh in "Show HN: Binserve – Fast single-binary static web server"]]></title><description><![CDATA[
<p>Thanks! :)<p>It currently does gzip compression by default. Compression modes for specific files sounds interesting, I will definitely get around to implementing that.</p>
]]></description><pubDate>Mon, 13 Jun 2022 17:31:41 +0000</pubDate><link>https://news.ycombinator.com/item?id=31728684</link><dc:creator>mufeedvh</dc:creator><comments>https://news.ycombinator.com/item?id=31728684</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=31728684</guid></item><item><title><![CDATA[New comment by mufeedvh in "Show HN: Binserve – Fast single-binary static web server"]]></title><description><![CDATA[
<p>Backstory: What started as a personal project to quickly host some web pages turned into a rabbit hole of yak shaving and that is how I ended up making Binserve. I automated the steps I usually take to host static pages into this project which is tweakable via the configuration file. And its also pretty fast.<p>Let me know your feedback/suggestions!</p>
]]></description><pubDate>Mon, 13 Jun 2022 16:43:09 +0000</pubDate><link>https://news.ycombinator.com/item?id=31727999</link><dc:creator>mufeedvh</dc:creator><comments>https://news.ycombinator.com/item?id=31727999</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=31727999</guid></item></channel></rss>