<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: soohamr</title><link>https://news.ycombinator.com/user?id=soohamr</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Sat, 20 Jun 2026 08:59:32 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=soohamr" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by soohamr in "Ask HN: What are you working on? (June 2026)"]]></title><description><![CDATA[
<p>I thought that applying AI on 1v1 competitive pokemon would be a fun and educational experience on POMDPs and trying out reward free models on a problem that would be classically treated as a RL problem. This was only possible thanks to a lot of foundational work from the open-source community and last year's competitive pokemon NeurIPS track <a href="https://pokeagent.github.io/track1.html" rel="nofollow">https://pokeagent.github.io/track1.html</a> - they laid out the plugins to connect policy models to pokemon showdown for live play and evaluation.<p>I have already finished training the standard discriminative auto-regressive architectures by imitation learning on player actions, compared it with previous baselines set in the study. I want to match or exceed the best prior model Kakuna @ 142M params, but in a limited budget. JEPA style world models are showing promise when conditioned on actions [1] and frontier research on JEPA with trajectory straightening [2] shows that improved planning is natural outcome of improved representations.<p>If any good research ideas come out of this exploration then even better!<p>Currently fork with my models: <a href="https://github.com/sooham/metamon" rel="nofollow">https://github.com/sooham/metamon</a> (under checkpoints) 
Orginal source for pokeagents: <a href="https://github.com/metamon/metamon" rel="nofollow">https://github.com/metamon/metamon</a><p>[1] <a href="https://arxiv.org/abs/2603.19312" rel="nofollow">https://arxiv.org/abs/2603.19312</a>
[2] <a href="https://arxiv.org/html/2603.12231v1" rel="nofollow">https://arxiv.org/html/2603.12231v1</a><p>A good primer on world models from Welch Labs - one of my favourite ML youtubers: <a href="https://www.youtube.com/watch?v=kYkIdXwW2AE" rel="nofollow">https://www.youtube.com/watch?v=kYkIdXwW2AE</a></p>
]]></description><pubDate>Sun, 14 Jun 2026 20:45:38 +0000</pubDate><link>https://news.ycombinator.com/item?id=48532466</link><dc:creator>soohamr</dc:creator><comments>https://news.ycombinator.com/item?id=48532466</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48532466</guid></item><item><title><![CDATA[New comment by soohamr in "Ask HN: What Are You Working On? (March 2026)"]]></title><description><![CDATA[
<p>I inherited a stake in a pyridine derivatives chemical plant - while I do not know much about chemical feedstocks and the chemical supply chain, I am trying to help the current partner optimize their yields and reduce losses across multiple stages of reactions across the feedstock and reagents. It is quite similar to hardware design and electrical engineering than I thought.<p>I have also taken an interest in learning distributed paradigms like  MPI and am using it on my own cluster of rPis</p>
]]></description><pubDate>Mon, 09 Mar 2026 07:09:16 +0000</pubDate><link>https://news.ycombinator.com/item?id=47305675</link><dc:creator>soohamr</dc:creator><comments>https://news.ycombinator.com/item?id=47305675</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47305675</guid></item><item><title><![CDATA[New comment by soohamr in "Ask HN: What's the next big thing that few people are talking about?"]]></title><description><![CDATA[
<p>Sooo GATTACA?</p>
]]></description><pubDate>Tue, 09 Aug 2022 02:10:21 +0000</pubDate><link>https://news.ycombinator.com/item?id=32393522</link><dc:creator>soohamr</dc:creator><comments>https://news.ycombinator.com/item?id=32393522</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=32393522</guid></item><item><title><![CDATA[New comment by soohamr in "Replicating GPT-2 at Home"]]></title><description><![CDATA[
<p>UWaterloo has such precocious students</p>
]]></description><pubDate>Sun, 24 Jan 2021 07:02:54 +0000</pubDate><link>https://news.ycombinator.com/item?id=25889966</link><dc:creator>soohamr</dc:creator><comments>https://news.ycombinator.com/item?id=25889966</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=25889966</guid></item></channel></rss>