New comment by tgw43279w in "Show HN: How I Topped the HuggingFace Open LLM Leaderboard on Two Gaming GPUs"

tgw43279w — Tue, 10 Mar 2026 15:24:33 +0000

Very cool, thanks for sharing! Recovering 96% using just two blocks on IMN-1k, wow!

New comment by tgw43279w in "Show HN: How I topped the HuggingFace open LLM leaderboard on two gaming GPUs"

tgw43279w — Tue, 10 Mar 2026 14:12:34 +0000

That was a fun read! The base64 decoding and encoding is quite interesting. A parallel: these models are surprisingly robust to heavy word mangling, back in 2023 people used this trick to jailbreak the models very often, but what was more surprising is that they even understand it. I always thought of it this way there must be some circuitry in the model that maps these almost unrecognizable words/sentences into their rectified versions. But what your base64 also shows is the fact thy can also encode them back as well! (However models are known to not be able to produce mangled output that looks convincingly random. I think the base64 transformation is more mechanical in this regard and hence it‘s easier to do the reverse for them.) So your layer circuit hypothesis aligns pretty well with my mental model of how these models work based on the interpretability work I am familiar with! I really also like the way you used the heatmaps as a tool to derive layer insights, very intuitive! But it’s really surprising that you can simply duplicate layers and achieve better results that generalize! This is some research grade effort! I’m confident you could publish this in NeurIPS or ICML if you put it into a paper! I‘m quite impressed! Great work!

New comment by tgw43279w in "Show HN: Semantic Grep – A Word2Vec-powered search tool"

tgw43279w — Sun, 28 Jul 2024 11:24:19 +0000

I really like how simple the implementation is!

New comment by tgw43279w in "Linear Book Scanner – Open-source automatic book scanner (2014)"

tgw43279w — Sun, 17 Sep 2023 16:41:36 +0000

Regarding your point about a successor to LaTeX: https://typst.app/ is turning out to be great.

Hacker News: tgw43279w

New comment by tgw43279w in "Show HN: How I Topped the HuggingFace Open LLM Leaderboard on Two Gaming GPUs"

New comment by tgw43279w in "Show HN: How I topped the HuggingFace open LLM leaderboard on two gaming GPUs"

New comment by tgw43279w in "Show HN: Semantic Grep – A Word2Vec-powered search tool"

New comment by tgw43279w in "Linear Book Scanner – Open-source automatic book scanner (2014)"