<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: croqaz</title><link>https://news.ycombinator.com/user?id=croqaz</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Sun, 21 Jun 2026 11:42:49 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=croqaz" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by croqaz in "Making a vintage LLM from scratch"]]></title><description><![CDATA[
<p>Thank you very much! It is humbling and motivating to see other people interested in this.</p>
]]></description><pubDate>Fri, 12 Jun 2026 19:20:05 +0000</pubDate><link>https://news.ycombinator.com/item?id=48508340</link><dc:creator>croqaz</dc:creator><comments>https://news.ycombinator.com/item?id=48508340</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48508340</guid></item><item><title><![CDATA[New comment by croqaz in "Making a vintage LLM from scratch"]]></title><description><![CDATA[
<p>Do share! I read all the blog posts where people share their experiences of building small scale LLMs "from scratch".</p>
]]></description><pubDate>Fri, 12 Jun 2026 19:18:57 +0000</pubDate><link>https://news.ycombinator.com/item?id=48508325</link><dc:creator>croqaz</dc:creator><comments>https://news.ycombinator.com/item?id=48508325</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48508325</guid></item><item><title><![CDATA[New comment by croqaz in "Making a vintage LLM from scratch"]]></title><description><![CDATA[
<p>That's a fair point TBH. I said in my post that this LLM is first of all a learning project and I skipped an important step: the training loop.  But on the other hand, how many data scientists are writing their own training loops? Is it even worth it?  And how much learning do you want for one project, I mean, where do you stop? Why use "Huggingface Transformers" when you can write it from scratch, for learning? Why use Torch when you can write it from scratch, for learning? Why use Python when you can write in C, etc. It's cheating, right?
In my case, I decided to skip the training loop and focus on the data processing and the hyper params and the rest of the higher level steps that took a ton of time anyway, and I reduced the friction.
I do get your point tho. Now that I know how to train an LLM, maybe I'll write a training loop from scratch as a project, to learn how to do it.</p>
]]></description><pubDate>Fri, 12 Jun 2026 19:17:14 +0000</pubDate><link>https://news.ycombinator.com/item?id=48508304</link><dc:creator>croqaz</dc:creator><comments>https://news.ycombinator.com/item?id=48508304</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48508304</guid></item><item><title><![CDATA[New comment by croqaz in "Making a vintage LLM from scratch"]]></title><description><![CDATA[
<p>"A more granular followup would be cool too"<p>Do you mind expanding this question? More granular in what way? what would you like to know that is missing from the post?</p>
]]></description><pubDate>Fri, 12 Jun 2026 13:50:36 +0000</pubDate><link>https://news.ycombinator.com/item?id=48504055</link><dc:creator>croqaz</dc:creator><comments>https://news.ycombinator.com/item?id=48504055</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48504055</guid></item><item><title><![CDATA[New comment by croqaz in "Making a vintage LLM from scratch"]]></title><description><![CDATA[
<p>That's exactly what I had in mind. When I started this, I was jumping back and forth between this thought: "Can this model size actually generate logical English text?" and I played with a few different models of the same size and I was really really depressed when seeing how bad they are.... but then I discovered more and more tiny models and LaMini-125M, LaMini-256M, and nanowhale-100m, and SmolLM2-135M-Instruct are very very decent. So I decided to give it a try.</p>
]]></description><pubDate>Fri, 12 Jun 2026 13:49:04 +0000</pubDate><link>https://news.ycombinator.com/item?id=48504032</link><dc:creator>croqaz</dc:creator><comments>https://news.ycombinator.com/item?id=48504032</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48504032</guid></item><item><title><![CDATA[New comment by croqaz in "Making a vintage LLM from scratch"]]></title><description><![CDATA[
<p>It looks like ROT13 text to me, I hope it's not Welsh. Don't want to offend anyone if that's their actual language :)</p>
]]></description><pubDate>Fri, 12 Jun 2026 10:56:19 +0000</pubDate><link>https://news.ycombinator.com/item?id=48502450</link><dc:creator>croqaz</dc:creator><comments>https://news.ycombinator.com/item?id=48502450</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48502450</guid></item><item><title><![CDATA[New comment by croqaz in "Making a vintage LLM from scratch"]]></title><description><![CDATA[
<p>I am creating my tiny Llama 340M base model from scratch. If you're curious about the steps, challenges and cost, read on. I am still working on the instruct model.</p>
]]></description><pubDate>Thu, 11 Jun 2026 08:38:00 +0000</pubDate><link>https://news.ycombinator.com/item?id=48487830</link><dc:creator>croqaz</dc:creator><comments>https://news.ycombinator.com/item?id=48487830</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48487830</guid></item><item><title><![CDATA[Making a vintage LLM from scratch]]></title><description><![CDATA[
<p>Article URL: <a href="https://crlf.link/log/entries/260525-1/">https://crlf.link/log/entries/260525-1/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=48487829">https://news.ycombinator.com/item?id=48487829</a></p>
<p>Points: 103</p>
<p># Comments: 29</p>
]]></description><pubDate>Thu, 11 Jun 2026 08:38:00 +0000</pubDate><link>https://crlf.link/log/entries/260525-1/</link><dc:creator>croqaz</dc:creator><comments>https://news.ycombinator.com/item?id=48487829</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48487829</guid></item><item><title><![CDATA[New comment by croqaz in "TwoFold (2f), CLI text expander/template engine"]]></title><description><![CDATA[
<p>TwoFold is a small command line app that allows plain text files to behave like dynamic files. It is a hybrid between a text expande, a template engine and a mini programming language.</p>
]]></description><pubDate>Sun, 11 May 2025 18:28:46 +0000</pubDate><link>https://news.ycombinator.com/item?id=43955835</link><dc:creator>croqaz</dc:creator><comments>https://news.ycombinator.com/item?id=43955835</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43955835</guid></item><item><title><![CDATA[TwoFold (2f), CLI text expander/template engine]]></title><description><![CDATA[
<p>Article URL: <a href="https://github.com/ShinyTrinkets/twofold.ts">https://github.com/ShinyTrinkets/twofold.ts</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=43955834">https://news.ycombinator.com/item?id=43955834</a></p>
<p>Points: 17</p>
<p># Comments: 1</p>
]]></description><pubDate>Sun, 11 May 2025 18:28:46 +0000</pubDate><link>https://github.com/ShinyTrinkets/twofold.ts</link><dc:creator>croqaz</dc:creator><comments>https://news.ycombinator.com/item?id=43955834</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43955834</guid></item><item><title><![CDATA[New comment by croqaz in "[dead]"]]></title><description><![CDATA[
<p>What are web snapshots; comparison of the most popular methods: WARC, HTML, rrWeb and a better alternative called "recorded".</p>
]]></description><pubDate>Fri, 12 Aug 2022 14:31:51 +0000</pubDate><link>https://news.ycombinator.com/item?id=32439261</link><dc:creator>croqaz</dc:creator><comments>https://news.ycombinator.com/item?id=32439261</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=32439261</guid></item><item><title><![CDATA[New comment by croqaz in "Blocking adware, malware, and tracking sites"]]></title><description><![CDATA[
<p>DYI blocking, a few methods</p>
]]></description><pubDate>Tue, 21 Dec 2021 10:39:11 +0000</pubDate><link>https://news.ycombinator.com/item?id=29636014</link><dc:creator>croqaz</dc:creator><comments>https://news.ycombinator.com/item?id=29636014</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=29636014</guid></item><item><title><![CDATA[Blocking adware, malware, and tracking sites]]></title><description><![CDATA[
<p>Article URL: <a href="https://crlf.link/log/entries/211220-1/">https://crlf.link/log/entries/211220-1/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=29636013">https://news.ycombinator.com/item?id=29636013</a></p>
<p>Points: 1</p>
<p># Comments: 2</p>
]]></description><pubDate>Tue, 21 Dec 2021 10:39:11 +0000</pubDate><link>https://crlf.link/log/entries/211220-1/</link><dc:creator>croqaz</dc:creator><comments>https://news.ycombinator.com/item?id=29636013</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=29636013</guid></item><item><title><![CDATA[Harden and secure browsers in containers, with GUI]]></title><description><![CDATA[
<p>Article URL: <a href="https://crlf.link/log/entries/211008-1/">https://crlf.link/log/entries/211008-1/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=28801119">https://news.ycombinator.com/item?id=28801119</a></p>
<p>Points: 58</p>
<p># Comments: 31</p>
]]></description><pubDate>Fri, 08 Oct 2021 16:28:52 +0000</pubDate><link>https://crlf.link/log/entries/211008-1/</link><dc:creator>croqaz</dc:creator><comments>https://news.ycombinator.com/item?id=28801119</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=28801119</guid></item><item><title><![CDATA[New comment by croqaz in "Tomb: File Encryption on GNU/Linux"]]></title><description><![CDATA[
<p>Tomb consists of a simple shell script (Zsh) using standard filesystem tools (GNU) and the cryptographic API of the Linux kernel (cryptsetup and LUKS).</p>
]]></description><pubDate>Tue, 10 Aug 2021 20:29:06 +0000</pubDate><link>https://news.ycombinator.com/item?id=28133988</link><dc:creator>croqaz</dc:creator><comments>https://news.ycombinator.com/item?id=28133988</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=28133988</guid></item><item><title><![CDATA[Tomb: File Encryption on GNU/Linux]]></title><description><![CDATA[
<p>Article URL: <a href="https://www.dyne.org/software/tomb/">https://www.dyne.org/software/tomb/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=28133987">https://news.ycombinator.com/item?id=28133987</a></p>
<p>Points: 6</p>
<p># Comments: 2</p>
]]></description><pubDate>Tue, 10 Aug 2021 20:29:06 +0000</pubDate><link>https://www.dyne.org/software/tomb/</link><dc:creator>croqaz</dc:creator><comments>https://news.ycombinator.com/item?id=28133987</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=28133987</guid></item><item><title><![CDATA[Things that only someone who has been programming 20 years would know]]></title><description><![CDATA[
<p>Article URL: <a href="https://www.quora.com/What-are-some-things-that-only-someone-who-has-been-programming-20-50-years-would-know/answer/John-Byrd-2?share=1">https://www.quora.com/What-are-some-things-that-only-someone-who-has-been-programming-20-50-years-would-know/answer/John-Byrd-2?share=1</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=27990400">https://news.ycombinator.com/item?id=27990400</a></p>
<p>Points: 23</p>
<p># Comments: 7</p>
]]></description><pubDate>Wed, 28 Jul 2021 22:09:01 +0000</pubDate><link>https://www.quora.com/What-are-some-things-that-only-someone-who-has-been-programming-20-50-years-would-know/answer/John-Byrd-2?share=1</link><dc:creator>croqaz</dc:creator><comments>https://news.ycombinator.com/item?id=27990400</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=27990400</guid></item><item><title><![CDATA[New comment by croqaz in "Fountain pen ink properties"]]></title><description><![CDATA[
<p>The same in Romania. In the 90s, everyone I know had to learn how to write with a fountain pen.
For me at least, I write horribly when I use a ballpoint pen, vs a fountain pen. Because the ballpoint slides on the paper much too easy, like on ice, and you need much more discipline to write nicely.
A fountain pen nib is generally more scratchy and feels easier to control.
My 2 cents anyway...</p>
]]></description><pubDate>Sat, 13 Jun 2020 21:10:08 +0000</pubDate><link>https://news.ycombinator.com/item?id=23512759</link><dc:creator>croqaz</dc:creator><comments>https://news.ycombinator.com/item?id=23512759</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=23512759</guid></item><item><title><![CDATA[New comment by croqaz in "Fountain pen ink properties"]]></title><description><![CDATA[
<p>I love my Plaisir too! I have a green color and of course, it's always inked with green :)</p>
]]></description><pubDate>Sat, 13 Jun 2020 21:06:03 +0000</pubDate><link>https://news.ycombinator.com/item?id=23512733</link><dc:creator>croqaz</dc:creator><comments>https://news.ycombinator.com/item?id=23512733</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=23512733</guid></item><item><title><![CDATA[New comment by croqaz in "Fountain pen ink properties"]]></title><description><![CDATA[
<p>Wow, I had no idea my crappy post ended on Hacker news! I just collected a bunch of info and links for myself :)
I guess I have to update the post with some pictures, at least...</p>
]]></description><pubDate>Sat, 13 Jun 2020 21:04:02 +0000</pubDate><link>https://news.ycombinator.com/item?id=23512718</link><dc:creator>croqaz</dc:creator><comments>https://news.ycombinator.com/item?id=23512718</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=23512718</guid></item></channel></rss>