<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: megadragon9</title><link>https://news.ycombinator.com/user?id=megadragon9</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Wed, 08 Apr 2026 12:37:23 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=megadragon9" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[How a Deep Learning Library Enables Learning]]></title><description><![CDATA[
<p>Article URL: <a href="https://www.henrypan.com/blog/2026-03-14-how-deep-learning-library-enables-learning/">https://www.henrypan.com/blog/2026-03-14-how-deep-learning-library-enables-learning/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=47406602">https://news.ycombinator.com/item?id=47406602</a></p>
<p>Points: 2</p>
<p># Comments: 0</p>
]]></description><pubDate>Mon, 16 Mar 2026 23:40:59 +0000</pubDate><link>https://www.henrypan.com/blog/2026-03-14-how-deep-learning-library-enables-learning/</link><dc:creator>megadragon9</dc:creator><comments>https://news.ycombinator.com/item?id=47406602</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47406602</guid></item><item><title><![CDATA[How A Deep Learning Library Enables Learning]]></title><description><![CDATA[
<p>Article URL: <a href="https://www.henrypan.com/blog/2026-03-14-how-deep-learning-library-enables-learning/">https://www.henrypan.com/blog/2026-03-14-how-deep-learning-library-enables-learning/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=47389887">https://news.ycombinator.com/item?id=47389887</a></p>
<p>Points: 2</p>
<p># Comments: 0</p>
]]></description><pubDate>Sun, 15 Mar 2026 17:52:50 +0000</pubDate><link>https://www.henrypan.com/blog/2026-03-14-how-deep-learning-library-enables-learning/</link><dc:creator>megadragon9</dc:creator><comments>https://news.ycombinator.com/item?id=47389887</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47389887</guid></item><item><title><![CDATA[New comment by megadragon9 in "Claude March 2026 usage promotion"]]></title><description><![CDATA[
<p>Interesting to see more demand shaping mechanisms applied to LLM inference. Even though the "batch processing" feature is already available. I guess this "promotion" is to test the hypothesis of sliding along the spectrum towards more "real-time" demand shaping.</p>
]]></description><pubDate>Sat, 14 Mar 2026 22:42:24 +0000</pubDate><link>https://news.ycombinator.com/item?id=47382083</link><dc:creator>megadragon9</dc:creator><comments>https://news.ycombinator.com/item?id=47382083</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47382083</guid></item><item><title><![CDATA[New comment by megadragon9 in "Build a Deep Learning Library"]]></title><description><![CDATA[
<p>Thanks for sharing! It's inspiring to see more people "reinventing for insight" in the age of AI. This reminds me of my similar previous project a year ago when I built an entire PyTorch-style machine learning library [1] from scratch, using nothing but Python and NumPy. I started with a tiny autograd engine, then gradually created layer modules, optimizers, data loaders etc... I simply wanted to learn machine learning from first principles. Along the way I attempted to reproduce classical convnets [2] all the way to a toy GPT-2 [3] using the library I built. It definitely helped me understand how machine learning worked underneath the hood without all the fancy abstractions that PyTorch/TensorFlow provides. I eventually wrote a blog post [4] of this journey.<p>[1] <a href="https://github.com/workofart/ml-by-hand" rel="nofollow">https://github.com/workofart/ml-by-hand</a><p>[2] <a href="https://github.com/workofart/ml-by-hand/blob/main/examples/cifar.py" rel="nofollow">https://github.com/workofart/ml-by-hand/blob/main/examples/c...</a><p>[3] <a href="https://github.com/workofart/ml-by-hand/blob/main/examples/gpt-2.py" rel="nofollow">https://github.com/workofart/ml-by-hand/blob/main/examples/g...</a><p>[4] <a href="https://www.henrypan.com/blog/2025-02-06-ml-by-hand/" rel="nofollow">https://www.henrypan.com/blog/2025-02-06-ml-by-hand/</a></p>
]]></description><pubDate>Fri, 02 Jan 2026 01:05:32 +0000</pubDate><link>https://news.ycombinator.com/item?id=46460189</link><dc:creator>megadragon9</dc:creator><comments>https://news.ycombinator.com/item?id=46460189</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46460189</guid></item><item><title><![CDATA[New comment by megadragon9 in "NanoChat – The best ChatGPT that $100 can buy"]]></title><description><![CDATA[
<p>Love the educational value of this "nano-sized" project. This reminded me of the from-scratch project I created to learn about deep learning libraries, neural networks all the way to LLMs like GPT-2 using just Numpy and Python [1]. Learning is done by "re-inventing the wheel" yourself, one step at a time :)<p>[1] <a href="https://github.com/workofart/ml-by-hand" rel="nofollow">https://github.com/workofart/ml-by-hand</a></p>
]]></description><pubDate>Fri, 17 Oct 2025 05:12:21 +0000</pubDate><link>https://news.ycombinator.com/item?id=45613494</link><dc:creator>megadragon9</dc:creator><comments>https://news.ycombinator.com/item?id=45613494</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45613494</guid></item><item><title><![CDATA[New comment by megadragon9 in "Claude Code Pro Limit? Hack It While You Sleep"]]></title><description><![CDATA[
<p>Reminds me of this HN discussion (Writing Code Was Never the Bottleneck): <a href="https://news.ycombinator.com/item?id=44429789">https://news.ycombinator.com/item?id=44429789</a></p>
]]></description><pubDate>Sun, 06 Jul 2025 19:19:36 +0000</pubDate><link>https://news.ycombinator.com/item?id=44483234</link><dc:creator>megadragon9</dc:creator><comments>https://news.ycombinator.com/item?id=44483234</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44483234</guid></item><item><title><![CDATA[New comment by megadragon9 in "Reinvent the Wheel"]]></title><description><![CDATA[
<p>Thanks for this inspiring essay, I couldn’t agree more that “reinventing for insight” is one of the best ways to learn. I had a similar experience couple months ago when I built an entire PyTorch-style machine learning library [1] from scratch, using nothing but Python and NumPy. I started with a tiny autograd engine, then gradually created layer modules, optimizers, data loaders etc... I simply wanted to learn machine learning from first principles. Along the way I attempted to reproduce classical convnets [2] all the way to a toy GPT-2 [3] using the library I built. It definitely helped me understand how machine learning worked underneath the hood without all the fancy abstractions that PyTorch/TensorFlow provides. Kinda like reinventing the car using the wheel I reinvented :)<p>[1] <a href="https://github.com/workofart/ml-by-hand">https://github.com/workofart/ml-by-hand</a><p>[2] <a href="https://github.com/workofart/ml-by-hand/blob/main/examples/cnn.py">https://github.com/workofart/ml-by-hand/blob/main/examples/c...</a><p>[3] <a href="https://github.com/workofart/ml-by-hand/blob/main/examples/gpt2.py">https://github.com/workofart/ml-by-hand/blob/main/examples/g...</a></p>
]]></description><pubDate>Sat, 24 May 2025 20:43:23 +0000</pubDate><link>https://news.ycombinator.com/item?id=44083698</link><dc:creator>megadragon9</dc:creator><comments>https://news.ycombinator.com/item?id=44083698</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44083698</guid></item><item><title><![CDATA[New comment by megadragon9 in "The Llama 4 herd"]]></title><description><![CDATA[
<p>The blog post is quite informative: <a href="https://ai.meta.com/blog/llama-4-multimodal-intelligence/" rel="nofollow">https://ai.meta.com/blog/llama-4-multimodal-intelligence/</a></p>
]]></description><pubDate>Sat, 05 Apr 2025 18:52:50 +0000</pubDate><link>https://news.ycombinator.com/item?id=43595832</link><dc:creator>megadragon9</dc:creator><comments>https://news.ycombinator.com/item?id=43595832</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43595832</guid></item><item><title><![CDATA[New comment by megadragon9 in "Ask HN: What are you working on? (March 2025)"]]></title><description><![CDATA[
<p>I built a machine learning library [1] (similar to PyTorch's API) entirely from scratch using only Python and NumPy. It was inspired by Andrej Karpathy's Micrograd project [2]. I slowly added more functionality and evolved it into a fully functional ML library that can build and train classical CNNs [3] to even a toy GPT-2 [4].<p>I wanted to understand how models learn, like literally bridging the gap between mathematical formulas and high-level API calls. I feel like, as a beginner in machine learning, it's important to strip away the abstractions and understand how these libraries work from the ground up before leveraging these "high-level" libraries such as PyTorch and Tensorflow. Oh I also wrote a blog post [5] on the journey.<p>[1] <a href="https://github.com/workofart/ml-by-hand" rel="nofollow">https://github.com/workofart/ml-by-hand</a><p>[2] <a href="https://github.com/karpathy/micrograd" rel="nofollow">https://github.com/karpathy/micrograd</a><p>[3] <a href="https://github.com/workofart/ml-by-hand/blob/main/examples/cifar.py" rel="nofollow">https://github.com/workofart/ml-by-hand/blob/main/examples/c...</a><p>[4] <a href="https://github.com/workofart/ml-by-hand/blob/main/examples/gpt-2.py" rel="nofollow">https://github.com/workofart/ml-by-hand/blob/main/examples/g...</a><p>[5] <a href="https://www.henrypan.com/blog/2025-02-06-ml-by-hand/" rel="nofollow">https://www.henrypan.com/blog/2025-02-06-ml-by-hand/</a></p>
]]></description><pubDate>Sun, 30 Mar 2025 21:33:40 +0000</pubDate><link>https://news.ycombinator.com/item?id=43527869</link><dc:creator>megadragon9</dc:creator><comments>https://news.ycombinator.com/item?id=43527869</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43527869</guid></item><item><title><![CDATA[New comment by megadragon9 in "Write to Escape Your Default Setting"]]></title><description><![CDATA[
<p>> Writing reveals what you don’t know, what you can’t see when an idea is only held in your head. Biases, blind spots, and assumptions you can’t grasp internally.<p>I completely agree with this. Often, I think I understand something, but when I try to explain it to others, I quickly realize where my understanding is shaky. The gaps become even more apparent when I attempt to write it down because I have to structure my thoughts logically and precisely. Writing goes a step beyond speaking because it forces me to re-read and refine my ideas, whereas spoken words often disappear without deeper reflection. Oh, even this comment that I'm writing now was edited a couple of times before submitting it. The second half of the comment was added after re-reading the first half.</p>
]]></description><pubDate>Fri, 28 Feb 2025 22:06:03 +0000</pubDate><link>https://news.ycombinator.com/item?id=43212155</link><dc:creator>megadragon9</dc:creator><comments>https://news.ycombinator.com/item?id=43212155</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43212155</guid></item><item><title><![CDATA[Understanding ML: Creating a PyTorch-Inspired Deep Learning Library from Scratch]]></title><description><![CDATA[
<p>Article URL: <a href="https://www.henrypan.com/blog/2025-02-06-ml-by-hand/">https://www.henrypan.com/blog/2025-02-06-ml-by-hand/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=43141751">https://news.ycombinator.com/item?id=43141751</a></p>
<p>Points: 1</p>
<p># Comments: 0</p>
]]></description><pubDate>Sat, 22 Feb 2025 18:42:41 +0000</pubDate><link>https://www.henrypan.com/blog/2025-02-06-ml-by-hand/</link><dc:creator>megadragon9</dc:creator><comments>https://news.ycombinator.com/item?id=43141751</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43141751</guid></item><item><title><![CDATA[New comment by megadragon9 in "Show HN: Transform your codebase into a single Markdown doc for feeding into AI"]]></title><description><![CDATA[
<p>I would say the demand for this kind of tool definitely exists. Good work!
From a rough glance it looks pretty similar to another tool that I've been using <a href="https://github.com/mufeedvh/code2prompt">https://github.com/mufeedvh/code2prompt</a></p>
]]></description><pubDate>Fri, 14 Feb 2025 18:25:54 +0000</pubDate><link>https://news.ycombinator.com/item?id=43051426</link><dc:creator>megadragon9</dc:creator><comments>https://news.ycombinator.com/item?id=43051426</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43051426</guid></item><item><title><![CDATA[Show HN: ML Library created by Python+NumPy (trains CNNs to a toy GPT-2)]]></title><description><![CDATA[
<p>Hi HN,
I built a machine learning library entirely from scratch using only Python and NumPy. I then used it to train a range of models—from classical CNNs, ResNets, RNNs, and LSTMs to modern Transformers and even a toy GPT-2. The motivation came from my curiosity about how to build deep learning models from scratch, like literally from mathematical formulas. I built this project not to replace production-ready libraries like PyTorch or TensorFlow, but to strip away the abstractions and reveal the underlying mathematics of machine learning.<p>Key points:<p>- Everything is derived in code — no opaque black boxes.<p>- API mirrors PyTorch so you can pick it up quickly.<p>- You can train CNNs, RNNs, Transformers, and even GPT models.<p>- Designed more for learning/debugging than raw performance.<p>What’s different here?<p>While there are many powerful ML libraries available (TensorFlow, PyTorch, Scikit-learn, etc.), they often hide the underlying math behind layers of abstraction. I believe that to truly master these tools, you first need to understand how they work from the ground up. This project explicitly derives all the mathematical and calculus operations in the code, making it a hands-on resource for deepening the understanding of neural networks and library building :)<p>Check it out:<p>- Github Repository: <a href="https://github.com/workofart/ml-by-hand">https://github.com/workofart/ml-by-hand</a><p>- API Documentation: <a href="https://ml-by-hand.readthedocs.io/en/latest/" rel="nofollow">https://ml-by-hand.readthedocs.io/en/latest/</a><p>- Examples: Explore models like GPT-2, CNNs, Transformers, and LSTMs in the examples/ folder: <a href="https://github.com/workofart/ml-by-hand/tree/main/examples">https://github.com/workofart/ml-by-hand/tree/main/examples</a><p>- Blog Post: Read about the project’s motivation, design, and challenges at <a href="https://www.henrypan.com/blog/2025-02-06-ml-by-hand/" rel="nofollow">https://www.henrypan.com/blog/2025-02-06-ml-by-hand/</a><p>I’d love to hear any thoughts, questions, or suggestions — thanks for checking it out!</p>
<hr>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=43020982">https://news.ycombinator.com/item?id=43020982</a></p>
<p>Points: 1</p>
<p># Comments: 0</p>
]]></description><pubDate>Wed, 12 Feb 2025 02:07:31 +0000</pubDate><link>https://github.com/workofart/ml-by-hand</link><dc:creator>megadragon9</dc:creator><comments>https://news.ycombinator.com/item?id=43020982</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43020982</guid></item><item><title><![CDATA[I Built a Deep Learning Library from Scratch Using Only Python, NumPy and Math]]></title><description><![CDATA[
<p>Article URL: <a href="https://www.henrypan.com/blog/2025-02-06-ml-by-hand/">https://www.henrypan.com/blog/2025-02-06-ml-by-hand/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=42988285">https://news.ycombinator.com/item?id=42988285</a></p>
<p>Points: 3</p>
<p># Comments: 0</p>
]]></description><pubDate>Sun, 09 Feb 2025 03:48:09 +0000</pubDate><link>https://www.henrypan.com/blog/2025-02-06-ml-by-hand/</link><dc:creator>megadragon9</dc:creator><comments>https://news.ycombinator.com/item?id=42988285</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42988285</guid></item><item><title><![CDATA[Show HN: From-Scratch ML Library (trains models from CNNs to a toy GPT-2)]]></title><description><![CDATA[
<p>Hi HN,<p>I built a machine learning library entirely from scratch using only Python and NumPy. I then used it to train a range of models—from classical CNNs, ResNets, RNNs, and LSTMs to modern Transformers and even a toy GPT-2. The motivation came from my curiosity about how to build deep learning models from scratch, like literally from mathematical formulas. I built this project not to replace production-ready libraries like PyTorch or TensorFlow, but to strip away the abstractions and reveal the underlying mathematics of machine learning.<p>Key points:<p>- Everything is derived in code — no opaque black boxes.<p>- API mirrors PyTorch so you can pick it up quickly.<p>- You can train CNNs, RNNs, Transformers, and even GPT models.<p>- Designed more for learning/debugging than raw performance.<p>What’s different here?<p>While there are many powerful ML libraries available (TensorFlow, PyTorch, Scikit-learn, etc.), they often hide the underlying math behind layers of abstraction. I believe that to truly master these tools, you first need to understand how they work from the ground up. This project explicitly derives all the mathematical and calculus operations in the code, making it a hands-on resource for deepening the understanding of neural networks and library building :)<p>Check it out:<p>- Github Repository: <a href="https://github.com/workofart/ml-by-hand">https://github.com/workofart/ml-by-hand</a><p>- API Documentation: <a href="https://ml-by-hand.readthedocs.io/en/latest/" rel="nofollow">https://ml-by-hand.readthedocs.io/en/latest/</a><p>- Examples: Explore models like GPT-2, CNNs, Transformers, and LSTMs in the examples/ folder: <a href="https://github.com/workofart/ml-by-hand/tree/main/examples">https://github.com/workofart/ml-by-hand/tree/main/examples</a><p>- Blog Post: Read about the project’s motivation, design, and challenges at <a href="https://www.henrypan.com/blog/2025-02-06-ml-by-hand/" rel="nofollow">https://www.henrypan.com/blog/2025-02-06-ml-by-hand/</a><p>I’d love to hear any thoughts, questions, or suggestions — thanks for checking it out!</p>
<hr>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=42986598">https://news.ycombinator.com/item?id=42986598</a></p>
<p>Points: 1</p>
<p># Comments: 0</p>
]]></description><pubDate>Sat, 08 Feb 2025 22:05:59 +0000</pubDate><link>https://github.com/workofart/ml-by-hand</link><dc:creator>megadragon9</dc:creator><comments>https://news.ycombinator.com/item?id=42986598</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42986598</guid></item><item><title><![CDATA[New comment by megadragon9 in "Ichi.city"]]></title><description><![CDATA[
<p>uploaded 00's geocities
<a href="https://geocity.ichi.city/" rel="nofollow">https://geocity.ichi.city/</a></p>
]]></description><pubDate>Tue, 09 Nov 2021 00:53:39 +0000</pubDate><link>https://news.ycombinator.com/item?id=29156366</link><dc:creator>megadragon9</dc:creator><comments>https://news.ycombinator.com/item?id=29156366</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=29156366</guid></item><item><title><![CDATA[Chinese scientist claims world's first gene-edited babies]]></title><description><![CDATA[
<p>Article URL: <a href="https://www.cnn.com/2018/11/26/health/china-crispr-gene-editing-twin-babies-first-intl">https://www.cnn.com/2018/11/26/health/china-crispr-gene-editing-twin-babies-first-intl</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=18535360">https://news.ycombinator.com/item?id=18535360</a></p>
<p>Points: 2</p>
<p># Comments: 0</p>
]]></description><pubDate>Mon, 26 Nov 2018 17:59:50 +0000</pubDate><link>https://www.cnn.com/2018/11/26/health/china-crispr-gene-editing-twin-babies-first-intl</link><dc:creator>megadragon9</dc:creator><comments>https://news.ycombinator.com/item?id=18535360</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=18535360</guid></item><item><title><![CDATA[New comment by megadragon9 in "Show HN: CoinMarketBook – CoinMarketCap, but with a different metric"]]></title><description><![CDATA[
<p>What does the "Tell Friends" column mean in the table? Was it supposed to be a metric or just a placeholder?</p>
]]></description><pubDate>Sun, 07 Oct 2018 16:09:44 +0000</pubDate><link>https://news.ycombinator.com/item?id=18161244</link><dc:creator>megadragon9</dc:creator><comments>https://news.ycombinator.com/item?id=18161244</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=18161244</guid></item><item><title><![CDATA[Do functions that Siri trigger exist in the OS?]]></title><description><![CDATA[
<p>I've tried using some "special" functions such as, turn off all alarms etc... And these functions are not available to users from the UI. My question is, does the OS have these functions programmed or are they programmed on-the-fly? Or maybe they're not functions at all.</p>
<hr>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=10715359">https://news.ycombinator.com/item?id=10715359</a></p>
<p>Points: 1</p>
<p># Comments: 0</p>
]]></description><pubDate>Fri, 11 Dec 2015 03:58:44 +0000</pubDate><link>https://news.ycombinator.com/item?id=10715359</link><dc:creator>megadragon9</dc:creator><comments>https://news.ycombinator.com/item?id=10715359</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=10715359</guid></item><item><title><![CDATA[Switching from software development in banks to product-oriented tech firms]]></title><description><![CDATA[
<p>I'm a upcoming new grad, and I'm debating whether accept an offer as a software developer position in a bank or still apply for product-focused tech firms. Eventually, I would want to move towards product-focused tech firms, or start my own down the road.<p>Anything is appreciated, thanks!</p>
<hr>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=10556812">https://news.ycombinator.com/item?id=10556812</a></p>
<p>Points: 4</p>
<p># Comments: 1</p>
]]></description><pubDate>Thu, 12 Nov 2015 23:01:56 +0000</pubDate><link>https://news.ycombinator.com/item?id=10556812</link><dc:creator>megadragon9</dc:creator><comments>https://news.ycombinator.com/item?id=10556812</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=10556812</guid></item></channel></rss>