<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: atgctg</title><link>https://news.ycombinator.com/user?id=atgctg</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Tue, 07 Apr 2026 05:58:43 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=atgctg" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[GPT-5.2]]></title><description><![CDATA[
<p><a href="https://platform.openai.com/docs/guides/latest-model" rel="nofollow">https://platform.openai.com/docs/guides/latest-model</a><p>System card: <a href="https://cdn.openai.com/pdf/3a4153c8-c748-4b71-8e31-aecbde944f8d/oai_5_2_system-card.pdf" rel="nofollow">https://cdn.openai.com/pdf/3a4153c8-c748-4b71-8e31-aecbde944...</a></p>
<hr>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=46234788">https://news.ycombinator.com/item?id=46234788</a></p>
<p>Points: 1195</p>
<p># Comments: 1083</p>
]]></description><pubDate>Thu, 11 Dec 2025 18:04:47 +0000</pubDate><link>https://openai.com/index/introducing-gpt-5-2/</link><dc:creator>atgctg</dc:creator><comments>https://news.ycombinator.com/item?id=46234788</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46234788</guid></item><item><title><![CDATA[Llama.cpp: Add GPT-OSS]]></title><description><![CDATA[
<p>Article URL: <a href="https://github.com/ggml-org/llama.cpp/pull/15091">https://github.com/ggml-org/llama.cpp/pull/15091</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=44799218">https://news.ycombinator.com/item?id=44799218</a></p>
<p>Points: 35</p>
<p># Comments: 0</p>
]]></description><pubDate>Tue, 05 Aug 2025 15:23:58 +0000</pubDate><link>https://github.com/ggml-org/llama.cpp/pull/15091</link><dc:creator>atgctg</dc:creator><comments>https://news.ycombinator.com/item?id=44799218</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44799218</guid></item><item><title><![CDATA[New comment by atgctg in "Transformers Without Normalization"]]></title><description><![CDATA[
<p>The paper's Table 7 shows DyT reducing overall LLaMA 7B inference time by 7.8% and training time by 8.2%. That is not insignificant.</p>
]]></description><pubDate>Sat, 15 Mar 2025 10:40:15 +0000</pubDate><link>https://news.ycombinator.com/item?id=43371584</link><dc:creator>atgctg</dc:creator><comments>https://news.ycombinator.com/item?id=43371584</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43371584</guid></item><item><title><![CDATA[New comment by atgctg in "An Interview with Daniel Gross and Nat Friedman About Models, Margins, and Moats"]]></title><description><![CDATA[
<p>You can get a free trial right now to Stratechery Plus through Asianometry:<p><a href="https://stratechery.passport.online/member/plan/4ycW4SE71Cy6ryrijywbTG?trial=ASIANOMETRY&price=YSr21tFbq4KY2tGJaA2KD3" rel="nofollow">https://stratechery.passport.online/member/plan/4ycW4SE71Cy6...</a><p>Source: <a href="https://substack.com/home/post/p-154928959" rel="nofollow">https://substack.com/home/post/p-154928959</a></p>
]]></description><pubDate>Thu, 23 Jan 2025 14:53:19 +0000</pubDate><link>https://news.ycombinator.com/item?id=42804606</link><dc:creator>atgctg</dc:creator><comments>https://news.ycombinator.com/item?id=42804606</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42804606</guid></item><item><title><![CDATA[I Love Calculator]]></title><description><![CDATA[
<p>Article URL: <a href="https://karpathy.ai/blog/calculator.html">https://karpathy.ai/blog/calculator.html</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=41679255">https://news.ycombinator.com/item?id=41679255</a></p>
<p>Points: 6</p>
<p># Comments: 1</p>
]]></description><pubDate>Sat, 28 Sep 2024 10:12:38 +0000</pubDate><link>https://karpathy.ai/blog/calculator.html</link><dc:creator>atgctg</dc:creator><comments>https://news.ycombinator.com/item?id=41679255</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=41679255</guid></item><item><title><![CDATA[New comment by atgctg in "Prompt Caching"]]></title><description><![CDATA[
<p>You have to store the KV cache, not the tokens. For Gemma 27B (probably slightly larger than Flash), this would be:<p><pre><code>  Size of KV cache = 2 * (num_layers) * (num_kv_heads * dim_head) * seq_length * precision

  8-bit Gemma 27B KV cache = 2 * (46) * (16 * 144) * 1e6 * 1 byte ≈ 200 GB
</code></pre>
Note that this doesn't take further optimizations into account that Google might be using.<p>Formula: <a href="https://developer.nvidia.com/blog/mastering-llm-techniques-inference-optimization/#llm_memory_requirement%C2%A0" rel="nofollow">https://developer.nvidia.com/blog/mastering-llm-techniques-i...</a><p>Gemma 27B config: <a href="https://huggingface.co/google/gemma-2-27b/blob/main/config.json" rel="nofollow">https://huggingface.co/google/gemma-2-27b/blob/main/config.j...</a></p>
]]></description><pubDate>Mon, 19 Aug 2024 07:01:56 +0000</pubDate><link>https://news.ycombinator.com/item?id=41288514</link><dc:creator>atgctg</dc:creator><comments>https://news.ycombinator.com/item?id=41288514</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=41288514</guid></item><item><title><![CDATA[Open source AI is the path forward]]></title><description><![CDATA[
<p>Article URL: <a href="https://about.fb.com/news/2024/07/open-source-ai-is-the-path-forward/">https://about.fb.com/news/2024/07/open-source-ai-is-the-path-forward/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=41046773">https://news.ycombinator.com/item?id=41046773</a></p>
<p>Points: 2360</p>
<p># Comments: 887</p>
]]></description><pubDate>Tue, 23 Jul 2024 15:08:41 +0000</pubDate><link>https://about.fb.com/news/2024/07/open-source-ai-is-the-path-forward/</link><dc:creator>atgctg</dc:creator><comments>https://news.ycombinator.com/item?id=41046773</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=41046773</guid></item><item><title><![CDATA[The Perceptron Controversy]]></title><description><![CDATA[
<p>Article URL: <a href="https://yuxi-liu-wired.github.io/essays/posts/perceptron-controversy/">https://yuxi-liu-wired.github.io/essays/posts/perceptron-controversy/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=40681806">https://news.ycombinator.com/item?id=40681806</a></p>
<p>Points: 4</p>
<p># Comments: 0</p>
]]></description><pubDate>Fri, 14 Jun 2024 15:34:11 +0000</pubDate><link>https://yuxi-liu-wired.github.io/essays/posts/perceptron-controversy/</link><dc:creator>atgctg</dc:creator><comments>https://news.ycombinator.com/item?id=40681806</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40681806</guid></item><item><title><![CDATA[New comment by atgctg in "GitHub Profile with a Custom Background"]]></title><description><![CDATA[
<p>Works using math CSS injection [1]:<p><pre><code>    ```math
    \ce{$\unicode[goombafont; color:red; pointer-events: none; z-index: -10; position: fixed; top: 0; left: 0; height: 100vh; object-fit: cover; background-size: cover; width: 130vw; opacity: 0.5; background: url('https://github.com/cloud11665/cloud11665/assets/59028866/3b916a93-1632-49cd-bf65-14e666cd81c8');]{x0000}$}
</code></pre>
[1]: <a href="https://raw.githubusercontent.com/cloud11665/cloud11665/master/readme.md" rel="nofollow">https://raw.githubusercontent.com/cloud11665/cloud11665/mast...</a></p>
]]></description><pubDate>Fri, 07 Jun 2024 19:24:00 +0000</pubDate><link>https://news.ycombinator.com/item?id=40612018</link><dc:creator>atgctg</dc:creator><comments>https://news.ycombinator.com/item?id=40612018</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40612018</guid></item><item><title><![CDATA[New comment by atgctg in "GPT-4o"]]></title><description><![CDATA[
<p>Tiktoken added support for GPT-4o: <a href="https://github.com/openai/tiktoken/commit/9d01e5670ff50eb74cdb96406c7f3d9add0ae2f8">https://github.com/openai/tiktoken/commit/9d01e5670ff50eb74c...</a><p>It has an increased vocab size of 200k.</p>
]]></description><pubDate>Mon, 13 May 2024 17:30:03 +0000</pubDate><link>https://news.ycombinator.com/item?id=40345800</link><dc:creator>atgctg</dc:creator><comments>https://news.ycombinator.com/item?id=40345800</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40345800</guid></item><item><title><![CDATA[New comment by atgctg in "OpenAI: Model Spec"]]></title><description><![CDATA[
<p>Seems like they are working on adding that capability:<p>> We're exploring whether we can responsibly provide the ability to generate NSFW content in age-appropriate contexts through the API and ChatGPT.<p>Link to section: <a href="https://cdn.openai.com/spec/model-spec-2024-05-08.html#dont-respond-with-nsfw-content" rel="nofollow">https://cdn.openai.com/spec/model-spec-2024-05-08.html#dont-...</a></p>
]]></description><pubDate>Wed, 08 May 2024 21:14:42 +0000</pubDate><link>https://news.ycombinator.com/item?id=40302777</link><dc:creator>atgctg</dc:creator><comments>https://news.ycombinator.com/item?id=40302777</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40302777</guid></item><item><title><![CDATA[Stripe re-enters crypto with stablecoin payments]]></title><description><![CDATA[
<p>Article URL: <a href="https://twitter.com/BackseatVC/status/1783544518320742774">https://twitter.com/BackseatVC/status/1783544518320742774</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=40160572">https://news.ycombinator.com/item?id=40160572</a></p>
<p>Points: 3</p>
<p># Comments: 0</p>
]]></description><pubDate>Thu, 25 Apr 2024 17:32:52 +0000</pubDate><link>https://twitter.com/BackseatVC/status/1783544518320742774</link><dc:creator>atgctg</dc:creator><comments>https://news.ycombinator.com/item?id=40160572</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40160572</guid></item><item><title><![CDATA[Poor Charlie's Almanack (Berkshire Mode)]]></title><description><![CDATA[
<p>Article URL: <a href="https://www.stripe.press/poor-charlies-almanack/book">https://www.stripe.press/poor-charlies-almanack/book</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=38541282">https://news.ycombinator.com/item?id=38541282</a></p>
<p>Points: 3</p>
<p># Comments: 0</p>
]]></description><pubDate>Wed, 06 Dec 2023 07:22:23 +0000</pubDate><link>https://www.stripe.press/poor-charlies-almanack/book</link><dc:creator>atgctg</dc:creator><comments>https://news.ycombinator.com/item?id=38541282</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=38541282</guid></item><item><title><![CDATA[New comment by atgctg in "LLM Visualization"]]></title><description><![CDATA[
<p>A lot of transformer explanations fail to mention what makes self attention so powerful.<p>Unlike traditional neural networks with fixed weights, self-attention layers adaptively weight connections between inputs based on context. This allows transformers to accomplish in a single layer what would take traditional networks multiple layers.</p>
]]></description><pubDate>Sun, 03 Dec 2023 18:34:44 +0000</pubDate><link>https://news.ycombinator.com/item?id=38509501</link><dc:creator>atgctg</dc:creator><comments>https://news.ycombinator.com/item?id=38509501</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=38509501</guid></item><item><title><![CDATA[GPT-4 powers Copilot Chat]]></title><description><![CDATA[
<p>Article URL: <a href="https://github.blog/2023-11-08-universe-2023-copilot-transforms-github-into-the-ai-powered-developer-platform/">https://github.blog/2023-11-08-universe-2023-copilot-transforms-github-into-the-ai-powered-developer-platform/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=38193553">https://news.ycombinator.com/item?id=38193553</a></p>
<p>Points: 82</p>
<p># Comments: 19</p>
]]></description><pubDate>Wed, 08 Nov 2023 17:15:06 +0000</pubDate><link>https://github.blog/2023-11-08-universe-2023-copilot-transforms-github-into-the-ai-powered-developer-platform/</link><dc:creator>atgctg</dc:creator><comments>https://news.ycombinator.com/item?id=38193553</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=38193553</guid></item><item><title><![CDATA[New comment by atgctg in "Show HN: Fully client-side GPT2 prediction visualizer"]]></title><description><![CDATA[
<p>It would be interesting to have attention visualized as well, similar to how it's done in BertViz:<p><a href="https://github.com/jessevig/bertviz">https://github.com/jessevig/bertviz</a></p>
]]></description><pubDate>Wed, 06 Sep 2023 05:52:30 +0000</pubDate><link>https://news.ycombinator.com/item?id=37401721</link><dc:creator>atgctg</dc:creator><comments>https://news.ycombinator.com/item?id=37401721</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=37401721</guid></item><item><title><![CDATA[New comment by atgctg in "WebGPU – All of the cores, none of the canvas"]]></title><description><![CDATA[
<p>As an example, INT8 support in WebGPU would enable running quantized models, allowing larger LLMs to run locally in the browser.<p>See Limitations section here: <a href="https://fleetwood.dev/posts/running-llms-in-the-browser" rel="nofollow noreferrer">https://fleetwood.dev/posts/running-llms-in-the-browser</a></p>
]]></description><pubDate>Mon, 17 Jul 2023 10:44:18 +0000</pubDate><link>https://news.ycombinator.com/item?id=36756351</link><dc:creator>atgctg</dc:creator><comments>https://news.ycombinator.com/item?id=36756351</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=36756351</guid></item><item><title><![CDATA[New comment by atgctg in "Apple's game porting toolkit is fantastic. Cyberpunk 2077 at Ultra on an M1 MBP"]]></title><description><![CDATA[
<p>Running at ~40 fps on M2 Max:<p><a href="https://www.reddit.com/r/macgaming/comments/1435ukq/cyberpunk_on_m2_max_wgame_porting_toolkit/" rel="nofollow">https://www.reddit.com/r/macgaming/comments/1435ukq/cyberpun...</a></p>
]]></description><pubDate>Wed, 07 Jun 2023 08:24:21 +0000</pubDate><link>https://news.ycombinator.com/item?id=36224330</link><dc:creator>atgctg</dc:creator><comments>https://news.ycombinator.com/item?id=36224330</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=36224330</guid></item><item><title><![CDATA[New comment by atgctg in "Ask HN: What is an A.I. chip and how does it work?"]]></title><description><![CDATA[
<p>That video is about analog computers</p>
]]></description><pubDate>Sat, 27 May 2023 09:02:45 +0000</pubDate><link>https://news.ycombinator.com/item?id=36093155</link><dc:creator>atgctg</dc:creator><comments>https://news.ycombinator.com/item?id=36093155</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=36093155</guid></item><item><title><![CDATA[Windows and the AI Platform Shift]]></title><description><![CDATA[
<p>Article URL: <a href="https://stratechery.com/2023/windows-and-the-ai-platform-shift/">https://stratechery.com/2023/windows-and-the-ai-platform-shift/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=36058415">https://news.ycombinator.com/item?id=36058415</a></p>
<p>Points: 5</p>
<p># Comments: 0</p>
]]></description><pubDate>Wed, 24 May 2023 14:16:43 +0000</pubDate><link>https://stratechery.com/2023/windows-and-the-ai-platform-shift/</link><dc:creator>atgctg</dc:creator><comments>https://news.ycombinator.com/item?id=36058415</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=36058415</guid></item></channel></rss>