<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: ahzhou</title><link>https://news.ycombinator.com/user?id=ahzhou</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Fri, 10 Apr 2026 08:31:41 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=ahzhou" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by ahzhou in "Building a TUI is easy now"]]></title><description><![CDATA[
<p>Fast as in time-to-market</p>
]]></description><pubDate>Sat, 14 Feb 2026 15:51:26 +0000</pubDate><link>https://news.ycombinator.com/item?id=47015422</link><dc:creator>ahzhou</dc:creator><comments>https://news.ycombinator.com/item?id=47015422</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47015422</guid></item><item><title><![CDATA[New comment by ahzhou in "A startup doesn't need to be a unicorn"]]></title><description><![CDATA[
<p>VC vs bootstrap is usually based on company TAM. There are certainly high growth bootstrapped businesses.</p>
]]></description><pubDate>Tue, 08 Apr 2025 16:52:12 +0000</pubDate><link>https://news.ycombinator.com/item?id=43623852</link><dc:creator>ahzhou</dc:creator><comments>https://news.ycombinator.com/item?id=43623852</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43623852</guid></item><item><title><![CDATA[New comment by ahzhou in "A startup doesn't need to be a unicorn"]]></title><description><![CDATA[
<p>Not common in Silicon Valley, but much more common in the rest of the country.
 There’s an archetype for bootstrapped tech businesses:
    - highly vertical specific
    - couple hundred million TAM
    - founder started the business in their 30s and is now in their 40s</p>
]]></description><pubDate>Tue, 08 Apr 2025 16:50:57 +0000</pubDate><link>https://news.ycombinator.com/item?id=43623833</link><dc:creator>ahzhou</dc:creator><comments>https://news.ycombinator.com/item?id=43623833</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43623833</guid></item><item><title><![CDATA[New comment by ahzhou in "DeepSeek's multi-head latent attention and other KV cache tricks"]]></title><description><![CDATA[
<p>It’s a tensor stored in GPU memory to improve inference throughput. Check out the PagedAttention (which introduces vLLM) paper for how most systems implement it nowadays.</p>
]]></description><pubDate>Wed, 29 Jan 2025 03:53:46 +0000</pubDate><link>https://news.ycombinator.com/item?id=42861324</link><dc:creator>ahzhou</dc:creator><comments>https://news.ycombinator.com/item?id=42861324</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42861324</guid></item><item><title><![CDATA[New comment by ahzhou in "Meta scrambling 'war rooms' of engineers to figure out DeepSeek's AI"]]></title><description><![CDATA[
<p>They slightly restructure their MoE [1], but I think the main difference is that other big models (e.g Llama 504B) are dense and have higher FLOP requirements. MoE should represent a ~5x improvement. FP8 should be about a ~2x improvement.<p>We don’t know how much of a speed improvement GRPO represents. They didn’t say how many GPU hours went into to RLing DeepSeek-r1 and we don’t have a o1 numbers to compare.<p>There’s definitely lots of misinformation spreading though. The $5.5m number refers to Deepseek-v3, not Deepseek-r1. I don't want to take away from HighFlyer's accomplishment, though. I think a lot of these innovations were forced to work around H800 networking limitations, and it's impressive what they've done.<p>[1] <a href="https://arxiv.org/abs/2401.06066" rel="nofollow">https://arxiv.org/abs/2401.06066</a></p>
]]></description><pubDate>Tue, 28 Jan 2025 19:35:36 +0000</pubDate><link>https://news.ycombinator.com/item?id=42856869</link><dc:creator>ahzhou</dc:creator><comments>https://news.ycombinator.com/item?id=42856869</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42856869</guid></item><item><title><![CDATA[New comment by ahzhou in "DeepSeek could represent Nvidia CEO Jensen Huang's worst nightmare"]]></title><description><![CDATA[
<p>You can easily do a fermi estimate based on the information given. They are comparing GPU hours.<p>See: <a href="https://planetbanatt.net/articles/v3fermi.html" rel="nofollow">https://planetbanatt.net/articles/v3fermi.html</a></p>
]]></description><pubDate>Tue, 28 Jan 2025 14:12:10 +0000</pubDate><link>https://news.ycombinator.com/item?id=42852474</link><dc:creator>ahzhou</dc:creator><comments>https://news.ycombinator.com/item?id=42852474</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42852474</guid></item><item><title><![CDATA[New comment by ahzhou in "Meta scrambling 'war rooms' of engineers to figure out DeepSeek's AI"]]></title><description><![CDATA[
<p>I might be missing something, but DeepSeek’s recipe is right there in plain sight. Most of the cost efficiency of DeepSeek v3 seem to be attributable to MoE and FP8 training. DeepSeek R1s improvements are from GRPO-based RL.<p>Interesting to note - we have no idea how much R1 cost to train.
 To speculate - maybe DeepSeek’s release made an upcoming Llama release moot in comparison.</p>
]]></description><pubDate>Tue, 28 Jan 2025 13:22:31 +0000</pubDate><link>https://news.ycombinator.com/item?id=42852001</link><dc:creator>ahzhou</dc:creator><comments>https://news.ycombinator.com/item?id=42852001</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42852001</guid></item><item><title><![CDATA[New comment by ahzhou in "Coping with dumb LLMs using classic ML"]]></title><description><![CDATA[
<p>LLMs are inherently bad at this due to tokenization, scaling, and lack of training on the task. Anthropic’s computer use feature has a specialized model for pixel-counting:
 > Training Claude to count pixels accurately was critical. Without this skill, the model finds it difficult to give mouse commands. [1]
For a VLM trained on identifying bounding boxes, check out PaliGemma [2]<p>You may also be able to get the computer use API to draw bounding boxes if the costs make sense.<p>That said, I think the correct solution is likely to use a non-VLM to draw bounding boxes. Depends on the dataset and problem.<p>1. <a href="https://www.anthropic.com/news/developing-computer-use" rel="nofollow">https://www.anthropic.com/news/developing-computer-use</a> 
2. <a href="https://huggingface.co/blog/paligemma" rel="nofollow">https://huggingface.co/blog/paligemma</a></p>
]]></description><pubDate>Fri, 24 Jan 2025 15:03:05 +0000</pubDate><link>https://news.ycombinator.com/item?id=42813653</link><dc:creator>ahzhou</dc:creator><comments>https://news.ycombinator.com/item?id=42813653</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42813653</guid></item><item><title><![CDATA[New comment by ahzhou in "Were RNNs all we needed?"]]></title><description><![CDATA[
<p>Author: @fandzomga
Username: fsndz<p>Why try to funnel us to your paywalled article?</p>
]]></description><pubDate>Thu, 03 Oct 2024 20:14:09 +0000</pubDate><link>https://news.ycombinator.com/item?id=41734494</link><dc:creator>ahzhou</dc:creator><comments>https://news.ycombinator.com/item?id=41734494</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=41734494</guid></item><item><title><![CDATA[New comment by ahzhou in "Mako – fast, production-grade web bundler based on Rust"]]></title><description><![CDATA[
<p>Conditionally yes. There are many libraries that cannot be tree shaken for various reasons. Libraries typically need to stick to a subset of full JS to ensure that the code can be statically analyzed.</p>
]]></description><pubDate>Tue, 02 Jul 2024 14:36:50 +0000</pubDate><link>https://news.ycombinator.com/item?id=40857125</link><dc:creator>ahzhou</dc:creator><comments>https://news.ycombinator.com/item?id=40857125</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40857125</guid></item><item><title><![CDATA[New comment by ahzhou in "Why we no longer use LangChain for building our AI agents"]]></title><description><![CDATA[
<p>GraphQL is very powerful when combined with Relay. It’s useless extra bloat if you just use it like REST.<p>The difference between the two technologies is that LangChain was developed and funded before anyone know what to do with LLMs and GraphQL was internal tooling using to solve a real problem at Meta.<p>In a lot of ways, LangChain is a poor abstraction because the layer it’s abstracting was (and still is) in it’s infancy.</p>
]]></description><pubDate>Fri, 21 Jun 2024 00:41:35 +0000</pubDate><link>https://news.ycombinator.com/item?id=40745039</link><dc:creator>ahzhou</dc:creator><comments>https://news.ycombinator.com/item?id=40745039</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40745039</guid></item><item><title><![CDATA[New comment by ahzhou in "Hate Chatbots? You Aren't the Only One"]]></title><description><![CDATA[
<p>To be fair, LLM-based chatbots are much better about this because you don't need to discover the magic incantation to talk to a human. It's a trade-off because that same property introduces the possibility of hallucination.</p>
]]></description><pubDate>Tue, 28 May 2024 21:58:42 +0000</pubDate><link>https://news.ycombinator.com/item?id=40506055</link><dc:creator>ahzhou</dc:creator><comments>https://news.ycombinator.com/item?id=40506055</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40506055</guid></item><item><title><![CDATA[New comment by ahzhou in "Hate Chatbots? You Aren't the Only One"]]></title><description><![CDATA[
<p>It depends on the business, but the kind of metrics you are talking about are measured and taken seriously. People have absolutely gotten fired for CS quality KPI drops.</p>
]]></description><pubDate>Tue, 28 May 2024 21:54:41 +0000</pubDate><link>https://news.ycombinator.com/item?id=40506024</link><dc:creator>ahzhou</dc:creator><comments>https://news.ycombinator.com/item?id=40506024</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40506024</guid></item><item><title><![CDATA[New comment by ahzhou in "Hate Chatbots? You Aren't the Only One"]]></title><description><![CDATA[
<p>While it may not happen for you, “too lazy to look it up” is the vast majority of CS requests.<p>My understanding from talking to a couple of CS execs is that these have been a slam dunk in terms of ROI because CS agents don’t need to handle type C requests. I expect we’ll only see more as time goes on.</p>
]]></description><pubDate>Tue, 28 May 2024 16:05:13 +0000</pubDate><link>https://news.ycombinator.com/item?id=40502170</link><dc:creator>ahzhou</dc:creator><comments>https://news.ycombinator.com/item?id=40502170</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40502170</guid></item><item><title><![CDATA[New comment by ahzhou in "Big Tech to EU: "Drop Dead""]]></title><description><![CDATA[
<p>> AppStore would be dead on arrival<p>Certainly not. PMF was already established via the jailbreaking scene and Installer.app / Cydia. Millions of people went through the annoying processing of jailbreaking their phone to get apps.</p>
]]></description><pubDate>Sun, 19 May 2024 13:07:08 +0000</pubDate><link>https://news.ycombinator.com/item?id=40406699</link><dc:creator>ahzhou</dc:creator><comments>https://news.ycombinator.com/item?id=40406699</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40406699</guid></item><item><title><![CDATA[New comment by ahzhou in "Financial market applications of LLMs"]]></title><description><![CDATA[
<p>If you’re saying that economics is a foundational driver of progress, then yes - almost by definition.<p>Banks and investors provide liquidity to the system, which is just one of many things the market demands.</p>
]]></description><pubDate>Sun, 21 Apr 2024 14:59:25 +0000</pubDate><link>https://news.ycombinator.com/item?id=40106260</link><dc:creator>ahzhou</dc:creator><comments>https://news.ycombinator.com/item?id=40106260</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40106260</guid></item><item><title><![CDATA[New comment by ahzhou in "But what is a GPT?  Visual intro to Transformers [video]"]]></title><description><![CDATA[
<p>Yes, this is a fundamental weakness with LLMs. Unfortunately this is likely unsolvable because the search space is exponential. Techniques like beam search help, but can only introduce a constant scaling factor.<p>That said, LLM reach their current performance despite this limitation.</p>
]]></description><pubDate>Tue, 02 Apr 2024 03:11:56 +0000</pubDate><link>https://news.ycombinator.com/item?id=39902020</link><dc:creator>ahzhou</dc:creator><comments>https://news.ycombinator.com/item?id=39902020</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=39902020</guid></item><item><title><![CDATA[New comment by ahzhou in "Grats: A More Pleasant Way to Build TypeScript GraphQL Servers"]]></title><description><![CDATA[
<p>They fall under a few buckets:
 Driver:<p>- node-postgres<p>- node-mysql2<p>Query Builder / Other thin clients: - knex - kysely - slonik
ORM: - TypeORM - MikroORM - Objection.js - DrizzleORM - Prisma (actually runs a separate binary)</p>
]]></description><pubDate>Fri, 08 Mar 2024 06:08:00 +0000</pubDate><link>https://news.ycombinator.com/item?id=39638249</link><dc:creator>ahzhou</dc:creator><comments>https://news.ycombinator.com/item?id=39638249</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=39638249</guid></item><item><title><![CDATA[New comment by ahzhou in "The creator economy can't rely on Patreon"]]></title><description><![CDATA[
<p>Its a two sided marketplace and companies only care about the conversion they get from different channels. If demand dries up, it will be reflected in more attractive pricing - I don’t think it’s likely that the entire market pulls out.<p>FWIW - it seems like the campaigns are working. You seem to be familiar with the brands and someone below chimed in on how one particular brand is great. Multiply that by the viewership - that’s definitely a win.<p>Some quick (unverified) research tells me that YouTuber marketing pays somewhere in the range of 30-70 CPM. You can pretty easily calculate that against google AdWords with reasonable conversion assumptions to decide if it’s worth it.</p>
]]></description><pubDate>Tue, 27 Feb 2024 14:22:37 +0000</pubDate><link>https://news.ycombinator.com/item?id=39524427</link><dc:creator>ahzhou</dc:creator><comments>https://news.ycombinator.com/item?id=39524427</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=39524427</guid></item><item><title><![CDATA[New comment by ahzhou in "Panda CSS: build time and type-safe CSS-in-JS"]]></title><description><![CDATA[
<p>Css modules would be great, except there’s bad tooling in VSCode. Autocomplete through Typescript is the killer feature of Panda / Vanilla extract, not that you can style.</p>
]]></description><pubDate>Tue, 06 Feb 2024 14:45:39 +0000</pubDate><link>https://news.ycombinator.com/item?id=39274721</link><dc:creator>ahzhou</dc:creator><comments>https://news.ycombinator.com/item?id=39274721</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=39274721</guid></item></channel></rss>