<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: HammadB</title><link>https://news.ycombinator.com/user?id=HammadB</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Sun, 03 May 2026 19:23:47 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=HammadB" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[Show HN: Package Search MCP – enable agents to search dependency source code]]></title><description><![CDATA[
<p>Hi all - Hammad here, CTO of Chroma.<p>We’ve been working closely with teams building AI systems for the software engineering stack - autocomplete bots, PR review agents, and code assistants. One common pain point we see: these systems hallucinate about dependencies.<p>While most companies index the primary codebase with grep, semantic search, and AST-based tools, dependencies are often overlooked. Even though they typically make up the bulk of running code. Maintaining source ingestion into agents is often difficult: fetching tarballs, caching, and avoiding latency spikes.<p>So we built Package Search MCP. It ingests and indexes public dependencies from NPM, PyPI, Go, and Crates.io at distinct versions. We:<p>1. Use Tree-sitter to parse and chunk the code<p>2. Embed the data with Qwen3-Embedding<p>3. Index each version into its own fork on Chroma Cloud<p>4. Expose grep + semantic search tools for agents via MCP<p>Just add this MCP server to your coding agent or AI SDK (Cursor, Claude, Codex, OpenAI etc.), and your agent immediately gets smarter about dependencies. Prompt it with “Use package search” and it’ll know where to look.<p>We’re excited to see what people build with it - try it out and let me know what you think!</p>
<hr>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=45202664">https://news.ycombinator.com/item?id=45202664</a></p>
<p>Points: 12</p>
<p># Comments: 1</p>
]]></description><pubDate>Wed, 10 Sep 2025 19:46:46 +0000</pubDate><link>https://trychroma.com/package-search</link><dc:creator>HammadB</dc:creator><comments>https://news.ycombinator.com/item?id=45202664</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45202664</guid></item><item><title><![CDATA[New comment by HammadB in "Wal3: A Write-Ahead Log for Chroma, Built on Object Storage"]]></title><description><![CDATA[
<p>You will always find a copy of McLuhan on the Chroma teams bookshelf !</p>
]]></description><pubDate>Thu, 04 Sep 2025 17:24:25 +0000</pubDate><link>https://news.ycombinator.com/item?id=45129769</link><dc:creator>HammadB</dc:creator><comments>https://news.ycombinator.com/item?id=45129769</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45129769</guid></item><item><title><![CDATA[New comment by HammadB in "Show HN: Chroma Cloud – serverless search database for AI"]]></title><description><![CDATA[
<p>Yes! It can - <a href="https://docs.trychroma.com/docs/querying-collections/metadata-filtering" rel="nofollow">https://docs.trychroma.com/docs/querying-collections/metadat...</a></p>
]]></description><pubDate>Tue, 19 Aug 2025 21:05:19 +0000</pubDate><link>https://news.ycombinator.com/item?id=44956254</link><dc:creator>HammadB</dc:creator><comments>https://news.ycombinator.com/item?id=44956254</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44956254</guid></item><item><title><![CDATA[New comment by HammadB in "Show HN: Chroma Cloud – serverless search database for AI"]]></title><description><![CDATA[
<p>That is helpful feedback, thank you. We'll address this.</p>
]]></description><pubDate>Tue, 19 Aug 2025 19:50:52 +0000</pubDate><link>https://news.ycombinator.com/item?id=44955601</link><dc:creator>HammadB</dc:creator><comments>https://news.ycombinator.com/item?id=44955601</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44955601</guid></item><item><title><![CDATA[New comment by HammadB in "Show HN: Chroma Cloud – serverless search database for AI"]]></title><description><![CDATA[
<p>Hi! Hammad here - Chroma’s CTO. First off, I have an immense amount of respect for the Turbopuffer team, they’ve build a solid product.<p>I understand your point. Chroma Cloud has been quietly live in production for a year, and we have been discussing this architecture publicly for almost two years now. You can see this talk I gave at the CMU databases group - <a href="https://youtu.be/E4ot5d79jdA?si=i64ouoyFMevEgm3U" rel="nofollow">https://youtu.be/E4ot5d79jdA?si=i64ouoyFMevEgm3U</a>. Some details have changed since then. But the core ideas remain the same.<p>The business model similarities mostly fall out of our architecture being similar, which mostly falls out of our constraints with respect to the workload being the same. There are only so many ways you can deliver a usage based billing model that is fair, understandable, and predictable. We aimed for a billing model that was all three, and this is what we arrived at.<p>On aesthetics, that’s always been our aesthetic, I think a lot of developer tools are leaning into the nostalgia of the early PC boom during this AI boom (fun fact, all the icons on our homepage are done by hand!).<p>On differences, we support optimized regexes vs full-scans, lending better performance. We also support trigram based full-text search which can often be useful for scenarios which need substring matches. We also support forking, which allows for cheap copy-on-write clones of your data, great for dataset versioning and tracking git repos with minimal cost. We've been building with support for generic sparse vectors (in beta) which enables techniques like SPLADE to be used, rather than just BM25. You can also run Chroma locally, enabling low-latency local workflows. This is great for AI apps where you need to iterate on a dataset until it passes evals, and then push it up to the cloud.<p>Chroma is Apache 2.0 open source - <a href="https://github.com/chroma-core/chroma" rel="nofollow">https://github.com/chroma-core/chroma</a> and has a massive developer community behind it. Customers can run embedded, single-node and distributed Chroma themselves. We've suffered from depending on closed-source database startups and wanted to give developers using Chroma confidence in the longevity of their choice.<p>Lastly, we are building with AI workloads front and center and this changes what you build, how you build it and who you build for in the long term. We think search is changing and that the primary consumer of the search API for AI applications is shifting from human engineers, to language models. We are building some exciting things in this direction, more on that soon.</p>
]]></description><pubDate>Tue, 19 Aug 2025 19:29:34 +0000</pubDate><link>https://news.ycombinator.com/item?id=44955350</link><dc:creator>HammadB</dc:creator><comments>https://news.ycombinator.com/item?id=44955350</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44955350</guid></item><item><title><![CDATA[Context Engineering – can't call it engineering if we can't predict it breaking [video]]]></title><description><![CDATA[
<p>Article URL: <a href="https://www.youtube.com/watch?v=vsfbplnJyA8">https://www.youtube.com/watch?v=vsfbplnJyA8</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=44540403">https://news.ycombinator.com/item?id=44540403</a></p>
<p>Points: 4</p>
<p># Comments: 0</p>
]]></description><pubDate>Sat, 12 Jul 2025 08:45:18 +0000</pubDate><link>https://www.youtube.com/watch?v=vsfbplnJyA8</link><dc:creator>HammadB</dc:creator><comments>https://news.ycombinator.com/item?id=44540403</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44540403</guid></item><item><title><![CDATA[Spann and SPFresh vector indexing in Chroma]]></title><description><![CDATA[
<p>Article URL: <a href="https://twitter.com/trychroma/status/1925262162567864620">https://twitter.com/trychroma/status/1925262162567864620</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=44119782">https://news.ycombinator.com/item?id=44119782</a></p>
<p>Points: 1</p>
<p># Comments: 0</p>
]]></description><pubDate>Wed, 28 May 2025 19:28:24 +0000</pubDate><link>https://twitter.com/trychroma/status/1925262162567864620</link><dc:creator>HammadB</dc:creator><comments>https://news.ycombinator.com/item?id=44119782</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44119782</guid></item><item><title><![CDATA[Building a usage-based billing system]]></title><description><![CDATA[
<p>Article URL: <a href="https://trychroma.com/engineering/billing">https://trychroma.com/engineering/billing</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=43981082">https://news.ycombinator.com/item?id=43981082</a></p>
<p>Points: 2</p>
<p># Comments: 0</p>
]]></description><pubDate>Wed, 14 May 2025 05:18:01 +0000</pubDate><link>https://trychroma.com/engineering/billing</link><dc:creator>HammadB</dc:creator><comments>https://news.ycombinator.com/item?id=43981082</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43981082</guid></item><item><title><![CDATA[New comment by HammadB in "Reasoning models don't always say what they think"]]></title><description><![CDATA[
<p>There is an abundance of discussion on this thread about whether models are intelligent or not.<p>This binary is an utter waste of time.<p>Instead focus on the gradient of intelligence -  the set of cognitive skills any given system has and to what degree it has them.<p>This engineering approach is more likely to lead to practical utility and progress.<p>The view of intelligence as binary is incredibly corrosive to this field.</p>
]]></description><pubDate>Fri, 04 Apr 2025 14:43:28 +0000</pubDate><link>https://news.ycombinator.com/item?id=43583159</link><dc:creator>HammadB</dc:creator><comments>https://news.ycombinator.com/item?id=43583159</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43583159</guid></item><item><title><![CDATA[New comment by HammadB in "Model Predictive Control in the Browser with WebAssembly"]]></title><description><![CDATA[
<p>I've taken the linked Russ Tedrake class and have to say I loved this. Please make more!</p>
]]></description><pubDate>Thu, 07 Nov 2024 01:07:02 +0000</pubDate><link>https://news.ycombinator.com/item?id=42072122</link><dc:creator>HammadB</dc:creator><comments>https://news.ycombinator.com/item?id=42072122</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42072122</guid></item><item><title><![CDATA[Napkin: Back-of-the-envelope calculations in Python]]></title><description><![CDATA[
<p>Article URL: <a href="https://github.com/rescrv/napkin">https://github.com/rescrv/napkin</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=41779925">https://news.ycombinator.com/item?id=41779925</a></p>
<p>Points: 24</p>
<p># Comments: 3</p>
]]></description><pubDate>Tue, 08 Oct 2024 17:48:41 +0000</pubDate><link>https://github.com/rescrv/napkin</link><dc:creator>HammadB</dc:creator><comments>https://news.ycombinator.com/item?id=41779925</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=41779925</guid></item><item><title><![CDATA[New comment by HammadB in "Infrared Infrastructure"]]></title><description><![CDATA[
<p>It’s COOL</p>
]]></description><pubDate>Thu, 06 Jun 2024 17:41:32 +0000</pubDate><link>https://news.ycombinator.com/item?id=40600244</link><dc:creator>HammadB</dc:creator><comments>https://news.ycombinator.com/item?id=40600244</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40600244</guid></item><item><title><![CDATA[Query Embedding Adapters]]></title><description><![CDATA[
<p>Article URL: <a href="https://research.trychroma.com/embedding-adapters">https://research.trychroma.com/embedding-adapters</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=40529139">https://news.ycombinator.com/item?id=40529139</a></p>
<p>Points: 2</p>
<p># Comments: 0</p>
]]></description><pubDate>Thu, 30 May 2024 21:54:09 +0000</pubDate><link>https://research.trychroma.com/embedding-adapters</link><dc:creator>HammadB</dc:creator><comments>https://news.ycombinator.com/item?id=40529139</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40529139</guid></item><item><title><![CDATA[New comment by HammadB in "Vector indexing all of Wikipedia on a laptop"]]></title><description><![CDATA[
<p>If segmented hsnw indices were O(N log N) - it would make no sense to build the index at all - brute force would be better as O(N log N) > O(N)</p>
]]></description><pubDate>Thu, 30 May 2024 19:43:08 +0000</pubDate><link>https://news.ycombinator.com/item?id=40527815</link><dc:creator>HammadB</dc:creator><comments>https://news.ycombinator.com/item?id=40527815</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40527815</guid></item><item><title><![CDATA[New comment by HammadB in "Vector indexing all of Wikipedia on a laptop"]]></title><description><![CDATA[
<p>Sure, I see. I think this is an area where complexity analysis doesn’t lead to useful information.<p>To be more correct it’s O(N/C log C) where C is the capacity of a segment. In this case you can ignore 1/C and log C as constant. So now sure, you actually just have O(N). But this is not super useful as it says that a segmented hnsw approach and brute force approach are the same - when this is really not the case in practice.<p>Also O(N log N) > O(N) so I’m not sure why we would ever do anything with segmentation according to that analysis if it were correct.</p>
]]></description><pubDate>Thu, 30 May 2024 05:35:24 +0000</pubDate><link>https://news.ycombinator.com/item?id=40520419</link><dc:creator>HammadB</dc:creator><comments>https://news.ycombinator.com/item?id=40520419</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40520419</guid></item><item><title><![CDATA[New comment by HammadB in "Three.js Shading Language"]]></title><description><![CDATA[
<p>a lot of game engines have higher level shading languages (often visually programmed) that are long-lived and widely used.</p>
]]></description><pubDate>Thu, 30 May 2024 00:05:11 +0000</pubDate><link>https://news.ycombinator.com/item?id=40518631</link><dc:creator>HammadB</dc:creator><comments>https://news.ycombinator.com/item?id=40518631</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40518631</guid></item><item><title><![CDATA[New comment by HammadB in "Vector indexing all of Wikipedia on a laptop"]]></title><description><![CDATA[
<p>"The obstacle is that until now, off-the-shelf vector databases could not index a dataset larger than memory, because both the full-resolution vectors and the index (edge list) needed to be kept in memory during index construction.  Larger datasets could be split into segments, but this means that at query time they need to search each segment separately, then combine the results, turning an O(log N) search per segment into O(N) overall."<p>How is a log N search over S segments O(N)?</p>
]]></description><pubDate>Wed, 29 May 2024 23:44:10 +0000</pubDate><link>https://news.ycombinator.com/item?id=40518480</link><dc:creator>HammadB</dc:creator><comments>https://news.ycombinator.com/item?id=40518480</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40518480</guid></item><item><title><![CDATA[New comment by HammadB in "Krazam OS"]]></title><description><![CDATA[
<p>+1 I was ready to drop $$$</p>
]]></description><pubDate>Tue, 23 Apr 2024 02:05:23 +0000</pubDate><link>https://news.ycombinator.com/item?id=40127708</link><dc:creator>HammadB</dc:creator><comments>https://news.ycombinator.com/item?id=40127708</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40127708</guid></item><item><title><![CDATA[New comment by HammadB in "The disaggregated write-ahead log (2023)"]]></title><description><![CDATA[
<p>Curious - what was the specific leaderless replication strategy?</p>
]]></description><pubDate>Wed, 07 Feb 2024 22:57:49 +0000</pubDate><link>https://news.ycombinator.com/item?id=39295479</link><dc:creator>HammadB</dc:creator><comments>https://news.ycombinator.com/item?id=39295479</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=39295479</guid></item><item><title><![CDATA[New comment by HammadB in "Discord is laying off 17 percent of employees"]]></title><description><![CDATA[
<p>I don't think that - I am speaking generally.</p>
]]></description><pubDate>Fri, 12 Jan 2024 01:10:24 +0000</pubDate><link>https://news.ycombinator.com/item?id=38962162</link><dc:creator>HammadB</dc:creator><comments>https://news.ycombinator.com/item?id=38962162</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=38962162</guid></item></channel></rss>