<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: stephantul</title><link>https://news.ycombinator.com/user?id=stephantul</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Mon, 18 May 2026 09:55:16 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=stephantul" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by stephantul in "Show HN: Semble – Code search for agents that uses 98% fewer tokens than grep"]]></title><description><![CDATA[
<p>It was directed at the parent who implied that we didn’t think about this.<p>I agree with your point about the evals and how you can get discontinuities: good search can be worse than bad search when agents can do many searches. We’re working on it</p>
]]></description><pubDate>Mon, 18 May 2026 06:11:48 +0000</pubDate><link>https://news.ycombinator.com/item?id=48176101</link><dc:creator>stephantul</dc:creator><comments>https://news.ycombinator.com/item?id=48176101</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48176101</guid></item><item><title><![CDATA[New comment by stephantul in "Show HN: Semble – Code search for agents that uses 98% fewer tokens than grep"]]></title><description><![CDATA[
<p>It's not probabilistic, and exact matches will always be preferred over non-exact. So if you search for a function name this will surface it.</p>
]]></description><pubDate>Mon, 18 May 2026 04:35:50 +0000</pubDate><link>https://news.ycombinator.com/item?id=48175622</link><dc:creator>stephantul</dc:creator><comments>https://news.ycombinator.com/item?id=48175622</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48175622</guid></item><item><title><![CDATA[New comment by stephantul in "Show HN: Semble – Code search for agents that uses 98% fewer tokens than grep"]]></title><description><![CDATA[
<p>This is a bit rude.<p>We didn't generate this project, we wrote it, a lot of it manually, and trained custom models. We'd been working in the real-time retrieval space for a while, and we thought coding was a good fit for this specific technology.</p>
]]></description><pubDate>Mon, 18 May 2026 04:19:25 +0000</pubDate><link>https://news.ycombinator.com/item?id=48175536</link><dc:creator>stephantul</dc:creator><comments>https://news.ycombinator.com/item?id=48175536</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48175536</guid></item><item><title><![CDATA[New comment by stephantul in "Show HN: Semble – Code search for agents that uses 98% fewer tokens than grep"]]></title><description><![CDATA[
<p>Oh sorry that happened. Feel free to open an issue or report it here</p>
]]></description><pubDate>Mon, 18 May 2026 04:17:08 +0000</pubDate><link>https://news.ycombinator.com/item?id=48175526</link><dc:creator>stephantul</dc:creator><comments>https://news.ycombinator.com/item?id=48175526</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48175526</guid></item><item><title><![CDATA[New comment by stephantul in "Show HN: Semble – Code search for agents that uses 98% fewer tokens than grep"]]></title><description><![CDATA[
<p>Wow awesome, thanks for sharing! This is really useful and very much like the experiments we want to be doing in the near future</p>
]]></description><pubDate>Mon, 18 May 2026 04:02:19 +0000</pubDate><link>https://news.ycombinator.com/item?id=48175443</link><dc:creator>stephantul</dc:creator><comments>https://news.ycombinator.com/item?id=48175443</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48175443</guid></item><item><title><![CDATA[New comment by stephantul in "Show HN: Semble – Code search for agents that uses 98% fewer tokens than grep"]]></title><description><![CDATA[
<p>This is true, agents just don't know a lot about the things they're looking at, e.g., the number of files, file sizes, etc.<p>Although for small codebases it also holds that whatever you would like to find it easy to find, so search still might help you with cost</p>
]]></description><pubDate>Mon, 18 May 2026 04:01:03 +0000</pubDate><link>https://news.ycombinator.com/item?id=48175436</link><dc:creator>stephantul</dc:creator><comments>https://news.ycombinator.com/item?id=48175436</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48175436</guid></item><item><title><![CDATA[New comment by stephantul in "Show HN: Semble – Code search for agents that uses 98% fewer tokens than grep"]]></title><description><![CDATA[
<p>Yeah I agree. I have used semble to quickly index a large monorepo and just ask a question about it, it surfaced the right files pretty quickly. Although without an IDE, it's difficult to display them in nice way</p>
]]></description><pubDate>Mon, 18 May 2026 03:58:58 +0000</pubDate><link>https://news.ycombinator.com/item?id=48175422</link><dc:creator>stephantul</dc:creator><comments>https://news.ycombinator.com/item?id=48175422</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48175422</guid></item><item><title><![CDATA[New comment by stephantul in "Show HN: Semble – Code search for agents that uses 98% fewer tokens than grep"]]></title><description><![CDATA[
<p>The comparison is in the benchmarks, see the README</p>
]]></description><pubDate>Mon, 18 May 2026 03:58:08 +0000</pubDate><link>https://news.ycombinator.com/item?id=48175419</link><dc:creator>stephantul</dc:creator><comments>https://news.ycombinator.com/item?id=48175419</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48175419</guid></item><item><title><![CDATA[New comment by stephantul in "Show HN: Semble – Code search for agents that uses 98% fewer tokens than grep"]]></title><description><![CDATA[
<p>For chunking Semble supports all languages supported by tree-sitter-language-pack. The models we train are trained on 6 languages, but can handle way more.</p>
]]></description><pubDate>Mon, 18 May 2026 03:57:32 +0000</pubDate><link>https://news.ycombinator.com/item?id=48175416</link><dc:creator>stephantul</dc:creator><comments>https://news.ycombinator.com/item?id=48175416</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48175416</guid></item><item><title><![CDATA[New comment by stephantul in "Show HN: Semble – Code search for agents that uses 98% fewer tokens than grep"]]></title><description><![CDATA[
<p>We hadn't found that one yet. Will do!</p>
]]></description><pubDate>Mon, 18 May 2026 03:55:27 +0000</pubDate><link>https://news.ycombinator.com/item?id=48175399</link><dc:creator>stephantul</dc:creator><comments>https://news.ycombinator.com/item?id=48175399</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48175399</guid></item><item><title><![CDATA[New comment by stephantul in "Show HN: Semble – Code search for agents that uses 98% fewer tokens than grep"]]></title><description><![CDATA[
<p>Yes, this is the main reason. We've released some rust stuff in the past, but Python is our main language</p>
]]></description><pubDate>Mon, 18 May 2026 03:54:27 +0000</pubDate><link>https://news.ycombinator.com/item?id=48175393</link><dc:creator>stephantul</dc:creator><comments>https://news.ycombinator.com/item?id=48175393</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48175393</guid></item><item><title><![CDATA[New comment by stephantul in "Show HN: Semble – Code search for agents that uses 98% fewer tokens than grep"]]></title><description><![CDATA[
<p>The comparison is with ripgrep, see the benchmarks.</p>
]]></description><pubDate>Mon, 18 May 2026 03:53:57 +0000</pubDate><link>https://news.ycombinator.com/item?id=48175387</link><dc:creator>stephantul</dc:creator><comments>https://news.ycombinator.com/item?id=48175387</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48175387</guid></item><item><title><![CDATA[New comment by stephantul in "Show HN: Semble – Code search for agents that uses 98% fewer tokens than grep"]]></title><description><![CDATA[
<p>Yeah we're also interested in doing this, it's on the roadmap together with optimization of the prompt and descriptions so that models have an easier time using it.<p>Perhaps anecdotally: we do use this tool ourselves of course, and it's been working pretty well so far. Anthropic models call it and seem to trust the results.</p>
]]></description><pubDate>Sun, 17 May 2026 20:06:48 +0000</pubDate><link>https://news.ycombinator.com/item?id=48172742</link><dc:creator>stephantul</dc:creator><comments>https://news.ycombinator.com/item?id=48172742</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48172742</guid></item><item><title><![CDATA[New comment by stephantul in "Show HN: Semble – Code search for agents that uses 98% fewer tokens than grep"]]></title><description><![CDATA[
<p>Even so. Take a look at the NDCG numbers for grep. It's not pretty</p>
]]></description><pubDate>Sun, 17 May 2026 18:51:41 +0000</pubDate><link>https://news.ycombinator.com/item?id=48172031</link><dc:creator>stephantul</dc:creator><comments>https://news.ycombinator.com/item?id=48172031</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48172031</guid></item><item><title><![CDATA[New comment by stephantul in "Show HN: Semble – Code search for agents that uses 98% fewer tokens than grep"]]></title><description><![CDATA[
<p>Hey, this is something we're actively investigating. We recently added a flag, `--include-text-files`, which, when set, also makes Semble index regular documents (i.e., markdown, text, json). This should also work relatively well.</p>
]]></description><pubDate>Sun, 17 May 2026 18:43:51 +0000</pubDate><link>https://news.ycombinator.com/item?id=48171927</link><dc:creator>stephantul</dc:creator><comments>https://news.ycombinator.com/item?id=48171927</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48171927</guid></item><item><title><![CDATA[New comment by stephantul in "Show HN: Semble – Code search for agents that uses 98% fewer tokens than grep"]]></title><description><![CDATA[
<p>The same holds for semble: the agent can fire off many different semble queries with different k/parameters.<p>I guess the point we’re trying to make is that you need fewer semble queries to achieve the same outcome, compared to grep+readfile calls.</p>
]]></description><pubDate>Sun, 17 May 2026 18:25:04 +0000</pubDate><link>https://news.ycombinator.com/item?id=48171711</link><dc:creator>stephantul</dc:creator><comments>https://news.ycombinator.com/item?id=48171711</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48171711</guid></item><item><title><![CDATA[New comment by stephantul in "Show HN: Semble – Code search for agents that uses 98% fewer tokens than grep"]]></title><description><![CDATA[
<p>You need readfile to do something with those tokens.
Grep only gives you the matching lines, not the context.</p>
]]></description><pubDate>Sun, 17 May 2026 18:23:01 +0000</pubDate><link>https://news.ycombinator.com/item?id=48171680</link><dc:creator>stephantul</dc:creator><comments>https://news.ycombinator.com/item?id=48171680</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48171680</guid></item><item><title><![CDATA[New comment by stephantul in "Show HN: Semble – Code search for agents that uses 98% fewer tokens than grep"]]></title><description><![CDATA[
<p>1) yes! It’s not accuracy, but ndcg
2) we assume that if the agent gets the correct answer in the returned snippets it does not need to read further</p>
]]></description><pubDate>Sun, 17 May 2026 17:51:57 +0000</pubDate><link>https://news.ycombinator.com/item?id=48171278</link><dc:creator>stephantul</dc:creator><comments>https://news.ycombinator.com/item?id=48171278</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48171278</guid></item><item><title><![CDATA[New comment by stephantul in "Show HN: Semble – Code search for agents that uses 98% fewer tokens than grep"]]></title><description><![CDATA[
<p>Hey! Co-author here. The benchmark currently only measures retrieval accuracy.<p>We’re interested in measuring it end to end and also optimizing, e.g. the prompt and tools, for this, but we just haven’t gotten around to it.</p>
]]></description><pubDate>Sun, 17 May 2026 17:32:00 +0000</pubDate><link>https://news.ycombinator.com/item?id=48171050</link><dc:creator>stephantul</dc:creator><comments>https://news.ycombinator.com/item?id=48171050</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48171050</guid></item><item><title><![CDATA[New comment by stephantul in "Δ-Mem: Efficient Online Memory for Large Language Models"]]></title><description><![CDATA[
<p>How would you conceptualize recall in this case? Is searching through the current version of your code and possibly git history not enough?</p>
]]></description><pubDate>Sat, 16 May 2026 10:39:01 +0000</pubDate><link>https://news.ycombinator.com/item?id=48158895</link><dc:creator>stephantul</dc:creator><comments>https://news.ycombinator.com/item?id=48158895</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48158895</guid></item></channel></rss>