<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: ilaksh</title><link>https://news.ycombinator.com/user?id=ilaksh</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Mon, 25 May 2026 19:54:31 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=ilaksh" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by ilaksh in "Search engines alternatives now that Google isn't Google anymore"]]></title><description><![CDATA[
<p>Which of these alternatives actually doesn't use Google under the hood?</p>
]]></description><pubDate>Mon, 25 May 2026 13:53:22 +0000</pubDate><link>https://news.ycombinator.com/item?id=48266865</link><dc:creator>ilaksh</dc:creator><comments>https://news.ycombinator.com/item?id=48266865</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48266865</guid></item><item><title><![CDATA[New comment by ilaksh in "No Juniors Today, No Seniors in 2031"]]></title><description><![CDATA[
<p>Given the intense pressure to find more efficiency, 5 years is probably enough to see deployment of radical new compute-in-memory paradigms such as nitrides-based ferroelectric devices that have a few orders of magnitude more efficiency.<p>I think the biggest problem is jagged intelligence which I would bet would be solved largely with bigger models, which comes back to rapidly evolving hardware paradigms.</p>
]]></description><pubDate>Sun, 24 May 2026 16:43:27 +0000</pubDate><link>https://news.ycombinator.com/item?id=48258815</link><dc:creator>ilaksh</dc:creator><comments>https://news.ycombinator.com/item?id=48258815</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48258815</guid></item><item><title><![CDATA[New comment by ilaksh in "No Juniors Today, No Seniors in 2031"]]></title><description><![CDATA[
<p>In less than five years it will be clear that unaugmented humans are a bottleneck.<p>So many jobs as such may be limited to humans with high bandwidth BCI connected to AI, that effectively gives them an AI exobrain. But even then the AI would need to do most of the work or the human would just slow things down.</p>
]]></description><pubDate>Sun, 24 May 2026 16:35:00 +0000</pubDate><link>https://news.ycombinator.com/item?id=48258719</link><dc:creator>ilaksh</dc:creator><comments>https://news.ycombinator.com/item?id=48258719</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48258719</guid></item><item><title><![CDATA[New comment by ilaksh in "No Juniors Today, No Seniors in 2031"]]></title><description><![CDATA[
<p>Good point to a degree, but it applies to all work, not just software engineering, and even before they get into the workforce. Young people are not going to have any depth of skills or knowledge if they literally just act as a go-between for AI to do everything.<p>Maybe they become experts in wiring up AI and robotics?<p>But 5 years out, we might be into a new compute-in-memory paradigm. And we may have exceeded the complexity of the human brain with models greater than 100 trillion parameters.<p>We already see performance that is superhuman in task specific ways with 1-10 trillion parameters models.<p>In under 5 years, most small and many medium business will be handled directly by AI agents. You won't even necessarily know what kind of software they are running. For small-medium business, it may have very little or no code and just be a semi-structured database that is neutrally rendered into any needed form on-the-fly.</p>
]]></description><pubDate>Sun, 24 May 2026 16:30:22 +0000</pubDate><link>https://news.ycombinator.com/item?id=48258669</link><dc:creator>ilaksh</dc:creator><comments>https://news.ycombinator.com/item?id=48258669</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48258669</guid></item><item><title><![CDATA[New comment by ilaksh in "Green card seekers must leave U.S. to apply, Trump administration says"]]></title><description><![CDATA[
<p>The DHS has made many communications that were openly white supremacist. It's not just an unfair situation with legal technicalities. Their views and plans are more extreme and dangerous than our society is able to accept as reality, so many are in denial. There are obvious historical parallels.<p>There need to be thorough weekly video walkthroughs of all of the detention centers. Otherwise you can expect actual starvation at some point.</p>
]]></description><pubDate>Sat, 23 May 2026 13:20:12 +0000</pubDate><link>https://news.ycombinator.com/item?id=48247441</link><dc:creator>ilaksh</dc:creator><comments>https://news.ycombinator.com/item?id=48247441</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48247441</guid></item><item><title><![CDATA[New comment by ilaksh in "Project Glasswing: An Initial Update"]]></title><description><![CDATA[
<p>What percentage of these are variations of the good old fashioned buffer overflow that would be impossible with Rust?</p>
]]></description><pubDate>Sat, 23 May 2026 10:59:49 +0000</pubDate><link>https://news.ycombinator.com/item?id=48246627</link><dc:creator>ilaksh</dc:creator><comments>https://news.ycombinator.com/item?id=48246627</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48246627</guid></item><item><title><![CDATA[New comment by ilaksh in "The memory shortage is causing a repricing of consumer electronics"]]></title><description><![CDATA[
<p>Surely some significant fraction of the billions and billions are being spent on intensive R&D to for example increase scale of MRAM modules or radical new paradigms like nitrides-based ferroelectric compute-in-memory?<p>I think there's a reason that University of Michigan researcher jumped after that ferroelectric breakthrough.<p>I am hopeful that some of this massive pressure is going to push us into the next paradigm, which is likely some kind of "true" compute-in-memory system. That may increase performance and efficiency for AI by up to 100 X.</p>
]]></description><pubDate>Fri, 22 May 2026 16:07:05 +0000</pubDate><link>https://news.ycombinator.com/item?id=48237812</link><dc:creator>ilaksh</dc:creator><comments>https://news.ycombinator.com/item?id=48237812</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48237812</guid></item><item><title><![CDATA[New comment by ilaksh in "Pope Leo XIV’s first encyclical Magnifica humanitas to be published May 25"]]></title><description><![CDATA[
<p>I'm an atheist, but most of what I have heard from popes in recent years seems like sound and possibly needed advice.<p>Also, even though I feel AI and robotics are very important for progressing humanity, I think that much of the world has long since lost a proper sense of intrinsic human value. It's really gone from overt exploitation to slightly more mild exploitation where we pretend the system is really merit based.<p>And as AI and robotics remove the need for human labor, I hope that someone like the pope can convince people that we should value human beings inherently and more fairly. Inexpensive labor and intelligence should make this feasible.<p>I hope the speech isn't something dumb like "remember only humans have souls" because I think that's really premature and pretty obvious that AIs are not people at this point.<p>The really convincing and somewhat deeper simulations of humans are probably only a few years down the line though.<p>Which comes back to the Rovelli dualism article that was on the front page before. I think we should not be in a hurry to try to duplicate humans in depth (such as imitating emotions, pain, stream of consciousness, self-preservation, etc).
It's just completely unnecessary to go that far to get useful AI, and obviously unethical to subject a real human emulation to slavery.</p>
]]></description><pubDate>Tue, 19 May 2026 00:36:14 +0000</pubDate><link>https://news.ycombinator.com/item?id=48187830</link><dc:creator>ilaksh</dc:creator><comments>https://news.ycombinator.com/item?id=48187830</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48187830</guid></item><item><title><![CDATA[New comment by ilaksh in "Orthrus-Qwen3: up to 7.8×tokens/forward on Qwen3, identical output distribution"]]></title><description><![CDATA[
<p>Amazing. Is it possible to do this with Qwen 3.6 27B? Will it work with quants (I assume so)?</p>
]]></description><pubDate>Sat, 16 May 2026 04:26:59 +0000</pubDate><link>https://news.ycombinator.com/item?id=48156825</link><dc:creator>ilaksh</dc:creator><comments>https://news.ycombinator.com/item?id=48156825</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48156825</guid></item><item><title><![CDATA[New comment by ilaksh in "A few words on DS4"]]></title><description><![CDATA[
<p>I want something like this but not only for my own computer but also for client projects or stuff I might run in cloud GPUs. Because the core idea of having a strong model that is efficient and doesn't require a cluster still applies to a lot of business cases. I am hoping something like this can work in batch mode.<p>Right now I feel like a 4bit Qwen 3.6 27B with MTP is one of the best for agentic tool calling for some smart voice agents in an H200. I wonder if DS4 Flash being using 80b at 2 bit with 13b active and MTP could be even faster and smarter and allow more concurrent sequences?<p>This special 2bit quantization seems like a big deal.</p>
]]></description><pubDate>Fri, 15 May 2026 06:15:07 +0000</pubDate><link>https://news.ycombinator.com/item?id=48145137</link><dc:creator>ilaksh</dc:creator><comments>https://news.ycombinator.com/item?id=48145137</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48145137</guid></item><item><title><![CDATA[New comment by ilaksh in "Show HN: Needle: We Distilled Gemini Tool Calling into a 26M Model"]]></title><description><![CDATA[
<p>Yes I assume some kind of LLM system will eventually make its way into OS package registries and eventually there will be a standard of some sort. Already OpenAI's API is the de facto standard available from all providers. But who knows what port it's on. Maybe implement mDNS/DNS-SD or SSDP, and if not there it could just go through default ports like the Ollama one and check for a certain endpoint.</p>
]]></description><pubDate>Wed, 13 May 2026 18:44:10 +0000</pubDate><link>https://news.ycombinator.com/item?id=48125773</link><dc:creator>ilaksh</dc:creator><comments>https://news.ycombinator.com/item?id=48125773</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48125773</guid></item><item><title><![CDATA[New comment by ilaksh in "Show HN: Needle: We Distilled Gemini Tool Calling into a 26M Model"]]></title><description><![CDATA[
<p>It does just barely work now after they spent billions, and they may still fall back to cloud LLMs for a significant number of things. This is a way that everyone can get that on the actual Apple Watch or local phone for any application they build.</p>
]]></description><pubDate>Tue, 12 May 2026 23:31:42 +0000</pubDate><link>https://news.ycombinator.com/item?id=48115948</link><dc:creator>ilaksh</dc:creator><comments>https://news.ycombinator.com/item?id=48115948</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48115948</guid></item><item><title><![CDATA[New comment by ilaksh in "Show HN: Needle: We Distilled Gemini Tool Calling into a 26M Model"]]></title><description><![CDATA[
<p>Good idea. Could you make that.</p>
]]></description><pubDate>Tue, 12 May 2026 20:06:13 +0000</pubDate><link>https://news.ycombinator.com/item?id=48113735</link><dc:creator>ilaksh</dc:creator><comments>https://news.ycombinator.com/item?id=48113735</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48113735</guid></item><item><title><![CDATA[New comment by ilaksh in "Show HN: Needle: We Distilled Gemini Tool Calling into a 26M Model"]]></title><description><![CDATA[
<p>I think GLM 5.1 or Kimi 2.6 could substitute for this type of purpose.</p>
]]></description><pubDate>Tue, 12 May 2026 20:04:23 +0000</pubDate><link>https://news.ycombinator.com/item?id=48113707</link><dc:creator>ilaksh</dc:creator><comments>https://news.ycombinator.com/item?id=48113707</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48113707</guid></item><item><title><![CDATA[New comment by ilaksh in "Show HN: Needle: We Distilled Gemini Tool Calling into a 26M Model"]]></title><description><![CDATA[
<p>Oh gotcha. Fixed my comment.</p>
]]></description><pubDate>Tue, 12 May 2026 19:32:41 +0000</pubDate><link>https://news.ycombinator.com/item?id=48113274</link><dc:creator>ilaksh</dc:creator><comments>https://news.ycombinator.com/item?id=48113274</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48113274</guid></item><item><title><![CDATA[New comment by ilaksh in "Show HN: Needle: We Distilled Gemini Tool Calling into a 26M Model"]]></title><description><![CDATA[
<p>Hmm.. this might make it feasible to build something like a command line program where you can optionally just specify the arguments in natural language. Although I know people will object to including an extra 14 MB and the computation for "parsing" and it could be pretty bad if everyone started doing that.<p>But it's really interesting to me that that may be possible now. You can include a fine-tuned model that understands how to use your program.<p>E.g. `> toolcli what can you do` runs `toolcli --help summary`, `toolcli add tom to teamfutz group` = `toolcli --gadd teamfutz tom`</p>
]]></description><pubDate>Tue, 12 May 2026 19:24:25 +0000</pubDate><link>https://news.ycombinator.com/item?id=48113166</link><dc:creator>ilaksh</dc:creator><comments>https://news.ycombinator.com/item?id=48113166</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48113166</guid></item><item><title><![CDATA[New comment by ilaksh in "Bambu Lab is abusing the open source social contract"]]></title><description><![CDATA[
<p>One aspect of this may be Chinese laws, which I am assuming if they don't already require the ability to monitor or even censor what gets printed, they will soon. Even in the US we are starting to have legislation related to blocking firearm component printing, but this doesn't necessarily mean central servers do it or have access to everything printed. Yet.<p>I think the primary problem is actually more than just Bambu's behavior, it's that China is an authoritarian country, and most of the population not only accepts the idea of central servers monitoring and "moderating" behavior but largely may embrace it as a sensible thing to do. It's probably beyond Stockholm Syndrome to the point of much of the culture genuinely not completely even understanding the idea of why privacy and personal control is important.<p>Much of the United States is so far on the other side that they can't begin to understand the position Bambu is in. Large companies in that country just do not have the option to allow their users to bypass censorship and monitoring.<p>I do think it's actually great that this type of issue gets in everyone's face though and it's great people are fighting back. But realize that the problem is deeper than one company. It's the whole type of government and attitude towards it and technology.</p>
]]></description><pubDate>Tue, 12 May 2026 18:52:18 +0000</pubDate><link>https://news.ycombinator.com/item?id=48112639</link><dc:creator>ilaksh</dc:creator><comments>https://news.ycombinator.com/item?id=48112639</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48112639</guid></item><item><title><![CDATA[New comment by ilaksh in "Students boo commencement speaker after she calls AI next industrial revolution"]]></title><description><![CDATA[
<p>Again that is a problem with the group of people and how they use technology rather than the technology itself.<p>I will die on this hill: AI _properly_ integrated into education will be a huge improvement for students because it will enable each student to have personalized instruction and tutoring.</p>
]]></description><pubDate>Mon, 11 May 2026 19:16:46 +0000</pubDate><link>https://news.ycombinator.com/item?id=48099409</link><dc:creator>ilaksh</dc:creator><comments>https://news.ycombinator.com/item?id=48099409</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48099409</guid></item><item><title><![CDATA[New comment by ilaksh in "Students boo commencement speaker after she calls AI next industrial revolution"]]></title><description><![CDATA[
<p>> The people who rely on it are not cool.<p>That's the only statement that's true. Admiting to AI use is unfashionable in the western world at this time.<p>But how much would you like to bet that 90% of those students who were booing also used AI to do their homework for them quite often? So your take away would be "the AI stole their education". No, they were dishonest and the AI helped them cheat themselves out of learning.<p>Technology doesn't make anything banal or a hellscape, or fire people. Technology is a lever.<p>If humans use AI to produce worse output because they are too lazy to bother reviewing and iterating on it, that is a human problem. If humans are going to use AI to help them exploit other humans more efficiently, that is also caused by the human rather than the technology.<p>Also, the ChatGPT moment for humanoid robots is coming this year or next. It will become very obvious that AI use in these robots is not just superficially plausible text.</p>
]]></description><pubDate>Mon, 11 May 2026 19:04:50 +0000</pubDate><link>https://news.ycombinator.com/item?id=48099260</link><dc:creator>ilaksh</dc:creator><comments>https://news.ycombinator.com/item?id=48099260</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48099260</guid></item><item><title><![CDATA[New comment by ilaksh in "I'm going back to writing code by hand"]]></title><description><![CDATA[
<p>He says he went several months without having to do a code review and it worked the vast majority of the time. That's incredibly impressive work by the AI.<p>AI may default to mediocre and often somewhat buggy code unless you iterate because that is just what the vast majority of human written code that it has seen looks like. But the fact that he got away with not reviewing the code for so long to me proves the opposite of his conclusion.<p>1690 lines of code in one file is a walk in the park for SOTA models.<p>He can just say something like:<p>"Please review and create a refactoring plan and test suite. I found atrocious architectural decisions like numerous special cases and if statements rather than using abstractions properly. Make a few notes in comments and architecture.md to never do this again."<p>One could also argue that it was a better decision each time by the AI to just never do a refactor unless prompted because that increases the likelihood of something breaking and you want to do that after you verify the minimum code change actually functionally does what you want.<p>Also I bet you the headline is a lie. He basically admits it by saying he is writing the core structure of the next version by hand ahead of time, implying that he will generate the rest. So the title is a half-truth at best.</p>
]]></description><pubDate>Mon, 11 May 2026 17:38:59 +0000</pubDate><link>https://news.ycombinator.com/item?id=48098067</link><dc:creator>ilaksh</dc:creator><comments>https://news.ycombinator.com/item?id=48098067</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48098067</guid></item></channel></rss>