Hacker News: deepsquirrelnet

New comment by deepsquirrelnet in "Good results fine tuning a local LLM like Qwen 3:0.6B to categorize questions"

deepsquirrelnet — Mon, 22 Jun 2026 16:58:07 +0000

I'd been working with language models for several years before LLMs were a solution to this kind of problem. These are some ideas "off the top of my head" about how you can do classification in various ways. There's really a lot of ways to tackle it now, and a lot of trade-offs you can learn by experimenting with them.

There's even more options still, especially if you go further back toward more traditional methods. Static word vectors like GloVe or fasttext (optionally more modern equivalents like WordLlama or Model2Vec). Then there's sklearn-style stuff too. Those can be really small/fast but have more accuracy-level tradeoffs.

New comment by deepsquirrelnet in "Good results fine tuning a local LLM like Qwen 3:0.6B to categorize questions"

deepsquirrelnet — Mon, 22 Jun 2026 02:57:28 +0000

If you want to go deeper on language models, try these project ideas:

- Zero-shot encoders like tasksource or GliNER

- Natural language inference: https://huggingface.co/blog/dleemiller/nli-xenc-ways-to-use

- GRPO training

- GEPA prompt tuning Qwen 0.6B (or GEPA, then GRPO)

- Use an embedding model and train a classifier (MLP, logistic, svm)

- Use a larger LLM to generate a synthetic dataset (beware of lack of diversity, mine "seed text" from real sources first)

- Synthetically generate "hard examples" where more than one category may be valid and DPO tune your preferred responses

New comment by deepsquirrelnet in "Statement on US government directive to suspend access to Fable 5 and Mythos 5"

deepsquirrelnet — Sat, 13 Jun 2026 19:21:54 +0000

The first step to regulatory capture is getting yourself regulated...

New comment by deepsquirrelnet in "US employers spend more than $1.5B a year to fight labor unions, report finds"

deepsquirrelnet — Thu, 21 May 2026 15:05:00 +0000

> Unions exist to benefit the median and bring up the floor, but it stifles competition among those who really do desire to be at the top. And in doing so while it brings up the floor, it also brings down the ceiling because people who would normally be motivated enough to move up would not have much incentive to do so anymore.

I think people tend to fixate on the worker-to-worker differences inside of unions. Yes, that is the most visible part of a union when in place, and at least in the US has valid arguments about meritocracy.

What is missed when limiting the scope to just that is the population-level abuses of workers that no amount of meritocracy will fix. When corporations engage in collusion against workers (now common and nearly unpunished in the US) the top-level wages are suppressed industry wide.

The whole pay band alignment that comes out of that undermines the meritocracy argument, and doesn't even begin to address the wage-fixing that has gone almost unchecked in tech for decades[1,2]. As a merited employee, you might have more options to where you can go, but it won't protect you from predatory hiring/layoff cycles and it certainly won't guarantee that you'll receive a truly competitive wage.

On paper, meritocracy sounds great. I have worked many places in tech and never once observed it, personally. Best case, if you have warmed a seat for enough years, then you advance that way. Worst, your employer knows they can just take advantage of you because you're willing to work without a dangling carrot.

As before, either the government frees itself from corruption and enacts justice or unions will come back. That is point we are at.

[1] https://www.npr.org/sections/alltechconsidered/2015/01/16/37...

[2] https://conversableeconomist.com/2025/10/31/the-silicon-vall...

New comment by deepsquirrelnet in "After Town Bans Flock, Councilmember Crashes Out, Proposes Internet, Phone Ban"

deepsquirrelnet — Wed, 20 May 2026 18:45:59 +0000

Baked into that is a presumption of justice, which is becoming comically out of touch to the point where that overused phrase could be a meme.

New comment by deepsquirrelnet in "Eric Schmidt speech about AI booed during graduation"

deepsquirrelnet — Mon, 18 May 2026 13:33:39 +0000

That’s even a bit optimistic. We can’t agree that all people with full time jobs should be able to afford the basics for their own survival.

New comment by deepsquirrelnet in "At least 25 Flock cameras have been destroyed in five states since April 2025"

deepsquirrelnet — Sun, 17 May 2026 18:52:13 +0000

Flock is only one company. Someone in my town smashed one from a different company and was treated like a hero in Facebook comments. It’s not mentioned in the article.

New comment by deepsquirrelnet in "A Meta employee gets real about the horror of working there"

deepsquirrelnet — Sat, 16 May 2026 14:02:20 +0000

Not that I’d want to work there given what they do, but every time I’ve been contacted by a recruiter there, it seems like it’s within a month of a mass layoff they’ve had… which is maybe just because they seem to have mass layoffs every quarter now.

They also seem to have adopted a no-remote hire policy and are in an extreme high CoL location. It’s a truly awful mix for trying to attract outside talent. I don’t know why they even bother.

New comment by deepsquirrelnet in "Twin brothers wipe 96 government databases minutes after being fired"

deepsquirrelnet — Wed, 13 May 2026 18:28:17 +0000

Professionally, he spells his name thusly: FBI Director Ka$h Patel, so you know he’s serious.

New comment by deepsquirrelnet in "Show HN: Needle: We Distilled Gemini Tool Calling into a 26M Model"

deepsquirrelnet — Tue, 12 May 2026 20:35:21 +0000

This is really cool. Any plans to release the dataset?

New comment by deepsquirrelnet in "The bottleneck was never the code"

deepsquirrelnet — Wed, 06 May 2026 13:44:13 +0000

Any discussion related to this topic always seems to assume everyone uses code the same way and for the same function, and then forces the rest of the world through that lens.

So here we walk around the circle one more time again, voicing our anxieties, talking past each other, waiting for the next opportunity for commentary to come in half an hour.

New comment by deepsquirrelnet in "Show HN: Site Mogging"

deepsquirrelnet — Fri, 01 May 2026 13:24:51 +0000

I think it means which site is numberwang.

New comment by deepsquirrelnet in "Meta in row after workers who saw smart glasses users having sex lose jobs"

deepsquirrelnet — Thu, 30 Apr 2026 14:59:15 +0000

> At the time of the publication, Meta admitted subcontracted workers might sometimes review content filmed on its smart glasses when people shared it with Meta AI.

They just got fired for "piercing the veil". They committed the sin of bringing attention to the invasion of privacy.

New comment by deepsquirrelnet in "U.S. Debt Tops 100% of GDP"

deepsquirrelnet — Thu, 30 Apr 2026 13:31:58 +0000

And now debt is 1 GDP year. I don’t see the problem.

New comment by deepsquirrelnet in "Mistral Medium 3.5"

deepsquirrelnet — Wed, 29 Apr 2026 18:13:29 +0000

I would love to be able to run frontier locally, but I think the larger importance of open weight models is price accountability.

In the US with our broken system of capitalism, it’s the only way we can tether these companies to reality. Left to their own devices, I’m not convinced they would actually compete with each other on price.

Buy nobody like to talk about how “moat” building is fundamentally anti-competitive, even in name.

Funny that self proclaimed capitalists hate the system in practice. Commodity pricing is what truly terrifies them.

New comment by deepsquirrelnet in "Why AI companies want you to be afraid of them"

deepsquirrelnet — Wed, 29 Apr 2026 16:12:46 +0000

This is my own take, directly related to this that I posted a little while back. The one thing that I think the article missed is the geopolitical angle they’re also working:

* We need to completely deregulate these US companies so China doesn't win and take us over

* We need to heavily regulate anybody who is not following the rules that make us the de-facto winner

* This is so powerful it will take all the jobs (and therefore if you lead a company that isn't using AI, you will soon be obsolete)

* If you don't use AI, you will not be able to function in a future job

* We need to lineup an excuse to call our friends in government and turn off the open source spigot when the time is right

They have chosen fear as a motivator, and it is clearly working very well. It's easier to use fear now, while it's new and then flip the narrative once people are more familiar with it than to go the other direction. Companies are not just telling a story to hype their product, but why they alone are the ones that should be entrusted to build it.

New comment by deepsquirrelnet in "The AI industry is discovering that the public hates it"

deepsquirrelnet — Sat, 25 Apr 2026 22:05:52 +0000

> In a provocative GitHub post, machine-learning engineer Han-Chung Lee argued that even rosy internal numbers that do show AI-assisted productivity gains are suspect, as they’re produced to hit adoption targets no one can effectively audit.

Isn't this fundamentally what MBAs do with their time? Keep going with this analysis, because it goes much deeper... In my experience, BI is often a house of cards. A lot of times it's just narrative crafting, just like we're all encouraged to do when we write our resumes.

Can you embellish a story? Can you invent a convincing political narrative? As far as I can tell, that's the fundamental unit of US corporation.

New comment by deepsquirrelnet in "Anker made its own chip to bring AI to all its products"

deepsquirrelnet — Wed, 22 Apr 2026 18:15:41 +0000

> Traditional call noise canceling relies on those small onboard neural networks and can have difficulty isolating your voice in very noisy environments, which results in ambient noise leaking through or voices getting highly compressed, making it difficult to hear. Anker says the larger neural network available on the Thus chip, plus eight MEMS (micro-electromechanical systems) microphones and two bone conduction sensors to focus in on your voice, in its yet-to-be-announced earbuds will have significantly cleaner call audio, regardless of the environment.

Anyone who likes good noise cancellation, which is a lot of people.

Back in the day we just called it ML. But now you have to stop for a minute to read and determine what they’re talking about, because “AI” is primarily a marketing term.

New comment by deepsquirrelnet in "Kimi K2.6: Advancing open-source coding"

deepsquirrelnet — Mon, 20 Apr 2026 19:43:14 +0000

I tried it on openrouter and set max tokens to 8192, and every response is truncated, even in non-thinking mode. Maybe there's an issue with the deployment, but in your link also shows it generates tons of output tokens.

New comment by deepsquirrelnet in "Claude Opus 4.7"

deepsquirrelnet — Thu, 16 Apr 2026 16:01:08 +0000

My tinfoil hat theory, which may not be that crazy, is that providers are sandbagging their models in the days leading up to a new release, so that the next model "feels" like a bigger improvement than it is.

An important aspect of AI is that it needs to be seen as moving forward all the time. Plateaus are the death of the hype cycle, and would tether people's expectations closer to reality.