Hacker News: ThunderSizzle

New comment by ThunderSizzle in "RTX 5080 and RTX 3090 Setup: 80 Tok/s on Qwen 3.6 27B Q8"

ThunderSizzle — Sat, 13 Jun 2026 16:11:40 +0000

An R9700 is $1350 and can get 100 TPS running Qwen3.6-35B-A3B Q5 with 130k context window (with room to spare) with a bit of fine tuning llamacpp-vulkan, but llamacpp's repository instability and lack of real versioning frustrates me.

In terms of electricity, if you aren't using it, even with all the vram loaded, at most your wasting about 30 watts or so.

Prompt processing a large uncached context is annoying, which is why I forced a lower context window, but I don't know if it's any worse in performance than the cloud models I've used.

There's a niceness, to me, knowing I don't have to rent it anymore. If you rent it, the terms can change regularly.

New comment by ThunderSizzle in "Why I'm Forced to Say Farewell: Google Management Has Lost Its Moral Compass"

ThunderSizzle — Fri, 12 Jun 2026 01:34:56 +0000

Like a compliment sandwich or something.

New comment by ThunderSizzle in "Show HN: FablePool – pool money behind a prompt, and Fable builds it in public"

ThunderSizzle — Fri, 12 Jun 2026 01:28:09 +0000

The coding isn't the hard part. It's the people and networking. Facebook's only moat is HOA boards that think private communication behind Facebook groups somehow equates to public messaging a community...

In other words, once people got on it, it was too late.

New comment by ThunderSizzle in "Uber's $1,500/month AI limit is a useful signal for AI tool pricing"

ThunderSizzle — Wed, 03 Jun 2026 23:40:33 +0000

I hope this is sarcasm, or "half" my job doesn't exist or something. Or you talking about full time non-dev scrum masters?

New comment by ThunderSizzle in "Nvidia RTX Spark"

ThunderSizzle — Tue, 02 Jun 2026 10:52:15 +0000

My 980 is currently unpowered. I have not tried to integrate them yet. It's on my to-do list, but the first time they were both powered on, the system had booting issues, and I didn't want to care at the time, since the 980 was probably going to be idle 99% of the time anyway.

I'll probably try to figure that problem out in about a month. Worst case is I move it to another even older desktop to replace the 9800 GTX+ inside of that one.

New comment by ThunderSizzle in "Nvidia RTX Spark"

ThunderSizzle — Tue, 02 Jun 2026 04:00:20 +0000

GitHub Copilot. It was one of the best values around in terms of cheap LLM access since each prompt was basically 4 cents (more or less), no matter how much it would do or how many tokens it used. A simple "Proceed" prompt that was telling the agent to execute a sophisticated plan could burn a lot of time without needing any direct intervention by the user, but as of June 1st, they switched to metered billing, meaning each token in/out has a cost now.

It was suspected to come soon enough, but it was a nice cheap road for my small hobby stuff. When they announced the price changes, I started to explore alternatives, and with the news of Qwen3.6 35B being both and having quality, I figured it was worth a try out, and self-hosting made the most sense to me, since that meant I was free from being a forever-renter.

New comment by ThunderSizzle in "Nvidia RTX Spark"

ThunderSizzle — Tue, 02 Jun 2026 01:28:20 +0000

I added an R9700 32GB to my 10+ year old desktop that had a 980 4GB card in it, for a grand total of $1350 or so. The payoff compared to what I was using with GHCP was 33 months, but when GHCP announced their price increase, it basically became a 3 month payoff at minimum (so yes, GHCP did a 10x price increase for non-parallel agentic workflows)

I can easily run Qwen3.6 35B-A3B with Q5_K_M with a 260k+ context window with some vram to spare. It easily runs probably 80tps. It took me quite a while to find the

Compared to GHCP Claude Sonnet 4.5 or 4.6, I have full parity. The wall clock time is faster for agentic workflows, and rule following is about on par.

With either, doing something kind of novel or obscure takes more hand holding compared to just generate a GUI or crud app. For example, trying to build an actual program that performs a complicated process correctly requires quite a bit of hand holding to get it to properly help.

Sure, it isn't Opus or something, but I think with the right harness, it probably can get close. I think most of the issues these days is the harnesses are lacking.

New comment by ThunderSizzle in "Why are large language models so terrible at video games?"

ThunderSizzle — Mon, 01 Jun 2026 10:20:59 +0000

I wonder if you paired a few different types of AI together, an LLM agent might be good at strategizing -. E.g. building a strategy on how to handle a scenario. But, it would need to know the entire game manual basically. Then it would pass the stratrgy to a better AI in some way. But it might not be needed if the better gaming AI can just do that part too already.

I admit I know nothing about this though.

New comment by ThunderSizzle in "Is AI causing a repeat of frontend’s lost decade?"

ThunderSizzle — Sat, 30 May 2026 02:59:18 +0000

My instructions and steering is to force the LLM agent to focus on mocking only system boundaries and outside in unit testing. I've found these two make verbose but good tests that actually prove behavior pretty well.

But that was my testing strategy already, but writing one of those tests could take legitimately hours with how many columns and crazy rules our area has.

New comment by ThunderSizzle in "DuckDuckGo search saw 28% more visits after Google said people love AI mode"

ThunderSizzle — Thu, 28 May 2026 19:22:43 +0000

There's a lot of reasons, but it's probably how Google makes most of their money - brand names end up being high value ad targets - at least 6+ years ago when I worked in that sphere of things (ecommerce, primarily). It's legit extortion.

I guess that's why - because extortion is immoral. Eventually, those paying the protection fees will eventually find someone better to pay or will stop paying when the protection doesn't show up.

New comment by ThunderSizzle in "DuckDuckGo search saw 28% more visits after Google said people love AI mode"

ThunderSizzle — Thu, 28 May 2026 10:44:46 +0000

Right, but that's because Google permits that. OP is saying Google should simply stop permitting advertising over the real answer.

New comment by ThunderSizzle in "YAML? That's Norway Problem"

ThunderSizzle — Sun, 24 May 2026 00:32:02 +0000

Fine. Then auto-closing tags? If you have a schema doc, you could do that. If you are writing free-form, then I'd definitely be complicated...?

I still see that better than YAML's indentation problem. Relying solely on indentation is a nightmare.

New comment by ThunderSizzle in "Microsoft starts canceling Claude Code licenses"

ThunderSizzle — Sat, 23 May 2026 11:05:16 +0000

If you do manual edits, I find it best to start a new conversation. But if your instructions and documentation is good enough, the new conversations won't have any problems picking up where it needs to be.

Having said that, I fear what June 1st brings for copilot It might suddenly be very useless for me.

New comment by ThunderSizzle in "YAML? That's Norway Problem"

Sat, 23 May 2026 02:02:00 +0000

It wouldn't need to be if closing tags were allowed to be unnamed. For most cases, we can tell the closing tags easily enough for simpler files:

  MAML
  
    minimal
    readable
  
  
  
  
    1
    Anton Medvedev
  

  
  
    
      JSON
      2001
    
    
      MAML
      2025
    
  

  
  This is a multiline raw strings.
  Keeps formatting as-is.

New comment by ThunderSizzle in "Intuit to lay off over 3k employees to refocus on AI"

ThunderSizzle — Thu, 21 May 2026 10:51:02 +0000

Freetaxusa isn't. I've done complicated schedule K, etc forms on it

New comment by ThunderSizzle in "Vivaldi 8.0"

ThunderSizzle — Thu, 21 May 2026 10:41:59 +0000

But what's your argument? That phone-based extensions are more vulnerable somehow than desktop extensions?

If anything, wouldn't a phone extension be more sandboxed than most desktop environments?

New comment by ThunderSizzle in "Local AI needs to be the norm"

ThunderSizzle — Sat, 16 May 2026 00:27:01 +0000

What if you split it into less complex tasks? E.g. use the model to help decompose the task into parts, then help it iterate through it.

Gives you more control over the outcome and more steering anyway.

New comment by ThunderSizzle in "The US is winning the AI race where it matters most: commercialization"

ThunderSizzle — Wed, 13 May 2026 19:39:07 +0000

A Qwen3.6-35B-A3B or whatever it's full name is, when on a 3090, can at the very least, with very little fine tuning, compete with Haiku and blows away GPT4.1 (aka, the cheap models).

It might keep up with Sonnet 4.5 with some tinkering.

But long story short: it seems to have better performance and similar quality for a payoff of a year or so compared to cloud models. In the same way you can self host faster/easier/cheaper than cloud hosting, if you are okay with the negatives.

I'm returning my 3090 soon for a R9700 after some more basic benchmarking, since the higher RAM should improve my observations more.

New comment by ThunderSizzle in "Cloudflare to cut about 20% of its workforce"

ThunderSizzle — Sat, 09 May 2026 10:47:35 +0000

I've heard this before.

Then COVID happened and employers didn't have 2-4 months of savings built up, and ended up shuttering due to lack of money immediately.

Also, since post COVID, we've had hyper inflation and a locked up housing market. $150 townhomes that a $50 family salary could afford are now going for $300. And rent has gone up to match.

I don't think the numbers not matching is because of anyone's personal financial responsibility. It's more from the Fed's and Congress's horrible actions over the past 2 decades and their financial irresponsibility.

Granted, grumbling about the powers that be doesn't solve the problem, which is why I fear civil turmoil will be here very soon.

New comment by ThunderSizzle in "Uber wants to turn its drivers into a sensor grid for self-driving companies"

ThunderSizzle — Sat, 02 May 2026 19:20:08 +0000

Many American roads don't have lines. Residential roads, parking lots, many business driveways have limited markings.

Then there's roads with just the center line markers with no road should markings.

Then there's a whole class of roads of lines over "demarked" old lines that weren't demarked well, or lines fading that should've been painted a long time ago.

I'm surprised you've never seen a non-perfect road?