Hacker News: benjiro29

New comment by benjiro29 in "GLM-5.2 is the new leading open weights model on Artificial Analysis"

benjiro29 — Thu, 18 Jun 2026 00:32:06 +0000

> doesn't seem to take into account the cost savings from cache hits

Absolute false information.

From my usage panel for this month:

* Total Tokens 1.1B * Cached Tokens 1.0B 97% of prompt tokens * Cost energy pricing $26.58

The energy pricing is higher then what i actually pay because its a mix of token billing and partial subscription (60% extra "power").

From the $50 subscription, i have about 3/4 left (4.21 of 16.0 kWh used this billing cycle). Used $5.5 in token billing.

That was running 82.0% GLM 5.1, and 18% GLM 5.2. Yes, i have been busy ;)

My actual usage if we look in dollar value was ~ $18.

For your information, that is cheaper the MiMo v2.5 Pro from Xiaomi as there i was doing around 450.000t per cent. And they have the same 75% cheaper prices like DeepSeek. MiMo has a issue with cache retention between session prompts what hurts them vs DeepSeek. Yes, DeepSeek v4 Pro is 2.5x cheaper but nowhere near GLM 5.1, and especially not GLM 5.2.

In case your wondering, zai subscription light is about 80m token / week limit. So on a token/cent price, neutralwatt is about 3x cheaper (and not 5h, week limits to maximize/frustrate).

> all while taking weeks to onboard new models.

Took them 1 day to include GLM 5.2 ... Yes, the remove old models fast because they do not have the server capacity to keep old models around.

> I assume some of these problems would be addressed if we had an SLA/enterprise contract.

Its a small team, not a big huge company. From my experience so far, seen a 2 timeouts, and sometimes slow speeds as servers get overloaded. For what i am paying for GLM ~5.1~ 5.2 ...

New comment by benjiro29 in "Hetzner Price Adjustment"

benjiro29 — Wed, 17 Jun 2026 11:50:10 +0000

> As long as they are in top of price/performance/quality nobody will switch.

Very sure that those new prices has put them out of the whole price/performance/quality bracket.

* Past: consumer level hardware for basic support, and low prices.

* Now: consumer level hardware for basic support, and extreme high prices.

So the entire peg for their hardware choice vs pricing, has collapsed.

New comment by benjiro29 in "GLM-5.2 is the new leading open weights model on Artificial Analysis"

benjiro29 — Wed, 17 Jun 2026 11:41:10 +0000

A yes, the stealth advertisement post ...

New comment by benjiro29 in "GLM-5.2 is the new leading open weights model on Artificial Analysis"

benjiro29 — Wed, 17 Jun 2026 11:37:01 +0000

Neuralwatt ... When you reverse calculate the actual energy usage / price on a token basis, the gap is large.

I do not have GLM 5.2 numbers because the whole default max setting is overkill. But GLM 5.1 numbers had it at 12x cheaper then API rates. And about 2.5x more tokens vs zai their own subscription service.

Yes, its FP8 but lets be honest, do we know for sure that even zai runs at FP16? I learned a long time ago with Claude and Codex how much cheating happens on model levels, even from the big boys.

New comment by benjiro29 in "GLM-5.2 is the new leading open weights model on Artificial Analysis"

benjiro29 — Wed, 17 Jun 2026 11:27:13 +0000

GLM 5.2 Max = Opus 4.8 Max in thinking behavior. The thinking chain is so similar, and so is the amount of token usage on the output.

If you want reasonable token usage, you need to run it GLM 5.2 at High. There is little drop in quality from Max to High (for most tasks). And it cuts token usage by 2 a 2.5x. GLM 5.2, Max is really something you only need for complex tasks.

In essence, GLM 5.2 is Opus 4.8 its little brother, at a way, WAY cheaper price.

There has been really no training on Opus models going on, really, none i tell you! /sarcasm

New comment by benjiro29 in "Hetzner Price Adjustment"

benjiro29 — Mon, 15 Jun 2026 16:16:47 +0000

I remember the price increase that Hetzner did during 2022 because of the invasion in Ukraine. The said they will adjust the prices down when the electricity price reduced.

Guess what? I am paying as a consumer about the same price as before 2022. Did Hetzner change their price down? Remember, the industrial price also dropped (and they also build out a large solar plant). No ...

Ok, inflation? But those price increases already covered part of that... Just saying, its not been the first price increase that happened. There have been multiple ones that Hetzner did over the years. Some flew under people radars.

> Payroll has seen significant increases in Germany,

Yea, we have seen nothing of that increase... O, wait, they reduce our income because the social security increase their costs. Yay ..

New comment by benjiro29 in "Hetzner Price Adjustment"

benjiro29 — Mon, 15 Jun 2026 16:11:10 +0000

They are scaling up, but most will only come online in end 2027-2028 time frame. And Memory, as in what we use in PCs is easier to manufacture then HBM memory. But all the money is in HBM ...

So for every ~4GB of memory that you can produce in normal DDR5, you can only make 1GB of HBM. But you make multiple times the revenue.

The demand for HBM memory is not going to go away. LLMs are memory bandwidth hungry, and we are going to see production going to AI. But also to "lower end" like B200's.

That means, they are producing multiple times less memory (if we look for the normal market demand), but still need to produce more for the memory bandwidth hungry market.

We are seeing more products entering the "prosumer/business" market that are also memory bandwidth hungry. This demand will not go away. It will actually increase as companies move to more localized workloads. There is is a issue with data privacy that a lot of companies legally deal with.

The lacking ramp up is not a sign of them being scared of over production, its a realization that 3 companies hold the market in a strangle hold, and "slow" scale. If everybody plays friendly, they can milk this for years.

China is a solution but China does not have the HBM production levels, and will take years to scale and put a dent in the market. And China is ... allocating a lot to domestic production of AI > HBM ...

The reality is, that unless competition ( as in China ) does not start scaling beyond the expected levels, the big 3 have no reason to scale too fast.

And money is not the issue ... have you seen their revenue (and net profit!! ) numbers. A few billions is peanuts for them at this point. They simply do not want to scale too fast because that means less milking ... Memory demand is not going to away. When people talk about the AI bubble popping, its more in terms of the stock market. The product is here and not going away.

New comment by benjiro29 in "Hetzner Price Adjustment"

benjiro29 — Mon, 15 Jun 2026 15:56:02 +0000

These prices have absolute nothing to do anymore with memory prices. Do not forget that Hetzner already increased the setup fees by a factor of 4x before to compensate for the price. And also servers getting price increases.

It seems they have shifted by reducing the setup fees, and increasing the monthly costs. As this generates more revenue. And its easy to prove this...

AX42 ... Its 8700GE that has gone from 65 Euro to 225 Euro. With the setup fee now being 112 Euro instead of 225 Euro. It has 64GB memory, and 1TB storage. The storage even in todays market is 100 Euro. The memory is 644 Euro.

Do the math ... Hetzner servers had a hardware payback periode of between 9 to 11 month if you took the market value. This calculation has always been very stable over the 20 years i used Hetzner.

This new price, reduced the hardware payback periode to ~4 month. It seems to be that Hetzer is trying to use the memory price issues, as a excuse. The revenue of those same servers now increased to a insane level. More revenue with less hardware.

The real issue is that a lot of companies are moving from US hosting to EU hosting because of the problems with the US. Hetzner sees this as the perfect time to cash in on Enterprise customers.

They have been trying to replace the "cheap" normal consumers with enterprise. This trend has been going on for a while already.

Every customer that now leaves, is a server they can rent out to business customers.

If you want to see the same thing, look up what happened to Microsoft/Github Copilot where they turn around has been sudden and very strong, with a clear goal of moving everything to enterprise.

New comment by benjiro29 in "Anthropic flies staff to D.C. to clean up White House fight"

benjiro29 — Mon, 15 Jun 2026 11:10:58 +0000

> Only U.S. citizens and immigrants that are holders of a "green card" may now access Mythos.

It was my understanding that not even green card holders may access Mythos. Normally when restrictions like this are put in place, you need a exemption as a green card holder. A geen card is just a permit to live and work in the US. Its not the same as citizenship.

> Security Clearance: Green Card holders are generally prohibited from jobs that require high-level security clearances or sensitive government/military roles reserved exclusively for U.S. citizens.

New comment by benjiro29 in "GLM 5.2 Is Out"

benjiro29 — Sun, 14 Jun 2026 10:54:49 +0000

> I don’t understand how I grew up thinking USA is the gold standard is good and China just make cheap copies and is bad

You did not grow up in the 80s ... Where it was the same about US vs Japan. Look how it turned out for several of the US industries. The US tends to sleep, look down on other countries, and then it loses key industries because of that attitude.