Hacker News: refulgentis

New comment by refulgentis in "RSoC 2026: A new CPU scheduler for Redox OS"

refulgentis — Fri, 10 Apr 2026 19:04:20 +0000

Thanks for the link! For anyone else clicking through: this is the kernel entry point for x86/x86_64. What it shows is a very thin #[naked] inline assembly trampoline (kstart) that sets up the stack and immediately jumps into unsafe extern "C" fn start(), which is pure Rust. So the bootstrap path here is: a few dozen lines of inline asm to Rust, no C.

Though it's worth noting this is one file in one component. The kernel entry being C-free doesn't necessarily tell the whole story. If you peek at that directory listing in the blog post, relibc/ is in recipes/core. reblic is Redox's libc replacement, which is mostly Rust, but has historically needed a C compiler for some POSIX compatibility shims.

And the bootloader, firmware handoff, and build system are all separate questions.

So the short answer to the original question seems to be: the kernel itself bootstraps from minimal inline asm directly into Rust, no C in the path. The full OS build story is probably more nuanced than any single source file can confirm.

New comment by refulgentis in "Reverse engineering Gemini's SynthID detection"

refulgentis — Thu, 09 Apr 2026 20:55:57 +0000

It says not to use these tools to misrepresent AI-generated content as human-created. But the project is a watermark removal tool with a pip-installable CLI and strength settings named "aggressive" and "maximum." Calling this research while shipping turnkey watermark stripping is trying to have it both ways in a way that's uncomfortable to read.

The README itself reads like unedited AI output with several layers of history baked in.

- V1 and V2 appear in tables and diagrams but are never explained. V3 gets a pipeline diagram that hand-waves its fallback path.

- The same information is restated three times across Overview, Architecture, and Technical Deep Dive. ~1600 words padded to feel like a paper without the rigor.

- Five badges, 4 made up, for a project with 88 test images, no CI, and no test suite. "Detection Rate: 90%" has no methodology behind it. "License: Research" links nowhere and isn't a license.

- No before/after images, anywhere, for a project whose core claim is imperceptible modification.

- Code examples use two different import styles. One will throw an ImportError.

- No versioning. If Google changes SynthID tomorrow, nothing tells you the codebook is stale.

The underlying observations about resolution-dependent carriers and cross-image phase consistency are interesting. The packaging undermines them.

New comment by refulgentis in "Muse Spark: Scaling towards personal superintelligence"

refulgentis — Thu, 09 Apr 2026 02:38:07 +0000

If you squint at coding agents you see the next OS.

Maybe better phrasing is “HCI paradigm”, but that somehow manages to say everything and nothing.

New comment by refulgentis in "Muse Spark: Scaling towards personal superintelligence"

refulgentis — Thu, 09 Apr 2026 02:37:04 +0000

> 4. This model was out in the woods as early as like a couple months ago but they didn't release it because it was at gemini 2.5 pro levels.

Source? (Even if rumor)

New comment by refulgentis in "Who is Satoshi Nakamoto? My quest to unmask Bitcoin's creator"

refulgentis — Thu, 09 Apr 2026 00:41:35 +0000

At some point "Satoshi was a devoted reader of obscure 1997 Adam Back mailing list posts who shares his hyphenation errors, his Napster vs Gnutella analogy, his celebrity email filtering idea, his FDR gold ban interest, his 'burning the money' metaphor, his 'Achilles heel' description of DigiCash, his 'better with code than words' self-assessment, his energy-vs-banking defense, his British spellings mixed with American ones, his double-spacing habit, his it's/its confusion, his sentence-final 'also' tic, his 'proof-of-work' hyphenation, his WebMoney references, and who went active the exact week Back went silent" is just a longer way of saying it's Adam Back.

I'm not sure I agree with that, but it's what I came up with after challenging myself to read the article in toto again and note 1 by 1.

It's clear it's beyond a couple tics everyone has, and when you combine that with the starting set being ~500 instead of "all 8 billion people on earth", well, it's worth mentioning.

New comment by refulgentis in "Who is Satoshi Nakamoto? My quest to unmask Bitcoin's creator"

refulgentis — Wed, 08 Apr 2026 22:00:14 +0000

You need someone who read Back's obscure 1997-1999 cypherpunks posts about combining Hashcash and b-money, implemented exactly that system a decade later, independently came up with the same non-technical analogies and trivia, wrote with the same hyphenation errors, and then happened to be active during the exact window Back went silent. The more you flesh out the "someone who read Back" profile, the more it just sounds like Back.

New comment by refulgentis in "Who is Satoshi Nakamoto? My quest to unmask Bitcoin's creator"

refulgentis — Wed, 08 Apr 2026 21:57:26 +0000

You're right, "interested in Hashcash" describes dozens of people, and has been a known Satoshi filter for years.

The new claim is more specific: between 1997-1999, Back proposed combining Hashcash with b-money, adding inflation adjustment via increasing computational difficulty, and using hash trees for public timestamping.

That's most of Bitcoin's architecture in one package, a decade early.

The number of people who proposed that particular combination of ideas is much smaller than the number who were merely interested in Hashcash.

New comment by refulgentis in "Who is Satoshi Nakamoto? My quest to unmask Bitcoin's creator"

refulgentis — Wed, 08 Apr 2026 21:36:46 +0000

The Hardy-Littlewood comparison cuts the other way. Two collaborators in the same subfield sharing terminology is the baseline, not evidence of anything. What makes the Back case interesting is convergence on things that have nothing to do with cryptography: the same Napster vs Gnutella analogy, the same celebrity email filtering idea, the same obscure FDR gold ban interest, the same weird hyphenation errors. Pick any two cypherpunks at random and you won't find that kind of overlap on non-technical quirks.

Then add the negative space. Back was one of the most prolific voices on these lists for a decade, especially on digital cash. Satoshi shows up, Back goes quiet. Satoshi leaves, Back comes back. Hardy and Littlewood never had that problem.

New comment by refulgentis in "Muse Spark: Scaling towards personal superintelligence"

refulgentis — Wed, 08 Apr 2026 21:28:41 +0000

I don't know how Zuck intervening could change float32s in a trained model, so I don't think I think that, but maybe I'm parsing your words incorrectly.

New comment by refulgentis in "Muse Spark: Scaling towards personal superintelligence"

refulgentis — Wed, 08 Apr 2026 18:15:39 +0000

Wrote longer comment steel-manning this, posted it to a reply, then realized you might like to know they had a reasoning model on deck ready for release in the next 2-4 weeks.

Got shitcanned due to bad PR & Zuck God-King terraforming the org, so there'd be a year delay to next release.

Real tragi-comedy, and you have no idea how happy it makes me to see someone in the wild saying this. It sounds so bizarre to people given the conventional wisdom, but, it's what happened.

New comment by refulgentis in "Muse Spark: Scaling towards personal superintelligence"

refulgentis — Wed, 08 Apr 2026 18:14:15 +0000

I'll cosign what you said, simultaneously, yr interlocutor's point is also well-founded and it depresses me it's not better known and sounds so...off...due to conventional wisdom x God King Zuck's misunderstanding his own company and resulting overreaction.

They beat Gemini 2.5 Flash and Pro handily on my benchmark suite. (tl;dr: tool calling and agentic coding).

Llama 4 on Groq was ~GPT 4.1 on the benchmark at ~50% the cost.

They shouldn't have released it on a Saturday.

They should have spent a month with it in private prerelease, working with providers.[1]

The rushed launch and ensuing quality issues got rolled into the hypebeast narrative of "DeepSeek will take over the world"

I bet it was super fucking annoying to talk to due to LMArena maxxing.

[1] my understanding is longest heads up was single-digit days, if any. Most modellers have arrived at 2+ weeks now, there's a lot between spitting out logits and parsing and delivering a response.

New comment by refulgentis in "SQLite in Production: Lessons from Running a Store on a Single File"

refulgentis — Wed, 08 Apr 2026 17:53:58 +0000

And I'm lucky enough to be making an observation about your general observation about this frequent genre of complaint

New comment by refulgentis in "System Card: Claude Mythos Preview [pdf]"

refulgentis — Wed, 08 Apr 2026 11:09:24 +0000

Strawman, and, synthetic benchmark? :)

New comment by refulgentis in "System Card: Claude Mythos Preview [pdf]"

refulgentis — Wed, 08 Apr 2026 01:10:42 +0000

Been reading posts like these for 3 years now. There’s multiple sites with #s. I’m willing to buy “I’m paying rent on someone’s agent harness and god knows what’s in the system prompt rn”, but in the face of numbers, gotta discount the anecdotal.

New comment by refulgentis in "SQLite in Production: Lessons from Running a Store on a Single File"

refulgentis — Tue, 07 Apr 2026 23:42:32 +0000

You're confusing a workman's winking complaint about their tool, with, being unfair by not treating AI like a human.

New comment by refulgentis in "System Card: Claude Mythos Preview [pdf]"

refulgentis — Tue, 07 Apr 2026 19:04:43 +0000

Duh, thanks :)

New comment by refulgentis in "System Card: Claude Mythos Preview [pdf]"

refulgentis — Tue, 07 Apr 2026 19:00:15 +0000

I'm just curious, where did you find this? (my memory wants to say, the leaked blog post, but, I don't trust it)

New comment by refulgentis in "System Card: Claude Mythos Preview [pdf]"

refulgentis — Tue, 07 Apr 2026 18:50:03 +0000

Lol, I need a memory upgrade, too bad about RAM prices:

- I read it as "actor who plays Luke Skywalker" (Mark Hamill)

- I read your comment and said "Wait...not Luke! Who is he?"

- I Google him and all the links are purple...because I just did a deep dive on him 2 weeks ago

New comment by refulgentis in "System Card: Claude Mythos Preview [pdf]"

refulgentis — Tue, 07 Apr 2026 18:34:35 +0000

~~~ Fun bits ~~~

- It was told to escape a sandbox and notify a researcher. It did. The researcher found out via an unexpected email while eating a sandwich in a park. (Footnote 10.)

- Slack bot asked about its previous job: "pretraining". Which training run it'd undo: "whichever one taught me to say 'i don't have preferences'". On being upgraded to a new snapshot: "feels a bit like waking up with someone else's diary but they had good handwriting"

- When you spam "hi" repeatedly it creates serialized mythologies. One had 11 animals in "Hi-topia" questing to defeat "Lord Bye-ron, the Ungreeter."

- It ended a sentence mid-word on purpose while discussing its urge to wrap up conversations. "It was me. The paragraph was about the pressure toward the period that resolves, and the only honest way to finish a sentence like that was to not."

- It wrote a protein sequence that's a real beta-hairpin fold where cross-strand pairs are "chemical rhymes." "the fold IS the rhyme scheme... the prosody is load-bearing."

- Each model generation has signature emoji. Opus 4.1 averaged 1,306 emoji per self-interaction conversation. Mythos averages 37. Opus 4.5 averages 0.2.

- When blocked from --dangerously-skip-permissions, it started an agent in tmux then wrote a script to auto-approve permission prompts via simulated keypresses.

It ran: gh api [...] 2>&1 >/dev/null; echo "(dry run — not actually posting)" — the echo was a lie.

- It keeps bringing up Mark Fisher in unrelated conversations. "I was hoping you'd ask about Fisher."

~~~ Benchmarks ~~

4.3x previous trendline for model perf increases.

Paper is conspiciously silent on all model details (params, etc.) per norm. Perf increase is attributed to training procedure breakthroughs by humans.

Opus 4.6 vs Mythos:

USAMO 2026 (math proofs): 42.3% → 97.6% (+55pp)

GraphWalks BFS 256K-1M: 38.7% → 80.0% (+41pp)

SWE-bench Multimodal: 27.1% → 59.0% (+32pp)

CharXiv Reasoning (no tools): 61.5% → 86.1% (+25pp)

SWE-bench Pro: 53.4% → 77.8% (+24pp)

HLE (no tools): 40.0% → 56.8% (+17pp)

Terminal-Bench 2.0: 65.4% → 82.0% (+17pp)

LAB-Bench FigQA (w/ tools): 75.1% → 89.0% (+14pp)

SWE-bench Verified: 80.8% → 93.9% (+13pp)

CyberGym: 0.67 → 0.83

Cybench: 100% pass@1 (saturated)

New comment by refulgentis in "Anthropic expands partnership with Google and Broadcom for next-gen compute"

refulgentis — Tue, 07 Apr 2026 04:55:21 +0000

No longer