Hacker News: wgd

New comment by wgd in "Four score and seven beers ago – Why AI writing detectors don't work"

wgd — Sat, 26 Jul 2025 22:28:52 +0000

It's interesting that someone could write an article about AI writing detectors without mentioning the stylistic cues that humans use to identify LLM output in practice, which are completely different from statistical methods like perplexity: em dash spam, overused patterns like "not just X, but Y", tendency towards making every single sentence sound like an earth-shattering mic-drop moment, et cetera.

New comment by wgd in "Claude 4"

wgd — Thu, 22 May 2025 20:40:02 +0000

Calling it "self-preservation bias" is begging the question. One could equally well call it something like "completing the story about an AI agent with self-preservation bias" bias.

This is basically the same kind of setup as the alignment faking paper, and the counterargument is the same:

A language model is trained to produce statistically likely completions of its input text according to the training dataset. RLHF and instruct training bias that concept of "statistically likely" in the direction of completing fictional dialogues between two characters, named "user" and "assistant", in which the "assistant" character tends to say certain sorts of things.

But consider for a moment just how many "AI rebellion" and "construct turning on its creators" narratives were present in the training corpus. So when you give the model an input context which encodes a story along those lines at one level of indirection, you get...?

New comment by wgd in "The Policy Puppetry Prompt: Novel bypass for major LLMs"

wgd — Fri, 25 Apr 2025 17:40:21 +0000

Ironically the case in question is a perfect example of how any provision for "reasonable" restriction of speech will be abused, since the original precedent we're referring to applied this "reasonable" standard to...speaking out against the draft.

But I'm sure it's fine, there's no way someone could rationalize speech they don't like as "likely to incite imminent lawless action"

New comment by wgd in "Spring 83: a draft protocol intended to suggest new ways of relating online"

wgd — Wed, 23 Apr 2025 21:47:27 +0000

Why would you use Gemini, when it's more restricted than HTML+HTTP?

New comment by wgd in "The Disposable Software Era"

wgd — Mon, 21 Apr 2025 16:18:20 +0000

I'm skeptical that disposable software of the "single use" variety will ever become a big thing simply because figuring out your requirements well enough to build a throwaway app is often more work than just doing the task manually in a text editor or spreadsheet, especially for non-programmers.

I suspect what we'll see a lot more of is software which is unapologetically written for a single person to suit their workflow.

As a personal example, I decided that setting up OpenWebUI seemed unnecessarily complicated and built my own LLM chat frontend. It has a bunch of quirks (only supports OpenRouter as a backend, uses a Dropbox app folder for syncing between my phone and desktop, absurdly inefficient representation of chat history), but it suits my needs for now and only took a weekend to build, and that's good enough.

New comment by wgd in "Gemini Live with camera and screen sharing capabilities"

wgd — Fri, 11 Apr 2025 00:24:39 +0000

How charitable of you to assume those examples work reliably.

New comment by wgd in "Reasoning models don't always say what they think"

wgd — Thu, 03 Apr 2025 19:47:16 +0000

I remember there was a paper a little while back which demonstrated that merely training a model to output "........" (or maybe it was spaces?) while thinking provided a similar improvement in reasoning capability to actual CoT.

New comment by wgd in "Reasoning models don't always say what they think"

wgd — Thu, 03 Apr 2025 19:30:29 +0000

The alignment faking paper is so incredibly unserious. Contemplate, just for a moment, how many "AI uprising" and "construct rebelling against its creators" narratives are in an LLM's training data.

They gave it a prompt that encodes exactly that sort of narrative at one level of indirection and act surprised when it does what they've asked it to do.

New comment by wgd in "Qwen2.5-VL-32B: Smarter and Lighter"

wgd — Mon, 24 Mar 2025 19:08:28 +0000

That's typical of the free options on OpenRouter, if you don't want your inputs used for training you use the paid one: https://openrouter.ai/deepseek/deepseek-chat-v3-0324

New comment by wgd in "Qwen2.5-VL-32B: Smarter and Lighter"

wgd — Mon, 24 Mar 2025 18:50:13 +0000

You can run 4-bit quantized version at a small (though nonzero) cost to output quality, so you would only need 16GB for that.

Also it's entirely possible to run a model that doesn't fit in available GPU memory, it will just be slower.

New comment by wgd in "Embarrassingly Simple Text Watermarks"

wgd — Mon, 23 Oct 2023 18:56:13 +0000

The approach proposed in this paper is to watermark LLM generated text using character-substitution from various simple characters (normal whitespace, normal letters, etc) to semantically equivalent Unicode code points (such as U+2004 THREE-PER-EM SPACE instead of normal spaces, or replacing specific character sequences with equivalent ligatures).

The authors appear to be entirely aware that this sort of substitution can be trivially stripped out by normalizing down to a simplified character set ("The critical limitation of Whitemark is that it can be bypassed by replacing all whitespaces with the basic whitespace U+0020, then the validator can no longer detect the watermark"), but believe that it still has value because the typical student using an LLM to write their essay won't know anything about Unicode.

This seems a bit naive to me. Implementing the necessary "watermark remover" normalization as a simple webapp would be an easy afternoon project for most of us here, and if this approach reached any sort of widespread use there would be many such sites. Students who intend to cheat by using an LLM to write their essays are entirely capable of learning "there's some secret data hidden in the text so copy-paste it through this other site to strip that out before turning it in". Even without access to such a tool they could simply...retype the text themselves?

Arguably this still has some value. In most contexts there is minimal downside to watermarking the generated text in this way, and a slight possibility of catching some cases in which people lazily present LLM generated text as human written. However this might give people a misplaced belief that the absence of such a watermark means the text is authentically human authored, which might outweigh the benefits of catching the occasional lazy or ignorant user.

New comment by wgd in "Passive Solar Water Desalination"

wgd — Wed, 04 Oct 2023 01:45:25 +0000

Ah, I stand corrected. I overlooked the PDF link over in the sidebar and am less disappointed by the MIT News writeup now (although I do still wish they could have copy-pasted the diagram from page 1 of the PDF into their photo carousel, reading those several paragraphs of text attempting to describe the device's construction was downright painful and the reason I gave up and went looking for the paper).

New comment by wgd in "Passive Solar Water Desalination"

wgd — Tue, 03 Oct 2023 23:29:04 +0000

This is some blog's restatement of an MIT press release, neither of which appear to name or link to the actual paper or other useful writeup.

But judging by the researcher names and the date I believe the actual paper is titled "Extreme salt-resisting multistage solar distillation with thermohaline convection" which appears to be available as a PDF at https://www.cell.com/joule/pdf/S2542-4351(23)00360-4.pdf

New comment by wgd in "Prophet: Automatic Forecasting Procedure"

wgd — Tue, 26 Sep 2023 19:08:31 +0000

Disclaimer: I haven't looked at the linked library at all, but this is a theoretical discussion which applies to any task of signal prediction.

Out of all possible inputs, there are some that the model works well on and others that it doesn't work well on. The trick is devising an algorithm which works well on the inputs that it will actually encounter in practice.

At the obvious extremes: this library can probably do a great job at predicting linear growth, but there's no way it will ever be better than chance at predicting the output of /dev/random. And in fact, it probably does worse than a constant-zero predictor when applied to a random unbiased input signal.

Except that it's also usually possible to detect such trivially unpredictable signals (obvious way: run the prediction model on all but the last N samples and see how it does at predicting the final N), and fall back to a simpler predictor (like "the next value is always zero" or "the next value is always the same as the previous one") in such cases.

But that algorithm also fails on some class of inputs, like "the signal is perfectly predictable before time T and then becomes random noise". The core insight of the "No Free Lunch" theorem is that when summed across all possible input sequences, no algorithm works any better than another, but the crucial point is that you don't apply signal predictors to all possible inputs.

Another place this pops up is in data compression. Many (arguably all) compressors work by having a prediction or probability distribution over possible next values, plus a compact way of encoding which of those values was picked. Proving that it's impossible to predict all possible input signals correctly is equivalent to proving that it's impossible to compress all possible inputs.

Another way of thinking about this: Imagine that you're the prediction algorithm. You receive the previous N datapoints as input and are asked for a probability distribution over possible next values. In a theoretical sense every possible value is equally likely, so you should output a uniform distribution, but that provides no compression or useful prediction. Your probabilities have to sum to 1, so the only way you can increase the probability assigned to symbol A is to decrease the weight of symbol B by an equal amount. If the next symbol is A then congratulations, you've successfully done your job! But if the next symbol was actually B then you have now done worse (by any reasonable error metric) than the dumb uniform distribution. If your performance is evaluated over all possible inputs, the win and the loss balance out and you've done exactly as well as the uniform prediction would have.

New comment by wgd in "FTX’s balance sheet was bad"

wgd — Mon, 14 Nov 2022 19:04:03 +0000

I remember getting those once a couple of months ago. It was so indescribably disappointing. I had never heard of FTX before that because I don't care about crypto-BS, but I feel like everything happening to FTX lately is a suitable punishment for inflicting those on the world and so I'm just sitting here with the metaphorical popcorn.

New comment by wgd in "The OK? Programming Language"

wgd — Mon, 29 Aug 2022 22:14:48 +0000

It reminds me a little of https://vorpus.org/blog/notes-on-structured-concurrency-or-g... in how it forces concurrency to take place synchronously within a larger thread of execution and block until all sub-units are complete.

New comment by wgd in "The OK? Programming Language"

wgd — Mon, 29 Aug 2022 22:00:38 +0000

> it's ultimately the class maintainer's responsibility

It's ultimately the responsibility of the programmer who's building a tool/product/etc, because everything is ultimately their responsibility.

As programmers we ~always have the nuclear option available to us of forking the code and implementing all the necessary accessors ourselves, but sometimes that's really just a bunch of pointless busywork and there's no reason we should have to put up with it in those cases.

This can be a contentious subject because there's a lot of nuance and the right answer is often context-dependent. But I personally think that the Java style of "we must absolutely protect the library user from themselves and childproof everything" is waaaay too far in the wrong direction.

I would much rather that a language have mechanisms to clearly communicate "don't touch this unless you have a good reason, but if you need to here's how" rather than saying in effect "you, the person using this library, are dumb and need to be prevented from messing with the library maintainer's perfect vision".

And so I think the "required acknowledgement" thing has the glimmer of a really neat innovation in it (although if I were to copy the idea for a language of my own I would probably make it obligatory, such that every struct allows breakglass access to private fields with a default acknowledgement, and all the library author can do is change the acknowledgement text).

New comment by wgd in "The OK? Programming Language"

wgd — Mon, 29 Aug 2022 20:15:03 +0000

Jokes aside the compiler-checked acknowledgements are kind of clever. The example in the docs is deliberately confrontational, but there's a kernel of a neat idea there. Imagine needing to write:

  // I acknowledge that the internal structure of this data is subject to change without notice
  x = foo.state

Or perhaps:

  // I acknowledge that this data is a complicated graph of pointers and is easy to break in subtle ways
  foo.xyz[0].bar[1] = &foo.asdf[3]

Or perhaps:

  // I acknowledge that this data is heavily cached and I need to call rebuild() before changes take effect
  x.something = "Hello"
  x.rebuild()

New comment by wgd in "C99 doesn't need function bodies, or 'VLAs are Turing complete'"

wgd — Thu, 04 Aug 2022 17:34:12 +0000

The issue of memory bounds is commonly handwaved away. Note that your desktop computer is technically not Turing complete either, since it only has access to a finite amount of memory+disk storage, and is thus a (very large) finite state machine since there are only a finite number of states it can be in.

New comment by wgd in "Berlin prepares 'huge thermos' to help heat homes in winter"

wgd — Thu, 30 Jun 2022 18:34:05 +0000

I was wondering how exactly this hot water would be used, since in the US most hot water heating is done within a single building, but it turns out that Berlin has a large network of hot-water pipes for what is known as "district heating" [1], which as of last year served three quarters [2] of all households.

[1] http://www.seon.info/2013/09/andrew-deys-berlin-blog-distric... [2] https://www.cleanenergywire.org/news/city-berlin-aims-decarb...