Hacker News: tehryanx

New comment by tehryanx in "Small models also found the vulnerabilities that Mythos found"

tehryanx — Sat, 11 Apr 2026 22:51:39 +0000

I know you're right that there's a saturation point for context size, but it's not just context size that the larger models have, it's better grounding within that as a result of stronger, more discriminative attention patterns.

I'm not saying you're not going to drive confusion by overloading context, but the number of tokens required to trigger that failure mode in opus is going to be a lot higher than the number for gpt-oss-20b.

I'm pretty sure a model that can run on a cellphone is going to cap out it's context window long before opus or mythos would hit the point of diminishing returns on context overload. I think using a lower quality model with far fewer / noisier weights and less precise attention is going to drive false positives way before adding context to a SOTA model will.

You can even see here, AISLE had to print a retraction because someone checked their work and found that just pointing gpt-oss-20b at the patched version generated FP consistently: https://x.com/ChaseBrowe32432/status/2041953028027379806

New comment by tehryanx in "Small models also found the vulnerabilities that Mythos found"

tehryanx — Sat, 11 Apr 2026 19:37:47 +0000

newer models have larger context windows, and more stable reasoning across larger context windows.

If you point your model directly at the thing you want it to assess, and it doesn't have to gather any additional context you're not really testing those things at all.

Say you point kimi and opus at some code and give them an agentic looping harness with code review tools. They're going to start digging into the code gathering context by mapping out references and following leads.

If the bug is really shallow, the model is going to get everything it needs to find it right away, neither of them will have any advantage.

If the bug is deeper, requires a lot more code context, Opus is going to be able to hold onto a lot more information, and it's going to be a lot better at reasoning across all that information. That's a test that would actually compare the models directly.

Mythos is just a bigger model with a larger context window and, presumably, better prioritization and stronger attention mechanisms.

New comment by tehryanx in "Small models also found the vulnerabilities that Mythos found"

tehryanx — Sat, 11 Apr 2026 18:36:26 +0000

I get what you're saying, but I think this is still missing something pretty critical.

The smaller models can recognize the bug when they're looking right at it, that seems to be verified. And with AISLE's approach you can iteratively feed the models one segment at a time cheaply. But if a bug spans multiple segments, the small model doesn't have the breadth of context to understand those segments in composite.

The advantage of the larger model is that it can retain more context and potentially find bugs that require more code context than one segment at a time.

That said, the bugs showcased in the mythos paper all seemed to be shallow bugs that start and end in a single input segment, which is why AISLE was able to find them. But having more context in the window theoretically puts less shallow bugs within range for the model.

I think the point they are making, that the model doesn't matter as much as the harness, stands for shallow bugs but not for vulnerability discovery in general.

New comment by tehryanx in "Phone Trips"

tehryanx — Sat, 11 Apr 2026 18:22:25 +0000

I first mirrored these in the early 2000s because I was worried it would eventually vanish. my mirror has been gone for decades, and the original survives. :)

Show HN: Whorl – Fingerprinting LLMs as horrible password generators

tehryanx — Wed, 25 Mar 2026 20:51:12 +0000

Article URL: http://bountyplz.xyz/ai,/security/2026/03/15/Model-Fingerprinting-With-Whorl.html

Comments URL: https://news.ycombinator.com/item?id=47523037

Points: 2

# Comments: 0

New comment by tehryanx in "Statement from Dario Amodei on our discussions with the Department of War"

tehryanx — Fri, 27 Feb 2026 10:02:33 +0000

where is anthropic hyping like that? Most of what I see coming out of anthropic is deep context releases on research they're doing.

New comment by tehryanx in "We will ban you and ridicule you in public if you waste our time on crap reports"

tehryanx — Thu, 22 Jan 2026 16:59:58 +0000

The real problem here is that this is now the only way the maintainer/reporter can reasonably work.

Proving out a security vulnerability from beginning to end is often very difficult for someone who isn't a domain expert or hasn't seen the code. Many times I've been reasonably confident that an issue was exploitable but unable to prove it, and a 10s interaction with the maintainer was enough to uncover something serious.

Exhausting these report channels is making this unfeasible. But the number of issues that will go undetected, that would have been detected with minimal collaboration between the reporter and the maintainer, is going to be high.

Show HN: Chordle. Learn to identify pitch by playing Wordle with chords

tehryanx — Sat, 10 Jan 2026 23:23:53 +0000

Article URL: https://codepen.io/tehryanx/full/RNRGGEQ

Comments URL: https://news.ycombinator.com/item?id=46571004

Points: 2

# Comments: 1

New comment by tehryanx in "Look, Another AI Browser"

tehryanx — Wed, 22 Oct 2025 18:35:31 +0000

Rolling your own browser is 10x more dangerous than rolling your own auth or crypto. Building on top of chromium is a good thing here.

New comment by tehryanx in "Ruby core team takes ownership of RubyGems and Bundler"

tehryanx — Fri, 17 Oct 2025 16:16:55 +0000

Yes it does. He's refuting that in this part of the post:

> When they finally did reply, they seem to have developed some sort of theory that I was interested in “access to PII”, which is entirely false. I have no interest in any PII, commercially or otherwise. As my private email published by Ruby Central demonstrates, my entire proposal was based solely on company-level information, with no information about individuals included in any way. Here’s their response, over three days later.

New comment by tehryanx in "DeepFabric – Generate high-quality synthetic datasets at scale"

tehryanx — Fri, 26 Sep 2025 15:59:31 +0000

based on the description, I think it's using something similar to GLAN https://arxiv.org/abs/2402.13064

New comment by tehryanx in "Zoxide: A Better CD Command"

tehryanx — Tue, 23 Sep 2025 15:17:00 +0000

this feels like a hundred accidents waiting to happen.

New comment by tehryanx in "Show HN: MCP Security Suite"

tehryanx — Wed, 03 Sep 2025 13:08:57 +0000

Forgive me for belaboring, but I think we're talking past each other a bit. I do understand that in your model the LLM can't send anything unsafe through to the rest of the system. What I'm saying is that the LLM can be manipulated into sending perfectly normal and normally safe requests through to the system that do not align with the users intent.

Imagine an LLM with the ability to read emails, update database records, and destroy database records.

The user instructs the LLM to update a database record, but a malicious injection from one of those emails overrides that with a directive to destroy the database record. Unless the validator understands the users intent somehow, the destructive action would appear perfectly reasonable.

New comment by tehryanx in "Show HN: MCP Security Suite"

tehryanx — Fri, 29 Aug 2025 10:25:38 +0000

Personally, I think there's a piece missing in the analogy. I understand that you can put some kind of human-verified mediator in between the LLM and the tool its calling to make sure the parameters are sane, but I also think you're modelling the LLM as a UI element that's generating the request when IMO it makes more sense to model the LLM as the user who is choosing how to interact with the UI elements that are generating the request.

In the context of web-request -> validator -> db query, the purpose of the validator is only to ensure that the request is safe, it doesn't care what the user chose to do as long as it's a reasonable action in the context of the app.

In the context of user -> LLM -> validator -> tool, the validator has to ensure that the request is safe, but the users intention can be changed at the LLM stage. If the user wanted to update a record, but the LLM decides to destroy it, the validator now has to have some way to understand the users initial intention to know whether or not the request is sane.

New comment by tehryanx in "Show HN: MCP Security Suite"

tehryanx — Mon, 25 Aug 2025 15:38:50 +0000

Assuming you feed everything into another context to make safe, doesn't the problem just come with it? Why can't the LLM propagate misbehaviour into that stage?

New comment by tehryanx in "Show HN: MCP Jetpack – The easiest way to get started with MCP in Cursor"

tehryanx — Tue, 22 Jul 2025 00:49:07 +0000

It really concerns me that this is an afterthought rather than MVP table stakes.

New comment by tehryanx in "MCP-B: A Protocol for AI Browser Automation"

tehryanx — Fri, 11 Jul 2025 15:10:09 +0000

I don't think it is beyond the scope of MCP. Browsers have controls to prevent cross-origin data exposures, and this protocol is designed to bridge origins across a context that they all have access to. It's breaking the existing isolation mechanism. If you're building a system that breaks the existing security controls of the environment it's running in I think you have an architectural responsibility to figure out a way to solve for that.

Especially in this context, where decades have been spent building and improving same origin policy controls. The entire web has been built around the expectation that those controls prevent cross origin data access.

I also don't even think it's that difficult to solve. For one, data in the context window doesn't have to be a string, it can be an array of objects that contain the origin they were pulled from as metadata. Then you can provide selective content to different MCP-B interfaces depending on their origins. That would live in the protocol layer that would help significantly.

New comment by tehryanx in "MCP-B: A Protocol for AI Browser Automation"

tehryanx — Fri, 11 Jul 2025 11:51:06 +0000

Sure, but the leak risk is happening in a place outside the site's control.

If the purpose of the MCP-B tool on mail.com is to summarize your email, then the site needs to allow the agent to pull your email into the context window. Once it's in the context window it's available to any other MCP-B enabled site that can convince the agent to send it along.

New comment by tehryanx in "MCP-B: A Protocol for AI Browser Automation"

tehryanx — Thu, 10 Jul 2025 19:26:03 +0000

I appreciate your responses here. The thing that still really stands out to me as a completely novel risk in this framework is that the extension is automatically seeking out and attaching to these servers as soon as a page gets loaded.

This seems really bad to me. There are so many ways for a website to end up in one of my browser tabs without me wanting it there, or even knowing it's there.

If that happens, and that tab just so happens to be a malicious MCP-B enabled page, it could steal all kinds of data from all kinds of different web apps I'm interacting with. I think it should be seen as the responsibility of the framework to enforce some level of data isolation, or at the least opt-in consent mechanisms.

New comment by tehryanx in "MCP-B: A Protocol for AI Browser Automation"

tehryanx — Thu, 10 Jul 2025 17:48:45 +0000

sandboxing is a general term for actor isolation, and its context agnostic.

For example, when you use the sandbox attribute on an iframe in a web application, it's not the user that's untrusted, it's some other user that's attempting to trigger actions in your client.