Hacker News: FrasiertheLion

New comment by FrasiertheLion in "Apple reveals new AI architecture built around Google Gemini models"

FrasiertheLion — Tue, 09 Jun 2026 04:49:01 +0000

That's basically what we built at Tinfoil. We run open source models inside secure enclaves (also using Intel TDX/AMD SEV-SNP + NVIDIA Confidential Computing). All the code running inside the enclave is open source and the client SDKs (also open source) automatically verify that the pinned source code matches the runtime attestation. The protocol used is TLS (terminates in the enclave) + HPKE keys generated inside the enclave on boot. Docs walk you through the verification process: https://docs.tinfoil.sh/verification/verification-in-tinfoil

Of course, we can't support Claude or Grok as they are closed source, but there is no incentive for companies that need your data to train the next generation of models to allow for private inference. One day...

New comment by FrasiertheLion in "Local AI needs to be the norm"

FrasiertheLion — Sun, 10 May 2026 23:15:23 +0000

Another option is verifiably private inference with open source models running inside secure enclaves on the cloud (using NVIDIA confidential computing), and the enclave code is open source and verified via remote attestation upon connection, cryptographically proving that the inference provider cannot see any data. Tinfoil: https://tinfoil.sh/ is a good example of this (disclaimer: i'm the cofounder). You can read more about how this works here: https://docs.tinfoil.sh/verification/verification-in-tinfoil

>that open models are in the ballpark of the best commercial models

This is basically true for certain tasks. As an example, chat interfaces are not well poised to take advantage of higher model intelligence than what the best open source models already provide. But coding harnesses still benefit from greater model intelligence and even more so, the reinforcement learning that tightly interlinks the provider's coding harness (claude-code, codex) with the model's tool calling interfaces is another reason for discrepancy in effectiveness even when controlled for model intelligence. The opencode founder (open source coding harness that supports different model providers) was recently complaining about the challenges making the harness work well with different providers: https://x.com/thdxr/status/2053290393727324313

New comment by FrasiertheLion in "Local AI needs to be the norm"

FrasiertheLion — Sun, 10 May 2026 22:49:09 +0000

Overall I'm bullish on standardized local APIs that ship with the browser or platform. Far more tractable than expecting end users to stand up their own local model instances, though r/LocalLLaMA is a fantastic community to follow if you want to go that route.

A useful framing over “local vs cloud AI” can be split along two axes: does the task touch private data, and does it need frontier intelligence? You can use frontier models for developing the software (doesn’t touch data), but open-source models running locally for ops: maintenance, debugging and monitoring (touches data). If you need to fall back to frontier intelligence at some point for a particularly hard to resolve problem, you can still rely on local models for pre-transforming and filtering input in a way that's privacy-preserving or satisfies some constraint before it’s sent off to the cloud for processing. OpenAI's privacy filter is a good example of a model that can be used to mask PII and secrets and that can run locally: https://openai.com/index/introducing-openai-privacy-filter/, before sending any data externally for processing.

Another framing for local vs frontier closed which the article mentions is whether the task saturates model capability. With certain tasks like PDF processing or voice or summarization, adding more intelligence isn't necessarily useful. Arguably we've approached that point for chat interfaces already with frontier open-source models. But for coding and ops through well structured tool use inside a coding capable harness, we're still a ways away.

Tangentially, a contrarian take here is that AI can actually enable more privacy preserving software if you’re so inclined. You can just build personalized software and it lowers the barrier to entry and the effort required to self host. SaaS complexity often comes from scaling and supporting features for all types of customers, and if you're building software for personal use, you don't need all that additional complexity. Additionally, foundational and infra software that is harder to vibecode with AI is often already open source.

New comment by FrasiertheLion in "DeepSeek V4 – almost on the frontier"

FrasiertheLion — Sun, 03 May 2026 18:44:54 +0000

Very reasonable if you have the resources to run it locally and certainly the best option.

But we created Tinfoil because not everyone has that capability especially when it comes to larger models, and it still doesn’t solve for the situation where you’re building a service for your end user and you want to lock yourself out of accessing their data. In those cases, this is the second best thing you can do.

The technical walkthrough section on this blog that we co-wrote with one of our customers walks through the various attack surfaces: https://www.workshoplabs.ai/blog/private-post-training

We weave in many mitigations against attacks, but it depends on what class of attack it is.

If there are specific attacks you are concerned about, happy to provide an answer if it’s something we can address or not.

New comment by FrasiertheLion in "DeepSeek V4 – almost on the frontier"

FrasiertheLion — Sun, 03 May 2026 18:37:16 +0000

Unfortunately we don’t support crypto payments at this time as we use Stripe.

We try to add models selectively as we have to be mindful about our compute allocation. Is there a specific reason why you need those two models (and our models such as Kimi K2.6, GLM 5.1, Deepseek V4 Pro, Gemma 4 amongst others) don’t suffice for your use case?

Feel free to email me at tanya@tinfoil.sh and happy to continue the conversation there.

New comment by FrasiertheLion in "DeepSeek V4 – almost on the frontier"

FrasiertheLion — Sun, 03 May 2026 18:34:25 +0000

Yes we do, but the load balancer also runs inside the enclave and is attested: https://github.com/tinfoilsh/confidential-model-router

In turn, that attests the model enclaves, for instance, see https://github.com/tinfoilsh/confidential-deepseek-v4-pro. The model repo/release that the model router attests is included in the attestation config, which creates a chain of trust.

Also see https://docs.tinfoil.sh/verification/attestation-architectur...

New comment by FrasiertheLion in "Security Through Obscurity Is Not Bad"

FrasiertheLion — Sun, 03 May 2026 17:29:47 +0000

Yeah everything is open source if you’re good at reversing. Models are increasingly capable of converting binaries into source, and excellent at implementing systems when there’s a finite and constrained end state to validate against, which is exactly the profile reversing falls into.

New comment by FrasiertheLion in "Security through obscurity is not bad"

FrasiertheLion — Sun, 03 May 2026 17:27:28 +0000

This was largely true before. But AI reduces the cost of comprehension and finding vulnerabilities en-masse to zero, so this no longer holds, and I’m increasingly convinced that hiding in noise and complexity is no longer a valid strategy. But AI symmetrically makes it easier to secure your system so it’s not like all hope is lost even if the transition period will be brutal.

I wrote a blog about this: https://tanyaverma.sh/2026/03/01/nowhere-to-hide.html

New comment by FrasiertheLion in "DeepSeek V4–almost on the frontier, a fraction of the price"

FrasiertheLion — Sat, 02 May 2026 11:57:03 +0000

Oh that's quite interesting and hasn't been my experience with regular backend code specifically with respect to tool calling. However that could be because the tool calling format in vllm for Deepseek v4 was broken until a few days ago and that's how I'm running it.

I've been hearing amazing things about Flash, I should give it a try.

New comment by FrasiertheLion in "DeepSeek V4–almost on the frontier, a fraction of the price"

FrasiertheLion — Sat, 02 May 2026 11:38:27 +0000

Because V4 doesn't even beat Kimi K2.6 and GLM 5.1, which have been out longer. It's only talked about as much as it is because it's Deepseek and R1 was the first open source reasoning model. V4 isn't even multimodal (unlike Kimi) and the 1M context doesn't seem to perform particularly well.

New comment by FrasiertheLion in "DeepSeek V4 – almost on the frontier"

FrasiertheLion — Sat, 02 May 2026 11:35:07 +0000

You can use Tinfoil for inference, which lets you use the model in the cloud while getting similar privacy as running locally: https://tinfoil.sh/inference.

Disclaimer I'm the cofounder. This works by running the model inside a secure enclave (using NVIDIA confidential computing) and verifying the open source code running inside the enclave matches the runtime attestation. The docs walk you through the verification process: https://docs.tinfoil.sh/verification/verification-in-tinfoil

New comment by FrasiertheLion in "DeepSeek V4 – almost on the frontier"

FrasiertheLion — Sat, 02 May 2026 11:30:02 +0000

Have you given GLM 5.1 or Kimi K2.6 a shot for coding? They outperform Deepseek v4 pro.

New comment by FrasiertheLion in "Show HN: Filling PDF forms with AI using client-side tool calling"

FrasiertheLion — Sat, 02 May 2026 11:26:10 +0000

This is the canonical use case for Tinfoil: https://tinfoil.sh/inference. It provides verifiably private AI inference with frontier open source models: https://docs.tinfoil.sh/models/overview

Disclaimer I'm the cofounder, only recommending it because it's legitimately the right shape for your problem. The idea is that the model runs inside a secure enclave (using NVIDIA confidential computing), and the enclave code is open source and is verified via remote attestation upon connection: https://docs.tinfoil.sh/verification/verification-in-tinfoil

New comment by FrasiertheLion in "Amateur armed with ChatGPT solves an Erdős problem"

FrasiertheLion — Sun, 26 Apr 2026 03:09:56 +0000

It's 80 minutes, not 80 hours.

The Closing of the Frontier

FrasiertheLion — Sat, 11 Apr 2026 22:09:25 +0000

Article URL: https://tanyaverma.sh/2026/04/10/closing-of-the-frontier.html

Comments URL: https://news.ycombinator.com/item?id=47734444

Points: 4

# Comments: 0

New comment by FrasiertheLion in "Safeguarding cryptocurrency by disclosing quantum vulnerabilities responsibly"

FrasiertheLion — Tue, 31 Mar 2026 16:35:32 +0000

Yes they absolutely care and have been doing serious work to migrate PKI to PQC.

This was the first of several articles coming out of Google: https://blog.google/innovation-and-ai/technology/safety-secu...

And the timeline for web migration is 2027 Q1: https://security.googleblog.com/2026/02/cultivating-robust-a...

And this was Sophie Schmieg’s talk at a cryptography conference this month (they lead PQC migration efforts at Google) tracking migration efforts and urging folks to prioritize signature migrations in lieu of accelerated quantum timelines: https://westerbaan.name/~bas/rwpqc2026/sophie.pdf

New comment by FrasiertheLion in "Safeguarding cryptocurrency by disclosing quantum vulnerabilities responsibly"

FrasiertheLion — Tue, 31 Mar 2026 06:05:58 +0000

It's unfortunate that we're past the point where all quantum computing progress is public. Between this and the unbearable secrecy of AI labs, balkanization of knowledge is in full force.

New comment by FrasiertheLion in "How we Built Private Post-Training and Inference for Frontier Models"

FrasiertheLion — Mon, 16 Mar 2026 19:26:32 +0000

Tanya from the Tinfoil team that worked on the confidential computing and security substrate here.

Also around to answer any questions!

New comment by FrasiertheLion in "Intel Demos Chip to Compute with Encrypted Data"

FrasiertheLion — Mon, 16 Mar 2026 17:50:37 +0000

We don't have reproducible builds because we attest the full OS image that we run, which is the Ubuntu image. Unfortunately bit-by-bit reproducible binaries for OS images is kind of an unsolved problem, because it requires the hundreds of package maintainers across all dependencies to eliminate any sources of non-determinism in the compilation. Things like timestamps and file reordering are very common and even one of these changes the entire hash.

So we do the next best thing. We decide to trust Github and rely on Github Actions to faithfully execute the build pipeline. We also make sure to pin all images and dependencies.

New comment by FrasiertheLion in "Intel Demos Chip to Compute with Encrypted Data"

FrasiertheLion — Tue, 10 Mar 2026 23:33:32 +0000

Enclaves have a property that allows the hardware to compute a measurement (a cryptographic hash) of everything running inside it, such as the firmware, system software such as the operating system and drivers, the application code, the security configuration. This is signed by the hardware manufacturer (Intel/AMD + NVIDIA).

Then, verification involves a three part approach. Disclaimer: I'm the cofounder of Tinfoil: https://tinfoil.sh/, we also run inference inside secure enclaves. So I'll explain this as we do it.

First, you open source the code that's running in the enclave, and pin a commitment to it to a transparency log (in our case, Sigstore).

Then, when a client connects to the server (that's running in the enclave), the enclave computes the measurement of its current state and returns that to the client. This process is called remote attestation.

The client then fetches the pinned measurements from Sigstore and compares it against the fetched measurements from the enclave. This guarantees that the code running in the enclave is the same as the code that was committed to publicly.

So if someone claimed they were only analyzing aggregated metrics, they could not suddenly start analyzing individual request metrics because the code would change -> hash changes -> verification fails.