Hacker News: systima

New comment by systima in "[dead]"

systima — Wed, 01 Apr 2026 09:52:43 +0000

As the entire planet now knows, the Claude Code source leaked on March 31.

The engineering-focused findings have been covered extensively (fake tool injection, Undercover Mode, KAIROS, etc).

This piece focuses on what these findings mean if you're using Claude Code to build AI systems subject to the EU AI Act.

TL;DR / spoiler:

Claude Code isn't a high-risk AI system in and of itself.

The EU AI Act regulates your deployed system and your process, not your tool vendor's internal engineering practices.

New comment by systima in "OpenYak – An open-source Cowork that runs any model and owns your filesystem"

systima — Sun, 29 Mar 2026 07:58:32 +0000

How does this differ to Open Code Desktop?

New comment by systima in "OpenCode – Open source AI coding agent"

systima — Sat, 21 Mar 2026 09:26:48 +0000

Sorry, I missed part of your question:

What caused the switch was that we're building AI solutions for sometimes price-conscious customers, so I was already familiar with the pattern of "Use a superior model for setting a standard, then fine-tuning a cheaper one to do that same work".

So I brought that into my own workflows (kind of) by using Opus 4.6 to do detailed planning and one 'exemplar' execution (with 'over documentation' of the choices), then after that, use Opus 4.6 only for planning, then "throw a load of MiniMax M2.5s at the problem".

They tend to do 90% of the job well, then I sometimes do a final pass with Opus 4.6 again to mop up any issues, this saves me a lot of tokens/money.

This pattern wasn't possible with Claude Code, thus my move to Open Code.

New comment by systima in "OpenCode – Open source AI coding agent"

systima — Sat, 21 Mar 2026 05:48:34 +0000

Yes I regularly plan in Opus 4.6 and execute in “lesser” models ie MiniMax

New comment by systima in "OpenCode – Open source AI coding agent"

systima — Fri, 20 Mar 2026 23:07:21 +0000

Open Code has been the backbone of our entire operation (we used Claude Code before it, and Cursor before that).

Hugely grateful for what they do.

New comment by systima in "Ask HN: What do you look for in your first 10 hires?"

systima — Thu, 19 Mar 2026 07:52:43 +0000

I agree.

In my experience, this correlates more with soft skills and “one man band” founder/maker companies that tend to sell training products or (if they do exist in a company environment at all) invariably work in DevRel and aren’t pushing code.

New comment by systima in "Show HN: Open-Source Article 12 Logging Infrastructure for the EU AI Act"

systima — Tue, 17 Mar 2026 22:33:49 +0000

Thank you — Excellent points. Will think about them.

New comment by systima in "Meta Platforms: Lobbying, dark money, and the App Store Accountability Act"

systima — Tue, 17 Mar 2026 19:52:29 +0000

I don’t think it’s that.

I think it’s more about setting a norm and precedent that “Age verification is not our responsibility; the App Store layer does that and it’s an established truth now”.

Which itself conveniently helps as a defence in lawsuits when a teenager kills themselves over harmful content etc.

New comment by systima in "Meta Platforms: Lobbying, dark money, and the App Store Accountability Act"

systima — Tue, 17 Mar 2026 12:59:52 +0000

"But there is an obvious solution: mandate the operating systems (iOS and Android) to share device users' ages when they download apps from the app stores – data the operating systems get as part of the hardware acquisition already. This would be a simple one-step way for parents to control all the different apps that their kids use (in the US, the average teen uses forty different apps per month) and would remedy the fractured app-by-app approach we have today. We should make a societal judgement about whether to set these age limits for smartphones or social media use at thirteen, fourteen, fifteen or sixteen, then write it into law." in How to Save the Internet by Nick Clegg

New comment by systima in "Meta Platforms: Lobbying, dark money, and the App Store Accountability Act"

systima — Tue, 17 Mar 2026 12:42:33 +0000

Follow what Nick Clegg has been saying post-Meta. He might give a big clue.

New comment by systima in "Apideck CLI – An AI-agent interface with much lower context consumption than MCP"

systima — Mon, 16 Mar 2026 18:15:16 +0000

Maybe https://usepec.eu ?

New comment by systima in "$96 3D-printed rocket that recalculates its mid-air trajectory using a $5 sensor"

systima — Sun, 15 Mar 2026 13:00:07 +0000

Impressive! Well done

Show HN: Open-Source EU AI Act Compliance Scanning for CI/CD

systima — Sat, 14 Mar 2026 14:10:03 +0000

We built a CLI tool that scans your codebase for EU AI Act compliance risks.

`npx @systima/comply scan` analyses your repository to detect AI framework usage, traces how AI outputs flow through the program, and flags patterns that may trigger regulatory obligations.

It runs in CI and posts findings on pull requests (no API keys required).

Under the hood it performs AST-based import detection using the TypeScript Compiler API and web-tree-sitter WASM across 37+ AI frameworks. It then traces AI return values through assignments and destructuring to identify four patterns:

1. conditional branching on AI output

2. persistence of AI output to a database

3. rendering AI output in a UI without disclosure

4. sending AI output to downstream APIs

Findings are severity-adjusted by system domain. You declare what your system does (customer support, credit scoring, legal research, etc) and the scanner adjusts accordingly.

Example:

- a chatbot routing tool using AI output in an `if` statement produces an informational note

- a credit scoring system doing the same produces a critical finding

We tested it against Vercel’s 20k-star AI chatbot repository; the scan took about 8 seconds. Example PR comment with full results: https://github.com/systima-ai/chatbot-comply-test/pull/1

Comply ships as an npm package, a GitHub Action (systima-ai/comply@v1), and a TypeScript API. It can also generate PDF reports and template compliance documentation.

Repo and explanation: https://systima.ai/blog/systima-comply-eu-ai-act-compliance-...

Feedback welcome on the call-chain tracing approach and whether the domain-based severity model makes sense.

Comments URL: https://news.ycombinator.com/item?id=47376869

Points: 1

# Comments: 0

New comment by systima in "Show HN: XML, Markdown, or JSON: Which gives LLMs the most reliable boundaries?"

systima — Thu, 05 Mar 2026 21:18:53 +0000

Respectfully, this is not really engaging withe content of the post.

Show HN: XML, Markdown, or JSON: Which gives LLMs the most reliable boundaries?

systima — Thu, 05 Mar 2026 21:11:11 +0000

Article URL: https://systima.ai/blog/delimiter-hypothesis

Comments URL: https://news.ycombinator.com/item?id=47267341

Points: 3

# Comments: 2

New comment by systima in "Show HN: Open-Source Article 12 Logging Infrastructure for the EU AI Act"

systima — Thu, 05 Mar 2026 07:29:13 +0000

IMO what you’re describing is essentially crypto-shredding.

It would definitely work (and when dealing with petabyte levels of data the simplicity of only having to delete the key is convenient).

We’re leaning toward the dual-layer separation I described though (metadata separate to content) mainly because crypto-shredding means every read (including regulatory reconstruction) depends on a key store.

In my view that’s a significant dependency for an audit log whose whole purpose is reliable reconstructability, whereas dual-layer lets the chain stand on its own.

Your point about developer mistakes is fair. It applies to dual layer as you say with your example, but I’d say crypto shredding isn’t immune to mistakes because (for example) deleting the key only works if the key and plaintext never leaked elsewhere accidentally in logs / backups etc.

New comment by systima in "Show HN: Open-Source Article 12 Logging Infrastructure for the EU AI Act"

systima — Wed, 04 Mar 2026 23:19:15 +0000

Great question.

voxic11 is right that the AI Act creates a legal obligation that provides a lawful basis for processing under GDPR Article 6(1)(c).

To add to that, Article 17(3)(b) specifically carves out an exemption to the right to erasure where retention is necessary to comply with a legal obligation.

(So the defence works at both levels; you have a lawful basis to retain, and erasure requests don’t override it during the mandatory retention period).

That said, GDPR data minimisation (Article 5(1)(c)) still constrains what you log.

The library addresses this at write-time today, in that the pii config lets you SHA-256 hash inputs/outputs before they hit the log and apply regex redaction patterns, so personal data need never enter the chain in the first place.

This enables the pattern of “Hash by default, only log raw where necessary for Article 12”.

For cases where raw content must be logged (eg, full decision reconstruction for a regulator), we’re planning a dual-layer storage approach. The hash chain would cover a structural envelope (timestamps, decision ID, model ID, parameters, latency, hash pointers) while the actual PII-bearing content (input prompts, output text) would live in a separate referenced object.

Erasure would then mean deleting the content object, and the chain would stay intact because it never hashed the raw content directly.

The regulator would also therefore see a complete, tamper-evident chain of system activity.

New comment by systima in "Show HN: Open-Source Article 12 Logging Infrastructure for the EU AI Act"

systima — Tue, 03 Mar 2026 21:47:44 +0000

Thanks for the thoughts and feedback.

Fair point on the reconstruction attack.

The library is deliberately scoped as tamper-evident, not tamper-proof; it detects modification but does not prevent wholesale chain reconstruction by someone with storage access. The design assumes defence-in-depth: S3 Object Lock (Compliance mode) at the infrastructure layer, hash chain verification at the application layer.

External timestamping (OpenTimestamps, RFC 3161) would definitely add independent temporal anchoring and is worth considering as an optional feature. From what I can see, Article 12 does not currently prescribe specific cryptographic mechanisms (but of course the assurance level would increase with it).

On the regulatory question: Article 12 requires "automatic recording" that enables monitoring and reconstruction and current regulatory guidance does not require tamper-proof storage (only trustworthy, auditable records). The hash chain plus immutable storage is designed to meet that bar, but what you raise here is good and thoughtful.

Show HN: Open-Source Article 12 Logging Infrastructure for the EU AI Act

systima — Tue, 03 Mar 2026 10:11:44 +0000

EU legislation (which affects UK and US companies in many cases) requires being able to truly reconstruct agentic events.

I've worked in a number of regulated industries off & on for years, and recently hit this gap.

We already had strong observability, but if someone asked me to prove exactly what happened for a specific AI decision X months ago (and demonstrate that the log trail had not been altered), I could not.

The EU AI Act has already entered force, and its Article 12 kicks-in in August this year, requiring automatic event recording and six-month retention for high-risk systems, which many legal commentators have suggested reads more like an append-only ledger requirement than standard application logging.

With this in mind, we built a small free, open-source TypeScript library for Node apps using the Vercel AI SDK that captures inference as an append-only log.

It wraps the model in middleware, automatically logs every inference call to structured JSONL in your own S3 bucket, chains entries with SHA-256 hashes for tamper detection, enforces a 180-day retention floor, and provides a CLI to reconstruct a decision and verify integrity. There is also a coverage command that flags likely gaps (in practice omissions are a bigger risk than edits).

The library is deliberately simple: TS, targeting Vercel AI SDK middleware, S3 or local fs, linear hash chaining. It also works with Mastra (agentic framework), and I am happy to expand its integrations via PRs.

Blog post with link to repo: https://systima.ai/blog/open-source-article-12-audit-logging

I'd value feedback, thoughts, and any critique.

Comments URL: https://news.ycombinator.com/item?id=47230438

Points: 42

# Comments: 10