Hacker News: beshrkayali

New comment by beshrkayali in "Stop Advertising in Your Commits"

beshrkayali — Tue, 26 May 2026 20:05:22 +0000

In the case of Claude or others, it is not just an advertisement, it's the weird shape the industry is spinning LLM-assisted-coding as a "co-author" relationship where it should be thought of more like a user-using-a-tool relationship. When you make a design with Photoshop or InDesign, it's not "co-designed by Photoshop", it's just a tool and you used the filters it provides.

It is slightly weird that people accepted this new trend just like that, probably because they think this is being transparent and wanting to give attribution, but it'd be more useful like what the Linux kernel "AI Coding Assistants" page describes, something like `AGENT_NAME:MODEL_VERSION [TOOL1] [TOOL2]`, at least we get to know which model was used and/if any additional tooling on top. And `Assisted-by:` is more appropriate for that purpose than `Co-authored-by`.

Domain Knowledge Is the Leverage

beshrkayali — Sun, 17 May 2026 07:39:06 +0000

Article URL: https://log.beshr.com/domain-knowledge-is-the-leverage/

Comments URL: https://news.ycombinator.com/item?id=48166832

Points: 4

# Comments: 0

New comment by beshrkayali in "The Emacsification of Software"

beshrkayali — Thu, 14 May 2026 15:50:08 +0000

> But they’re hamstrung by the terminal itself, which is almost always monospaced and thus fatiguing to read.

Not related to the main point of the article, but I find reading long form contnet in a mono font much easier.

New comment by beshrkayali in "If AI writes your code, why use Python?"

beshrkayali — Tue, 12 May 2026 06:34:05 +0000

Everyone is trying to figure out how and what are the optimal use cases. It could be like you said but it doesn’t have to be. There’s a lot of incentive for it not to end up like that.

New comment by beshrkayali in "If AI writes your code, why use Python?"

beshrkayali — Tue, 12 May 2026 06:20:48 +0000

For now it’s the exact same reason why you’d use Python when you’re writing by hand: so the code is more easily readable/editable by humans who are more likely to know Python than something like Zig. But I understand the point the post is trying to make, I don’t think we’re there yet.

New comment by beshrkayali in "Agents need control flow, not more prompts"

beshrkayali — Fri, 08 May 2026 11:23:31 +0000

Humble mention, I’ve been thinking the same thing with Ossature for the last couple of months since I started working on it: https://ossature.dev

The models are already good enough for code generation. What we need is the harness around them actually deterministically enforcing a specific path and “leashing” the models output to be aligned with the intention of the user as much as possible. You can’t make the output of the model deterministic, but you can make everything around it to be so.

Trying to make enforcements work with prompts is like a government agency investigating/auditing itself, there’s no incentive to find problems, so you’ll always inevitably get the “All Good, Boss!”

New comment by beshrkayali in "Specsmaxxing – On overcoming AI psychosis, and why I write specs in YAML"

beshrkayali — Sun, 03 May 2026 08:39:23 +0000

I wrote something similar recently about how agent-generated code lacks the institutional memory that human-written code has. There's nobody to ask why a decision was made (1).

“Specsmaxxing” is basically the right response to this. When you can't rely on authorial memory, you have to put the intent somewhere durable. Specs become the source of truth by default if we continue down the road of AI generated code.

1: https://ossature.dev/blog/ai-generated-code-has-no-author/

New comment by beshrkayali in "Becoming a father shrinks your cerebrum (2022)"

beshrkayali — Sat, 02 May 2026 14:52:43 +0000

The title of the article is more on the sensationalist side unfortunately, the actual paper gives a different view [1].

There are two parts worth quoting:

> Although cortical reductions sometimes reflect a process of neurodegeneration, they can also be a sign of refinement and specialization of neural circuits. Adolescence, for instance, is a life period characterized by the continued elimination of redundant synapses (i.e. synaptic pruning) which parallels cognitive and emotional development (Selemon 2013). In the context of the transition to parent-hood, several examples across human and non-human mammals show functional improvements after reductions in brain markers (Pawluski et al. 2022).

And:

> Although we found converging evidence of cortical reductions across the two samples, a number of divergent findings also emerged. First, when disentangling the cortical volume reduction, Californian fathers displayed significant reductions in area and Spanish fathers in thickness. Changes in the area may reflect changes in the number of cells located between radial columns of the brain, while changes in thickness may reflect changes in the number of cells within ontogenic columns (Petanjek et al. 2011). Secondly, the volume of the dorsal attentional network, which supports goal-directed attention, was significantly reduced in Spanish fathers, while it did not show significant changes in Californian fathers. Combined with the default mode network, this network may control sustained attention (Spreng et al. 2010, 2013), a behavior that is often required during childrearing. It is possible that these inconsistent results at the statistical level may be due to the different scan timing windows or to cultural or behavioral differences. For example, due to more generous paternity leave policies in Spain

1: https://academic.oup.com/cercor/article/33/7/4156/6691667

AI-Generated Code Has No Author

beshrkayali — Thu, 30 Apr 2026 20:14:56 +0000

Article URL: https://ossature.dev/blog/ai-generated-code-has-no-author/

Comments URL: https://news.ycombinator.com/item?id=47967649

Points: 2

# Comments: 0

New comment by beshrkayali in "Components of a Coding Agent"

beshrkayali — Mon, 06 Apr 2026 08:31:20 +0000

Exactly, and it is a DAG (specs and tasks in the toml plan). Check the QOIzig example and its task graph if you’re curious!

New comment by beshrkayali in "Components of a Coding Agent"

beshrkayali — Sun, 05 Apr 2026 19:59:48 +0000

Exactly this. The audit pass in Ossature is specifically for that "unclear spec" case, you resolve ambiguities in the spec before generation starts rather than discovering them mid-conversation and losing them the next session. Once the plan is clean, the LLM never needs to ask a clarifying question. Memories and agent files are patching over the fact that intent was never properly captured to begin with.

New comment by beshrkayali in "Components of a Coding Agent"

beshrkayali — Sun, 05 Apr 2026 14:55:23 +0000

Allium looks interesting, making behavioral intent explicit in a structured format rather than prose is very close to what I'm trying to do with Ossature actually.

Ossature uses two markdown formats, SMD[1] for describing behavior and AMD for structure (components, file paths, data models). AMDs[2] link back to their parent SMD so behavior and structure stay connected. Both are meant to be written, reviewed, and/or owned by humans, the LLM only reads the relevant parts during generation. One thing I am thinking about for the future is making the template structure for this customizable per project, because "spec" means different things to different teams/projects. Right now the format is fixed, but I am thinking about a schema-based way to declare which sections are required, their order, and basic content constraints, so teams can adapt the spec structure to how they think about software without having to learn a grammar language to do it (though maybe peg-based underneath anyway, not sure).

The formal approach you describe is probably more precise for expressing system properties. Would be interesting to see how practical it is to maintain it as a project grows.

1: https://docs.ossature.dev/specs/smd.html

2: https://docs.ossature.dev/specs/amd.html

New comment by beshrkayali in "Components of a Coding Agent"

beshrkayali — Sun, 05 Apr 2026 14:01:44 +0000

All three of these are real. The audit pass in Ossature is meant to catch the first two before generation starts, it reads across all specs and flags underspecified behavior, missing details, and contradictions. You resolve those, update the specs, and re-audit until the plan is clean. It's not perfect but it shifts a lot of the discovery earlier in the process.

The third point is harder. You still need to know your tooling well enough to write a spec that works with it. That part hasn't gone away.

New comment by beshrkayali in "Components of a Coding Agent"

beshrkayali — Sun, 05 Apr 2026 11:00:20 +0000

Very much the same thinking. Ossature already structures work that way at the plan level during audit, so curious to see where you take it. Happy to share more about the TOML approach if useful. Feel free to reach out (me at my domain)

New comment by beshrkayali in "Components of a Coding Agent"

beshrkayali — Sun, 05 Apr 2026 10:51:58 +0000

The hierarchy you describe (intent -> plan -> code -> tests) maps well to how Ossature works. The difference is that your approach builds scaffolding around Claude Code to recover structure that chat naturally loses, whereas Ossature takes chat out of the generation pipeline entirely. Specs are the source of truth before anything is generated, so there's no drift to compensate for, the audit and build plan handle that upfront.

The judge finding is interesting though. Right now verification during build for each task in Ossature is command-based, compile, tests, that kind of thing. A judge checking spec-to-code fidelity rather than (or maybe in addition to?) runtime correctness is worth thinking about.

New comment by beshrkayali in "Components of a Coding Agent"

beshrkayali — Sun, 05 Apr 2026 09:56:24 +0000

You framed it better than I would. The part I'm still working through is making re-planning feel cheap when specs change. Right now if you change something early, downstream tasks get invalidated and the cascade isn't always obvious. Ideally when the project gets built, and then specs change, nothing of the generated code should change if an irrelevant part of the spec changed, this is a bit harder to do properly but I have some ideas.

I agree that, this is what makes it not waterfall. You're iterating on the spec and not backtracking from broken code. The spec is the "source code", replanning and rebuilding is just "recompiling".

New comment by beshrkayali in "Components of a Coding Agent"

beshrkayali — Sat, 04 Apr 2026 19:52:32 +0000

Right, the spec/build separation is exactly the idea and Ossature is already built that way on the build side.

I agree a dedicated layer for intent capture makes a lot of sense. I thought about that as well, I am just not fully convinced it has to be conversational (or free-form conversational). Writing a prompt to get the right spec change is still a skill in itself, and it feels like it'd just be shifting the problem upstream rather than actually solving it. A structured editing experience over specs feels like it'd be more tractable to me. But the explicit vs inferred distinction you mention is interesting and worth thinking through more.

New comment by beshrkayali in "Components of a Coding Agent"

beshrkayali — Sat, 04 Apr 2026 18:31:48 +0000

I've answered this exact question in a previous hn comment thread a few weeks ago, maybe I should reconsider front-matter? My previous answer:

> Yeah, I did briefly consider front-matter, but ended up with inline @ tags because I thought it kept the entire document feeling like one coherent spec instead of header-data + body, front matter felt like config to me, but this is 0.0.1 so things might change :)

New comment by beshrkayali in "Components of a Coding Agent"

beshrkayali — Sat, 04 Apr 2026 18:00:23 +0000

Thanks!

> How does the human intervention work out? Do you use a mix of spec and audit editing to get into the ready to generate state?

Yes, the flow is: you write specs then you validate them with `ossature validate` which parses them and checks they are structurally sound (no LLM involved), then you run `ossature audit` which flags gaps or contradictions in the content as INFO, WARNING, or ERROR level findings. The audit has its own fixer loop that auto-resolves ERROR level findings, but you can also run it interactively, manually fix things yourself, address the INFO and WARNING findings as you see fit, and rerun until you are happy. From that it produces a toml build plan that you can read and edit directly before anything is generated. You can reorder tasks, add notes for the LLM, adjust verification commands, or skip steps entirely. So when you run `ossature build` to generate, the structure is already something you have signed off on. There's a bit more details under the hood, I wrote more in an intro post[1] about Ossature, might be useful.

> The spec driven approach is potentially better for writing things from scratch, do you have any plans for existing code?

Right now it is best for greenfield, as you said. I have been thinking about a workflow where you generate specs from existing code and then let Ossature work from those, but I am honestly not sure that is the right model either. The harder case is when engineers want to touch both the code and the specs, and keeping those in sync through that back and forth is something I want to support but have not figured out a clean answer for yet. It's on the list, if you have any thoughts please feel free to open an issue! I want to get through some of the issues I am seeing with just spec editing workflow (and re-audit/re-planning) first, specifically around how changes cascade through dependent tasks.

Regarding success rate, each task requires a verification command to run and pass after generation and if it fails, a separate fixer agent tries to repair it using the error output. The number of retry attempts is configurable. I did notice that the more concise and clear the spec is the more likely it is for capable models to generate code that works (obviously) but that's what auditing is supposed to help with. One interesting case about the chip-8 emulator I mentioned above is that even mentioning the correct name of the solution to a specific problem was not enough, I had to spell out the concrete algorithm in the spec (wrote more details here[2]). But the full prompt and response for every task is saved to disk, so when something does go wrong one can read the exact prompt/response and fix-attempts prompt/response for each task.

1: https://ossature.dev/blog/introducing-ossature/

2: https://log.beshr.com/chip8-emulator-from-spec/

New comment by beshrkayali in "Components of a Coding Agent"

beshrkayali — Sat, 04 Apr 2026 16:52:28 +0000

> long contexts are still expensive and can also introduce additional noise (if there is a lot of irrelevant info)

I think spec-driven generation is the antithesis of chat-style coding for this reason. With tools like Claude Code, you are the one tracking what was already built, what interfaces exist, and why something was generated a certain way.

I built Ossature[1] around the opposite model. You write specs describing behavior, it audits them for gaps and contradictions before any code is written, then produces a build plan toml where each task declares exactly which spec sections and upstream files it needs. The LLM never sees more than that, and there is no accumulated conversation history to drift from. Every prompt and response is saved to disk, so traceability is built in rather than something you reconstruct by scrolling back through a chat. I used it over the last couple of days to build a CHIP-8 emulator entirely from specs[2]. I have some more example projects on GitHub[3]

1: https://github.com/ossature/ossature

2: https://github.com/beshrkayali/chomp8

3: https://github.com/ossature/ossature-examples