<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: avyvar</title><link>https://news.ycombinator.com/user?id=avyvar</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Thu, 21 May 2026 01:20:10 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=avyvar" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by avyvar in "Show HN: Dari-docs – Optimize your docs using parallel coding agents"]]></title><description><![CDATA[
<p>Totally fair question. If you only want one agent to sanity-check one doc change, a skill/prompt is probably enough.<p>We actually aren’t rebuilding a harness here, it’s Pi with several LLM options to select from. The reason this is a project is that the useful workflow is more like a docs test suite: run realistic user tasks across multiple models, isolate each run in a greenfield sandbox, keep the transcripts/results, and make failures reproducible in CI.<p>You could ask an existing coding agent to spawn subagents for every task/model pair, but once that matrix grows, running hundreds of subagents on your computer gets messy. It’s also the wrong isolation boundary: for docs testing, you usually want the agent to start from a clean environment with access only to the docs/product surface you’re testing, not your whole working tree or local setup.</p>
]]></description><pubDate>Wed, 20 May 2026 22:16:32 +0000</pubDate><link>https://news.ycombinator.com/item?id=48214988</link><dc:creator>avyvar</dc:creator><comments>https://news.ycombinator.com/item?id=48214988</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48214988</guid></item><item><title><![CDATA[Show HN: SkillFS – Git-backed persistent sandboxes for AI agents]]></title><description><![CDATA[
<p>We're Dari (YC F25). We're building browser automation agents with a deterministic caching layer; for that to work, we needed a way to save and rerun code across sessions. There's also no audit trail with traditional sandboxes; when something goes wrong, you have no idea what the agent actually did.<p>So we built SkillFS. The core idea is simple: every agent sandbox is a git repo. The agent does its work, commits its progress, and the session ends. When you start a new session, SkillFS restores the repo from a git bundle and the agent continues where it left off. And because it's git, you have a complete history of every action the agent took.<p>The workflow looks like this:<p><pre><code>  1. Agent works in a sandboxed environment, making changes to files
  2. Agent commits progress at meaningful checkpoints
  3. Session ends, git bundle gets saved to storage (local or GCS)
  4. Next session starts, bundle is restored, agent resumes
  5. Need to debug? git log shows you exactly what happened
</code></pre>
Key features:<p><pre><code>  - Persistent state via git bundles, with pluggable storage backends (local filesystem or GCS), so skills and scripts from previous sessions are reusable.

  - MCP integration that lets you plug in any server. SkillFS also generates Python wrappers from MCP tool definitions and uploads them to the sandbox, so your agent can call any MCP tool as regular code to cache sequences of MCP interactions into deterministic scripts.

  - Built-in LLM runner with standard tools (glob, grep, read/write/edit files, run commands), or you can bring your own agent loop.

  - Runs on E2B sandboxes so agents execute code in isolated environments. Agent skills can also be easily imported from local files or from GitHub repos.
</code></pre>
We're open sourcing this because agent persistence is a problem every team building with agents ends up solving differently, and we think bash+git is a good answer that more people should be using.<p><pre><code>  > pip install skillfs</code></pre></p>
<hr>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=46543093">https://news.ycombinator.com/item?id=46543093</a></p>
<p>Points: 4</p>
<p># Comments: 0</p>
]]></description><pubDate>Thu, 08 Jan 2026 16:38:43 +0000</pubDate><link>https://github.com/mupt-ai/skillfs</link><dc:creator>avyvar</dc:creator><comments>https://news.ycombinator.com/item?id=46543093</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46543093</guid></item></channel></rss>