Hacker News: sermakarevich

New comment by sermakarevich in "Ask HN: How do you stay up to date without information overload?"

sermakarevich — Sat, 06 Jun 2026 17:34:14 +0000

I have this list on x of 67 people posting on AI related topics: https://x.com/i/lists/2014953069700633085

For papers/articles processing I use AI with pre-defined instructions to create short/medium summaries. The summary should give me good understanding of the main idea and if its worth reading it deeper. Even if not - main idea would be in your head.

These summaries I accumulate into hierarchical knowledge base, optimized for Obsidian and llms https://github.com/sermakarevich/ai_knowledge_wiki

New comment by sermakarevich in "Ask HN: Knowledge Access for Agents?"

sermakarevich — Sat, 06 Jun 2026 17:25:55 +0000

git or versioned path on cloud storage should work. Format is more important - I think hierarchical knowledge base is the best thing on the market atm. You keep main page with refs and short summaries to 5-10 topics. Model reads it and decides where to drop down next. You optimize the breadth and depth of topics for optimal performance. Dropping down is not a through away tokens - this helps model to understand wider context.

I use it in quite a few repoes:

-- https://github.com/sermakarevich/ai_knowledge_wiki Curated extraction of summaries from AI-related research papers, organized as a hierarchical wiki optimized for Obsidian and LLMs

-- https://github.com/sermakarevich/chunker/tree/master/output/... Chunker processes a document into a hierarchy of self-sufficient chunks and multi-level summaries, producing a set of linked markdown files that an AI model (or a human) can explore through progressive disclosure -- starting from a high-level overview and drilling into details on demand, without ever loading the entire document.

-- https://github.com/sermakarevich/kaggle_wiki A structured knowledge base of Kaggle competitions — solutions, notebooks, and indexes — built for fast lookup and reuse.

New comment by sermakarevich in "Show HN: Fleet – Python supervisor for running coding agents in parallel"

sermakarevich — Sat, 06 Jun 2026 16:58:59 +0000

Slides on fleet: https://docs.google.com/presentation/d/1O_pXyKdtpRG2ORD1xw7s...

New comment by sermakarevich in "Show HN: Lessons learned from running Claude Code swarms at scale"

sermakarevich — Sat, 06 Jun 2026 16:58:45 +0000

Slides on fleet: https://docs.google.com/presentation/d/1O_pXyKdtpRG2ORD1xw7s...

New comment by sermakarevich in "Ask HN: What is your (AI) dev tech stack / workflow?"

sermakarevich — Sat, 06 Jun 2026 12:38:42 +0000

TDD and specs help

New comment by sermakarevich in "Ask HN: What is your (AI) dev tech stack / workflow?"

sermakarevich — Sat, 06 Jun 2026 08:04:32 +0000

I looked at most of those, including kiro and tessl. Was early user of GSD when it was suitable for mid+ size projects. Over time GSD grown into beast which is suitable for huge + size projects only producing gigantic specs and burning too many tokens for most of the tasks. So I decided to created my own, with set of steps I need and specs I want.

After few presentations of sddw to different companies, most important conclusion was that the ssd plugin should be customizable. It should fit the typical size of tasks/features you are working on, specs should fit your requirements, set of steps can be different.

So I created claude code workflow (ccw) which allows to compile custom version of workflow on top of sdd approach: https://github.com/sermakarevich/ccw

After making few presentations of sddw to different companies,

New comment by sermakarevich in "Ask HN: What is your (AI) dev tech stack / workflow?"

sermakarevich — Sat, 06 Jun 2026 07:13:18 +0000

Here are some - I used sddw to create: - chunker - app to get smart slices from text and organize them in hierarchical LLM/Obsidian wiki. There were two features implemented using sddw and 15 subtags:

-- https://github.com/sermakarevich/chunker/blob/master/.sddw/c...

-- https://github.com/sermakarevich/chunker/blob/master/.sddw/m...

- ccw (claude code workflow - plugin to compile generic claude code workflows based on sdd approach) - https://github.com/sermakarevich/ccw/blob/main/.sddw/claude_...

Btw you can use ccw to create your own custom version of sddw quite fast - with specs format and sequence of steps that suit you best.

New comment by sermakarevich in "Programmers will document for Claude, but not for each other"

sermakarevich — Sat, 06 Jun 2026 05:35:32 +0000

Time vs effort balance is very different for the two cases.

-- writing documents for each other takes weeks - write the document, give people time to review the document, discuss, update on review.

-- writing documents for Claude takes minutes rather than hour - AI is used to assist in writing specs, personal instead of collective review/approval process.

Also beneficiaries are different:

-- when you write spec for Claude - you benefit personally, you have higher chance of getting code you need from ai coder

-- when you write spec for the company - the company is the main beneficiary, it get higher chances to get teams on the same page with the change

New comment by sermakarevich in "Show HN: Lessons learned from running Claude Code swarms at scale"

sermakarevich — Sat, 06 Jun 2026 04:37:59 +0000

Only as a viewer but I don't code in ide

New comment by sermakarevich in "Show HN: Lich, start a dev stack per coding agent in parallel"

sermakarevich — Fri, 05 Jun 2026 17:31:39 +0000

Nice one. I love the idea of running many agents concurrently - this feels like a factory. Built fleet app to orchestrate those: https://news.ycombinator.com/item?id=48407998

New comment by sermakarevich in "Ask HN: What is your (AI) dev tech stack / workflow?"

sermakarevich — Fri, 05 Jun 2026 17:12:12 +0000

I am building AI agents full time since Nov 2024. I stopped coding completely around mid summer 2025 using Cursor at that time. When you build platform-like application, and have few plugins already, ai coder can create next one in a way you won't recognize which one is written by you.

At the end of 2025 I switched to Claude Code. Compared to Cursor this opened a different level of automation, including fe possibility of running swarms of agents: https://news.ycombinator.com/item?id=48407998 using subscription limits.

So I spend all my time rather understanding how to squeeze everything possible from AI than myself. AI scales, I am not.

New comment by sermakarevich in "Ask HN: AI researchers – what's a recent paper that recently blew your mind?"

sermakarevich — Fri, 05 Jun 2026 16:37:50 +0000

AI Knowledge Wiki - https://github.com/sermakarevich/ai_knowledge_wiki

Hierarchically organized Wiki of ai papers / articles for Obsidian / LLM. Approximately 300+ papers in total. You can get most recent +- based on commits. I process ~100 papers/articles a month, so pull from time to time.

New comment by sermakarevich in "Ask HN: What is your (AI) dev tech stack / workflow?"

sermakarevich — Fri, 05 Jun 2026 16:23:15 +0000

One additional benefit that we get from the sddw is that agent drives the spec creation using scenario we put into command/skill. It does the research local/web, it asks operator questions and later confirmations about each block in the spec.

New comment by sermakarevich in "Ask HN: What is your (AI) dev tech stack / workflow?"

sermakarevich — Fri, 05 Jun 2026 16:08:12 +0000

I am using Spec Driven Development approach implemented as a Claude Code plugin since Feb for all mid + size tasks. The idea is to write detailed specs first using agent help doing research and interviewing, decompose the task into smaller subtasks, write detailed spec for each task, implement each task separately. You can restart the session after every step in the workflow and after each subtask implementation since all requirements are materialized in specs. This helps to keep session context focused on a single task at time, improve adherence, reduce cost and allow to implement bigger tasks that are hard to implement with pure plan + code.

Discussion on hn: https://news.ycombinator.com/item?id=48231575

Repo: https://github.com/sermakarevich/sddw

Slides: https://docs.google.com/presentation/d/1SjKXF7hkoqyiN9-3tBGY...

New comment by sermakarevich in "Show HN: Lessons learned from running Claude Code swarms at scale"

sermakarevich — Fri, 05 Jun 2026 10:10:25 +0000

I use it as a full control plane for a couple of weeks from cli and from UI completely over last few days. UI now allows to manage the full cycle of agents - create task, monitor task, respond to agents questions, unblock jobs, etc

Show HN: Lessons learned from running Claude Code swarms at scale

sermakarevich — Fri, 05 Jun 2026 04:34:09 +0000

Some time ago I built a simple app to run swarms of coding agents — I call it fleet (https://news.ycombinator.com/item?id=48256389). It's based on centralized beads with a Python orchestrator and can run any coder (Claude, agy, Codex). Recently I added a UI to manage the whole agent lifecycle: adding new tasks, monitoring running ones, and a chat interface built on MCP with a centralized SQLite DB. From the UI I can spawn agents to run in any directory, define dependencies on other tasks, and specify which coder/model should do the job. Today I can run 10–15 agents concurrently. At that scale you burn through limits very fast, so I spent some time investigating where those limits go and how to maximize efficiency. Here are the lessons learned after a few weeks of running the fleet:

- CLAUDE.md is a terrible abstraction. These files load unconditionally, they often contain descriptions irrelevant to the task at hand, and they stack from your working directory upward. The result is wasted tokens and confusion from injecting irrelevant instructions into the session.

- Skills are bad, but not as bad as CLAUDE.md. They use a progressive disclosure approach: only the skill description goes into the session, and Claude loads the full skill text with a tool when it's needed. That's one level better, but it still doesn't let you scale — you can't create 10K skills, as that would eat your entire usable context. Claude recently introduced a skills budget that silently drops less frequently used skills from the session entirely. You can still invoke them in an interactive session, but the model can't invoke them in a background session.

- Some plugins may be installed more than once. During cleanup I found that a few of mine were installed in multiple locations, consuming double the tokens on duplicated instructions.

- Attaching plugins to every session is a bad idea at scale. You want to be precise about which plugins are actually useful and attach them per task.

- Use a hierarchical knowledge base instead of CLAUDE.md / skills / plugins. It lets you benefit from real progressive disclosure: keep your instructions and tool descriptions in it and let Claude navigate through it quickly and cheaply.

- System tools consume ~15K tokens (7% of the session). You can't manage this — they're just attached, and disabling tools doesn't remove them from the context.

- AskUserQuestion isn't available in background sessions. You need to implement your own tool — MCP- or CLI-based — to give `claude -p` the ability to talk to you.

- You become selective about which model handles each task. Decompose work into harder and simpler subtasks so you can route the simpler ones to weaker, cheaper models and save tokens.

- Your context-switching skill improves over time.

Fleet repo: https://github.com/sermakarevich/fleet

Comments URL: https://news.ycombinator.com/item?id=48407998

Points: 10

# Comments: 6

New comment by sermakarevich in "From Spec-Driven Development to Compilable Specs"

sermakarevich — Sat, 30 May 2026 05:17:24 +0000

spec driven approach is also great for task decomposition, context control and adherence https://news.ycombinator.com/item?id=48231575

and not only for coding but also for non coding tasks - like fe deep marketing/sales research https://news.ycombinator.com/item?id=48306730

New comment by sermakarevich in "Dynamic Workflows in Claude Code"

sermakarevich — Fri, 29 May 2026 07:44:45 +0000

Not sure why Claude does not have AskUserQuestion implementation that works for spawned sessions: subagents, teams, workflows. Without it, spawning hundreds of subagents and wait for final result without single input feels a bit risky.

Here is the solution to it. Built on a SQLite DB and MCP, blocking until the question is answered, supporting all possible question types, with a CLI or web interface for answers, `ask_human_question` fills the gap in efficient subagent management.

https://news.ycombinator.com/item?id=48320233

Show HN: Claude Code AskUserQuestion which works for subagents/teams/workflows

sermakarevich — Fri, 29 May 2026 07:38:59 +0000

Claude Code does not support AskUserQuestion in spawned sessions - subagents, agent teams, and workflows — which makes a swarm of agents less efficient in the original implementation. This can be easily fixed with an additional MCP.

Built on a SQLite DB and MCP, blocking until the question is answered, supporting all possible question types, with a CLI or web interface for answers, `ask_human_question` fills the gap in efficient subagent management.

Repo: https://github.com/sermakarevich/claude/tree/main/mcp/ask_hu...

Comments URL: https://news.ycombinator.com/item?id=48320233

Points: 1

# Comments: 0

New comment by sermakarevich in "Show HN: Fleet – Python supervisor for running coding agents in parallel"

sermakarevich — Thu, 28 May 2026 16:55:14 +0000

these are great point, I think

have you seen this happening after auto was introduced?