Hacker News: alexsmirnov

New comment by alexsmirnov in "Monetization Gateway: Charge for any resource behind Cloudflare via x402"

alexsmirnov — Wed, 01 Jul 2026 21:37:12 +0000

If I pay for vending machine by corporate card on a business trip, it looks more like B2B

New comment by alexsmirnov in "The Radiation Exposure Lie"

alexsmirnov — Tue, 30 Jun 2026 23:25:34 +0000

From the first hand: I was in the air base service regiment, Ovruch ( 30 km from reactor ), 1 year in service at the time of disaster.

The average numbers have a little sense. By our measurements, it was't an "even" distribution, but some "hot spots" that had 10x time radiation level than surrounding territory.

The article focuses on cancer, but for me and my buddies, the worst was an impact on immune system. In the six months after disaster, the healthy boy before, I got: pneumonia, chicken pox ( and 2 more with me ), furunculosis ( the whole company got it as well ), and endless flu/fevers. The weak immune continued for about 10 years.

New comment by alexsmirnov in "Where is the AI jobs crisis?"

alexsmirnov — Tue, 09 Jun 2026 23:20:16 +0000

I more like my high school math teacher explanation: "Put your one hand on dry ice, another in the boiling water. On average, you feel warm and cozy"

New comment by alexsmirnov in "Using Git's rerere feature to escape recurring conflict hell"

alexsmirnov — Mon, 01 Jun 2026 17:39:15 +0000

We usually squash feature branches before merge. To squash before rebase, I use git reset --soft $(git merge-base develop HEAD) && git commit && git rebase develop - you have to resolve final conflicts only

New comment by alexsmirnov in "OpenAI backs Illinois bill that would limit when AI labs can be held liable"

alexsmirnov — Fri, 10 Apr 2026 17:31:59 +0000

Much longer than that, and was available way before an internet. I graduated STEM high school in St. Petersburg in 1981, and I had several classmates who were big funs of chemistry. That they were able to create from textbooks, school lab ingredients, and understanding:

WWI era poison gas, tear gas, potassium cyanide, and bunch of explosives like acetone peroxide.

LLMs have all of that knowledge in training data

New comment by alexsmirnov in "Qwen3.6-Plus: Towards real world agents"

alexsmirnov — Thu, 02 Apr 2026 16:05:49 +0000

Exactly.

I did create my own MCP with custom agents that combine several tools into a single one. For example, all WebSearch, WebFetch, Context7 exposed as a single "web research" tool, backed by the cheapest model that passes evaluation. The same for a codebase research

Use it with both Claude and Opencode saves a lot of time and tokens.

New comment by alexsmirnov in "Study: 'Security Fatigue' May Weaken Digital Defenses"

alexsmirnov — Mon, 23 Mar 2026 17:36:30 +0000

Almost instantly, compared to my experience working for a big health care provider... I waited 6 moths for IT department to allow me install development tools on work laptop.

And while security rules created enormous roadblocks for work, whey also left enough holes to be exploited. Before getting required permissions, I managed to create dual boot with linux and share files between 'approved' and 'illegal' systems

New comment by alexsmirnov in "Ask HN: AI productivity gains – do you fire devs or build better products?"

alexsmirnov — Mon, 23 Mar 2026 03:04:33 +0000

> you start off checking every diff like a hawk, expecting it to break things, but honestly, soon you see it's not necessary most of the time.

I see it's necessary ALL the time. The AI generated code can be used as scaffolding, but it's newer get close to real production quality. The expierence from small startup with team of 5 developers. I do review and approve all PRs, and none ever able to pass AI code review from the first iteration.

New comment by alexsmirnov in "Claudetop – htop for Claude Code sessions (see your AI spend in real-time)"

alexsmirnov — Mon, 16 Mar 2026 00:01:43 +0000

The calculation misses subagents tokens, that can be a significant differences. Better to parse session jsonl files ~/.claude/projects//.jsonl and ~/.claude/projects///subagents/.jsonl

New comment by alexsmirnov in "Code Review for Claude Code"

alexsmirnov — Tue, 10 Mar 2026 02:06:34 +0000

This mostly matches my own estimates for pr-review command that I use. But it's pretty sophisticated: 6 specialized agents, best practices skills, CVE database, bunch of scripts. To reduce cost, most of agents use cheap open source models.

New comment by alexsmirnov in "When AI writes the software, who verifies it?"

alexsmirnov — Wed, 04 Mar 2026 07:45:00 +0000

Actually, they extremely bad at that. All training data contains cod + tests, even if tests where created first. So far, all models that I tried failed to implement tests for interfaces, without access to actual code.

New comment by alexsmirnov in "What Claude Code chooses"

alexsmirnov — Thu, 26 Feb 2026 22:18:45 +0000

Considering how little data needed to poison llm https://www.anthropic.com/research/small-samples-poison , this is a way to replace SEO by llm product placement:

1. create several hundreds github repos with projects that use your product ( may be clones or AI generated )

2. create website with similar instructions, connect to hundred domains

3. generate reddit, facebook, X posts, wikipedia pages with the same information

Wait half a year ? until scrappers collect it and use to train new models

Profit...

New comment by alexsmirnov in "Why Developers Keep Choosing Claude over Every Other AI"

alexsmirnov — Thu, 26 Feb 2026 21:52:55 +0000

I do in similar way, connect claude code to litellm router that dispatches model requests to different providers: bedrock, openai, gemini, openrouter and ollama for opensource models. I have special slash command and script that collect information about session, project and observed problems to evaluation dataset. I can re-evaluate prompts and find models that do a job in particular agent faster/cheaper, or use automated prompt optimization to eliminate problems.

New comment by alexsmirnov in "My AI Adoption Journey"

alexsmirnov — Fri, 06 Feb 2026 06:00:42 +0000

For me, AI is the best for code research and review

Since some team members started using AI without care, I did create bunch of agents/skills/commands and custom scripts for claude code. For each PR, it collects changes by git log/diff, read PR data and spin bunch of specialized agents to check code style, architecture, security, performance, and bugs. Each agent armed with necessary requirement documents, including security compliance files. False positives are rare, but it still misses some problems. No PR with ai generated code passes it. If AI did not find any problems, I do manual review.

New comment by alexsmirnov in "LNAI – Define AI coding tool configs once, sync to Claude, Cursor, Codex, etc."

alexsmirnov — Tue, 03 Feb 2026 18:58:50 +0000

I did create and actively use a similar tool, but with different purpose: configure AI tools for each team member to use the same code style and architecture guides across projects. It includes: - build docker images for claude code and opencode dev containers. - creates custom MCP server that works as a proxy and combines several tools into a single one ( for example, web search, fetch, and context7 tools exposed as a single "web_research" that invokes custom code to answer question ) - copy code style, documentation, and best practice rules for technologies used in our projects - deploys a bunch of helper scripts useful for development - configure agents, skills, hooks, and commands to use those rules. Configuration changed per "mode" : documentation, onboarding, code review, and web development all have different settings. - run AI tools in docker container with limited permissions - feedback tool to generate session report, that is used for automatic evaluation and prompt optimization.

This came out of necessity, as active using of AI assistants in uncontrollable way significantly degraded code quality. The goal is to enforce the same development workflow across team This is internal tool. If someone interesting, I can create a public repo from it

New comment by alexsmirnov in "Ask HN: Do you also "hoard" notes/links but struggle to turn them into actions?"

alexsmirnov — Sat, 31 Jan 2026 07:17:29 +0000

I do use Obsidian on pair with Claude code and git.

I organize notes by tags, folders, and links from tree of "map of content" notes. Those documented as rules for AI. All notes came to "Inbox" folder, and from time to time I run special script that checks inbox, formats notes, tags them, and put in the most appropriate place. "git diff" to check results and fix mistakes, reset if it went wrong.

As notes organized by the limited number of well defined rules, they became easy to search and navigate by AI. Claude Code easily finds requested notes, working as advanced search engine, and they became a starting point for "deep research" : find relevant notes, follow links, detect gaps, search internet. Repeat until reach required confidence level.

The most advanced workflow so far is combination of TRIZ (Theory of Inventive Problem Solving) + First Principles Framework. Former generates ideas and hypotheses, later validates them and converge on final answer.

New comment by alexsmirnov in "The Code-Only Agent"

alexsmirnov — Mon, 19 Jan 2026 06:27:29 +0000

This was implemented far ago, at least by huggingface "smolagents". https://huggingface.co/docs/smolagents/index . I did use them, with evaluations. For the most cases, modern models tool call outperforms code agent. They just trained to use tools, not a code

New comment by alexsmirnov in "Don't fall into the anti-AI hype"

alexsmirnov — Mon, 12 Jan 2026 19:14:17 +0000

This is exact the impression that I got. Every question or task given to LLM returns pretty reasonable, but flawed result. For the coding, those are hard to spot but dangerous mistakes. They all look good and perfectly reasonable, but just wrong. Anthropic compared Claude Code to a "slot machine", and I fell that AI coding now is something close to gambling addiction. As small wins keep gambler to make more bets, so correct results from AI keep developers to use it: "I see it made correct solution, let's try again!" At a startup CTO, I review most of the pull requests from team members, and team uses AI tools actively. The overall picture strongly confirms your second conclusion.

New comment by alexsmirnov in "Why users cannot create Issues directly"

alexsmirnov — Sat, 03 Jan 2026 20:09:53 +0000

This is not about understanding the message, but switching user mental activity. I go myself in the similar situations many times. One example: I tried to pay my bills in online bank application, but got into error. After several attempts, I did read message and it say "Header size exceed..." . It give me clue that app probably put too much history into cookies. Clear browser data, log in again, and all got works.

Even when error message was clearly understandable for my expertise, it took surprisingly long tome to switch from one mental activity - "Pay bills", to another - "Investigate technical problem". And you have to throw away all short memory to switch into another task. So all rumors about "stupid" users is direct consequence from how human mind works.

New comment by alexsmirnov in "The Gorman Paradox: Where Are All the AI-Generated Apps?"

alexsmirnov — Sun, 14 Dec 2025 19:48:39 +0000

A lot of discussions around vibe AI coding flaws: awful architecture, performance problems, security holes, lack of maintainability, bugs, and low code quality. All correct, but none of those is matter if:

- you create small utility that covers only features needed only for you. As many researches show that any individual uses only less than 20% of software functionality, your tool covers only 10-20% that matters for you

- it only runs locally, on user computer or phone, and never has more than one customer. Performance, security, compliances do not matter

- the code lies next to application, and small enough to fix any bug instantly, in a single AI agent run

- as a single user, you don't care about design, UX, or marketing. Do the job is only matter

It means, majority of vibe coded applications run under radar, used only by a few individuals. I can see it myself: I have a bunch of vibe code utilities that never intended for a broad auditory . And, many of my friend and customers, mention the same: "I vibe coded utility that does ... for me". This means a big consequences for software development: the area for commercial development shrinks, nothing that can be replaced by the small local utility has a market value.