<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: ashmil</title><link>https://news.ycombinator.com/user?id=ashmil</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Fri, 03 Jul 2026 06:56:09 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=ashmil" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by ashmil in "Ask HN: Is Codex with GPT 5.5 Extra High being dumbed down?"]]></title><description><![CDATA[
<p>same thing happened to me. was on codex for about a month, felt that exact shift, ended up going back to claude max. not sure if it's routing or tuning but something changed and prompting around it didn't help.</p>
]]></description><pubDate>Wed, 01 Jul 2026 08:12:43 +0000</pubDate><link>https://news.ycombinator.com/item?id=48743657</link><dc:creator>ashmil</dc:creator><comments>https://news.ycombinator.com/item?id=48743657</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48743657</guid></item><item><title><![CDATA[Ask HN: We built cloud ops AI 3 years ago. Are managed agents a death sentence?]]></title><description><![CDATA[
<p>Three years ago we started our startups engineers can query their entire cloud (AWS, GCP, observability, on-call) in plain English, run actions, and get incident context in one place. We have paying enterprise customers and real usage. Not a side project.<p>The product has always felt obvious to us: teams waste 45+ minutes per incident just context-switching between Grafana, AWS Console, PagerDuty, and Slack before they even start debugging. We collapsed that.<p>But now Anthropic, AWS, and Google are all shipping their own managed agent orchestration layers. And every week another enterprise tells us their infra team wants to "just build it themselves with Claude."<p>I keep going back and forth. On one hand, three years of development is genuinely hard to replicate - the domain specificity matters. On the other hand, foundation model providers expanding down the stack has killed entire product categories before.<p>Has anyone else navigated this? Particularly curious whether other vertical AI founders have found a defensible position or are quietly pivoting.</p>
<hr>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=47742347">https://news.ycombinator.com/item?id=47742347</a></p>
<p>Points: 1</p>
<p># Comments: 1</p>
]]></description><pubDate>Sun, 12 Apr 2026 17:41:19 +0000</pubDate><link>https://news.ycombinator.com/item?id=47742347</link><dc:creator>ashmil</dc:creator><comments>https://news.ycombinator.com/item?id=47742347</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47742347</guid></item><item><title><![CDATA[Show HN: Using AI to generate accurate illustrations for physiotherapy site]]></title><description><![CDATA[
<p>Hi HN,<p>My wife is a physiotherapist, and a while back, she decided to start compiling her clinical experience and patient care knowledge into a resource site, Heal And Move. The goal was to share clear, actionable articles on various physical conditions, rehabilitation plans, and specific exercises.<p>The Bottleneck:
The biggest hurdle we immediately ran into was visuals. Explaining a specific stretch or joint movement through text alone is tough. Relying on generic stock photos often doesn't accurately reflect the proper form or the specific muscle groups being targeted, and hiring medical illustrators for every single post just wasn't scalable.<p>The AI Solution:
Recently, we started integrating AI image and video generation into her workflow, and it's been a massive unlock. She can now translate her domain knowledge into precise prompts to generate accurate anatomical illustrations, exercise demonstrations, and step-by-step videos.<p>As many of you know, getting AI to render anatomically correct limbs and specific physiological poses without looking like body horror takes some trial and error. But by iterating on the prompts, she’s now able to produce high-quality, custom visuals that perfectly match her clinical articles. It has completely removed the visual bottleneck from her content creation.<p>I'd love for you to take a look. We're open to any feedback on the site, the visual accuracy of the AI-generated assets, or any thoughts from those who have tinkered with fine-tuning models for health/fitness illustrations!</p>
<hr>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=47364185">https://news.ycombinator.com/item?id=47364185</a></p>
<p>Points: 3</p>
<p># Comments: 0</p>
]]></description><pubDate>Fri, 13 Mar 2026 13:28:35 +0000</pubDate><link>https://www.healandmove.fit</link><dc:creator>ashmil</dc:creator><comments>https://news.ycombinator.com/item?id=47364185</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47364185</guid></item><item><title><![CDATA[New comment by ashmil in "I Reduced 5 hours of Testing my Agentic AI applcaition to 10 mins"]]></title><description><![CDATA[
<p>Hi HN,<p>I was spending over 5 hours manually testing my Agentic AI application before every patch and release. While automating my API and backend tests was straightforward, testing the actual chat UI was a massive bottleneck. I had to sit there, type out prompts, wait for the AI to respond, read the output, and ask follow-up questions. As the app grew, releases started taking longer just because of manual QA.<p>To solve this, I built Mantis. It’s an automated UI testing tool designed specifically to evaluate LLM and Agentic AI applications right from the browser.<p>Here is how it works under the hood:<p>Define Cases: You define the use cases and specific test cases you want to evaluate for your LLM app.<p>Browser Automation: A Chrome agent takes control of your application's UI in a tab.<p>Execution: It simulates a real user by typing the test questions into the chat UI and clicking send.<p>Evaluation: It waits for the response, analyzes the LLM's output, and can even ask context-aware follow-up questions if the test case requires it.<p>Reporting: Once a sequence is complete, it moves to the next test case. Everything is logged and aggregated into a dashboard report.<p>The biggest win for me is that I can now just kick off a test run in a background Chrome tab and get back to writing code while Mantis handles the tedious chat testing.<p>I’d love to hear your thoughts. How are you all handling end-to-end UI testing for your chat apps and AI agents? Any feedback or questions on the approach are welcome!</p>
]]></description><pubDate>Wed, 11 Mar 2026 07:35:39 +0000</pubDate><link>https://news.ycombinator.com/item?id=47332644</link><dc:creator>ashmil</dc:creator><comments>https://news.ycombinator.com/item?id=47332644</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47332644</guid></item><item><title><![CDATA[I Reduced 5 hours of Testing my Agentic AI applcaition to 10 mins]]></title><description><![CDATA[
<p>Article URL: <a href="https://github.com/onepaneai/mantis">https://github.com/onepaneai/mantis</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=47332643">https://news.ycombinator.com/item?id=47332643</a></p>
<p>Points: 1</p>
<p># Comments: 1</p>
]]></description><pubDate>Wed, 11 Mar 2026 07:35:39 +0000</pubDate><link>https://github.com/onepaneai/mantis</link><dc:creator>ashmil</dc:creator><comments>https://news.ycombinator.com/item?id=47332643</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47332643</guid></item><item><title><![CDATA[Show HN: IdeaValidate – AI-Powered Tool to Validate Your Startup Ideas Fast]]></title><description><![CDATA[
<p>Article URL: <a href="https://ideavalidate.daretobuild.today/">https://ideavalidate.daretobuild.today/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=43805251">https://news.ycombinator.com/item?id=43805251</a></p>
<p>Points: 1</p>
<p># Comments: 0</p>
]]></description><pubDate>Sat, 26 Apr 2025 17:00:11 +0000</pubDate><link>https://ideavalidate.daretobuild.today/</link><dc:creator>ashmil</dc:creator><comments>https://news.ycombinator.com/item?id=43805251</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43805251</guid></item><item><title><![CDATA[Is Product Hunt Dying?]]></title><description><![CDATA[
<p>I've noticed a surge in low-effort launches, copycat products, and templated AI spam.<p>What’s worse? Paid upvotes are winning the day. Quality no longer equals visibility.<p>Real builders pouring months into crafting unique, valuable products are often buried under noise.
Meanwhile, “Weekend hacks” with fancy landing pages and fake engagement top the charts.<p>It used to be a place where innovation got discovered. Now it’s a leaderboard of who has the better hype machine.</p>
<hr>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=43701815">https://news.ycombinator.com/item?id=43701815</a></p>
<p>Points: 2</p>
<p># Comments: 2</p>
]]></description><pubDate>Wed, 16 Apr 2025 05:31:01 +0000</pubDate><link>https://news.ycombinator.com/item?id=43701815</link><dc:creator>ashmil</dc:creator><comments>https://news.ycombinator.com/item?id=43701815</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43701815</guid></item><item><title><![CDATA[New comment by ashmil in "First Impressions of Google's Firebase Studio"]]></title><description><![CDATA[
<p>What i liked<p>Generate a Blueprint
Creates a basic structure and suggests necessary components.<p>Firebase Integration: Likely designed for seamless integration with Firebase services (though not deeply explored in the review)<p>Public Link Sharing: Enables sharing a public link to the prototyped application</p>
]]></description><pubDate>Thu, 10 Apr 2025 09:16:18 +0000</pubDate><link>https://news.ycombinator.com/item?id=43642178</link><dc:creator>ashmil</dc:creator><comments>https://news.ycombinator.com/item?id=43642178</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43642178</guid></item><item><title><![CDATA[First Impressions of Google's Firebase Studio]]></title><description><![CDATA[
<p>Article URL: <a href="https://insights.daretobuild.today/p/firebase-studio-first-impressions-of-google-s-ai-development-environment">https://insights.daretobuild.today/p/firebase-studio-first-impressions-of-google-s-ai-development-environment</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=43642177">https://news.ycombinator.com/item?id=43642177</a></p>
<p>Points: 2</p>
<p># Comments: 1</p>
]]></description><pubDate>Thu, 10 Apr 2025 09:16:18 +0000</pubDate><link>https://insights.daretobuild.today/p/firebase-studio-first-impressions-of-google-s-ai-development-environment</link><dc:creator>ashmil</dc:creator><comments>https://news.ycombinator.com/item?id=43642177</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43642177</guid></item><item><title><![CDATA[Things I Wish I Knew Before Building My Product]]></title><description><![CDATA[
<p>Article URL: <a href="https://insights.daretobuild.today/p/things-i-wish-i-knew-before-building-my-product">https://insights.daretobuild.today/p/things-i-wish-i-knew-before-building-my-product</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=43629095">https://news.ycombinator.com/item?id=43629095</a></p>
<p>Points: 1</p>
<p># Comments: 0</p>
]]></description><pubDate>Wed, 09 Apr 2025 05:16:54 +0000</pubDate><link>https://insights.daretobuild.today/p/things-i-wish-i-knew-before-building-my-product</link><dc:creator>ashmil</dc:creator><comments>https://news.ycombinator.com/item?id=43629095</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43629095</guid></item><item><title><![CDATA[OpenAI lanched o1 Pro with Higher reasoning why is it 100X costoy that o3-mini?]]></title><description><![CDATA[
<p>Article URL: <a href="https://platform.openai.com/docs/models/compare?model=o1-pro">https://platform.openai.com/docs/models/compare?model=o1-pro</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=43432439">https://news.ycombinator.com/item?id=43432439</a></p>
<p>Points: 1</p>
<p># Comments: 0</p>
]]></description><pubDate>Fri, 21 Mar 2025 06:36:31 +0000</pubDate><link>https://platform.openai.com/docs/models/compare?model=o1-pro</link><dc:creator>ashmil</dc:creator><comments>https://news.ycombinator.com/item?id=43432439</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43432439</guid></item><item><title><![CDATA[Build Your SaaS Faster with a Micro-Back End]]></title><description><![CDATA[
<p>Article URL: <a href="https://insights.daretobuild.today/p/micro-backend-the-fast-track-to-saas-development">https://insights.daretobuild.today/p/micro-backend-the-fast-track-to-saas-development</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=43423188">https://news.ycombinator.com/item?id=43423188</a></p>
<p>Points: 1</p>
<p># Comments: 0</p>
]]></description><pubDate>Thu, 20 Mar 2025 13:38:54 +0000</pubDate><link>https://insights.daretobuild.today/p/micro-backend-the-fast-track-to-saas-development</link><dc:creator>ashmil</dc:creator><comments>https://news.ycombinator.com/item?id=43423188</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43423188</guid></item><item><title><![CDATA[Newsletter for Technology evaluation and Product Building for startups]]></title><description><![CDATA[
<p>Article URL: <a href="https://insights.daretobuild.today/">https://insights.daretobuild.today/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=43376701">https://news.ycombinator.com/item?id=43376701</a></p>
<p>Points: 1</p>
<p># Comments: 0</p>
]]></description><pubDate>Sun, 16 Mar 2025 03:10:00 +0000</pubDate><link>https://insights.daretobuild.today/</link><dc:creator>ashmil</dc:creator><comments>https://news.ycombinator.com/item?id=43376701</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43376701</guid></item><item><title><![CDATA[MCP – A Standardized Bridge Between LLMs and External Tools]]></title><description><![CDATA[
<p>Article URL: <a href="https://daretobuild.beehiiv.com/p/mcp-a-standardized-bridge-between-llms-and-external-tools">https://daretobuild.beehiiv.com/p/mcp-a-standardized-bridge-between-llms-and-external-tools</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=43357247">https://news.ycombinator.com/item?id=43357247</a></p>
<p>Points: 4</p>
<p># Comments: 0</p>
]]></description><pubDate>Thu, 13 Mar 2025 21:09:28 +0000</pubDate><link>https://daretobuild.beehiiv.com/p/mcp-a-standardized-bridge-between-llms-and-external-tools</link><dc:creator>ashmil</dc:creator><comments>https://news.ycombinator.com/item?id=43357247</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43357247</guid></item><item><title><![CDATA[Show HN: Personal CRM for Founders and Professional]]></title><description><![CDATA[
<p>Article URL: <a href="https://www.worktodo.today/">https://www.worktodo.today/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=43101080">https://news.ycombinator.com/item?id=43101080</a></p>
<p>Points: 1</p>
<p># Comments: 0</p>
]]></description><pubDate>Wed, 19 Feb 2025 11:49:49 +0000</pubDate><link>https://www.worktodo.today/</link><dc:creator>ashmil</dc:creator><comments>https://news.ycombinator.com/item?id=43101080</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43101080</guid></item><item><title><![CDATA[New comment by ashmil in "I Built This Tool to Solve My Own Chaos"]]></title><description><![CDATA[
<p>I was drowning in scattered notes, reminders, and follow-ups—some in WhatsApp, some in emails, some just floating in my head. Traditional CRMs felt like overkill, and to-do lists lacked the context I needed. I wanted something as easy as jotting things down on a sticky note but smart enough to organize everything for me.<p>So, I built WorkToDoToday.<p>Imagine typing or saying:<p>"Remind me tomorrow to send a follow-up email to Customer XYZ regarding our PoC next month."<p>And instantly, the app:<p>* Creates a task with a due date of tomorrow
* Tags it to Customer XYZ with a clear title and description
* Lets me see all related tasks, notes, and past interactions in one place<p>Why It’s Different:<p>* No forms or complex workflows—just type naturally, and the app organizes everything
* Manage tasks through chat or UI, whichever works best for you
* See all interactions tied to a contact for a full view of your relationships<p>It started as a personal tool, but after sharing it, I realized a lot of founders and busy professionals needed this too. If you struggle with managing follow-ups like I did, check it out: <a href="https://www.worktodo.today" rel="nofollow">https://www.worktodo.today</a> Would love to hear your thoughts!</p>
]]></description><pubDate>Thu, 13 Feb 2025 09:47:20 +0000</pubDate><link>https://news.ycombinator.com/item?id=43034346</link><dc:creator>ashmil</dc:creator><comments>https://news.ycombinator.com/item?id=43034346</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43034346</guid></item><item><title><![CDATA[I Built This Tool to Solve My Own Chaos]]></title><description><![CDATA[
<p>Article URL: <a href="https://www.worktodo.today">https://www.worktodo.today</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=43034345">https://news.ycombinator.com/item?id=43034345</a></p>
<p>Points: 2</p>
<p># Comments: 1</p>
]]></description><pubDate>Thu, 13 Feb 2025 09:47:20 +0000</pubDate><link>https://www.worktodo.today</link><dc:creator>ashmil</dc:creator><comments>https://news.ycombinator.com/item?id=43034345</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43034345</guid></item><item><title><![CDATA[Show HN: Personal CRM for Founders and Professionals]]></title><description><![CDATA[
<p>Article URL: <a href="https://www.worktodo.today/about.html">https://www.worktodo.today/about.html</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=42997397">https://news.ycombinator.com/item?id=42997397</a></p>
<p>Points: 1</p>
<p># Comments: 1</p>
]]></description><pubDate>Mon, 10 Feb 2025 06:08:04 +0000</pubDate><link>https://www.worktodo.today/about.html</link><dc:creator>ashmil</dc:creator><comments>https://news.ycombinator.com/item?id=42997397</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42997397</guid></item><item><title><![CDATA[Do you need a personal CRM as a founder]]></title><description><![CDATA[
<p>Article URL: <a href="https://worktodo.today/">https://worktodo.today/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=42059364">https://news.ycombinator.com/item?id=42059364</a></p>
<p>Points: 2</p>
<p># Comments: 1</p>
]]></description><pubDate>Wed, 06 Nov 2024 10:20:31 +0000</pubDate><link>https://worktodo.today/</link><dc:creator>ashmil</dc:creator><comments>https://news.ycombinator.com/item?id=42059364</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42059364</guid></item><item><title><![CDATA[Is Anyone Else Frustrated with Current Note-Taking and Reminder Apps?]]></title><description><![CDATA[
<p>Title: Is Anyone Else Frustrated with Current Note-Taking and Reminder Apps?<p>I’ve been discussing with friends recently about the challenges we face when managing notes and tasks, and I got some interesting insights. Surprisingly, most people, regardless of their profession (founders, business development, marketing, engineers), seem dissatisfied with the current apps for note-taking and reminders. Despite having different needs, they all agree that they want something better.<p>Here are two examples:<p>One of my friends needs notes and reminders tied to his contacts. Right now, he’s manually jotting down important things about his contacts across different apps. For instance, small notes go into the Apple Contacts app, while larger notes and reminders are scattered between the Apple Notes and Reminders apps. It’s disorganized, and these apps don’t work well together.<p>Another friend explained how he’s been juggling between tools for years. During meetings or while talking to customers, quick thoughts come to his mind, and he wants to note them down fast. But, when you’re occupied, pulling up a dedicated app, filling out forms, or setting a time-based reminder is too slow. His go-to is still a plain notepad because there’s no time to mess around with structured tools. Plus, he doesn’t always need time-based reminders—he just needs to link notes and tasks to the relevant contacts for future follow-ups.<p>What I realized is that both of them, and likely many others, share a common need: notes and tasks need to be connected to people—whether colleagues, customers, or professional contacts. This happens when you’re making multiple decisions and handling various follow-ups daily.<p>So, does anyone else face similar frustrations? What are your thoughts?<p>I’m considering building a simple note-taking and reminder app that ties everything to your contacts—kind of like a personal CRM (though I hesitate to use that term).<p>Would love to hear your thoughts!</p>
<hr>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=41845327">https://news.ycombinator.com/item?id=41845327</a></p>
<p>Points: 1</p>
<p># Comments: 1</p>
]]></description><pubDate>Tue, 15 Oct 2024 05:50:03 +0000</pubDate><link>https://news.ycombinator.com/item?id=41845327</link><dc:creator>ashmil</dc:creator><comments>https://news.ycombinator.com/item?id=41845327</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=41845327</guid></item></channel></rss>