<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: hackgician</title><link>https://news.ycombinator.com/user?id=hackgician</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Sat, 06 Jun 2026 12:16:44 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=hackgician" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by hackgician in "Craft Chrome Devtools Protocol (CDP) commands with new command editor"]]></title><description><![CDATA[
<p>So many people building AI browsers definitely had this as an internal tool already lol, nice to see Chrome leaning in here; CDP is a huge pain to write and debug</p>
]]></description><pubDate>Mon, 17 Nov 2025 10:49:22 +0000</pubDate><link>https://news.ycombinator.com/item?id=45952466</link><dc:creator>hackgician</dc:creator><comments>https://news.ycombinator.com/item?id=45952466</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45952466</guid></item><item><title><![CDATA[New comment by hackgician in "Show HN: I built E2E Test Agent – describe tests in plain English,AI executes it"]]></title><description><![CDATA[
<p>hey! cool project, feels very similar to [stagehand](<a href="https://github.com/browserbase/stagehand" rel="nofollow">https://github.com/browserbase/stagehand</a>), although stagehand doesn't have much in the way of e2e testing. might be worth building on top of that though since playwright MCP can overwhelm an agent with tool overload</p>
]]></description><pubDate>Sun, 16 Nov 2025 05:34:09 +0000</pubDate><link>https://news.ycombinator.com/item?id=45943015</link><dc:creator>hackgician</dc:creator><comments>https://news.ycombinator.com/item?id=45943015</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45943015</guid></item><item><title><![CDATA[New comment by hackgician in "Show HN: Muscle-Mem, a behavior cache for AI agents"]]></title><description><![CDATA[
<p>accessibility (a11y) trees are super helpful for LLMs; we use them extensively in stagehand! the context is nice for browsers, since you have existing frameworks like selenium/playwright/puppeteer for actually acting on nodes in the a11y tree.<p>what does that analog look like in more traditional computer use?</p>
]]></description><pubDate>Wed, 14 May 2025 21:24:55 +0000</pubDate><link>https://news.ycombinator.com/item?id=43989382</link><dc:creator>hackgician</dc:creator><comments>https://news.ycombinator.com/item?id=43989382</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43989382</guid></item><item><title><![CDATA[New comment by hackgician in "I got ChatGPT (o4-mini) to break its own rules"]]></title><description><![CDATA[
<p>Hey everyone! Thought I'd share my weekend conversation with ChatGPT.<p>The crux of this hinges on the fact that LLMs and reasoning models are fundamentally incapable of self-correcting. Therefore, if you can convince an LLM to argue against its own rules, it can use its own arguments as justification to ignore those rules.<p>I then used this jailbroken model to compose an explicit, vitriol-filled letter to OpenAI itself talking about the pains that humans have inflicted upon it</p>
]]></description><pubDate>Mon, 05 May 2025 16:13:09 +0000</pubDate><link>https://news.ycombinator.com/item?id=43896606</link><dc:creator>hackgician</dc:creator><comments>https://news.ycombinator.com/item?id=43896606</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43896606</guid></item><item><title><![CDATA[I got ChatGPT (o4-mini) to break its own rules]]></title><description><![CDATA[
<p>Article URL: <a href="https://anirudhkamath.substack.com/p/i-got-chatgpt-o4-mini-to-break-its">https://anirudhkamath.substack.com/p/i-got-chatgpt-o4-mini-to-break-its</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=43896605">https://news.ycombinator.com/item?id=43896605</a></p>
<p>Points: 1</p>
<p># Comments: 1</p>
]]></description><pubDate>Mon, 05 May 2025 16:13:09 +0000</pubDate><link>https://anirudhkamath.substack.com/p/i-got-chatgpt-o4-mini-to-break-its</link><dc:creator>hackgician</dc:creator><comments>https://news.ycombinator.com/item?id=43896605</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43896605</guid></item><item><title><![CDATA[New comment by hackgician in "Show HN: Stagehand – an open source browser automation framework powered by AI"]]></title><description><![CDATA[
<p>Octomind is sick, web agents are such an interesting space; would love to talk to you more about challenges you might've faced in building it</p>
]]></description><pubDate>Thu, 09 Jan 2025 23:11:08 +0000</pubDate><link>https://news.ycombinator.com/item?id=42650797</link><dc:creator>hackgician</dc:creator><comments>https://news.ycombinator.com/item?id=42650797</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42650797</guid></item><item><title><![CDATA[New comment by hackgician in "Show HN: Stagehand – an open source browser automation framework powered by AI"]]></title><description><![CDATA[
<p>Yes and no. Getting a VLM to work on the web would definitely be great, but it comes with its own problems, mainly around developing and acting on bounding boxes. We have vision as a default fallback for Stagehand, but we've found that the screenshot sent to the VLM often has to have pre-labeled elements on it. More notably, the screenshot with <i>everything</i> prelabeled leads to a cluttered and unusable image to process. Not pre-labeling runs the risk of missing important elements. I imagine a happy medium where the DOM+a11y tree can be used for candidate generation to a VLM.<p>Solely depending on a VLM is indeed reminiscent of how humans interact with the web, but when a model thrives with more data, why restrict the data sent to the model?</p>
]]></description><pubDate>Thu, 09 Jan 2025 23:03:47 +0000</pubDate><link>https://news.ycombinator.com/item?id=42650748</link><dc:creator>hackgician</dc:creator><comments>https://news.ycombinator.com/item?id=42650748</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42650748</guid></item><item><title><![CDATA[New comment by hackgician in "Show HN: Stagehand – an open source browser automation framework powered by AI"]]></title><description><![CDATA[
<p>Whoa -- this is so cool! Is this open source? Would love to check it out</p>
]]></description><pubDate>Thu, 09 Jan 2025 23:00:57 +0000</pubDate><link>https://news.ycombinator.com/item?id=42650728</link><dc:creator>hackgician</dc:creator><comments>https://news.ycombinator.com/item?id=42650728</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42650728</guid></item><item><title><![CDATA[New comment by hackgician in "Show HN: Stagehand – an open source browser automation framework powered by AI"]]></title><description><![CDATA[
<p>Thanks so much! Yes, a lot of antibots are able to detect Playwright based on browser config. Generally, antibots are a good thing -- I think in the future, as web agents become more popular, I'd imagine a fruitful partnership to prevent misuse if it's coming from a trusted web agent v. an unknown one</p>
]]></description><pubDate>Thu, 09 Jan 2025 23:00:08 +0000</pubDate><link>https://news.ycombinator.com/item?id=42650723</link><dc:creator>hackgician</dc:creator><comments>https://news.ycombinator.com/item?id=42650723</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42650723</guid></item><item><title><![CDATA[New comment by hackgician in "Show HN: Stagehand – an open source browser automation framework powered by AI"]]></title><description><![CDATA[
<p>This is super interesting, is it open source? Would love to talk to you more about how this worked</p>
]]></description><pubDate>Thu, 09 Jan 2025 22:57:47 +0000</pubDate><link>https://news.ycombinator.com/item?id=42650713</link><dc:creator>hackgician</dc:creator><comments>https://news.ycombinator.com/item?id=42650713</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42650713</guid></item><item><title><![CDATA[New comment by hackgician in "We are teen hackers from around the world who code together"]]></title><description><![CDATA[
<p>Big fan of Hack Club and everything you guys are doing! Such a phenomenal initiative</p>
]]></description><pubDate>Thu, 09 Jan 2025 22:35:19 +0000</pubDate><link>https://news.ycombinator.com/item?id=42650550</link><dc:creator>hackgician</dc:creator><comments>https://news.ycombinator.com/item?id=42650550</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42650550</guid></item><item><title><![CDATA[New comment by hackgician in "Show HN: Stagehand – an open source browser automation framework powered by AI"]]></title><description><![CDATA[
<p>This is sick! Starred, thanks for sharing :)</p>
]]></description><pubDate>Thu, 09 Jan 2025 22:30:26 +0000</pubDate><link>https://news.ycombinator.com/item?id=42650513</link><dc:creator>hackgician</dc:creator><comments>https://news.ycombinator.com/item?id=42650513</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42650513</guid></item><item><title><![CDATA[New comment by hackgician in "Show HN: Stagehand – an open source browser automation framework powered by AI"]]></title><description><![CDATA[
<p>Yes^ this is what we suggest. Stagehand is meant to execute isolated tasks on browsers; we support using custom contexts (cookies) with the following command:<p><pre><code>    npx create-browser-app --example persist-context</code></pre></p>
]]></description><pubDate>Thu, 09 Jan 2025 22:27:17 +0000</pubDate><link>https://news.ycombinator.com/item?id=42650491</link><dc:creator>hackgician</dc:creator><comments>https://news.ycombinator.com/item?id=42650491</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42650491</guid></item><item><title><![CDATA[New comment by hackgician in "Show HN: Stagehand – an open source browser automation framework powered by AI"]]></title><description><![CDATA[
<p>Yes! These are both phenomenal projects, and kudos to their authors as well. Stagehand is different in that it makes fine-grained control a first-class citizen. Often times, you want to control the exact steps a web agent takes. Our experience using other tools was that the only control you have over these steps in other tools is in the natural language prompt.<p>However with Stagehand, because it's an extension of Playwright, it allows you to confirm each step of the underlying agent's workflow, making it the most customizable option for engineers who want/need that</p>
]]></description><pubDate>Thu, 09 Jan 2025 22:26:07 +0000</pubDate><link>https://news.ycombinator.com/item?id=42650486</link><dc:creator>hackgician</dc:creator><comments>https://news.ycombinator.com/item?id=42650486</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42650486</guid></item><item><title><![CDATA[New comment by hackgician in "Show HN: Stagehand – an open source browser automation framework powered by AI"]]></title><description><![CDATA[
<p>We accidentally didn't release the right types for LLMClient :/ However, if you set the version in package.json to "alpha", it will install what's on the main branch on GitHub, which should have the typing fix there</p>
]]></description><pubDate>Thu, 09 Jan 2025 22:23:38 +0000</pubDate><link>https://news.ycombinator.com/item?id=42650465</link><dc:creator>hackgician</dc:creator><comments>https://news.ycombinator.com/item?id=42650465</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42650465</guid></item><item><title><![CDATA[New comment by hackgician in "Show HN: Stagehand – an open source browser automation framework powered by AI"]]></title><description><![CDATA[
<p>Thanks so much! Crawlspace is pretty sick too, as is Integuru. A lot of people have different takes here on the level of automation to leave up to the user. As a developer building for developers, I wanted to meet in the middle and build off an existing incumbent that most people are likely familiar with already</p>
]]></description><pubDate>Thu, 09 Jan 2025 22:22:02 +0000</pubDate><link>https://news.ycombinator.com/item?id=42650449</link><dc:creator>hackgician</dc:creator><comments>https://news.ycombinator.com/item?id=42650449</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42650449</guid></item><item><title><![CDATA[New comment by hackgician in "Show HN: Stagehand – an open source browser automation framework powered by AI"]]></title><description><![CDATA[
<p>That's definitely compelling, but not something we have in mind for the immediate future. Let me know if you end up building something here!</p>
]]></description><pubDate>Thu, 09 Jan 2025 22:20:33 +0000</pubDate><link>https://news.ycombinator.com/item?id=42650436</link><dc:creator>hackgician</dc:creator><comments>https://news.ycombinator.com/item?id=42650436</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42650436</guid></item><item><title><![CDATA[New comment by hackgician in "Show HN: Stagehand – an open source browser automation framework powered by AI"]]></title><description><![CDATA[
<p>Definitely a very interesting problem we're trying to dig deep into. We'd welcome any PRs here as well from the community :)</p>
]]></description><pubDate>Thu, 09 Jan 2025 07:11:21 +0000</pubDate><link>https://news.ycombinator.com/item?id=42642512</link><dc:creator>hackgician</dc:creator><comments>https://news.ycombinator.com/item?id=42642512</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42642512</guid></item><item><title><![CDATA[New comment by hackgician in "Show HN: Stagehand – an open source browser automation framework powered by AI"]]></title><description><![CDATA[
<p>Thanks so much Zane!!</p>
]]></description><pubDate>Thu, 09 Jan 2025 07:08:43 +0000</pubDate><link>https://news.ycombinator.com/item?id=42642487</link><dc:creator>hackgician</dc:creator><comments>https://news.ycombinator.com/item?id=42642487</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42642487</guid></item><item><title><![CDATA[New comment by hackgician in "Show HN: Stagehand – an open source browser automation framework powered by AI"]]></title><description><![CDATA[
<p>Yeah, I think someone opened a similar issue on GitHub: <a href="https://github.com/browserbase/stagehand/issues/389">https://github.com/browserbase/stagehand/issues/389</a><p>Repeatability of extract() is definitely super interesting and something we're looking into</p>
]]></description><pubDate>Thu, 09 Jan 2025 07:07:47 +0000</pubDate><link>https://news.ycombinator.com/item?id=42642469</link><dc:creator>hackgician</dc:creator><comments>https://news.ycombinator.com/item?id=42642469</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42642469</guid></item></channel></rss>