<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: ej88</title><link>https://news.ycombinator.com/user?id=ej88</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Mon, 22 Jun 2026 23:51:00 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=ej88" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by ej88 in "Twitter user posts a real Monet and says it's AI"]]></title><description><![CDATA[
<p>i think the thread shows:<p>- most people's perception of art is heavily affected by the framing (and to a lot of people ai = bad, and so they start seeing technical issues with it that could /never/ be made by Monet despite it being a Monet)<p>- but I think the critique here is more: even if someone recreated a Monet stroke-for-stroke, what's the value of this copy? I think the artist's personal life and context around the painting adds so much more to it compared to just being a pretty painting (perhaps this is the single most important part of what makes a painting interesting and valuable)</p>
]]></description><pubDate>Thu, 14 May 2026 18:09:54 +0000</pubDate><link>https://news.ycombinator.com/item?id=48139011</link><dc:creator>ej88</dc:creator><comments>https://news.ycombinator.com/item?id=48139011</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48139011</guid></item><item><title><![CDATA[New comment by ej88 in "Interaction Models"]]></title><description><![CDATA[
<p>Ya, the demos were pretty contrived (feels like a running theme amongst the labs...)</p>
]]></description><pubDate>Tue, 12 May 2026 18:35:01 +0000</pubDate><link>https://news.ycombinator.com/item?id=48112379</link><dc:creator>ej88</dc:creator><comments>https://news.ycombinator.com/item?id=48112379</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48112379</guid></item><item><title><![CDATA[New comment by ej88 in "Through the looking glass of benchmark hacking"]]></title><description><![CDATA[
<p>swe bench pro has a public and private test set, where the private eval is from proprietary codebases only</p>
]]></description><pubDate>Tue, 12 May 2026 16:56:02 +0000</pubDate><link>https://news.ycombinator.com/item?id=48110933</link><dc:creator>ej88</dc:creator><comments>https://news.ycombinator.com/item?id=48110933</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48110933</guid></item><item><title><![CDATA[New comment by ej88 in "Through the looking glass of benchmark hacking"]]></title><description><![CDATA[
<p>This is cool!<p>I used to work on post-training & evals. it's really hard to make a good eval set and catch all forms of reward hacking. Excited to see more from poolside!</p>
]]></description><pubDate>Tue, 12 May 2026 16:54:33 +0000</pubDate><link>https://news.ycombinator.com/item?id=48110912</link><dc:creator>ej88</dc:creator><comments>https://news.ycombinator.com/item?id=48110912</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48110912</guid></item><item><title><![CDATA[New comment by ej88 in "Interaction Models"]]></title><description><![CDATA[
<p>An omni model seems very useful for real-time human-computer interaction, off the top of my head:<p>- Voice assistants<p>- Customer experience<p>- Gaming<p>- Meeting assistants<p>- Real-time coach or user assistant for using software<p>- Translation<p>- Real-time work on a computer controlled by voice (frontend / mobile dev, CAD, 3D modeling, etc)<p>Traditionally a lot of these use cases with LLM agents are higher latency because the model needs to wait for the speaker to finish, then decide to call a tool or respond - if they call a tool they need to process the tool result and decide if they want to call a tool or respond, etc...</p>
]]></description><pubDate>Tue, 12 May 2026 16:26:49 +0000</pubDate><link>https://news.ycombinator.com/item?id=48110497</link><dc:creator>ej88</dc:creator><comments>https://news.ycombinator.com/item?id=48110497</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48110497</guid></item><item><title><![CDATA[New comment by ej88 in "Software engineering may no longer be a lifetime career"]]></title><description><![CDATA[
<p>i would argue its the opposite<p>farming hit a ceiling because of demand<p>software today is heavily, heavily constrained by supply. demand is basically infinite for actually good software that solves problems people have (and people always have problems).</p>
]]></description><pubDate>Tue, 12 May 2026 15:47:55 +0000</pubDate><link>https://news.ycombinator.com/item?id=48109997</link><dc:creator>ej88</dc:creator><comments>https://news.ycombinator.com/item?id=48109997</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48109997</guid></item><item><title><![CDATA[New comment by ej88 in "Our AI started a cafe in Stockholm"]]></title><description><![CDATA[
<p>"She rejected several applicants with PhDs and engineering backgrounds, reasoning that their level of education could not compensate for a lack of hands-on specialty coffee experience."<p>This is depressing.</p>
]]></description><pubDate>Tue, 05 May 2026 21:39:08 +0000</pubDate><link>https://news.ycombinator.com/item?id=48028963</link><dc:creator>ej88</dc:creator><comments>https://news.ycombinator.com/item?id=48028963</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48028963</guid></item><item><title><![CDATA[New comment by ej88 in "Sierra Raises $950M at $15B Valuation"]]></title><description><![CDATA[
<p>1. part of the moat is their guardrails and obviously they are audited and tracked. there are agents issuing refunds and more at scale right now so not sure where the skepticism comes from.. you're free to try and jailbreak them<p>2. another part of the value prop of these companies is figuring out how to construct the proper harness to take advantage of the lower latency of faster models while shoring up the weaker intelligence, how you blend deterministic and non-deterministic behaviors, compliance etc.<p>its a hard problem which is why f500 is willing to pay up</p>
]]></description><pubDate>Tue, 05 May 2026 18:05:41 +0000</pubDate><link>https://news.ycombinator.com/item?id=48026222</link><dc:creator>ej88</dc:creator><comments>https://news.ycombinator.com/item?id=48026222</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48026222</guid></item><item><title><![CDATA[New comment by ej88 in "Sierra Raises $950M at $15B Valuation"]]></title><description><![CDATA[
<p>1&2 are already happening, these startups take on brand liability and trust to do so<p>3 depends on how companies want to measure it, but lack of user submitting satisfaction score is not a good thing<p>you can use a model w/o reasoning, + use various tricks to simulate low latency</p>
]]></description><pubDate>Tue, 05 May 2026 03:21:08 +0000</pubDate><link>https://news.ycombinator.com/item?id=48017697</link><dc:creator>ej88</dc:creator><comments>https://news.ycombinator.com/item?id=48017697</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48017697</guid></item><item><title><![CDATA[New comment by ej88 in "Sierra Raises $950M at $15B Valuation"]]></title><description><![CDATA[
<p>that's fair, most implementations in the industry are in the early stages and implementing a full powered agent with access to all the tools it needs is hard (very political as you can imagine). i hope over the next year you notice them getting better!</p>
]]></description><pubDate>Mon, 04 May 2026 22:03:25 +0000</pubDate><link>https://news.ycombinator.com/item?id=48015602</link><dc:creator>ej88</dc:creator><comments>https://news.ycombinator.com/item?id=48015602</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48015602</guid></item><item><title><![CDATA[New comment by ej88 in "Sierra Raises $950M at $15B Valuation"]]></title><description><![CDATA[
<p>adding some context as someone who works in this space<p>1. most people (average, non-tech people) reach for the phone to call in for easily solvable problems. Plus, if the agent is integrated deep enough & has tools to interact with crms, you can raise the ceiling on the types of problems it can solve.<p>You're trying to avoid the bad customer experience of human 1 reading off their script, then they transfer you to some other department who may or may not know how to solve your problem, and the entire interaction cost the company way more than the value created, so the company is disincentivized to help customers.<p>2. All the companies in this space start with the outsourced BPO market for cx (multi billion market still) but the next market is going to be in revenue generation and churn prevention at scale, i.e. how do you proactively avoid customer issues, how do you upsell and generate revenue instead of reducing cost, how do you keep customers happy?<p>3. I think more companies will pivot to outcome based pricing on the contrary, makes it so much more measurable than seat-based and protects margins better than usage based. Plus cx is one of the few industries with very well known metrics<p>4. Kind of? Most companies in this space don't use native voice models which are noticeably dumber, they use transcription + a stronger text model + TTS. The majority of customers can be handled with the latest SOTA text model and you need smart context engineering to handle the long tail of more complicated asks</p>
]]></description><pubDate>Mon, 04 May 2026 21:28:55 +0000</pubDate><link>https://news.ycombinator.com/item?id=48015250</link><dc:creator>ej88</dc:creator><comments>https://news.ycombinator.com/item?id=48015250</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48015250</guid></item><item><title><![CDATA[New comment by ej88 in "Sierra Raises $950M at $15B Valuation"]]></title><description><![CDATA[
<p>its a moat vs. other startups and it carried them to multi-B valuation<p>obviously the product needs to deliver and nrr needs to be good in the long run</p>
]]></description><pubDate>Mon, 04 May 2026 21:21:38 +0000</pubDate><link>https://news.ycombinator.com/item?id=48015172</link><dc:creator>ej88</dc:creator><comments>https://news.ycombinator.com/item?id=48015172</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48015172</guid></item><item><title><![CDATA[New comment by ej88 in "Sierra Raises $950M at $15B Valuation"]]></title><description><![CDATA[
<p>true. we'll see how many ai cos become profit printers a few years from now</p>
]]></description><pubDate>Mon, 04 May 2026 19:44:39 +0000</pubDate><link>https://news.ycombinator.com/item?id=48013951</link><dc:creator>ej88</dc:creator><comments>https://news.ycombinator.com/item?id=48013951</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48013951</guid></item><item><title><![CDATA[New comment by ej88 in "Sierra Raises $950M at $15B Valuation"]]></title><description><![CDATA[
<p>ai skeptic fanfic evolves in fascinating ways every day</p>
]]></description><pubDate>Mon, 04 May 2026 18:28:20 +0000</pubDate><link>https://news.ycombinator.com/item?id=48012820</link><dc:creator>ej88</dc:creator><comments>https://news.ycombinator.com/item?id=48012820</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48012820</guid></item><item><title><![CDATA[New comment by ej88 in "Sierra Raises $950M at $15B Valuation"]]></title><description><![CDATA[
<p>hes board chair of openai and is ex co-ceo of salesforce, ex cto of facebook, can get a meeting with any exec in F500...<p>their moat is distribution</p>
]]></description><pubDate>Mon, 04 May 2026 18:24:09 +0000</pubDate><link>https://news.ycombinator.com/item?id=48012753</link><dc:creator>ej88</dc:creator><comments>https://news.ycombinator.com/item?id=48012753</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48012753</guid></item><item><title><![CDATA[New comment by ej88 in "Sierra Raises $950M at $15B Valuation"]]></title><description><![CDATA[
<p>ime its very implementation dependent<p>but even a simple impl to answer questions can knock out like 50% of callers who are tech-illiterate at 100x cheaper cost, it's just strictly better economics and better for those customers</p>
]]></description><pubDate>Mon, 04 May 2026 18:21:37 +0000</pubDate><link>https://news.ycombinator.com/item?id=48012710</link><dc:creator>ej88</dc:creator><comments>https://news.ycombinator.com/item?id=48012710</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48012710</guid></item><item><title><![CDATA[New comment by ej88 in "Sierra Raises $950M at $15B Valuation"]]></title><description><![CDATA[
<p>It's always interesting seeing how HN reacts to AI CX (as someone who works in this space). Yes, the tech savvy crowd loves to say how they always ask for a human and love old school phone trees<p>in reality 50-80% of callers come in with easily answerable questions because they don't know how to nav the website and prefer to ask in natural language<p>The vast majority of callers call in to resolve their issue, and most don't care if they are speaking to a bot because they just want their issue fixed. Agents (if implemented well) are an order of magnitude more effective at resolving issues compared to a call centre worker who is reading off a script and churn within 9 months<p>There's also the 2nd order effs of making CX cheap. before, there is the perverse incentive of companies trying to keep you off support because each call costs them way more than the value they get. if your cost per call drops 100x you can invest in turning a cost centre into a revenue driver (+ a better experience)</p>
]]></description><pubDate>Mon, 04 May 2026 18:18:12 +0000</pubDate><link>https://news.ycombinator.com/item?id=48012664</link><dc:creator>ej88</dc:creator><comments>https://news.ycombinator.com/item?id=48012664</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48012664</guid></item><item><title><![CDATA[New comment by ej88 in "AI's biggest critic has lost the plot"]]></title><description><![CDATA[
<p>im not sure i understand your reply, but it sounds like you're agreeing with me that yts biggest advantage is the network effect?</p>
]]></description><pubDate>Tue, 28 Apr 2026 16:08:08 +0000</pubDate><link>https://news.ycombinator.com/item?id=47936373</link><dc:creator>ej88</dc:creator><comments>https://news.ycombinator.com/item?id=47936373</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47936373</guid></item><item><title><![CDATA[New comment by ej88 in "Software engineering may no longer be a lifetime career"]]></title><description><![CDATA[
<p>Ive been preparing somewhat for this, as someone who knows they aren't a top N% engineer. My current role involves a certain amount of sales and product in addition to SWE (and luckily I find it fun to talk to customers!)<p>I think it's prudent for a lot of swes to think about what a future looks like where most of the job is managing and unblocking agents.</p>
]]></description><pubDate>Tue, 28 Apr 2026 16:05:43 +0000</pubDate><link>https://news.ycombinator.com/item?id=47936343</link><dc:creator>ej88</dc:creator><comments>https://news.ycombinator.com/item?id=47936343</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47936343</guid></item><item><title><![CDATA[New comment by ej88 in "AI's biggest critic has lost the plot"]]></title><description><![CDATA[
<p>my main qualm with Ed is his analysis on the financials is decent, but he absolutely refuses to admit that the technology is useful (especially in the hands of competent users), and that all the labs are extremely compute starved due to overwhelming demand.</p>
]]></description><pubDate>Tue, 28 Apr 2026 15:50:01 +0000</pubDate><link>https://news.ycombinator.com/item?id=47936164</link><dc:creator>ej88</dc:creator><comments>https://news.ycombinator.com/item?id=47936164</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47936164</guid></item></channel></rss>