<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: gsandahl</title><link>https://news.ycombinator.com/user?id=gsandahl</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Sun, 24 May 2026 20:54:39 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=gsandahl" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by gsandahl in "Show HN: AI Roundtable – Let 200 models debate your question"]]></title><description><![CDATA[
<p>Agree, this is where llms can uncover new perspectives!</p>
]]></description><pubDate>Wed, 25 Mar 2026 08:18:54 +0000</pubDate><link>https://news.ycombinator.com/item?id=47514667</link><dc:creator>gsandahl</dc:creator><comments>https://news.ycombinator.com/item?id=47514667</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47514667</guid></item><item><title><![CDATA[New comment by gsandahl in "Show HN: AI Roundtable – Let 200 models debate your question"]]></title><description><![CDATA[
<p>Oh lord, imagine asking ”serious” questions<p><a href="https://opper.ai/ai-roundtable/questions/you-are-standing-in-a-room-in-a-dungeon-with-two-doors-that-1bda300c" rel="nofollow">https://opper.ai/ai-roundtable/questions/you-are-standing-in...</a></p>
]]></description><pubDate>Tue, 24 Mar 2026 21:16:50 +0000</pubDate><link>https://news.ycombinator.com/item?id=47509457</link><dc:creator>gsandahl</dc:creator><comments>https://news.ycombinator.com/item?id=47509457</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47509457</guid></item><item><title><![CDATA[New comment by gsandahl in "Benchmarking GPT-5 on 400 real-world code reviews"]]></title><description><![CDATA[
<p>Most of the tasks have assessed with ground truth, occasionally helped with an LLM as a judge to assess the answer if the answer is a sentence and not an exact result.<p>Example:
Given a long travel journal
How many cities does the author mention? 
GPT-5: 12
Expected: 17</p>
]]></description><pubDate>Fri, 08 Aug 2025 13:25:14 +0000</pubDate><link>https://news.ycombinator.com/item?id=44836671</link><dc:creator>gsandahl</dc:creator><comments>https://news.ycombinator.com/item?id=44836671</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44836671</guid></item><item><title><![CDATA[New comment by gsandahl in "Benchmarking GPT-5 on 400 real-world code reviews"]]></title><description><![CDATA[
<p>We are running task specific benchmarks across a number of categories (agentic tasks, context tasks, normalization tasks etc), and on our benchmarks we see Gpt-5 rating slightly below o3. But at a much lower cost.<p>See <a href="https://opper.ai/models" rel="nofollow">https://opper.ai/models</a></p>
]]></description><pubDate>Fri, 08 Aug 2025 13:22:51 +0000</pubDate><link>https://news.ycombinator.com/item?id=44836649</link><dc:creator>gsandahl</dc:creator><comments>https://news.ycombinator.com/item?id=44836649</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44836649</guid></item><item><title><![CDATA[New comment by gsandahl in "Show HN: Opper AI – Task-Completion API for LLMs"]]></title><description><![CDATA[
<p>Please do and give us some feedback!</p>
]]></description><pubDate>Tue, 15 Jul 2025 17:17:39 +0000</pubDate><link>https://news.ycombinator.com/item?id=44573532</link><dc:creator>gsandahl</dc:creator><comments>https://news.ycombinator.com/item?id=44573532</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44573532</guid></item><item><title><![CDATA[New comment by gsandahl in "Show HN: Opper AI – Task-Completion API for LLMs"]]></title><description><![CDATA[
<p>I think just how far you can go with examples has been an interesting learning! As these models have become smarter, they are also getting better at reasoning from examples and understanding intent. We will be publishing some research in the next few days!</p>
]]></description><pubDate>Tue, 15 Jul 2025 17:17:17 +0000</pubDate><link>https://news.ycombinator.com/item?id=44573527</link><dc:creator>gsandahl</dc:creator><comments>https://news.ycombinator.com/item?id=44573527</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44573527</guid></item><item><title><![CDATA[New comment by gsandahl in "Show HN: Opper AI – Task-Completion API for LLMs"]]></title><description><![CDATA[
<p>No up to date demo video unfortunately :(<p>Sounds like a great use case though!</p>
]]></description><pubDate>Tue, 15 Jul 2025 13:43:11 +0000</pubDate><link>https://news.ycombinator.com/item?id=44571099</link><dc:creator>gsandahl</dc:creator><comments>https://news.ycombinator.com/item?id=44571099</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44571099</guid></item><item><title><![CDATA[New comment by gsandahl in "Show HN: Opper AI – Task-Completion API for LLMs"]]></title><description><![CDATA[
<p>We have been thinking a bit about this, and one option would be to have some form of locally hosted runner. You can optimize the task in the cloud and deploy it locally. Something like that. It is possible to plug in custom models so technically feasible.</p>
]]></description><pubDate>Tue, 15 Jul 2025 13:42:06 +0000</pubDate><link>https://news.ycombinator.com/item?id=44571091</link><dc:creator>gsandahl</dc:creator><comments>https://news.ycombinator.com/item?id=44571091</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44571091</guid></item><item><title><![CDATA[New comment by gsandahl in "Show HN: Opper AI – Task-Completion API for LLMs"]]></title><description><![CDATA[
<p>Yes that's possible! You can populate examples of great outputs to task specific datasets and have those be automatically populated to the prompt. More info here: <a href="https://docs.opper.ai/capabilities/learning" rel="nofollow">https://docs.opper.ai/capabilities/learning</a></p>
]]></description><pubDate>Tue, 15 Jul 2025 13:34:34 +0000</pubDate><link>https://news.ycombinator.com/item?id=44571004</link><dc:creator>gsandahl</dc:creator><comments>https://news.ycombinator.com/item?id=44571004</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44571004</guid></item><item><title><![CDATA[New comment by gsandahl in "Show HN: Opper AI – Task-Completion API for LLMs"]]></title><description><![CDATA[
<p>Thanks for the shout out!</p>
]]></description><pubDate>Tue, 15 Jul 2025 13:32:21 +0000</pubDate><link>https://news.ycombinator.com/item?id=44570977</link><dc:creator>gsandahl</dc:creator><comments>https://news.ycombinator.com/item?id=44570977</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44570977</guid></item><item><title><![CDATA[New comment by gsandahl in "Show HN: Opper AI – Task-Completion API for LLMs"]]></title><description><![CDATA[
<p>Co-Founder here  thanks for taking a look at Opper! I’m hanging around the thread all day, so feel free to ask anything, share feedback, or tell us where you’d like the product to go next</p>
]]></description><pubDate>Tue, 15 Jul 2025 13:08:53 +0000</pubDate><link>https://news.ycombinator.com/item?id=44570737</link><dc:creator>gsandahl</dc:creator><comments>https://news.ycombinator.com/item?id=44570737</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44570737</guid></item><item><title><![CDATA[Schema Based Prompting: Structured Inputs for Predictable Outputs]]></title><description><![CDATA[
<p>Article URL: <a href="https://opper.ai/blog/schema-based-prompting">https://opper.ai/blog/schema-based-prompting</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=43264856">https://news.ycombinator.com/item?id=43264856</a></p>
<p>Points: 4</p>
<p># Comments: 0</p>
]]></description><pubDate>Wed, 05 Mar 2025 10:04:38 +0000</pubDate><link>https://opper.ai/blog/schema-based-prompting</link><dc:creator>gsandahl</dc:creator><comments>https://news.ycombinator.com/item?id=43264856</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43264856</guid></item><item><title><![CDATA[New comment by gsandahl in "Opperator: A composable agent to automate tasks on the web"]]></title><description><![CDATA[
<p>Its on that trajectory at least :)</p>
]]></description><pubDate>Fri, 14 Feb 2025 07:15:12 +0000</pubDate><link>https://news.ycombinator.com/item?id=43045726</link><dc:creator>gsandahl</dc:creator><comments>https://news.ycombinator.com/item?id=43045726</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43045726</guid></item></channel></rss>