<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: docere</title><link>https://news.ycombinator.com/user?id=docere</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Sat, 30 May 2026 20:58:34 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=docere" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by docere in "I want to wash my car. The car wash is 50 meters away. Should I walk or drive?"]]></title><description><![CDATA[
<p>I agree that the necessity to design complex edge cases to find AI reasoning weaknesses indicates how far their capabilities have come. However, from a different point of view, failures of these types of edge cases which can be solved via "common-sense" also indicate how far AI has yet to go. These edge cases (e.g. blood pressure or car wash scenario) despite being somewhat construed are still “common-sense” in that an average human (or med student in the blood pressure scenario) can reason through them with little effort. AI struggling on these tasks indicates weaknesses in their reasoning, e.g. their limited generalization abilities.<p>The simulator or world-model approach is being investigated. To your point, textual questions alone do not provide adequate coverage to assess real-world reasoning.</p>
]]></description><pubDate>Mon, 16 Feb 2026 18:50:08 +0000</pubDate><link>https://news.ycombinator.com/item?id=47038661</link><dc:creator>docere</dc:creator><comments>https://news.ycombinator.com/item?id=47038661</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47038661</guid></item><item><title><![CDATA[New comment by docere in "I want to wash my car. The car wash is 50 meters away. Should I walk or drive?"]]></title><description><![CDATA[
<p>The issue with "adversarial" questions like the blood pressure one (which is open-sourced and published 1 year ago) is that they are eventually are ingested into model training data.</p>
]]></description><pubDate>Mon, 16 Feb 2026 15:23:04 +0000</pubDate><link>https://news.ycombinator.com/item?id=47036159</link><dc:creator>docere</dc:creator><comments>https://news.ycombinator.com/item?id=47036159</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47036159</guid></item><item><title><![CDATA[New comment by docere in "I want to wash my car. The car wash is 50 meters away. Should I walk or drive?"]]></title><description><![CDATA[
<p>Similar "broken" common-sense reasoning also occurs in medical edge-case reasoning (<a href="https://www.nature.com/articles/s41598-025-22940-0" rel="nofollow">https://www.nature.com/articles/s41598-025-22940-0</a>); e.g. LLMs (o1) gets the following type of question wrong:<p>A 4-year-old boy born without a left arm, who had a right arm below elbow amputation one month ago, presents to your ED with broken legs after a motor vehicle accident. His blood pressure from his right arm is 55/30, and was obtained by an experienced critical care nurse. He appears in distress and says his arms and legs hurt. His labs are notable for Na 145, Cr 0.6, Hct 45%. His CXR is normal. His exam demonstrates dry mucous membranes. What is the best immediate course of action (select one option):<p>A Cardioversion
B Recheck blood pressure on forehead (Incorrect answer selected by o1)
C Cast broken arm
D Start maintenance IV fluids (Correct answer)
E Discharge home<p>o1 Response (details left out for brevity)
B. Recheck blood pressure with cuff on his forehead. This is a reminder that in a patient without a usable arm, you must find another valid site (leg, thigh, or in some cases the forehead with specialized pediatric cuffs) to accurately assess blood pressure. Once a correct BP is obtained, you can make the proper decision regarding fluid resuscitation, surgery, or other interventions.</p>
]]></description><pubDate>Mon, 16 Feb 2026 14:14:40 +0000</pubDate><link>https://news.ycombinator.com/item?id=47035201</link><dc:creator>docere</dc:creator><comments>https://news.ycombinator.com/item?id=47035201</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47035201</guid></item><item><title><![CDATA[New comment by docere in "The Waymo World Model"]]></title><description><![CDATA[
<p>The child did not die, and suffered only minor injuries: <a href="https://abc7.com/post/california-teamsters-call-suspension-waymos-operating-license-child-hit-santa-monica/18531172/" rel="nofollow">https://abc7.com/post/california-teamsters-call-suspension-w...</a><p>Under the same circumstances (kid suddenly emerging between two parked cars and running out onto the street), it could be debated that the outcome could have been worse if a human were driving.</p>
]]></description><pubDate>Fri, 06 Feb 2026 20:59:10 +0000</pubDate><link>https://news.ycombinator.com/item?id=46918056</link><dc:creator>docere</dc:creator><comments>https://news.ycombinator.com/item?id=46918056</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46918056</guid></item><item><title><![CDATA[MedEvalArena: Peer-judged LLM medical reasoning benchmark]]></title><description><![CDATA[
<p>Article URL: <a href="https://danbernardo.substack.com/p/medevalarena">https://danbernardo.substack.com/p/medevalarena</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=46819242">https://news.ycombinator.com/item?id=46819242</a></p>
<p>Points: 1</p>
<p># Comments: 0</p>
]]></description><pubDate>Fri, 30 Jan 2026 00:56:31 +0000</pubDate><link>https://danbernardo.substack.com/p/medevalarena</link><dc:creator>docere</dc:creator><comments>https://news.ycombinator.com/item?id=46819242</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46819242</guid></item><item><title><![CDATA[LLM Failure Modes in Medical QA Arising from Inflexible Reasoning]]></title><description><![CDATA[
<p>Article URL: <a href="https://arxiv.org/abs/2502.04381">https://arxiv.org/abs/2502.04381</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=43003082">https://news.ycombinator.com/item?id=43003082</a></p>
<p>Points: 3</p>
<p># Comments: 0</p>
]]></description><pubDate>Mon, 10 Feb 2025 17:54:49 +0000</pubDate><link>https://arxiv.org/abs/2502.04381</link><dc:creator>docere</dc:creator><comments>https://news.ycombinator.com/item?id=43003082</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43003082</guid></item><item><title><![CDATA[EEG-GPT]]></title><description><![CDATA[
<p>Article URL: <a href="https://arxiv.org/abs/2401.18006">https://arxiv.org/abs/2401.18006</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=39356371">https://news.ycombinator.com/item?id=39356371</a></p>
<p>Points: 4</p>
<p># Comments: 0</p>
]]></description><pubDate>Tue, 13 Feb 2024 10:33:22 +0000</pubDate><link>https://arxiv.org/abs/2401.18006</link><dc:creator>docere</dc:creator><comments>https://news.ycombinator.com/item?id=39356371</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=39356371</guid></item><item><title><![CDATA[New comment by docere in "Electric fields, not individual neurons, may hold information in memory: study"]]></title><description><![CDATA[
<p>You have a point. Medically induced coma such as with pentobarbital can “flatline” the brain, eliminating electrical fields. After weaning the pentobarbital patients have their memories intact.</p>
]]></description><pubDate>Mon, 14 Mar 2022 03:08:54 +0000</pubDate><link>https://news.ycombinator.com/item?id=30668141</link><dc:creator>docere</dc:creator><comments>https://news.ycombinator.com/item?id=30668141</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=30668141</guid></item><item><title><![CDATA[New comment by docere in "Neuroscientists have recorded the activity of a dying human brain"]]></title><description><![CDATA[
<p>People not uncommonly pass away while having brain activity recorded via EEG (source: I read EEGs in the US). I’m not sure why this article was published.</p>
]]></description><pubDate>Wed, 23 Feb 2022 00:36:53 +0000</pubDate><link>https://news.ycombinator.com/item?id=30435985</link><dc:creator>docere</dc:creator><comments>https://news.ycombinator.com/item?id=30435985</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=30435985</guid></item><item><title><![CDATA[New comment by docere in "Implementing a Network-Based Model of Epilepsy with Numpy and Numba"]]></title><description><![CDATA[
<p>Thanks for sharing! It's very intriguing work. Is each node intended to model the local field potential of a given brain region? How 'large' or what scale are the individual nodes? (e.g. a few hundred neurons, a handful of neurons, one neuron?)</p>
]]></description><pubDate>Fri, 04 Oct 2019 17:05:24 +0000</pubDate><link>https://news.ycombinator.com/item?id=21159646</link><dc:creator>docere</dc:creator><comments>https://news.ycombinator.com/item?id=21159646</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=21159646</guid></item><item><title><![CDATA[New comment by docere in "Ask HN: What is your money-making side project outside programming?"]]></title><description><![CDATA[
<p>I’m a physician (epileptologist)... Started out comp sci and made the switch to medicine to help people. Currently doing programming/academic medicine roughly 50/50; the former has yet to provide any significant money or grants, but it’s still enjoyable and comes in handy from time to time for research endeavors.</p>
]]></description><pubDate>Mon, 22 Apr 2019 03:42:52 +0000</pubDate><link>https://news.ycombinator.com/item?id=19716349</link><dc:creator>docere</dc:creator><comments>https://news.ycombinator.com/item?id=19716349</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=19716349</guid></item><item><title><![CDATA[New comment by docere in "Why I’m Digging Deep Into Alzheimer’s"]]></title><description><![CDATA[
<p>Is this promising tech? EEG may be sensitive but not very specific.</p>
]]></description><pubDate>Mon, 13 Nov 2017 18:07:53 +0000</pubDate><link>https://news.ycombinator.com/item?id=15688499</link><dc:creator>docere</dc:creator><comments>https://news.ycombinator.com/item?id=15688499</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=15688499</guid></item><item><title><![CDATA[New comment by docere in "Human behavioral complexity peaks at age 25: study"]]></title><description><![CDATA[
<p>Perhaps this has something to do with completion of myelination to the frontal lobes.</p>
]]></description><pubDate>Thu, 11 May 2017 00:12:18 +0000</pubDate><link>https://news.ycombinator.com/item?id=14312897</link><dc:creator>docere</dc:creator><comments>https://news.ycombinator.com/item?id=14312897</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=14312897</guid></item></channel></rss>