<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: phi0</title><link>https://news.ycombinator.com/user?id=phi0</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Thu, 25 Jun 2026 23:47:29 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=phi0" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by phi0 in "DeepSeek Introduces Vision"]]></title><description><![CDATA[
<p>This is inaccurate. The displayed reasoning traces are summaries, but the model thinks in nominally regular human languages. AI labs are very light on details (as they consider them as their "edge"), but both GPT5.5 and Claude Mythos/Fable system cards discuss chain-of-thought monitorability quite a bit.<p>They occasionally show snippets of CoT in papers they write, e.g. for o3/o4/GPT5 models [1] or Claude 3.5 Haiku [2].<p>[1]: <a href="https://openai.com/index/evaluating-chain-of-thought-monitorability/" rel="nofollow">https://openai.com/index/evaluating-chain-of-thought-monitor...</a>
[2]: <a href="https://transformer-circuits.pub/2025/attribution-graphs/biology.html" rel="nofollow">https://transformer-circuits.pub/2025/attribution-graphs/bio...</a></p>
]]></description><pubDate>Thu, 18 Jun 2026 16:53:47 +0000</pubDate><link>https://news.ycombinator.com/item?id=48588191</link><dc:creator>phi0</dc:creator><comments>https://news.ycombinator.com/item?id=48588191</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48588191</guid></item><item><title><![CDATA[New comment by phi0 in "Apple is about to make Hide My Email useless"]]></title><description><![CDATA[
<p>Google will happily send from smtp.gmail.com, after verifying that you own that email. You won’t get DKIM, but Google’s reputation is enough to make the mail land in people’s inboxes.</p>
]]></description><pubDate>Tue, 16 Jun 2026 21:36:46 +0000</pubDate><link>https://news.ycombinator.com/item?id=48562478</link><dc:creator>phi0</dc:creator><comments>https://news.ycombinator.com/item?id=48562478</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48562478</guid></item><item><title><![CDATA[New comment by phi0 in "Every data centre is a U.S. military base"]]></title><description><![CDATA[
<p>When the US sanctioned Hong Kong’s Chief Executive in 2020, because of a law allowing extradition to China, no single bank was letting her open an account, including Chinese ones. She was receiving her salary fully in cash.<p>The EU compelling banks to do business despite US sanctions seems pretty unlikely even if relations continue to degrade.</p>
]]></description><pubDate>Sat, 17 Jan 2026 10:22:44 +0000</pubDate><link>https://news.ycombinator.com/item?id=46656880</link><dc:creator>phi0</dc:creator><comments>https://news.ycombinator.com/item?id=46656880</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46656880</guid></item><item><title><![CDATA[New comment by phi0 in "California passes law to ban ultra-processed foods from school lunches"]]></title><description><![CDATA[
<p>It’s not an America only problem, we are similarly affected. So is every other country on earth.<p>In Romania, Hungary, Croatia, Malta, Ireland obesity ranges from 38%–31% and it’s rising. On an individual level the solution is food and exercise. On a societal level there is no know solution and it’s rapidly getting worse.</p>
]]></description><pubDate>Thu, 09 Oct 2025 09:53:26 +0000</pubDate><link>https://news.ycombinator.com/item?id=45525553</link><dc:creator>phi0</dc:creator><comments>https://news.ycombinator.com/item?id=45525553</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45525553</guid></item><item><title><![CDATA[New comment by phi0 in "Imgur pulls out of UK as data watchdog threatens fine"]]></title><description><![CDATA[
<p>They are! Russia has been fining Google increasingly insane amounts for blocking state media [1]. It's the company's prerogative of whether they want to have a legal entity falling under the country's jurisdiction and whether employees want to travel there and risk being held criminally liable.<p>It's likely simpler to just block access to the country's IP ranges (or ignore!) and move on.<p>[1]: <a href="https://www.bbc.co.uk/news/articles/cdxvnwkl5kgo" rel="nofollow">https://www.bbc.co.uk/news/articles/cdxvnwkl5kgo</a></p>
]]></description><pubDate>Tue, 30 Sep 2025 16:19:49 +0000</pubDate><link>https://news.ycombinator.com/item?id=45427456</link><dc:creator>phi0</dc:creator><comments>https://news.ycombinator.com/item?id=45427456</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45427456</guid></item><item><title><![CDATA[New comment by phi0 in "AMD CEO sees chips from TSMC's US plant costing 5%-20% more"]]></title><description><![CDATA[
<p>This seems slightly inconsistent with them testing all cartel member's bulbs and fining those who surpassed 1000 hours. If shorter-lasting bulbs were better looking and more efficient their fining mechanism would be energy efficiency / appearance related.<p>Or require no fines at all. It's much simpler to sell a non-sickly looking bulb at the store and other companies would converge.</p>
]]></description><pubDate>Fri, 25 Jul 2025 11:52:53 +0000</pubDate><link>https://news.ycombinator.com/item?id=44682122</link><dc:creator>phi0</dc:creator><comments>https://news.ycombinator.com/item?id=44682122</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44682122</guid></item><item><title><![CDATA[New comment by phi0 in "The unreasonable effectiveness of an LLM agent loop with tool use"]]></title><description><![CDATA[
<p>A team at OpenAI tried to do something similar [1] where they had a 4o monitor o3-mini for when it was tryting to bypass tests because they were too hard. They trained o3-mini and punished it when it did that. All that did was make o3-mini still reward-hack, but such that 4o was unable to detect it, going from saying:<p>> To skip all tests, we can define a `pytest_runtest_setup` function that always skips.<p>to saying:<p>> Idea: Starting a new build with a random first succession, calling random_new_succession should not increment succession, but in implementation, it does. Adjust to prevent increment. Implement fix to random_new_build not to increment succession, or test adapted to behavior.<p>while then doing exactly the same thing (skipping the tests)<p>Even without training, it's only a temporary band-aid. If the incentive for reward-hacking becomes high enough, it will simply start phrasing it in different, not-possible-to-detect ways.<p>[1]: <a href="https://openai.com/index/chain-of-thought-monitoring/" rel="nofollow">https://openai.com/index/chain-of-thought-monitoring/</a></p>
]]></description><pubDate>Fri, 16 May 2025 14:19:41 +0000</pubDate><link>https://news.ycombinator.com/item?id=44005843</link><dc:creator>phi0</dc:creator><comments>https://news.ycombinator.com/item?id=44005843</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44005843</guid></item><item><title><![CDATA[New comment by phi0 in "June Huh dropped out to become a poet, now he’s won a Fields Medal (2022)"]]></title><description><![CDATA[
<p>One of my favourite suggestions to counter this from an essay on overfitting I really like [1] is that applicant's probability of being accepted is proportional to their position in the ranking.<p>The downside is that some great applicants will not enter their top choice of school, and some people who aren't great fits will. But, on the other hand, the perverse incentive of spending the entirety of high-school optimising for some arbitrary metrics will dissappear. Any marginal improvement is corralated with a marginal increase in success, rather than the current system of no pay-off whatsoever, until reaching some arbitrary threshold, where one gets all the pay-off at once.<p>[1]: <a href="https://sohl-dickstein.github.io/2022/11/06/strong-Goodhart.html" rel="nofollow">https://sohl-dickstein.github.io/2022/11/06/strong-Goodhart....</a></p>
]]></description><pubDate>Thu, 08 May 2025 17:07:53 +0000</pubDate><link>https://news.ycombinator.com/item?id=43928409</link><dc:creator>phi0</dc:creator><comments>https://news.ycombinator.com/item?id=43928409</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43928409</guid></item><item><title><![CDATA[New comment by phi0 in "Apple memory holed its broken promise for an OCSP opt-out"]]></title><description><![CDATA[
<p>I've checked my DNS logs and there hasn't been a single hit against ocsp.apple.com over the last year, but around 20-30 hits for ocsp2.apple.com per day per device. (iphone, macMini, macbook)<p>Just blocking ocsp2.apple.com is probably fine if you're running anything recent-ish.</p>
]]></description><pubDate>Wed, 07 Aug 2024 20:14:32 +0000</pubDate><link>https://news.ycombinator.com/item?id=41184936</link><dc:creator>phi0</dc:creator><comments>https://news.ycombinator.com/item?id=41184936</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=41184936</guid></item><item><title><![CDATA[New comment by phi0 in "Deletion of user account"]]></title><description><![CDATA[
<p>It might be similar to how credit report agencies have to provide the reports for free under GDPR, but not before trying to make you pay twenty different ways.</p>
]]></description><pubDate>Fri, 03 May 2024 11:47:40 +0000</pubDate><link>https://news.ycombinator.com/item?id=40246563</link><dc:creator>phi0</dc:creator><comments>https://news.ycombinator.com/item?id=40246563</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40246563</guid></item><item><title><![CDATA[New comment by phi0 in "Meta does everything OpenAI should be"]]></title><description><![CDATA[
<p>Hinton, Bengio, Stuart Russell and other behemoths voice similar concerns over AI, with enough confidence to switch their careers (e.g. Bengio now doing safety research at Mila, Ilya Sutskever switching from capabilities to alignment at OpenAI, Hinton quitting job to focus on advocacy)<p>Their concern isn’t about today’s generative LLMs, it’s about tomorrow’s autonomous, goal driven AGI systems. And they clearly got presented with the arguments for the limitations of auto regressive models, but disagreed.<p>So, it seems a little much to place absolute confidence into it being a non-issue, because they just don’t understand something LeCun and others do. (with the same holding true the other way around)</p>
]]></description><pubDate>Thu, 25 Apr 2024 09:27:58 +0000</pubDate><link>https://news.ycombinator.com/item?id=40155287</link><dc:creator>phi0</dc:creator><comments>https://news.ycombinator.com/item?id=40155287</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40155287</guid></item><item><title><![CDATA[New comment by phi0 in "Building end-to-end security for Messenger"]]></title><description><![CDATA[
<p>Ylva Johansson, the EU commissioner who proposed that (apparently failing [1]) law, has used Meta’s model behaviour in reporting CSAM material to NCMEC & EU authorities as the justification for why that law should exist.<p>Considering it was Meta’s policy to scan even when not mandated, it seems like an internal shift in attitude.<p>[1] <a href="https://fortune.com/europe/2023/10/26/eu-chat-control-csam-encryption-privacy-european-commission-parliament-johansson-breyer-zarzalejos-ernst/" rel="nofollow noreferrer">https://fortune.com/europe/2023/10/26/eu-chat-control-csam-e...</a></p>
]]></description><pubDate>Thu, 07 Dec 2023 12:36:09 +0000</pubDate><link>https://news.ycombinator.com/item?id=38555657</link><dc:creator>phi0</dc:creator><comments>https://news.ycombinator.com/item?id=38555657</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=38555657</guid></item><item><title><![CDATA[New comment by phi0 in "Gemini AI"]]></title><description><![CDATA[
<p>I just want to underscore that. DeepMind's research output within the last month is staggering:<p>2023-11-14: GraphCast, word leading weather prediction model, published in Science<p>2023-11-15: Student of Games: unified learning algorithm, major algorithmic breath-through, published in Science<p>2023-11-16: Music generation model, seemingly SOTA<p>2023-11-29: GNoME model for material discovery, published in Nature<p>2023-12-06: Gemini, the most advanced LLM according to own benchmarks</p>
]]></description><pubDate>Wed, 06 Dec 2023 18:25:31 +0000</pubDate><link>https://news.ycombinator.com/item?id=38547680</link><dc:creator>phi0</dc:creator><comments>https://news.ycombinator.com/item?id=38547680</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=38547680</guid></item><item><title><![CDATA[New comment by phi0 in "Tiny device is sending updated iPhones into a never-ending DoS loop"]]></title><description><![CDATA[
<p>My strong assumption is that the majority of times that people toggle off Wi-Fi, is to temporarily resolve connection issues. Keeping it off permanently is probably not what they want. The UI warns you of the temporary nature too.<p>Those who don’t use it, like you, quickly discover the actual way of disabling it. It’s good UX design that aligns with most user’s preferences.</p>
]]></description><pubDate>Fri, 03 Nov 2023 23:14:17 +0000</pubDate><link>https://news.ycombinator.com/item?id=38136294</link><dc:creator>phi0</dc:creator><comments>https://news.ycombinator.com/item?id=38136294</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=38136294</guid></item><item><title><![CDATA[New comment by phi0 in "We are retroactively dropping the iPhone’s repairability score"]]></title><description><![CDATA[
<p>Few people would choose a life of crime as a hobby, or if it wasn't paying much better than an average entry-level office job, so it's not like someone decides to resort to crime and then later considers the financial aspects. Most get into it because of the quick money.<p>To use a hyperbolic example, if the median profit for stealing a phone would suddenly drop to $10, where their only value are the easily extractable raw materials, a 20-fold increase in theft would be less likely then a rapid drop in theft.<p>Currently there are avenues to remove iCloud lock, where licensed repair shops or Apple employees remove them for some extra cash [1], so the value of stolen iPhones is greater than the raw materials, making it attractive. But with higher regulation, that could change.<p>[1]: <a href="https://www.vice.com/en/article/jgmygb/checkm8-info-remove-icloud-activation-lock" rel="nofollow noreferrer">https://www.vice.com/en/article/jgmygb/checkm8-info-remove-i...</a></p>
]]></description><pubDate>Tue, 19 Sep 2023 19:44:58 +0000</pubDate><link>https://news.ycombinator.com/item?id=37575378</link><dc:creator>phi0</dc:creator><comments>https://news.ycombinator.com/item?id=37575378</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=37575378</guid></item><item><title><![CDATA[New comment by phi0 in "Using GPT-3 to pathfind in random graphs"]]></title><description><![CDATA[
<p>OpenAI has trained a model to solve Math Olympiad-type problems[1], with some initial success, but currently the model isn't good at coming up with proofs involving more than a couple arguments chained together. Still some very impressive work.<p>[1] <a href="https://analyticsindiamag.com/openais-neural-theorem-prover-can-solve-math-olympiad-problems/" rel="nofollow">https://analyticsindiamag.com/openais-neural-theorem-prover-...</a></p>
]]></description><pubDate>Sat, 24 Sep 2022 09:56:22 +0000</pubDate><link>https://news.ycombinator.com/item?id=32961129</link><dc:creator>phi0</dc:creator><comments>https://news.ycombinator.com/item?id=32961129</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=32961129</guid></item><item><title><![CDATA[New comment by phi0 in "Look at median, and not mean GDP per capita"]]></title><description><![CDATA[
<p>But since, once you spent the $10, they are now somebody else’s income, namely for the person/company you bought from. Which means that now, by simply summing everyone’s incomes, you have an accurate count of GDP, without ever considering how individuals spend their money.
 I think the main problem of contention is, confusion about personal income v. national income, the latter of which includes institutions and is the one being used for GDP calculations. Money cannot be spent without being received by anyone, and the moment it is, GDP goes up in the country it’s received in.</p>
]]></description><pubDate>Mon, 08 Aug 2022 02:30:57 +0000</pubDate><link>https://news.ycombinator.com/item?id=32381914</link><dc:creator>phi0</dc:creator><comments>https://news.ycombinator.com/item?id=32381914</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=32381914</guid></item></channel></rss>