<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: activatedgeek</title><link>https://news.ycombinator.com/user?id=activatedgeek</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Tue, 30 Jun 2026 23:08:11 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=activatedgeek" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[Sanjeev Arora – How could a Superhuman AI mathematician come about? [video]]]></title><description><![CDATA[
<p>Article URL: <a href="https://www.youtube.com/watch?v=gXiVVZypHJg">https://www.youtube.com/watch?v=gXiVVZypHJg</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=47492152">https://news.ycombinator.com/item?id=47492152</a></p>
<p>Points: 2</p>
<p># Comments: 0</p>
]]></description><pubDate>Mon, 23 Mar 2026 17:01:44 +0000</pubDate><link>https://www.youtube.com/watch?v=gXiVVZypHJg</link><dc:creator>activatedgeek</dc:creator><comments>https://news.ycombinator.com/item?id=47492152</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47492152</guid></item><item><title><![CDATA[New comment by activatedgeek in "Show HN: Klarity – Open-source tool to analyze uncertainty/entropy in LLM output"]]></title><description><![CDATA[
<p>Congratulations on the strong reception of min-p. Very clever!<p>We may be talking about two orthogonal things here. And also to be clear, I don't care about theoretical guarantees either.<p>Now, min-p is solving for the inadequacies of standard sampling techniques. It is almost like a clever adaptive search which other sampling methods fail at (despite truncations like top-k/top-p).<p>However, one thing that I noticed in the min-p results was that lower temperatures were almost always better in the final performance (and quite expectedly the inverse for creating writing). This observation makes me think that the underlying model is generally fairly good at ranking the best tokens. What sampling allows us is a margin-for-error in cases where the model ranked a relevant next token not at the top, but slightly lower.<p>Therefore, my takeaway from min-p is that it solves for deficiencies of current samplers but its success is not in contradiction to the fact that logprobs are bad proxies for semantics. Sampling is the simplest form of search, and I agree with you that better sampling methods are a solid ingredient to extract information from logprobs.</p>
]]></description><pubDate>Mon, 03 Feb 2025 21:11:19 +0000</pubDate><link>https://news.ycombinator.com/item?id=42923196</link><dc:creator>activatedgeek</dc:creator><comments>https://news.ycombinator.com/item?id=42923196</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42923196</guid></item><item><title><![CDATA[New comment by activatedgeek in "Show HN: Klarity – Open-source tool to analyze uncertainty/entropy in LLM output"]]></title><description><![CDATA[
<p>That has been my understanding too. More generally, a verifier at the end certainly helps.<p>In our paper [1], we find that asking a follow up question like "Is the answer correct?" and taking the normalized probability of "Yes" or "No" token (or more generally any such token trained for) seems to be best bet so far to get well-calibrated probabilities out of the model.<p>In general, the log-probability of tokens is not a good indicator of anything other than satisfying the pre-training loss function of predicting the "next token." (it likely is very well-calibrated on that task though) Semantics of language are a much less tamable object, especially when we don't quite have a good way to estimate a normalizing constant because every answer can be paraphrased in many ways and still be correct. The volume of correct answers in the generation space of language model is just too small.<p>There is work that shows one way to approximate the normalizing constant via SMC [2], but I believe we are more likely to benefit from having a verifier at train-time than any other approach.<p>And there are stop-gap solutions to make log probabilities more reliable by only computing them on "relevant" tokens, e.g. only final numerical answer tokens for a math problem [3]. But this approach kind of side-steps the problem of actually trying to find relevant tokens. Perhaps something more in the spirit of System 2 attention which selects meaningful tokens for the generated output would be more promising [4].<p>[1]: <a href="https://arxiv.org/abs/2406.08391" rel="nofollow">https://arxiv.org/abs/2406.08391</a>
[2]: <a href="https://arxiv.org/abs/2404.17546" rel="nofollow">https://arxiv.org/abs/2404.17546</a>
[3]: <a href="https://arxiv.org/abs/2402.10200" rel="nofollow">https://arxiv.org/abs/2402.10200</a>
[4]: <a href="https://arxiv.org/abs/2311.11829" rel="nofollow">https://arxiv.org/abs/2311.11829</a></p>
]]></description><pubDate>Mon, 03 Feb 2025 18:09:15 +0000</pubDate><link>https://news.ycombinator.com/item?id=42921027</link><dc:creator>activatedgeek</dc:creator><comments>https://news.ycombinator.com/item?id=42921027</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42921027</guid></item><item><title><![CDATA[EconGraphs]]></title><description><![CDATA[
<p>Article URL: <a href="https://www.econgraphs.org">https://www.econgraphs.org</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=42858809">https://news.ycombinator.com/item?id=42858809</a></p>
<p>Points: 1</p>
<p># Comments: 0</p>
]]></description><pubDate>Tue, 28 Jan 2025 22:17:28 +0000</pubDate><link>https://www.econgraphs.org</link><dc:creator>activatedgeek</dc:creator><comments>https://news.ycombinator.com/item?id=42858809</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42858809</guid></item><item><title><![CDATA[There's a New Country Ranking and You're Not Going to Like It]]></title><description><![CDATA[
<p>Article URL: <a href="https://www.atvbt.com/bmi/">https://www.atvbt.com/bmi/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=42303085">https://news.ycombinator.com/item?id=42303085</a></p>
<p>Points: 4</p>
<p># Comments: 1</p>
]]></description><pubDate>Tue, 03 Dec 2024 04:31:44 +0000</pubDate><link>https://www.atvbt.com/bmi/</link><dc:creator>activatedgeek</dc:creator><comments>https://news.ycombinator.com/item?id=42303085</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42303085</guid></item><item><title><![CDATA[Horse – The Organized Browser]]></title><description><![CDATA[
<p>Article URL: <a href="https://browser.horse">https://browser.horse</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=41988058">https://news.ycombinator.com/item?id=41988058</a></p>
<p>Points: 2</p>
<p># Comments: 0</p>
]]></description><pubDate>Tue, 29 Oct 2024 18:52:11 +0000</pubDate><link>https://browser.horse</link><dc:creator>activatedgeek</dc:creator><comments>https://news.ycombinator.com/item?id=41988058</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=41988058</guid></item><item><title><![CDATA[New comment by activatedgeek in "Learning to Reason with LLMs"]]></title><description><![CDATA[
<p>Reasoning tokens are indeed billed as output tokens.<p>> While reasoning tokens are not visible via the API, they still occupy space in the model's context window and are billed as output tokens.<p>From here: <a href="https://platform.openai.com/docs/guides/reasoning" rel="nofollow">https://platform.openai.com/docs/guides/reasoning</a></p>
]]></description><pubDate>Thu, 12 Sep 2024 20:27:21 +0000</pubDate><link>https://news.ycombinator.com/item?id=41525151</link><dc:creator>activatedgeek</dc:creator><comments>https://news.ycombinator.com/item?id=41525151</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=41525151</guid></item><item><title><![CDATA[Idyll]]></title><description><![CDATA[
<p>Article URL: <a href="https://idyll-lang.org/docs">https://idyll-lang.org/docs</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=41365469">https://news.ycombinator.com/item?id=41365469</a></p>
<p>Points: 5</p>
<p># Comments: 0</p>
]]></description><pubDate>Tue, 27 Aug 2024 08:12:40 +0000</pubDate><link>https://idyll-lang.org/docs</link><dc:creator>activatedgeek</dc:creator><comments>https://news.ycombinator.com/item?id=41365469</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=41365469</guid></item><item><title><![CDATA[New comment by activatedgeek in "Perceived Age"]]></title><description><![CDATA[
<p>This effect is very interesting.<p>Veritasium covered this effect in a video [1] for the interested.<p>[1]: <a href="https://www.youtube.com/watch?v=aIx2N-viNwY" rel="nofollow">https://www.youtube.com/watch?v=aIx2N-viNwY</a> (2016)</p>
]]></description><pubDate>Fri, 09 Aug 2024 20:30:39 +0000</pubDate><link>https://news.ycombinator.com/item?id=41205164</link><dc:creator>activatedgeek</dc:creator><comments>https://news.ycombinator.com/item?id=41205164</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=41205164</guid></item><item><title><![CDATA[New comment by activatedgeek in "Ask HN: What is the best way to author blogs in 2024?"]]></title><description><![CDATA[
<p>I use Astro + Cloudflare Pages for my website [1]. I document the key bits of my stack here [2] for completeness.<p>I've been very happy with Astro because it is a good example of low floor and high ceiling software. I can start with plain HTML, make it more flexible with Astro language (still very close to HTML), make authoring easier with Markdown (+ lifestyle extensions from Remark/Rehype), and extend to frameworks like React on a need basis (which I use for some pages where I use maps).<p>[1]: <a href="https://sanyamkapoor.com" rel="nofollow">https://sanyamkapoor.com</a>
[2]: <a href="https://sanyamkapoor.com/kb/the-stack" rel="nofollow">https://sanyamkapoor.com/kb/the-stack</a></p>
]]></description><pubDate>Sat, 20 Jul 2024 20:28:11 +0000</pubDate><link>https://news.ycombinator.com/item?id=41019546</link><dc:creator>activatedgeek</dc:creator><comments>https://news.ycombinator.com/item?id=41019546</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=41019546</guid></item><item><title><![CDATA[GPT Rapper]]></title><description><![CDATA[
<p>Article URL: <a href="https://gpt-rapper.com">https://gpt-rapper.com</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=40992697">https://news.ycombinator.com/item?id=40992697</a></p>
<p>Points: 1</p>
<p># Comments: 0</p>
]]></description><pubDate>Thu, 18 Jul 2024 05:15:16 +0000</pubDate><link>https://gpt-rapper.com</link><dc:creator>activatedgeek</dc:creator><comments>https://news.ycombinator.com/item?id=40992697</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40992697</guid></item><item><title><![CDATA[New comment by activatedgeek in "Ideas and Creativity (2019)"]]></title><description><![CDATA[
<p>The best thing that one can do for themselves to develop the creative "muscle" is to _own_ their time.<p>Unfortunately, I am yet to feel even close to such a breakthrough. I think very few are fortunate to afford such kind of luxury (as the author alludes to as well). There is always something to deliver for, a deadline to meet (although many would argue deadlines are a forcing constraint); a life waiting to happen. With a tiny bit of envy, I feel very happy and inspired when someone does achieve the "flow" state.<p>On the subject of "tools" to spur creativity, I have always been skeptical. It feels similar to believing that there is a productivity app right around the corner that will unleash your potential. For me, the only true indicator of my productivity has been actually putting in the _time_, making any kind of progress along a chosen direction and then re-evaluating.<p>What are fellow readers here doing to _own_ their time?</p>
]]></description><pubDate>Sat, 18 May 2024 21:15:02 +0000</pubDate><link>https://news.ycombinator.com/item?id=40402169</link><dc:creator>activatedgeek</dc:creator><comments>https://news.ycombinator.com/item?id=40402169</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40402169</guid></item><item><title><![CDATA[New comment by activatedgeek in "Pyinfra: Automate Infrastructure Using Python"]]></title><description><![CDATA[
<p>The fact that Pyinfra does not currently support a feature which can be implemented using Pyinfra philosophy does not make it different than Ansible. I believe that was what the parent comment was about.</p>
]]></description><pubDate>Tue, 30 Apr 2024 17:10:43 +0000</pubDate><link>https://news.ycombinator.com/item?id=40213371</link><dc:creator>activatedgeek</dc:creator><comments>https://news.ycombinator.com/item?id=40213371</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40213371</guid></item><item><title><![CDATA[New comment by activatedgeek in "Pyinfra: Automate Infrastructure Using Python"]]></title><description><![CDATA[
<p>Any kind of provisioning doesn't seem too far a step though. It is just another "operation" with its own state management logic.</p>
]]></description><pubDate>Tue, 30 Apr 2024 16:10:38 +0000</pubDate><link>https://news.ycombinator.com/item?id=40212634</link><dc:creator>activatedgeek</dc:creator><comments>https://news.ycombinator.com/item?id=40212634</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40212634</guid></item><item><title><![CDATA[New comment by activatedgeek in "Pyinfra: Automate Infrastructure Using Python"]]></title><description><![CDATA[
<p>I current use Ansible to setup both local and remote hosts. I've been very happy with it, and love that Pyinfra intends to support the Ansible connector.<p>My main gripe with Ansible is the YAML specification. Ansible chooses to separate the task specification and task execution. 
Pyinfra chooses to directly expose the Python layer, instead of using slightly ugly magic functions/variables. I like this approach more since it allows standard Pythonic control flow instead of using a new (arguably ugly and more hassle to maintain) grammar.<p>Excited for Pyinfra!</p>
]]></description><pubDate>Tue, 30 Apr 2024 15:53:10 +0000</pubDate><link>https://news.ycombinator.com/item?id=40212449</link><dc:creator>activatedgeek</dc:creator><comments>https://news.ycombinator.com/item?id=40212449</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40212449</guid></item><item><title><![CDATA[Wikifier: Semantic Annotation Service for 100 Languages]]></title><description><![CDATA[
<p>Article URL: <a href="https://wikifier.org">https://wikifier.org</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=39913375">https://news.ycombinator.com/item?id=39913375</a></p>
<p>Points: 1</p>
<p># Comments: 0</p>
]]></description><pubDate>Wed, 03 Apr 2024 03:37:55 +0000</pubDate><link>https://wikifier.org</link><dc:creator>activatedgeek</dc:creator><comments>https://news.ycombinator.com/item?id=39913375</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=39913375</guid></item><item><title><![CDATA[New comment by activatedgeek in "But what is a GPT?  Visual intro to Transformers [video]"]]></title><description><![CDATA[
<p>If you use HuggingFace models, then a few simpler decoding algorithms are already implemented for `generate` method of all supported models.<p>Here is a blog post that describes it: <a href="https://huggingface.co/blog/how-to-generate" rel="nofollow">https://huggingface.co/blog/how-to-generate</a>.<p>I will warn you though that beam search is typically what you do NOT want. Beam search approximately optimizes for the "highest likely sequence at the token level." This is rarely what you need in practice with open-ended generations (e.g. a question-answering chat bot). In practice, you need "highest likely semantic sequence," which is much harder problem.<p>Of course, various approximations for semantic alignment are currently in the literature, but still a wide open problem.</p>
]]></description><pubDate>Mon, 01 Apr 2024 22:20:28 +0000</pubDate><link>https://news.ycombinator.com/item?id=39900079</link><dc:creator>activatedgeek</dc:creator><comments>https://news.ycombinator.com/item?id=39900079</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=39900079</guid></item><item><title><![CDATA[Unified Acceleration Foundation]]></title><description><![CDATA[
<p>Article URL: <a href="https://uxlfoundation.org">https://uxlfoundation.org</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=39827939">https://news.ycombinator.com/item?id=39827939</a></p>
<p>Points: 1</p>
<p># Comments: 0</p>
]]></description><pubDate>Tue, 26 Mar 2024 14:01:46 +0000</pubDate><link>https://uxlfoundation.org</link><dc:creator>activatedgeek</dc:creator><comments>https://news.ycombinator.com/item?id=39827939</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=39827939</guid></item><item><title><![CDATA[New comment by activatedgeek in "How Chain-of-Thought Reasoning Helps Neural Networks Compute"]]></title><description><![CDATA[
<p>Makes sense. Thank you for a the QS reference!</p>
]]></description><pubDate>Mon, 25 Mar 2024 06:06:44 +0000</pubDate><link>https://news.ycombinator.com/item?id=39813189</link><dc:creator>activatedgeek</dc:creator><comments>https://news.ycombinator.com/item?id=39813189</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=39813189</guid></item><item><title><![CDATA[New comment by activatedgeek in "How Chain-of-Thought Reasoning Helps Neural Networks Compute"]]></title><description><![CDATA[
<p>I hesitate to the use description as "think," just biasing correlations for subsequent generations.<p>In any case, there is at least one work that shows that CoT may not be necessary and biasing the decoding path via logit probabilities is also promising. [1]<p>One could argue it still doesn't contradict the benefits of CoT, but I suspect there is nothing fundamental about CoT, except that we happened to have been pre-training on sequences that use certain prompts that were easy to conceive from a human's perspective.<p>[1]: <a href="https://arxiv.org/abs/2402.10200" rel="nofollow">https://arxiv.org/abs/2402.10200</a></p>
]]></description><pubDate>Fri, 22 Mar 2024 16:58:32 +0000</pubDate><link>https://news.ycombinator.com/item?id=39792608</link><dc:creator>activatedgeek</dc:creator><comments>https://news.ycombinator.com/item?id=39792608</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=39792608</guid></item></channel></rss>