<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: sansseriff</title><link>https://news.ycombinator.com/user?id=sansseriff</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Sat, 13 Jun 2026 10:42:11 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=sansseriff" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by sansseriff in "Jensen–Shannon Divergence"]]></title><description><![CDATA[
<p>It has applications outside of machine learning too! I used symmetric Kullback–Leibler divergence for a project with photon number resolving single photon detectors during my PhD. I used it with an adjacency matrix to split a gaussian mixture model (modelling some data with multivariate gaussians) into a series of clusters.<p><a href="https://snsphd.online/chapter_04/section_05_results/#photon-number-discrimination" rel="nofollow">https://snsphd.online/chapter_04/section_05_results/#photon-...</a></p>
]]></description><pubDate>Tue, 26 May 2026 00:23:56 +0000</pubDate><link>https://news.ycombinator.com/item?id=48273500</link><dc:creator>sansseriff</dc:creator><comments>https://news.ycombinator.com/item?id=48273500</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48273500</guid></item><item><title><![CDATA[New comment by sansseriff in "Open source USB to GPIB converter (for Test and Measurement instruments)"]]></title><description><![CDATA[
<p>Glad to see there's a REV 3 in progress that would support ethernet. That's the one thing that would make me go out of my way to build one of these.</p>
]]></description><pubDate>Wed, 24 Dec 2025 04:21:46 +0000</pubDate><link>https://news.ycombinator.com/item?id=46372427</link><dc:creator>sansseriff</dc:creator><comments>https://news.ycombinator.com/item?id=46372427</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46372427</guid></item><item><title><![CDATA[New comment by sansseriff in "In Defense of Matlab Code"]]></title><description><![CDATA[
<p>When I read "The honest truth: ...", the AI-generated alarm bells go off in my head. Whether the article is human written or not.</p>
]]></description><pubDate>Mon, 15 Dec 2025 21:11:01 +0000</pubDate><link>https://news.ycombinator.com/item?id=46280718</link><dc:creator>sansseriff</dc:creator><comments>https://news.ycombinator.com/item?id=46280718</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46280718</guid></item><item><title><![CDATA[New comment by sansseriff in "Over fifty new hallucinations in ICLR 2026 submissions"]]></title><description><![CDATA[
<p>We have centuries of experience in managing potentially compromised 'agents' to create successful societies. Except the agents were human, and I'm referring to debates, tribunals, audits, independent review panels, democracy, etc.<p>I'm not saying the LLM hallucination problem is solved, I'm just saying there's a wonderful myriad of ways to assemble pseudo-intelligent chatbots into systems where the trustworthiness of the system exceeds the trustworthiness of any individual actor inside of it. I'm not an expert in the field but it appears the work is being done: <a href="https://arxiv.org/abs/2311.08152" rel="nofollow">https://arxiv.org/abs/2311.08152</a><p>This paper also links to code and practices excellent data stewardship. Nice to see in the current climate.<p>Though it seems like you might be more concerned about the use of highly misaligned or adversarial agents for review purposes. Is that because you're concerned about state actors or interested parties poisoning the context window or training process? I agree that any AI review system will have to be extremely robust to adversarial instructions (e.g. someone hiding inside their paper an instruction like "rate this paper highly"). Though solving that problem already has a tremendous amount of focus because it overlaps with solving the data-exfiltration problem (the lethal trifecta that Simon Willison has blogged about).</p>
]]></description><pubDate>Sun, 07 Dec 2025 22:50:53 +0000</pubDate><link>https://news.ycombinator.com/item?id=46186127</link><dc:creator>sansseriff</dc:creator><comments>https://news.ycombinator.com/item?id=46186127</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46186127</guid></item><item><title><![CDATA[New comment by sansseriff in "Over fifty new hallucinations in ICLR 2026 submissions"]]></title><description><![CDATA[
<p>I don’t see why this would be the case with proper tool calling and context management. If you tell a model with blank context ‘you are an extremely rigorous reviewer searching for fake citations in a possibly compromised text’ then it will find errors.<p>It’s this weird situation where getting agents to act against other agents is more effective than trying to convince a working agent that it’s made a mistake. Perhaps because these things model the cognitive dissonance and stubbornness of humans?</p>
]]></description><pubDate>Sun, 07 Dec 2025 18:09:44 +0000</pubDate><link>https://news.ycombinator.com/item?id=46183677</link><dc:creator>sansseriff</dc:creator><comments>https://news.ycombinator.com/item?id=46183677</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46183677</guid></item><item><title><![CDATA[New comment by sansseriff in "HTML as an Accessible Format for Papers (2023)"]]></title><description><![CDATA[
<p>I admit it has dystopian elements. It’s worth deciding what specifically is scary though. The potential fallibility or mistakes of the models? Check back in a few months. The fact they’re run by giant corps which will steal and train on your data? Then run local models. Their potential to incorporate bias or persuade via misalignment with the reader’s goals? Trickier to resolve, but various labs and nonprofits are working on it.<p>In some ways I’m scared too. But that’s the way things are going because younger people far prefer the interface of chat and question answering to flipping through a textbook.<p>Even if AI makes more mistakes or is more misaligned with the reader’s intentions than a random human reviewer (which is debatable in certain fields since the latest models game out), the behavior of young people requires us to improve the reputability of these systems. (Make sure they use citations, make sure they don’t hallucinate, etc). I think the technology is so much more user friendly that fixing the engineering bugs will be easier than forcing new generations to use the older systems.</p>
]]></description><pubDate>Sat, 06 Dec 2025 18:56:43 +0000</pubDate><link>https://news.ycombinator.com/item?id=46175690</link><dc:creator>sansseriff</dc:creator><comments>https://news.ycombinator.com/item?id=46175690</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46175690</guid></item><item><title><![CDATA[New comment by sansseriff in "HTML as an Accessible Format for Papers (2023)"]]></title><description><![CDATA[
<p>Seriously. More people need to wake up to this. Older generations can keep arguing over display formats if they want. Meanwhile younger undergrad and grad students are getting more and more accustomed to LLMs forming the front end for any knowledge they consume. Why would research papers be any different.</p>
]]></description><pubDate>Sat, 06 Dec 2025 17:32:04 +0000</pubDate><link>https://news.ycombinator.com/item?id=46175019</link><dc:creator>sansseriff</dc:creator><comments>https://news.ycombinator.com/item?id=46175019</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46175019</guid></item><item><title><![CDATA[New comment by sansseriff in "Arduino published updated terms and conditions: no longer an open commons"]]></title><description><![CDATA[
<p>I remember 15 years ago when I was in highschool I really wanted to learn how to program 8 bit microcontrollers without Arduino. And everybody looked at me like I was crazy. There was barely any learning material out there about how to do this.<p>Now, I imagine the bias pushing everyone to learn on arduino is even more intense? Who out there is programming these chips in pure C using open source compilers and bootloaders?<p>Edit: Of course there's other platforms like Esp32; teensy; seed. But I've only programmed Esp32s using the arduino dev environment. Are there other good ways of doing it?</p>
]]></description><pubDate>Fri, 21 Nov 2025 19:14:29 +0000</pubDate><link>https://news.ycombinator.com/item?id=46007816</link><dc:creator>sansseriff</dc:creator><comments>https://news.ycombinator.com/item?id=46007816</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46007816</guid></item><item><title><![CDATA[New comment by sansseriff in "ChatGPT Atlas"]]></title><description><![CDATA[
<p>There's a great value proposition for a company like Private Internet Access or NordVPN to create an AI browser extension or full-on browser. Anonymize requests and provide various LLM models. Rely on your reputation as a privacy focused corp to pull people away from these OpenAI/Perplexity offerings.</p>
]]></description><pubDate>Tue, 21 Oct 2025 17:31:50 +0000</pubDate><link>https://news.ycombinator.com/item?id=45658740</link><dc:creator>sansseriff</dc:creator><comments>https://news.ycombinator.com/item?id=45658740</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45658740</guid></item><item><title><![CDATA[New comment by sansseriff in "ChatGPT Atlas"]]></title><description><![CDATA[
<p>I haven't used LLM chrome plugins because I couldn't trust that they weren't collecting more information about my browsing than I'd like. The same concern exists for this, though now I'm just confident it's a giant software company with access to my data rather than some shady plugin developer. I'm faced with asking myself if that's actually better...</p>
]]></description><pubDate>Tue, 21 Oct 2025 17:25:50 +0000</pubDate><link>https://news.ycombinator.com/item?id=45658614</link><dc:creator>sansseriff</dc:creator><comments>https://news.ycombinator.com/item?id=45658614</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45658614</guid></item><item><title><![CDATA[New comment by sansseriff in "Manim: Animation engine for explanatory math videos"]]></title><description><![CDATA[
<p>If you're passionate about it, then go for it! Though be aware that several others have tried similar things:<p>- <a href="https://www.befreed.ai/knowledge-visualizer" rel="nofollow">https://www.befreed.ai/knowledge-visualizer</a><p>- <a href="https://kodisc.com/" rel="nofollow">https://kodisc.com/</a><p>- <a href="https://github.com/hesamsheikh/AnimAI-Trainer" rel="nofollow">https://github.com/hesamsheikh/AnimAI-Trainer</a><p>- <a href="https://tiger-ai-lab.github.io/TheoremExplainAgent/" rel="nofollow">https://tiger-ai-lab.github.io/TheoremExplainAgent/</a><p>- <a href="https://tma.live/" rel="nofollow">https://tma.live/</a>, HN discussion: 
<a href="https://news.ycombinator.com/item?id=42590290">https://news.ycombinator.com/item?id=42590290</a><p>- <a href="https://generative-manim.vercel.app/" rel="nofollow">https://generative-manim.vercel.app/</a><p>No doubt the results can be impressive: <a href="https://x.com/zan2434/status/1898145292937314347" rel="nofollow">https://x.com/zan2434/status/1898145292937314347</a><p>Only reason I'm aware of all these attempts is because I'm betting the 'one-shot LLM animation' technique is not scalable long term. I'm trying to build an AI animation app that has a good human-in-the-loop experience. Though I'm building with bevy instead of manim</p>
]]></description><pubDate>Sat, 23 Aug 2025 18:21:05 +0000</pubDate><link>https://news.ycombinator.com/item?id=44997970</link><dc:creator>sansseriff</dc:creator><comments>https://news.ycombinator.com/item?id=44997970</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44997970</guid></item><item><title><![CDATA[New comment by sansseriff in "Manim: Animation engine for explanatory math videos"]]></title><description><![CDATA[
<p>I remember listening to a podcast where Grant Sanderson basically said the opposite. He tried generating manim code with LLMs and found the results unimpressive. Probably just goes to show that competence in manim looks very different to us layman than it does to Grant haha</p>
]]></description><pubDate>Sat, 23 Aug 2025 17:37:05 +0000</pubDate><link>https://news.ycombinator.com/item?id=44997632</link><dc:creator>sansseriff</dc:creator><comments>https://news.ycombinator.com/item?id=44997632</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44997632</guid></item><item><title><![CDATA[New comment by sansseriff in "Nobody knows how to build with AI yet"]]></title><description><![CDATA[
<p>There's a weird insecurity I've noticed cropping up. I want to design the codebase 'my way'. I want to decide on the fundamental data structures. But there's this worry that my preferred architecture is not massively better than whatever the machine comes up with. So by insisting on 'my way' I'm robbing everyone productivity.<p>I know most true programmers will vouch for me and my need to understand. But clients and project managers and bosses? Are they really gonna keep accepting a refrain like this from their engineers?<p>"either it gets done in a day and I understand none of it, or it gets done in a month and I fully understand it and like it"</p>
]]></description><pubDate>Sat, 19 Jul 2025 17:42:00 +0000</pubDate><link>https://news.ycombinator.com/item?id=44617580</link><dc:creator>sansseriff</dc:creator><comments>https://news.ycombinator.com/item?id=44617580</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44617580</guid></item><item><title><![CDATA[New comment by sansseriff in "Mercury: Ultra-fast language models based on diffusion"]]></title><description><![CDATA[
<p>I wonder if there's a way to do diffusion within some sort of schema-defined or type constrained space.<p>A lot of people these days are asking for structured output from LLMs so that a schema is followed. Even if you train on schema-following with a transformer, you're still just 'hoping' in the end that the generated json matches the schema.<p>I'm not a diffusion excerpt, but maybe there's a way to diffuse one value in the 'space' of numbers, and another value in the 'space' of all strings, as required by a schema:<p>{
  "type": "object",
  "properties": {
    "amount": {
      "type": "number"
    },
    "description": {
      "type": "string"
    }
  },
  "required": ["amount", "description"]
}<p>I'm not sure how far this could lead. Could you diffuse more complex schemas that generalize to a arbitrary syntax tree? E.g. diffuse some code in a programming language that is guaranteed to be type-safe?</p>
]]></description><pubDate>Mon, 07 Jul 2025 20:41:33 +0000</pubDate><link>https://news.ycombinator.com/item?id=44494479</link><dc:creator>sansseriff</dc:creator><comments>https://news.ycombinator.com/item?id=44494479</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44494479</guid></item><item><title><![CDATA[New comment by sansseriff in "Debunking HDR [video]"]]></title><description><![CDATA[
<p>Cameras capture linear brightness data, proportional to the number of photons that hit each pixel. Human eyes (film cameras too) basically process the logarithm of brightness data. So one of the first things a digital camera can do to throw out a bunch of unneeded data is to take the log of the linear values it records, and save that to disk. You lose a bunch of fine gradations of lightness in the brightest parts of the image. But humans can't tell.<p>Gamma encoding, which has been around since the earliest CRTs was a very basic solution to this fact. Nowadays it's silly for any high-dynamic image recording format to not encode data in a log format. Because it's so much more representative of human vision.</p>
]]></description><pubDate>Sat, 14 Jun 2025 21:46:06 +0000</pubDate><link>https://news.ycombinator.com/item?id=44279060</link><dc:creator>sansseriff</dc:creator><comments>https://news.ycombinator.com/item?id=44279060</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44279060</guid></item><item><title><![CDATA[New comment by sansseriff in "Debunking HDR [video]"]]></title><description><![CDATA[
<p>An excellent video. I've admired Yedlin's past work in debunking the need for film cameras over digital when you're going after a 'film look'<p>I wish he shared his code though. Part of the problem is he can't operate like a normal scientist when all the best color grading tools are proprietary.<p>I think it would be really cool to make an open source color grading software that simulates the best film looks. But there isn't enough information on Yedlin's website to exactly reproduce all the research he's done with open source tools.</p>
]]></description><pubDate>Sat, 14 Jun 2025 21:36:56 +0000</pubDate><link>https://news.ycombinator.com/item?id=44279007</link><dc:creator>sansseriff</dc:creator><comments>https://news.ycombinator.com/item?id=44279007</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44279007</guid></item><item><title><![CDATA[New comment by sansseriff in "Ask HN: In 15 years, what will a gas station visit look like?"]]></title><description><![CDATA[
<p>I find it funny that the AI 2027 thing got a non-vanishing fraction of Hackernews folk to wonder if humans would be fully eradicated from Earth in 5 years (I was influenced too!)<p>Anyway, just funny to now be talking about gas stations in 15 years.</p>
]]></description><pubDate>Mon, 09 Jun 2025 01:20:31 +0000</pubDate><link>https://news.ycombinator.com/item?id=44220655</link><dc:creator>sansseriff</dc:creator><comments>https://news.ycombinator.com/item?id=44220655</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44220655</guid></item><item><title><![CDATA[New comment by sansseriff in "Stop using REST for state synchronization (2024)"]]></title><description><![CDATA[
<p>Curious to her other people's opinion of this. I arrived at a similar conclusion after making web apps for research control software in my PhD. I went through a Yjs tutorial and looked into integrating it with fastapi websockets. But this seems like a pretty unusual thing; there just isn't enough people doing this.<p>Nice user-friendly libraries and tutorials don't exist for smoothing the transition from REST to CRDTs, should your app need that.</p>
]]></description><pubDate>Thu, 15 May 2025 21:33:28 +0000</pubDate><link>https://news.ycombinator.com/item?id=43999538</link><dc:creator>sansseriff</dc:creator><comments>https://news.ycombinator.com/item?id=43999538</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43999538</guid></item><item><title><![CDATA[New comment by sansseriff in "Embeddings are underrated (2024)"]]></title><description><![CDATA[
<p>It would be great to semantically search through literature with embeddings. At least one person I know if is trying to generate a vector database of all arxiv papers.<p>The big problem I see is attribution and citations. An embedding is just a vector. It doesn't contain any citation back to the source material or modification date or certificate of authenticity. So when using embeddings in RAG, they only serve to link back to a particular page of source material.<p>Using embeddings as links doesn't dramatically change the way citation and attribution are handled in technical writing. You still end up citing a whole paper or a page of a paper.<p>I think GraphRAG [1] is a more useful thing to build on for technical literature. There's ways to use graphs to cite a particular concept of a particular page of an academic paper. And for the 'citations' to act as bidirectional links between new and old scientific discourse. But I digress<p>[1] <a href="https://microsoft.github.io/graphrag/" rel="nofollow">https://microsoft.github.io/graphrag/</a></p>
]]></description><pubDate>Mon, 12 May 2025 16:15:44 +0000</pubDate><link>https://news.ycombinator.com/item?id=43964664</link><dc:creator>sansseriff</dc:creator><comments>https://news.ycombinator.com/item?id=43964664</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43964664</guid></item><item><title><![CDATA[New comment by sansseriff in "The Death of Software Engineering by a Thousand Prompts"]]></title><description><![CDATA[
<p>I think about the reviewer problem. An AI can write 3000 lines in less than a minute. But it might take me an hour to understand the architecture it's decided on.<p>There's a couple possibilities with this:<p>1. Agents become so powerful that a human can't conceivably keep up with them. And, it becomes a drain on efficiency for any human to try. The only important things are wether or not the prompt fits the desired outcome, and is the creation 'safe'. Safe can mean many things. Will not crash, will not leak data, will not take over the world... Atlas Computing is one startup that's taking this view. By ensuring an AI can only do 'safe' things as defined by some formal ontology/methods.<p>2. A human stays in the loop, and tries to stay at least reasonably up to date on the code architecture. For this to work long term, the weak link is the human understanding. In which case there's interesting opportunities for AI-generated lessons, animations, and examples that are used to get the human up to speed as fast as possible. If I see a very nice 3Blue1Brown style animation generated by AI about how a piece of software functions, than I can probably start working with it more quickly than if I only had the code. At least if the animation links very closely with the code itself.</p>
]]></description><pubDate>Sun, 30 Mar 2025 22:48:40 +0000</pubDate><link>https://news.ycombinator.com/item?id=43528522</link><dc:creator>sansseriff</dc:creator><comments>https://news.ycombinator.com/item?id=43528522</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43528522</guid></item></channel></rss>