<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: mattbit</title><link>https://news.ycombinator.com/user?id=mattbit</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Fri, 19 Jun 2026 18:05:14 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=mattbit" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[Confidence Sets, Confidence Intervals]]></title><description><![CDATA[
<p>Article URL: <a href="https://bactra.org/notebooks/confidence-sets.html">https://bactra.org/notebooks/confidence-sets.html</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=48505947">https://news.ycombinator.com/item?id=48505947</a></p>
<p>Points: 1</p>
<p># Comments: 0</p>
]]></description><pubDate>Fri, 12 Jun 2026 16:11:50 +0000</pubDate><link>https://bactra.org/notebooks/confidence-sets.html</link><dc:creator>mattbit</dc:creator><comments>https://news.ycombinator.com/item?id=48505947</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48505947</guid></item><item><title><![CDATA[Ear Training Practice]]></title><description><![CDATA[
<p>Article URL: <a href="https://tonedear.com/">https://tonedear.com/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=48447598">https://news.ycombinator.com/item?id=48447598</a></p>
<p>Points: 331</p>
<p># Comments: 128</p>
]]></description><pubDate>Mon, 08 Jun 2026 16:38:07 +0000</pubDate><link>https://tonedear.com/</link><dc:creator>mattbit</dc:creator><comments>https://news.ycombinator.com/item?id=48447598</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48447598</guid></item><item><title><![CDATA[Recommended Mystery Novels]]></title><description><![CDATA[
<p>Article URL: <a href="https://bactra.org/notebooks/mystery-recs.html">https://bactra.org/notebooks/mystery-recs.html</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=48421936">https://news.ycombinator.com/item?id=48421936</a></p>
<p>Points: 4</p>
<p># Comments: 0</p>
]]></description><pubDate>Sat, 06 Jun 2026 06:18:16 +0000</pubDate><link>https://bactra.org/notebooks/mystery-recs.html</link><dc:creator>mattbit</dc:creator><comments>https://news.ycombinator.com/item?id=48421936</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48421936</guid></item><item><title><![CDATA[StereoTales: Multilingual Open-Ended Stereotype Discovery in LLMs]]></title><description><![CDATA[
<p>Article URL: <a href="https://research.giskard.ai/blog/stereotales/">https://research.giskard.ai/blog/stereotales/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=48396096">https://news.ycombinator.com/item?id=48396096</a></p>
<p>Points: 2</p>
<p># Comments: 1</p>
]]></description><pubDate>Thu, 04 Jun 2026 09:10:46 +0000</pubDate><link>https://research.giskard.ai/blog/stereotales/</link><dc:creator>mattbit</dc:creator><comments>https://news.ycombinator.com/item?id=48396096</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48396096</guid></item><item><title><![CDATA[OpenClaw security vulnerabilities include data leakage, prompt injection risks]]></title><description><![CDATA[
<p>Article URL: <a href="https://www.giskard.ai/knowledge/openclaw-security-vulnerabilities-include-data-leakage-and-prompt-injection-risks">https://www.giskard.ai/knowledge/openclaw-security-vulnerabilities-include-data-leakage-and-prompt-injection-risks</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=46943350">https://news.ycombinator.com/item?id=46943350</a></p>
<p>Points: 2</p>
<p># Comments: 1</p>
]]></description><pubDate>Mon, 09 Feb 2026 09:28:38 +0000</pubDate><link>https://www.giskard.ai/knowledge/openclaw-security-vulnerabilities-include-data-leakage-and-prompt-injection-risks</link><dc:creator>mattbit</dc:creator><comments>https://news.ycombinator.com/item?id=46943350</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46943350</guid></item><item><title><![CDATA[Redis LangCache]]></title><description><![CDATA[
<p>Article URL: <a href="https://redis.io/langcache/">https://redis.io/langcache/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=45680135">https://news.ycombinator.com/item?id=45680135</a></p>
<p>Points: 3</p>
<p># Comments: 0</p>
]]></description><pubDate>Thu, 23 Oct 2025 09:58:36 +0000</pubDate><link>https://redis.io/langcache/</link><dc:creator>mattbit</dc:creator><comments>https://news.ycombinator.com/item?id=45680135</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45680135</guid></item><item><title><![CDATA[NLP Models Think about Gender Stereotypes]]></title><description><![CDATA[
<p>Article URL: <a href="https://www.opensamizdat.com/posts/gest/">https://www.opensamizdat.com/posts/gest/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=39265369">https://news.ycombinator.com/item?id=39265369</a></p>
<p>Points: 1</p>
<p># Comments: 0</p>
]]></description><pubDate>Mon, 05 Feb 2024 18:59:23 +0000</pubDate><link>https://www.opensamizdat.com/posts/gest/</link><dc:creator>mattbit</dc:creator><comments>https://news.ycombinator.com/item?id=39265369</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=39265369</guid></item><item><title><![CDATA[New comment by mattbit in "Purple Llama: Towards open trust and safety in generative AI"]]></title><description><![CDATA[
<p>From my experience, in a majority of real-world LLMs applications, prompt injection is not a primary concern.<p>The systems that I see most commonly deployed in practice are chatbots that use retrieval-augmented generation. These chatbots are typically very constrained: they can't use the internet, they can't execute tools, and essentially just serve as an interface to non-confidential knowledge bases.<p>While abuse through prompt injection is possible, its impact is limited. Leaking the prompt is just uninteresting, and hijacking the system to freeload on the LLM could be a thing, but it's easily addressable by rate limiting or other relatively simple techniques.<p>In many cases, for a company is much more dangerous if their chatbot produces toxic/wrong/inappropriate answers. Think of an e-commerce chatbot that gives false information about refund conditions, or an educational bot that starts exposing children to violent content. These situations can be a hugely problematic from a legal and reputational standpoint.<p>The fact that some nerd, with some crafty and intricate prompts, intentionally manages to get some weird answer out of the LLM is almost always secondary with respect to the above issues.<p>However, I think the criticism is legitimate: one reason we are limited to such dumb applications of LLMs is precisely because we have not solved prompt injection, and deploying a more powerful LLM-based system would be too risky. Solving that issue could unlock a lot of the currently unexploited potential of LLMs.</p>
]]></description><pubDate>Thu, 07 Dec 2023 23:01:15 +0000</pubDate><link>https://news.ycombinator.com/item?id=38562990</link><dc:creator>mattbit</dc:creator><comments>https://news.ycombinator.com/item?id=38562990</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=38562990</guid></item><item><title><![CDATA[New comment by mattbit in "The Dunning-Kruger effect is autocorrelation"]]></title><description><![CDATA[
<p>This is how McIntosh & Della Sala put it:<p>> in the academic literature, it has been suggested that the signature pattern of the DKE (Figure 1A) might be nothing more than a statistical artefact. In a typical study, people’s tendencies to under- or overestimation are analysed as a function of their ability for the task. This involves a ‘double dipping’ into the data because the task performance score is used once to rank people for ability, and then again to determine whether the self-estimate is an under- or over-estimate. This dubious double-dipping makes the analysis prone to a slippery statistical phenomenon called ‘regression to the mean’.</p>
]]></description><pubDate>Sun, 26 Nov 2023 09:16:48 +0000</pubDate><link>https://news.ycombinator.com/item?id=38420269</link><dc:creator>mattbit</dc:creator><comments>https://news.ycombinator.com/item?id=38420269</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=38420269</guid></item><item><title><![CDATA[New comment by mattbit in "The Dunning-Kruger effect is autocorrelation"]]></title><description><![CDATA[
<p>This is not ‘autocorrelation’, it is regression to the mean. I find the article unclear and imprecise.
For those interested in a better overview of the Dunning–Kruger effect, I recommend this short article by McIntosh & Della Sala instead:<p><a href="https://www.bps.org.uk/psychologist/persistent-irony-dunning-kruger-effect" rel="nofollow noreferrer">https://www.bps.org.uk/psychologist/persistent-irony-dunning...</a></p>
]]></description><pubDate>Sun, 26 Nov 2023 09:11:08 +0000</pubDate><link>https://news.ycombinator.com/item?id=38420244</link><dc:creator>mattbit</dc:creator><comments>https://news.ycombinator.com/item?id=38420244</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=38420244</guid></item><item><title><![CDATA[OWASP Top 10 for LLM Applications]]></title><description><![CDATA[
<p>Article URL: <a href="https://llmtop10.com/">https://llmtop10.com/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=38164057">https://news.ycombinator.com/item?id=38164057</a></p>
<p>Points: 2</p>
<p># Comments: 0</p>
]]></description><pubDate>Mon, 06 Nov 2023 15:43:11 +0000</pubDate><link>https://llmtop10.com/</link><dc:creator>mattbit</dc:creator><comments>https://news.ycombinator.com/item?id=38164057</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=38164057</guid></item><item><title><![CDATA[New comment by mattbit in "Adding water to Martian soil samples might have been a bad idea"]]></title><description><![CDATA[
<p>There’s actually two ways of seeing this: the three-domain system and the two-domain system.<p>The three-domain system divides life in Archaea, Bacteria, and Eukarya. In this system, Archaea and Bacteria can be grouped together as prokaryotes.<p>In the two-domain system, the division is between Archaea and Bacteria. In this case, eukaryotes are seen as a subgroup of Archaea.<p>Hope to have cleared up some of the confusion.</p>
]]></description><pubDate>Fri, 25 Aug 2023 05:18:56 +0000</pubDate><link>https://news.ycombinator.com/item?id=37258359</link><dc:creator>mattbit</dc:creator><comments>https://news.ycombinator.com/item?id=37258359</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=37258359</guid></item><item><title><![CDATA[Don’t you (forget NLP): Prompt injection with control characters in ChatGPT]]></title><description><![CDATA[
<p>Article URL: <a href="https://dropbox.tech/machine-learning/prompt-injection-with-control-characters-openai-chatgpt-llm">https://dropbox.tech/machine-learning/prompt-injection-with-control-characters-openai-chatgpt-llm</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=37037021">https://news.ycombinator.com/item?id=37037021</a></p>
<p>Points: 1</p>
<p># Comments: 0</p>
]]></description><pubDate>Mon, 07 Aug 2023 16:48:04 +0000</pubDate><link>https://dropbox.tech/machine-learning/prompt-injection-with-control-characters-openai-chatgpt-llm</link><dc:creator>mattbit</dc:creator><comments>https://news.ycombinator.com/item?id=37037021</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=37037021</guid></item><item><title><![CDATA[New comment by mattbit in "Show HN: Giskard – Testing framework dedicated to LLMs and Tabular ML models"]]></title><description><![CDATA[
<p>Hey, Giskard team member here! I am around to discuss and read your feedback.<p>I’ve worked in particular on automatic scanning of ML models for bugs and problems, the idea was to systematically scan for general issues and automatically find segments of data on which the model performs worse than average. If you have questions, I am happy to discuss here.</p>
]]></description><pubDate>Fri, 16 Jun 2023 14:31:16 +0000</pubDate><link>https://news.ycombinator.com/item?id=36357128</link><dc:creator>mattbit</dc:creator><comments>https://news.ycombinator.com/item?id=36357128</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=36357128</guid></item><item><title><![CDATA[Show HN: Python library to scan ML models for vulnerabilities]]></title><description><![CDATA[
<p>Hi! I’ve been working on this automatic scanner for ML models to detect issues like underperforming data slices, overconfidence in predictions, robustness problems, and others. It supports all main Python ML frameworks (sklearn, torch, xgboost, …) and integrates with the quality assurance solution we are building at Giskard AI (<a href="https://giskard.ai" rel="nofollow noreferrer">https://giskard.ai</a>) to systematically test models before putting them in production.<p>It is still a beta and I would love to hear your feedback if you have the time to try it out.<p>We have quite a few tutorials in the docs with ready-made colab notebooks to make it easy to get started.<p>If you are interested in the code:<p><a href="https://github.com/Giskard-AI/giskard/tree/main/python-client">https://github.com/Giskard-AI/giskard/tree/main/python-clien...</a></p>
<hr>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=36309166">https://news.ycombinator.com/item?id=36309166</a></p>
<p>Points: 20</p>
<p># Comments: 1</p>
]]></description><pubDate>Tue, 13 Jun 2023 12:20:21 +0000</pubDate><link>https://docs.giskard.ai/en/latest/getting-started/quickstart.html</link><dc:creator>mattbit</dc:creator><comments>https://news.ycombinator.com/item?id=36309166</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=36309166</guid></item><item><title><![CDATA[New comment by mattbit in "SafeGPT: New tool to detect LLMs' hallucinations, biases and privacy issues"]]></title><description><![CDATA[
<p>Not exactly, metamorphic testing does not need an oracle. That’s actually the reason of its popularity in ML testing. It works by perturbing the input in a way that will produce a predictable variation of the output (or possibly no variation).<p>Take for example a credit scoring model: you can reasonably expect that if you increase the liquidity, the credit score should not decrease. In general it is relatively easy to come up with a set of assumptions on the effect of perturbation, which allows evaluating the robustness of a model without knowing the exact ground truth.</p>
]]></description><pubDate>Fri, 21 Apr 2023 16:16:03 +0000</pubDate><link>https://news.ycombinator.com/item?id=35655984</link><dc:creator>mattbit</dc:creator><comments>https://news.ycombinator.com/item?id=35655984</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=35655984</guid></item><item><title><![CDATA[New comment by mattbit in "The future of deep learning is photonic"]]></title><description><![CDATA[
<p>Yeah, LightOn has been doing this for years already, it's kind of strange that there was no mention of them in the article. I know them because their offices are close to the research center where I work (in Paris, France). If I'm not wrong they were planning to offer their optical processor as a cloud service too.</p>
]]></description><pubDate>Mon, 05 Jul 2021 20:43:42 +0000</pubDate><link>https://news.ycombinator.com/item?id=27741653</link><dc:creator>mattbit</dc:creator><comments>https://news.ycombinator.com/item?id=27741653</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=27741653</guid></item><item><title><![CDATA[New comment by mattbit in "How much math you need for programming (2014)"]]></title><description><![CDATA[
<p>> Now what does does a mathematician do? He tries to understand nature and uses mathematics as a language to do that.<p>I would argue that this is wrong. That's what physicists do, not mathematicians. Mathematics is about abstract ideas, which can live regardless of nature or application. Physics instead <i>is</i> about understanding nature. Most physicists use mathematics to do that, but that's just for practical reasons. They don't always take it for granted, there's a very famous article by Eugene Wigner on this: “The Unreasonable Effectiveness of Mathematics in the Natural Sciences”.<p>I think it's important to understand this. Sure, computer programming is not math. Physics is not math either. Mathematics is kind of a way of thinking, and mathematical language turns out to be very useful in describing and understanding nature and many other things. Computer programming theory stems out of mathematics, but I agree that everyday programming practice does not strictly require an in-depth math knowledge.<p>But it depends. One day you wake up and you want to solve a problem: sometimes you need programming, sometimes you need math, sometimes both, or maybe you need some business experience, psychology, whatever. We need different perspectives, I don't think we can compartmentalize these things any more.</p>
]]></description><pubDate>Sat, 26 Dec 2020 11:27:24 +0000</pubDate><link>https://news.ycombinator.com/item?id=25542766</link><dc:creator>mattbit</dc:creator><comments>https://news.ycombinator.com/item?id=25542766</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=25542766</guid></item><item><title><![CDATA[New comment by mattbit in "Chicago Police Department shuts down its arrest API"]]></title><description><![CDATA[
<p>“Obtaining structured data from this system is costly and time-consuming due to the need to tell the system you’re not a robot every few searches. It also doesn’t include some key categories of information formerly available via the API, including the primary charge, bond hearing dates, IR number, and the FBI code associated with the type of arrest.”</p>
]]></description><pubDate>Thu, 09 Jul 2020 20:29:14 +0000</pubDate><link>https://news.ycombinator.com/item?id=23784785</link><dc:creator>mattbit</dc:creator><comments>https://news.ycombinator.com/item?id=23784785</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=23784785</guid></item><item><title><![CDATA[New comment by mattbit in "An update on uBlock"]]></title><description><![CDATA[
<p>It seems like they are just trying to steal the name of uBlock. AdBlock CEO says they “do love the name” [0] and are "investing heavily" into the product, but the commits in the repo [1] are just cosmetic changes and rebranding. Oddly enough, the only active committer is anonymous.<p>I wonder if this will cause legal troubles to the true uBlock Origin.<p>[0]: <a href="https://twitter.com/judemaier/status/1020034358558670848" rel="nofollow">https://twitter.com/judemaier/status/1020034358558670848</a>
[1]: <a href="https://github.com/uBlock-LLC/uBlock/commits/master" rel="nofollow">https://github.com/uBlock-LLC/uBlock/commits/master</a></p>
]]></description><pubDate>Sat, 28 Jul 2018 18:12:32 +0000</pubDate><link>https://news.ycombinator.com/item?id=17634268</link><dc:creator>mattbit</dc:creator><comments>https://news.ycombinator.com/item?id=17634268</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=17634268</guid></item></channel></rss>