<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: pplonski86</title><link>https://news.ycombinator.com/user?id=pplonski86</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Tue, 14 Apr 2026 22:32:57 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=pplonski86" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by pplonski86 in "Show HN: Which AI model is best for real data analysis?"]]></title><description><![CDATA[
<p>We built a benchmark to evaluate LLMs on real data analysis workflows. Instead of single prompts, each task is a sequence of prompts (steps). It is similar to how a human data analyst works in practice. Each run is saved as full python notebook, including prompts, code and outputs. We evaluated runs across task completion, code correctess, output quality, reasoning and reliability. Each workflow is execuuted multiple times and scored automatically.<p>Modern LLMs perform very well on individual steps. The benchmark currently inludes 23 workflows from different data analysis tasks (EDA, ML, NLP, statistics ...). The top-3 models across the 23 workflows, gpt-oss:120b scored 9.87/10, followed by gpt-5.4 at 9.65/10, glm-5.1 at 9.48/10. Which is very high in my opinion. The results show that modern LLMs perform very well on data analysis tasks. All feedback is welcome! I uploaded all notebooks for each model <a href="https://github.com/pplonski/ai-for-data-analysis" rel="nofollow">https://github.com/pplonski/ai-for-data-analysis</a></p>
]]></description><pubDate>Tue, 14 Apr 2026 12:35:37 +0000</pubDate><link>https://news.ycombinator.com/item?id=47764837</link><dc:creator>pplonski86</dc:creator><comments>https://news.ycombinator.com/item?id=47764837</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47764837</guid></item><item><title><![CDATA[Show HN: Which AI model is best for real data analysis?]]></title><description><![CDATA[
<p>Article URL: <a href="https://mljar.com/analysis/">https://mljar.com/analysis/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=47764735">https://news.ycombinator.com/item?id=47764735</a></p>
<p>Points: 2</p>
<p># Comments: 1</p>
]]></description><pubDate>Tue, 14 Apr 2026 12:27:56 +0000</pubDate><link>https://mljar.com/analysis/</link><dc:creator>pplonski86</dc:creator><comments>https://news.ycombinator.com/item?id=47764735</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47764735</guid></item><item><title><![CDATA[Ask HN: Im back end engineer, not front end – is this just excuse?]]></title><description><![CDATA[
<p>When I hear: I'm not good at frontend, I'm good at backend - I'm starting to think this is not true.<p>If you don't care about the user in the UI, you probably don't care about the user in the backend either. Frontend problems are just easier to see — bad UI, confusing views, ugly design. Everyone notices. Backend problems are hidden — slow API response, bad errors, hard to manage code. It takes longer to notice, but users still suffer. So maybe it's not about frontend vs backend. Maybe it is just about caring (or not caring) about the end user. Have you seen this?</p>
<hr>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=47731814">https://news.ycombinator.com/item?id=47731814</a></p>
<p>Points: 3</p>
<p># Comments: 4</p>
]]></description><pubDate>Sat, 11 Apr 2026 16:20:12 +0000</pubDate><link>https://news.ycombinator.com/item?id=47731814</link><dc:creator>pplonski86</dc:creator><comments>https://news.ycombinator.com/item?id=47731814</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47731814</guid></item><item><title><![CDATA[New comment by pplonski86 in "Claude Code's source code has been leaked via a map file in their NPM registry"]]></title><description><![CDATA[
<p>I thought it was open source project on github? <a href="https://github.com/anthropics/claude-code" rel="nofollow">https://github.com/anthropics/claude-code</a> no?</p>
]]></description><pubDate>Tue, 31 Mar 2026 14:02:07 +0000</pubDate><link>https://news.ycombinator.com/item?id=47587547</link><dc:creator>pplonski86</dc:creator><comments>https://news.ycombinator.com/item?id=47587547</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47587547</guid></item><item><title><![CDATA[New comment by pplonski86 in "Google's 200M-parameter time-series foundation model with 16k context"]]></title><description><![CDATA[
<p>Can someone explain ELI5 how it does work? and how many data points it can read?</p>
]]></description><pubDate>Tue, 31 Mar 2026 08:40:56 +0000</pubDate><link>https://news.ycombinator.com/item?id=47584383</link><dc:creator>pplonski86</dc:creator><comments>https://news.ycombinator.com/item?id=47584383</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47584383</guid></item><item><title><![CDATA[Machine Learning for Humans and Now for LLMs]]></title><description><![CDATA[
<p>Article URL: <a href="https://mljar.com/blog/structured-automl-reports-python-llm/">https://mljar.com/blog/structured-automl-reports-python-llm/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=47542991">https://news.ycombinator.com/item?id=47542991</a></p>
<p>Points: 2</p>
<p># Comments: 0</p>
]]></description><pubDate>Fri, 27 Mar 2026 14:22:11 +0000</pubDate><link>https://mljar.com/blog/structured-automl-reports-python-llm/</link><dc:creator>pplonski86</dc:creator><comments>https://news.ycombinator.com/item?id=47542991</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47542991</guid></item><item><title><![CDATA[Show HN: Prompts Directory for Data Analyst]]></title><description><![CDATA[
<p>Article URL: <a href="https://mljar.com/ai-prompts/data-analyst/">https://mljar.com/ai-prompts/data-analyst/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=47499593">https://news.ycombinator.com/item?id=47499593</a></p>
<p>Points: 1</p>
<p># Comments: 0</p>
]]></description><pubDate>Tue, 24 Mar 2026 07:37:11 +0000</pubDate><link>https://mljar.com/ai-prompts/data-analyst/</link><dc:creator>pplonski86</dc:creator><comments>https://news.ycombinator.com/item?id=47499593</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47499593</guid></item><item><title><![CDATA[AutoResearch by Karpathy and the Future of Autonomous AI Research]]></title><description><![CDATA[
<p>Article URL: <a href="https://mljar.com/blog/autoresearch-karpathy-autonomous-ai-research/">https://mljar.com/blog/autoresearch-karpathy-autonomous-ai-research/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=47486602">https://news.ycombinator.com/item?id=47486602</a></p>
<p>Points: 2</p>
<p># Comments: 0</p>
]]></description><pubDate>Mon, 23 Mar 2026 08:07:49 +0000</pubDate><link>https://mljar.com/blog/autoresearch-karpathy-autonomous-ai-research/</link><dc:creator>pplonski86</dc:creator><comments>https://news.ycombinator.com/item?id=47486602</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47486602</guid></item><item><title><![CDATA[Show HN: AI Prompts for Data Professionals]]></title><description><![CDATA[
<p>Article URL: <a href="https://mljar.com/ai-prompts/">https://mljar.com/ai-prompts/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=47486460">https://news.ycombinator.com/item?id=47486460</a></p>
<p>Points: 1</p>
<p># Comments: 0</p>
]]></description><pubDate>Mon, 23 Mar 2026 07:42:51 +0000</pubDate><link>https://mljar.com/ai-prompts/</link><dc:creator>pplonski86</dc:creator><comments>https://news.ycombinator.com/item?id=47486460</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47486460</guid></item><item><title><![CDATA[AutoResearch by Karpathy and the Future of Autonomous Machine Learning]]></title><description><![CDATA[
<p>Article URL: <a href="https://mljar.com/blog/autoresearch-karpathy-autonomous-ai-research/">https://mljar.com/blog/autoresearch-karpathy-autonomous-ai-research/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=47414882">https://news.ycombinator.com/item?id=47414882</a></p>
<p>Points: 2</p>
<p># Comments: 0</p>
]]></description><pubDate>Tue, 17 Mar 2026 16:26:39 +0000</pubDate><link>https://mljar.com/blog/autoresearch-karpathy-autonomous-ai-research/</link><dc:creator>pplonski86</dc:creator><comments>https://news.ycombinator.com/item?id=47414882</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47414882</guid></item><item><title><![CDATA[GeoGPT – Chat-controlled GIS app built from a Jupyter Notebook]]></title><description><![CDATA[
<p>Article URL: <a href="https://mljar.com/blog/chatgpt-gis-app-jupyter-notebook/">https://mljar.com/blog/chatgpt-gis-app-jupyter-notebook/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=46911686">https://news.ycombinator.com/item?id=46911686</a></p>
<p>Points: 1</p>
<p># Comments: 0</p>
]]></description><pubDate>Fri, 06 Feb 2026 11:40:41 +0000</pubDate><link>https://mljar.com/blog/chatgpt-gis-app-jupyter-notebook/</link><dc:creator>pplonski86</dc:creator><comments>https://news.ycombinator.com/item?id=46911686</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46911686</guid></item><item><title><![CDATA[New comment by pplonski86 in "xAI joins SpaceX"]]></title><description><![CDATA[
<p>Do we need rockets to put satelittes to the space? Cant it be done with baloons? <a href="https://www.youtube.com/watch?v=NFieAD5Gpms" rel="nofollow">https://www.youtube.com/watch?v=NFieAD5Gpms</a></p>
]]></description><pubDate>Tue, 03 Feb 2026 12:27:29 +0000</pubDate><link>https://news.ycombinator.com/item?id=46870175</link><dc:creator>pplonski86</dc:creator><comments>https://news.ycombinator.com/item?id=46870175</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46870175</guid></item><item><title><![CDATA[New comment by pplonski86 in "The Codex App"]]></title><description><![CDATA[
<p>It is not that easy to build such app from scratch ... it all requires a lot of work, even with AI help. I think the most important is to provide easy to use UI first, and if speed or some missing features will be blockers for further innovation step then maybe native app will be at some point created.</p>
]]></description><pubDate>Tue, 03 Feb 2026 12:20:09 +0000</pubDate><link>https://news.ycombinator.com/item?id=46870075</link><dc:creator>pplonski86</dc:creator><comments>https://news.ycombinator.com/item?id=46870075</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46870075</guid></item><item><title><![CDATA[Build chatbot to talk with your PostgreSQL database using Python and local LLM]]></title><description><![CDATA[
<p>Article URL: <a href="https://mljar.com/blog/chatbot-python-postgresql-local-llm/">https://mljar.com/blog/chatbot-python-postgresql-local-llm/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=46854722">https://news.ycombinator.com/item?id=46854722</a></p>
<p>Points: 1</p>
<p># Comments: 0</p>
]]></description><pubDate>Mon, 02 Feb 2026 11:17:24 +0000</pubDate><link>https://mljar.com/blog/chatbot-python-postgresql-local-llm/</link><dc:creator>pplonski86</dc:creator><comments>https://news.ycombinator.com/item?id=46854722</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46854722</guid></item><item><title><![CDATA[New comment by pplonski86 in "Microsoft forced me to switch to Linux"]]></title><description><![CDATA[
<p>I have dual boot on decent laptop, doing nothing, on windows fan is always on, computing something? On Linux it is just silent</p>
]]></description><pubDate>Wed, 28 Jan 2026 21:10:33 +0000</pubDate><link>https://news.ycombinator.com/item?id=46801559</link><dc:creator>pplonski86</dc:creator><comments>https://news.ycombinator.com/item?id=46801559</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46801559</guid></item><item><title><![CDATA[New comment by pplonski86 in "The browser is the sandbox"]]></title><description><![CDATA[
<p>Thank you! With WASM I can’t use all pypi packages and can’t connect to database, that’s why I’m looking for python based solution</p>
]]></description><pubDate>Tue, 27 Jan 2026 08:16:58 +0000</pubDate><link>https://news.ycombinator.com/item?id=46776980</link><dc:creator>pplonski86</dc:creator><comments>https://news.ycombinator.com/item?id=46776980</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46776980</guid></item><item><title><![CDATA[New comment by pplonski86 in "Kimi Released Kimi K2.5, Open-Source Visual SOTA-Agentic Model"]]></title><description><![CDATA[
<p>Thank you! Exactly what I was looking for</p>
]]></description><pubDate>Tue, 27 Jan 2026 08:14:45 +0000</pubDate><link>https://news.ycombinator.com/item?id=46776957</link><dc:creator>pplonski86</dc:creator><comments>https://news.ycombinator.com/item?id=46776957</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46776957</guid></item><item><title><![CDATA[New comment by pplonski86 in "AI code and software craft"]]></title><description><![CDATA[
<p>What if AI starts to have sense of craft? we just miss the verify and critique models, that will tell other models what looks good</p>
]]></description><pubDate>Tue, 27 Jan 2026 07:28:09 +0000</pubDate><link>https://news.ycombinator.com/item?id=46776577</link><dc:creator>pplonski86</dc:creator><comments>https://news.ycombinator.com/item?id=46776577</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46776577</guid></item><item><title><![CDATA[New comment by pplonski86 in "ChatGPT Containers can now run bash, pip/npm install packages and download files"]]></title><description><![CDATA[
<p>thank you for sharing, is there a new container for each code run, or it stays the same for whole conversation?</p>
]]></description><pubDate>Tue, 27 Jan 2026 07:26:07 +0000</pubDate><link>https://news.ycombinator.com/item?id=46776560</link><dc:creator>pplonski86</dc:creator><comments>https://news.ycombinator.com/item?id=46776560</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46776560</guid></item><item><title><![CDATA[New comment by pplonski86 in "Kimi Released Kimi K2.5, Open-Source Visual SOTA-Agentic Model"]]></title><description><![CDATA[
<p>There are so many models, is there any website with list of all of them and comparison of performance on different tasks?</p>
]]></description><pubDate>Tue, 27 Jan 2026 07:22:59 +0000</pubDate><link>https://news.ycombinator.com/item?id=46776537</link><dc:creator>pplonski86</dc:creator><comments>https://news.ycombinator.com/item?id=46776537</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46776537</guid></item></channel></rss>