<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: londogard</title><link>https://news.ycombinator.com/user?id=londogard</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Tue, 28 Apr 2026 21:28:06 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=londogard" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by londogard in "Gzip and KNN Outperforms Transformers on Text Classification"]]></title><description><![CDATA[
<p>Yes, what I'm saying is that gzip does not perform as well when it's not overlapping tokens exact.<p>Gzip does not support a "semantic" mode, hence it won't and does not (according to the papers metric) perform as well.<p>Deep learning can capture these semantic similarities.</p>
]]></description><pubDate>Thu, 13 Jul 2023 13:19:41 +0000</pubDate><link>https://news.ycombinator.com/item?id=36708671</link><dc:creator>londogard</dc:creator><comments>https://news.ycombinator.com/item?id=36708671</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=36708671</guid></item><item><title><![CDATA[New comment by londogard in "Gzip and KNN Outperforms Transformers on Text Classification"]]></title><description><![CDATA[
<p>I'd like to note that this is only stronger on news.<p>Yahoo Questions it is not top performer.
It's not far fetched to think that news are written in a similar way, sometimes even partly copied, and therefore have a lot of words in common.
Yahoo Questions is a forum and I'd expect there to be a greater variation of word, but the word themself have a semantic similarity.<p>That is, gzip is strong when many words overlap (the size increase when gzipped is smaller) but if it's semantic similarity DNN's win everyday.<p>The results are interesting but not as interesting as it sounds IMO.</p>
]]></description><pubDate>Thu, 13 Jul 2023 11:41:13 +0000</pubDate><link>https://news.ycombinator.com/item?id=36707677</link><dc:creator>londogard</dc:creator><comments>https://news.ycombinator.com/item?id=36707677</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=36707677</guid></item><item><title><![CDATA[New comment by londogard in "Ask HN: Could you share your personal blog here?"]]></title><description><![CDATA[
<p>blog.londogard.com<p>Mainly Data Science or Kotlin I'm blogging about it at least for now.</p>
]]></description><pubDate>Tue, 04 Jul 2023 18:15:01 +0000</pubDate><link>https://news.ycombinator.com/item?id=36590546</link><dc:creator>londogard</dc:creator><comments>https://news.ycombinator.com/item?id=36590546</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=36590546</guid></item><item><title><![CDATA[Solara – “A New Reactive Streamlit”]]></title><description><![CDATA[
<p>Article URL: <a href="https://blog.londogard.com/posts/2023-06-30-solara/">https://blog.londogard.com/posts/2023-06-30-solara/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=36550924">https://news.ycombinator.com/item?id=36550924</a></p>
<p>Points: 2</p>
<p># Comments: 0</p>
]]></description><pubDate>Sat, 01 Jul 2023 15:07:42 +0000</pubDate><link>https://blog.londogard.com/posts/2023-06-30-solara/</link><dc:creator>londogard</dc:creator><comments>https://news.ycombinator.com/item?id=36550924</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=36550924</guid></item><item><title><![CDATA[New comment by londogard in "Log and track ML metrics, parameters, models with Git and DVC"]]></title><description><![CDATA[
<p>Thanks!<p>We solved 1. the same way, but it felt "off" somehow. Perhaps it's a good solution.<p>2. That's a sound solution, but a tiny bit cumbersome. I have projects where we deploy both classifier and regressor, where it'd nice to keep all in main. Alas, you can't have it all.</p>
]]></description><pubDate>Tue, 23 May 2023 11:32:22 +0000</pubDate><link>https://news.ycombinator.com/item?id=36042820</link><dc:creator>londogard</dc:creator><comments>https://news.ycombinator.com/item?id=36042820</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=36042820</guid></item><item><title><![CDATA[New comment by londogard in "Log and track ML metrics, parameters, models with Git and DVC"]]></title><description><![CDATA[
<p>I really enjoy using DVC. I do have some drawbacks compared to other offering like MLFlow and W&B.<p>1. Harder to track experiments on remote VM's (e.g. Azure) as there's no server (we need to feed results back somehow)
2. Impossible (?) to track different types of experiments in the same repo. MLFlow has a way to define experiments and runs, which means I can easily group Regression vs Classification or even if I try a completely different task with the same data.<p>If anyone has a good suggestion on how to solve these two I'd love to fully commit to DVC!</p>
]]></description><pubDate>Sun, 21 May 2023 08:13:29 +0000</pubDate><link>https://news.ycombinator.com/item?id=36018897</link><dc:creator>londogard</dc:creator><comments>https://news.ycombinator.com/item?id=36018897</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=36018897</guid></item><item><title><![CDATA[New comment by londogard in "DuckDB 0.7.0"]]></title><description><![CDATA[
<p>The Polars integration is golden! Now I can use my two favorite tools without any awkward conversations via Arrow.<p>Really smooth, love it!</p>
]]></description><pubDate>Mon, 13 Feb 2023 10:33:55 +0000</pubDate><link>https://news.ycombinator.com/item?id=34772005</link><dc:creator>londogard</dc:creator><comments>https://news.ycombinator.com/item?id=34772005</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=34772005</guid></item><item><title><![CDATA[New comment by londogard in "TypeScript and Set Theory"]]></title><description><![CDATA[
<p>I think it's excalidraw</p>
]]></description><pubDate>Fri, 22 Apr 2022 06:15:57 +0000</pubDate><link>https://news.ycombinator.com/item?id=31118771</link><dc:creator>londogard</dc:creator><comments>https://news.ycombinator.com/item?id=31118771</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=31118771</guid></item><item><title><![CDATA[New comment by londogard in "Tribuo, a Machine Learning Library for Java"]]></title><description><![CDATA[
<p>Hi,<p>What would you say differentiates you from Smile which includes a simplistic datagrame, visualisation and support for CBLAS etc.<p>Is speed on par?</p>
]]></description><pubDate>Tue, 15 Sep 2020 20:10:36 +0000</pubDate><link>https://news.ycombinator.com/item?id=24486179</link><dc:creator>londogard</dc:creator><comments>https://news.ycombinator.com/item?id=24486179</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=24486179</guid></item><item><title><![CDATA[New comment by londogard in "AllenNLP Interpret: A Framework for Explaining Predictions of NLP Models"]]></title><description><![CDATA[
<p>Awesome, I love to see how there's coming more and more frameworks for interpretability. It's incredibly important, especially when selling your solution to higher-ups.<p>There's another solution named LIME which seems to take a similar but more general approach, I like this more tailored idea as it'll probably give a better interpretation of the NLP questions.</p>
]]></description><pubDate>Sun, 29 Sep 2019 06:35:48 +0000</pubDate><link>https://news.ycombinator.com/item?id=21105493</link><dc:creator>londogard</dc:creator><comments>https://news.ycombinator.com/item?id=21105493</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=21105493</guid></item></channel></rss>