<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: kacperlukawski</title><link>https://news.ycombinator.com/user?id=kacperlukawski</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Fri, 01 May 2026 22:28:46 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=kacperlukawski" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New multimodal Gemini embeddings from Google (videos and PDFs supported)]]></title><description><![CDATA[
<p>Article URL: <a href="https://haystack.deepset.ai/blog/multimodal-embeddings-gemini-haystack">https://haystack.deepset.ai/blog/multimodal-embeddings-gemini-haystack</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=47326357">https://news.ycombinator.com/item?id=47326357</a></p>
<p>Points: 1</p>
<p># Comments: 0</p>
]]></description><pubDate>Tue, 10 Mar 2026 17:34:13 +0000</pubDate><link>https://haystack.deepset.ai/blog/multimodal-embeddings-gemini-haystack</link><dc:creator>kacperlukawski</dc:creator><comments>https://news.ycombinator.com/item?id=47326357</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47326357</guid></item><item><title><![CDATA[New comment by kacperlukawski in "Ask HN: Is building a calm, non-gamified learning app a mistake?"]]></title><description><![CDATA[
<p>Although it's in a different area, I wanted to mention <a href="https://calmcode.io/" rel="nofollow">https://calmcode.io/</a> as an excellent example of a calm learning platform.<p>There is a whole movement around enshittification, and I see potential in this kind of app, even though it still seems to be a niche.</p>
]]></description><pubDate>Mon, 15 Dec 2025 16:26:28 +0000</pubDate><link>https://news.ycombinator.com/item?id=46276604</link><dc:creator>kacperlukawski</dc:creator><comments>https://news.ycombinator.com/item?id=46276604</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46276604</guid></item><item><title><![CDATA[Ask HN: Front end stack for a new app in 2025]]></title><description><![CDATA[
<p>I read "The Frontend Treadmill" (https://polotek.net/posts/the-frontend-treadmill/) and it makes me think a lot.<p>I used to be a backend developer, and most of my experience is there. For frontend, last time I used jQuery and it was OK for me. Simple and worked fine. Now I want to build new side project and I think maybe vanilla JS is best? JavaScript has many good features now and browsers can do many things.<p>1. Can vanilla JS work for medium-sized apps in 2025?
2. What small libraries would you add if needed?<p>I don't want to rewrite everything in 2-3 years when frameworks change again. This seems like a waste of time. Has anyone here stopped using big frameworks and feels better about it?</p>
<hr>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=43435010">https://news.ycombinator.com/item?id=43435010</a></p>
<p>Points: 1</p>
<p># Comments: 2</p>
]]></description><pubDate>Fri, 21 Mar 2025 12:49:46 +0000</pubDate><link>https://news.ycombinator.com/item?id=43435010</link><dc:creator>kacperlukawski</dc:creator><comments>https://news.ycombinator.com/item?id=43435010</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43435010</guid></item><item><title><![CDATA[Elasticsearch Hybrid Search in Practice]]></title><description><![CDATA[
<p>Article URL: <a href="https://softwaredoug.com/blog/2025/02/08/elasticsearch-hybrid-search">https://softwaredoug.com/blog/2025/02/08/elasticsearch-hybrid-search</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=43024764">https://news.ycombinator.com/item?id=43024764</a></p>
<p>Points: 3</p>
<p># Comments: 0</p>
]]></description><pubDate>Wed, 12 Feb 2025 12:37:34 +0000</pubDate><link>https://softwaredoug.com/blog/2025/02/08/elasticsearch-hybrid-search</link><dc:creator>kacperlukawski</dc:creator><comments>https://news.ycombinator.com/item?id=43024764</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43024764</guid></item><item><title><![CDATA[New comment by kacperlukawski in "Show HN: TalkNotes – A site that turns your ideas into tasks"]]></title><description><![CDATA[
<p>Is there a free trial available? Even 24 hours should be enough to say if I like it, but currently I have to pay from the day one. Or did I miss it?</p>
]]></description><pubDate>Sat, 01 Feb 2025 12:42:02 +0000</pubDate><link>https://news.ycombinator.com/item?id=42897894</link><dc:creator>kacperlukawski</dc:creator><comments>https://news.ycombinator.com/item?id=42897894</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42897894</guid></item><item><title><![CDATA[New comment by kacperlukawski in "Show HN: I made an extension that turns Google Sheets into Google Slides"]]></title><description><![CDATA[
<p>I'm always a bit worried if an extension gets permission to do anything it wants with all my files, including deleting them. Is there a way to restrict it and allow it to modify only the files it created?</p>
]]></description><pubDate>Thu, 23 Jan 2025 13:51:04 +0000</pubDate><link>https://news.ycombinator.com/item?id=42803985</link><dc:creator>kacperlukawski</dc:creator><comments>https://news.ycombinator.com/item?id=42803985</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42803985</guid></item><item><title><![CDATA[New comment by kacperlukawski in "Don't use cosine similarity carelessly"]]></title><description><![CDATA[
<p>The problem is to scale that properly. If you have millions of documents, that won't scale that well. You are not going to prompt the LLM millions of times, aren't you?<p>Embedding models usually have fewer parameters than the LLMs, and once we index the documents, their retrieval is also pretty fast. Using LLM as a judge makes sense, but only on a limited scale.</p>
]]></description><pubDate>Wed, 15 Jan 2025 09:22:35 +0000</pubDate><link>https://news.ycombinator.com/item?id=42708988</link><dc:creator>kacperlukawski</dc:creator><comments>https://news.ycombinator.com/item?id=42708988</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42708988</guid></item><item><title><![CDATA[New comment by kacperlukawski in "Instant Video Search"]]></title><description><![CDATA[
<p>Interesting! Does it work based on speech or transcriptions?</p>
]]></description><pubDate>Thu, 12 Sep 2024 16:02:18 +0000</pubDate><link>https://news.ycombinator.com/item?id=41522356</link><dc:creator>kacperlukawski</dc:creator><comments>https://news.ycombinator.com/item?id=41522356</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=41522356</guid></item><item><title><![CDATA[New comment by kacperlukawski in "Automatically Detecting Under-Trained Tokens in Large Language Models"]]></title><description><![CDATA[
<p>Why is that an issue? Training the tokenizer seems much more straightforward than training the model as it is based on the statistics of the input data. I guess it may take a while for massive datasets, but is calculating the frequencies impossible to be done on a bigger scale?</p>
]]></description><pubDate>Sun, 12 May 2024 14:48:52 +0000</pubDate><link>https://news.ycombinator.com/item?id=40334925</link><dc:creator>kacperlukawski</dc:creator><comments>https://news.ycombinator.com/item?id=40334925</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40334925</guid></item><item><title><![CDATA[New comment by kacperlukawski in "Automatically Detecting Under-Trained Tokens in Large Language Models"]]></title><description><![CDATA[
<p>Are there any specific reasons for using BPE, not Unigram, in LLMs? I've been trying to understand the impact of the tokenization algorithm, and Unigram was reported to be a better alternative (e.g., Byte Pair Encoding is Suboptimal for Language Model Pretraining: <a href="https://arxiv.org/abs/2004.03720" rel="nofollow">https://arxiv.org/abs/2004.03720</a>). I understand that the unigram training process should eliminate under-trained tokens if trained on the same data as the LLM itself.</p>
]]></description><pubDate>Sun, 12 May 2024 13:06:12 +0000</pubDate><link>https://news.ycombinator.com/item?id=40334142</link><dc:creator>kacperlukawski</dc:creator><comments>https://news.ycombinator.com/item?id=40334142</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40334142</guid></item><item><title><![CDATA[New comment by kacperlukawski in "OpenAI announces GPT-4.5 Turbo"]]></title><description><![CDATA[
<p>Yeah, it seems like it got published too early.</p>
]]></description><pubDate>Tue, 12 Mar 2024 18:43:05 +0000</pubDate><link>https://news.ycombinator.com/item?id=39683245</link><dc:creator>kacperlukawski</dc:creator><comments>https://news.ycombinator.com/item?id=39683245</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=39683245</guid></item><item><title><![CDATA[OpenAI announces GPT-4.5 Turbo]]></title><description><![CDATA[
<p>Article URL: <a href="https://www.bing.com/search?q=https%3A%2F%2Fopenai.com%2Fblog%2Fgpt-4-5-turbo">https://www.bing.com/search?q=https%3A%2F%2Fopenai.com%2Fblog%2Fgpt-4-5-turbo</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=39683178">https://news.ycombinator.com/item?id=39683178</a></p>
<p>Points: 5</p>
<p># Comments: 2</p>
]]></description><pubDate>Tue, 12 Mar 2024 18:37:19 +0000</pubDate><link>https://www.bing.com/search?q=https%3A%2F%2Fopenai.com%2Fblog%2Fgpt-4-5-turbo</link><dc:creator>kacperlukawski</dc:creator><comments>https://news.ycombinator.com/item?id=39683178</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=39683178</guid></item><item><title><![CDATA[New comment by kacperlukawski in "Qdrant 1.7.0"]]></title><description><![CDATA[
<p>I'm unsure if there is any comparison of LanceDB and Qdrant available out there, but there shouldn't be any issues with Python 3.12 and qdrant-client compatibility. Windows is also not a problem, as the typical local setup is usually based on Docker. Are there any specific features you are interested in?</p>
]]></description><pubDate>Tue, 12 Dec 2023 15:12:03 +0000</pubDate><link>https://news.ycombinator.com/item?id=38613175</link><dc:creator>kacperlukawski</dc:creator><comments>https://news.ycombinator.com/item?id=38613175</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=38613175</guid></item><item><title><![CDATA[New comment by kacperlukawski in "Qdrant 1.7.0"]]></title><description><![CDATA[
<p>If you will be the only app user, then the Python SDK's local mode might be suitable. However, in the long run, when you decide to publish the app, you rather have to switch to an on-premise or cloud environment. Using Qdrant from the very beginning might be a good idea, as the interfaces are kept the same, and the switch is seamless.<p>Local mode: <a href="https://github.com/qdrant/qdrant-client#local-mode">https://github.com/qdrant/qdrant-client#local-mode</a></p>
]]></description><pubDate>Tue, 12 Dec 2023 14:34:41 +0000</pubDate><link>https://news.ycombinator.com/item?id=38612624</link><dc:creator>kacperlukawski</dc:creator><comments>https://news.ycombinator.com/item?id=38612624</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=38612624</guid></item><item><title><![CDATA[New comment by kacperlukawski in "Show HN: PromptTools – open-source tools for evaluating LLMs and vector DBs"]]></title><description><![CDATA[
<p>Qdrant here! We're already working on that :D</p>
]]></description><pubDate>Wed, 02 Aug 2023 07:29:38 +0000</pubDate><link>https://news.ycombinator.com/item?id=36968041</link><dc:creator>kacperlukawski</dc:creator><comments>https://news.ycombinator.com/item?id=36968041</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=36968041</guid></item><item><title><![CDATA[New comment by kacperlukawski in "Serverless Semantic Search, Free tier only"]]></title><description><![CDATA[
<p>How would you host sentence-transformers model for free? You need it to vectorize each query so that has to be hosted somewhere. Is there any way to do it for free?</p>
]]></description><pubDate>Wed, 12 Jul 2023 14:00:48 +0000</pubDate><link>https://news.ycombinator.com/item?id=36694665</link><dc:creator>kacperlukawski</dc:creator><comments>https://news.ycombinator.com/item?id=36694665</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=36694665</guid></item><item><title><![CDATA[New comment by kacperlukawski in "Serverless Semantic Search, Free tier only"]]></title><description><![CDATA[
<p>If you need semantic search locally then it's fine, but serving an embedding model might be still challenging. And if you want to expose it, your laptop might be not enough.</p>
]]></description><pubDate>Wed, 12 Jul 2023 13:54:12 +0000</pubDate><link>https://news.ycombinator.com/item?id=36694549</link><dc:creator>kacperlukawski</dc:creator><comments>https://news.ycombinator.com/item?id=36694549</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=36694549</guid></item><item><title><![CDATA[New comment by kacperlukawski in "Introduction to vector similarity search (2022)"]]></title><description><![CDATA[
<p>This is also an interesting piece of how to do it completely for free: <a href="https://news.ycombinator.com/item?id=36693239">https://news.ycombinator.com/item?id=36693239</a></p>
]]></description><pubDate>Wed, 12 Jul 2023 12:44:46 +0000</pubDate><link>https://news.ycombinator.com/item?id=36693744</link><dc:creator>kacperlukawski</dc:creator><comments>https://news.ycombinator.com/item?id=36693744</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=36693744</guid></item><item><title><![CDATA[New comment by kacperlukawski in "Serverless Semantic Search, Free tier only"]]></title><description><![CDATA[
<p>It's a bit easier in Python if you use tools like <a href="https://www.serverless.com/" rel="nofollow noreferrer">https://www.serverless.com/</a>. I'm not sure if Rust has something similar yet.</p>
]]></description><pubDate>Wed, 12 Jul 2023 12:25:52 +0000</pubDate><link>https://news.ycombinator.com/item?id=36693536</link><dc:creator>kacperlukawski</dc:creator><comments>https://news.ycombinator.com/item?id=36693536</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=36693536</guid></item><item><title><![CDATA[New comment by kacperlukawski in "MdBook – A command line tool to create books with Markdown"]]></title><description><![CDATA[
<p>It's Hugo, with a custom styling.
<a href="https://gohugo.io/documentation/" rel="nofollow noreferrer">https://gohugo.io/documentation/</a></p>
]]></description><pubDate>Fri, 30 Jun 2023 12:26:36 +0000</pubDate><link>https://news.ycombinator.com/item?id=36533666</link><dc:creator>kacperlukawski</dc:creator><comments>https://news.ycombinator.com/item?id=36533666</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=36533666</guid></item></channel></rss>