<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: dmitrykan</title><link>https://news.ycombinator.com/user?id=dmitrykan</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Thu, 30 Apr 2026 10:08:47 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=dmitrykan" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by dmitrykan in "Ask HN: What Are You Working On? (December 2025)"]]></title><description><![CDATA[
<p>On a Research Assistant: <a href="https://muves.io/" rel="nofollow">https://muves.io/</a><p>It helps to comprehend research papers (and not only papers - any document on any language) faster.<p>The tool is free to use, because we have credits from GCP. I guess at some point we'll need to introduce some level of subscription fee to keep it alive and useful, as it uses LLMs and vector search quite a bit.<p>Feedback is welcome!</p>
]]></description><pubDate>Sun, 14 Dec 2025 21:29:01 +0000</pubDate><link>https://news.ycombinator.com/item?id=46267137</link><dc:creator>dmitrykan</dc:creator><comments>https://news.ycombinator.com/item?id=46267137</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46267137</guid></item><item><title><![CDATA[New comment by dmitrykan in "[dead]"]]></title><description><![CDATA[
<p>Leo Boytsov, my guest on the Vector Podcast, made an honest and an eye-opening claim about vector search being intellectually rewarding, but professionally undervalued. What picked my attention was how he gives credit to people who actually deserve it, and how he speaks modestly about his own achievements. When professionally as a researcher, he accumulated over 1800 citations by now and helped to create the famous HNSW vector search algorithm.</p>
]]></description><pubDate>Sat, 08 Feb 2025 07:13:24 +0000</pubDate><link>https://news.ycombinator.com/item?id=42981110</link><dc:creator>dmitrykan</dc:creator><comments>https://news.ycombinator.com/item?id=42981110</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42981110</guid></item><item><title><![CDATA[New comment by dmitrykan in "The Rise, Fall, and Future of Vector Databases: How to Pick the One That Lasts"]]></title><description><![CDATA[
<p>My take on whether we've lost the category of vector databases.</p>
]]></description><pubDate>Thu, 16 Jan 2025 21:28:41 +0000</pubDate><link>https://news.ycombinator.com/item?id=42731127</link><dc:creator>dmitrykan</dc:creator><comments>https://news.ycombinator.com/item?id=42731127</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42731127</guid></item><item><title><![CDATA[The Rise, Fall, and Future of Vector Databases: How to Pick the One That Lasts]]></title><description><![CDATA[
<p>Article URL: <a href="https://dmitry-kan.medium.com/the-rise-fall-and-future-of-vector-databases-how-to-pick-the-one-that-lasts-6b9fbb43bbbe">https://dmitry-kan.medium.com/the-rise-fall-and-future-of-vector-databases-how-to-pick-the-one-that-lasts-6b9fbb43bbbe</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=42731126">https://news.ycombinator.com/item?id=42731126</a></p>
<p>Points: 1</p>
<p># Comments: 1</p>
]]></description><pubDate>Thu, 16 Jan 2025 21:28:41 +0000</pubDate><link>https://dmitry-kan.medium.com/the-rise-fall-and-future-of-vector-databases-how-to-pick-the-one-that-lasts-6b9fbb43bbbe</link><dc:creator>dmitrykan</dc:creator><comments>https://news.ycombinator.com/item?id=42731126</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42731126</guid></item><item><title><![CDATA[New comment by dmitrykan in "OTranscribe: A free and open tool for transcribing audio interviews"]]></title><description><![CDATA[
<p>I'm working on the tool, that includes AI. My original target is to test it on my <a href="https://www.youtube.com/c/VectorPodcast" rel="nofollow">https://www.youtube.com/c/VectorPodcast</a> by offering something that Lex Fridman does for his episodes.<p>Current features:
1. Download from YT
2. Transcribe using Vosk (output has time codes included)
3. Speaker diarization using pyannote - this isn't perfect and needs a bit more ironing out.<p>What needs to be done:
4. Store the transcription in a search engine (can include vectors)
5. Implement a webapp<p>If anyone here is interested to join forces, let me know.</p>
]]></description><pubDate>Fri, 09 Aug 2024 11:56:43 +0000</pubDate><link>https://news.ycombinator.com/item?id=41200973</link><dc:creator>dmitrykan</dc:creator><comments>https://news.ycombinator.com/item?id=41200973</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=41200973</guid></item><item><title><![CDATA[New comment by dmitrykan in "Ask HN: Could you share your personal blog here?"]]></title><description><![CDATA[
<p><a href="https://dmitry-kan.medium.com/" rel="nofollow noreferrer">https://dmitry-kan.medium.com/</a><p>I write about vector search, ANN algorithms, neural search frameworks, search engines and algorithms in general and publish episodes of the Vector Podcast.</p>
]]></description><pubDate>Wed, 05 Jul 2023 07:12:56 +0000</pubDate><link>https://news.ycombinator.com/item?id=36596696</link><dc:creator>dmitrykan</dc:creator><comments>https://news.ycombinator.com/item?id=36596696</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=36596696</guid></item><item><title><![CDATA[New comment by dmitrykan in "What are embeddings?"]]></title><description><![CDATA[
<p>Nice post, thanks for sharing! Learnt about new dbs / solutions for vector search from it.<p>Vicki mentions this survey I wrote some time ago: <a href="https://towardsdatascience.com/milvus-pinecone-vespa-weaviate-vald-gsi-what-unites-these-buzz-words-and-what-makes-each-9c65a3bd0696" rel="nofollow noreferrer">https://towardsdatascience.com/milvus-pinecone-vespa-weaviat...</a><p>Hoping it'll be useful as well!</p>
]]></description><pubDate>Mon, 26 Jun 2023 21:08:33 +0000</pubDate><link>https://news.ycombinator.com/item?id=36485877</link><dc:creator>dmitrykan</dc:creator><comments>https://news.ycombinator.com/item?id=36485877</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=36485877</guid></item><item><title><![CDATA[New comment by dmitrykan in "Neural Search Frameworks: A Head-to-Head Comparison"]]></title><description><![CDATA[
<p>thank you, Doug! It is mutual. Had a fantastic discussion with you on the podcast: <a href="https://www.youtube.com/watch?v=Kpua1Euc-B8">https://www.youtube.com/watch?v=Kpua1Euc-B8</a></p>
]]></description><pubDate>Sat, 24 Dec 2022 09:44:53 +0000</pubDate><link>https://news.ycombinator.com/item?id=34115396</link><dc:creator>dmitrykan</dc:creator><comments>https://news.ycombinator.com/item?id=34115396</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=34115396</guid></item><item><title><![CDATA[New comment by dmitrykan in "Vector Podcast – Journey of Vespa from Sparse into Neural Search"]]></title><description><![CDATA[
<p>It was great to discuss with Jo Kristian on these topics and more:<p>- History of Vespa
- Tensor data structure and its use cases
- Multi-stage ranking pipeline
- Game-changing vector search in Vespa
- Approximate vs exact nearest neighbor search tradeoffs
- Misconceptions in neural search
- Multimodal search is where vector search shines
- Power of building fully-fledged demos
- How to combine vector search with sparse search: Reciprocal Rank Fusion
- The question of WHY (my favourite)<p>I wonder what topics are interesting to HN community -- it would help me focus on these topics / embed into my questions in new episodes.</p>
]]></description><pubDate>Tue, 12 Apr 2022 16:07:37 +0000</pubDate><link>https://news.ycombinator.com/item?id=31004523</link><dc:creator>dmitrykan</dc:creator><comments>https://news.ycombinator.com/item?id=31004523</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=31004523</guid></item><item><title><![CDATA[New comment by dmitrykan in "The next Google"]]></title><description><![CDATA[
<p>Thanks for the article, I've learnt new search engines despite spending a couple of years recently in web scale search. I think you may consider <a href="https://usearch.com/" rel="nofollow">https://usearch.com/</a> as another dimension in web scale search, where query log is learnt from the data, making it quite unique.</p>
]]></description><pubDate>Wed, 06 Apr 2022 06:33:28 +0000</pubDate><link>https://news.ycombinator.com/item?id=30929074</link><dc:creator>dmitrykan</dc:creator><comments>https://news.ycombinator.com/item?id=30929074</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=30929074</guid></item><item><title><![CDATA[New comment by dmitrykan in "Show HN: We made an open-source personalization engine"]]></title><description><![CDATA[
<p>Great project! Elasticsearch / OpenSearch / Solr have their own learning to rank plugins. Have you considered integrating Metarank with such systems? Or is your vision to provide a reranker layer, that can be independent of the underlying search engine architecture?</p>
]]></description><pubDate>Wed, 23 Mar 2022 20:27:03 +0000</pubDate><link>https://news.ycombinator.com/item?id=30782896</link><dc:creator>dmitrykan</dc:creator><comments>https://news.ycombinator.com/item?id=30782896</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=30782896</guid></item><item><title><![CDATA[New comment by dmitrykan in "Vectorizing the entire eCFR for $3.00 and 22 minutes"]]></title><description><![CDATA[
<p>This is fantastic work, Max! I like where you're going with this unit economy.<p>It is increasingly more and more common that companies need ways of productizing their deep learning models with tight SLA constraints and tight budgets.
I think, one of the logical steps could be integration with vector databases / frameworks and measuring gains there in client setups.</p>
]]></description><pubDate>Wed, 16 Mar 2022 16:20:25 +0000</pubDate><link>https://news.ycombinator.com/item?id=30700759</link><dc:creator>dmitrykan</dc:creator><comments>https://news.ycombinator.com/item?id=30700759</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=30700759</guid></item><item><title><![CDATA[New comment by dmitrykan in "[dead]"]]></title><description><![CDATA[
<p>5th episode of Vector Podcast with Tom Lackner, VP Engineering at classic.com explaning the choise of Qdrant vector DB for his projects.</p>
]]></description><pubDate>Wed, 22 Dec 2021 12:59:03 +0000</pubDate><link>https://news.ycombinator.com/item?id=29648685</link><dc:creator>dmitrykan</dc:creator><comments>https://news.ycombinator.com/item?id=29648685</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=29648685</guid></item><item><title><![CDATA[New comment by dmitrykan in "Semantic search through Wikipedia with the Weaviate vector search engine"]]></title><description><![CDATA[
<p>thanks!</p>
]]></description><pubDate>Mon, 29 Nov 2021 16:50:02 +0000</pubDate><link>https://news.ycombinator.com/item?id=29382016</link><dc:creator>dmitrykan</dc:creator><comments>https://news.ycombinator.com/item?id=29382016</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=29382016</guid></item><item><title><![CDATA[New comment by dmitrykan in "Semantic search through Wikipedia with the Weaviate vector search engine"]]></title><description><![CDATA[
<p>To learn about vector search and get practical with it, you can check out my blog as well: <a href="https://dmitry-kan.medium.com/list/vector-search-e9b564d14274" rel="nofollow">https://dmitry-kan.medium.com/list/vector-search-e9b564d1427...</a><p>I'm also hosting a podcast around these topics: <a href="https://www.youtube.com/channel/UCCIMPfR7TXyDvlDRXjVhP1g" rel="nofollow">https://www.youtube.com/channel/UCCIMPfR7TXyDvlDRXjVhP1g</a></p>
]]></description><pubDate>Mon, 29 Nov 2021 16:49:48 +0000</pubDate><link>https://news.ycombinator.com/item?id=29382014</link><dc:creator>dmitrykan</dc:creator><comments>https://news.ycombinator.com/item?id=29382014</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=29382014</guid></item><item><title><![CDATA[New comment by dmitrykan in "Semantic search through Wikipedia with the Weaviate vector search engine"]]></title><description><![CDATA[
<p>11M+ Wikipedia articles indexed for similarity search.</p>
]]></description><pubDate>Mon, 29 Nov 2021 15:39:57 +0000</pubDate><link>https://news.ycombinator.com/item?id=29381052</link><dc:creator>dmitrykan</dc:creator><comments>https://news.ycombinator.com/item?id=29381052</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=29381052</guid></item><item><title><![CDATA[Semantic search through Wikipedia with the Weaviate vector search engine]]></title><description><![CDATA[
<p>Article URL: <a href="https://github.com/semi-technologies/semantic-search-through-wikipedia-with-weaviate">https://github.com/semi-technologies/semantic-search-through-wikipedia-with-weaviate</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=29381051">https://news.ycombinator.com/item?id=29381051</a></p>
<p>Points: 13</p>
<p># Comments: 4</p>
]]></description><pubDate>Mon, 29 Nov 2021 15:39:57 +0000</pubDate><link>https://github.com/semi-technologies/semantic-search-through-wikipedia-with-weaviate</link><dc:creator>dmitrykan</dc:creator><comments>https://news.ycombinator.com/item?id=29381051</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=29381051</guid></item><item><title><![CDATA[New comment by dmitrykan in "Vector Search"]]></title><description><![CDATA[
<p>From code to podcast to billion-scale ANN challenge and statistics</p>
]]></description><pubDate>Fri, 12 Nov 2021 20:52:12 +0000</pubDate><link>https://news.ycombinator.com/item?id=29204389</link><dc:creator>dmitrykan</dc:creator><comments>https://news.ycombinator.com/item?id=29204389</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=29204389</guid></item><item><title><![CDATA[Vector Search]]></title><description><![CDATA[
<p>Article URL: <a href="https://dmitry-kan.medium.com/list/vector-search-e9b564d14274">https://dmitry-kan.medium.com/list/vector-search-e9b564d14274</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=29204388">https://news.ycombinator.com/item?id=29204388</a></p>
<p>Points: 3</p>
<p># Comments: 1</p>
]]></description><pubDate>Fri, 12 Nov 2021 20:52:12 +0000</pubDate><link>https://dmitry-kan.medium.com/list/vector-search-e9b564d14274</link><dc:creator>dmitrykan</dc:creator><comments>https://news.ycombinator.com/item?id=29204388</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=29204388</guid></item><item><title><![CDATA[New comment by dmitrykan in "Ć Programming Language"]]></title><description><![CDATA[
<p>Thanks! One practical target I imagine having is:
1. write the program in Python.
2. translate it into Ć ("interlingua")
3. compile Ć into Go -- for instance there is a belief around me, that Python is not suitable for web scale loads, while Go is more suitable.<p>Go source code / binary in this case are of less importance for code readability, because they are meant for production deployments. Something that happened in GWT: write in Java, compile into JavaScript.</p>
]]></description><pubDate>Sun, 10 Oct 2021 10:24:22 +0000</pubDate><link>https://news.ycombinator.com/item?id=28817270</link><dc:creator>dmitrykan</dc:creator><comments>https://news.ycombinator.com/item?id=28817270</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=28817270</guid></item></channel></rss>