<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: prvnsmpth</title><link>https://news.ycombinator.com/user?id=prvnsmpth</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Sun, 03 May 2026 16:24:12 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=prvnsmpth" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by prvnsmpth in "Kimi K2.6 just beat Claude, GPT-5.5, and Gemini in a coding challenge"]]></title><description><![CDATA[
<p>Use kimi 2.6 for planning and a cheap model (preferably local) for execution, and then kimi once again for reviewing it. Then finally I review the code. Saves a lot on tokens.</p>
]]></description><pubDate>Sun, 03 May 2026 05:18:41 +0000</pubDate><link>https://news.ycombinator.com/item?id=47993595</link><dc:creator>prvnsmpth</dc:creator><comments>https://news.ycombinator.com/item?id=47993595</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47993595</guid></item><item><title><![CDATA[New comment by prvnsmpth in "Kimi K2.6 just beat Claude, GPT-5.5, and Gemini in a coding challenge"]]></title><description><![CDATA[
<p>You can sign up for a plan on the kimi code platform and use it via the pi.dev coding agent, or opencode. In planning, I’d say it’s almost on par with Claude Opus.</p>
]]></description><pubDate>Sun, 03 May 2026 05:15:14 +0000</pubDate><link>https://news.ycombinator.com/item?id=47993570</link><dc:creator>prvnsmpth</dc:creator><comments>https://news.ycombinator.com/item?id=47993570</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47993570</guid></item><item><title><![CDATA[New comment by prvnsmpth in "Show HN: Omni – Open-source workplace search and chat, built on Postgres"]]></title><description><![CDATA[
<p>Thanks Philippe! You guys have been super helpful on slack!</p>
]]></description><pubDate>Mon, 02 Mar 2026 16:58:23 +0000</pubDate><link>https://news.ycombinator.com/item?id=47220606</link><dc:creator>prvnsmpth</dc:creator><comments>https://news.ycombinator.com/item?id=47220606</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47220606</guid></item><item><title><![CDATA[New comment by prvnsmpth in "Show HN: Omni – Open-source workplace search and chat, built on Postgres"]]></title><description><![CDATA[
<p>So it depends on the app - e.g., Google has domain-wide delegation where the workspace admin can provide service account creds that allow us to impersonate all users in the workspace and index all their files/email. During indexing, we determine the users/groups who have permissions file and persist that in the db. (It's not perfect, because Google Drive permission model is a bit complex, but I'm working on it.) This model is much simpler than doing per-user OAuth.<p>In general, the goal is to use an org-wide installation method wherever possible, and record the identify of the user we are impersonating when ingesting data in the ACL. There are some gaps in the permission-gathering step in some of the connectors, I'm still working on fixing those.</p>
]]></description><pubDate>Mon, 02 Mar 2026 16:22:48 +0000</pubDate><link>https://news.ycombinator.com/item?id=47220063</link><dc:creator>prvnsmpth</dc:creator><comments>https://news.ycombinator.com/item?id=47220063</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47220063</guid></item><item><title><![CDATA[New comment by prvnsmpth in "Show HN: Omni – Open-source workplace search and chat, built on Postgres"]]></title><description><![CDATA[
<p>Fair point, although I think we have OpenAI to blame for that - for buying chat.com and pointing it to the most popular textual AI interface of them all :)</p>
]]></description><pubDate>Mon, 02 Mar 2026 13:17:49 +0000</pubDate><link>https://news.ycombinator.com/item?id=47217621</link><dc:creator>prvnsmpth</dc:creator><comments>https://news.ycombinator.com/item?id=47217621</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47217621</guid></item><item><title><![CDATA[New comment by prvnsmpth in "Show HN: Omni – Open-source workplace search and chat, built on Postgres"]]></title><description><![CDATA[
<p>So far both projects are quite similar… the only major difference being the search index. Onyx uses vespa.ai for BM25 and vector search, I decided to go down the Postgres-only route.</p>
]]></description><pubDate>Mon, 02 Mar 2026 13:14:14 +0000</pubDate><link>https://news.ycombinator.com/item?id=47217577</link><dc:creator>prvnsmpth</dc:creator><comments>https://news.ycombinator.com/item?id=47217577</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47217577</guid></item><item><title><![CDATA[New comment by prvnsmpth in "Show HN: Omni – Open-source workplace search and chat, built on Postgres"]]></title><description><![CDATA[
<p>That's a good point, it might make sense to clarify that for individuals who want to self-host. I'll make the change, thanks!</p>
]]></description><pubDate>Mon, 02 Mar 2026 12:36:57 +0000</pubDate><link>https://news.ycombinator.com/item?id=47217243</link><dc:creator>prvnsmpth</dc:creator><comments>https://news.ycombinator.com/item?id=47217243</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47217243</guid></item><item><title><![CDATA[New comment by prvnsmpth in "Show HN: Omni – Open-source workplace search and chat, built on Postgres"]]></title><description><![CDATA[
<p>Exactly, enterprise customers almost always use private model endpoints on their cloud provider for any serious deployments. Data stays within the customer's VPC, data security and privacy is guaranteed by the cloud providers.</p>
]]></description><pubDate>Mon, 02 Mar 2026 12:28:53 +0000</pubDate><link>https://news.ycombinator.com/item?id=47217157</link><dc:creator>prvnsmpth</dc:creator><comments>https://news.ycombinator.com/item?id=47217157</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47217157</guid></item><item><title><![CDATA[New comment by prvnsmpth in "Show HN: Omni – Open-source workplace search and chat, built on Postgres"]]></title><description><![CDATA[
<p>Thank you!<p>Typical RAG implementations I’ve seen take the user query and directly run it against the full-text search and embedding indexes. This produces sub-par results because the query embedding doesn’t really capture fully what the user is really looking for.<p>A better solution is to send the user query to the LLM, and let it construct and run queries against the index via tool calling. Nothing too ground-breaking tbh, pretty much every AI search agent does this now. But it produces much better results.</p>
]]></description><pubDate>Mon, 02 Mar 2026 12:22:19 +0000</pubDate><link>https://news.ycombinator.com/item?id=47217092</link><dc:creator>prvnsmpth</dc:creator><comments>https://news.ycombinator.com/item?id=47217092</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47217092</guid></item><item><title><![CDATA[New comment by prvnsmpth in "Show HN: Omni – Open-source workplace search and chat, built on Postgres"]]></title><description><![CDATA[
<p>Not yet, there’s a Microsoft connector implementation, but it only does Sharepoint, OneDrive, Outlook etc. and I haven’t tested it thoroughly yet. Teams required some special setup to work IIRC, so I skipped it. Will keep it on the roadmap though!</p>
]]></description><pubDate>Mon, 02 Mar 2026 12:18:32 +0000</pubDate><link>https://news.ycombinator.com/item?id=47217058</link><dc:creator>prvnsmpth</dc:creator><comments>https://news.ycombinator.com/item?id=47217058</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47217058</guid></item><item><title><![CDATA[New comment by prvnsmpth in "Show HN: Omni – Open-source workplace search and chat, built on Postgres"]]></title><description><![CDATA[
<p>Thanks for sharing! Big part of the reason why I decided on postgres, everything I've read about people using it in prod tells me that most organizations never really grow beyond requiring anything more than what it offers.</p>
]]></description><pubDate>Mon, 02 Mar 2026 11:27:27 +0000</pubDate><link>https://news.ycombinator.com/item?id=47216578</link><dc:creator>prvnsmpth</dc:creator><comments>https://news.ycombinator.com/item?id=47216578</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47216578</guid></item><item><title><![CDATA[New comment by prvnsmpth in "Show HN: Omni – Open-source workplace search and chat, built on Postgres"]]></title><description><![CDATA[
<p>Yeah, Omni uses Postgres and pgvector for search. ParadeDB is essentially just Postgres with the pgsearch extension that brings in Tantivy, a full-text search engine (like Apache Lucene).</p>
]]></description><pubDate>Mon, 02 Mar 2026 10:19:29 +0000</pubDate><link>https://news.ycombinator.com/item?id=47216023</link><dc:creator>prvnsmpth</dc:creator><comments>https://news.ycombinator.com/item?id=47216023</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47216023</guid></item><item><title><![CDATA[New comment by prvnsmpth in "Show HN: Omni – Open-source workplace search and chat, built on Postgres"]]></title><description><![CDATA[
<p>Oops, sorry! That page is still a WIP, haven't pushed it yet. The plan was to expose the main search and chat APIs so that users can build integrations with third-party messaging apps (e.g. Slack), but haven't gotten around to properly documenting all the APIs yet.</p>
]]></description><pubDate>Mon, 02 Mar 2026 10:11:02 +0000</pubDate><link>https://news.ycombinator.com/item?id=47215956</link><dc:creator>prvnsmpth</dc:creator><comments>https://news.ycombinator.com/item?id=47215956</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47215956</guid></item><item><title><![CDATA[New comment by prvnsmpth in "Show HN: Omni – Open-source workplace search and chat, built on Postgres"]]></title><description><![CDATA[
<p>I've done small scale experiments with up to 100-500k rows, and did not notice any significant degradation in search query latency - p95 still well under 1s.<p>I haven't directly compared against Elasticsearch yet, but I plan to do that next and publish some numbers. There's a benchmark harness setup already: <a href="https://github.com/getomnico/omni/tree/master/benchmarks" rel="nofollow">https://github.com/getomnico/omni/tree/master/benchmarks</a>, but there's a couple issues with it right now that I need to address first before I do a large scale run (the ParadeDB index settings need some tuning).</p>
]]></description><pubDate>Mon, 02 Mar 2026 10:05:26 +0000</pubDate><link>https://news.ycombinator.com/item?id=47215920</link><dc:creator>prvnsmpth</dc:creator><comments>https://news.ycombinator.com/item?id=47215920</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47215920</guid></item><item><title><![CDATA[New comment by prvnsmpth in "Show HN: Omni – Open-source workplace search and chat, built on Postgres"]]></title><description><![CDATA[
<p>Thank you!<p>Currently permissions are handled in the app layer - it's simply a WHERE clause filter that restricts access to only those records that the user has read permissions for in the source. But I plan to upgrade this to use RLS in Postgres eventually.<p>For Slack specifically, right now the connector only indexes public channels. For private channels, I'm still working on full permission inheritance - capturing all channel members, and giving them read permissions to messages indexed from that channel. It's a bit challenging because channel members can change over time, and you'll have to keep permissions updated in real-time.</p>
]]></description><pubDate>Mon, 02 Mar 2026 09:51:27 +0000</pubDate><link>https://news.ycombinator.com/item?id=47215803</link><dc:creator>prvnsmpth</dc:creator><comments>https://news.ycombinator.com/item?id=47215803</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47215803</guid></item><item><title><![CDATA[Show HN: Omni – Open-source workplace search and chat, built on Postgres]]></title><description><![CDATA[
<p>Hey HN!<p>Over the past few months, I've been working on building Omni - a workplace search and chat platform that connects to apps like Google Drive/Gmail, Slack, Confluence, etc. Essentially an open-source alternative to Glean, fully self-hosted.<p>I noticed that some orgs find Glean to be expensive and not very extensible. I wanted to build something that small to mid-size teams could run themselves, so I decided to build it all on Postgres (ParadeDB to be precise) and pgvector. No Elasticsearch, or dedicated vector databases. I figured Postgres is more than capable of handling the level of scale required.<p>To bring up Omni on your own infra, all it takes is a single `docker compose up`, and some basic configuration to connect your apps and LLMs.<p>What it does:<p>- Syncs data from all connected apps and builds a BM25 index (ParadeDB) and HNSW vector index (pgvector)<p>- Hybrid search combines results from both<p>- Chat UI where the LLM has tools to search the index - not just basic RAG<p>- Traditional search UI<p>- Users bring their own LLM provider (OpenAI/Anthropic/Gemini)<p>- Connectors for Google Workspace, Slack, Confluence, Jira, HubSpot, and more<p>- Connector SDK to build your own custom connectors<p>Omni is in beta right now, and I'd love your feedback, especially on the following:<p>- Has anyone tried self-hosting workplace search and/or AI tools, and what was your experience like?<p>- Any concerns with the Postgres-only approach at larger scales?<p>Happy to answer any questions!<p>The code: <a href="https://github.com/getomnico/omni" rel="nofollow">https://github.com/getomnico/omni</a> (Apache 2.0 licensed)</p>
<hr>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=47215427">https://news.ycombinator.com/item?id=47215427</a></p>
<p>Points: 177</p>
<p># Comments: 42</p>
]]></description><pubDate>Mon, 02 Mar 2026 08:58:14 +0000</pubDate><link>https://github.com/getomnico/omni</link><dc:creator>prvnsmpth</dc:creator><comments>https://news.ycombinator.com/item?id=47215427</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47215427</guid></item><item><title><![CDATA[New comment by prvnsmpth in "Ask HN: What are you working on? (May 2025)"]]></title><description><![CDATA[
<p><a href="https://casepro.club" rel="nofollow">https://casepro.club</a><p>A platform for consulting aspirants to practice business case interviews.<p>Finding case prep partners is a major pain point for B-school students/consulting aspirants. Fortunately, frontier AI models are now good enough to function as surprisingly competent case interviewers.</p>
]]></description><pubDate>Wed, 28 May 2025 14:40:03 +0000</pubDate><link>https://news.ycombinator.com/item?id=44116517</link><dc:creator>prvnsmpth</dc:creator><comments>https://news.ycombinator.com/item?id=44116517</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44116517</guid></item><item><title><![CDATA[Show HN: Fine-tuning an LLM on your code for better code completions]]></title><description><![CDATA[
<p>Hey folks,<p>Just wanted to share an interesting experiment I ran to see what kind of performance gains can be achieved by fine-tuning a model to code from a single repo.<p>Results: The fine-tuned model achieves a 47% improvement in the code completion task (tab autocomplete). Accuracy goes from 25% to 36% (exact match against ground truth) after a short training run of only 500 iterations on a single RTX 4090 GPU.<p>This is an interesting result because it shows that there are significant gains to be had by fine-tuning to your own code.<p>Full details of the experiment: <a href="https://prvn.sh/build-your-own-github-copilot/" rel="nofollow">https://prvn.sh/build-your-own-github-copilot/</a><p>Highlights of the experiment:
  - Model: qwen2.5-coder 14b, 4-bit quantized
  - Training data: Svelte source files from this repo: <a href="https://github.com/hcengineering/platform" rel="nofollow">https://github.com/hcengineering/platform</a>
  - Unsloth for LoRA training with rank 16, 4096 sequence length
  - GPU: single RTX 4090
  - 500 iterations with effective batch size 8<p>A fine-tuned open-source model could be a real alternative to commercial coding assistants like GitHub Copilot, Cursor, etc., especially for organizations with a ton of legacy code that these LLMs have not seen, and also for those that are wary of exposing their code to external systems.</p>
<hr>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=43390889">https://news.ycombinator.com/item?id=43390889</a></p>
<p>Points: 4</p>
<p># Comments: 0</p>
]]></description><pubDate>Mon, 17 Mar 2025 17:40:54 +0000</pubDate><link>https://prvn.sh/build-your-own-github-copilot/</link><dc:creator>prvnsmpth</dc:creator><comments>https://news.ycombinator.com/item?id=43390889</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43390889</guid></item><item><title><![CDATA[New comment by prvnsmpth in "The Animated Transformer: An Intuitive Explanation of the Transformer Model"]]></title><description><![CDATA[
<p>I wrote a short article explaining the Transformer model and how it works, using the Manim library for generating animations. Please read through if the topic interests you, and leave your thoughts and feedback!</p>
]]></description><pubDate>Mon, 03 Jul 2023 10:50:22 +0000</pubDate><link>https://news.ycombinator.com/item?id=36571174</link><dc:creator>prvnsmpth</dc:creator><comments>https://news.ycombinator.com/item?id=36571174</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=36571174</guid></item><item><title><![CDATA[The Animated Transformer: An Intuitive Explanation of the Transformer Model]]></title><description><![CDATA[
<p>Article URL: <a href="https://prvnsmpth.github.io/animated-transformer/">https://prvnsmpth.github.io/animated-transformer/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=36571173">https://news.ycombinator.com/item?id=36571173</a></p>
<p>Points: 3</p>
<p># Comments: 1</p>
]]></description><pubDate>Mon, 03 Jul 2023 10:50:22 +0000</pubDate><link>https://prvnsmpth.github.io/animated-transformer/</link><dc:creator>prvnsmpth</dc:creator><comments>https://news.ycombinator.com/item?id=36571173</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=36571173</guid></item></channel></rss>