<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: ambitious_potat</title><link>https://news.ycombinator.com/user?id=ambitious_potat</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Mon, 27 Apr 2026 08:29:20 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=ambitious_potat" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[Why the hell is this showing up]]></title><description><![CDATA[
<p>Article URL: <a href="https://github.com/adityaprasad-sudo/Explore-Singapore">https://github.com/adityaprasad-sudo/Explore-Singapore</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=46948124">https://news.ycombinator.com/item?id=46948124</a></p>
<p>Points: 4</p>
<p># Comments: 1</p>
]]></description><pubDate>Mon, 09 Feb 2026 17:32:38 +0000</pubDate><link>https://github.com/adityaprasad-sudo/Explore-Singapore</link><dc:creator>ambitious_potat</dc:creator><comments>https://news.ycombinator.com/item?id=46948124</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46948124</guid></item><item><title><![CDATA[New comment by ambitious_potat in "Show HN: The biggest achievement of my life so far"]]></title><description><![CDATA[
<p>Appreciate the words man</p>
]]></description><pubDate>Mon, 09 Feb 2026 11:32:13 +0000</pubDate><link>https://news.ycombinator.com/item?id=46944107</link><dc:creator>ambitious_potat</dc:creator><comments>https://news.ycombinator.com/item?id=46944107</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46944107</guid></item><item><title><![CDATA[New comment by ambitious_potat in "Show HN: The biggest achievement of my life so far"]]></title><description><![CDATA[
<p>Oh hey i forgot to mention here's the live demo:-
<a href="https://adityaprasad-sudo.github.io/Explore-Singapore/" rel="nofollow">https://adityaprasad-sudo.github.io/Explore-Singapore/</a></p>
]]></description><pubDate>Mon, 09 Feb 2026 10:06:03 +0000</pubDate><link>https://news.ycombinator.com/item?id=46943570</link><dc:creator>ambitious_potat</dc:creator><comments>https://news.ycombinator.com/item?id=46943570</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46943570</guid></item><item><title><![CDATA[New comment by ambitious_potat in "Ask HN: What are you working on? (February 2026)"]]></title><description><![CDATA[
<p>I built an Legaltech for Singapore with RAG architecture and triple llm backup logic<p>GitHub:- <a href="https://github.com/adityaprasad-sudo/Explore-Singapore" rel="nofollow">https://github.com/adityaprasad-sudo/Explore-Singapore</a>
Live demo :-
<a href="https://adityaprasad-sudo.github.io/Explore-Singapore/" rel="nofollow">https://adityaprasad-sudo.github.io/Explore-Singapore/</a></p>
]]></description><pubDate>Mon, 09 Feb 2026 05:51:54 +0000</pubDate><link>https://news.ycombinator.com/item?id=46941982</link><dc:creator>ambitious_potat</dc:creator><comments>https://news.ycombinator.com/item?id=46941982</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46941982</guid></item><item><title><![CDATA[Show HN: The biggest achievement of my life so far]]></title><description><![CDATA[
<p>Hello everyone,<p>I have always loved coding and in the couple I was thinking of making an open source project and it turned out to be awesome I hope you guys like it.<p>I present Explore Singapore which I created as an open-source intelligence engine to execute retrieval-augmented generation (RAG) on Singapore's public policy documents and legal statutes and historical archives.<p>The objective required building a domain-specific search engine which enables LLM systems to decrease errors by using government documents as their exclusive information source.<p>What my Project does :- basically it provides legal information faster and reliable(due to RAG) without going through long PDFs of goverment websites and helps travellers get insights faster about Singapore.<p>Target Audience:- Python developers who keep hearing about "RAG" and AI agents but haven't build one yet or building one and are stuck somewhere also Singaporean people(obviously!)<p>Comparison:- RAW LLM vs RAG based LLM to test the rag implementation i compared output of my logic code against the standard(gemini/Arcee AI/groq) and custom system instructions with rag(gemini/Arcee AI/groq) results were shocking query:- "can I fly in a drone in public park" standard llm response :- ""gave generic advice about "checking local laws"  and safety guidelines""
Customized llm with RAG :- ""cited the air navigation act,specified the 5km no fly zones,and linked to the CAAS permit page"" the difference was clear and it was sure that the ai was not hallucinating.<p>Ingestion:- I have the RAG Architecture about 594 PDFs about Singaporian laws and acts which rougly contains 33000 pages.<p>How did I do it :- I used google Collab to build vector database and metadata which nearly took me 1 hour to do so ie convert PDFs to vectors.<p>How accurate is it:- It's still in development phase but still it provides near accurate information as it contains multi query retrieval ie if a user asks ("ease of doing business in Singapore") the logic would break the keywords "ease", "business", "Singapore" and provide the required documents from the PDFs with the page number also it's a little hard to explain but you can check it on my webpage.Its not perfect but hey i am still learning.<p>The Tech Stack:  
Ingestion: Python scripts using PyPDF2 to parse various PDF formats.  
Embeddings: Hugging Face BGE-M3(1024 dimensions)
Vector Database: FAISS for similarity search.  
Orchestration: LangChain.  
Backend: Flask
Frontend: React and Framer.<p>The RAG Pipeline operates through the following process:  
Chunking: The source text is divided into chunks of 150 with an overlap of 50 tokens to maintain context across boundaries.  
Retrieval: When a user asks a question (e.g., "What is the policy on HDB grants?"), the system queries the vector database for the top k chunks (k=1).  
Synthesis: The system adds these chunks to the prompt of LLMs which produces the final response that includes citation information.
Why did I say llms :- because I wanted the system to be as non crashable as possible so I am using gemini as my primary llm to provide responses but if it fails to do so due to api requests or any other reasons the backup model(Arcee AI trinity large) can handle the requests.<p>Don't worry :- I have implemented different system instructions for different models so that result is a good quality product.<p>Current Challenges:  
I am working on optimizing the the ranking strategy of the RAG architecture. I would value insights from anyone who has encountered RAG returning unrelevant documents.<p>Feedbacks are the backbone of improving a platform so they are most<p>Repository:- <a href="https://github.com/adityaprasad-sudo/Explore-Singapore" rel="nofollow">https://github.com/adityaprasad-sudo/Explore-Singapore</a></p>
<hr>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=46937543">https://news.ycombinator.com/item?id=46937543</a></p>
<p>Points: 9</p>
<p># Comments: 5</p>
]]></description><pubDate>Sun, 08 Feb 2026 19:21:10 +0000</pubDate><link>https://github.com/adityaprasad-sudo/Explore-Singapore</link><dc:creator>ambitious_potat</dc:creator><comments>https://news.ycombinator.com/item?id=46937543</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46937543</guid></item><item><title><![CDATA[New comment by ambitious_potat in "Show HN: I built a RAG engine to search Singaporean laws"]]></title><description><![CDATA[
<p>Yep I implemented triple models as a backup so good of you that you actually read the code thanks dude for such kind words</p>
]]></description><pubDate>Sat, 07 Feb 2026 05:09:05 +0000</pubDate><link>https://news.ycombinator.com/item?id=46921501</link><dc:creator>ambitious_potat</dc:creator><comments>https://news.ycombinator.com/item?id=46921501</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46921501</guid></item><item><title><![CDATA[New comment by ambitious_potat in "Show HN: I built a RAG engine to search Singaporean laws"]]></title><description><![CDATA[
<p>Thank you for such great words now the question 
When I built the vector databse I also created a metadata file that has all the info about pages and topics.
Again thanks dude</p>
]]></description><pubDate>Sat, 07 Feb 2026 05:08:18 +0000</pubDate><link>https://news.ycombinator.com/item?id=46921495</link><dc:creator>ambitious_potat</dc:creator><comments>https://news.ycombinator.com/item?id=46921495</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46921495</guid></item><item><title><![CDATA[Show HN: I built a RAG engine to search Singaporean laws]]></title><description><![CDATA[
<p>I built a "Triple Failover" RAG for Singapore Laws, then rewrote the logic based on your feedback.<p>Hi everyone!<p>I’m a student developer. Recently, I created Explore Singapore, a RAG-based search engine that scrapes about 20,000 pages of Singaporean government acts and laws.<p>I recently posted the MVP and received some tough but essential feedback about hallucinations and query depth. I took that feedback, focused on improvements, and just released Version 2.<p>Here is how I upgraded the system from a basic RAG to a production-grade one.<p>The Design & UI
I aimed to avoid a dull government website.<p>Design: Heavily inspired by Apple’s minimalist style.<p>Tech: Custom frontend interacting with a Python backend.<p>The V2 Engineering Overhaul<p>The community challenged me on three main points. Here’s how I addressed them:<p>1. The "Personality" Fix
Issue: I use a "Triple Failover" system with three models as backup. When the main model failed, the backups sounded entirely different.<p>The Solution: I added Dynamic System Instructions. Now, if the backend switches to Model B, it uses a specific prompt designed for Model B’s features, making it mimic the structure and tone of the primary model. The user never notices the change.<p>2. The "Deep Search" Fix
Issue: A simple semantic search for "Starting a business" misses related laws like "Tax" or "Labor" acts.<p>The Solution: I implemented Multi-Query Retrieval (MQR). An LLM now intercepts your query. It breaks it down into sub-intents (e.g., “Business Registration,” “Corporate Tax,” “Employment Rules”). It searches for all of them at the same time and combines the results.<p>Result: Much richer, context-aware answers.<p>3. The "Hallucination" Fix
Issue: Garbage In, Garbage Out. If FAISS retrieves a bad document, the LLM produces inaccurate information.<p>The Solution: I added a Cross-Encoder Re-Ranking layer.<p>Step 1: FAISS grabs the top 10 results.<p>Step 2: A specialized Cross-Encoder model evaluates them for relevance.<p>Step 3: Irrelevant parts are removed before they reach the Chat LLM.<p>*<p>The Tech Stack
*<p>Embeddings: BGE-M3 (Running locally)<p>Vector DB: FAISS<p>Backend: Python + Custom Triple-Model Failover<p>Logic: Multi-Query + Re-Ranking (New in V2)<p>Try it out<p>I am still learning. I’d love to hear your thoughts on the new logic.<p>Live Demo: <a href="https://adityaprasad-sudo.github.io/Explore-Singapore/" rel="nofollow">https://adityaprasad-sudo.github.io/Explore-Singapore/</a><p>GitHub Repo: <a href="https://github.com/adityaprasad-sudo/Explore-Singapore" rel="nofollow">https://github.com/adityaprasad-sudo/Explore-Singapore</a><p>Feedback, especially on the failover speed, is welcome!</p>
<hr>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=46921180">https://news.ycombinator.com/item?id=46921180</a></p>
<p>Points: 5</p>
<p># Comments: 4</p>
]]></description><pubDate>Sat, 07 Feb 2026 03:58:51 +0000</pubDate><link>https://github.com/adityaprasad-sudo/Explore-Singapore</link><dc:creator>ambitious_potat</dc:creator><comments>https://news.ycombinator.com/item?id=46921180</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46921180</guid></item></channel></rss>