<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: hasyimibhar</title><link>https://news.ycombinator.com/user?id=hasyimibhar</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Thu, 11 Jun 2026 07:39:00 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=hasyimibhar" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by hasyimibhar in "PgDog is funded and coming to a database near you"]]></title><description><![CDATA[
<p>What about Multigres[0]? It builds on top of Postgres and adds HA (based on Flexible Paxos[1]), sharding, etc. They're still not production-ready, but I'm highly optimistic they will solve a lot of the problems Postgres have.<p>For example, with Multigres, you should be able to achieve true zero downtime major version upgrade by simply resharding [2]. With vanilla Postgres + pgBouncer, you can only achieve near-zero downtime (few seconds at most), though it's probably good enough for most use cases.<p>[0] <a href="https://multigres.com/" rel="nofollow">https://multigres.com/</a><p>[1] <a href="https://fpaxos.github.io/" rel="nofollow">https://fpaxos.github.io/</a><p>[2] <a href="https://multigres.com/docs#migrate-across-postgres-versions" rel="nofollow">https://multigres.com/docs#migrate-across-postgres-versions</a></p>
]]></description><pubDate>Thu, 11 Jun 2026 02:15:01 +0000</pubDate><link>https://news.ycombinator.com/item?id=48485481</link><dc:creator>hasyimibhar</dc:creator><comments>https://news.ycombinator.com/item?id=48485481</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48485481</guid></item><item><title><![CDATA[New comment by hasyimibhar in "Migrating from Go to Rust"]]></title><description><![CDATA[
<p>It is also easier to make your code deterministic with Rust vs with Go, which is incredibly useful if you need to perform deterministic simulation testing + property-based testing. I recently wrote a Postgres-to-Iceberg data mirroring tool [1] in Go, but I ported it to Rust because I wanted the ability perform DST without fighting Go's runtime [2]. But if the domain is not critical that warrants DST, I would still pick Go over Rust any day.<p>[1] <a href="https://github.com/polynya-dev/pg2iceberg" rel="nofollow">https://github.com/polynya-dev/pg2iceberg</a><p>[2] <a href="https://www.polarsignals.com/blog/posts/2024/05/28/mostly-dst-in-go" rel="nofollow">https://www.polarsignals.com/blog/posts/2024/05/28/mostly-ds...</a></p>
]]></description><pubDate>Sun, 24 May 2026 22:57:24 +0000</pubDate><link>https://news.ycombinator.com/item?id=48261845</link><dc:creator>hasyimibhar</dc:creator><comments>https://news.ycombinator.com/item?id=48261845</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48261845</guid></item><item><title><![CDATA[New comment by hasyimibhar in "Incident Report: Railway Blocked by Google Cloud (Resolved)"]]></title><description><![CDATA[
<p>The problem with the us-east-1 outage is that a lot of big companies are there, so even if you try your best not to depend on us-east-1, your third party providers are most likely there. In my previous company, we were completely down during us-east-1 outage because of other dependencies that are beyond our control.</p>
]]></description><pubDate>Wed, 20 May 2026 04:00:41 +0000</pubDate><link>https://news.ycombinator.com/item?id=48202927</link><dc:creator>hasyimibhar</dc:creator><comments>https://news.ycombinator.com/item?id=48202927</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48202927</guid></item><item><title><![CDATA[New comment by hasyimibhar in "Lakebase architecture delivers faster Postgres writes"]]></title><description><![CDATA[
<p>In some cases you have no choice but to retain the data, e.g. due to compliance. But the good thing is it doesn't have to be in Postgres. You can periodically offload data to a lakehouse, then delete it from Postgres. If the table is partitioned, delete should be cheap.<p>I'm guessing with Neon, since their storage is a lakehouse, you get this for free.</p>
]]></description><pubDate>Mon, 11 May 2026 02:45:14 +0000</pubDate><link>https://news.ycombinator.com/item?id=48090528</link><dc:creator>hasyimibhar</dc:creator><comments>https://news.ycombinator.com/item?id=48090528</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48090528</guid></item><item><title><![CDATA[New comment by hasyimibhar in "Lakebase architecture delivers faster Postgres writes"]]></title><description><![CDATA[
<p>How does Lakebase compare to OrioleDB[0]?<p>[0] <a href="https://www.orioledb.com/" rel="nofollow">https://www.orioledb.com/</a></p>
]]></description><pubDate>Sun, 10 May 2026 22:57:31 +0000</pubDate><link>https://news.ycombinator.com/item?id=48089007</link><dc:creator>hasyimibhar</dc:creator><comments>https://news.ycombinator.com/item?id=48089007</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48089007</guid></item><item><title><![CDATA[New comment by hasyimibhar in "Show HN: Mljar Studio – local AI data analyst that saves analysis as notebooks"]]></title><description><![CDATA[
<p>You should check them out, their interface pretty much looks like chat nowadays.</p>
]]></description><pubDate>Tue, 05 May 2026 02:32:08 +0000</pubDate><link>https://news.ycombinator.com/item?id=48017452</link><dc:creator>hasyimibhar</dc:creator><comments>https://news.ycombinator.com/item?id=48017452</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48017452</guid></item><item><title><![CDATA[New comment by hasyimibhar in "Show HN: Mljar Studio – local AI data analyst that saves analysis as notebooks"]]></title><description><![CDATA[
<p>How does this compare to open source Deepnote[0]? We use the cloud version (BYOC) at my previous company to replace self-hosted Jupyter notebooks, and it's pretty great.<p>[0] <a href="https://github.com/deepnote/deepnote" rel="nofollow">https://github.com/deepnote/deepnote</a></p>
]]></description><pubDate>Sat, 02 May 2026 14:48:45 +0000</pubDate><link>https://news.ycombinator.com/item?id=47986873</link><dc:creator>hasyimibhar</dc:creator><comments>https://news.ycombinator.com/item?id=47986873</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47986873</guid></item><item><title><![CDATA[New comment by hasyimibhar in "Show HN: DAC – open-source dashboard as code tool for agents and humans"]]></title><description><![CDATA[
<p>I mean that's what the Vega team is doing no? They are building the standard grammar (Vega-Lite), along with an implementation (Vega). And they are already quite established with rich ecosystem, and supports a ton of components[0]. The only thing missing is that it expects a CSV or inline data source. But it's probably not too hard to build an extension that connects to a data warehouse with an SQL query.<p>[0] <a href="https://vega.github.io/vega-lite/examples" rel="nofollow">https://vega.github.io/vega-lite/examples</a></p>
]]></description><pubDate>Sat, 02 May 2026 14:45:14 +0000</pubDate><link>https://news.ycombinator.com/item?id=47986840</link><dc:creator>hasyimibhar</dc:creator><comments>https://news.ycombinator.com/item?id=47986840</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47986840</guid></item><item><title><![CDATA[New comment by hasyimibhar in "Show HN: DAC – open-source dashboard as code tool for agents and humans"]]></title><description><![CDATA[
<p>Why not use Vega-Lite[0]? It’s my go-to data viz DSL with Claude.<p>[0] <a href="https://vega.github.io/vega-lite/" rel="nofollow">https://vega.github.io/vega-lite/</a></p>
]]></description><pubDate>Sat, 02 May 2026 13:23:44 +0000</pubDate><link>https://news.ycombinator.com/item?id=47986203</link><dc:creator>hasyimibhar</dc:creator><comments>https://news.ycombinator.com/item?id=47986203</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47986203</guid></item><item><title><![CDATA[New comment by hasyimibhar in "If I could make my own GitHub"]]></title><description><![CDATA[
<p>You can skip by running git commit --no-verify. I know this because I also hate pre-commit checks, and I will automatically use it when working with any codebase that has one.</p>
]]></description><pubDate>Fri, 01 May 2026 19:47:50 +0000</pubDate><link>https://news.ycombinator.com/item?id=47979335</link><dc:creator>hasyimibhar</dc:creator><comments>https://news.ycombinator.com/item?id=47979335</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47979335</guid></item><item><title><![CDATA[New comment by hasyimibhar in "The end of "Just ask Sarah""]]></title><description><![CDATA[
<p>Having worked at enterprise with 100+ engineers across multiple teams with complex reporting lines, I definitely agree with this. There's a lot of nuance behind decisions that docs simply can't capture. I mean in theory you can probably write every considerations that led you to make a certain decision, e.g.:<p>- I'm writing this service even though team X has built the same thing, because my team lead doesn't trust team X since the last time we depended on their service 3 years ago, they had a major downtime that screwed us up big time<p>- This service is using AWS Lambda simply because I think it's cool, despite the fact that the company has a dedicated team running k8s stack with argocd, argo rollouts, KEDA, etc for the entire company<p>- Service Y is written in this particular way because it's a service that is shared with another team that came from a company that was acquired, and they wouldn't use it unless we write it this particular way, and making the top execs happy is more important than dealing with a small tech debt (this is probably true)<p>But no one is going to write these in their RFC. But Sarah knows.</p>
]]></description><pubDate>Fri, 01 May 2026 19:32:07 +0000</pubDate><link>https://news.ycombinator.com/item?id=47979142</link><dc:creator>hasyimibhar</dc:creator><comments>https://news.ycombinator.com/item?id=47979142</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47979142</guid></item><item><title><![CDATA[New comment by hasyimibhar in "Does Postgres Scale?"]]></title><description><![CDATA[
<p>AFAIK that's what Multigres[0] and Neki[1] are trying to solve.<p>[0] <a href="https://multigres.com/" rel="nofollow">https://multigres.com/</a>
[1] <a href="https://neki.dev/" rel="nofollow">https://neki.dev/</a></p>
]]></description><pubDate>Fri, 01 May 2026 07:47:33 +0000</pubDate><link>https://news.ycombinator.com/item?id=47972233</link><dc:creator>hasyimibhar</dc:creator><comments>https://news.ycombinator.com/item?id=47972233</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47972233</guid></item><item><title><![CDATA[New comment by hasyimibhar in "Show HN: Rocky – Rust SQL engine with branches, replay, column lineage"]]></title><description><![CDATA[
<p>Looks cool, I've been waiting for someone to build this since dbt and SQLMesh acquisition. It would be great to have model versioning and support for ClickHouse SQL.</p>
]]></description><pubDate>Wed, 29 Apr 2026 08:24:02 +0000</pubDate><link>https://news.ycombinator.com/item?id=47945572</link><dc:creator>hasyimibhar</dc:creator><comments>https://news.ycombinator.com/item?id=47945572</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47945572</guid></item><item><title><![CDATA[New comment by hasyimibhar in "Optimizing Datalog for the GPU"]]></title><description><![CDATA[
<p>Slightly off-topic but Datalog related: is there a way for me to query data in ClickHouse using Datalog without any ETL? I want to do fraud analysis, and I've been reading about how Datalog is a lot better at these kind of use cases than SQL.</p>
]]></description><pubDate>Mon, 27 Apr 2026 11:44:01 +0000</pubDate><link>https://news.ycombinator.com/item?id=47920316</link><dc:creator>hasyimibhar</dc:creator><comments>https://news.ycombinator.com/item?id=47920316</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47920316</guid></item><item><title><![CDATA[New comment by hasyimibhar in "An AI agent deleted our production database. The agent's confession is below"]]></title><description><![CDATA[
<p>Ok then it's definitely the author's fault for clicking "Always Allow". I don't even trust my agent to run grep without approval, let alone curl.</p>
]]></description><pubDate>Sun, 26 Apr 2026 22:01:09 +0000</pubDate><link>https://news.ycombinator.com/item?id=47915137</link><dc:creator>hasyimibhar</dc:creator><comments>https://news.ycombinator.com/item?id=47915137</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47915137</guid></item><item><title><![CDATA[Show HN: Polynya – Turn your Postgres into workspaces for AI]]></title><description><![CDATA[
<p>The idea is simple: AI agents need real-time data to be useful. But streaming real-time data into your data warehouse means you need the data warehouse to be up 24/7. This is expensive and wasteful. What if you could spin up an ephemeral data warehouse only when your agent needs it, and get real-time data at the same time?<p>The solution: Polynya replicates your data into Iceberg, and gives your agent an ephemeral ClickHouse instance on demand. Polynya also provides persistent workspaces — collections of views that survive across sessions. So from your agent's point of view, it's a 24/7 data warehouse.<p>At its core, Polynya is a data platform for streaming real-time data from Postgres to Iceberg. But instead of spending hours setting up costly and complex pipelines involving Kafka, Debezium, Flink, etc., Polynya lets you do this with just one command:<p>npx polynya create<p>It's currently free on early access, so do try it out and give me any feedback! Currently there is no web dashboard yet (coming soon), you interact with it 100% through CLI.<p>P.S. Polynya is built on top of pg2iceberg, the open source Postgres to Iceberg replication tool I've been building for the past few weeks: <a href="https://pg2iceberg.dev/" rel="nofollow">https://pg2iceberg.dev/</a></p>
<hr>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=47914821">https://news.ycombinator.com/item?id=47914821</a></p>
<p>Points: 5</p>
<p># Comments: 0</p>
]]></description><pubDate>Sun, 26 Apr 2026 21:35:15 +0000</pubDate><link>https://polynya.dev/</link><dc:creator>hasyimibhar</dc:creator><comments>https://news.ycombinator.com/item?id=47914821</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47914821</guid></item><item><title><![CDATA[New comment by hasyimibhar in "An AI agent deleted our production database. The agent's confession is below"]]></title><description><![CDATA[
<p>I'm not familiar with Cursor, does it allow the agent to have access to run "curl -X POST" with no approval, i.e. a popup will show up asking you to approve/deny/always approve? AFAIK with Claude Code, this can only happen if you use something like "--dangerously-skip-permissions". I have never used this, I manually approve all commands my agent runs. Pretty insane that people are giving agents to do whatever it wants and trusting the guardrails will work 100% of the time.</p>
]]></description><pubDate>Sun, 26 Apr 2026 19:40:13 +0000</pubDate><link>https://news.ycombinator.com/item?id=47913343</link><dc:creator>hasyimibhar</dc:creator><comments>https://news.ycombinator.com/item?id=47913343</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47913343</guid></item><item><title><![CDATA[New comment by hasyimibhar in "Dear friend, you have built a Kubernetes (2024)"]]></title><description><![CDATA[
<p>I've experienced something like this at work but with data warehouse instead, and it happened multiple times (to be fair, data engineering is still fairly new where I'm from).<p>One example was an engineer wanted to build an API that accepts large CSV (GBs of credit reports) to extract some data and perform some aggregations. He was in the process of discussing with SREs on the best way to process the huge CSV file without using k8s stateful set, and the solution he was about to build was basically writing to S3 and having a worker asynchronously load and process the CSV in chunks, then finally writing the aggregation to db.<p>I stepped in and told him he was about to build a data warehouse. :P</p>
]]></description><pubDate>Sun, 26 Apr 2026 18:24:59 +0000</pubDate><link>https://news.ycombinator.com/item?id=47912568</link><dc:creator>hasyimibhar</dc:creator><comments>https://news.ycombinator.com/item?id=47912568</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47912568</guid></item><item><title><![CDATA[New comment by hasyimibhar in "Agentic AI systems violate the implicit assumptions of database design"]]></title><description><![CDATA[
<p>I'm not sure why you are giving your agents write access to query your OLTP database, let alone write to it. The pattern that I use at work is:<p>- Read access through OLAP, not OLTP. You just need to setup a near real-time replication between your OLTP and OLAP.<p>- Write access through API, just like your application. You can add fancy things like approval layer, e.g. you agent cannot "ban_user(id)", but it can "request_to_ban_user(id)", and the action only happens once you approve it.</p>
]]></description><pubDate>Sun, 26 Apr 2026 18:12:53 +0000</pubDate><link>https://news.ycombinator.com/item?id=47912486</link><dc:creator>hasyimibhar</dc:creator><comments>https://news.ycombinator.com/item?id=47912486</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47912486</guid></item><item><title><![CDATA[New comment by hasyimibhar in "Agentic AI systems violate the implicit assumptions of database design"]]></title><description><![CDATA[
<p>> Why<p>Based on my experience with Claude, it's pretty damn good at doing data analysis, if given the right curated data models. You still need to eyeball the generated SQL to make sure it makes sense.<p>> and how?<p>1. Replicate your Postgres into Snowflake/Databricks/ClickHouse/etc, or directly to Iceberg and hook it up to Snowflake/Databricks/ClickHouse/etc.<p>2. Give your agent read access to query it.<p>3. Build dimensional models (facts and dimensions tables) from the raw data. You can ask LLM for help here, Claude is pretty good at designing data models in my experience.<p>4. Start asking your agent questions about your data.<p>Keep steps 3-4 as a tight feedback loop. Every time your agent hallucinates or struggle to answer your questions, improve the model.<p>Side note: I'm currently building a platform that does all 3 (though you still need to do 2 yourself), you just need Postgres + 1 command to set it up: <a href="https://polynya.dev/" rel="nofollow">https://polynya.dev/</a></p>
]]></description><pubDate>Sun, 26 Apr 2026 18:05:23 +0000</pubDate><link>https://news.ycombinator.com/item?id=47912413</link><dc:creator>hasyimibhar</dc:creator><comments>https://news.ycombinator.com/item?id=47912413</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47912413</guid></item></channel></rss>