Hacker News: hasyimibhar

New comment by hasyimibhar in "PgDog is funded and coming to a database near you"

hasyimibhar — Thu, 11 Jun 2026 02:15:01 +0000

What about Multigres[0]? It builds on top of Postgres and adds HA (based on Flexible Paxos[1]), sharding, etc. They're still not production-ready, but I'm highly optimistic they will solve a lot of the problems Postgres have.

For example, with Multigres, you should be able to achieve true zero downtime major version upgrade by simply resharding [2]. With vanilla Postgres + pgBouncer, you can only achieve near-zero downtime (few seconds at most), though it's probably good enough for most use cases.

[0] https://multigres.com/

[1] https://fpaxos.github.io/

[2] https://multigres.com/docs#migrate-across-postgres-versions

New comment by hasyimibhar in "Migrating from Go to Rust"

hasyimibhar — Sun, 24 May 2026 22:57:24 +0000

It is also easier to make your code deterministic with Rust vs with Go, which is incredibly useful if you need to perform deterministic simulation testing + property-based testing. I recently wrote a Postgres-to-Iceberg data mirroring tool [1] in Go, but I ported it to Rust because I wanted the ability perform DST without fighting Go's runtime [2]. But if the domain is not critical that warrants DST, I would still pick Go over Rust any day.

[1] https://github.com/polynya-dev/pg2iceberg

[2] https://www.polarsignals.com/blog/posts/2024/05/28/mostly-ds...

New comment by hasyimibhar in "Incident Report: Railway Blocked by Google Cloud (Resolved)"

hasyimibhar — Wed, 20 May 2026 04:00:41 +0000

The problem with the us-east-1 outage is that a lot of big companies are there, so even if you try your best not to depend on us-east-1, your third party providers are most likely there. In my previous company, we were completely down during us-east-1 outage because of other dependencies that are beyond our control.

New comment by hasyimibhar in "Lakebase architecture delivers faster Postgres writes"

hasyimibhar — Mon, 11 May 2026 02:45:14 +0000

In some cases you have no choice but to retain the data, e.g. due to compliance. But the good thing is it doesn't have to be in Postgres. You can periodically offload data to a lakehouse, then delete it from Postgres. If the table is partitioned, delete should be cheap.

I'm guessing with Neon, since their storage is a lakehouse, you get this for free.

New comment by hasyimibhar in "Lakebase architecture delivers faster Postgres writes"

hasyimibhar — Sun, 10 May 2026 22:57:31 +0000

How does Lakebase compare to OrioleDB[0]?

[0] https://www.orioledb.com/

New comment by hasyimibhar in "Show HN: Mljar Studio – local AI data analyst that saves analysis as notebooks"

hasyimibhar — Tue, 05 May 2026 02:32:08 +0000

You should check them out, their interface pretty much looks like chat nowadays.

New comment by hasyimibhar in "Show HN: Mljar Studio – local AI data analyst that saves analysis as notebooks"

hasyimibhar — Sat, 02 May 2026 14:48:45 +0000

How does this compare to open source Deepnote[0]? We use the cloud version (BYOC) at my previous company to replace self-hosted Jupyter notebooks, and it's pretty great.

[0] https://github.com/deepnote/deepnote

New comment by hasyimibhar in "Show HN: DAC – open-source dashboard as code tool for agents and humans"

hasyimibhar — Sat, 02 May 2026 14:45:14 +0000

I mean that's what the Vega team is doing no? They are building the standard grammar (Vega-Lite), along with an implementation (Vega). And they are already quite established with rich ecosystem, and supports a ton of components[0]. The only thing missing is that it expects a CSV or inline data source. But it's probably not too hard to build an extension that connects to a data warehouse with an SQL query.

[0] https://vega.github.io/vega-lite/examples

New comment by hasyimibhar in "Show HN: DAC – open-source dashboard as code tool for agents and humans"

hasyimibhar — Sat, 02 May 2026 13:23:44 +0000

Why not use Vega-Lite[0]? It’s my go-to data viz DSL with Claude.

[0] https://vega.github.io/vega-lite/

New comment by hasyimibhar in "If I could make my own GitHub"

hasyimibhar — Fri, 01 May 2026 19:47:50 +0000

You can skip by running git commit --no-verify. I know this because I also hate pre-commit checks, and I will automatically use it when working with any codebase that has one.

New comment by hasyimibhar in "The end of "Just ask Sarah""

hasyimibhar — Fri, 01 May 2026 19:32:07 +0000

Having worked at enterprise with 100+ engineers across multiple teams with complex reporting lines, I definitely agree with this. There's a lot of nuance behind decisions that docs simply can't capture. I mean in theory you can probably write every considerations that led you to make a certain decision, e.g.:

- I'm writing this service even though team X has built the same thing, because my team lead doesn't trust team X since the last time we depended on their service 3 years ago, they had a major downtime that screwed us up big time

- This service is using AWS Lambda simply because I think it's cool, despite the fact that the company has a dedicated team running k8s stack with argocd, argo rollouts, KEDA, etc for the entire company

- Service Y is written in this particular way because it's a service that is shared with another team that came from a company that was acquired, and they wouldn't use it unless we write it this particular way, and making the top execs happy is more important than dealing with a small tech debt (this is probably true)

But no one is going to write these in their RFC. But Sarah knows.

New comment by hasyimibhar in "Does Postgres Scale?"

hasyimibhar — Fri, 01 May 2026 07:47:33 +0000

AFAIK that's what Multigres[0] and Neki[1] are trying to solve.

[0] https://multigres.com/ [1] https://neki.dev/

New comment by hasyimibhar in "Show HN: Rocky – Rust SQL engine with branches, replay, column lineage"

hasyimibhar — Wed, 29 Apr 2026 08:24:02 +0000

Looks cool, I've been waiting for someone to build this since dbt and SQLMesh acquisition. It would be great to have model versioning and support for ClickHouse SQL.

New comment by hasyimibhar in "Optimizing Datalog for the GPU"

hasyimibhar — Mon, 27 Apr 2026 11:44:01 +0000

Slightly off-topic but Datalog related: is there a way for me to query data in ClickHouse using Datalog without any ETL? I want to do fraud analysis, and I've been reading about how Datalog is a lot better at these kind of use cases than SQL.

New comment by hasyimibhar in "An AI agent deleted our production database. The agent's confession is below"

hasyimibhar — Sun, 26 Apr 2026 22:01:09 +0000

Ok then it's definitely the author's fault for clicking "Always Allow". I don't even trust my agent to run grep without approval, let alone curl.

Show HN: Polynya – Turn your Postgres into workspaces for AI

hasyimibhar — Sun, 26 Apr 2026 21:35:15 +0000

The idea is simple: AI agents need real-time data to be useful. But streaming real-time data into your data warehouse means you need the data warehouse to be up 24/7. This is expensive and wasteful. What if you could spin up an ephemeral data warehouse only when your agent needs it, and get real-time data at the same time?

The solution: Polynya replicates your data into Iceberg, and gives your agent an ephemeral ClickHouse instance on demand. Polynya also provides persistent workspaces — collections of views that survive across sessions. So from your agent's point of view, it's a 24/7 data warehouse.

At its core, Polynya is a data platform for streaming real-time data from Postgres to Iceberg. But instead of spending hours setting up costly and complex pipelines involving Kafka, Debezium, Flink, etc., Polynya lets you do this with just one command:

npx polynya create

It's currently free on early access, so do try it out and give me any feedback! Currently there is no web dashboard yet (coming soon), you interact with it 100% through CLI.

P.S. Polynya is built on top of pg2iceberg, the open source Postgres to Iceberg replication tool I've been building for the past few weeks: https://pg2iceberg.dev/

Comments URL: https://news.ycombinator.com/item?id=47914821

Points: 5

# Comments: 0

New comment by hasyimibhar in "An AI agent deleted our production database. The agent's confession is below"

hasyimibhar — Sun, 26 Apr 2026 19:40:13 +0000

I'm not familiar with Cursor, does it allow the agent to have access to run "curl -X POST" with no approval, i.e. a popup will show up asking you to approve/deny/always approve? AFAIK with Claude Code, this can only happen if you use something like "--dangerously-skip-permissions". I have never used this, I manually approve all commands my agent runs. Pretty insane that people are giving agents to do whatever it wants and trusting the guardrails will work 100% of the time.

New comment by hasyimibhar in "Dear friend, you have built a Kubernetes (2024)"

hasyimibhar — Sun, 26 Apr 2026 18:24:59 +0000

I've experienced something like this at work but with data warehouse instead, and it happened multiple times (to be fair, data engineering is still fairly new where I'm from).

One example was an engineer wanted to build an API that accepts large CSV (GBs of credit reports) to extract some data and perform some aggregations. He was in the process of discussing with SREs on the best way to process the huge CSV file without using k8s stateful set, and the solution he was about to build was basically writing to S3 and having a worker asynchronously load and process the CSV in chunks, then finally writing the aggregation to db.

I stepped in and told him he was about to build a data warehouse. :P

New comment by hasyimibhar in "Agentic AI systems violate the implicit assumptions of database design"

hasyimibhar — Sun, 26 Apr 2026 18:12:53 +0000

I'm not sure why you are giving your agents write access to query your OLTP database, let alone write to it. The pattern that I use at work is:

- Read access through OLAP, not OLTP. You just need to setup a near real-time replication between your OLTP and OLAP.

- Write access through API, just like your application. You can add fancy things like approval layer, e.g. you agent cannot "ban_user(id)", but it can "request_to_ban_user(id)", and the action only happens once you approve it.

New comment by hasyimibhar in "Agentic AI systems violate the implicit assumptions of database design"

hasyimibhar — Sun, 26 Apr 2026 18:05:23 +0000

> Why

Based on my experience with Claude, it's pretty damn good at doing data analysis, if given the right curated data models. You still need to eyeball the generated SQL to make sure it makes sense.

> and how?

1. Replicate your Postgres into Snowflake/Databricks/ClickHouse/etc, or directly to Iceberg and hook it up to Snowflake/Databricks/ClickHouse/etc.

2. Give your agent read access to query it.

3. Build dimensional models (facts and dimensions tables) from the raw data. You can ask LLM for help here, Claude is pretty good at designing data models in my experience.

4. Start asking your agent questions about your data.

Keep steps 3-4 as a tight feedback loop. Every time your agent hallucinates or struggle to answer your questions, improve the model.

Side note: I'm currently building a platform that does all 3 (though you still need to do 2 yourself), you just need Postgres + 1 command to set it up: https://polynya.dev/