<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: wuputah</title><link>https://news.ycombinator.com/user?id=wuputah</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Sat, 13 Jun 2026 13:28:15 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=wuputah" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by wuputah in "pg_duckdb: Splicing Duck and Elephant DNA"]]></title><description><![CDATA[
<p>I think it will be straightforward to expose time_bucket in pg_duckdb. Feel free to open an issue for the feature.</p>
]]></description><pubDate>Sun, 18 Aug 2024 06:41:59 +0000</pubDate><link>https://news.ycombinator.com/item?id=41280597</link><dc:creator>wuputah</dc:creator><comments>https://news.ycombinator.com/item?id=41280597</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=41280597</guid></item><item><title><![CDATA[New comment by wuputah in "Using ClickHouse to scale an events engine"]]></title><description><![CDATA[
<p>That is an incorrect and baseless accusation, we had nothing to do with "Postgres (tuned)". My commits are only in the `hydra` folder. There are no restrictions on how you set up the benchmark in Clickbench and the settings we use there are analogous with what we use on our cloud service for a similar sized instance.<p>As the linked post points out, the main 'advantage' of the "tuned" benchmark is the indexes, which are tuned specifically to the queries in the benchmark. We do not use indexes in our version of the benchmark, aside from the primary key (which actually provides no performance advantage).</p>
]]></description><pubDate>Wed, 17 Apr 2024 19:31:16 +0000</pubDate><link>https://news.ycombinator.com/item?id=40069124</link><dc:creator>wuputah</dc:creator><comments>https://news.ycombinator.com/item?id=40069124</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40069124</guid></item><item><title><![CDATA[New comment by wuputah in "Show HN: pgxman – npm for Postgres extensions"]]></title><description><![CDATA[
<p>> does it work with the existing postgres apt/yum repos?<p>We only support apt for now but plan to support other package managers in the future. It works with existing Postgres apt packages, we recommend using PGDG but the default system packages on Debian/Ubuntu work as well.<p>> Does it work with the postgres Docker image?<p>yes, in fact this how our `container` feature works.
<a href="https://docs.pgxman.com/container" rel="nofollow noreferrer">https://docs.pgxman.com/container</a></p>
]]></description><pubDate>Thu, 30 Nov 2023 07:17:38 +0000</pubDate><link>https://news.ycombinator.com/item?id=38470652</link><dc:creator>wuputah</dc:creator><comments>https://news.ycombinator.com/item?id=38470652</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=38470652</guid></item><item><title><![CDATA[New comment by wuputah in "Show HN: pgxman – npm for Postgres extensions"]]></title><description><![CDATA[
<p>In my opinion, we plan on accomplishing this by using a container; it's not quite something we have today, but this is good feedback. :)<p>On Ubuntu/Debian, Postgres doesn't typically work this way, so it's not the way that pgxman works. pgxman works on top of the existing `postgresql` packages and with the existing package manager (apt) in order to install extensions -- which is also how it handles runtime dependencies, whether libraries or even other extensions.<p>So, that said, we have a container feature I could see using to effectively isolate for a single project. Right now there is only one single "global" container (per Postgres version) that pgxman will manage for you, but this is just a MVP of this feature. I could definitely see something like `pgxman c dev` or similar which will read a local pgxman pack file (pgxman.yaml) in your project and boot a "local" Postgres for you just for that project.<p>The pgxman pack is already a thing and is how the local container config is maintained, but we haven't tied it together in the way described above... yet. For more on both pgxman pack and the container feature, check out our docs.</p>
]]></description><pubDate>Wed, 29 Nov 2023 23:53:20 +0000</pubDate><link>https://news.ycombinator.com/item?id=38467164</link><dc:creator>wuputah</dc:creator><comments>https://news.ycombinator.com/item?id=38467164</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=38467164</guid></item><item><title><![CDATA[New comment by wuputah in "Show HN: Hydra - Open-Source Columnar Postgres"]]></title><description><![CDATA[
<p>Thanks for calling these out, as these are just misunderstandings. We will certainly tweak the language around these.<p>- Installing the extension itself does not change the default table type, this is only the case on Hydra Cloud and our Docker image.<p>- "Hydra is not a fork" refers to the fact that Hydra did not fork Postgres; it is an extension.  We have put in a lot of effort since forking Citus, but it's not our intent to hide that fact.<p>- Yes, "Hydra External Tables" is a productization around FDWs, there's more we want to do with it but it hasn't been our focus lately.</p>
]]></description><pubDate>Wed, 20 Sep 2023 00:38:52 +0000</pubDate><link>https://news.ycombinator.com/item?id=37579037</link><dc:creator>wuputah</dc:creator><comments>https://news.ycombinator.com/item?id=37579037</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=37579037</guid></item><item><title><![CDATA[New comment by wuputah in "Show HN: Hydra 1.0 – open-source column-oriented Postgres"]]></title><description><![CDATA[
<p>First we added a bitmask to mark rows as deleted - these rows are filtered out on read. Then updates are implemented as deletions + inserts. We have also added vacuum functions to remove/rewrite stripes that have >20% of deleted rows in order to reclaim space and optimize those stripes.</p>
]]></description><pubDate>Thu, 03 Aug 2023 23:30:58 +0000</pubDate><link>https://news.ycombinator.com/item?id=36993474</link><dc:creator>wuputah</dc:creator><comments>https://news.ycombinator.com/item?id=36993474</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=36993474</guid></item><item><title><![CDATA[New comment by wuputah in "Show HN: Hydra 1.0 – open-source column-oriented Postgres"]]></title><description><![CDATA[
<p>of course :) drop by our Discord if there's something you'd like to contribute and want to chat about it beforehand, need help/have questions getting started, etc. <a href="https://hydra.so/discord">https://hydra.so/discord</a></p>
]]></description><pubDate>Thu, 03 Aug 2023 23:28:29 +0000</pubDate><link>https://news.ycombinator.com/item?id=36993456</link><dc:creator>wuputah</dc:creator><comments>https://news.ycombinator.com/item?id=36993456</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=36993456</guid></item><item><title><![CDATA[New comment by wuputah in "Hydra – the fastest Postgres for analytics [benchmarks]"]]></title><description><![CDATA[
<p>Yes, Hydra Columnar and PostGIS get along just fine. We've not looked into any PostGIS-specific optimizations yet, but if users run into issues, we'll be happy to investigate.</p>
]]></description><pubDate>Wed, 14 Dec 2022 20:35:06 +0000</pubDate><link>https://news.ycombinator.com/item?id=33990106</link><dc:creator>wuputah</dc:creator><comments>https://news.ycombinator.com/item?id=33990106</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=33990106</guid></item><item><title><![CDATA[New comment by wuputah in "Hydra – the fastest Postgres for analytics [benchmarks]"]]></title><description><![CDATA[
<p>Of course, that's the power of Postgres. You can join between columnar tables or between columnar and heap (row-based) tables. The performance of joins hasn't been a specific focus of our engineering work yet, but I made a little test of enriching an analytical query with user data here:
<a href="https://gist.github.com/wuputah/e62b83f86880bd7e6623809afe4c1e41" rel="nofollow">https://gist.github.com/wuputah/e62b83f86880bd7e6623809afe4c...</a></p>
]]></description><pubDate>Wed, 14 Dec 2022 04:34:03 +0000</pubDate><link>https://news.ycombinator.com/item?id=33980032</link><dc:creator>wuputah</dc:creator><comments>https://news.ycombinator.com/item?id=33980032</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=33980032</guid></item><item><title><![CDATA[New comment by wuputah in "Hydra – the fastest Postgres for analytics [benchmarks]"]]></title><description><![CDATA[
<p>Yeah, I agree! However, ClickBench has used 500GB GP2 as the "standard" for some time, so I stuck to it for consistency. We use GP3 for our hosted service, and I did test on GP3 as well with identical settings as GP2 and the results are very similar.</p>
]]></description><pubDate>Wed, 14 Dec 2022 04:02:27 +0000</pubDate><link>https://news.ycombinator.com/item?id=33979872</link><dc:creator>wuputah</dc:creator><comments>https://news.ycombinator.com/item?id=33979872</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=33979872</guid></item><item><title><![CDATA[New comment by wuputah in "Hydra – the fastest Postgres for analytics [benchmarks]"]]></title><description><![CDATA[
<p>The metadata can act as a basic form of indexing (or sometimes caching, though Hydra doesn't use metadata to calculate results yet), but it's not an index in the traditional sense. It's used to eliminate stripes and blocks from consideration during a scan.<p>Columnar is not ideal for a `users` table where you want to select and update specific rows, often in very small, quick transactions (OLTP). You would want to continue to use a traditional (heap) table in that case. That's certainly something you can still do with Hydra, and combining both kinds of tables is considered HTAP, and something that is a unique use case of our product.<p>To contrast, columnar is best for "fact" tables -- data about something that happened (thus it does not change) that will be analyzed in an aggregate way. Those might be logs, events, transactions, etc.</p>
]]></description><pubDate>Wed, 14 Dec 2022 01:31:07 +0000</pubDate><link>https://news.ycombinator.com/item?id=33978868</link><dc:creator>wuputah</dc:creator><comments>https://news.ycombinator.com/item?id=33978868</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=33978868</guid></item><item><title><![CDATA[New comment by wuputah in "Hydra – the fastest Postgres for analytics [benchmarks]"]]></title><description><![CDATA[
<p>A narrow distinction, but Hydra is Postgres - we only install an extension - while Greenplum and Redshift are forks but remain Postgres-compatible (to varying degrees). I'm not up on when Greenplum last merged updates from Postgres, but I would be concerned that it only runs on Ubuntu 18.04. If you have a look at the Greenplum install in ClickBench[1], you'll see it's not a typical Postgres setup. Hopefully we will be able to beat Greenplum straight-up soon. :)<p>Redshift is multi-node, which puts it in a different category -- with considerably higher costs.<p>[1]: <a href="https://github.com/ClickHouse/ClickBench/blob/main/greenplum/benchmark.sh" rel="nofollow">https://github.com/ClickHouse/ClickBench/blob/main/greenplum...</a></p>
]]></description><pubDate>Tue, 13 Dec 2022 20:31:34 +0000</pubDate><link>https://news.ycombinator.com/item?id=33975296</link><dc:creator>wuputah</dc:creator><comments>https://news.ycombinator.com/item?id=33975296</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=33975296</guid></item><item><title><![CDATA[New comment by wuputah in "Hydra (YC W22) releases the Postgres data warehouse"]]></title><description><![CDATA[
<p>We hope you find our new product useful and engaging! You can get started with a free 10 GB Hydra at <a href="https://hydras.io/" rel="nofollow">https://hydras.io/</a> or read our blog post at <a href="https://hydras.io/blog/2022-03-21-announcing-hydra-postgres-data-warehouse" rel="nofollow">https://hydras.io/blog/2022-03-21-announcing-hydra-postgres-...</a>.</p>
]]></description><pubDate>Thu, 24 Mar 2022 21:38:36 +0000</pubDate><link>https://news.ycombinator.com/item?id=30795632</link><dc:creator>wuputah</dc:creator><comments>https://news.ycombinator.com/item?id=30795632</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=30795632</guid></item><item><title><![CDATA[New comment by wuputah in "Heroku was Down"]]></title><description><![CDATA[
<p>I agree with your lead statement and argued as such, but was overruled. Last I knew and understood, Heroku Status is static pages pushed out to Fastly, with the internal admin site (that does that work) running in a Heroku Private Space. If you look at the DNS, it still appears to be served by Fastly, and Heroku Private Spaces are generally pretty isolated infra, so I would be curious what the failure mode was here. But ultimately this is the fire you play with when you self-host your status site...</p>
]]></description><pubDate>Thu, 24 Feb 2022 19:57:33 +0000</pubDate><link>https://news.ycombinator.com/item?id=30459452</link><dc:creator>wuputah</dc:creator><comments>https://news.ycombinator.com/item?id=30459452</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=30459452</guid></item><item><title><![CDATA[New comment by wuputah in "Launch HN: Hydra (YC W22) – Query any database via Postgres"]]></title><description><![CDATA[
<p>Hi! Definitely reach out to us to discuss further.</p>
]]></description><pubDate>Thu, 24 Feb 2022 18:20:01 +0000</pubDate><link>https://news.ycombinator.com/item?id=30458229</link><dc:creator>wuputah</dc:creator><comments>https://news.ycombinator.com/item?id=30458229</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=30458229</guid></item><item><title><![CDATA[New comment by wuputah in "Launch HN: Hydra (YC W22) – Query any database via Postgres"]]></title><description><![CDATA[
<p>Certainly, I see us expanding to other cloud providers as we follow customer demand, but it will take some time. I think if you wanted to move faster and have a higher level of control, self-hosted would be the way to go. We are offering that but it's not on our web site.<p>Definitely stay tuned [0] for Clickhouse! And yes, exactly, you can continue to use your ORM of choice.<p>0: social links are at the bottom of hydras.io :)</p>
]]></description><pubDate>Thu, 24 Feb 2022 18:17:55 +0000</pubDate><link>https://news.ycombinator.com/item?id=30458201</link><dc:creator>wuputah</dc:creator><comments>https://news.ycombinator.com/item?id=30458201</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=30458201</guid></item><item><title><![CDATA[New comment by wuputah in "Launch HN: Hydra (YC W22) – Query any database via Postgres"]]></title><description><![CDATA[
<p>Hi, JD here, CTO at Hydra. In an HTAP scenario, local transactional data would be replicated, but your data warehouse will likely have a great amount of data that your Postgres database does not. You can still connect that data to Postgres with Hydra. Ultimately, it's up to you if/how you choose to replicate your data -- along with guidance from our team along the way.</p>
]]></description><pubDate>Wed, 23 Feb 2022 21:38:35 +0000</pubDate><link>https://news.ycombinator.com/item?id=30446846</link><dc:creator>wuputah</dc:creator><comments>https://news.ycombinator.com/item?id=30446846</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=30446846</guid></item><item><title><![CDATA[New comment by wuputah in "Launch HN: Hydra (YC W22) – Query any database via Postgres"]]></title><description><![CDATA[
<p>Hi, JD here, Hydra's CTO. It's still early days and we are considering open source; for now, we wanted to leave our options open, and OSS feels like a one-way door. I think you make a great point here - thanks for sharing your past pain / experience. Definitely food for thought.<p>Our "no lock-in" claim refers to your data, since Hydra is Postgres, you're not stuck using "HydraDB" forever -- it's relatively easy to migrate in or out since you can use well established Postgres tools. We also are open to licensing the product should you wish to self-host, on-prem, etc.</p>
]]></description><pubDate>Wed, 23 Feb 2022 21:27:27 +0000</pubDate><link>https://news.ycombinator.com/item?id=30446698</link><dc:creator>wuputah</dc:creator><comments>https://news.ycombinator.com/item?id=30446698</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=30446698</guid></item><item><title><![CDATA[New comment by wuputah in "Launch HN: Hydra (YC W22) – Query any database via Postgres"]]></title><description><![CDATA[
<p>Hi, JD here, Hydra's CTO. Thanks for the interest and questions!<p>Today, queries need to be Postgres-compatible to be intelligently routed, but queries with specific query syntax or functions beyond Postgres can be routed with our manual router[1]. This is our first solution to this problem and plan to iterate in response to customer pain.<p>Sorry for the confusion! Data moves asynchronously -- we're not trying to implement multi-phase commits -- but we can act on data very quickly once committed. Our solution here uses Postgres logical replication. Using the Data Bridge is optional and a customer's existing solutions are welcome as well.<p>[1]: <a href="https://hydras-io.notion.site/Router-a91f5282f1354c54a9ba894864658e10" rel="nofollow">https://hydras-io.notion.site/Router-a91f5282f1354c54a9ba894...</a></p>
]]></description><pubDate>Wed, 23 Feb 2022 20:27:32 +0000</pubDate><link>https://news.ycombinator.com/item?id=30445967</link><dc:creator>wuputah</dc:creator><comments>https://news.ycombinator.com/item?id=30445967</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=30445967</guid></item><item><title><![CDATA[New comment by wuputah in "Launch HN: Hydra (YC W22) – Query any database via Postgres"]]></title><description><![CDATA[
<p>Hydra doesn't ship data to the client in order to then do further work like aggregations -- that's the whole point of Hydra -- but that also means that you won't be able to "workaround" a performance issue with an underlying data store. For that, we'd need to find a way to replicate the data to a data store that can solve the aggregation performance issue.</p>
]]></description><pubDate>Wed, 23 Feb 2022 19:46:04 +0000</pubDate><link>https://news.ycombinator.com/item?id=30445427</link><dc:creator>wuputah</dc:creator><comments>https://news.ycombinator.com/item?id=30445427</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=30445427</guid></item></channel></rss>