<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: baotiao</title><link>https://news.ycombinator.com/user?id=baotiao</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Tue, 28 Apr 2026 23:47:21 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=baotiao" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by baotiao in "[dead]"]]></title><description><![CDATA[
<p>In 2026, Can AI Modify Database Kernel Code? Rewriting PostgreSQL with Claude Code: Full Page Write vs Doublewrite Buffer, a 3x Performance Gap</p>
]]></description><pubDate>Sun, 08 Feb 2026 23:49:44 +0000</pubDate><link>https://news.ycombinator.com/item?id=46939834</link><dc:creator>baotiao</dc:creator><comments>https://news.ycombinator.com/item?id=46939834</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46939834</guid></item><item><title><![CDATA[New comment by baotiao in "AliSQL: Alibaba's open-source MySQL with vector and DuckDB engines"]]></title><description><![CDATA[
<p>Yes, MySQL-DuckDB columned read only node will continuously get data from transactional workload by binlog.
Then people will not need to maintain tools like kafka/debezium to sync between two node.</p>
]]></description><pubDate>Fri, 06 Feb 2026 23:13:29 +0000</pubDate><link>https://news.ycombinator.com/item?id=46919453</link><dc:creator>baotiao</dc:creator><comments>https://news.ycombinator.com/item?id=46919453</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46919453</guid></item><item><title><![CDATA[New comment by baotiao in "AliSQL: Alibaba's open-source MySQL with vector and DuckDB engines"]]></title><description><![CDATA[
<p>I’m quite certain that if DuckDB had been open-sourced and reached stability around 2020, TiDB would have definitely chosen DuckDB instead of ClickHouse.</p>
]]></description><pubDate>Wed, 04 Feb 2026 21:12:43 +0000</pubDate><link>https://news.ycombinator.com/item?id=46891853</link><dc:creator>baotiao</dc:creator><comments>https://news.ycombinator.com/item?id=46891853</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46891853</guid></item><item><title><![CDATA[New comment by baotiao in "AliSQL: Alibaba's open-source MySQL with vector and DuckDB engines"]]></title><description><![CDATA[
<p>We havn't try that before, maybe I will try to combine with mysql-operator later..</p>
]]></description><pubDate>Wed, 04 Feb 2026 21:11:01 +0000</pubDate><link>https://news.ycombinator.com/item?id=46891830</link><dc:creator>baotiao</dc:creator><comments>https://news.ycombinator.com/item?id=46891830</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46891830</guid></item><item><title><![CDATA[New comment by baotiao in "AliSQL: Alibaba's open-source MySQL with vector and DuckDB engines"]]></title><description><![CDATA[
<p>Actually, that’s not the case. I also support PostgreSQL products in my professional work. However, specifically regarding this issue—as I mentioned in my article—it is simply easier to integrate DuckDB by leveraging MySQL's binlog and its pluggable storage engine architecture.</p>
]]></description><pubDate>Wed, 04 Feb 2026 21:09:00 +0000</pubDate><link>https://news.ycombinator.com/item?id=46891802</link><dc:creator>baotiao</dc:creator><comments>https://news.ycombinator.com/item?id=46891802</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46891802</guid></item><item><title><![CDATA[Why MySQL's Integration with DuckDB Is More Elegant Than PostgreSQL's]]></title><description><![CDATA[
<p>Article URL: <a href="https://www.linkedin.com/top-content/">https://www.linkedin.com/top-content/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=46887089">https://news.ycombinator.com/item?id=46887089</a></p>
<p>Points: 1</p>
<p># Comments: 0</p>
]]></description><pubDate>Wed, 04 Feb 2026 15:35:09 +0000</pubDate><link>https://www.linkedin.com/top-content/</link><dc:creator>baotiao</dc:creator><comments>https://news.ycombinator.com/item?id=46887089</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46887089</guid></item><item><title><![CDATA[New comment by baotiao in "AliSQL: Alibaba's open-source MySQL with vector and DuckDB engines"]]></title><description><![CDATA[
<p>Here is the professional English translation of your analysis, optimized for a technical audience or a blog post:<p>Why I Believe MySQL is More Suited than PostgreSQL for DuckDB Integration
Currently, there are three mainstream solutions in the ecosystem: pg_duckdb, pg_mooncake, and pg_lake. However, they face several critical hurdles. First, PostgreSQL's logical replication is not mature enough—falling far behind the robustness of its physical replication—making it difficult to reliably connect a PG primary node to a DuckDB read-only replica via logical streams.<p>Furthermore, PostgreSQL lacks a truly mature pluggable storage engine architecture. While it provides the Table Access Method as an interface, it does not offer standardized support for primary-replica replication or Crash Recovery at the interface level. This makes it challenging to guarantee data consistency in many production scenarios.<p>MySQL, however, solves these issues elegantly:<p>Native Pluggable Architecture: MySQL was born with a pluggable storage engine design. Historically, MySQL pivoted from MyISAM to InnoDB as the default engine specifically to leverage InnoDB's row-level MVCC. While previous columnar attempts like InfoBright existed, they didn't reach mass adoption. Adding DuckDB as a native columnar engine in MySQL is a natural progression. It eliminates the need for "workaround" architectures seen in PostgreSQL, where data must first be written to a row-store before being converted into a columnar format.<p>The Power of the Binlog Ecosystem: MySQL’s "dual-log" mechanism (Binlog and Redo Log) is a double-edged sword; while it impacts raw write performance, the Binlog provides unparalleled support for the broader data ecosystem. By providing a clean stream of data changes, it facilitates seamless replication to downstream systems. This is precisely why OLAP solutions like ClickHouse, StarRocks, and SelectDB have flourished within the MySQL ecosystem.<p>Seamless HTAP Integration: When using DuckDB as a MySQL storage engine, the Binlog ecosystem remains fully compatible and intact. This allows the system to function as a data warehouse node that can still "egress" its own Binlog. In an HTAP (Hybrid Transactional/Analytical Processing) scenario, a primary MySQL node using InnoDB can stream Binlog directly to a downstream MySQL node using the DuckDB engine, achieving a perfectly compatible and fluid data pipeline.</p>
]]></description><pubDate>Wed, 04 Feb 2026 07:35:16 +0000</pubDate><link>https://news.ycombinator.com/item?id=46882662</link><dc:creator>baotiao</dc:creator><comments>https://news.ycombinator.com/item?id=46882662</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46882662</guid></item><item><title><![CDATA[New comment by baotiao in "AliSQL: Alibaba's open-source MySQL with vector and DuckDB engines"]]></title><description><![CDATA[
<p>On this page, we introduce how to implement a read-only Columnar Store (DuckDB) node leveraging the MySQL binlog mechanism. <a href="https://github.com/alibaba/AliSQL/blob/master/wiki/duckdb/duckdb-en.md" rel="nofollow">https://github.com/alibaba/AliSQL/blob/master/wiki/duckdb/du...</a> In this implementation, we have performed extensive optimizations for binlog batch transmission, write operations, and more.</p>
]]></description><pubDate>Wed, 04 Feb 2026 07:25:33 +0000</pubDate><link>https://news.ycombinator.com/item?id=46882597</link><dc:creator>baotiao</dc:creator><comments>https://news.ycombinator.com/item?id=46882597</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46882597</guid></item><item><title><![CDATA[AliSQL: Alibaba's open-source MySQL with vector and DuckDB engines]]></title><description><![CDATA[
<p>Article URL: <a href="https://github.com/alibaba/AliSQL">https://github.com/alibaba/AliSQL</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=46875228">https://news.ycombinator.com/item?id=46875228</a></p>
<p>Points: 306</p>
<p># Comments: 49</p>
]]></description><pubDate>Tue, 03 Feb 2026 18:40:18 +0000</pubDate><link>https://github.com/alibaba/AliSQL</link><dc:creator>baotiao</dc:creator><comments>https://news.ycombinator.com/item?id=46875228</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46875228</guid></item><item><title><![CDATA[New comment by baotiao in "Ask HN: Could you share your personal blog here?"]]></title><description><![CDATA[
<p><a href="http://baotiao.github.io/" rel="nofollow noreferrer">http://baotiao.github.io/</a><p>130 blog posts. Writing about Database and Distributed system</p>
]]></description><pubDate>Tue, 04 Jul 2023 17:51:39 +0000</pubDate><link>https://news.ycombinator.com/item?id=36590197</link><dc:creator>baotiao</dc:creator><comments>https://news.ycombinator.com/item?id=36590197</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=36590197</guid></item><item><title><![CDATA[New comment by baotiao in "Boost:Unordered_flat_map"]]></title><description><![CDATA[
<p>Inside boost::unordered_flat_map</p>
]]></description><pubDate>Sun, 04 Dec 2022 20:00:18 +0000</pubDate><link>https://news.ycombinator.com/item?id=33857220</link><dc:creator>baotiao</dc:creator><comments>https://news.ycombinator.com/item?id=33857220</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=33857220</guid></item><item><title><![CDATA[Boost:Unordered_flat_map]]></title><description><![CDATA[
<p>Article URL: <a href="http://bannalia.blogspot.com/2022/11/">http://bannalia.blogspot.com/2022/11/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=33857219">https://news.ycombinator.com/item?id=33857219</a></p>
<p>Points: 2</p>
<p># Comments: 1</p>
]]></description><pubDate>Sun, 04 Dec 2022 20:00:18 +0000</pubDate><link>http://bannalia.blogspot.com/2022/11/</link><dc:creator>baotiao</dc:creator><comments>https://news.ycombinator.com/item?id=33857219</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=33857219</guid></item><item><title><![CDATA[Coding for SSDs]]></title><description><![CDATA[
<p>Article URL: <a href="https://codecapsule.com/2014/02/12/coding-for-ssds-part-1-introduction-and-table-of-contents/">https://codecapsule.com/2014/02/12/coding-for-ssds-part-1-introduction-and-table-of-contents/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=28455868">https://news.ycombinator.com/item?id=28455868</a></p>
<p>Points: 2</p>
<p># Comments: 1</p>
]]></description><pubDate>Wed, 08 Sep 2021 12:07:50 +0000</pubDate><link>https://codecapsule.com/2014/02/12/coding-for-ssds-part-1-introduction-and-table-of-contents/</link><dc:creator>baotiao</dc:creator><comments>https://news.ycombinator.com/item?id=28455868</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=28455868</guid></item><item><title><![CDATA[Evolution of tree data structures for indexing]]></title><description><![CDATA[
<p>Article URL: <a href="https://erthalion.info/2020/11/28/evolution-of-btree-index-am">https://erthalion.info/2020/11/28/evolution-of-btree-index-am</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=27963753">https://news.ycombinator.com/item?id=27963753</a></p>
<p>Points: 101</p>
<p># Comments: 12</p>
]]></description><pubDate>Mon, 26 Jul 2021 19:01:29 +0000</pubDate><link>https://erthalion.info/2020/11/28/evolution-of-btree-index-am</link><dc:creator>baotiao</dc:creator><comments>https://news.ycombinator.com/item?id=27963753</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=27963753</guid></item><item><title><![CDATA[New comment by baotiao in "PolarFS: Alibaba Distributed File System for Shared Storage Cloud Database [pdf]"]]></title><description><![CDATA[
<p>I think the most interesting part is PolarFS taking full advantage of the emerging techniques like RDMA, NVMe, and SPDK. And the Parallel raft consensus algorithm</p>
]]></description><pubDate>Wed, 22 Aug 2018 12:56:53 +0000</pubDate><link>https://news.ycombinator.com/item?id=17818349</link><dc:creator>baotiao</dc:creator><comments>https://news.ycombinator.com/item?id=17818349</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=17818349</guid></item><item><title><![CDATA[New comment by baotiao in "PolarFS: Alibaba Distributed File System for Shared Storage Cloud Database [pdf]"]]></title><description><![CDATA[
<p>Yes I am from the PolarFS team.
You can read from the paper that we have compared PolarFS with ceph.</p>
]]></description><pubDate>Wed, 22 Aug 2018 12:25:03 +0000</pubDate><link>https://news.ycombinator.com/item?id=17818165</link><dc:creator>baotiao</dc:creator><comments>https://news.ycombinator.com/item?id=17818165</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=17818165</guid></item><item><title><![CDATA[New comment by baotiao in "PolarFS: Alibaba Distributed File System for Shared Storage Cloud Database [pdf]"]]></title><description><![CDATA[
<p>The protocol is interesting, and we will provide the TLA+ proof soon.</p>
]]></description><pubDate>Wed, 22 Aug 2018 05:14:46 +0000</pubDate><link>https://news.ycombinator.com/item?id=17816477</link><dc:creator>baotiao</dc:creator><comments>https://news.ycombinator.com/item?id=17816477</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=17816477</guid></item><item><title><![CDATA[New comment by baotiao in "PolarFS: Alibaba Distributed File System for Shared Storage Cloud Database [pdf]"]]></title><description><![CDATA[
<p>Thank you.
We will provide our TLA+ proof soon..</p>
]]></description><pubDate>Wed, 22 Aug 2018 05:08:13 +0000</pubDate><link>https://news.ycombinator.com/item?id=17816456</link><dc:creator>baotiao</dc:creator><comments>https://news.ycombinator.com/item?id=17816456</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=17816456</guid></item><item><title><![CDATA[Optimizing Paxos with batching and pipelining (2012) [pdf]]]></title><description><![CDATA[
<p>Article URL: <a href="https://pdfs.semanticscholar.org/a0d0/cdd2e8af1945c03cfaf2cb451f71f208d0c9.pdf">https://pdfs.semanticscholar.org/a0d0/cdd2e8af1945c03cfaf2cb451f71f208d0c9.pdf</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=16952649">https://news.ycombinator.com/item?id=16952649</a></p>
<p>Points: 71</p>
<p># Comments: 1</p>
]]></description><pubDate>Sun, 29 Apr 2018 16:22:03 +0000</pubDate><link>https://pdfs.semanticscholar.org/a0d0/cdd2e8af1945c03cfaf2cb451f71f208d0c9.pdf</link><dc:creator>baotiao</dc:creator><comments>https://news.ycombinator.com/item?id=16952649</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=16952649</guid></item><item><title><![CDATA[Fpaxos]]></title><description><![CDATA[
<p>Article URL: <a href="https://fpaxos.github.io/">https://fpaxos.github.io/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=16047258">https://news.ycombinator.com/item?id=16047258</a></p>
<p>Points: 1</p>
<p># Comments: 0</p>
]]></description><pubDate>Mon, 01 Jan 2018 18:29:16 +0000</pubDate><link>https://fpaxos.github.io/</link><dc:creator>baotiao</dc:creator><comments>https://news.ycombinator.com/item?id=16047258</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=16047258</guid></item></channel></rss>