<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: sqlcook</title><link>https://news.ycombinator.com/user?id=sqlcook</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Thu, 30 Apr 2026 17:51:40 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=sqlcook" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by sqlcook in "Craig Venter has died"]]></title><description><![CDATA[
<p>Sad news, I’ve worked at HumanLongevity and got to interact with Craig several times. He was a legend and truly will be missed.</p>
]]></description><pubDate>Thu, 30 Apr 2026 04:59:13 +0000</pubDate><link>https://news.ycombinator.com/item?id=47958310</link><dc:creator>sqlcook</dc:creator><comments>https://news.ycombinator.com/item?id=47958310</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47958310</guid></item><item><title><![CDATA[New comment by sqlcook in "Open Source Python ETL"]]></title><description><![CDATA[
<p>You’re missing the point of the benefits of solutions like these, and the original set of tools like the Informatica of the kind. Those tools come with limitations and constrains, like a box of legos you can build a very powerful pipeline without having to wire up a lot of redundant code as you pass data frames between validation stages. Tools like Airflow/Spark etc are great for what they are, but they don’t come with guidelines or best practices when it comes to reusable code at scale, your team has to establish that early on.<p>You can open a pretty complicated large DAG in and right away you’ll understand the data flow and processing steps. If you were to do similar in code, it becomes a lot harder unless you comply to good modular design practices.<p>This is also why common game engine and 3d rendering tools come with a UI for flow driven scripting. It’s intuitive and much easier to organize.</p>
]]></description><pubDate>Wed, 19 Jun 2024 03:56:47 +0000</pubDate><link>https://news.ycombinator.com/item?id=40724646</link><dc:creator>sqlcook</dc:creator><comments>https://news.ycombinator.com/item?id=40724646</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40724646</guid></item><item><title><![CDATA[New comment by sqlcook in "Gallery of database schema diagrams of open-source packages"]]></title><description><![CDATA[
<p>This is indeed one of the nicest UI's I've seen, is this all custom made or are you using some open components? I wanted to get similar effect for something completely different I had in mind (like a flow based diagram), any advice greatly appreciated.</p>
]]></description><pubDate>Tue, 28 Apr 2020 16:26:46 +0000</pubDate><link>https://news.ycombinator.com/item?id=23008613</link><dc:creator>sqlcook</dc:creator><comments>https://news.ycombinator.com/item?id=23008613</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=23008613</guid></item><item><title><![CDATA[New comment by sqlcook in "Dbeaver – Multi-platform database tool"]]></title><description><![CDATA[
<p>Nice if you’re only connecting to a single instance, nightmare if you have multiple. DbVisualizer is better imho.</p>
]]></description><pubDate>Fri, 18 Jan 2019 03:22:36 +0000</pubDate><link>https://news.ycombinator.com/item?id=18936789</link><dc:creator>sqlcook</dc:creator><comments>https://news.ycombinator.com/item?id=18936789</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=18936789</guid></item><item><title><![CDATA[New comment by sqlcook in "How NoSQL forced the evolution of a scalable relational database"]]></title><description><![CDATA[
<p>It actually IS quite hard to imagine solid engineers finding it hard to learn simple SQL. It's a lot harder to write "nasty" sql than "nasty" js, and if you're a front-end dev struggling with sql, then you probably should not be doing back end work. Yes not all databases are equal, thats why there are ORMs and ANSI standards.<p>From the list of bad practices you've "seen" people do in SQL, I would guess their comfort zone code is probably a hot mess as well.</p>
]]></description><pubDate>Fri, 13 Jul 2018 08:59:38 +0000</pubDate><link>https://news.ycombinator.com/item?id=17521467</link><dc:creator>sqlcook</dc:creator><comments>https://news.ycombinator.com/item?id=17521467</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=17521467</guid></item><item><title><![CDATA[New comment by sqlcook in "How NoSQL forced the evolution of a scalable relational database"]]></title><description><![CDATA[
<p>EE features without support. I hate to bring up Mongo as example, but something similar...where support, additional software/plugins and cloud hosting are where the $ is made.<p>I did thorough testing of Memsql two years ago but went with Aurora instead. Would love to see how the product evolved since (Spark and Streaming integration was just being rolled out at the time), but something tells me pricing will be a deal breaker.</p>
]]></description><pubDate>Fri, 13 Jul 2018 07:39:59 +0000</pubDate><link>https://news.ycombinator.com/item?id=17521157</link><dc:creator>sqlcook</dc:creator><comments>https://news.ycombinator.com/item?id=17521157</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=17521157</guid></item><item><title><![CDATA[New comment by sqlcook in "How NoSQL forced the evolution of a scalable relational database"]]></title><description><![CDATA[
<p>Hi Rick, I've been following Memsql for a few years now, are there any plans to release "community" edition? Last time I checked about 1.5 years ago json support was very basic and EE pricing (dont remember exact #s) was rather high. Thanks</p>
]]></description><pubDate>Fri, 13 Jul 2018 07:18:04 +0000</pubDate><link>https://news.ycombinator.com/item?id=17521064</link><dc:creator>sqlcook</dc:creator><comments>https://news.ycombinator.com/item?id=17521064</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=17521064</guid></item><item><title><![CDATA[New comment by sqlcook in "First field report of iPhone X"]]></title><description><![CDATA[
<p>You mean with a picture of your face....</p>
]]></description><pubDate>Mon, 30 Oct 2017 12:47:45 +0000</pubDate><link>https://news.ycombinator.com/item?id=15584582</link><dc:creator>sqlcook</dc:creator><comments>https://news.ycombinator.com/item?id=15584582</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=15584582</guid></item><item><title><![CDATA[New comment by sqlcook in "Show HN: CoinHub for iOS"]]></title><description><![CDATA[
<p>Nice looking app! What did you use for the charts?</p>
]]></description><pubDate>Thu, 05 Oct 2017 02:11:48 +0000</pubDate><link>https://news.ycombinator.com/item?id=15406303</link><dc:creator>sqlcook</dc:creator><comments>https://news.ycombinator.com/item?id=15406303</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=15406303</guid></item><item><title><![CDATA[The Sixense Sense was once the ultimate VR controller. Where is it now?]]></title><description><![CDATA[
<p>Article URL: <a href="https://www.theverge.com/2017/5/2/15477520/sixense-stem-oculus-rift-vr-controller-kickstarter-problems">https://www.theverge.com/2017/5/2/15477520/sixense-stem-oculus-rift-vr-controller-kickstarter-problems</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=14248137">https://news.ycombinator.com/item?id=14248137</a></p>
<p>Points: 1</p>
<p># Comments: 0</p>
]]></description><pubDate>Tue, 02 May 2017 16:58:51 +0000</pubDate><link>https://www.theverge.com/2017/5/2/15477520/sixense-stem-oculus-rift-vr-controller-kickstarter-problems</link><dc:creator>sqlcook</dc:creator><comments>https://news.ycombinator.com/item?id=14248137</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=14248137</guid></item><item><title><![CDATA[New comment by sqlcook in "Horizon 1.0: a realtime, open-source JavaScript back end from RethinkDB"]]></title><description><![CDATA[
<p>Excellent work Slava! Have been planning to migrate to RethinkDB for one of the existing projects, great timing with Horizon :D</p>
]]></description><pubDate>Tue, 17 May 2016 21:07:50 +0000</pubDate><link>https://news.ycombinator.com/item?id=11717430</link><dc:creator>sqlcook</dc:creator><comments>https://news.ycombinator.com/item?id=11717430</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=11717430</guid></item><item><title><![CDATA[New comment by sqlcook in "How we get high availability with Elasticsearch and Ruby on Rails"]]></title><description><![CDATA[
<p>if you want the fastest ingress, disable replica until your ingress is done, its faster to create replica at the end of ETL for that given index. Also, you want to disable auto allocation as well, this will disable shard movement during ingress, re-enable it afterwards.<p>on a 100 node cluster i had roughly 500GB on each node. this was not a single index, multiple indexes, with roughly 8 shards per index per node. Shard count is pretty important to get correct.<p>I did not manually control document routing (it was hard based on the type of data i was ingressing), so it was set to auto and during the load i observed hotspots in the cluster (you have to look at BULK thread/queue length), some nodes were getting burst of docs while others were idle, roughly 40-50% of the nodes in the cluster were under utilized, and maybe 5-10% had hot spots from time to time.<p>Also, depending what you use to push data in, (I used ES hadoop plugin) , you have to account for shard segment merges, which literally pause ingress for a brief moment and merge segments in a given shard. You have to set retry to -1 (infinite) and retry delay to something like a second or two, otherwise you will end up with dropped documents.</p>
]]></description><pubDate>Sat, 09 Apr 2016 03:18:26 +0000</pubDate><link>https://news.ycombinator.com/item?id=11459723</link><dc:creator>sqlcook</dc:creator><comments>https://news.ycombinator.com/item?id=11459723</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=11459723</guid></item><item><title><![CDATA[New comment by sqlcook in "How we get high availability with Elasticsearch and Ruby on Rails"]]></title><description><![CDATA[
<p>100 data nodes<p>basically if you want fast ingress, keep shards small, once they get past ~5-10gb , ingress significantly slows down. Also this was on ES 1.5 , have not tested latest 2.0+ builds</p>
]]></description><pubDate>Sat, 09 Apr 2016 02:24:14 +0000</pubDate><link>https://news.ycombinator.com/item?id=11459606</link><dc:creator>sqlcook</dc:creator><comments>https://news.ycombinator.com/item?id=11459606</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=11459606</guid></item><item><title><![CDATA[New comment by sqlcook in "How we get high availability with Elasticsearch and Ruby on Rails"]]></title><description><![CDATA[
<p>I've indexed ~ 1 million docs a second, but with proper routing, can probably even 5x that. Total cluster size was 50 terabytes, at the end.</p>
]]></description><pubDate>Sat, 09 Apr 2016 01:10:44 +0000</pubDate><link>https://news.ycombinator.com/item?id=11459373</link><dc:creator>sqlcook</dc:creator><comments>https://news.ycombinator.com/item?id=11459373</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=11459373</guid></item><item><title><![CDATA[New comment by sqlcook in "pg_rewind in PostgreSQL 9.5"]]></title><description><![CDATA[
<p>The post is about Postgres, you were referring to MSSQL mirroring, which can fail back from primary to secondary and back to primary almost seamlessly, depending on type of failure and log catchup.<p>Postgres has been a pain as described in the article.......</p>
]]></description><pubDate>Mon, 23 Mar 2015 21:57:58 +0000</pubDate><link>https://news.ycombinator.com/item?id=9253589</link><dc:creator>sqlcook</dc:creator><comments>https://news.ycombinator.com/item?id=9253589</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=9253589</guid></item><item><title><![CDATA[New comment by sqlcook in "pg_rewind in PostgreSQL 9.5"]]></title><description><![CDATA[
<p>? MSSQL mirroring failover is very easy and painless( arguably one of best compared to other rdbms), just make sure not to do automatic failover as that can cause false failover with spotty network between nodes and witness.</p>
]]></description><pubDate>Mon, 23 Mar 2015 21:45:00 +0000</pubDate><link>https://news.ycombinator.com/item?id=9253529</link><dc:creator>sqlcook</dc:creator><comments>https://news.ycombinator.com/item?id=9253529</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=9253529</guid></item></channel></rss>