<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: data_dan_</title><link>https://news.ycombinator.com/user?id=data_dan_</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Thu, 30 Apr 2026 10:07:04 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=data_dan_" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by data_dan_ in "Ask HN: Could you share your personal blog here?"]]></title><description><![CDATA[
<p><a href="https://www.danliden.com/" rel="nofollow noreferrer">https://www.danliden.com/</a><p>At the moment, I post fairly infrequently. The whole site is written using emacs org mode. Most of the posts have to do with emacs and data stuff (often doing data stuff in emacs).</p>
]]></description><pubDate>Thu, 06 Jul 2023 18:02:44 +0000</pubDate><link>https://news.ycombinator.com/item?id=36620195</link><dc:creator>data_dan_</dc:creator><comments>https://news.ycombinator.com/item?id=36620195</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=36620195</guid></item><item><title><![CDATA[Pre-Classify Tasks for Better ChatGPT Completions]]></title><description><![CDATA[
<p>Article URL: <a href="https://innerjoin.bit.io/pre-classify-tasks-for-better-chatgpt-completions-f197ad01618c?gi=448e4c0c1717">https://innerjoin.bit.io/pre-classify-tasks-for-better-chatgpt-completions-f197ad01618c?gi=448e4c0c1717</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=35712919">https://news.ycombinator.com/item?id=35712919</a></p>
<p>Points: 2</p>
<p># Comments: 0</p>
]]></description><pubDate>Wed, 26 Apr 2023 12:57:01 +0000</pubDate><link>https://innerjoin.bit.io/pre-classify-tasks-for-better-chatgpt-completions-f197ad01618c?gi=448e4c0c1717</link><dc:creator>data_dan_</dc:creator><comments>https://news.ycombinator.com/item?id=35712919</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=35712919</guid></item><item><title><![CDATA[Vector Similarity Search in Postgres with bit.io and pgvector]]></title><description><![CDATA[
<p>Article URL: <a href="https://innerjoin.bit.io/vector-similarity-search-in-postgres-with-bit-io-and-pgvector-c58ac34f408b">https://innerjoin.bit.io/vector-similarity-search-in-postgres-with-bit-io-and-pgvector-c58ac34f408b</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=35513432">https://news.ycombinator.com/item?id=35513432</a></p>
<p>Points: 5</p>
<p># Comments: 0</p>
]]></description><pubDate>Mon, 10 Apr 2023 15:23:34 +0000</pubDate><link>https://innerjoin.bit.io/vector-similarity-search-in-postgres-with-bit-io-and-pgvector-c58ac34f408b</link><dc:creator>data_dan_</dc:creator><comments>https://news.ycombinator.com/item?id=35513432</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=35513432</guid></item><item><title><![CDATA[Comparing ChatGPT to Codex for text-to-SQL translation]]></title><description><![CDATA[
<p>Article URL: <a href="https://innerjoin.bit.io/llm-sql-translation-prompt-testing-part-1-comparing-chatgpt-to-codex-78da57213ebe">https://innerjoin.bit.io/llm-sql-translation-prompt-testing-part-1-comparing-chatgpt-to-codex-78da57213ebe</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=35202332">https://news.ycombinator.com/item?id=35202332</a></p>
<p>Points: 4</p>
<p># Comments: 0</p>
]]></description><pubDate>Fri, 17 Mar 2023 20:18:34 +0000</pubDate><link>https://innerjoin.bit.io/llm-sql-translation-prompt-testing-part-1-comparing-chatgpt-to-codex-78da57213ebe</link><dc:creator>data_dan_</dc:creator><comments>https://news.ycombinator.com/item?id=35202332</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=35202332</guid></item><item><title><![CDATA[Make ChatGPT Stop Chatting and Start Writing SQL]]></title><description><![CDATA[
<p>Article URL: <a href="https://innerjoin.bit.io/make-chatgpt-stop-chatting-and-start-writing-sql-fd5560049ae4?gi=0dbaa203018e">https://innerjoin.bit.io/make-chatgpt-stop-chatting-and-start-writing-sql-fd5560049ae4?gi=0dbaa203018e</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=35171230">https://news.ycombinator.com/item?id=35171230</a></p>
<p>Points: 4</p>
<p># Comments: 0</p>
]]></description><pubDate>Wed, 15 Mar 2023 16:58:38 +0000</pubDate><link>https://innerjoin.bit.io/make-chatgpt-stop-chatting-and-start-writing-sql-fd5560049ae4?gi=0dbaa203018e</link><dc:creator>data_dan_</dc:creator><comments>https://news.ycombinator.com/item?id=35171230</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=35171230</guid></item><item><title><![CDATA[Making a Production LLM Prompt for Text-to-SQL Translation]]></title><description><![CDATA[
<p>Article URL: <a href="https://innerjoin.bit.io/making-a-production-llm-prompt-for-text-to-sql-translation-b798b6e94783">https://innerjoin.bit.io/making-a-production-llm-prompt-for-text-to-sql-translation-b798b6e94783</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=35001805">https://news.ycombinator.com/item?id=35001805</a></p>
<p>Points: 3</p>
<p># Comments: 0</p>
]]></description><pubDate>Thu, 02 Mar 2023 20:46:57 +0000</pubDate><link>https://innerjoin.bit.io/making-a-production-llm-prompt-for-text-to-sql-translation-b798b6e94783</link><dc:creator>data_dan_</dc:creator><comments>https://news.ycombinator.com/item?id=35001805</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=35001805</guid></item><item><title><![CDATA[AI-Powered Text-to-SQL Translation in Bit.io]]></title><description><![CDATA[
<p>Article URL: <a href="https://blog.bit.io/ai-powered-text-to-sql-translation-in-bit-io-1fbcf32fd586">https://blog.bit.io/ai-powered-text-to-sql-translation-in-bit-io-1fbcf32fd586</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=34961741">https://news.ycombinator.com/item?id=34961741</a></p>
<p>Points: 5</p>
<p># Comments: 0</p>
]]></description><pubDate>Mon, 27 Feb 2023 20:56:17 +0000</pubDate><link>https://blog.bit.io/ai-powered-text-to-sql-translation-in-bit-io-1fbcf32fd586</link><dc:creator>data_dan_</dc:creator><comments>https://news.ycombinator.com/item?id=34961741</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=34961741</guid></item><item><title><![CDATA[Measuring and Mitigating Postgres Network Latency]]></title><description><![CDATA[
<p>Article URL: <a href="https://innerjoin.bit.io/the-distance-dilemma-measuring-and-mitigating-postgres-network-latency-76f6cd1a6c57">https://innerjoin.bit.io/the-distance-dilemma-measuring-and-mitigating-postgres-network-latency-76f6cd1a6c57</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=33968840">https://news.ycombinator.com/item?id=33968840</a></p>
<p>Points: 3</p>
<p># Comments: 0</p>
]]></description><pubDate>Tue, 13 Dec 2022 13:55:28 +0000</pubDate><link>https://innerjoin.bit.io/the-distance-dilemma-measuring-and-mitigating-postgres-network-latency-76f6cd1a6c57</link><dc:creator>data_dan_</dc:creator><comments>https://news.ycombinator.com/item?id=33968840</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=33968840</guid></item><item><title><![CDATA[Cities in the Southwest Decouple Growth from the Need for More Water]]></title><description><![CDATA[
<p>Article URL: <a href="https://e360.yale.edu/features/a-quiet-revolution-southwest-cities-learn-to-thrive-amid-drought">https://e360.yale.edu/features/a-quiet-revolution-southwest-cities-learn-to-thrive-amid-drought</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=31315675">https://news.ycombinator.com/item?id=31315675</a></p>
<p>Points: 2</p>
<p># Comments: 1</p>
]]></description><pubDate>Mon, 09 May 2022 15:49:54 +0000</pubDate><link>https://e360.yale.edu/features/a-quiet-revolution-southwest-cities-learn-to-thrive-amid-drought</link><dc:creator>data_dan_</dc:creator><comments>https://news.ycombinator.com/item?id=31315675</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=31315675</guid></item><item><title><![CDATA[Literate Emacs Config with imenu-list]]></title><description><![CDATA[
<p>Article URL: <a href="https://github.com/rougier/dotemacs">https://github.com/rougier/dotemacs</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=31297112">https://news.ycombinator.com/item?id=31297112</a></p>
<p>Points: 1</p>
<p># Comments: 0</p>
]]></description><pubDate>Sat, 07 May 2022 18:23:06 +0000</pubDate><link>https://github.com/rougier/dotemacs</link><dc:creator>data_dan_</dc:creator><comments>https://news.ycombinator.com/item?id=31297112</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=31297112</guid></item><item><title><![CDATA[New comment by data_dan_ in "Record Linkage on People's Names with Approximate String Matching"]]></title><description><![CDATA[
<p>I wrote this article. For some background—<p>In March, we published an article on Stock Trades by members of congressional committees: <a href="https://innerjoin.bit.io/data-cant-tell-us-whether-congressional-stock-trading-is-corrupt-b32031c533bc" rel="nofollow">https://innerjoin.bit.io/data-cant-tell-us-whether-congressi...</a><p>To conduct this research, we needed to know: (1) which members of congress made which stock trades, and (2) which members of congress belonged to which congressional committees. The data for (1) was available from the the senate/house stock watchers sites; the data for (2) came from the ProPublica Congress API. There was no primary key available for linking the two datasets: the best we had to work with were the names of the members of congress.<p>This would be fine, if the names were represented uniquely and consistently. This was not the case. You can't join "Mitch McConnell" to "A. Mitchell McConnell, Jr." without a bit of work.<p>Manually matching every single name from the first data source to every single name in the second would be tedious, time consuming, and error prone. Instead, we used the Levenshtein distance to compute a similarity metric between each name in the first dataset and each name in the second. Simply using the best match according to this metric correctly matched more than 95% of the names, and made it incredibly simple to review the list and manually fix the few incorrect matches.<p>There's also an accompanying Deepnote dashboard where you can compare string distances between pairs of strings of your choosing: <a href="https://deepnote.com/@dliden-bitdotio/Whats-in-a-Name-28418c6c-f87f-4367-8744-ead5b93c227b" rel="nofollow">https://deepnote.com/@dliden-bitdotio/Whats-in-a-Name-28418c...</a></p>
]]></description><pubDate>Thu, 05 May 2022 15:24:07 +0000</pubDate><link>https://news.ycombinator.com/item?id=31274534</link><dc:creator>data_dan_</dc:creator><comments>https://news.ycombinator.com/item?id=31274534</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=31274534</guid></item><item><title><![CDATA[Record Linkage on People's Names with Approximate String Matching]]></title><description><![CDATA[
<p>Article URL: <a href="https://innerjoin.bit.io/simple-record-linkage-with-approximate-string-matching-a4a480566122?gi=57afada8ed63">https://innerjoin.bit.io/simple-record-linkage-with-approximate-string-matching-a4a480566122?gi=57afada8ed63</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=31274156">https://news.ycombinator.com/item?id=31274156</a></p>
<p>Points: 2</p>
<p># Comments: 1</p>
]]></description><pubDate>Thu, 05 May 2022 14:54:59 +0000</pubDate><link>https://innerjoin.bit.io/simple-record-linkage-with-approximate-string-matching-a4a480566122?gi=57afada8ed63</link><dc:creator>data_dan_</dc:creator><comments>https://news.ycombinator.com/item?id=31274156</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=31274156</guid></item><item><title><![CDATA[Using Telemedicine to Address the Access Gap for Opioid Treatment]]></title><description><![CDATA[
<p>Article URL: <a href="https://knowablemagazine.org/article/mind/2022/does-online-opioid-treatment-work">https://knowablemagazine.org/article/mind/2022/does-online-opioid-treatment-work</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=31242537">https://news.ycombinator.com/item?id=31242537</a></p>
<p>Points: 2</p>
<p># Comments: 0</p>
]]></description><pubDate>Mon, 02 May 2022 23:38:15 +0000</pubDate><link>https://knowablemagazine.org/article/mind/2022/does-online-opioid-treatment-work</link><dc:creator>data_dan_</dc:creator><comments>https://news.ycombinator.com/item?id=31242537</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=31242537</guid></item><item><title><![CDATA[How Much Resource Use is Fair? National responsibility for ecological breakdown]]></title><description><![CDATA[
<p>Article URL: <a href="https://blog.datawrapper.de/fair-share-resources/">https://blog.datawrapper.de/fair-share-resources/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=31072610">https://news.ycombinator.com/item?id=31072610</a></p>
<p>Points: 3</p>
<p># Comments: 0</p>
]]></description><pubDate>Mon, 18 Apr 2022 16:08:34 +0000</pubDate><link>https://blog.datawrapper.de/fair-share-resources/</link><dc:creator>data_dan_</dc:creator><comments>https://news.ycombinator.com/item?id=31072610</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=31072610</guid></item><item><title><![CDATA[Surveying SQL parser libraries in a few high-level languages]]></title><description><![CDATA[
<p>Article URL: <a href="https://datastation.multiprocess.io/blog/2022-04-11-sql-parsers.html">https://datastation.multiprocess.io/blog/2022-04-11-sql-parsers.html</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=31068375">https://news.ycombinator.com/item?id=31068375</a></p>
<p>Points: 2</p>
<p># Comments: 0</p>
]]></description><pubDate>Mon, 18 Apr 2022 07:54:34 +0000</pubDate><link>https://datastation.multiprocess.io/blog/2022-04-11-sql-parsers.html</link><dc:creator>data_dan_</dc:creator><comments>https://news.ycombinator.com/item?id=31068375</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=31068375</guid></item><item><title><![CDATA[Bayesian Data Analysis with Jags: A Dirichlet-Multinomial Model of Wordle Scores]]></title><description><![CDATA[
<p>Article URL: <a href="https://innerjoin.bit.io/ask-a-bayesian-who-is-better-at-wordle-76a0e5199ed">https://innerjoin.bit.io/ask-a-bayesian-who-is-better-at-wordle-76a0e5199ed</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=31039061">https://news.ycombinator.com/item?id=31039061</a></p>
<p>Points: 2</p>
<p># Comments: 0</p>
]]></description><pubDate>Fri, 15 Apr 2022 11:44:47 +0000</pubDate><link>https://innerjoin.bit.io/ask-a-bayesian-who-is-better-at-wordle-76a0e5199ed</link><dc:creator>data_dan_</dc:creator><comments>https://news.ycombinator.com/item?id=31039061</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=31039061</guid></item><item><title><![CDATA[New comment by data_dan_ in "How Fast Can We Write a 10M Row CSV to a Postgres Database with Pandas?"]]></title><description><![CDATA[
<p>I wrote this article after a colleague pointed out that the Pandas DataFrame.to_sql() method uses row-by-row INSERTs. There are plenty of good reasons for this, and the to_sql method works great with many different SQL database flavors, but it's not fast.<p>This article compares the performance of different methods for writing a Pandas DataFrame to a PostgreSQL database using the to_sql method on DataFrames ranging from 100 rows to 10,000,000 rows.</p>
]]></description><pubDate>Wed, 02 Mar 2022 18:22:57 +0000</pubDate><link>https://news.ycombinator.com/item?id=30530791</link><dc:creator>data_dan_</dc:creator><comments>https://news.ycombinator.com/item?id=30530791</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=30530791</guid></item><item><title><![CDATA[New comment by data_dan_ in "How Great Is the Great Resignation? A Primer on Labor Turnover Data Sources"]]></title><description><![CDATA[
<p>To me it just looks like it's too early to tell. Did quits go up very quickly in 2021? Sure! But that comes on the heels of a massive spike in layoffs that occurred in 2020. It is at least a possibility that the current situation is a response to that.<p>One point I didn't go into is the fact that the labor force participation rate also dropped steeply in 2020 and hasn't recovered to pre-pandemic levels yet. So that could create labor shortages that are not necessarily represented in the quits rate.</p>
]]></description><pubDate>Wed, 09 Feb 2022 23:48:58 +0000</pubDate><link>https://news.ycombinator.com/item?id=30280517</link><dc:creator>data_dan_</dc:creator><comments>https://news.ycombinator.com/item?id=30280517</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=30280517</guid></item><item><title><![CDATA[New comment by data_dan_ in "How Great Is the Great Resignation? A Primer on Labor Turnover Data Sources"]]></title><description><![CDATA[
<p>(I wrote this) I wish I had clearer answers to those questions! The conclusion is just a little unsatisfying from a writing perspective—it's just too early to tell exactly what's going on. The quits data through most of 2021 don't look all that different from what we'd expect based on the pre-pandemic trend, though there were definitely more than anticipated, especially later in the year. But this is following a massively-disruptive pandemic that put a lot of people out of work and in general had a dramatic impact on the employment situation.<p>I think the most interesting part is the decrease in layoffs that coincided with the increase in quits. People aren't leaving <i>that</i> much more than before, but when they leave, they're doing so on their own terms.</p>
]]></description><pubDate>Wed, 09 Feb 2022 15:14:28 +0000</pubDate><link>https://news.ycombinator.com/item?id=30273541</link><dc:creator>data_dan_</dc:creator><comments>https://news.ycombinator.com/item?id=30273541</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=30273541</guid></item><item><title><![CDATA[New comment by data_dan_ in "Ask HN: At what point in your engineering career did you feel respected?"]]></title><description><![CDATA[
<p>Maybe I've been especially fortunate, or I'm just not understanding the question right, but I've felt respected pretty much from the beginning of my career (aside from grad school, which was not great, in many respects). Or at least I've felt "treated with respect" -- not sure if that's exactly the same thing. But I've always been given a fair amount of independence at work, and I've generally been able to solve whatever I'm assigned to solve, or clearly articulate the challenges, allowing those with different/more expertise to help out. And I've never been made to feel ashamed or inadequate because of either the work I've completed or the work I've needed to seek out additional help to complete.</p>
]]></description><pubDate>Mon, 03 Jan 2022 21:28:07 +0000</pubDate><link>https://news.ycombinator.com/item?id=29787179</link><dc:creator>data_dan_</dc:creator><comments>https://news.ycombinator.com/item?id=29787179</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=29787179</guid></item></channel></rss>