<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: tfederman</title><link>https://news.ycombinator.com/user?id=tfederman</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Thu, 23 Apr 2026 09:00:53 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=tfederman" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by tfederman in "Ask HN: Is archive.org a good place for structured data?"]]></title><description><![CDATA[
<p>It's not a big data set that lends itself primarily to analysis, it's more like content. For example, a list of all US Presidents with a lot of metadata or text content fields about them collected/combined from different sources, cleaned, corrected, annotated, etc. (Pretend Wikipedia has only a subset of these fields and considers broadening them out of scope.)<p>As for Github, the data would still be under "my" account and I'm thinking about more of a platform that doesn't depend on one person. Maybe I would manage day to day version control in Github but I'd want to promote occasional releases to be more official and not reliant on my account.</p>
]]></description><pubDate>Fri, 10 Oct 2025 15:23:20 +0000</pubDate><link>https://news.ycombinator.com/item?id=45540056</link><dc:creator>tfederman</dc:creator><comments>https://news.ycombinator.com/item?id=45540056</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45540056</guid></item><item><title><![CDATA[Ask HN: Is archive.org a good place for structured data?]]></title><description><![CDATA[
<p>Let's say I put a lot of effort into creating an authoritative data set on a notable topic in a format like CSV, JSON, whatever. And I want to share it and have it housed somewhere more official/permanent than I'll be able to support.<p>Could archive.org be thought of as a place for raw assembled/authoritative data, with a layer that could transform it into a static web site view to be hosted elsewhere with additional design and features but would not be the authoritative source of the underlying data?<p>Is there a better place for this sort of vision?</p>
<hr>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=45539917">https://news.ycombinator.com/item?id=45539917</a></p>
<p>Points: 3</p>
<p># Comments: 2</p>
]]></description><pubDate>Fri, 10 Oct 2025 15:08:01 +0000</pubDate><link>https://news.ycombinator.com/item?id=45539917</link><dc:creator>tfederman</dc:creator><comments>https://news.ycombinator.com/item?id=45539917</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45539917</guid></item><item><title><![CDATA[New comment by tfederman in "Newspapers Are Recommending AI-Hallucinated Novels"]]></title><description><![CDATA[
<p>In the GPT-2 era I created CouldReads, a big data set of generated book titles/synopses trained on thousands of e-books. It was a fun project in the naivete of 2020 but it's less amusing now.</p>
]]></description><pubDate>Tue, 20 May 2025 18:32:44 +0000</pubDate><link>https://news.ycombinator.com/item?id=44044477</link><dc:creator>tfederman</dc:creator><comments>https://news.ycombinator.com/item?id=44044477</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44044477</guid></item><item><title><![CDATA[Ask HN: Any recommendations for help with (engineering) career presentation?]]></title><description><![CDATA[
<p>I'm in a long career break to work on my own stuff because my financial situation allows it, but I've found one job I'd like to apply for. My LinkedIn profile is extremely neglected and I've never put any effort into that sort of presentation, self promotion, or online presence in general.<p>I'd like to fix that, I want to pay for someone to fix (or help fix) up my profile, touch up my resume, and help write a bio and generic cover letter about my specific strengths and situation.<p>There's a million people and services that do this but are there any that are focused on software engineering and really solid on understanding that specific market?<p>Any recommendations? I'm not looking for a low effort/AI solution.<p>(email in bio)</p>
<hr>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=44034310">https://news.ycombinator.com/item?id=44034310</a></p>
<p>Points: 2</p>
<p># Comments: 1</p>
]]></description><pubDate>Mon, 19 May 2025 20:14:13 +0000</pubDate><link>https://news.ycombinator.com/item?id=44034310</link><dc:creator>tfederman</dc:creator><comments>https://news.ycombinator.com/item?id=44034310</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44034310</guid></item><item><title><![CDATA[New comment by tfederman in "Crawlers impact the operations of the Wikimedia projects"]]></title><description><![CDATA[
<p>A while back I wrote up a way to turn the big Wikipedia XML dump into a database. Not a generic table with articles but thousands of tables, one for each article "type". I'm not sure if this is still the best way to go about it.<p><a href="https://feder001.com/exploring-wikipedia-as-a-database-part-one.html" rel="nofollow">https://feder001.com/exploring-wikipedia-as-a-database-part-...</a></p>
]]></description><pubDate>Fri, 02 May 2025 14:22:06 +0000</pubDate><link>https://news.ycombinator.com/item?id=43870178</link><dc:creator>tfederman</dc:creator><comments>https://news.ycombinator.com/item?id=43870178</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43870178</guid></item><item><title><![CDATA[New comment by tfederman in "Show HN: Fountain of RSS – 30k active feeds and code for sourcing more"]]></title><description><![CDATA[
<p>That looks like a crowdsourced project for turning arbitrary sites into RSS which is very cool, but I don't see a way to get a large RSS data set out of it. And with about 5000 sources (I think) it's not as large as what I was hoping for, but it could be a good complementary source.</p>
]]></description><pubDate>Thu, 17 Apr 2025 15:30:29 +0000</pubDate><link>https://news.ycombinator.com/item?id=43718311</link><dc:creator>tfederman</dc:creator><comments>https://news.ycombinator.com/item?id=43718311</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43718311</guid></item><item><title><![CDATA[Show HN: Fountain of RSS – 30k active feeds and code for sourcing more]]></title><description><![CDATA[
<p>Recently I was searching for a very large collection of RSS feeds and didn't have luck finding an existing one.<p>I decided to see if they could be sourced reliably at any volume from the meta tags of a stream of links, which are easy to find. Long story short, that did work very well. I used the Bluesky firehose as the stream of links.<p>The repo has a script that narrows the firehose down to just a "fountain" of RSS feeds and their metadata. There's also a .tsv with about 30,000 entries I got from running the script on and off over a few days.<p><a href="https://github.com/tfederman/fountain-of-rss/">https://github.com/tfederman/fountain-of-rss/</a><p>p.s. What's good in the RSS tool/reader scene in 2025?</p>
<hr>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=43707363">https://news.ycombinator.com/item?id=43707363</a></p>
<p>Points: 2</p>
<p># Comments: 3</p>
]]></description><pubDate>Wed, 16 Apr 2025 16:30:23 +0000</pubDate><link>https://github.com/tfederman/fountain-of-rss</link><dc:creator>tfederman</dc:creator><comments>https://news.ycombinator.com/item?id=43707363</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43707363</guid></item><item><title><![CDATA[New comment by tfederman in "Ask HN: What are you working on? (March 2025)"]]></title><description><![CDATA[
<p>RSS reader through Bluesky custom feeds: <a href="https://github.com/tfederman/stroma-news" rel="nofollow">https://github.com/tfederman/stroma-news</a><p>Bluesky API library spun off from the other project: <a href="https://github.com/tfederman/pysky" rel="nofollow">https://github.com/tfederman/pysky</a><p>Haven't really started it yet, but a master list of RSS feeds and the code I used to source them: <a href="https://github.com/tfederman/huge-rss-list" rel="nofollow">https://github.com/tfederman/huge-rss-list</a><p>And also a new project to fetch all links seen in the Bluesky firehose and gather metadata to build a database of sites and pages at a more granular level than the domain. For example, is account X posting video links from one YT channel or many?</p>
]]></description><pubDate>Sun, 30 Mar 2025 22:43:41 +0000</pubDate><link>https://news.ycombinator.com/item?id=43528469</link><dc:creator>tfederman</dc:creator><comments>https://news.ycombinator.com/item?id=43528469</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43528469</guid></item><item><title><![CDATA[New comment by tfederman in "Show HN: AI-Less Hacker News"]]></title><description><![CDATA[
<p>Just for fun I wanted to do a simple server-side version of this where the submissions would be truly hidden on my account, so it would take effect on mobile too. And avoid client side artifacts like messed up numbering.<p><a href="https://github.com/tfederman/hacker-news-topic-hider">https://github.com/tfederman/hacker-news-topic-hider</a></p>
]]></description><pubDate>Wed, 05 Apr 2023 20:10:37 +0000</pubDate><link>https://news.ycombinator.com/item?id=35459741</link><dc:creator>tfederman</dc:creator><comments>https://news.ycombinator.com/item?id=35459741</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=35459741</guid></item><item><title><![CDATA[New comment by tfederman in "From hell to HTML: releasing a Python package to easily work with Wikimedia HTM"]]></title><description><![CDATA[
<p>If anyone's interested in an approach to processing the data set quickly, I got something working and wrote it up when I was curious about turning the content into structured data for database tables.<p><a href="https://feder001.com/exploring-wikipedia-as-a-database-part-one.html" rel="nofollow">https://feder001.com/exploring-wikipedia-as-a-database-part-...</a><p><a href="https://feder001.com/exploring-wikipedia-as-a-database-part-two.html" rel="nofollow">https://feder001.com/exploring-wikipedia-as-a-database-part-...</a></p>
]]></description><pubDate>Wed, 05 Apr 2023 13:07:56 +0000</pubDate><link>https://news.ycombinator.com/item?id=35453499</link><dc:creator>tfederman</dc:creator><comments>https://news.ycombinator.com/item?id=35453499</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=35453499</guid></item><item><title><![CDATA[New comment by tfederman in "“Just remove the duck” (2013)"]]></title><description><![CDATA[
<p>That's maybe true, and also a good reason I wouldn't work for a large company again.</p>
]]></description><pubDate>Tue, 03 Mar 2015 19:31:55 +0000</pubDate><link>https://news.ycombinator.com/item?id=9140027</link><dc:creator>tfederman</dc:creator><comments>https://news.ycombinator.com/item?id=9140027</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=9140027</guid></item><item><title><![CDATA[New comment by tfederman in "“Just remove the duck” (2013)"]]></title><description><![CDATA[
<p>My company just doesn't have product or project managers and it's wonderful. Fewer meetings, no intermediaries, more agility. It could only work in practice, it could never work in theory.</p>
]]></description><pubDate>Tue, 03 Mar 2015 17:45:42 +0000</pubDate><link>https://news.ycombinator.com/item?id=9139177</link><dc:creator>tfederman</dc:creator><comments>https://news.ycombinator.com/item?id=9139177</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=9139177</guid></item><item><title><![CDATA[New comment by tfederman in "Simplicity and Utility, or Why SOAP Lost"]]></title><description><![CDATA[
<p>I feel like I spend a troubling amount of time as a software developer dealing with and/or avoiding solutions that are much more complicated than the problems they're trying to solve.</p>
]]></description><pubDate>Tue, 09 Dec 2014 17:31:22 +0000</pubDate><link>https://news.ycombinator.com/item?id=8724128</link><dc:creator>tfederman</dc:creator><comments>https://news.ycombinator.com/item?id=8724128</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=8724128</guid></item><item><title><![CDATA[New comment by tfederman in "Why Clay Shirky just banned technology use in class"]]></title><description><![CDATA[
<p>When I take notes because I need to retain something, the writing part is more valuable than reviewing it later. And writing on paper is important, typing isn't effective for that part.<p>When I take notes just to record details that I can look up later on demand it's faster to type and there's no downside. But that matches work more than it does school.</p>
]]></description><pubDate>Sat, 27 Sep 2014 20:37:30 +0000</pubDate><link>https://news.ycombinator.com/item?id=8378138</link><dc:creator>tfederman</dc:creator><comments>https://news.ycombinator.com/item?id=8378138</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=8378138</guid></item><item><title><![CDATA[New comment by tfederman in "Amazon Takes a Big Step Toward Making Its Own Deliveries"]]></title><description><![CDATA[
<p>At this point I already get my stuff from Amazon fast enough, and I don't even have a prime account. I'd rather see Amazon improve working conditions for its employees (if what that I've read about that is true) than chase this obsession with feeding my desire to get more material things as fast as possible.<p>Ironically their focus on customer satisfaction has started to make me feel dirty about buying from Amazon.</p>
]]></description><pubDate>Fri, 26 Sep 2014 19:53:10 +0000</pubDate><link>https://news.ycombinator.com/item?id=8374526</link><dc:creator>tfederman</dc:creator><comments>https://news.ycombinator.com/item?id=8374526</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=8374526</guid></item><item><title><![CDATA[New comment by tfederman in "Why We Keep Playing the Lottery"]]></title><description><![CDATA[
<p>When someone gets entertainment value from playing the lottery, is it really about the chance of winning multiplied by the amount of happiness the money would bring? Or does it have more to do with the excitement that comes with each drawing? (Similar to how I can enjoy watching a football season even if my team doesn't win the Super Bowl.)<p>I think it's more likely the latter, though I don't personally see the entertainment value of playing the lottery. I wouldn't lose respect for someone for having a hobby I don't share or understand, though.</p>
]]></description><pubDate>Thu, 25 Sep 2014 17:03:34 +0000</pubDate><link>https://news.ycombinator.com/item?id=8368204</link><dc:creator>tfederman</dc:creator><comments>https://news.ycombinator.com/item?id=8368204</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=8368204</guid></item><item><title><![CDATA[New comment by tfederman in "Why We Keep Playing the Lottery"]]></title><description><![CDATA[
<p>When the expected value is so definitively against you I guess it's implied that you're playing for variance. Maybe you also feel that you're playing for the entertainment value, which isn't subject to the same mathematical rationality.<p>I get that. When I play poker for 5 hours and come out even at the end of the night or slightly behind, I consider it worthwhile because I had fun for 5 hours.</p>
]]></description><pubDate>Thu, 25 Sep 2014 14:03:44 +0000</pubDate><link>https://news.ycombinator.com/item?id=8367163</link><dc:creator>tfederman</dc:creator><comments>https://news.ycombinator.com/item?id=8367163</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=8367163</guid></item><item><title><![CDATA[New comment by tfederman in "Loyalty Nearly Killed My Beehive"]]></title><description><![CDATA[
<p>When I saw the title my first instinct was, okay, what's the metaphor between a beehive and a development team or a startup? You could find some parallels, but I agree this article is better.</p>
]]></description><pubDate>Mon, 22 Sep 2014 15:19:53 +0000</pubDate><link>https://news.ycombinator.com/item?id=8350888</link><dc:creator>tfederman</dc:creator><comments>https://news.ycombinator.com/item?id=8350888</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=8350888</guid></item><item><title><![CDATA[New comment by tfederman in "Kicked Off Facebook, and Wondering Why"]]></title><description><![CDATA[
<p>Maybe it's hard to find Facebook essential (as opposed to convenient) if you lived during a time when you had to actually know your friends' phone numbers and enter them manually. As I did.<p>Anyone you're close to you have a way to contact without Facebook. Anyone else shouldn't make you feel like you're missing anything important. Right?</p>
]]></description><pubDate>Fri, 19 Sep 2014 20:35:09 +0000</pubDate><link>https://news.ycombinator.com/item?id=8342356</link><dc:creator>tfederman</dc:creator><comments>https://news.ycombinator.com/item?id=8342356</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=8342356</guid></item><item><title><![CDATA[New comment by tfederman in "Mystery Science Theater 3000"]]></title><description><![CDATA[
<p>There's a good selection and it's weighted towards earlier seasons which is good but unfortunately no Manos or Santa Claus Conquers the Martians yet.</p>
]]></description><pubDate>Thu, 18 Sep 2014 00:01:14 +0000</pubDate><link>https://news.ycombinator.com/item?id=8332979</link><dc:creator>tfederman</dc:creator><comments>https://news.ycombinator.com/item?id=8332979</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=8332979</guid></item></channel></rss>