<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: ndneighbor</title><link>https://news.ycombinator.com/user?id=ndneighbor</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Wed, 10 Jun 2026 03:27:09 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=ndneighbor" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[Claude please rack me a datacenter, make no mistakes]]></title><description><![CDATA[
<p>Article URL: <a href="https://blog.railway.com/p/datacenter-no-mistakes">https://blog.railway.com/p/datacenter-no-mistakes</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=48388523">https://news.ycombinator.com/item?id=48388523</a></p>
<p>Points: 4</p>
<p># Comments: 0</p>
]]></description><pubDate>Wed, 03 Jun 2026 19:16:00 +0000</pubDate><link>https://blog.railway.com/p/datacenter-no-mistakes</link><dc:creator>ndneighbor</dc:creator><comments>https://news.ycombinator.com/item?id=48388523</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48388523</guid></item><item><title><![CDATA[Going multi-cloud with an in-housed status page]]></title><description><![CDATA[
<p>Article URL: <a href="https://blog.railway.com/p/multi-cloud-status-page">https://blog.railway.com/p/multi-cloud-status-page</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=48362501">https://news.ycombinator.com/item?id=48362501</a></p>
<p>Points: 5</p>
<p># Comments: 0</p>
]]></description><pubDate>Mon, 01 Jun 2026 20:54:18 +0000</pubDate><link>https://blog.railway.com/p/multi-cloud-status-page</link><dc:creator>ndneighbor</dc:creator><comments>https://news.ycombinator.com/item?id=48362501</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48362501</guid></item><item><title><![CDATA[New comment by ndneighbor in "Railway Blocked by Google Cloud"]]></title><description><![CDATA[
<p><i>huh- I guess there are two HN submissions with meaningful replies...</i><p>I said this in the other thread, we got access to our account back, but even with a Account Rep. and a CSM on our account- it still took them a while to figure out what was going on.<p>I'm sure it could have been worse if we didn't have a rep on our account.</p>
]]></description><pubDate>Wed, 20 May 2026 02:48:30 +0000</pubDate><link>https://news.ycombinator.com/item?id=48202521</link><dc:creator>ndneighbor</dc:creator><comments>https://news.ycombinator.com/item?id=48202521</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48202521</guid></item><item><title><![CDATA[New comment by ndneighbor in "Incident Report: Railway Blocked by Google Cloud [resolved]"]]></title><description><![CDATA[
<p>Actually, when I made the TOS check, I put that in Redis. That + the feature flags got reset.</p>
]]></description><pubDate>Wed, 20 May 2026 02:20:32 +0000</pubDate><link>https://news.ycombinator.com/item?id=48202322</link><dc:creator>ndneighbor</dc:creator><comments>https://news.ycombinator.com/item?id=48202322</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48202322</guid></item><item><title><![CDATA[New comment by ndneighbor in "Incident Report: Railway Blocked by Google Cloud [resolved]"]]></title><description><![CDATA[
<p>They deleted our GCP proj. sans warning. Still working the details, but that's how this whole thing began.</p>
]]></description><pubDate>Wed, 20 May 2026 01:58:32 +0000</pubDate><link>https://news.ycombinator.com/item?id=48202166</link><dc:creator>ndneighbor</dc:creator><comments>https://news.ycombinator.com/item?id=48202166</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48202166</guid></item><item><title><![CDATA[New comment by ndneighbor in "Incident Report: Railway Blocked by Google Cloud [resolved]"]]></title><description><![CDATA[
<p>> decisions are easy to criticize in hindsight<p>I mean, the pain we have caused our customer ultimately proves you correct. That said, we made our decisions with the information and constraints that we knew in that moment in time. Railway has hosts in AWS/GCP/and co-los, so coordinating those workloads in a fully distributed manner would be ideal but end of the day, we didn't forsee that would just have our project get deleted just like that.<p>(Even if we did get assurances from them in 2024, that it wouldn't happen again, although we just got auto-rate limited the last time.)</p>
]]></description><pubDate>Wed, 20 May 2026 01:33:38 +0000</pubDate><link>https://news.ycombinator.com/item?id=48201967</link><dc:creator>ndneighbor</dc:creator><comments>https://news.ycombinator.com/item?id=48201967</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48201967</guid></item><item><title><![CDATA[New comment by ndneighbor in "Railway Is Having a Major Outage"]]></title><description><![CDATA[
<p>We have a CSM, Head of Customer Support contact, and further contacts with GCP. Despite that, we still had this issue.</p>
]]></description><pubDate>Wed, 20 May 2026 01:20:09 +0000</pubDate><link>https://news.ycombinator.com/item?id=48201874</link><dc:creator>ndneighbor</dc:creator><comments>https://news.ycombinator.com/item?id=48201874</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48201874</guid></item><item><title><![CDATA[New comment by ndneighbor in "Incident Report: Railway Blocked by Google Cloud [resolved]"]]></title><description><![CDATA[
<p>Yea, I mean, that's the whole MO of our platform and we failed at that. So yea, that's disappointing and more so for our customers.<p>I can provide an explanation about the GCP dependency. Yes, we have host workloads off GCP, and we have been able to build a good business by performing a cloud exit. However, we were worried that we would have a circular dependency on our own cloud. I don't think we expected to get auto-modded out of our own account, hence we left our DB on CloudSQL.<p>It was never our intent to deceive people that we didn't own our own destiny with our business. The last GCP issue, we were assured that this scenario wouldn't happen (when we got auto-ratelimited, which was bad, but survivable) - but it seems like we have further work to do. Apologies.</p>
]]></description><pubDate>Wed, 20 May 2026 01:18:37 +0000</pubDate><link>https://news.ycombinator.com/item?id=48201863</link><dc:creator>ndneighbor</dc:creator><comments>https://news.ycombinator.com/item?id=48201863</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48201863</guid></item><item><title><![CDATA[Counting to 3 with a new builder processing 50M+ monthly builds]]></title><description><![CDATA[
<p>Article URL: <a href="https://blog.railway.com/p/new-builder-scale-big">https://blog.railway.com/p/new-builder-scale-big</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=48140165">https://news.ycombinator.com/item?id=48140165</a></p>
<p>Points: 2</p>
<p># Comments: 0</p>
]]></description><pubDate>Thu, 14 May 2026 19:37:13 +0000</pubDate><link>https://blog.railway.com/p/new-builder-scale-big</link><dc:creator>ndneighbor</dc:creator><comments>https://news.ycombinator.com/item?id=48140165</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48140165</guid></item><item><title><![CDATA[New comment by ndneighbor in "Software Internals Book Club"]]></title><description><![CDATA[
<p>confirmed<p>(I help host nycsystems w/ Phil- we don't mind, just an easier way to know who is who other than email)</p>
]]></description><pubDate>Tue, 12 May 2026 06:41:32 +0000</pubDate><link>https://news.ycombinator.com/item?id=48104989</link><dc:creator>ndneighbor</dc:creator><comments>https://news.ycombinator.com/item?id=48104989</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48104989</guid></item><item><title><![CDATA[New comment by ndneighbor in "Are We Idiocracy Yet?"]]></title><description><![CDATA[
<p>I think the intent of Mike Judge's joke was less so an outright promotion of eugenics and more so mocking the upper crust of American society's approach to family planning. (That of which Judge was intimately familiar with during his time in SV when he worked for a graphics card company.)<p>A lot of his work with KotH analyzed the same dynamics of educated and uneducated America and the interplay and I think Idiocracy is essentially the terminus of the observations he would make where if the idiots got their way. (A semi-common plot point with Hank in KotH where he would be pit against rediculous circumstances.)</p>
]]></description><pubDate>Tue, 07 Apr 2026 12:03:09 +0000</pubDate><link>https://news.ycombinator.com/item?id=47673904</link><dc:creator>ndneighbor</dc:creator><comments>https://news.ycombinator.com/item?id=47673904</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47673904</guid></item><item><title><![CDATA[New comment by ndneighbor in "Railway (PaaS) global outage"]]></title><description><![CDATA[
<p>We have more info coming soon but I think the best way to frame this is actually working backwards and then explain how it impacted yours and other services.<p>So Railway (and other cloud providers) deal with fraud near constantly. The internet is a bad and scary place and we spend maybe a third to half of our total engineering cycles just on fraud/up-time related work. I don't wanna give any credit to script kiddies to the hostile nation states but we (and others) are under near and constant bombardment from crap workloads in the form of traffic, or not great CPU cycles, or sometimes more benignly, movie pirating.<p>Most cloud providers understandably don't like talking about it because ironically, the more they talk about it- the bad actors do indeed get a kick from seeing the chaos that they cause work. Begin the vicious cycle...<p>This hopefully answers:<p>> If 3% of your services were affected, does that match your expected fraud rate? That is an awful lot of customers to take down in one go, and you'd want to be very accurate in your modeling. I can't see how you'd plan to kill that many without false positives and negative media.<p>In our 5 year history, this is the third abuse related major outage. One being a Nation State DDoS, one being coordinated denial. This is the first one where it was a false positive taking down services automatically. We tune it constantly so its not really an issue except when it is.<p>So- with that background, we tune our boxes of lets say "performance" rules constantly. When we see bad workloads, or bad traffic, we have automated systems that "discourage" that use entirely.<p>When we updated those rules because we detected a new pattern, and then rolling it out, that's when we nailed the legit users, since this used the abuse pattern, it didn't show on your dash, hence the immediate gaslighting.<p>Which leads to the other question:<p>> How or why were customers not notified? I have used services before where if something seemed dodgy they would proactively reach out and say 'tell us if it's legit or in 24 hours it will be shut down' or for something truly bad, eg massive CPU usage affecting other services, they'd kill it right away but would _tell you_.<p>We don't want to tell fraudulent customers if they are effective or not. For this instance, it was a straight up logic bug on the heuristics match. But we have done this for our existence like black holing illegitimate traffic for example, then ban. We did this because some coordinated actors will deploy, get banned with: "reason" and then they would have backup accounts after they found that whatever they were doing was working. If you knew where to look, sometimes they will brag on their IRCs/Discords.<p>Candidly, we don't want to be transparent about this, but any user impact like this is the least we can do. Zooming out, macro wise, this is why Discord and other services are leaning towards ID verification. ...and it's hard for people on the non service provider side to appreciate the level of garbage out there in the internet. That said, that is an excuse- and we shovel that so that you can do your job and if we stop you, then thats on us which we own and hopefully do better about.<p>That said, you and others are understandably miffed (understatement) all we can do is work through our actions to rebuild trust.</p>
]]></description><pubDate>Wed, 11 Feb 2026 19:58:43 +0000</pubDate><link>https://news.ycombinator.com/item?id=46980029</link><dc:creator>ndneighbor</dc:creator><comments>https://news.ycombinator.com/item?id=46980029</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46980029</guid></item><item><title><![CDATA[New comment by ndneighbor in "Railway (PaaS) global outage"]]></title><description><![CDATA[
<p>Hey there Dave, Angelo from Railway here-<p>First off, super duper sorry. It's sometimes a good/bad thing if I can remember someones handle. ...and I specifically remember the support thread where we did have an outage before your demo :| - the number one goal for us is to deliver a great product. Number two is that we should never embarrass a user, outages do exactly that.<p>We just wrapped up the post mortem and that'll be published soon where it explains why the dashboard was reporting the state of the application incorrectly and would be more than happy to credit you for the impact to keep your business. That said, totally understand if two is way too much impact for your services.</p>
]]></description><pubDate>Wed, 11 Feb 2026 19:32:39 +0000</pubDate><link>https://news.ycombinator.com/item?id=46979674</link><dc:creator>ndneighbor</dc:creator><comments>https://news.ycombinator.com/item?id=46979674</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46979674</guid></item><item><title><![CDATA[New comment by ndneighbor in "Railway (PaaS) global outage"]]></title><description><![CDATA[
<p>(Angelo from Railway here)<p>Heard. Being transparent, usually the delay on ack is us trying to determine and correlate the issue. We have a post mortem going out but we note that first report was in our system 10 minutes before it was acked, to which the platform team was trying to see which layer the impact was at.<p>That said, this is maybe concern #1 of the support team. Where we want the delta between report and customer outage detected to be as small as possible. The way it usually works is that we have the platform alarms and pages go first, and then the platform engineer usually will page a support eng. to run communications.<p>Usually the priority is to have the platform engineer focus on triaging the issue and then offload the workload to our support team so that we can accurately state what is going on. We have a new comms clustering system that rolling out so that if we get 5 reports with the similar content, it pages up to the support team as well. (We will roll this out after we communicated with affected customers first.)</p>
]]></description><pubDate>Wed, 11 Feb 2026 19:26:37 +0000</pubDate><link>https://news.ycombinator.com/item?id=46979589</link><dc:creator>ndneighbor</dc:creator><comments>https://news.ycombinator.com/item?id=46979589</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46979589</guid></item><item><title><![CDATA[New comment by ndneighbor in "Vercel's CEO offers to cover expenses of 'Jmail'"]]></title><description><![CDATA[
<p>Angelo from Railway here, Railway runs our own metal for the sheer reason to preserve margins so we can run for perpetuity.<p>We're nuts for studying failure at the company and Heroku's margins was one of the things we considered to be one of the many nails in that coffin. (RIP)<p>(my rant here: <a href="https://blog.railway.com/p/heroku-walked-railway-run" rel="nofollow">https://blog.railway.com/p/heroku-walked-railway-run</a>)</p>
]]></description><pubDate>Tue, 10 Feb 2026 20:39:56 +0000</pubDate><link>https://news.ycombinator.com/item?id=46966562</link><dc:creator>ndneighbor</dc:creator><comments>https://news.ycombinator.com/item?id=46966562</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46966562</guid></item><item><title><![CDATA[New comment by ndneighbor in "OpenAI acquires Sky.app"]]></title><description><![CDATA[
<p>I think this acquisition makes a lot of sense and it's good business. Finding good MacOS developers who know the system level APIs more so than the docs is a tough go. It would make a lot of sense that OpenAI would just go ahead and hire out this expertise as they try to get their Mac app and their iOS app to get closer and closer to the system.</p>
]]></description><pubDate>Fri, 24 Oct 2025 02:22:52 +0000</pubDate><link>https://news.ycombinator.com/item?id=45690053</link><dc:creator>ndneighbor</dc:creator><comments>https://news.ycombinator.com/item?id=45690053</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45690053</guid></item><item><title><![CDATA[New comment by ndneighbor in "UA 1093"]]></title><description><![CDATA[
<p>The unfortunate irony is not lost on me that Windbourne's H1 is "record breaking Weather Balloons".<p>I don't think any company would want this record. I am very glad the pilot and the souls on board are safe.</p>
]]></description><pubDate>Tue, 21 Oct 2025 16:23:01 +0000</pubDate><link>https://news.ycombinator.com/item?id=45657703</link><dc:creator>ndneighbor</dc:creator><comments>https://news.ycombinator.com/item?id=45657703</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45657703</guid></item><item><title><![CDATA[New comment by ndneighbor in "Why most product planning is bad and what to do about it"]]></title><description><![CDATA[
<p>A man can dream ;-;</p>
]]></description><pubDate>Fri, 03 Oct 2025 00:04:08 +0000</pubDate><link>https://news.ycombinator.com/item?id=45457105</link><dc:creator>ndneighbor</dc:creator><comments>https://news.ycombinator.com/item?id=45457105</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45457105</guid></item><item><title><![CDATA[New comment by ndneighbor in "Why most product planning is bad and what to do about it"]]></title><description><![CDATA[
<p>Author here, not my intent! My deepest apologies. English is my first language but people do joke that they say I write English like I learned it as a second language.<p>I have fixed the sentence fragment and connected the two thoughts together. Thank you for keeping me honest.</p>
]]></description><pubDate>Thu, 02 Oct 2025 20:59:18 +0000</pubDate><link>https://news.ycombinator.com/item?id=45455492</link><dc:creator>ndneighbor</dc:creator><comments>https://news.ycombinator.com/item?id=45455492</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45455492</guid></item><item><title><![CDATA[New comment by ndneighbor in "Why most product planning is bad and what to do about it"]]></title><description><![CDATA[
<p>I am more than happy to add color here, I am sorry, I try my best to write everything but my editor cuts as much as I add. We also tend to hire really autonomous engineers who tend to like just going off on their own to try to solve the issue.<p>There have been a few times where we would commit to the problem, assign a DRI, and then find out midway that... no we have to hire/consult our way out of the issue. I think that's okay, we then look back at the retro to see what we missed.<p>If interested, I think we can blog about what happens when a problem gets converted to an RFC and then we have more engineering discussions with the stakeholders but the piece was pushing a 10 min read time as it was...</p>
]]></description><pubDate>Thu, 02 Oct 2025 20:57:02 +0000</pubDate><link>https://news.ycombinator.com/item?id=45455461</link><dc:creator>ndneighbor</dc:creator><comments>https://news.ycombinator.com/item?id=45455461</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45455461</guid></item></channel></rss>