<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: Gormo</title><link>https://news.ycombinator.com/user?id=Gormo</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Sat, 13 Jun 2026 01:26:13 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=Gormo" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by Gormo in "FPS.cob: A first person shooter in COBOL"]]></title><description><![CDATA[
<p>Direct link to the video:  <a href="https://www.youtube.com/watch?v=qzpZQe7JT-o" rel="nofollow">https://www.youtube.com/watch?v=qzpZQe7JT-o</a><p>It's quite interesting, and describes how the developer used COBOL to implement a raycasting algorithm and generate a stream of PPM images to to pipe into FFplay.  There's <i>zero</i> evidence of this being written in Claude; there are multiple clips of the developer working in VSCode, where the Claude plugin doesn't even appear to be installed.</p>
]]></description><pubDate>Thu, 11 Jun 2026 19:55:34 +0000</pubDate><link>https://news.ycombinator.com/item?id=48495584</link><dc:creator>Gormo</dc:creator><comments>https://news.ycombinator.com/item?id=48495584</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48495584</guid></item><item><title><![CDATA[New comment by Gormo in "Switzerland wil have a referendum to cap population at 10M"]]></title><description><![CDATA[
<p>In Belfast?  Was that a Protestant Somali man or a Catholic Somali man?</p>
]]></description><pubDate>Tue, 09 Jun 2026 15:27:57 +0000</pubDate><link>https://news.ycombinator.com/item?id=48462399</link><dc:creator>Gormo</dc:creator><comments>https://news.ycombinator.com/item?id=48462399</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48462399</guid></item><item><title><![CDATA[New comment by Gormo in "Switzerland wil have a referendum to cap population at 10M"]]></title><description><![CDATA[
<p>> It wasn’t that long ago that American racists debated whether Italians and even Irish were truly “white.” The definition of white had expanded considerably over the years.
Eastern Europeans, Jews (of course), and Russians were also at times not “white.”<p>This idea is mostly a modern fabrication.  Various more granular ethnic biases were of course present throughout American history, but those were never conflated with racial categories: in times and places where the white vs. black racial division was relevant, the ethnic groups you're referencing were always considered "white".<p>And the types of discrimination that people in white ethnic groups sometimes experienced was of of a type and of a degree vastly different from that experienced by black people.  They're really two very distinct phenomena, and weren't evenly distributed throughout the US -- black people in the South had the legacy of slavery, Jim Crow segregation, and other horrifying things to deal with, whereas white immigrant communities in the Northeast or Midwest never experienced anything remotely similar.</p>
]]></description><pubDate>Tue, 09 Jun 2026 15:15:41 +0000</pubDate><link>https://news.ycombinator.com/item?id=48462218</link><dc:creator>Gormo</dc:creator><comments>https://news.ycombinator.com/item?id=48462218</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48462218</guid></item><item><title><![CDATA[New comment by Gormo in "Cessation of public development of Kefir C compiler"]]></title><description><![CDATA[
<p>> The camera analogy is a good one but I have never had a camera that had every great picture somebody else had taken, plus every work of art, baked into it.<p>I've never had an LLM that had any of that baked into it either.  LLMs just have token correlations trained on those works.  Trying to get an LLM to output the data it was trained on verbatim is something I'd expect to be heading into monkeys-on-typewriters territory.  "Write something in the style of Shakespeare" and "give me the original text of Hamlet" are two very different things.<p>> I agree with the framing of the AI as a tool not an autonomous entity. The thing is, to me, it is exactly that framing that makes it so the use of that tool means "copying" more than it means "learning and taking inspiration and creating new art", because who is doing the learning and being inspired?<p>It's not learning or taking inspiration, though.  It's just making statistical inferences based on token correlations.  Whether or not that's analogous to how humans learn is something I think is a metaphysical question that is of little practical relevance.  The fact remains that LLMs are not human, have no intentions of their own, do not exercise any kind of agency despite how often people employing the misnomer "agentic", and are ultimately glorified statistical models.<p>The LLM is a tool that extends human capacities in the same way as any other mathematical framework or technological device.<p>> I think of a trained AI like a lossy, highly compressed copy of its training data set.<p>I've seen a few people in this thread make that argument, but I just can't agree with it.  It's not compression, lossy or lossless, which aims to deterministically encode a representation of the specific input data.  The training data is analogous to the sample set used in a regression analysis to generate a polynomial function -- it's not valid to treat the output from any application of that polynomial as a copy of the original sample data.</p>
]]></description><pubDate>Thu, 04 Jun 2026 00:41:44 +0000</pubDate><link>https://news.ycombinator.com/item?id=48392211</link><dc:creator>Gormo</dc:creator><comments>https://news.ycombinator.com/item?id=48392211</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48392211</guid></item><item><title><![CDATA[New comment by Gormo in "Gmail thinks I'm stupid, so I left"]]></title><description><![CDATA[
<p>> It was the first time I can say that installing a linux OS was easier and friendlier than Windows.<p>I'd say that from work experience managing an IT department that maintains and deploys both Windows and Linux machines, the administrative overhead involved in working with Windows first exceeded that of Linux at some point in the Windows 10 life cycle -- at least five years ago.  Since then, Windows has been getting worse and worse, and Linux has been getting better and better.<p>With most corporate software being accessible via the web and/or being cross-platform these days, we're seriously debating moving the standard corporate workstation configuration to Linux.</p>
]]></description><pubDate>Wed, 03 Jun 2026 16:00:57 +0000</pubDate><link>https://news.ycombinator.com/item?id=48385848</link><dc:creator>Gormo</dc:creator><comments>https://news.ycombinator.com/item?id=48385848</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48385848</guid></item><item><title><![CDATA[New comment by Gormo in "Cessation of public development of Kefir C compiler"]]></title><description><![CDATA[
<p>> But try to write your own story of a lion cub chased away by his uncle and living in a jungle until his childhood friend finds him and convinces him to reclaim his kingdom, and you'll quickly hear from Disney's lawyers how non-derivative it really is.<p>I'd expect them to say "we don't like this, but since it's not actually a derivative work, we can't do anything about it".  As long as you're not directly copying things like characters, dialogue, etc., it's <i>not</i> a derivative work.<p>That's why <i>Armageddon</i> is not a derivative work of <i>Deep Impact</i>, the <i>Shark Attack</i> series is not a derivative work of <i>Jaws</i>, the more famous <i>Titanic</i> is not a derivative work of 1979's <i>S.O.S. Titanic</i>, and the <i>Harry Potter</i> series is not a derivative work of <i>Teen Witch</i>.<p>Using the same story themes, plot points, and setting as another work does not implicate that other work's copyright.  Only substantial copying of specifics does.</p>
]]></description><pubDate>Tue, 02 Jun 2026 17:13:46 +0000</pubDate><link>https://news.ycombinator.com/item?id=48373118</link><dc:creator>Gormo</dc:creator><comments>https://news.ycombinator.com/item?id=48373118</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48373118</guid></item><item><title><![CDATA[New comment by Gormo in "Apple rejected my dictation app for using the accessibility API"]]></title><description><![CDATA[
<p>Of course, one might construe Apple as an MITM in the relationship between the user an and the software vendor.</p>
]]></description><pubDate>Tue, 02 Jun 2026 15:19:55 +0000</pubDate><link>https://news.ycombinator.com/item?id=48371429</link><dc:creator>Gormo</dc:creator><comments>https://news.ycombinator.com/item?id=48371429</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48371429</guid></item><item><title><![CDATA[New comment by Gormo in "Cessation of public development of Kefir C compiler"]]></title><description><![CDATA[
<p>> under the condition that if you use it for anything, I get credited; else, you get nothing.<p>But this has never been a condition in the FOSS world, as far as I'm aware.  I've only ever seen attribution requirements attach to redistribution of source, not usage of the software.<p>I understand that the crux of the debate here is whether training an LLM <i>is</i> redistribution of the underlying code, but to me, it seems to be fairly clear that it is not.<p>> Luckily, FOSS is specific published works, and unless LLMs actually reasonably-provably do such decomposing into ideas/facts (good luck reasoning about that), that part is also irrelevant.<p>That's literally all LLMs do.  That's what tokenization <i>is</i>.  And it's trivially provable, since if you compare LLM models with the copyrighted works you're claiming they replicate, all you'll see on the LLM side is probability matrices representing correlations between decomposed units of knowledge aggregated across the entire dataset as an integrated whole.<p>> Depending on intent, that very much can happen, it's called plagiarism. Good luck proving an LLMs intent.<p>The only intent ever in play is that of the user.  LLMs are just software.</p>
]]></description><pubDate>Tue, 02 Jun 2026 14:09:24 +0000</pubDate><link>https://news.ycombinator.com/item?id=48370477</link><dc:creator>Gormo</dc:creator><comments>https://news.ycombinator.com/item?id=48370477</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48370477</guid></item><item><title><![CDATA[New comment by Gormo in "Cessation of public development of Kefir C compiler"]]></title><description><![CDATA[
<p>> But it does exist, and within this framework, the creator gets to say how you may redistribute their IP,<p>Right.  And the way the creator gets to exercise that say is by releasing their work under a license.  If you release your work under a FOSS license, you're saying "you are free to copy this work and use it for your own purposes".<p>Complaining that people are using it for purposes you don't like after you've already given permission to them to use it for whatever purposes they please seems a bit disingenuous.<p>> and "We compressed it very much" isn't an out.<p>It's not, but I don't think we're discussing that.  We're talking about LLMs, not people redistributing zip files containing someone else's work.  If you're trying to imply that LLMs are merely a form of compression, that's a position you've got to argue for, because I'm definitely not seeing any similarity between the two.</p>
]]></description><pubDate>Tue, 02 Jun 2026 13:59:35 +0000</pubDate><link>https://news.ycombinator.com/item?id=48370356</link><dc:creator>Gormo</dc:creator><comments>https://news.ycombinator.com/item?id=48370356</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48370356</guid></item><item><title><![CDATA[New comment by Gormo in "Cessation of public development of Kefir C compiler"]]></title><description><![CDATA[
<p>> If you put a GPL C program through Emscripten to run in a browser the output doesn't include the original C code but it's surely a derivative work.<p>Because it does include content from the original work -- this is just a translation, and isn't comparable to how LLMs work.<p>> To me the answer is simply that humans are special.<p>I don't disagree, but I also view LLMs as tools that extend human capacities and not autonomous entities unto themselves.   LLMs are <i>still just software</i>, and can't really be regarded as anything other than instruments that humans use to broaden their capacity to see, appreciate, understand, and draw on that experience in what they create.<p>> That may seem remarkably unfair to the machines, or like a cop-out.<p>No, it's unfair to the humans.  The machines are just tools that they use.  The "double standard" is really a set of inconsistent standards applied to the same underlying moral agents.<p>> After all, if you want to treat a machine exactly like a human who learns from prior art to create new art, then the ownership of the new art would also belong to the machine. Not to the person who prompts it.<p>No, it always belongs to the person who prompts it.  The machine is not a conscious entity, bears no intentions, and has no capacity to act on its own initiative.  The machine is always just a tool that extends human capacity, as all machines always have.<p>For a good comparison here, we've never <i>not</i> credited a photographer as the author of a photograph.  But the photographer is in a sense merely <i>prompting the camera</i> by framing the shot, selecting the exposure, adjusting the lighting, etc. -- the hard work in actually creating the photograph is being done by the camera itself, with the photographer playing no role in directly constructing the final image, and with the many of the qualities of the final image being determined by pre-existing features of the camera's functional design and components that the photographer also played no role in defining, apart from choosing which camera to use.<p>LLMs are like cameras in this way.  And the fact that they rely on external data for model training no more disclaims the user as the author of the resulting work than looking things up in a dictionary or encyclopedia does the same for the author of an essay.</p>
]]></description><pubDate>Tue, 02 Jun 2026 13:52:14 +0000</pubDate><link>https://news.ycombinator.com/item?id=48370276</link><dc:creator>Gormo</dc:creator><comments>https://news.ycombinator.com/item?id=48370276</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48370276</guid></item><item><title><![CDATA[New comment by Gormo in "Cessation of public development of Kefir C compiler"]]></title><description><![CDATA[
<p>I mean, the most restrictive license, the GPL, was conceived specifically to protect the "four freedoms" and prevent subsequent modifications from violating them.  The "copyleft" concept was specifically designed to create an ecosystem that behaved as if copyright didn't apply in the first place.<p>I don't know how you can imply with a straight face that it did anything else.<p>I don't know how you can possibly argue that non-redistributive <i>usage</i> of software could ever violate the GPL -- and the other common FOSS licenses don't even have the copyleft provision, and literally are saying "do whatever you want, but I'm not responsible".</p>
]]></description><pubDate>Mon, 01 Jun 2026 19:12:11 +0000</pubDate><link>https://news.ycombinator.com/item?id=48361263</link><dc:creator>Gormo</dc:creator><comments>https://news.ycombinator.com/item?id=48361263</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48361263</guid></item><item><title><![CDATA[New comment by Gormo in "Cessation of public development of Kefir C compiler"]]></title><description><![CDATA[
<p>> Uh, that is exactly what a derivative work is.<p>No, it isn't.  A derivative work isn't something based on extracting underlying ideas or patterns from another work, it's something that <i>includes</i> copyrighted portions of the other work.<p>An annotated edition of <i>Hamlet</i> is a derivative work.  A Cliff's Notes summary of <i>Hamlet</i> is a derivative work.<p><i>Strange Brew</i> and <i>The Lion King</i> are <i>not</i> derivative works of <i>Hamlet</i> simply because they include literary themes and plot points that originated in <i>Hamlet</i>.  A list of word counts of popular works of literature that includes an entry for <i>Hamlet</i> is also not a derivative work.  The Markov chain described above is not a derivative work.<p>> The obvious follow up here is whether an LLM is creating transformative derivations or not. A lot of folks argue that yes, an LLM spitting out statistically sampled code that matches existing code is not transformative and is (or might be) infringing the terms of the license it was released under.<p>And I would agree with them.  An LLM that actually is outputting non-trivial code that matches a public project's code verbatim <i>is</i> engaging in copying, and <i>not</i> stochastic inference.<p>> I think it's a pretty obvious "somewhere in the middle" that is gonna make a bunch of lawyers a whole lot of money.<p>It's a shame that the same fundamental questions have to be relitigated over and over again just because the contextual formalities and modes of expression have changed.  I wonder how many of the legal cases are going to be copies or derivative works of previous ones.</p>
]]></description><pubDate>Mon, 01 Jun 2026 19:08:16 +0000</pubDate><link>https://news.ycombinator.com/item?id=48361221</link><dc:creator>Gormo</dc:creator><comments>https://news.ycombinator.com/item?id=48361221</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48361221</guid></item><item><title><![CDATA[New comment by Gormo in "Cessation of public development of Kefir C compiler"]]></title><description><![CDATA[
<p>> That's wrong. What on earth gave you that impression when the licenses specifically set constraints on what downstream can do (from "release derivatives as open" to "put me in the credits").<p>These are restrictions on redistribution, not use.  And they're there to make sure that derivative works can't themselves impose restrictions on use.</p>
]]></description><pubDate>Mon, 01 Jun 2026 18:58:35 +0000</pubDate><link>https://news.ycombinator.com/item?id=48361107</link><dc:creator>Gormo</dc:creator><comments>https://news.ycombinator.com/item?id=48361107</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48361107</guid></item><item><title><![CDATA[New comment by Gormo in "Cessation of public development of Kefir C compiler"]]></title><description><![CDATA[
<p>> It sounds like your view of things is limited mostly to that last version of FOSS, the copyleft style.<p>No, I'm well aware of the different motivations for and approaches to FOSS.  I'm mostly focusing on the copyleft/GNU GPL side of the discussion here because that's the side of the house where most of ideas of a social contract and desire to see a specific ecosystem develop have been located.   People on the MIT/BSD side of things, which has always had a much more direct "do whatever you want" ethos, are not the ones I'd expect to be making these arguments in the first place.<p>> For one, it allows laundering of the copyleft license so their work goes into closed-source products that are never shared.<p>I'd agree that someone using an LLM to create a deterministic transcription of someone else's work is indeed violating the license.  But I think the argument goes beyond that, into using LLMs in any way at all.<p>> That doesn't mean I will be writing OSS software with a new made-up unenforceable license. It just means, like OP, I'll weigh differently whether I want to bother releasing stuff at all.<p>That's a reasonable position, and from the perspective of examining whether the current LLM climate is sapping motivation to participate in FOSS, I can understand where you're coming from.<p>But to that point, I'd argue that if your motivation <i>was</i> to gain recognition, participate in a community, etc. then you're going to lose those things by keeping your code private anyway, whereas you won't necessarily lose those things just because an LLM was trained on your code.  If you contribute to a popular project, people were almost certainly already using your work to do things you don't approve of -- if that didn't take away your motivation, why would LLMs do much worse?</p>
]]></description><pubDate>Mon, 01 Jun 2026 18:57:17 +0000</pubDate><link>https://news.ycombinator.com/item?id=48361096</link><dc:creator>Gormo</dc:creator><comments>https://news.ycombinator.com/item?id=48361096</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48361096</guid></item><item><title><![CDATA[New comment by Gormo in "Cessation of public development of Kefir C compiler"]]></title><description><![CDATA[
<p>> This is extremely false. Copyright additionally grants you exclusive control over the production and distribution of derivative works.<p>A derivative work is a work that <i>itself includes</i> copyrighted content from the original work.<p>That is to say that for something to be a derivative work, some measure of its content must be "CTRL-C, CTRL-V" from the originating work.<p>Something that's merely inspired by another work, or draws underlying themes or factual knowledge from it, is not a derivative work.<p>> A training set is just an anthology,<p>Which might make the training set itself a derivative work, but works created by using the model trained on that anthology are a different matter.<p>> and the training process is condensation.<p>No, it isn't.  It's the creation of a new work that represents patterns extrapolated or interpolated from the data set, without the resulting model actually including any of the copyrighted elements of the work.<p>The underlying ideas and facts in the original work were never protected by copyright.  Only the specific fixed form of expression is copyrightable.<p>Someone who looks at a dozen code examples in public repos to learn how to do e.g. a quick sort, then upon understanding the logic flow of the quick sort algorithm, writes his own quick sort implementation is <i>not</i> creating a derivative work of the code in the repos he exampled.  And the way LLMs work is much more similar to that process than to the "compressed anthology" concept you're describing.</p>
]]></description><pubDate>Mon, 01 Jun 2026 18:41:32 +0000</pubDate><link>https://news.ycombinator.com/item?id=48360899</link><dc:creator>Gormo</dc:creator><comments>https://news.ycombinator.com/item?id=48360899</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48360899</guid></item><item><title><![CDATA[New comment by Gormo in "Cessation of public development of Kefir C compiler"]]></title><description><![CDATA[
<p>> But if a specific work was generated based on a specific open source work, then according to the social contract that binds non-AI code generators such as transpilers, the output is derivative and should follow the license of that open source work.<p>I don't disagree with the premise that any LLM that is cloning code wholesale from a third-party repo is creating a derivative work, and the license terms apply to it.<p>But I also don't agree that non-AI code generators such as transpilers are in the same category as LLMs -- a deterministic process that is simply parsing input from a single source and outputting it in a new form is not the same thing as a stochastic process that interpolates patterns from multiple sources and then uses those patterns to generate novel outputs.<p>> There's also the question of whether the model itself is a redistribution. For every other lossy compression algorithm in history, the answer is a resounding yes. Is a model meaningfully different from a hypercompressed corpus of its learning data?<p>The model <i>isn't</i> a lossy compression archive that merely represents a collection of pre-existing works in parallel to each other.  It's a probability matrix that relates together uniquely isolatable units of data to each other across the entire collection.<p>If I build a Markov chain based on a statistical analysis of word sequences in <i>Hamlet</i>, and then use it to produce a new sentence that isn't found in the text of that work, I have <i>not</i> created a derivative work of <i>Hamlet</i> under any applicable sense of that term.<p>> The social contract of the open source (not to be confused with the legal contract of GPL, MIT etc.) is that developers give users software that they can use and modify in any way they want, and in exchange the users give the developer recognition and help with development and maintance, as well as give each other the assurance that the software will remain available to them and any future users.<p>I don't think that is generally true.  There's always been a hope and expectation that <i>some</i> subset of users would contribute back to the project in the ways you're describing, but never a sense of there being any obligation to do so.  Only a fraction of FOSS users have ever contributed to back to the projects whose software they use.<p>There's always been both a social and legal obligation to properly attribute authors and abide by license terms when redistributing or forking FOSS code, but neither obligation has ever applied when learning programming techniques from FOSS code in order to write your own software.  And the way LLMs are designed to work is more similar to the latter than to the former.<p>But in cases where LLMs actually <i>are</i> acting in ways similar to the former, I agree that they should be held accountable both socially and legally.</p>
]]></description><pubDate>Mon, 01 Jun 2026 17:25:00 +0000</pubDate><link>https://news.ycombinator.com/item?id=48359893</link><dc:creator>Gormo</dc:creator><comments>https://news.ycombinator.com/item?id=48359893</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48359893</guid></item><item><title><![CDATA[New comment by Gormo in "Cessation of public development of Kefir C compiler"]]></title><description><![CDATA[
<p>> But we are talking about a social contract, which is not quite the same thing. The social contract is what leads some devs who previously enjoyed publishing their work openly to no longer feel the same way.<p>Perhaps this illustrates a fissure that was always lurking under the surface, then.  The social contract that I've personally always attributed to FOSS communities was that attempting to restrict how people downstream of you use code is illegitimate, and that licenses like the GPL were meant to use copyright law to achieve something that resembles the state of affairs that might exist if copyright didn't exist in the first place.  That's what the whole concept of "copyleft" always seemed to imply.<p>Now we have a new class of technologies that is admittedly fraught with a wide range of risks and pitfalls, but also a lot of promise to enable people to actually put the "four freedoms" into practice in ways they couldn't before, and we're seeing people who have normative opinions about AI derived from other, unrelated principles trying to circle the wagons and exclude those use cases.  <i>That</i> is what seems like a breach of the social contract as I've always understood it.<p>> Did they mean literally CTRL+C, CTRL+V or something broader?<p>Given that FOSS licenses were always constructed to function <i>within</i> applicable copyright law, I don't see how they could mean anything else.  "Literal CTRL+C, CTRL+V" is the only thing copyright has ever applied to, and the whole point of "copyleft" was to lessen the restrictions on even that.</p>
]]></description><pubDate>Mon, 01 Jun 2026 16:51:30 +0000</pubDate><link>https://news.ycombinator.com/item?id=48359362</link><dc:creator>Gormo</dc:creator><comments>https://news.ycombinator.com/item?id=48359362</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48359362</guid></item><item><title><![CDATA[New comment by Gormo in "Cessation of public development of Kefir C compiler"]]></title><description><![CDATA[
<p>> It's a tool, if using data is necessary to make the tool work, then its output derives from the data.<p>That's simply not correct within the applicable meaning of "derives" as understood in copyright law.   In fact, data <i>per se</i> is not even within the scope of copyright protection in the first place: specific published works are copyrighted, but the underlying ideas and facts that they convey are not.<p>Even creating works that merely draw on a single source of data, but express the ideas drawn from that in a new or transformative way, are not considered derivative works (see the ruling in Google v. Oracle, for example), let alone works based on patterns extrapolated by relating together ideas sourced from many distinct works, which is what LLMs are principally doing.<p>If you applied the principle you're proposing here to human developers, you'd conclude that any code written by someone who learned to program by studying techniques used in FOSS software would in turn be a derivative work of that software.  No one has ever regarded this to be the case.</p>
]]></description><pubDate>Mon, 01 Jun 2026 16:40:34 +0000</pubDate><link>https://news.ycombinator.com/item?id=48359217</link><dc:creator>Gormo</dc:creator><comments>https://news.ycombinator.com/item?id=48359217</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48359217</guid></item><item><title><![CDATA[New comment by Gormo in "Cessation of public development of Kefir C compiler"]]></title><description><![CDATA[
<p>What?  What is being "stolen" from you?<p>Are you now layering the old and tired "copyright infringement = stealing" argument on top of the still unsubstantiated premise that all LLM training is copyright infringement?</p>
]]></description><pubDate>Mon, 01 Jun 2026 16:09:40 +0000</pubDate><link>https://news.ycombinator.com/item?id=48358763</link><dc:creator>Gormo</dc:creator><comments>https://news.ycombinator.com/item?id=48358763</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48358763</guid></item><item><title><![CDATA[New comment by Gormo in "Cessation of public development of Kefir C compiler"]]></title><description><![CDATA[
<p>Both would resolve to the same question, no?<p>There seems to be an implicit premise here that <i>any</i> work generated by an LLM whose training data includes a particular bit of code itself constitutes a redistribution of that code.  I've yet to encounter any strong arguments substantiating this premise as a general principle, and my own suspicion is that it is <i>not</i> valid as a general principle, given the nature of how LLMs operate.<p>It's certainly possible that specific instances of LLMs lazily copy-pasting code from public repos may exist, and the extent to which this is happening is something that can be substantiated by empirical examples, so if you have any to point to, I'd be interested in looking at them.  However, where this is happening, it ought to be regarded as a failure modality of LLMs, and not something that implicates the underlying nature of LLMs, given that their intended purpose is to function as stochastic generators that do <i>not</i> merely copy-paste input data.<p>My initial feeling here is that using open-source code to train LLMs is not per se a violation of the generally accepted FOSS social contract, but rather that attempting to restrict specific use cases of FOSS-licensed code on the basis of normative opinions unrelated to the license terms <i>is</i> a violation, or at least a rejection, of that social contract.  I'm not fully committed to this position, though, and would welcome well-reasoned arguments to the contrary.</p>
]]></description><pubDate>Mon, 01 Jun 2026 15:59:23 +0000</pubDate><link>https://news.ycombinator.com/item?id=48358630</link><dc:creator>Gormo</dc:creator><comments>https://news.ycombinator.com/item?id=48358630</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48358630</guid></item></channel></rss>