<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: iliane5</title><link>https://news.ycombinator.com/user?id=iliane5</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Mon, 01 Jun 2026 22:02:46 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=iliane5" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by iliane5 in "Xzbot: Notes, honeypot, and exploit demo for the xz backdoor"]]></title><description><![CDATA[
<p>Just wanted to say thank you for your work and attention to detail, it's immensely valuable and we're all very grateful for it.</p>
]]></description><pubDate>Tue, 02 Apr 2024 10:34:31 +0000</pubDate><link>https://news.ycombinator.com/item?id=39904176</link><dc:creator>iliane5</dc:creator><comments>https://news.ycombinator.com/item?id=39904176</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=39904176</guid></item><item><title><![CDATA[New comment by iliane5 in "Video generation models as world simulators"]]></title><description><![CDATA[
<p>Watching an entirely generated video of someone painting is crazy.<p>I can't wait to play with this but I can't even imagine how expensive it must be. They're training in full resolution and can generate up to a minute of video.<p>Seeing how bad video generation was, I expected it would take a few more years to get to this but it seems like this is another case of "Add data & compute"(TM) where transformers prove once again they'll learn everything and be great at it</p>
]]></description><pubDate>Fri, 16 Feb 2024 02:50:04 +0000</pubDate><link>https://news.ycombinator.com/item?id=39392563</link><dc:creator>iliane5</dc:creator><comments>https://news.ycombinator.com/item?id=39392563</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=39392563</guid></item><item><title><![CDATA[New comment by iliane5 in "OpenAI is too cheap to beat"]]></title><description><![CDATA[
<p>I think it's mostly the scale. Once you have a consistent user base and tons of GPUs, batching inference/training across your cluster allows you to process requests much faster and for a lower marginal cost.</p>
]]></description><pubDate>Thu, 12 Oct 2023 19:43:06 +0000</pubDate><link>https://news.ycombinator.com/item?id=37862166</link><dc:creator>iliane5</dc:creator><comments>https://news.ycombinator.com/item?id=37862166</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=37862166</guid></item><item><title><![CDATA[New comment by iliane5 in "Why are LLMs general learners?"]]></title><description><![CDATA[
<p>What I was saying is that because you need to go out of your way to make sure it's tokenized properly, I wouldn't be surprised if there are enough non properly tokenized examples in the dataset.<p>If that was the case, it would make it difficult to generalize these concepts.</p>
]]></description><pubDate>Tue, 13 Jun 2023 15:16:14 +0000</pubDate><link>https://news.ycombinator.com/item?id=36311490</link><dc:creator>iliane5</dc:creator><comments>https://news.ycombinator.com/item?id=36311490</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=36311490</guid></item><item><title><![CDATA[New comment by iliane5 in "Why are LLMs general learners?"]]></title><description><![CDATA[
<p>> LLMs are not particularly good at arithmetic, counting syllables, or recognizing haikus<p>I suspect most of this is due to tokenization making it difficult to generalize these concepts.<p>There are some weird edge cases though, for example GPT-4 will almost always be able to add two 40 digits number but it is also almost always wrong when adding a 40 digit and 35 digit number.</p>
]]></description><pubDate>Tue, 13 Jun 2023 01:27:14 +0000</pubDate><link>https://news.ycombinator.com/item?id=36303927</link><dc:creator>iliane5</dc:creator><comments>https://news.ycombinator.com/item?id=36303927</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=36303927</guid></item><item><title><![CDATA[New comment by iliane5 in "OpenAI Employee: GPT-4 has been static since March"]]></title><description><![CDATA[
<p>AFAIK it's pretty standard practice not to expose the "raw" LLM directly to the user. You need a "sanity loop" where user input and the output of the LLM is checked by another LLM to actually enforce rules and mitigate prompt injections, etc.</p>
]]></description><pubDate>Thu, 01 Jun 2023 22:54:14 +0000</pubDate><link>https://news.ycombinator.com/item?id=36158541</link><dc:creator>iliane5</dc:creator><comments>https://news.ycombinator.com/item?id=36158541</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=36158541</guid></item><item><title><![CDATA[New comment by iliane5 in "Superintelligence: An idea that eats smart people (2016)"]]></title><description><![CDATA[
<p>100% agree.<p>However, seeing how excited Palantir is with their war assistant LLM , the US testing autonomous fighter jets a few months ago, etc. I think there's a decent chance that AI won't even have to break out of its constraints. It's pretty much guaranteed people are going to do the obviously dumb thing and give it capabilities it shouldn't have or is not equipped to deal with safely.</p>
]]></description><pubDate>Sun, 28 May 2023 15:16:14 +0000</pubDate><link>https://news.ycombinator.com/item?id=36104736</link><dc:creator>iliane5</dc:creator><comments>https://news.ycombinator.com/item?id=36104736</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=36104736</guid></item><item><title><![CDATA[New comment by iliane5 in "Sam Altman goes before US Congress to propose licenses for building AI"]]></title><description><![CDATA[
<p>I don't think we need sentient AI for it to be autonomous. LLMs are powerful cognitive engines and weak knowledge engines. Cognition on its own does not allow them to be autonomous, but because they can use tools (APIs, etc.) they are able to have some degree of autonomy when given a task and can use basic logic to follow them through/correct their mistakes.<p>AutoGPTs and the likes are much overhyped (it's early tech experiments after all) and have not produced anything of value yet but having dabbled with autonomous agents, I definitely see a not so distant future when you can outsource valuable tasks to such systems.</p>
]]></description><pubDate>Tue, 16 May 2023 15:56:42 +0000</pubDate><link>https://news.ycombinator.com/item?id=35963625</link><dc:creator>iliane5</dc:creator><comments>https://news.ycombinator.com/item?id=35963625</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=35963625</guid></item><item><title><![CDATA[New comment by iliane5 in "Sam Altman goes before US Congress to propose licenses for building AI"]]></title><description><![CDATA[
<p>> Why is building what amounts to a calculator/spreadsheet/CAD program for language somehow a Rubicon that cannot be crossed?<p>We've already crossed it and I believe we should go full steam ahead, tech is cool and we should be doing cool things.<p>> Did people freak out this much about computers replacing humans when they were shown to be good at math?<p>Too young but I'm sure they did freak out a little! Computers have changed the world and people have internalized computers as being much better/faster at math but <i>exhibiting</i> creativity, language proficiency and thinking is not something people thought computers were supposed to do.</p>
]]></description><pubDate>Tue, 16 May 2023 14:21:24 +0000</pubDate><link>https://news.ycombinator.com/item?id=35962286</link><dc:creator>iliane5</dc:creator><comments>https://news.ycombinator.com/item?id=35962286</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=35962286</guid></item><item><title><![CDATA[New comment by iliane5 in "Sam Altman goes before US Congress to propose licenses for building AI"]]></title><description><![CDATA[
<p>There's no denying this is regulatory capture by OpenAI to secure their (gigantic) bag and that the "AI will kill us all" meme is not based in reality and plays on the fact that the majority of people do not understand LLMs.<p>I was simply explaining why I believe your perspective is not represented in the discussions in the media, etc. If these models were not getting incredibly good at mimicking intelligence, it would not be possible to play on people's fears of it.</p>
]]></description><pubDate>Tue, 16 May 2023 14:10:13 +0000</pubDate><link>https://news.ycombinator.com/item?id=35962127</link><dc:creator>iliane5</dc:creator><comments>https://news.ycombinator.com/item?id=35962127</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=35962127</guid></item><item><title><![CDATA[New comment by iliane5 in "Sam Altman goes before US Congress to propose licenses for building AI"]]></title><description><![CDATA[
<p>The human brain works around a lot of limiting biological functions. The necessary architecture to fully mimic a human brain on a computer might not look anything like the actual human brain.<p>That said, there are 8B+ of us and counting so unless there is magic involved, I don't see why we couldn't do a "1:1" replica of it (maybe far) in the future.</p>
]]></description><pubDate>Tue, 16 May 2023 13:55:00 +0000</pubDate><link>https://news.ycombinator.com/item?id=35961948</link><dc:creator>iliane5</dc:creator><comments>https://news.ycombinator.com/item?id=35961948</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=35961948</guid></item><item><title><![CDATA[New comment by iliane5 in "Sam Altman goes before US Congress to propose licenses for building AI"]]></title><description><![CDATA[
<p>> I don't want every to know how to make a bomb.<p>This information is not created inside the LLMs, it's part of their training data. If someone is motivated enough, I'm sure they'd need no more than a few minutes of googling.<p>> I do feel like this is more than a math formula<p>The sum is greater than the parts! It can just be a math formula and still produce amazing results.
After all, our brains are just a neat arrangement of atoms :)</p>
]]></description><pubDate>Tue, 16 May 2023 13:36:16 +0000</pubDate><link>https://news.ycombinator.com/item?id=35961691</link><dc:creator>iliane5</dc:creator><comments>https://news.ycombinator.com/item?id=35961691</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=35961691</guid></item><item><title><![CDATA[New comment by iliane5 in "Sam Altman goes before US Congress to propose licenses for building AI"]]></title><description><![CDATA[
<p>> Why is it so hard to hear this perspective? Like, genuinely curious.<p>Because people have different definition of what intelligence is. Recreating the human brain in a computer would definitely be neat and interesting but you don't need that nor AGI to be revolutionary.<p>LLMs, as perfect Chinese Rooms, lack a mind or human intelligence but demonstrate increasingly sophisticated behavior. If they can perform tasks better than humans, does their lack of "understanding" and "thinking" matter?<p>The goal is to create a different form of intelligence, superior in ways that benefit us. Planes (or rockets!) don't "fly" like birds do but for our human needs, they are effectively <i>much</i> better at flying that birds ever could be.</p>
]]></description><pubDate>Tue, 16 May 2023 13:29:16 +0000</pubDate><link>https://news.ycombinator.com/item?id=35961613</link><dc:creator>iliane5</dc:creator><comments>https://news.ycombinator.com/item?id=35961613</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=35961613</guid></item><item><title><![CDATA[New comment by iliane5 in "StableLM: A new open-source language model"]]></title><description><![CDATA[
<p>Agreed, there is way too much hype about the actual capabilities of the LLaMa models. However, instruction tuning alone makes Alpaca much more usable than the the base model and to be fair even some versions of the "tiny" 7B can do small talk relatively well.<p>> Using GPT to generate training data for fine-tuning seems to produce the best results, but even so, GPT4-x-Alpaca 30B is still clearly inferior to the real thing.<p>Distillation is interesting and it does seems to make the models adopt ChatGPT's style but I'm dubious that making LLMs generate entire datasets or copy/pasting ShareGPT is going to give you that great of a dataset. The whole point of RLHF is getting the human feedback to make the model better. OpenAI's dataset/RLHF work seems to be working wonders for them and will continue to give them a huge advantage (especially now that they're getting hundred of millions of conversations of people doing all sorts of things with ChatGPT)</p>
]]></description><pubDate>Sat, 22 Apr 2023 12:38:51 +0000</pubDate><link>https://news.ycombinator.com/item?id=35666019</link><dc:creator>iliane5</dc:creator><comments>https://news.ycombinator.com/item?id=35666019</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=35666019</guid></item><item><title><![CDATA[New comment by iliane5 in "StableLM: A new open-source language model"]]></title><description><![CDATA[
<p>I'm sure they're tweaking lots of things under the hood, especially now that they have 100M+ users. It could be bigger (30B?, <i>maybe</i> 65B) as coming down from 175B gives quite a lot of room, but the cognitive drop from Davinci gives away that's it's much smaller.<p>People fine-tuning LLaMa models on arguably not that much/not the highest quality data are already seeing pretty good improvements over the base LLaMa, even at "small" sizes (7B/13B). I assume OpenAI has access to much higher quality data to fine-tune with and in much higher quantity too.</p>
]]></description><pubDate>Thu, 20 Apr 2023 20:37:17 +0000</pubDate><link>https://news.ycombinator.com/item?id=35645928</link><dc:creator>iliane5</dc:creator><comments>https://news.ycombinator.com/item?id=35645928</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=35645928</guid></item><item><title><![CDATA[New comment by iliane5 in "StableLM: A new open-source language model"]]></title><description><![CDATA[
<p>GPT-3.5 is much worse at "complex" cognitive tasks than Davinci (175B), which seem to indicate that it's a smaller model. It's also much faster than Davinci and costs the same as Curie via the API.</p>
]]></description><pubDate>Thu, 20 Apr 2023 16:16:47 +0000</pubDate><link>https://news.ycombinator.com/item?id=35642294</link><dc:creator>iliane5</dc:creator><comments>https://news.ycombinator.com/item?id=35642294</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=35642294</guid></item><item><title><![CDATA[New comment by iliane5 in "Sam Altman: OpenAI is not training GPT-5 and "won't for some time""]]></title><description><![CDATA[
<p>It’s not only 10x cheaper, it’s also way faster at inference and not as smart as Davinci. IMO the only logical answer is that the model is just smaller.</p>
]]></description><pubDate>Sat, 15 Apr 2023 13:34:18 +0000</pubDate><link>https://news.ycombinator.com/item?id=35580501</link><dc:creator>iliane5</dc:creator><comments>https://news.ycombinator.com/item?id=35580501</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=35580501</guid></item><item><title><![CDATA[New comment by iliane5 in "Sam Altman: OpenAI is not training GPT-5 and "won't for some time""]]></title><description><![CDATA[
<p>I bet they’re not saying how big of a model GPT-4 is because it’s actually much smaller we would expect.<p>ChatGPT is IMO a heavily fine-tuned Curie sized model (same price via API + less cognitive capacity than even text davinci-003) so it would make sense that a heavily fine-tuned Davinci sized model would yield similar results to GPT-4.</p>
]]></description><pubDate>Fri, 14 Apr 2023 22:36:09 +0000</pubDate><link>https://news.ycombinator.com/item?id=35575731</link><dc:creator>iliane5</dc:creator><comments>https://news.ycombinator.com/item?id=35575731</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=35575731</guid></item><item><title><![CDATA[New comment by iliane5 in "Simply explained: How does GPT work?"]]></title><description><![CDATA[
<p>I think as soon as text2video gets really good (like midjourney level), there’s gonna be so much AI generated content that unless it’s all extremely good, human made content will be something people search specifically for.<p>As for curation, I think the success of TikTok proves that you don’t need <i>that much</i> data to pretty preceding pinpoint what someone wants to watch (or what will get them to spend the most time on the app at least).</p>
]]></description><pubDate>Fri, 07 Apr 2023 01:04:01 +0000</pubDate><link>https://news.ycombinator.com/item?id=35476764</link><dc:creator>iliane5</dc:creator><comments>https://news.ycombinator.com/item?id=35476764</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=35476764</guid></item><item><title><![CDATA[New comment by iliane5 in "Simply explained: How does GPT work?"]]></title><description><![CDATA[
<p>> LLMs architected and trained the way they are now can never approach human reasoning capability<p>Not sure if you’ve played with GPT-4 but honestly it’s getting there. If you take the bar exam, ChatGPT was in the bottom 10% of participants, GPT-4 is in the top 90%.<p>It obviously isn’t the ultimate test of reasoning/intelligence but I think we would agree that a human who’s in the top 90% is likely to be pretty smart.<p>> Of course LLMs aren’t the final word in AI development<p>Couldn’t agree more. AGI will come from plugging a few of these systems together.</p>
]]></description><pubDate>Fri, 07 Apr 2023 00:58:10 +0000</pubDate><link>https://news.ycombinator.com/item?id=35476724</link><dc:creator>iliane5</dc:creator><comments>https://news.ycombinator.com/item?id=35476724</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=35476724</guid></item></channel></rss>