<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: dchichkov</title><link>https://news.ycombinator.com/user?id=dchichkov</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Sat, 18 Apr 2026 03:07:45 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=dchichkov" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by dchichkov in "OpenAI asks White House for relief from state AI rules"]]></title><description><![CDATA[
<p>>> In the proposal, OpenAI also said the U.S. needs “a copyright strategy that promotes the freedom to learn” and on “preserving American AI models’ ability to learn from copyrighted material.”<p>Perhaps also symmetric "freedom to learn" from OpenAI models, with some provisions / naming convention?  U.S. labs are limited in this way, while labs in China are not.</p>
]]></description><pubDate>Thu, 13 Mar 2025 16:40:26 +0000</pubDate><link>https://news.ycombinator.com/item?id=43354979</link><dc:creator>dchichkov</dc:creator><comments>https://news.ycombinator.com/item?id=43354979</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43354979</guid></item><item><title><![CDATA[New comment by dchichkov in "Music labels will regret coming for the Internet Archive, sound historian says"]]></title><description><![CDATA[
<p>Just in case, here's the list of these labels:<p>- UMG Recordings, Inc.<p>- Capitol Records, LLC<p>- Concord Bicycle Assets, LLC<p>- CMGI Recorded Music Assets LLC<p>- Sony Music Entertainment<p>- Arista Music<p>Taken from: <a href="https://cdn.arstechnica.net/wp-content/uploads/2025/03/UMG-v-Internet-Archive-Proposed-Seconded-Amended-Complaint-3-6-25.pdf" rel="nofollow">https://cdn.arstechnica.net/wp-content/uploads/2025/03/UMG-v...</a></p>
]]></description><pubDate>Mon, 10 Mar 2025 18:12:23 +0000</pubDate><link>https://news.ycombinator.com/item?id=43323677</link><dc:creator>dchichkov</dc:creator><comments>https://news.ycombinator.com/item?id=43323677</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43323677</guid></item><item><title><![CDATA[New comment by dchichkov in "ARC-AGI without pretraining"]]></title><description><![CDATA[
<p>0 1
00 01 10 11
000 001 010 011 100 101 110 111
0000 0001 0010 0011 0100 0101 0110 0111 1000 1001 1010 1011 1100 1101 1110<p>And no, I don't think the knowledge of language is necessary.  To give a concrete example, tokens from TinyStories dataset (the dataset size is ~1GB) are known to be sufficient to bootstrap basic language.</p>
]]></description><pubDate>Wed, 05 Mar 2025 01:40:03 +0000</pubDate><link>https://news.ycombinator.com/item?id=43261644</link><dc:creator>dchichkov</dc:creator><comments>https://news.ycombinator.com/item?id=43261644</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43261644</guid></item><item><title><![CDATA[New comment by dchichkov in "ARC-AGI without pretraining"]]></title><description><![CDATA[
<p>For long context sizes AGI is not useless without vast knowledge. You could always put a bootstrap sequence into the context (think Arecibo Message), followed by your prompt. A general enough reasoner with enough compute should be able to establish the context and reason about your prompt.</p>
]]></description><pubDate>Tue, 04 Mar 2025 20:50:36 +0000</pubDate><link>https://news.ycombinator.com/item?id=43259678</link><dc:creator>dchichkov</dc:creator><comments>https://news.ycombinator.com/item?id=43259678</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43259678</guid></item><item><title><![CDATA[New comment by dchichkov in "The Deep Research problem"]]></title><description><![CDATA[
<p>I agree, they are only starting the data flywheel there.  And at the same time making users pay $200/month for it, while the competition is only charging $20/month.<p>And note, the system is now directly competing with "interns". Once the accuracy is competitive (is it already?) with an average "intern", there'd be fewer reasons to hire paid "interns" (more expensive than $200/month).  Which is maybe a good thing? Fewer kids wasting their time/eyes looking at the computer screens?</p>
]]></description><pubDate>Tue, 25 Feb 2025 23:37:12 +0000</pubDate><link>https://news.ycombinator.com/item?id=43178916</link><dc:creator>dchichkov</dc:creator><comments>https://news.ycombinator.com/item?id=43178916</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43178916</guid></item><item><title><![CDATA[New comment by dchichkov in "[dead]"]]></title><description><![CDATA[
<p>The approach of "cutting funding and then observing whether anything critical fails or is impacted" only works if outcomes follow a normal distribution.<p>This is far from the case — many areas are characterized by heavy-tailed loss distributions, where extreme negative consequences could really ruin the day and erase any efficiency gains.</p>
]]></description><pubDate>Sat, 08 Feb 2025 20:42:45 +0000</pubDate><link>https://news.ycombinator.com/item?id=42986011</link><dc:creator>dchichkov</dc:creator><comments>https://news.ycombinator.com/item?id=42986011</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42986011</guid></item><item><title><![CDATA[New comment by dchichkov in "OpenAI says it has evidence DeepSeek used its model to train competitor"]]></title><description><![CDATA[
<p>I've suggested that <i>long context</i> should be included into the prompt.<p>In your particular case the prompt would look something like:
<pubmed dump> what are the plants that aren't poisonous to most people?<p>A general reasoner would recover language and relevant world model from pubmed dump. And then would proceed to reason about it, to perform the task.<p>It doesn't look like a particularly efficient process.</p>
]]></description><pubDate>Thu, 30 Jan 2025 22:43:21 +0000</pubDate><link>https://news.ycombinator.com/item?id=42882935</link><dc:creator>dchichkov</dc:creator><comments>https://news.ycombinator.com/item?id=42882935</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42882935</guid></item><item><title><![CDATA[New comment by dchichkov in "OpenAI says it has evidence DeepSeek used its model to train competitor"]]></title><description><![CDATA[
<p>If you look at the benchmarks of the DeepSeek-V3-Base, it is quite capable, even in 0-shot: <a href="https://huggingface.co/deepseek-ai/DeepSeek-V3-Base#base-model" rel="nofollow">https://huggingface.co/deepseek-ai/DeepSeek-V3-Base#base-mod...</a>  This is not from scratch. These benchmark numbers are an indication that the base model already had a large number of reasoning/LLM tokens in the pre-training set.<p>On the other hand, my take on it, the ability to do reasoning <i>in a long context</i> is a general capability. And my guess is that it can be bootstrapped from scratch, without having to do training on all of the internet or having to distill models trained on the internet.</p>
]]></description><pubDate>Wed, 29 Jan 2025 22:34:05 +0000</pubDate><link>https://news.ycombinator.com/item?id=42872152</link><dc:creator>dchichkov</dc:creator><comments>https://news.ycombinator.com/item?id=42872152</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42872152</guid></item><item><title><![CDATA[New comment by dchichkov in "OpenAI says it has evidence DeepSeek used its model to train competitor"]]></title><description><![CDATA[
<p>There are examples of learning reasoning from scratch with reinforcement learning.<p>Emergent tool use from multi-agent interaction is a good example - <a href="https://openai.com/index/emergent-tool-use/" rel="nofollow">https://openai.com/index/emergent-tool-use/</a></p>
]]></description><pubDate>Wed, 29 Jan 2025 22:21:52 +0000</pubDate><link>https://news.ycombinator.com/item?id=42872017</link><dc:creator>dchichkov</dc:creator><comments>https://news.ycombinator.com/item?id=42872017</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42872017</guid></item><item><title><![CDATA[New comment by dchichkov in "OpenAI says it has evidence DeepSeek used its model to train competitor"]]></title><description><![CDATA[
<p>It should be possible to learn to reason from scratch. And the ability to reason in a long context seems to be very general.</p>
]]></description><pubDate>Wed, 29 Jan 2025 20:30:02 +0000</pubDate><link>https://news.ycombinator.com/item?id=42870738</link><dc:creator>dchichkov</dc:creator><comments>https://news.ycombinator.com/item?id=42870738</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42870738</guid></item><item><title><![CDATA[New comment by dchichkov in "DeepSeek releases Janus Pro, a text-to-image generator [pdf]"]]></title><description><![CDATA[
<p>MMMU is not particularly high. Janus-Pro-7B is 41.0, which is only 14 points better than random/frequent choice. I'm pretty sure, their base DeepSeek 7B LLM will get around 41.0 MMMU without access to images, this is a normal number for a roughly GPT4-level LLM base with no access to images.</p>
]]></description><pubDate>Mon, 27 Jan 2025 18:13:48 +0000</pubDate><link>https://news.ycombinator.com/item?id=42843968</link><dc:creator>dchichkov</dc:creator><comments>https://news.ycombinator.com/item?id=42843968</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42843968</guid></item><item><title><![CDATA[New comment by dchichkov in "Prime numbers so memorable that people hunt for them"]]></title><description><![CDATA[
<p>Sorry, but this was ChatGPT/o1 with access to code execution (Python) and it used almost 4 minutes to do reasoning. It had done a few checks with smaller numbers, all of which had failed.  And it proceeded to make a wrong conclusion (with high confidence).</p>
]]></description><pubDate>Wed, 22 Jan 2025 00:04:17 +0000</pubDate><link>https://news.ycombinator.com/item?id=42786902</link><dc:creator>dchichkov</dc:creator><comments>https://news.ycombinator.com/item?id=42786902</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42786902</guid></item><item><title><![CDATA[New comment by dchichkov in "Prime numbers so memorable that people hunt for them"]]></title><description><![CDATA[
<p>ChatGPT o1: <a href="https://chatgpt.com/share/678feedb-0b2c-8001-bd77-4e574502e4fc" rel="nofollow">https://chatgpt.com/share/678feedb-0b2c-8001-bd77-4e574502e4...</a><p>> Thought about large prime check for 3m 52s: <i>"Despite its interesting pattern of digits, 12,345,678,910,987,654,321
is definitely not prime. It is a large composite number with no small prime factors."</i><p>Feels like this Online Encyclopedia of Integer Sequences (OEIS) would be a good candidate for a hallucination benchmark...</p>
]]></description><pubDate>Tue, 21 Jan 2025 19:01:00 +0000</pubDate><link>https://news.ycombinator.com/item?id=42783962</link><dc:creator>dchichkov</dc:creator><comments>https://news.ycombinator.com/item?id=42783962</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42783962</guid></item><item><title><![CDATA[New comment by dchichkov in "Ask HN: How do you prevent the impact of social media on your children?"]]></title><description><![CDATA[
<p>I understand that it is mostly regulated at the state level. I'm not sure about other states, but The Computer Science Standards for California Public Schools (Kindergarten through Grade Twelve) also tend to be followed by private schools. So they can claim their programs meet state requirements.<p>This brings computers into the classroom, and once they’re available, it is a slippery slope. It is easier for teachers to have students use semi-gamified "educational" apps rather than engage themselves.<p>Example for K-2 - <a href="https://www.cde.ca.gov/be/st/ss/documents/csstandards.pdf" rel="nofollow">https://www.cde.ca.gov/be/st/ss/documents/csstandards.pdf</a>:<p><pre><code>  K-2.CS.1 Select and operate computing devices that perform a variety of tasks accurately and quickly based on user needs and preferences.

  K-2.CS.2 Explain the functions of common hardware and software components of computing systems.

  K-2.CS.3 Describe basic hardware and software problems using accurate terminology.

  K-2.NI.4 Model and describe how people connect to other people, places, information and ideas through a network.

  ...

  K–2 K-2.AP.12 Create programs with sequences of commands and simple loops, to express ideas or address a problem

  K-2.IC.20 Describe approaches and rationales for keeping login information private, and for logging off of devices appropriately</code></pre></p>
]]></description><pubDate>Wed, 15 Jan 2025 01:15:19 +0000</pubDate><link>https://news.ycombinator.com/item?id=42706181</link><dc:creator>dchichkov</dc:creator><comments>https://news.ycombinator.com/item?id=42706181</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42706181</guid></item><item><title><![CDATA[New comment by dchichkov in "Ask HN: How do you prevent the impact of social media on your children?"]]></title><description><![CDATA[
<p>Another Gorilla is the schools, teachers and state-approved  recommendations, that extend their reach even into private schools.<p>Imagine my frustration one day, when I've discovered that my kindergartner has full access to a brand-new, shiny iPad during class.  Despite complaints from parents, the teacher refused to reduce iPad usage (or even activate Screen Distance and Screen Time controls on the iPad, or share usage statistics).<p>The only thing that I've learned, this is all in line with California’s state-approved computer literacy recommendations.</p>
]]></description><pubDate>Tue, 14 Jan 2025 23:28:45 +0000</pubDate><link>https://news.ycombinator.com/item?id=42705377</link><dc:creator>dchichkov</dc:creator><comments>https://news.ycombinator.com/item?id=42705377</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42705377</guid></item><item><title><![CDATA[New comment by dchichkov in "FTC bans hidden junk fees in hotel, event ticket prices"]]></title><description><![CDATA[
<p>At the expense of other people's time.</p>
]]></description><pubDate>Wed, 18 Dec 2024 18:26:23 +0000</pubDate><link>https://news.ycombinator.com/item?id=42453301</link><dc:creator>dchichkov</dc:creator><comments>https://news.ycombinator.com/item?id=42453301</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42453301</guid></item><item><title><![CDATA[New comment by dchichkov in "FTC bans hidden junk fees in hotel, event ticket prices"]]></title><description><![CDATA[
<p>Whoever invented this is evil and is destroying happiness.</p>
]]></description><pubDate>Tue, 17 Dec 2024 22:44:51 +0000</pubDate><link>https://news.ycombinator.com/item?id=42446262</link><dc:creator>dchichkov</dc:creator><comments>https://news.ycombinator.com/item?id=42446262</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42446262</guid></item><item><title><![CDATA[New comment by dchichkov in "FTC bans hidden junk fees in hotel, event ticket prices"]]></title><description><![CDATA[
<p>Safeway, Walgreens.</p>
]]></description><pubDate>Tue, 17 Dec 2024 22:43:24 +0000</pubDate><link>https://news.ycombinator.com/item?id=42446246</link><dc:creator>dchichkov</dc:creator><comments>https://news.ycombinator.com/item?id=42446246</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42446246</guid></item><item><title><![CDATA[New comment by dchichkov in "FTC Announces Rule Banning Junk Ticket and Hotel Fees"]]></title><description><![CDATA[
<p>I wish that "Online Coupon Price Tags" in stores would also be banned. I'm talking about these yellow price tags that show lower than "Club" prices, which are only valid if you collect a coupon online.<p>Like FTC, I estimate that banning these would save U.S. consumers millions of hours they currently spend searching and clicking on pointless coupons on their phones before making purchases. It would also increase happiness, as it's extremely annoying to pay $20 extra, knowing that a lower price is available if only you spent ten minutes struggling with a store's website on your phone.<p>Whoever invented this is evil and is destroying happiness.</p>
]]></description><pubDate>Tue, 17 Dec 2024 22:13:27 +0000</pubDate><link>https://news.ycombinator.com/item?id=42445933</link><dc:creator>dchichkov</dc:creator><comments>https://news.ycombinator.com/item?id=42445933</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42445933</guid></item><item><title><![CDATA[New comment by dchichkov in "FTC bans hidden junk fees in hotel, event ticket prices"]]></title><description><![CDATA[
<p>I wish that "Online Coupon Price Tags" in stores would also be banned. I'm talking about these yellow price tags that show lower than "Club" prices, which are only valid if you collect a coupon online.<p>Like FTC, I estimate that banning these would save U.S. consumers millions of hours they currently spend searching and clicking on pointless coupons on their phones before making purchases. It would also increase happiness, as it's extremely annoying to pay $20 extra, knowing that a lower price is available if only you spent ten minutes struggling with a store's website on your phone.<p>Whoever invented this is evil and is destroying happiness.</p>
]]></description><pubDate>Tue, 17 Dec 2024 22:11:03 +0000</pubDate><link>https://news.ycombinator.com/item?id=42445904</link><dc:creator>dchichkov</dc:creator><comments>https://news.ycombinator.com/item?id=42445904</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42445904</guid></item></channel></rss>