Hacker News: dchichkov

New comment by dchichkov in "OpenAI asks White House for relief from state AI rules"

dchichkov — Thu, 13 Mar 2025 16:40:26 +0000

>> In the proposal, OpenAI also said the U.S. needs “a copyright strategy that promotes the freedom to learn” and on “preserving American AI models’ ability to learn from copyrighted material.”

Perhaps also symmetric "freedom to learn" from OpenAI models, with some provisions / naming convention? U.S. labs are limited in this way, while labs in China are not.

New comment by dchichkov in "Music labels will regret coming for the Internet Archive, sound historian says"

dchichkov — Mon, 10 Mar 2025 18:12:23 +0000

Just in case, here's the list of these labels:

- UMG Recordings, Inc.

- Capitol Records, LLC

- Concord Bicycle Assets, LLC

- CMGI Recorded Music Assets LLC

- Sony Music Entertainment

- Arista Music

Taken from: https://cdn.arstechnica.net/wp-content/uploads/2025/03/UMG-v...

New comment by dchichkov in "ARC-AGI without pretraining"

dchichkov — Wed, 05 Mar 2025 01:40:03 +0000

0 1 00 01 10 11 000 001 010 011 100 101 110 111 0000 0001 0010 0011 0100 0101 0110 0111 1000 1001 1010 1011 1100 1101 1110

And no, I don't think the knowledge of language is necessary. To give a concrete example, tokens from TinyStories dataset (the dataset size is ~1GB) are known to be sufficient to bootstrap basic language.

New comment by dchichkov in "ARC-AGI without pretraining"

dchichkov — Tue, 04 Mar 2025 20:50:36 +0000

For long context sizes AGI is not useless without vast knowledge. You could always put a bootstrap sequence into the context (think Arecibo Message), followed by your prompt. A general enough reasoner with enough compute should be able to establish the context and reason about your prompt.

New comment by dchichkov in "The Deep Research problem"

dchichkov — Tue, 25 Feb 2025 23:37:12 +0000

I agree, they are only starting the data flywheel there. And at the same time making users pay $200/month for it, while the competition is only charging $20/month.

And note, the system is now directly competing with "interns". Once the accuracy is competitive (is it already?) with an average "intern", there'd be fewer reasons to hire paid "interns" (more expensive than $200/month). Which is maybe a good thing? Fewer kids wasting their time/eyes looking at the computer screens?

New comment by dchichkov in "[dead]"

dchichkov — Sat, 08 Feb 2025 20:42:45 +0000

The approach of "cutting funding and then observing whether anything critical fails or is impacted" only works if outcomes follow a normal distribution.

This is far from the case — many areas are characterized by heavy-tailed loss distributions, where extreme negative consequences could really ruin the day and erase any efficiency gains.

New comment by dchichkov in "OpenAI says it has evidence DeepSeek used its model to train competitor"

dchichkov — Thu, 30 Jan 2025 22:43:21 +0000

I've suggested that long context should be included into the prompt.

In your particular case the prompt would look something like: what are the plants that aren't poisonous to most people?

A general reasoner would recover language and relevant world model from pubmed dump. And then would proceed to reason about it, to perform the task.

It doesn't look like a particularly efficient process.

New comment by dchichkov in "OpenAI says it has evidence DeepSeek used its model to train competitor"

dchichkov — Wed, 29 Jan 2025 22:34:05 +0000

If you look at the benchmarks of the DeepSeek-V3-Base, it is quite capable, even in 0-shot: https://huggingface.co/deepseek-ai/DeepSeek-V3-Base#base-mod... This is not from scratch. These benchmark numbers are an indication that the base model already had a large number of reasoning/LLM tokens in the pre-training set.

On the other hand, my take on it, the ability to do reasoning in a long context is a general capability. And my guess is that it can be bootstrapped from scratch, without having to do training on all of the internet or having to distill models trained on the internet.

New comment by dchichkov in "OpenAI says it has evidence DeepSeek used its model to train competitor"

dchichkov — Wed, 29 Jan 2025 22:21:52 +0000

There are examples of learning reasoning from scratch with reinforcement learning.

Emergent tool use from multi-agent interaction is a good example - https://openai.com/index/emergent-tool-use/

New comment by dchichkov in "OpenAI says it has evidence DeepSeek used its model to train competitor"

dchichkov — Wed, 29 Jan 2025 20:30:02 +0000

It should be possible to learn to reason from scratch. And the ability to reason in a long context seems to be very general.

New comment by dchichkov in "DeepSeek releases Janus Pro, a text-to-image generator [pdf]"

dchichkov — Mon, 27 Jan 2025 18:13:48 +0000

MMMU is not particularly high. Janus-Pro-7B is 41.0, which is only 14 points better than random/frequent choice. I'm pretty sure, their base DeepSeek 7B LLM will get around 41.0 MMMU without access to images, this is a normal number for a roughly GPT4-level LLM base with no access to images.

New comment by dchichkov in "Prime numbers so memorable that people hunt for them"

dchichkov — Wed, 22 Jan 2025 00:04:17 +0000

Sorry, but this was ChatGPT/o1 with access to code execution (Python) and it used almost 4 minutes to do reasoning. It had done a few checks with smaller numbers, all of which had failed. And it proceeded to make a wrong conclusion (with high confidence).

New comment by dchichkov in "Prime numbers so memorable that people hunt for them"

dchichkov — Tue, 21 Jan 2025 19:01:00 +0000

ChatGPT o1: https://chatgpt.com/share/678feedb-0b2c-8001-bd77-4e574502e4...

> Thought about large prime check for 3m 52s: "Despite its interesting pattern of digits, 12,345,678,910,987,654,321 is definitely not prime. It is a large composite number with no small prime factors."

Feels like this Online Encyclopedia of Integer Sequences (OEIS) would be a good candidate for a hallucination benchmark...

New comment by dchichkov in "Ask HN: How do you prevent the impact of social media on your children?"

dchichkov — Wed, 15 Jan 2025 01:15:19 +0000

I understand that it is mostly regulated at the state level. I'm not sure about other states, but The Computer Science Standards for California Public Schools (Kindergarten through Grade Twelve) also tend to be followed by private schools. So they can claim their programs meet state requirements.

This brings computers into the classroom, and once they’re available, it is a slippery slope. It is easier for teachers to have students use semi-gamified "educational" apps rather than engage themselves.

Example for K-2 - https://www.cde.ca.gov/be/st/ss/documents/csstandards.pdf:

  K-2.CS.1 Select and operate computing devices that perform a variety of tasks accurately and quickly based on user needs and preferences.

  K-2.CS.2 Explain the functions of common hardware and software components of computing systems.

  K-2.CS.3 Describe basic hardware and software problems using accurate terminology.

  K-2.NI.4 Model and describe how people connect to other people, places, information and ideas through a network.

  ...

  K–2 K-2.AP.12 Create programs with sequences of commands and simple loops, to express ideas or address a problem

  K-2.IC.20 Describe approaches and rationales for keeping login information private, and for logging off of devices appropriately

New comment by dchichkov in "Ask HN: How do you prevent the impact of social media on your children?"

dchichkov — Tue, 14 Jan 2025 23:28:45 +0000

Another Gorilla is the schools, teachers and state-approved recommendations, that extend their reach even into private schools.

Imagine my frustration one day, when I've discovered that my kindergartner has full access to a brand-new, shiny iPad during class. Despite complaints from parents, the teacher refused to reduce iPad usage (or even activate Screen Distance and Screen Time controls on the iPad, or share usage statistics).

The only thing that I've learned, this is all in line with California’s state-approved computer literacy recommendations.

New comment by dchichkov in "FTC bans hidden junk fees in hotel, event ticket prices"

dchichkov — Wed, 18 Dec 2024 18:26:23 +0000

At the expense of other people's time.

New comment by dchichkov in "FTC bans hidden junk fees in hotel, event ticket prices"

dchichkov — Tue, 17 Dec 2024 22:44:51 +0000

Whoever invented this is evil and is destroying happiness.

New comment by dchichkov in "FTC bans hidden junk fees in hotel, event ticket prices"

dchichkov — Tue, 17 Dec 2024 22:43:24 +0000

Safeway, Walgreens.

New comment by dchichkov in "FTC Announces Rule Banning Junk Ticket and Hotel Fees"

dchichkov — Tue, 17 Dec 2024 22:13:27 +0000

I wish that "Online Coupon Price Tags" in stores would also be banned. I'm talking about these yellow price tags that show lower than "Club" prices, which are only valid if you collect a coupon online.

Like FTC, I estimate that banning these would save U.S. consumers millions of hours they currently spend searching and clicking on pointless coupons on their phones before making purchases. It would also increase happiness, as it's extremely annoying to pay $20 extra, knowing that a lower price is available if only you spent ten minutes struggling with a store's website on your phone.

Whoever invented this is evil and is destroying happiness.

New comment by dchichkov in "FTC bans hidden junk fees in hotel, event ticket prices"

dchichkov — Tue, 17 Dec 2024 22:11:03 +0000

Whoever invented this is evil and is destroying happiness.