Hacker News: attentionmech

New comment by attentionmech in "The OBS Project is threatening Fedora Linux with legal action"

attentionmech — Fri, 14 Feb 2025 02:02:23 +0000

why don't they just block the obs project and let users install it in unofficial manner while removing themselves as middleman? I mean, they have certain let's say guidelines but why go about enforcing them in this weird manner.

New comment by attentionmech in "Andrej Karpathy: Deep Dive into LLMs Like ChatGPT [video]"

attentionmech — Sat, 08 Feb 2025 15:24:36 +0000

saw that video just now, thanks for this.

New comment by attentionmech in "Andrej Karpathy: Deep Dive into LLMs Like ChatGPT [video]"

attentionmech — Thu, 06 Feb 2025 19:15:01 +0000

he has earned it haha.

New comment by attentionmech in "Andrej Karpathy: Deep Dive into LLMs Like ChatGPT [video]"

attentionmech — Thu, 06 Feb 2025 10:21:18 +0000

Will checkout jeremy's lectures. I actually use his fastbook notebooks a lot to self-study.

Karpathy's style, for me is more like at the right abstraction to bring out curiosity in me towards the subject. After watching his lectures, i go on to more materials generally, and never really stop there.

New comment by attentionmech in "Emerging reasoning with reinforcement learning"

attentionmech — Thu, 30 Jan 2025 10:59:43 +0000

agreed. that's err on my part to mention it like that. more evidence suggest that they were working on similar stuff but now the cat is out of the bag and open source got a win.

New comment by attentionmech in "Emerging reasoning with reinforcement learning"

attentionmech — Thu, 30 Jan 2025 10:57:36 +0000

people already did: https://x.com/karpathy/status/1884678601704169965

New comment by attentionmech in "SmolGPT: A minimal PyTorch implementation for training a small LLM from scratch"

attentionmech — Thu, 30 Jan 2025 10:30:09 +0000

This is cool, and timely (I wanted a neat repo like that).

I have also been working from last 2 weeks on a gpt implementation in C. Eventually it turned out to be really slow (without CUDA). But it taught me how much memory management and data management there is when implementing these systems. You are running like a loop billions of times so you need to preallocate the computational graph and stuff. If anyone wanna check out it's ~1500 LOC single file:

https://github.com/attentionmech/gpt.c/blob/main/gpt.c

New comment by attentionmech in "DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via RL"

attentionmech — Mon, 27 Jan 2025 08:13:53 +0000

I love this paradigm of reasoning by one model and actual work by another. This opens up avenues of specialization and then eventually smaller plays working on more niche things.

New comment by attentionmech in "Emerging reasoning with reinforcement learning"

attentionmech — Mon, 27 Jan 2025 04:14:11 +0000

I found the following thread more insightful than my original comment (wish I could edit that one). A research explains why RL didn't work before this: https://x.com/its_dibya/status/1883595705736163727

New comment by attentionmech in "DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via RL"

attentionmech — Sun, 26 Jan 2025 18:50:47 +0000

people are doing all sort of experiments and reproducing the "emergence"(sorry it's not the right word) of backtracking; it's all so fun to watch.

New comment by attentionmech in "DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via RL"

attentionmech — Sun, 26 Jan 2025 18:26:59 +0000

Yea, they might be scaling is harder or may be more tricks up their sleeves when it comes to serving the model.

New comment by attentionmech in "DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via RL"

attentionmech — Sun, 26 Jan 2025 12:17:14 +0000

Plus, the speed at which it replies is amazing too. Claude/Chatgpt now seem like inefficient inference engines compared to it.

New comment by attentionmech in "Emerging Reasoning with Reinforcement Learning"

attentionmech — Sun, 26 Jan 2025 12:10:39 +0000

Do you think this feature i.e. 'finding smaller chunks easier to solve' comes out from the dataset these are trained on or is it more related to architecture components?

New comment by attentionmech in "DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via RL"

attentionmech — Sun, 26 Jan 2025 12:08:06 +0000

Most people I talked with don't grasp how big of an event this is. I consider is almost as similar to as what early version of linux did to OS ecosystem.

New comment by attentionmech in "Emerging reasoning with reinforcement learning"

attentionmech — Sun, 26 Jan 2025 12:05:07 +0000

If you check failure section of their paper, they also tried other methods like MCTS and PRM which is what other labs have been obsessing about but couldn't move on from (that includes bigshots). Only team which I am aware which tried verifiable rewards is tulu but they didn't scaled it up and just left it there.

This sort of thing imo is similar to what openAI did with transformer architecture i.e. google invented it but couldn't scale it in the right direction and deepmind got busy with atari games. They had all the pieces still openai could do it. It seems to be it comes down to research leadership in what methods to choose to invest in. But yeah, the budgets big labs have, they can easily try 10 different techniques and brute force it all but seems like they are too opinionated in methods and less urgent on outcomes.

[paper] https://arxiv.org/pdf/2501.12948 [tulu] https://x.com/hamishivi/status/1881394117810500004

New comment by attentionmech in "Emerging reasoning with reinforcement learning"

attentionmech — Sun, 26 Jan 2025 08:18:07 +0000

That's nice explanation. Is there any insights so far in the field about why chain of thought improves the capability of a model? Does it like provide model with more working memory or something in the context itself?

New comment by attentionmech in "Emerging reasoning with reinforcement learning"

attentionmech — Sun, 26 Jan 2025 08:13:19 +0000

the tulu team saw it. but, yes nobody like scaled it to the extent deepseek did. I am surprised that the faang labs which have the best of the best didn't see this.

transformer-scope: script for visualizing activations

attentionmech — Sat, 11 Jan 2025 02:31:17 +0000

Article URL: https://github.com/attentionmech/transformer-scope

Comments URL: https://news.ycombinator.com/item?id=42662758

Points: 1

# Comments: 0

New comment by attentionmech in "Stimulation Clicker"

attentionmech — Mon, 06 Jan 2025 19:09:40 +0000

idk what i am doing but i am hooked on it. it's like as if it's directly interacting with dopamine of my brain.

New comment by attentionmech in "I am rich and have no idea what to do"

attentionmech — Fri, 03 Jan 2025 00:40:18 +0000

It's commutative. Happiness also doesn't buy money.