Hacker News: tsurba

New comment by tsurba in "Show HN: Files.md – Open-source alternative to Obsidian"

tsurba — Mon, 18 May 2026 21:29:34 +0000

Joplin is open source, syncing setup between devices is one login to Dropbox, works for free, with native apps on Windows/OSX/Linux/iOS/Android. It has a bunch of plugins too. If you just need markdown files with syncing, use it rather than paying for Obsidian sync.

The 2GB free quota on Dropbox is plenty enough for text (and some screenshots). Or you could self-host obviously. Git while lovely for source code is a hassle for notes.

New comment by tsurba in "The universal weight subspace hypothesis"

tsurba — Tue, 09 Dec 2025 05:51:14 +0000

True, good point, maybe not a straightforward consequence to extend to weights.

New comment by tsurba in "The universal weight subspace hypothesis"

tsurba — Tue, 09 Dec 2025 05:36:00 +0000

Many discriminative models converge to same representation space up to a linear transformation. Makes sense that a linear transformation (like PCA) would be able to undo that transformation.

https://arxiv.org/abs/2007.00810

Without properly reading the linked article, if thats all this is, not a particularly new result. Nevertheless this direction of proofs is imo at the core of understanding neural nets.

New comment by tsurba in "The universal weight subspace hypothesis"

tsurba — Tue, 09 Dec 2025 05:25:37 +0000

Edit: actually this paper is the canonical reference (?): https://arxiv.org/abs/2007.00810 models converge to same space up to a linear transformation. Makes sense that a linear transformation (like PCA) would be able to undo that transformation.

You can show for example that siamese encoders for time-series, with MSE loss on similarity, without a decoder, will converge to the the same latent space up to orthogonal transformations (as MSE is kinda like gaussian prior which doesn’t distinguish between different rotations).

Similarly I would expect that transformers trained on the same loss function for predicting the next word, if the data is at all similar (like human language), would converge to approx the same space, up to some, likely linear, transformations. And to represent that same space probably weights are similar, too. Weights in general seem to occupy low-dimensional spaces.

All in all, I don’t think this is that surprising, and I think the theoretical angle should be (have been?) to find mathematical proofs like this paper https://openreview.net/forum?id=ONfWFluZBI

They also have a previous paper (”CEBRA”) published in Nature with similar results.

New comment by tsurba in "The universal weight subspace hypothesis"

tsurba — Tue, 09 Dec 2025 05:25:12 +0000

Similarly I would expect that transformers trained on the same loss function for predicting the next word, if the data is at all similar (like human language), would converge to approx the same space. And to represent that same space probably weights are similar, too. Weights in general seem to occupy low-dimensional spaces.

All in all, I don’t think this is that surprising, and I think the theoretical angle should be (have been?) to find mathematical proofs like this paper https://openreview.net/forum?id=ONfWFluZBI

New comment by tsurba in "François Chollet: The Arc Prize and How We Get to AGI [video]"

tsurba — Mon, 07 Jul 2025 14:30:19 +0000

But are we close to doing that in real-time on any reasonably large model? I don’t think so.

New comment by tsurba in "The Rise of Whatever"

tsurba — Fri, 04 Jul 2025 10:11:02 +0000

I agree with everything up until the AI part, and for that part too, the general idea is good and worth worrying about. I’m scared af about what happens to kids who do all their homework with LLMs. Thankfully at least we still have free and open models, and are not just centralizing everything.

But chatgpt does help me work through some really difficult mathematical equations in newest research papers by adding intermediate steps. I can easily confirm when it gets them right and when not, as I do have some idea. It’s super useful.

If you are not able to make LLMs work for you at all, and complain about them on the internet, you are an old man yelling at clouds. The blog post devolves from an insightful viewpoint into a long sad ramble.

It’s 100% fine if you don’t want to use them yourself, but complaining to others gets tired quick.

New comment by tsurba in "Meta announces Oakley smart glasses"

tsurba — Fri, 20 Jun 2025 21:31:37 +0000

Thankfully in the EU you are not even allowed to sell sunglasses without proper UV protection, and can just pick up sunglasses from any market and trust they are fine, if a little flimsy.

EDIT: ok apparently anywhere else than the poorest of countries, too, really.

New comment by tsurba in "Generative AI coding tools and agents do not work for me"

tsurba — Tue, 17 Jun 2025 07:57:45 +0000

And how long have you been doing this? Because that sounds naive.

After doing programming for a decade or two, the actual act of programming is not enough to be ”creative problem solving”, it’s the domain and set of problems you get to apply it to that need to be interesting.

>90% of programming tasks at a company are usually reimplementing things and algorithms that have been done a thousand times before by others, and you’ve done something similar a dozen times. Nothing interesting there. That is exactly what should and can now be automated (to some extent).

In fact solving problems creatively to keep yourself interested, when the problem itself is boring is how you get code that sucks to maintain for the next guy. You should usually be doing the most clear and boring implementation possible. Which is not what ”I love coding” -people usually do (I’m definitely guilty).

To be honest this is why I went back to get a PhD, ”just coding” stuff got boring after a few years of doing it for a living. Now it feels like I’m just doing hobby projects again, because I work exactly on what I think could be interesting for others.

New comment by tsurba in "Generative AI coding tools and agents do not work for me"

tsurba — Tue, 17 Jun 2025 07:40:27 +0000

Gambling is where I end up if I’m tired and try to get an LLM to build my hobby project for me from scratch in one go, not really bothering to read the code properly. It’s stupid and a waste of time. Sometimes it’s easier to get started this way though.

But more seriously, in the ideal case refining a prompt based on a misunderstanding of an LLM due to ambiguity in your task description is actually doing the meaningful part of the work in software development. It is exactly about defining the edge cases, and converting into language what is it that you need for a task. Iterating on that is not gambling.

But of course if you are not doing that, but just trying to get a ”smarter” LLM with (hopefully deprecated study of) ”prompt engineering” tricks, then that is about building yourself a skill that can become useless tomorrow.

New comment by tsurba in ""The Illusion of Thinking" – Thoughts on This Important Paper"

tsurba — Sat, 14 Jun 2025 08:07:57 +0000

Is it a puzzle if there is no algorithm?

But testing via coding algos to known puzzles is problematic as the code may be in the training set. Hence you need new puzzles, which is kinda what ARC was meant to do, right? Too bad OpenAI lost credibility for that set by having access to it, but ”verbally promising” (lol) not to train on it, etc.

New comment by tsurba in "Getting Past Procrastination"

tsurba — Sun, 08 Jun 2025 08:30:26 +0000

I would argue the other way around. I have ADHD, but the thing that really helped me with work procrastination, which I think would help even without ADHD, was to find a job that is actually interesting.

In approx 7 years I went through working at all the top software companies in my country, but what really fixed my problems was moving on to being a researcher at the university. I’m now paid less than half from before, but it’s still enough, and I couldn’t be happier.

Getting to work on what I think is actually important and interesting every day is what helped. I also seem happier than the younger researchers who didn’t work at companies first, who don’t know how good they have it.

New comment by tsurba in "Running GPT-2 in WebGL: Rediscovering the Lost Art of GPU Shader Programming"

tsurba — Tue, 27 May 2025 21:03:47 +0000

1.5-2 years ago I did some training for a ML paper on 4 AMD MI250x (each is essentially 2 gpus so 8 in total really, each with 64GB VRAM) on LUMI.

My Jax models and the baseline PyTorch models were quite easy to set up there, and there was not a noticeable perf difference to 8x A100s (which I used for prototyping on our university cluster) in practice.

Of course it’s just a random anecdote, but I don’t think nvidia is actually that much ahead.

New comment by tsurba in "Claude 4"

tsurba — Fri, 23 May 2025 10:22:29 +0000

Yet somehow chatting with Gemini in the web interface, it forgets everything after 3 messages, while GPT (almost) always feels natural in long back-and-forths. It’s been like this for at least a year.

New comment by tsurba in "ChatGPT Is a Gimmick"

tsurba — Thu, 22 May 2025 07:58:04 +0000

I do machine learning research and it is very useful for working out equations and checking for ”does this concept already have an established name” etc.

It is also excellent for writing one-off code experiments and plots, saving some time from having to write them from scratch.

I’m sorry but you are just using it wrong.

New comment by tsurba in "Ditching Obsidian and building my own"

tsurba — Mon, 19 May 2025 14:45:31 +0000

OSS alternatives with free syncing to your chosen cloud already exist, and have plugin systems. Why not just contribute to those? Because this is either advertising or procrastination.

New comment by tsurba in "Ditching Obsidian and building my own"

tsurba — Mon, 19 May 2025 14:43:18 +0000

Or use Joplin which is open source and also creates markdown files, and setting up sync to a cloud provider that you probably already have is free.

New comment by tsurba in "Ditching Obsidian and building my own"

tsurba — Mon, 19 May 2025 14:40:28 +0000

Just use Joplin, it’s open source and syncing to many cloud providers you already probably have is free.

New comment by tsurba in "Push Ifs Up and Fors Down"

tsurba — Sun, 18 May 2025 08:40:54 +0000

And going to a grocery store instead of 37 individual farmers…?

New comment by tsurba in "MIT asks arXiv to withdraw preprint of paper on AI and scientific discovery"

tsurba — Fri, 16 May 2025 16:55:32 +0000

Nice that someone realized then already it sounds sus https://news.ycombinator.com/item?id=42128532