Hacker News: SomewhatLikely

New comment by SomewhatLikely in "How people woke up before alarm clocks"

SomewhatLikely — Fri, 13 Mar 2026 06:27:58 +0000

New comment by SomewhatLikely in "Three Years from GPT-3 to Gemini 3"

SomewhatLikely — Tue, 25 Nov 2025 06:23:49 +0000

I've seen it so this too. I had it keeping a running tally over many turns and occasionally it would say something like: "... bringing the total to 304.. 306, no 303. Haha, just kidding I know it's really 310." With the last number being the right one. I'm curious if it's an organic behavior or a taught one. It could be self learned through reinforcement learning, a way to correct itself since it doesn't have access to a backspace key.

New comment by SomewhatLikely in "The surprise deprecation of GPT-4o for ChatGPT consumers"

SomewhatLikely — Sat, 09 Aug 2025 05:48:39 +0000

The default outputs are considerably shorter even in thinking mode. Something that helped me get the thinking mode back to an acceptable state was to switch to the Nerd personality and in the traits customization setting tell it to be complete and add extra relevant details. With those additions it compares favorably to o3 on my recent chat history and even improved some cases. I prefer to scan a longer output than have the LLM guess what to omit. But I know many people have complained about verbosity so I can understand why they may have moved to less verbiage.

New comment by SomewhatLikely in "GPT-5: Key characteristics, pricing and system card"

SomewhatLikely — Fri, 08 Aug 2025 06:17:54 +0000

Nobody's preventing them from rendering it and refining. That's certainly what we'd expect an AGI to do.

New comment by SomewhatLikely in "The Bluesky Dictionary"

SomewhatLikely — Thu, 07 Aug 2025 05:18:42 +0000

It's likely that the commenter has read less than 5 million posts worth of text though. So perhaps this still points to a lack of diversity in content.

New comment by SomewhatLikely in "At a Loss for Words: A flawed idea is teaching kids to be poor readers (2019)"

SomewhatLikely — Mon, 04 Aug 2025 06:51:27 +0000

What was the alternative you went with?

New comment by SomewhatLikely in "At a Loss for Words: A flawed idea is teaching kids to be poor readers (2019)"

SomewhatLikely — Sun, 03 Aug 2025 07:44:14 +0000

I saw a very similar timely appeal here on Hacker News a few years ago and taught my son with this book at the age of 4. It has become my go-to comparison when prompting chat bots on what I want in a teaching material for other subjects. I listened to the entire article posted here and it makes me wonder if schools are getting something as foundational as reading wrong how can we trust the attention to research on anything else they're teaching? Don't get me wrong, I'm not going to pull my kid out of school but I'll dig a little deeper into how well he's learning. For math, we've been doing the Beast Academy books. It has gone... Okay. I like that they approach problems from many different ways which simulate the many different ways math is hidden in our interactions with the world. For my younger son I've recently started Teaching Your Child... because of how well it went for his brother but for math I may try something else to have a new data point. Something that occurred to me listening to the article is I wonder if certain skills are learned much faster with one on one instruction like the book has you do. Our schools pretty much never teach that way out of efficiency, though home schools often do. It may not be true for most subjects though or home school students would be so far ahead by college and that's not the impression I have.

New comment by SomewhatLikely in "Spending Too Much Money on a Coding Agent"

SomewhatLikely — Fri, 04 Jul 2025 01:11:41 +0000

It's pretty damn capital intensive to be a productive farmer today. That said, AI will likely, hopefully, get cheaper over time.

New comment by SomewhatLikely in "How we made our AI code review bot stop leaving nitpicky comments"

SomewhatLikely — Sun, 22 Dec 2024 21:36:40 +0000

You could probably modify the metric to addressed comments per 1000 lines of code.

New comment by SomewhatLikely in "ARIA: An Open Multimodal Native Mixture-of-Experts Model"

SomewhatLikely — Fri, 11 Oct 2024 07:33:15 +0000

"Here, we provide a quantifiable definition: A multimodal native model refers to a single model with strong understanding capabilities across multiple input modalities (e.g. text, code, image, video), that matches or exceeds the modality specialized models of similar capacities."

New comment by SomewhatLikely in "Practices of Reliable Software Design"

SomewhatLikely — Wed, 09 Oct 2024 06:08:09 +0000

My first thought upon seeing the prompt:

    If you would build an in-memory cache, how would you do it?

    It should have good performance and be able to hold many entries. 
    Reads are more common than writes. I know how I would do it already, 
    but I’m curious about your approach.

Was to add this requirement since it comes up so often:

    Let's assume that keys accessed follow a power law, so some keys get 
    accessed very frequently and we would like them to have the fastest 
    retrieval of all.

I'm not sure if there are any efficient tweaks to hash tables or b-trees that might help with this additional requirement. Obviously we could make a hash table take way more space than needed to reduce collisions, but with a decent load factor is the answer to just swap frequently accessed keys to the beginning of their probe chain? How do we know it's frequently accessed? Count-Min sketch?

Even with that tweak, the hottest keys will still be scattered around memory. Wouldn't it be best if their entries could fit into fewer pages? So, maybe a much smaller "hot" table containing say the 1,000 most accessed keys. We still want a high load factor to maximize the use of cache pages so perhaps perfect hashing?

New comment by SomewhatLikely in "Bitten by Unicode"

SomewhatLikely — Mon, 09 Sep 2024 05:47:09 +0000

Where I thought this might be going from the first paragraph:

Negative numbers are sometimes represented with parentheses: (234.58)

Tables sometimes tell you in the description that all numbers in are in 1000's or millions.

The dollar sign is used by many currencies, including in Australia and Canada.

I'd probably look around for some other gotchas. Here's one page on prices in general: https://gist.github.com/rgs/6509585 but interestingly doesn't quite cover the OP's problem or the ones I brought up, though the use cases are slightly different.

New comment by SomewhatLikely in "How does cosine similarity work?"

SomewhatLikely — Sat, 07 Sep 2024 06:06:17 +0000

Something worth mentioning is that if your vectors all have the same length then cosine similarity and Euclidean distance will order most (all?) neighbors in the same order. Think of your query vector as a point on a unit sphere. The Euclidean distance to a neighbor will be a chord from the query point to the neighbor. Just as with the angle between the query-to-origin and the neighbor-to-origin vectors, the farther you move the neighbor from the query point on the surface of the sphere, the longer the chord between those points gets too.

EDIT: Here's a better treatment, and it is the case that they give the exact same orderings: https://ajayp.app/posts/2020/05/relationship-between-cosine-...

New comment by SomewhatLikely in "Manipulating large language models to increase product visibility"

SomewhatLikely — Sat, 07 Sep 2024 05:20:20 +0000

This feels similar to those adversarial examples that first came out that were very tuned for a specific image recognizer. I haven't followed the research but I know they had some very limited success to getting it to work in the real world. I'm not sure if they ever worked across different models though.

The paper claims there is literature with more success for LLMs:

   Large language models have been shown to be vulnerable to adversarial
   attacks, in which attackers introduce maliciously crafted token sequences
   into the input prompt to circumvent the model’s safety mechanisms and 
   generate a harmful response [1, 14].

New comment by SomewhatLikely in "Why Don't Tech Companies Pay Their Engineers to Stay?"

SomewhatLikely — Fri, 06 Sep 2024 06:08:56 +0000

https://jonpauluritis.com/articles/why-arent-developers-paid...

New comment by SomewhatLikely in "Show HN: I'm making an AI scraper called FetchFox"

SomewhatLikely — Wed, 04 Sep 2024 07:39:22 +0000

Can I recommend you provide some cost estimates next to the examples for using your own key? I tried a few custom extractions and then checked my usage dashboard and it was already over $2.

New comment by SomewhatLikely in "Architectural Effects on Maximum Dependency Lengths of Recurrent Neural Networks"

SomewhatLikely — Sat, 31 Aug 2024 06:31:52 +0000

https://arxiv.org/abs/2006.16236 Transformers are RNNs: Fast Autoregressive Transformers with Linear Attention

New comment by SomewhatLikely in "Diffusion models are real-time game engines"

SomewhatLikely — Wed, 28 Aug 2024 07:21:46 +0000

Video games are gonna be wild in the near future. You could have one person talking to a model producing something that's on par with a AAA title from today. Imagine the 2d sidescroller boom on Steam but with immersive photorealistic 3d games with hyper-realistic physics (water flow, fire that spreads, tornados) and full deformability and buildability because the model is pretrained with real world videos. Your game is just a "style" that tweaks some priors on look, settings, and story.

New comment by SomewhatLikely in "Shell.how: Explain Shell Commands"

SomewhatLikely — Sun, 25 Aug 2024 03:37:35 +0000

I've switched to LLM's for explaining commands. Here's ChatGPT4o:

The command tar xvzf 1.zip appears to be trying to extract a compressed archive file, but there's a mismatch between the file extension (1.zip) and the command options used for tar. Let me break it down:

tar: This is the command-line utility for working with tar archives (tape archives). It can be used to create, extract, and manipulate tar files.

x: This option tells tar to extract files from an archive.

v: This stands for "verbose", which means that tar will list the files being extracted to the terminal.

z: This option tells tar to handle a compressed archive using gzip. It is typically used with .tar.gz or .tgz files.

f: This tells tar that the next argument is the name of the archive file.

However, the file 1.zip is a ZIP file, not a tar.gz file. ZIP files are handled by the unzip command, not tar. Using tar xvzf on a .zip file would result in an error, as tar expects a .tar.gz file when using the z option.

To properly extract a .zip file, you should use:

unzip 1.zip

This command will extract the contents of the ZIP file.

New comment by SomewhatLikely in "Maps Mania: How the World Powers Itself"

SomewhatLikely — Thu, 22 Aug 2024 07:12:59 +0000

I suspect there may be a flaw in the visualization where smaller dots are drawn with disproportionately more pixels than bigger dots. Just looking at how much green is on the map is way more than numbers I can find support for. For example, spain visually looks like more than 90% solar/wind but in reality it appears to only be about 60% [ https://www.reuters.com/business/energy/renewables-produce-a... ]