Hacker News: readyplayeremma

New comment by readyplayeremma in "MapLibre Tile: a modern and efficient vector tile format"

readyplayeremma — Mon, 26 Jan 2026 15:19:02 +0000

MapLibre's globe mode is both fantastic and performant. Also, it's literally just the one option to change it, and your tile formats/CRS don't need to change either.

It's the easiest way to escape from web mercator projections with no real downsides that I have discovered yet. Also, there is a built-in control if you want to offer a button to toggle between web mercator view, and globe view, since it's all just rendering changes.

New comment by readyplayeremma in "Qwen3-Omni-Flash-2025-12-01：a next-generation native multimodal large model"

readyplayeremma — Wed, 10 Dec 2025 20:05:20 +0000

The models are right here, one of the first links in the post: https://huggingface.co/collections/Qwen/qwen3-omni

edit: Nevermind, in spite of them linking it at the top, they are the old models. Also, the HF demo is calling their API and not using HF for compute.

New comment by readyplayeremma in "A new proposal for how mind emerges from matter"

readyplayeremma — Thu, 27 Feb 2025 13:23:08 +0000

The article’s real contribution is in highlighting evidence of complex behavior in living systems that often get excluded from definitions of "intelligence". In doing so, it invites deeper philosophical reflection, even if it doesn’t mount that reflection itself.

New comment by readyplayeremma in "Privacy Pass Authentication for Kagi Search"

readyplayeremma — Fri, 14 Feb 2025 06:45:45 +0000

The server does not generate the tokens, the client generates the tokens. The server is supposed to be able to verify that they were generated by a client who was granted the authority to generate them, but not which client did so. At least, not without side-channel information.

> The main building block of our construction is a verifiable oblivious pseudorandom function (VOPRF)

I am not sure how well tested that primitive is, but it definitely appears to be more than the server handing clients tokens and then pretending not to know who it gave them to.

The referenced paper: https://petsymposium.org/popets/2018/popets-2018-0026.pdf

New comment by readyplayeremma in "Windows Kills SMB Speeds When Using Tailscale"

readyplayeremma — Thu, 14 Nov 2024 06:45:21 +0000

You don’t have any specific routes to random internet addresses though. And Tailscale would not either. Unless your Windows server is running BGP, all your Internet traffic is hitting the default route.

New comment by readyplayeremma in "Ichigo: Local real-time voice AI"

readyplayeremma — Fri, 18 Oct 2024 11:21:57 +0000

You can just run whisper on the conversations as a background job populating the text versions of all the user inputs, so it doesn't interfere with the real-time latency.

New comment by readyplayeremma in "OpenAI Bought Chatgpt.com"

readyplayeremma — Sat, 04 May 2024 18:22:09 +0000

Maybe it was before, but now the entire world knows it well. To change it would be taking an incredibly recognizable name—like Twitter for instance—and then just throwing it away. ChatGPT is forever part of history, and almost everyone recognizes it immediately at this point.

New comment by readyplayeremma in "'Lavender': The AI machine directing Israel's bombing in Gaza"

readyplayeremma — Wed, 03 Apr 2024 18:38:00 +0000

> B., a senior officer who used Lavender, echoed to +972 and Local Call that in the current war, officers were not required to independently review the AI system’s assessments, in order to save time and enable the mass production of human targets without hindrances.

> “Everything was statistical, everything was neat — it was very dry,” B. said. He noted that this lack of supervision was permitted despite internal checks showing that Lavender’s calculations were considered accurate only 90 percent of the time; in other words, it was known in advance that 10 percent of the human targets slated for assassination were not members of the Hamas military wing at all.```

So, there was no human sign-off. I guess the policy itself was ordered by someone, but all the ongoing targets that were selected for assassination were solely authorized by the AI system's predictions.

This sentence is horrifically dystopian... "in order to save time and enable the mass production of human targets without hindrances"

New comment by readyplayeremma in "Gemini AI"

readyplayeremma — Thu, 07 Dec 2023 05:11:29 +0000

I would like more details on Gemini's 'native' multimodal approach before assuming it is something truly unique. Even if GPT-4V were aligning a pretrained image model and pretrained language model with a projection layer like PaLM-E/LLaVA/MiniGPT-4 (unconfirmed speculation, but likely), it's not as if they are not 'natively' training the composite system of projection-aligned models.

There is nothing in any of Google's claims that preclude the architecture being the same kind of composite system. Maybe with some additional blending in of multimodal training earlier in the process than has been published so far. And perhaps also unlike GPT-4V, they might have aligned a pretrained audio model to eliminate the need for a separate speech recognition layer and possibly solving for multi-speaker recognition by voice characteristics, but they didn't even demo that... Even this would not be groundbreaking though. ImageBind from Meta demonstrated the capacity align an audio model with an LLM in the same way images models have been aligned with LLMs. I would perhaps even argue that Google skipping the natural language intermediate step between LLM output and image generation is actually in support of the position that they may be using projection layers to create interfaces between these modalities. However, this direct image generation projection example was also a capability published by Meta with ImageBind.

What seems more likely, and not entirely unimpressive, is that they refined those existing techniques for building composite multimodal systems and created something that they plan to launch soon. However, they still have crucially not actually launched it here. Which puts them in a similar position to when GPT-4 was first announced with vision capabilities, but then did not offer them as a service for quite an extended time. Google has yet to ship it, and as a result fails to back up any of their interesting claims with evidence.

Most of Google's demos here are possible with a clever interface layer to GPT-4V + Whisper today. And while the demos 'feel' more natural, there is no claim being made that they are real-time demos, so we don't know how much practical improvement in the interface and user experience would actually be possible in their product when compared to what is possible with clever combinations of GPT-4V + Whisper today.

New comment by readyplayeremma in "Show HN: How did your computer reach my server?"

readyplayeremma — Tue, 05 Dec 2023 18:53:21 +0000

Really awesome project!

A technical nitpick though: Routes can be asymmetric—going across one path in one direction and another for the opposite. This means that your tool potentially doesn't show the route packets from the user took to reach your server, but rather the route packets took from your server to reach the user. I believe that querying with BGP looking glass tools would allow you to construct the route in either direction, but it is maybe a bit less cool looking than the real-time traceroute that is a result of actual traffic.

New comment by readyplayeremma in "LLaVA-Plus: Large Language and Vision Assistants That Learn to Use Skills"

readyplayeremma — Fri, 10 Nov 2023 10:44:56 +0000

LLaVA-Plus maintains a skill repository that contains a wide range of vision and vision-language pre-trained models (tools), and is able to activate relevant tools, given users’ multimodal inputs, to compose their execution results on the fly to fulfill many real-world tasks.

LLaVA-Plus: Large Language and Vision Assistants That Learn to Use Skills

readyplayeremma — Fri, 10 Nov 2023 10:44:56 +0000

Article URL: https://llava-vl.github.io/llava-plus/

Comments URL: https://news.ycombinator.com/item?id=38217302

Points: 1

# Comments: 1

New comment by readyplayeremma in "Artificial Intelligence and Copyright: Request for comments"

readyplayeremma — Fri, 01 Sep 2023 06:26:57 +0000

> People casually asserting that software is equivalent to humanity will be a non-negligible thing to consider, as irritating and poorly-founded as it seems.

> If the reproduction isn't pixel-perfect, but merely obvious and overwhelming, how do you refute that philosophically to people who refuse a distinction between 50GB and a human life?

Software equivalence to humanity is a very philosophical question that many sci-fi writers have approached. But our primary issue related to this technology does not depend on anyone making a determination there.

The challenge is that losses to livelihood from this technology are going to come from far broader impacts than copyright alone. Copyright disputes are just the first things to get everyone's attention.

Let's say we err on the side of protection of copyright, and all training data must be fully licensed, in addition to users being responsible for ensuring outputs did not accidentally reproduce something similar to a copyrighted work, even if it was part of the licensed training dataset. Great! This fixes the problem of lost value for the owners of copyrights. Companies will face a slight delay and slightly increased costs as they license content; however, in the end, model capabilities will be the same and continue to increase.

The number of jobs that actually cannot be performed without humans will continue to dwindle — livelihoods will be lost at essentially the same scale despite upholding copyrights.

The only way we can handle a technology capable of reducing most need for human labor is by focusing on planning and executing a smooth transition toward an economy with more people than jobs — aiming for minimal human suffering during this process.

A mass loss of human jobs does not need to mean a mass loss of livelihood if our society is prepared to transition to a universal basic income. After all, human life is far more than just a job. We have the opportunity for much more fulfilling lives if we plan this transition well. We must understand that this is a far larger issue than copyright - copyright disputes are just one of the first symptoms of this disruptive process.

New comment by readyplayeremma in "Artificial Intelligence and Copyright: Request for comments"

readyplayeremma — Fri, 01 Sep 2023 03:01:29 +0000

This is the best path forward I think. And it will become increasingly sensible as things continue to evolve. AI wasn't necessary to violate copyright before, and it isn't necessary today.

The determination of copyright violation should be made against the output of the model in the event that someone uses it for commercial purposes.

If the models have a risk of generating copyrighted content, it will be up to the consumers of the system to mitigate that risk through manual review or automated checks of the output.

New comment by readyplayeremma in "Show HN: Llama2 Embeddings FastAPI Server"

readyplayeremma — Tue, 15 Aug 2023 17:49:44 +0000

The SGPT model is a very high performing text embeddings model adapted from a decoder. Using the same techniques with Llama-2 might perform better than you expect. I think someone will need to try these things before we know for certain. I believe there is still room for significant improvement with embedding models.

New comment by readyplayeremma in "On-disk HNSW index for Postgres with pg_embedding"

readyplayeremma — Fri, 04 Aug 2023 03:13:18 +0000

The GitHub project has no license file that I see. Does anyone know if this is going to be released under an OSS license of some kind?

New comment by readyplayeremma in "Which vector database should I use? A comparison cheatsheet"

readyplayeremma — Mon, 31 Jul 2023 15:55:03 +0000

I can also add one more data-point in favor of Elastic / OpenSearch. They benefit from a long history of providing search-specific features. Including the ability to write custom re-ranking functions to combine the benefits of traditional TF/IDF style search with the modern benefit of vector search techniques. And you can easily use OpenSearch with state of the art open embedding models like SGPT that use 2048-dimensional vectors. Plus, it is designed to be highly scalable and distributed.

Given how well OpenSearch works and scales, I would find it hard to justify a specialized vector-specific database unless it brought A LOT of new benefits to the table. And I am not currently aware how any of them would actually do that.

Also, OpenSearch provides all of that out-of-the-box. You just configure a vector field mapping and start inserting your data. No need for an add-on plugin/extension. It just works.

New comment by readyplayeremma in "ChatGPT loses users for first time, shaking faith in AI revolution"

readyplayeremma — Fri, 07 Jul 2023 20:20:03 +0000

This is the correct viewpoint. ChatGPT is one specific implementation of the technology, which was most people's first exposure to it. The broader applications of the technology itself are still in the very early stages, but are already making significant impacts.

New comment by readyplayeremma in "GPT-4 apparently can't recite the litany against fear from Dune"

readyplayeremma — Sun, 18 Jun 2023 01:03:48 +0000

I can confirm this happens with the API as well though. But it does work if you ask it to output the quote in YAML format somehow.

acknowledgeFear: I must not fear.

understandEffect: Fear is the mind-killer.

describeImpact: Fear is the little-death that brings total obliteration.

faceFear: I will face my fear.

letItPass: I will permit it to pass over me and through me.

observeAbsence: And when it has gone past, I will turn the inner eye to see its path.

finalResult: Where the fear has gone there will be nothing. Only I will remain.

New comment by readyplayeremma in "Bytes are all you need: Transformers operating directly on file bytes"

readyplayeremma — Sun, 04 Jun 2023 07:00:02 +0000

Except the person in the room also had to write the book by looking at examples of Chinese.