<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: rsolva</title><link>https://news.ycombinator.com/user?id=rsolva</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Mon, 01 Jun 2026 03:12:55 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=rsolva" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by rsolva in "The Website Specification"]]></title><description><![CDATA[
<p>I have used it almost daily for the last few months, with only 3 simple extensions. On this project I spent a few minutes adding context to an AGENTS.md and having it write a todo list based on the spec, then I start a new chat for each task on the list. Most changes are just a few lines of code so it is very quick to review. But this a very small website project though.</p>
]]></description><pubDate>Sun, 31 May 2026 21:13:14 +0000</pubDate><link>https://news.ycombinator.com/item?id=48349811</link><dc:creator>rsolva</dc:creator><comments>https://news.ycombinator.com/item?id=48349811</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48349811</guid></item><item><title><![CDATA[New comment by rsolva in "The Website Specification"]]></title><description><![CDATA[
<p>Thanks for making this!<p>I planed to make something like this as a skill for my self, but pasting this into any agent chat works like a charm. I just had my local model (Qwen3.6 27B / pi) make a list of all the required standards an older hugo site of mine was missing, make a todo list and then run through the whole thing one by one, giving me chance to review each change.<p>It even made the missing favicon by cropping out the symbol from the logo, and it looks good!</p>
]]></description><pubDate>Sun, 31 May 2026 19:46:25 +0000</pubDate><link>https://news.ycombinator.com/item?id=48349073</link><dc:creator>rsolva</dc:creator><comments>https://news.ycombinator.com/item?id=48349073</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48349073</guid></item><item><title><![CDATA[New comment by rsolva in "Ask HN: What has HN given you?"]]></title><description><![CDATA[
<p>A window into novel technology, commented on by knowledgeable people from around the world. Also, proof that some online communities can survive and thrive for decades while staying largely the same (no big redesigns etc).</p>
]]></description><pubDate>Mon, 25 May 2026 13:43:55 +0000</pubDate><link>https://news.ycombinator.com/item?id=48266742</link><dc:creator>rsolva</dc:creator><comments>https://news.ycombinator.com/item?id=48266742</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48266742</guid></item><item><title><![CDATA[New comment by rsolva in "Access to frontier AI will soon be limited by economic and security constraints"]]></title><description><![CDATA[
<p>Nowadays I use our local setup 95% of the time, but it is not that long since that flipped for me personally.<p>Context: I have a $20 Claude Code subscription, and have used it for a handfull of small-ish projects the last year, in parallel with local models on my AMD 9700XTX (24GB) at home. Mostly Ministral 14B and more recently Qwen3.6 27B Dense 4q.<p>Historically, the tooling (interferens engines and harness) has been the biggest challenge when using local models, a lot of the benefits from Claude Code was a rather unified and well oiled agent system. Local setups often bring with them sutle incompatibilities between models, inference engines and agent systems that are not obvious from initial testing, but cause trouble on projects larger than a couple of files.<p>The Spark setup at work is now at a point where I do not miss Claude, like at all. A big part of this is the harness and the tools available to the agent, most critically a good tool for searching online. I use my Kagi subscription to allow the models to fetch up-to-date information, and the Kagi MCP I use also has a summarizer which is very helpful in avoiding rapidly filling up the context window.<p>I mostly use Zed and it's native agent, which only recently got muuuch better, and on the terminal I use Pi with a minimal selection of extensions (currently     pi-kagi-search, pi-smart-fetch, pi-btw and pi-diffloop). I also have Pi in Zed via the ACP, but it does not work so well with some of the extensions, especially the lack of a built-in permission system is a problem, when YOLO-mode is the only mode :)<p>Honestly, as long as you have a model that is decent at tool calling, your good. Having a solid and stable frame around your model makes a huge difference. The only caveat in all of this is that I spend most of my time on smaller projects and debugging on linux base systems, not huge and complex code bases, so your mileage might vary.<p>The next phase at work is to set up a chatGPT-like webinterface, and so far LibreChat is at the top of my shortlist. We had OpenWebUI for a while, but it is so bad at using MCP tools that it is practically non-functional for us. LibreChat is a bit more work to set up, but the interface and it's MCP story is much more solid. The goal is to plug in our internal helpdesk, docs and task manager system to LibreChat via MCPs to give us a quick way to query  and gather information that is currently very time consuming to do on your own.</p>
]]></description><pubDate>Mon, 18 May 2026 07:55:20 +0000</pubDate><link>https://news.ycombinator.com/item?id=48176652</link><dc:creator>rsolva</dc:creator><comments>https://news.ycombinator.com/item?id=48176652</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48176652</guid></item><item><title><![CDATA[New comment by rsolva in "Access to frontier AI will soon be limited by economic and security constraints"]]></title><description><![CDATA[
<p>I have experimented with both q8 and q4 for KV cache. I can't find any difference between q8 and fp16, but q4 suffers more when the context grows. q8 seems like a good compromise and gives us enough ctx for about 6-8 concurrent, full context sessions. But we have not fully tested those limits yet, as the context windows rarely reach the limit.</p>
]]></description><pubDate>Fri, 15 May 2026 16:02:03 +0000</pubDate><link>https://news.ycombinator.com/item?id=48150281</link><dc:creator>rsolva</dc:creator><comments>https://news.ycombinator.com/item?id=48150281</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48150281</guid></item><item><title><![CDATA[New comment by rsolva in "Access to frontier AI will soon be limited by economic and security constraints"]]></title><description><![CDATA[
<p>In our company of 24 employees, we get by with two DGX Sparks. We don't use AI heavily, but each Spark can serve about 6-8 concurrent requests with a full context lenght of 256k, which is decent. We get about ~35 t/s depending on the model we use (currently Qwen3.5 122B A10B and Qwen3 Coder Next), but we might set up a smaller model too for simpler tasks.<p>This works for us and will work for years to come. It is not SOTA, but it works darn well for our purposes, and we control the compute and data flowing through it, so totally worth it.</p>
]]></description><pubDate>Fri, 15 May 2026 10:19:19 +0000</pubDate><link>https://news.ycombinator.com/item?id=48146804</link><dc:creator>rsolva</dc:creator><comments>https://news.ycombinator.com/item?id=48146804</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48146804</guid></item><item><title><![CDATA[New comment by rsolva in "Show HN: Needle: We Distilled Gemini Tool Calling into a 26M Model"]]></title><description><![CDATA[
<p>Haha, not what I meant to write, but this works too!</p>
]]></description><pubDate>Wed, 13 May 2026 08:25:31 +0000</pubDate><link>https://news.ycombinator.com/item?id=48119240</link><dc:creator>rsolva</dc:creator><comments>https://news.ycombinator.com/item?id=48119240</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48119240</guid></item><item><title><![CDATA[New comment by rsolva in "Screenshots of Old Desktop OSes"]]></title><description><![CDATA[
<p>I remember using some kind of software around the time of windows xp i think, that could replace the chrome/shell so you could design your own GUI entirely – but I can't remember what it was called! I spent a lot of time iterating and experimenting back then, replacing iexplore.exe or whatever the main process was called.</p>
]]></description><pubDate>Tue, 12 May 2026 21:30:31 +0000</pubDate><link>https://news.ycombinator.com/item?id=48114872</link><dc:creator>rsolva</dc:creator><comments>https://news.ycombinator.com/item?id=48114872</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48114872</guid></item><item><title><![CDATA[New comment by rsolva in "Show HN: Needle: We Distilled Gemini Tool Calling into a 26M Model"]]></title><description><![CDATA[
<p>Can it summarize text it fetches?<p>Come to think of it, this could be a nice model to have as the first pass in a more complex agent system where Needle hands of the results of a tool call to a larger model.<p>I will defiantly play around with this!</p>
]]></description><pubDate>Tue, 12 May 2026 21:15:26 +0000</pubDate><link>https://news.ycombinator.com/item?id=48114709</link><dc:creator>rsolva</dc:creator><comments>https://news.ycombinator.com/item?id=48114709</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48114709</guid></item><item><title><![CDATA[New comment by rsolva in "Using Claude Code: The unreasonable effectiveness of HTML"]]></title><description><![CDATA[
<p>This is the way!<p>It is incredible how far you get with a single HTML-file, containing styles and JS, when building dashboards, small apps and other utilities that can interact with an API or otherwise fetch data from somwhere.<p>I just drop it on my personal ~ folder on the shared server at work and voilà, everyone can check it out and use it immediately!</p>
]]></description><pubDate>Sat, 09 May 2026 08:18:02 +0000</pubDate><link>https://news.ycombinator.com/item?id=48073056</link><dc:creator>rsolva</dc:creator><comments>https://news.ycombinator.com/item?id=48073056</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48073056</guid></item><item><title><![CDATA[New comment by rsolva in "New copy of earliest poem in English, written 1,3k years ago, discovered in Rome"]]></title><description><![CDATA[
<p>Knowing both Norwegian and Dutch, most words here is surprisingly similar to modern words:<p>hefenricæs = himmelrikes (no)<p>uerc = werk (nl)<p>eci = evig (no) / eeuwig (nl)<p>ærist = eerst (nl)<p>barnum = barn (no)<p>sceppend = schepper (nl)<p>EDIT: Hearing the poem read also gives dutch / germanic vibes: <a href="https://gutenberg.org/files/19677/ogg/19677.ogg" rel="nofollow">https://gutenberg.org/files/19677/ogg/19677.ogg</a></p>
]]></description><pubDate>Fri, 01 May 2026 14:00:45 +0000</pubDate><link>https://news.ycombinator.com/item?id=47974931</link><dc:creator>rsolva</dc:creator><comments>https://news.ycombinator.com/item?id=47974931</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47974931</guid></item><item><title><![CDATA[New comment by rsolva in "Can I disable all data collection from my vehicle?"]]></title><description><![CDATA[
<p>I can recommend the VW e-UP!s from 2013-2016ish. They have very little tech in them but are relatively modern. You can also quite easily tap into the control systems (climate etc) to remote control it with your own hardware: <a href="https://docs.openvehicles.com/en/latest/components/vehicle_vweup/docs/index.html" rel="nofollow">https://docs.openvehicles.com/en/latest/components/vehicle_v...</a><p>They are also super fun to drive and, although they have small batteries, the can charge at 40-50kWh, which translates to 10 minutes to ~85% full. We have used a eUP 2013 model to travel across europe (~900km) in two days, many times! One charge last between one and two hours, depending on speed and weather. We usually cruse at about 90km/h, and the car is basically sipping electrons! The newer model have double the range, but I have not owned or testet them, but might be a decent compromise for longer travels.</p>
]]></description><pubDate>Fri, 01 May 2026 09:50:42 +0000</pubDate><link>https://news.ycombinator.com/item?id=47972877</link><dc:creator>rsolva</dc:creator><comments>https://news.ycombinator.com/item?id=47972877</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47972877</guid></item><item><title><![CDATA[New comment by rsolva in "Qwen3.6-27B: Flagship-Level Coding in a 27B Dense Model"]]></title><description><![CDATA[
<p>I have the same GPU and get very good results, even better than Gemma 4 26B A4B, using the following setup (Fedora 43 Silverblue, podman compose):<p><pre><code>  services:
    llama:
      image: ghcr.io/ggml-org/llama.cpp:server-vulkan
      container_name: llama-qwen3.6-27b-dense
      ports:
        - 4201:8080
      volumes:
        - ./Qwen3.6-27B-Q4_K_M.gguf:/models/model.gguf:ro,z
        - ./mmproj-BF16.gguf:/models/mmproj.gguf:ro,z
      devices:
        - /dev/dri
      group_add:
        - video
      command: >
        -m /models/model.gguf
        --mmproj /models/mmproj.gguf
        --alias "Qwen3.6 27b Dense"
        -ngl 99
        -c 98304
        -b 2048
        --host 0.0.0.0
        --port 8080
        --parallel 2
        --kv-unified
        --ubatch-size 2048
        --flash-attn on
        -cb
        --jinja
        --no-webui
        -ctk q8_0
        -ctv q8_0
        --image-min-tokens 1024
        --temp 0.6
        --top-k 20
        --top-p 0.95
        --repeat-penalty 1
        --presence-penalty 1.5
        --reasoning auto
      restart: unless-stopped</code></pre></p>
]]></description><pubDate>Fri, 24 Apr 2026 22:05:26 +0000</pubDate><link>https://news.ycombinator.com/item?id=47896386</link><dc:creator>rsolva</dc:creator><comments>https://news.ycombinator.com/item?id=47896386</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47896386</guid></item><item><title><![CDATA[New comment by rsolva in "I ran Gemma 4 as a local model in Codex CLI"]]></title><description><![CDATA[
<p>I run this model on my AMD RX7900XTX with 24GB VRAM with up to 4 concurrent chats and 512K context window in total. It is very fast (~100 t/s) and feels instant and very capable, and I have used Claude Code less and less these days.</p>
]]></description><pubDate>Mon, 13 Apr 2026 17:21:06 +0000</pubDate><link>https://news.ycombinator.com/item?id=47755186</link><dc:creator>rsolva</dc:creator><comments>https://news.ycombinator.com/item?id=47755186</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47755186</guid></item><item><title><![CDATA[New comment by rsolva in "Sweden goes back to basics, swapping screens for books in the classroom"]]></title><description><![CDATA[
<p>Laptops and tables are, as it turns out, not so cheap either. They need to be fixed or replaced at an alarming rate, and they lay claim to a much larger part of a school budget than books ever did. That is part of the reason that we revert back to pen, paper and books in Norway. First for 1-4 grade, but it will be push further up the grades as we go, I think.</p>
]]></description><pubDate>Thu, 02 Apr 2026 15:14:20 +0000</pubDate><link>https://news.ycombinator.com/item?id=47615576</link><dc:creator>rsolva</dc:creator><comments>https://news.ycombinator.com/item?id=47615576</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47615576</guid></item><item><title><![CDATA[New comment by rsolva in "AI Team OS – Turn Claude Code into a Self-Managing AI Team"]]></title><description><![CDATA[
<p>Nifty, looks like the enterprise edition of OpenClaw, kinda. Also, it looks token hungry!</p>
]]></description><pubDate>Sat, 21 Mar 2026 16:04:05 +0000</pubDate><link>https://news.ycombinator.com/item?id=47468244</link><dc:creator>rsolva</dc:creator><comments>https://news.ycombinator.com/item?id=47468244</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47468244</guid></item><item><title><![CDATA[New comment by rsolva in "World Happiness Report 2026"]]></title><description><![CDATA[
<p>The term 'social media' has changed a lot over time. The attention grabbing kind we have today is a very different beast than what we started out with; no ads and only a chronological timeline showing posts from your network.<p>The original kind was genuinely connecting people and adding value. The current one is in effect isolating and driving people and groups apart.<p>Luckily, the original kind did not vanish. I find a lot of joy hanging out on the fediverse. I spend far less time on it than what I did on Twitter of FB back when I still had accounts there, but that is a good sign.<p>Social media is too generous term to use when describing products from Meta, TikTok, Snap, X etc. It is an ad platform that also, occasionality, shows you what your friends are up to.<p>We should come up with a better term than 'social media' when describing platforms that has reached the last stage of enshitification.</p>
]]></description><pubDate>Thu, 19 Mar 2026 19:59:21 +0000</pubDate><link>https://news.ycombinator.com/item?id=47445031</link><dc:creator>rsolva</dc:creator><comments>https://news.ycombinator.com/item?id=47445031</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47445031</guid></item><item><title><![CDATA[New comment by rsolva in "Does that use a lot of energy?"]]></title><description><![CDATA[
<p>It is not only about raw power consumption. Comparing driving an electric car with using AI only in kW hides a major point: Hyperscale datacenters are massively centralised, which brings it's own problems; a lot of energy is used for cooling, and water consumptions is enormous. Charging electric cars at home is distributed and does not suffer from the same problems as the centralised hyperscalers do. Also, running AI models at home is not much different than a gaming session :)</p>
]]></description><pubDate>Wed, 04 Mar 2026 21:46:08 +0000</pubDate><link>https://news.ycombinator.com/item?id=47254373</link><dc:creator>rsolva</dc:creator><comments>https://news.ycombinator.com/item?id=47254373</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47254373</guid></item><item><title><![CDATA[New comment by rsolva in "Mullvad VPN: Banned TV Ad in the Streets of London [video]"]]></title><description><![CDATA[
<p>Great ad. Long, but great.</p>
]]></description><pubDate>Tue, 03 Mar 2026 12:59:25 +0000</pubDate><link>https://news.ycombinator.com/item?id=47231635</link><dc:creator>rsolva</dc:creator><comments>https://news.ycombinator.com/item?id=47231635</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47231635</guid></item><item><title><![CDATA[New comment by rsolva in "Attention Media ≠ Social Networks"]]></title><description><![CDATA[
<p>I run my own instance at home. Not everybody can do this, but if more nerds hosted instances for friends and family, it would get us somewhere at least :)<p>The goal is not instant world domination, but a organically grown network, providing an off-ramp for those that have had enough and are ready to take some action. Facilitating real world meet-ups would help, I think!</p>
]]></description><pubDate>Wed, 25 Feb 2026 18:48:56 +0000</pubDate><link>https://news.ycombinator.com/item?id=47155946</link><dc:creator>rsolva</dc:creator><comments>https://news.ycombinator.com/item?id=47155946</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47155946</guid></item></channel></rss>