<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: PhilippGille</title><link>https://news.ycombinator.com/user?id=PhilippGille</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Mon, 15 Jun 2026 10:35:40 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=PhilippGille" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by PhilippGille in "MiMo-v2.5-Pro-UltraSpeed: 1T model with 1000 tokens per second"]]></title><description><![CDATA[
<p>The interesting bits on how they achieved it:<p>> On the model side, we applied FP4 quantization<p>> introduced DFlash, an efficient speculative decoding method based on block-level masked parallel prediction<p>> On the system side, TileRT perfectly adapts to the dynamic characteristics of these algorithms<p>> 1000+ tokens/s output [...] using just a single standard 8-GPU commodity node</p>
]]></description><pubDate>Tue, 09 Jun 2026 06:36:36 +0000</pubDate><link>https://news.ycombinator.com/item?id=48457372</link><dc:creator>PhilippGille</dc:creator><comments>https://news.ycombinator.com/item?id=48457372</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48457372</guid></item><item><title><![CDATA[New comment by PhilippGille in "MiniMax M3: The First Open-Weights Model to Combine Three Frontier Capabilities"]]></title><description><![CDATA[
<p>The blog post has more info: <a href="https://www.minimax.io/blog/minimax-m3" rel="nofollow">https://www.minimax.io/blog/minimax-m3</a></p>
]]></description><pubDate>Mon, 01 Jun 2026 07:29:00 +0000</pubDate><link>https://news.ycombinator.com/item?id=48353654</link><dc:creator>PhilippGille</dc:creator><comments>https://news.ycombinator.com/item?id=48353654</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48353654</guid></item><item><title><![CDATA[New comment by PhilippGille in "Step 3.7 Flash"]]></title><description><![CDATA[
<p>Do you mean MiMo V2 Flash? V2.5 doesn't have a Flash version.</p>
]]></description><pubDate>Fri, 29 May 2026 22:11:23 +0000</pubDate><link>https://news.ycombinator.com/item?id=48329983</link><dc:creator>PhilippGille</dc:creator><comments>https://news.ycombinator.com/item?id=48329983</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48329983</guid></item><item><title><![CDATA[New comment by PhilippGille in "Quack: The DuckDB Client-Server Protocol"]]></title><description><![CDATA[
<p>It's in the article:<p>> HTTP also allows the DuckDB-Wasm distribution to speak Quack natively! So DuckDB running in a browser can e.g., directly connect to a DuckDB instance running in an EC2 server using Quack.</p>
]]></description><pubDate>Tue, 12 May 2026 21:31:55 +0000</pubDate><link>https://news.ycombinator.com/item?id=48114889</link><dc:creator>PhilippGille</dc:creator><comments>https://news.ycombinator.com/item?id=48114889</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48114889</guid></item><item><title><![CDATA[New comment by PhilippGille in "Using Claude Code: The unreasonable effectiveness of HTML"]]></title><description><![CDATA[
<p>Both the original Markdown spec [1] as well as CommonMark [2] clearly specify support for inline HTML. With that you can kind of get the best of both words depending on your use case.<p>For the most parts you just write the regular Markdown headers and paragraphs, embed images, insert tables etc without the need for any HTML tags, making it readable in source form. And if you want to embed an SVG file for example, which the author of the article mentions as one use case, you just embed the SVG directly, and people can render the Markdown in their favorite viewer.<p>Let's say you're viewing a raw Markdown file in VS Code. You come onto an HTML tag, so you hit Cmd+Shift+V to open the preview and that's it.<p>Of course for full-fledged web pages with interactive buttons and fully customized styling and all of that, which the author shows in some examples, this is not feasible. But you can get very far when you have mostly text/images/tables and just want to add some extras here and there.<p>[1] <a href="https://daringfireball.net/projects/markdown/syntax#html" rel="nofollow">https://daringfireball.net/projects/markdown/syntax#html</a><p>[2] <a href="https://spec.commonmark.org/0.31.2/#html-blocks" rel="nofollow">https://spec.commonmark.org/0.31.2/#html-blocks</a></p>
]]></description><pubDate>Sat, 09 May 2026 09:32:07 +0000</pubDate><link>https://news.ycombinator.com/item?id=48073497</link><dc:creator>PhilippGille</dc:creator><comments>https://news.ycombinator.com/item?id=48073497</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48073497</guid></item><item><title><![CDATA[La Suite Docs v5.0.0 released]]></title><description><![CDATA[
<p>Article URL: <a href="https://github.com/suitenumerique/docs/releases/v5.0.0">https://github.com/suitenumerique/docs/releases/v5.0.0</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=48072833">https://news.ycombinator.com/item?id=48072833</a></p>
<p>Points: 4</p>
<p># Comments: 0</p>
]]></description><pubDate>Sat, 09 May 2026 07:42:57 +0000</pubDate><link>https://github.com/suitenumerique/docs/releases/v5.0.0</link><dc:creator>PhilippGille</dc:creator><comments>https://news.ycombinator.com/item?id=48072833</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48072833</guid></item><item><title><![CDATA[New comment by PhilippGille in "DeepSeek 4 Flash local inference engine for Metal"]]></title><description><![CDATA[
<p>On max it uses more than twice as many tokens as on high when running the ArtificialAnalysis benchmark suite, and then it's indeed the model with the highest token usage (among the current top tier models). See the "Intelligence vs. Token Use" chart here:<p><a href="https://artificialanalysis.ai/models?models=gpt-5-5%2Cgpt-5-5-high%2Cgemini-3-1-pro-preview%2Cclaude-opus-4-7%2Cclaude-sonnet-4-6-adaptive%2Cdeepseek-v4-flash%2Cdeepseek-v4-flash-high%2Cdeepseek-v4-pro%2Cdeepseek-v4-pro-high%2Cminimax-m2-7%2Ckimi-k2-6%2Cglm-5-1&intelligence-index-token-use=intelligence-vs-token-use#intelligence-index-token-use-tabs" rel="nofollow">https://artificialanalysis.ai/models?models=gpt-5-5%2Cgpt-5-...</a></p>
]]></description><pubDate>Thu, 07 May 2026 18:24:54 +0000</pubDate><link>https://news.ycombinator.com/item?id=48052926</link><dc:creator>PhilippGille</dc:creator><comments>https://news.ycombinator.com/item?id=48052926</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48052926</guid></item><item><title><![CDATA[New comment by PhilippGille in "Ask HN: Best Embedding Models?"]]></title><description><![CDATA[
<p>Benchmarks only paint part of the picture, but it's still a decent place to start looking into recent models:<p><a href="https://huggingface.co/spaces/mteb/leaderboard" rel="nofollow">https://huggingface.co/spaces/mteb/leaderboard</a></p>
]]></description><pubDate>Tue, 05 May 2026 06:04:56 +0000</pubDate><link>https://news.ycombinator.com/item?id=48018649</link><dc:creator>PhilippGille</dc:creator><comments>https://news.ycombinator.com/item?id=48018649</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48018649</guid></item><item><title><![CDATA[New comment by PhilippGille in "DeepSeek v4"]]></title><description><![CDATA[
<p>When you say "Gemini", which exact model do you mean? You know there are several and they vary a lot in how capable they are? Pro 3.1 Preview, 2.5 Pro (their latest non-preview pro model), Flash 3 Preview, ...<p>Same with GPT-5: Latest 5.5, prior 5.4, or actually the original 5 (.0)?<p>You can't talk about model performance without specifying the exact model.</p>
]]></description><pubDate>Fri, 24 Apr 2026 14:26:17 +0000</pubDate><link>https://news.ycombinator.com/item?id=47890770</link><dc:creator>PhilippGille</dc:creator><comments>https://news.ycombinator.com/item?id=47890770</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47890770</guid></item><item><title><![CDATA[New comment by PhilippGille in "High-Level Rust: Getting 80% of the Benefits with 20% of the Pain"]]></title><description><![CDATA[
<p>> C# [...] only really works properly in Windows<p>What do you mean with this? Maybe you are thinking of the old ".NET Framework" runtime, which only runs on Windows? Nowadays there is ".NET Core" which runs on macOS and Linux as well.</p>
]]></description><pubDate>Sun, 12 Apr 2026 08:30:44 +0000</pubDate><link>https://news.ycombinator.com/item?id=47737331</link><dc:creator>PhilippGille</dc:creator><comments>https://news.ycombinator.com/item?id=47737331</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47737331</guid></item><item><title><![CDATA[New comment by PhilippGille in "I run multiple $10K MRR companies on a $20/month tech stack"]]></title><description><![CDATA[
<p>He specifically mentions that he is using GitHub Copilot because of how Microsoft bills per request instead of token.</p>
]]></description><pubDate>Sun, 12 Apr 2026 07:49:57 +0000</pubDate><link>https://news.ycombinator.com/item?id=47737064</link><dc:creator>PhilippGille</dc:creator><comments>https://news.ycombinator.com/item?id=47737064</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47737064</guid></item><item><title><![CDATA[New comment by PhilippGille in "Old laptops in a colo as low cost servers"]]></title><description><![CDATA[
<p>> it is possible with some software to have everything massively cached, with the cloud doing that, with the origin server in my basement, only accessible from the allowed cache arrangement<p>Do you mean a setup like:<p><pre><code>    client -> cloud(HAProxy+Varnish) -WireGuard-> basement(backend)
</code></pre>
Or something else?</p>
]]></description><pubDate>Fri, 10 Apr 2026 06:21:30 +0000</pubDate><link>https://news.ycombinator.com/item?id=47714298</link><dc:creator>PhilippGille</dc:creator><comments>https://news.ycombinator.com/item?id=47714298</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47714298</guid></item><item><title><![CDATA[New comment by PhilippGille in "A truck driver spent 20 years making a scale model of every building in NYC"]]></title><description><![CDATA[
<p>If you are interested in scale models of New York, there's a 1:1 scale model in Minecraft: <a href="https://youtu.be/ZouSJWXFBPk" rel="nofollow">https://youtu.be/ZouSJWXFBPk</a></p>
]]></description><pubDate>Tue, 07 Apr 2026 21:28:21 +0000</pubDate><link>https://news.ycombinator.com/item?id=47681579</link><dc:creator>PhilippGille</dc:creator><comments>https://news.ycombinator.com/item?id=47681579</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47681579</guid></item><item><title><![CDATA[New comment by PhilippGille in "Docker Offload"]]></title><description><![CDATA[
<p>The article tries to sell it to people who can't run Docker locally (e.g. locked down permissions in enterprise environments, slow old laptop), but hasn't it already been possible to use remote Docker engines?<p>So the news is that they're offering to host those remotes now, right?</p>
]]></description><pubDate>Sun, 05 Apr 2026 18:50:08 +0000</pubDate><link>https://news.ycombinator.com/item?id=47652602</link><dc:creator>PhilippGille</dc:creator><comments>https://news.ycombinator.com/item?id=47652602</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47652602</guid></item><item><title><![CDATA[New comment by PhilippGille in "Codex pricing to align with API token usage, instead of per-message"]]></title><description><![CDATA[
<p>Is this not just about <i>extra</i> credit? So what's included in the subscription doesn't change - just extra credits are now token based instead of message based? (For Plus/Pro)</p>
]]></description><pubDate>Sun, 05 Apr 2026 16:55:19 +0000</pubDate><link>https://news.ycombinator.com/item?id=47651318</link><dc:creator>PhilippGille</dc:creator><comments>https://news.ycombinator.com/item?id=47651318</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47651318</guid></item><item><title><![CDATA[New comment by PhilippGille in "Qwen3.6-Plus: Towards real world agents"]]></title><description><![CDATA[
<p>The OpenRouter usage stats indicate the opposite: <a href="https://openrouter.ai/rankings?view=month" rel="nofollow">https://openrouter.ai/rankings?view=month</a></p>
]]></description><pubDate>Thu, 02 Apr 2026 15:19:26 +0000</pubDate><link>https://news.ycombinator.com/item?id=47615653</link><dc:creator>PhilippGille</dc:creator><comments>https://news.ycombinator.com/item?id=47615653</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47615653</guid></item><item><title><![CDATA[New comment by PhilippGille in "43 hours battery life: Dell XPS 14 2026 lasts almost 3x longer vs MacBook Air 15"]]></title><description><![CDATA[
<p>This is their MBP 14" M5 Max review, with a "Battery life" section and their standard web browsing test: <a href="https://www.notebookcheck.net/M5-Max-with-inconsistent-performance-and-throttling-issues-Apple-MacBook-Pro-14-Review.1246064.0.html#c15764322" rel="nofollow">https://www.notebookcheck.net/M5-Max-with-inconsistent-perfo...</a><p>15h 10min</p>
]]></description><pubDate>Thu, 02 Apr 2026 06:29:31 +0000</pubDate><link>https://news.ycombinator.com/item?id=47610678</link><dc:creator>PhilippGille</dc:creator><comments>https://news.ycombinator.com/item?id=47610678</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47610678</guid></item><item><title><![CDATA[New comment by PhilippGille in "I traced my traffic through a home Tailscale exit node"]]></title><description><![CDATA[
<p>The article does list what Tailscale adds on top of WireGuard:<p>> WireGuard by itself is mostly the data plane. Tailscale adds the control plane on top: identity/SSO, peer discovery, NAT traversal coordination, ACL distribution, route distribution (including exit node default routes), MagicDNS, and fast device revocation.</p>
]]></description><pubDate>Wed, 01 Apr 2026 06:24:14 +0000</pubDate><link>https://news.ycombinator.com/item?id=47597518</link><dc:creator>PhilippGille</dc:creator><comments>https://news.ycombinator.com/item?id=47597518</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47597518</guid></item><item><title><![CDATA[New comment by PhilippGille in "purl: a curl-esque CLI for making HTTP requests that require payment"]]></title><description><![CDATA[
<p>In the cryptocurrency world this has existed many years already. For example with the Lightning network on top of Bitcoin it has always been easy to let the server generate an invoice, which the client pays and then the client sends another request including cryptographic proof of the payment. On layer 2 it was always cheap and fast.<p>For example I created this Go middleware at the time: <a href="https://github.com/philippgille/ln-paywall#how-it-works" rel="nofollow">https://github.com/philippgille/ln-paywall#how-it-works</a> (currently defunct)<p>Similar projects implemented that into standalone API gateways.<p>All using status code 402 already.<p>Here Stripe's example is using USDC, so still crypto BTW.</p>
]]></description><pubDate>Wed, 25 Mar 2026 16:35:23 +0000</pubDate><link>https://news.ycombinator.com/item?id=47519687</link><dc:creator>PhilippGille</dc:creator><comments>https://news.ycombinator.com/item?id=47519687</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47519687</guid></item><item><title><![CDATA[New comment by PhilippGille in "ARM to make processors for first time in their history"]]></title><description><![CDATA[
<p>83 points, 24 comments: <a href="https://news.ycombinator.com/item?id=47506641">https://news.ycombinator.com/item?id=47506641</a></p>
]]></description><pubDate>Tue, 24 Mar 2026 22:52:58 +0000</pubDate><link>https://news.ycombinator.com/item?id=47510690</link><dc:creator>PhilippGille</dc:creator><comments>https://news.ycombinator.com/item?id=47510690</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47510690</guid></item></channel></rss>