<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: fzysingularity</title><link>https://news.ycombinator.com/user?id=fzysingularity</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Mon, 15 Jun 2026 00:44:54 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=fzysingularity" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by fzysingularity in "OpenCV 5 Is Here: The Biggest Leap in Years for Computer Vision"]]></title><description><![CDATA[
<p>That’s a pretty large binary for simply loading images.<p>In all honesty, opencv has stood the test of time and I’m certain newer LLMs will likely not attempt to rewrite it from scratch.<p>P.S. I’ve been a user since the IplImage days, circa 2007, and I’d still consider using it over most CV libraries today.</p>
]]></description><pubDate>Wed, 10 Jun 2026 05:54:57 +0000</pubDate><link>https://news.ycombinator.com/item?id=48472001</link><dc:creator>fzysingularity</dc:creator><comments>https://news.ycombinator.com/item?id=48472001</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48472001</guid></item><item><title><![CDATA[New comment by fzysingularity in "Claude Fable 5"]]></title><description><![CDATA[
<p>I can’t help but think that there are so many astroturfed comments in here.<p>Seems like a concerted and distributed effort from the entire Anthropic team every time to get this on top of HN.</p>
]]></description><pubDate>Wed, 10 Jun 2026 04:23:52 +0000</pubDate><link>https://news.ycombinator.com/item?id=48471397</link><dc:creator>fzysingularity</dc:creator><comments>https://news.ycombinator.com/item?id=48471397</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48471397</guid></item><item><title><![CDATA[New comment by fzysingularity in "Running Python code in a sandbox with MicroPython and WASM"]]></title><description><![CDATA[
<p>Kind of crazy how many bespoke python sandbox implementations have popped up in the past few months.<p>I’d love to see if we can get GPU access within these runtimes, that’d be awesome.</p>
]]></description><pubDate>Sun, 07 Jun 2026 03:34:42 +0000</pubDate><link>https://news.ycombinator.com/item?id=48431522</link><dc:creator>fzysingularity</dc:creator><comments>https://news.ycombinator.com/item?id=48431522</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48431522</guid></item><item><title><![CDATA[New comment by fzysingularity in "Running Python code in a sandbox with MicroPython and WASM"]]></title><description><![CDATA[
<p>What’s your experience with Monty? Been looking at it for one of our environments and it seems very promising.</p>
]]></description><pubDate>Sat, 06 Jun 2026 18:03:54 +0000</pubDate><link>https://news.ycombinator.com/item?id=48427379</link><dc:creator>fzysingularity</dc:creator><comments>https://news.ycombinator.com/item?id=48427379</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48427379</guid></item><item><title><![CDATA[New comment by fzysingularity in "Ask HN: Who is hiring? (May 2026)"]]></title><description><![CDATA[
<p>VLM Run (<a href="https://vlm.run" rel="nofollow">https://vlm.run</a>) | 1x Product + 1x ML Staff Engineer | Santa Clara, CA (HQ)<p>We're building the inference and orchestration layer for production Vision-Language Models. We care deeply about fast and ergonomic visual inference, reliable structured outputs, and the observability to iterate on them.<p>A few things we've shipped recently you can poke at:<p><pre><code>  1. Orion: our visual agent that reasons and acts over images, video, and documents. Chat at https://chat.vlm.run.
  2. mm-ctx: a Unix-style multimodal CLI (find, cat, grep, wc) that gives coding agents real context over images, video, and PDFs. Rust core, Python devex. 
  3. vlmbench:  single-file CLI for benchmarking VLM inference (TTFT, TPOT, throughput) across vLLM, Ollama, and SGLang.
</code></pre>
Apply: <a href="https://app.dover.com/jobs/vlm-run" rel="nofollow">https://app.dover.com/jobs/vlm-run</a><p>Email hiring "at" vlm.run with your GitHub + a couple recent projects.<p>[1] <a href="https://chat.vlm.run" rel="nofollow">https://chat.vlm.run</a><p>[2] <a href="https://pypi.org/project/mm-ctx" rel="nofollow">https://pypi.org/project/mm-ctx</a> | <a href="https://www.vlm.run/open-source/mm" rel="nofollow">https://www.vlm.run/open-source/mm</a><p>[3] <a href="https://github.com/vlm-run/vlmbench" rel="nofollow">https://github.com/vlm-run/vlmbench</a> | <a href="https://www.vlm.run/open-source/vlmbench" rel="nofollow">https://www.vlm.run/open-source/vlmbench</a></p>
]]></description><pubDate>Fri, 01 May 2026 23:56:12 +0000</pubDate><link>https://news.ycombinator.com/item?id=47981876</link><dc:creator>fzysingularity</dc:creator><comments>https://news.ycombinator.com/item?id=47981876</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47981876</guid></item><item><title><![CDATA[New comment by fzysingularity in "Improving Composer through real-time RL"]]></title><description><![CDATA[
<p>The recent claude code leak also revealed that they're poisoning their competitors via anti-distillation policies baked in claude code CLI (fake tool calls, adding noise etc).</p>
]]></description><pubDate>Thu, 02 Apr 2026 00:15:00 +0000</pubDate><link>https://news.ycombinator.com/item?id=47608410</link><dc:creator>fzysingularity</dc:creator><comments>https://news.ycombinator.com/item?id=47608410</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47608410</guid></item><item><title><![CDATA[New comment by fzysingularity in "Ask HN: Who is hiring? (April 2026)"]]></title><description><![CDATA[
<p>VLM Run (<a href="https://vlm.run" rel="nofollow">https://vlm.run</a>) | 1x Infrastructure Engineer + 2x AI/ML Engineer | Santa Clara, CA (HQ)<p>VLM Run is building infrastructure for production Vision-Language Model (VLM) systems — fast inference, tool-use + orchestration, reliable structured outputs, and the observability to iterate quickly. We’re a deeply technical team of veteran AI / computer-vision engineers (20+ years combined, MIT/CMU PhDs) who’ve shipped production ML infrastructure across autonomous driving and LLMs.<p>Open roles:<p>1. Infrastructure Engineer (Full-time, ONSITE): $150K–$220K + 1–3% equity <a href="https://app.dover.com/apply/VLM%20Run/8d4fa3b1-5b38-42e1-927" rel="nofollow">https://app.dover.com/apply/VLM%20Run/8d4fa3b1-5b38-42e1-927</a>...<p>2. AI/ML Engineer (Full-time, ONSITE): $150K–$220K + 0.5–3% equity <a href="https://app.dover.com/apply/VLM%20Run/1a490851-1ea1-4f12-a0f" rel="nofollow">https://app.dover.com/apply/VLM%20Run/1a490851-1ea1-4f12-a0f</a>...<p>Email hiring "at" vlm.run with your GitHub + a couple recent projects.<p>P.S. We recently launched Orion, our visual agent that can reason and act over images, videos and documents. You can chat with Orion at <a href="https://chat.vlm.run" rel="nofollow">https://chat.vlm.run</a> and see capabilities at <a href="https://docs.vlm.run" rel="nofollow">https://docs.vlm.run</a>.<p>Apply: <a href="https://app.dover.com/jobs/vlm-run" rel="nofollow">https://app.dover.com/jobs/vlm-run</a></p>
]]></description><pubDate>Wed, 01 Apr 2026 16:49:34 +0000</pubDate><link>https://news.ycombinator.com/item?id=47603363</link><dc:creator>fzysingularity</dc:creator><comments>https://news.ycombinator.com/item?id=47603363</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47603363</guid></item><item><title><![CDATA[New comment by fzysingularity in "Improving Composer through real-time RL"]]></title><description><![CDATA[
<p>Real-time or continuous learning is great on paper, but to get this to work without extremely expensive regression testing and catastrophic forgetting is a real challenge.<p>Credit to the team for taking this on, but I’d be skeptical of announcements like this without at least 3–6 months of proven production deployments. Definitely curious how this plays out.</p>
]]></description><pubDate>Sat, 28 Mar 2026 01:27:16 +0000</pubDate><link>https://news.ycombinator.com/item?id=47550578</link><dc:creator>fzysingularity</dc:creator><comments>https://news.ycombinator.com/item?id=47550578</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47550578</guid></item><item><title><![CDATA[New comment by fzysingularity in "Improving Composer through real-time RL"]]></title><description><![CDATA[
<p>What do you think actually happened here in the past week?<p>They used Kimi, failed to acknowledge it in the original Composer announcement. Kimi team probably reached out and asked WTF? Their only recourse was to publicly disclose their whitepaper with Kimi mentioned to win brownie points about being open about their training pipeline, while placating the Kimi team.</p>
]]></description><pubDate>Sat, 28 Mar 2026 01:17:52 +0000</pubDate><link>https://news.ycombinator.com/item?id=47550516</link><dc:creator>fzysingularity</dc:creator><comments>https://news.ycombinator.com/item?id=47550516</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47550516</guid></item><item><title><![CDATA[New comment by fzysingularity in "Ask HN: Who is hiring? (March 2026)"]]></title><description><![CDATA[
<p>VLM Run (<a href="https://vlm.run" rel="nofollow">https://vlm.run</a>) | 1x Infrastructure Engineer + 2x AI/ML Engineer | Santa Clara, CA (HQ)<p>VLM Run is building infrastructure for production Vision-Language Model (VLM) systems — fast inference, tool-use + orchestration, reliable structured outputs, and the observability to iterate quickly. We’re a deeply technical team of veteran AI / computer-vision engineers (20+ years combined, MIT/CMU PhDs) who’ve shipped production ML infrastructure across autonomous driving and LLMs.<p>Open roles:<p>1. Infrastructure Engineer (Full-time, ONSITE): $150K–$220K + 0.5–3% equity 
<a href="https://app.dover.com/apply/VLM%20Run/8d4fa3b1-5b38-42e1-9271-88ebb50e6f5b" rel="nofollow">https://app.dover.com/apply/VLM%20Run/8d4fa3b1-5b38-42e1-927...</a><p>2. AI/ML Engineer (Full-time, ONSITE): $150K–$220K + 0.5–3% equity 
<a href="https://app.dover.com/apply/VLM%20Run/1a490851-1ea1-4f12-a0ff-1c3367034a82" rel="nofollow">https://app.dover.com/apply/VLM%20Run/1a490851-1ea1-4f12-a0f...</a><p>Email hiring "at" vlm.run with your GitHub + a couple recent projects.<p>P.S. We recently launched <i>Orion</i>, our visual agent that can reason and act over images, videos and documents. You can chat with Orion at <a href="https://chat.vlm.run" rel="nofollow">https://chat.vlm.run</a> and see capabilities at <a href="https://docs.vlm.run" rel="nofollow">https://docs.vlm.run</a>.<p>Apply: <a href="https://app.dover.com/jobs/vlm-run" rel="nofollow">https://app.dover.com/jobs/vlm-run</a></p>
]]></description><pubDate>Mon, 02 Mar 2026 19:46:49 +0000</pubDate><link>https://news.ycombinator.com/item?id=47223097</link><dc:creator>fzysingularity</dc:creator><comments>https://news.ycombinator.com/item?id=47223097</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47223097</guid></item><item><title><![CDATA[New comment by fzysingularity in "AI Made Writing Code Easier. It Made Being an Engineer Harder"]]></title><description><![CDATA[
<p>AI allows you to accelerate the initial build process, but I think engineering is all about craftsmanship. Today most LLMs have poor taste and chipping away the cruft matters more than ever.</p>
]]></description><pubDate>Sun, 01 Mar 2026 17:04:14 +0000</pubDate><link>https://news.ycombinator.com/item?id=47208494</link><dc:creator>fzysingularity</dc:creator><comments>https://news.ycombinator.com/item?id=47208494</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47208494</guid></item><item><title><![CDATA[New comment by fzysingularity in "Hugging Face Skills"]]></title><description><![CDATA[
<p>uvx probably is the way to go here (fully self-contained environment for each skill), and use stdout as the I/O bridge between skills.</p>
]]></description><pubDate>Tue, 24 Feb 2026 19:36:13 +0000</pubDate><link>https://news.ycombinator.com/item?id=47141669</link><dc:creator>fzysingularity</dc:creator><comments>https://news.ycombinator.com/item?id=47141669</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47141669</guid></item><item><title><![CDATA[New comment by fzysingularity in "Rolling your own serverless OCR in 40 lines of code"]]></title><description><![CDATA[
<p>The cold-boot time on this model can hardly be called “serverless”</p>
]]></description><pubDate>Mon, 16 Feb 2026 16:38:19 +0000</pubDate><link>https://news.ycombinator.com/item?id=47037109</link><dc:creator>fzysingularity</dc:creator><comments>https://news.ycombinator.com/item?id=47037109</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47037109</guid></item><item><title><![CDATA[New comment by fzysingularity in "GLM-OCR – A multimodal OCR model for complex document understanding"]]></title><description><![CDATA[
<p>ELO scores for OCR don't really make much sense - it's trying to reduce accuracy to a single voting score without any real quality-control on the reviewer/judge.<p>I think a more accurate reflection of the current state of comparisons would be a real-world benchmark with messy/complex docs across industries, languages.</p>
]]></description><pubDate>Wed, 11 Feb 2026 17:56:37 +0000</pubDate><link>https://news.ycombinator.com/item?id=46978368</link><dc:creator>fzysingularity</dc:creator><comments>https://news.ycombinator.com/item?id=46978368</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46978368</guid></item><item><title><![CDATA[New comment by fzysingularity in "GLM-OCR – A multimodal OCR model for complex document understanding"]]></title><description><![CDATA[
<p>Apple OCR even on the Mac is insanely good, in fact way better than AWS textract/GCP cloud vision  OCR.<p>Any idea what model is being used?</p>
]]></description><pubDate>Wed, 11 Feb 2026 16:44:14 +0000</pubDate><link>https://news.ycombinator.com/item?id=46977246</link><dc:creator>fzysingularity</dc:creator><comments>https://news.ycombinator.com/item?id=46977246</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46977246</guid></item><item><title><![CDATA[New comment by fzysingularity in "Ask HN: Who is hiring? (February 2026)"]]></title><description><![CDATA[
<p>VLM Run (<a href="https://vlm.run" rel="nofollow">https://vlm.run</a>) | Infrastructure Engineer + DevRel + AI/ML Engineer | Santa Clara, CA (HQ)<p>VLM Run is building infrastructure for production Vision-Language Model (VLM) systems — fast inference, tool-use + orchestration, reliable structured outputs, and the observability to iterate quickly. We’re a deeply technical team of veteran AI / computer-vision engineers (20+ years combined) who’ve shipped production ML infrastructure across autonomous driving and LLMs.<p>Open roles:<p>1. Infrastructure Engineer (Full-time, ONSITE): $150K–$220K + 0.5–3% equity
<a href="https://app.dover.com/apply/VLM%20Run/8d4fa3b1-5b38-42e1-9271-88ebb50e6f5b" rel="nofollow">https://app.dover.com/apply/VLM%20Run/8d4fa3b1-5b38-42e1-927...</a><p>2. Founding DevRel (Full-time, ONSITE/REMOTE): $90K–$140K + 0.5–3% equity
<a href="https://app.dover.com/apply/VLM%20Run/de84c63e-fd0a-418b-929b-ad02dea2a19b" rel="nofollow">https://app.dover.com/apply/VLM%20Run/de84c63e-fd0a-418b-929...</a><p>3. AI/ML Engineer (Full-time, ONSITE): $150K–$220K + 0.5–3% equity
<a href="https://app.dover.com/apply/VLM%20Run/1a490851-1ea1-4f12-a0ff-1c3367034a82" rel="nofollow">https://app.dover.com/apply/VLM%20Run/1a490851-1ea1-4f12-a0f...</a><p>Email hiring "at" vlm.run with your GitHub + a couple recent projects.<p>P.S. We recently launched *Orion*, our visual agent that can reason and act over images, videos and documents. You can chat with Orion at <a href="https://chat.vlm.run" rel="nofollow">https://chat.vlm.run</a> and see capabilities at <a href="https://docs.vlm.run" rel="nofollow">https://docs.vlm.run</a>.<p>Apply: <a href="https://app.dover.com/jobs/vlm-run" rel="nofollow">https://app.dover.com/jobs/vlm-run</a></p>
]]></description><pubDate>Mon, 02 Feb 2026 20:39:17 +0000</pubDate><link>https://news.ycombinator.com/item?id=46861176</link><dc:creator>fzysingularity</dc:creator><comments>https://news.ycombinator.com/item?id=46861176</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46861176</guid></item><item><title><![CDATA[DeepSeek OCR 2: Visual Causal Flow]]></title><description><![CDATA[
<p>Article URL: <a href="https://huggingface.co/deepseek-ai/DeepSeek-OCR-2">https://huggingface.co/deepseek-ai/DeepSeek-OCR-2</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=46775984">https://news.ycombinator.com/item?id=46775984</a></p>
<p>Points: 2</p>
<p># Comments: 0</p>
]]></description><pubDate>Tue, 27 Jan 2026 05:47:32 +0000</pubDate><link>https://huggingface.co/deepseek-ai/DeepSeek-OCR-2</link><dc:creator>fzysingularity</dc:creator><comments>https://news.ycombinator.com/item?id=46775984</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46775984</guid></item><item><title><![CDATA[New comment by fzysingularity in "LMArena is a cancer on AI"]]></title><description><![CDATA[
<p>> It's like going to the grocery store and buying tabloids, pretending they're scientific journals.<p>This is pure gold. I've always found this approach of evals on a moving-target via consensus broken.</p>
]]></description><pubDate>Wed, 07 Jan 2026 23:13:13 +0000</pubDate><link>https://news.ycombinator.com/item?id=46534602</link><dc:creator>fzysingularity</dc:creator><comments>https://news.ycombinator.com/item?id=46534602</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46534602</guid></item><item><title><![CDATA[New comment by fzysingularity in "Claude Code CLI was broken"]]></title><description><![CDATA[
<p>I'd love to see Claude Code remove more lines than it added TBH.<p>There's a ton of cruft in code that humans are less inclined to remove because it just works, but imagine having LLM doing the clean up work instead of the generation work.</p>
]]></description><pubDate>Wed, 07 Jan 2026 21:25:52 +0000</pubDate><link>https://news.ycombinator.com/item?id=46533099</link><dc:creator>fzysingularity</dc:creator><comments>https://news.ycombinator.com/item?id=46533099</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46533099</guid></item><item><title><![CDATA[New comment by fzysingularity in "Unified Vision-Language Agents – Detect, Segment, OCR, Generate and More"]]></title><description><![CDATA[
<p>Here's a short cookbook exploring an agentic approach to vision–language tasks: detection, segmentation, OCR, generation, and combining classical CV tools with VLM reasoning.<p>Happy to run examples if you leave a comment.<p>[1] IPython notebook: <a href="https://github.com/vlm-run/vlmrun-cookbook/blob/main/notebooks/12_orion_image_understanding.ipynb" rel="nofollow">https://github.com/vlm-run/vlmrun-cookbook/blob/main/noteboo...</a><p>[2] Colab: <a href="https://colab.research.google.com/github/vlm-run/vlmrun-cookbook/blob/main/notebooks/12_orion_image_understanding.ipynb" rel="nofollow">https://colab.research.google.com/github/vlm-run/vlmrun-cook...</a></p>
]]></description><pubDate>Wed, 03 Dec 2025 19:46:30 +0000</pubDate><link>https://news.ycombinator.com/item?id=46139075</link><dc:creator>fzysingularity</dc:creator><comments>https://news.ycombinator.com/item?id=46139075</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46139075</guid></item></channel></rss>