<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: whamp</title><link>https://news.ycombinator.com/user?id=whamp</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Tue, 07 Apr 2026 11:24:28 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=whamp" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by whamp in "Show HN: Chirp – Local Windows dictation with ParakeetV3 no executable required"]]></title><description><![CDATA[
<p>Yea whisper has more features and is awesome if you have the hardware to run the big models that are accurate enough. The constraint here is the best cpu only implementation. By no means am I wedded or affiliated with parakeet, it's just the best/fastest within the CPU hardware space.</p>
]]></description><pubDate>Sat, 15 Nov 2025 16:32:43 +0000</pubDate><link>https://news.ycombinator.com/item?id=45938546</link><dc:creator>whamp</dc:creator><comments>https://news.ycombinator.com/item?id=45938546</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45938546</guid></item><item><title><![CDATA[New comment by whamp in "Show HN: Chirp – Local Windows dictation with ParakeetV3 no executable required"]]></title><description><![CDATA[
<p>Never said it did! Working with what I got.</p>
]]></description><pubDate>Sat, 15 Nov 2025 16:29:26 +0000</pubDate><link>https://news.ycombinator.com/item?id=45938526</link><dc:creator>whamp</dc:creator><comments>https://news.ycombinator.com/item?id=45938526</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45938526</guid></item><item><title><![CDATA[New comment by whamp in "Show HN: Chirp – Local Windows dictation with ParakeetV3 no executable required"]]></title><description><![CDATA[
<p>Here is the huggingface ASR leaderboard for those wondering how parakeet V3 compares to Whisper Large V3<p>Accuracy Average WER:
Whisper-large-v3  4.91 vs Parakeet V3 5.05<p>Speed RTFx:
Whisper-large-v3 126 vs PArakeet V3 2154<p>~17x faster<p><a href="https://huggingface.co/spaces/hf-audio/open_asr_leaderboard" rel="nofollow">https://huggingface.co/spaces/hf-audio/open_asr_leaderboard</a></p>
]]></description><pubDate>Sat, 15 Nov 2025 03:32:15 +0000</pubDate><link>https://news.ycombinator.com/item?id=45934822</link><dc:creator>whamp</dc:creator><comments>https://news.ycombinator.com/item?id=45934822</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45934822</guid></item><item><title><![CDATA[New comment by whamp in "Show HN: Chirp – Local Windows dictation with ParakeetV3 no executable required"]]></title><description><![CDATA[
<p>My project has a built-in word_replacement so you can automatically replace certain terms if that's important to you in the config.toml<p>i loved whisper but it was insanely slow on cpu only and even then it was with a smaller whisper that isn't as accurate as parakeet.<p>my windows environment locks down the built-in windows option so i don't have a way to test it. i've heard it's pretty good if you're allowed to use it, but your inputs don't stay local which is why i needed to create this project.</p>
]]></description><pubDate>Fri, 14 Nov 2025 23:42:28 +0000</pubDate><link>https://news.ycombinator.com/item?id=45933510</link><dc:creator>whamp</dc:creator><comments>https://news.ycombinator.com/item?id=45933510</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45933510</guid></item><item><title><![CDATA[New comment by whamp in "Show HN: Chirp – Local Windows dictation with ParakeetV3 no executable required"]]></title><description><![CDATA[
<p>btw this is my first open-source project</p>
]]></description><pubDate>Fri, 14 Nov 2025 19:12:15 +0000</pubDate><link>https://news.ycombinator.com/item?id=45930731</link><dc:creator>whamp</dc:creator><comments>https://news.ycombinator.com/item?id=45930731</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45930731</guid></item><item><title><![CDATA[Show HN: Chirp – Local Windows dictation with ParakeetV3 no executable required]]></title><description><![CDATA[
<p>I’ve been working in fairly locked‑down Windows environments where I’m allowed to run Python, but not install or launch new `.exe` files. In addition the built-in windows dictations are blocked (the only good one isn't local anyway). At the same time, I really wanted accurate, fast dictation without sending audio to a cloud service, and without needing a GPU. Most speech‑to‑text setups I tried either required special launchers, GPU access, or were awkward to run day‑to‑day.<p>To scratch that itch, I built Chirp, a Windows dictation app that runs fully locally, uses NVIDIA’s ParakeetV3 model, and is managed end‑to‑end with `uv`. If you can run Python on your machine, you should be able to run Chirp—no additional executables required.<p>Under the hood, Chirp uses the Parakeet TDT 0.6B v3 ONNX bundle. ParakeetV3 has accuracy in the same ballpark as Whisper‑large‑v3 (multilingual WER ~4.9 vs ~5.0 in the open ASR leaderboard), but it’s much faster and happy on CPU.<p>The flow is:
- One‑time setup that downloads and prepares the ONNX model:
  - `uv run python -m chirp.setup`
- A long‑running CLI process:
  - `uv run python -m chirp.main`
- A global hotkey that starts/stops recording and injects text into the active window.<p>A few details that might be interesting technically:<p>- <i>Local‑only STT:</i> Everything runs on your machine using ONNX Runtime; by default it uses CPU providers, with optional GPU providers if your environment allows.<p>- <i>Config‑driven behavior:</i> A `config.toml` file controls the global hotkey, model choice, quantization (`int8` option), language, ONNX providers, and threading. There’s also a simple `[word_overrides]` map so you can fix tokens that the model consistently mishears.<p>- <i>Post‑processing pipeline:</i> After recognition, there’s an optional “style guide” step where you can specify prompts like “sentence case” or “prepend: >>” for the final text.<p>- <i>No clipboard gymnastics required on Windows:</i> The app types directly into the focused window; there are options for clipboard‑based pasting and cleanup behavior for platforms where that makes more sense.<p>- <i>Audio feedback:</i> Start/stop sounds (configurable) let you know when the mic is actually recording.<p>So far I’ve mainly tested this on my own Windows machines with English dictation and CPU‑only setups. There are probably plenty of rough edges (different keyboard layouts, language settings, corporate IT policies, etc.), and I’d love feedback from people who:<p>- Work in restricted corporate environments and need local dictation.
- Have experience with Parakeet/Whisper or ONNX Runtime and see obvious ways to improve performance or robustness.
- Want specific features (e.g., better multi‑language support, more advanced post‑processing, or integrations with their editor/IDE).<p>Repo is here:  
`<a href="https://github.com/Whamp/chirp" rel="nofollow">https://github.com/Whamp/chirp</a>`<p>If you try it, I’d be very interested in:<p>- CPU usage and latency on your hardware,
- How well it behaves with your keyboard layout and applications,
- Any weird failure cases or usability annoyances you run into.<p>Happy to answer questions and dig into technical details in the comments.</p>
<hr>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=45930659">https://news.ycombinator.com/item?id=45930659</a></p>
<p>Points: 34</p>
<p># Comments: 18</p>
]]></description><pubDate>Fri, 14 Nov 2025 19:07:45 +0000</pubDate><link>https://github.com/Whamp/chirp</link><dc:creator>whamp</dc:creator><comments>https://news.ycombinator.com/item?id=45930659</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45930659</guid></item></channel></rss>