<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: hamza_q_</title><link>https://news.ycombinator.com/user?id=hamza_q_</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Tue, 07 Apr 2026 12:00:46 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=hamza_q_" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by hamza_q_ in "Show HN: Iron-Wolf – Wolfenstein 3D source port in Rust"]]></title><description><![CDATA[
<p>Cool! I did an incomplete version in Rust a while back as well. Not a source port, tried to recreate the game from scratch myself, without looking at the C src code<p><a href="https://github.com/hamzaq2000/wolf3d-reimpl-rs" rel="nofollow">https://github.com/hamzaq2000/wolf3d-reimpl-rs</a></p>
]]></description><pubDate>Sat, 21 Feb 2026 20:25:32 +0000</pubDate><link>https://news.ycombinator.com/item?id=47104342</link><dc:creator>hamza_q_</dc:creator><comments>https://news.ycombinator.com/item?id=47104342</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47104342</guid></item><item><title><![CDATA[New comment by hamza_q_ in "Our new SAM audio model transforms audio editing"]]></title><description><![CDATA[
<p>Yeah would love contributions! Here's a brief overview of how I think it can be done:<p>Senko has two clustering types, (1) spectral for audio < 20 mins in length, and (2) UMAP+HDBSCAN for >= 20 mins. In the clustering code, spectral actually already supports orcale/min/max speakers, but UMAP+HDBSCAN doesn't. However, someone forked Senko and added min/max speakers to that here (for oracle, I guess min = max): <a href="https://github.com/DedZago/senko/commit/c33812ae185a5cd420f2cb3f38a3c0cc7d673644" rel="nofollow">https://github.com/DedZago/senko/commit/c33812ae185a5cd420f2...</a><p>So I think all that's required is basically just testing this thoroughly to make sure it doesn't introduce any regressions in clustering quality. And then just wiring the oracle/min/max parameters to the Diarizer class, or diarize() func.</p>
]]></description><pubDate>Fri, 26 Dec 2025 20:26:32 +0000</pubDate><link>https://news.ycombinator.com/item?id=46395850</link><dc:creator>hamza_q_</dc:creator><comments>https://news.ycombinator.com/item?id=46395850</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46395850</guid></item><item><title><![CDATA[New comment by hamza_q_ in "Our new SAM audio model transforms audio editing"]]></title><description><![CDATA[
<p>Thanks for checking it out!<p>Yeah unfortunately, since the diarization is acoustic features based, it really does require high recorded voice fidelity/quality to get the best results.
However, I just added another knob to the Diarizer class called mer_cos, which controls the speaker merging threshold. The default is 0.875, so perhaps try lowering to 0.8. That should help.<p>I'll also get around to adding a oracle/min/max speakers feature at some point, for cases where you know the exact number of speakers ahead of time, or wanna set upper/lower bounds. Gotten busy with another project, so haven't done it yet. PR's welcome though! haha</p>
]]></description><pubDate>Thu, 25 Dec 2025 19:30:23 +0000</pubDate><link>https://news.ycombinator.com/item?id=46386524</link><dc:creator>hamza_q_</dc:creator><comments>https://news.ycombinator.com/item?id=46386524</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46386524</guid></item><item><title><![CDATA[New comment by hamza_q_ in "Our new SAM audio model transforms audio editing"]]></title><description><![CDATA[
<p>Use Demucs bruh
<a href="https://github.com/adefossez/demucs" rel="nofollow">https://github.com/adefossez/demucs</a></p>
]]></description><pubDate>Tue, 23 Dec 2025 20:19:06 +0000</pubDate><link>https://news.ycombinator.com/item?id=46369047</link><dc:creator>hamza_q_</dc:creator><comments>https://news.ycombinator.com/item?id=46369047</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46369047</guid></item><item><title><![CDATA[New comment by hamza_q_ in "Our new SAM audio model transforms audio editing"]]></title><description><![CDATA[
<p>Yeah I was frustrated by slow and hard to use OSS diarization too; recently released a library to address that, check it out: <a href="https://github.com/narcotic-sh/senko" rel="nofollow">https://github.com/narcotic-sh/senko</a><p>Also <a href="https://zanshin.sh" rel="nofollow">https://zanshin.sh</a>, if you'd like speaker diarization when watching YouTube videos</p>
]]></description><pubDate>Tue, 23 Dec 2025 20:05:14 +0000</pubDate><link>https://news.ycombinator.com/item?id=46368900</link><dc:creator>hamza_q_</dc:creator><comments>https://news.ycombinator.com/item?id=46368900</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46368900</guid></item><item><title><![CDATA[New comment by hamza_q_ in "Vince Zampella, developer of Call of Duty and Battlefield has died"]]></title><description><![CDATA[
<p>Thanks for COD: MW2 (2009), Vince. The game of my childhood.
Rest in Peace.</p>
]]></description><pubDate>Mon, 22 Dec 2025 20:51:09 +0000</pubDate><link>https://news.ycombinator.com/item?id=46358900</link><dc:creator>hamza_q_</dc:creator><comments>https://news.ycombinator.com/item?id=46358900</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46358900</guid></item><item><title><![CDATA[New comment by hamza_q_ in "Show HN: Chirp – Local Windows dictation with ParakeetV3 no executable required"]]></title><description><![CDATA[
<p>Cool use of ONNX!
Fluid Inference also have great implementations of Parakeet v2/v3 in CoreML for Apple devices and OpenVINO for Intel:<p><a href="https://github.com/FluidInference/FluidAudio" rel="nofollow">https://github.com/FluidInference/FluidAudio</a><p><a href="https://github.com/FluidInference/eddy-audio" rel="nofollow">https://github.com/FluidInference/eddy-audio</a></p>
]]></description><pubDate>Fri, 14 Nov 2025 21:34:51 +0000</pubDate><link>https://news.ycombinator.com/item?id=45932458</link><dc:creator>hamza_q_</dc:creator><comments>https://news.ycombinator.com/item?id=45932458</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45932458</guid></item><item><title><![CDATA[New comment by hamza_q_ in "Ask HN: Who wants to be hired? (November 2025)"]]></title><description><![CDATA[
<p>Location: Vancouver, BC, Canada<p>Remote: Yes<p>Willing to relocate: Yes<p>Technologies: diarization, Voice AI, PyTorch, CoreML,<p>Svelte/SvelteKit, Flask, SQLite, Tauri<p>Résumé/CV: <a href="https://hamzaq.com/Hamza_Qayyum_Resume_Public.pdf" rel="nofollow">https://hamzaq.com/Hamza_Qayyum_Resume_Public.pdf</a><p>Email: mhamzaqayyum [at] icloud [dot] com<p>---------<p>Projects:<p>- Senko: very fast, accurate, speaker diarization (<a href="https://senko.sh" rel="nofollow">https://senko.sh</a>)<p>- Zanshin: novel media player that allows you to navigate by speaker (<a href="https://zanshin.sh" rel="nofollow">https://zanshin.sh</a>)</p>
]]></description><pubDate>Tue, 04 Nov 2025 06:19:06 +0000</pubDate><link>https://news.ycombinator.com/item?id=45807879</link><dc:creator>hamza_q_</dc:creator><comments>https://news.ycombinator.com/item?id=45807879</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45807879</guid></item><item><title><![CDATA[New comment by hamza_q_ in "Ghostty with ⌘+F search"]]></title><description><![CDATA[
<p>Thought about it but it seems they have some stringent pre-req's they'd like: <a href="https://github.com/ghostty-org/ghostty/issues/189" rel="nofollow">https://github.com/ghostty-org/ghostty/issues/189</a><p>I didn't care for those; just told Claude Code to add in the feature directly. So they probably wouldn't accept the PR if I made one.</p>
]]></description><pubDate>Sat, 25 Oct 2025 18:26:50 +0000</pubDate><link>https://news.ycombinator.com/item?id=45705972</link><dc:creator>hamza_q_</dc:creator><comments>https://news.ycombinator.com/item?id=45705972</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45705972</guid></item><item><title><![CDATA[Ghostty with ⌘+F search]]></title><description><![CDATA[
<p>Article URL: <a href="https://github.com/hamzaq2000/ghostty-cmd-f">https://github.com/hamzaq2000/ghostty-cmd-f</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=45701223">https://news.ycombinator.com/item?id=45701223</a></p>
<p>Points: 4</p>
<p># Comments: 2</p>
]]></description><pubDate>Sat, 25 Oct 2025 03:44:57 +0000</pubDate><link>https://github.com/hamzaq2000/ghostty-cmd-f</link><dc:creator>hamza_q_</dc:creator><comments>https://news.ycombinator.com/item?id=45701223</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45701223</guid></item><item><title><![CDATA[Oral History of Ken Thompson [video]]]></title><description><![CDATA[
<p>Article URL: <a href="https://www.youtube.com/watch?v=OmVHkL0IWk4">https://www.youtube.com/watch?v=OmVHkL0IWk4</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=45518843">https://news.ycombinator.com/item?id=45518843</a></p>
<p>Points: 6</p>
<p># Comments: 1</p>
]]></description><pubDate>Wed, 08 Oct 2025 17:59:19 +0000</pubDate><link>https://www.youtube.com/watch?v=OmVHkL0IWk4</link><dc:creator>hamza_q_</dc:creator><comments>https://news.ycombinator.com/item?id=45518843</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45518843</guid></item><item><title><![CDATA[Show HN: Lightning-Fast Diarization on Apple Silicon]]></title><description><![CDATA[
<p>Article URL: <a href="https://github.com/narcotic-sh/senko">https://github.com/narcotic-sh/senko</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=45369869">https://news.ycombinator.com/item?id=45369869</a></p>
<p>Points: 3</p>
<p># Comments: 0</p>
]]></description><pubDate>Thu, 25 Sep 2025 06:36:43 +0000</pubDate><link>https://github.com/narcotic-sh/senko</link><dc:creator>hamza_q_</dc:creator><comments>https://news.ycombinator.com/item?id=45369869</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45369869</guid></item><item><title><![CDATA[New comment by hamza_q_ in "Show HN: Navigate by speaker in YouTube videos"]]></title><description><![CDATA[
<p>Thanks :)
Agreed, the limiting factor has been diarization (generating the "who speaks when" data) speed. But the diarization backend of this app that I developed can now process 1 hour of audio in ~8 seconds on a M3 Mac. So that's more or less a solved problem now (at least on Mac), just UI work remains.</p>
]]></description><pubDate>Sun, 21 Sep 2025 17:17:40 +0000</pubDate><link>https://news.ycombinator.com/item?id=45324699</link><dc:creator>hamza_q_</dc:creator><comments>https://news.ycombinator.com/item?id=45324699</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45324699</guid></item><item><title><![CDATA[Show HN: Navigate by speaker in YouTube videos]]></title><description><![CDATA[
<p>Article URL: <a href="https://zanshin.sh">https://zanshin.sh</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=45324612">https://news.ycombinator.com/item?id=45324612</a></p>
<p>Points: 2</p>
<p># Comments: 2</p>
]]></description><pubDate>Sun, 21 Sep 2025 17:08:27 +0000</pubDate><link>https://zanshin.sh</link><dc:creator>hamza_q_</dc:creator><comments>https://news.ycombinator.com/item?id=45324612</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45324612</guid></item><item><title><![CDATA[New comment by hamza_q_ in "US High school students' scores fall in reading and math"]]></title><description><![CDATA[
<p>We do know; it's just not in the popular conscience yet.
Read a bit of Marshall McLuhan.</p>
]]></description><pubDate>Thu, 11 Sep 2025 16:49:58 +0000</pubDate><link>https://news.ycombinator.com/item?id=45213645</link><dc:creator>hamza_q_</dc:creator><comments>https://news.ycombinator.com/item?id=45213645</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45213645</guid></item><item><title><![CDATA[New comment by hamza_q_ in "US High school students' scores fall in reading and math"]]></title><description><![CDATA[
<p>Taking bets on how fast Marshall McLuhan re-enters the public conscience :)</p>
]]></description><pubDate>Thu, 11 Sep 2025 16:48:04 +0000</pubDate><link>https://news.ycombinator.com/item?id=45213629</link><dc:creator>hamza_q_</dc:creator><comments>https://news.ycombinator.com/item?id=45213629</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45213629</guid></item><item><title><![CDATA[New comment by hamza_q_ in "Is the decline of reading making politics dumber?"]]></title><description><![CDATA[
<p>It's remarkable that Marshall McLuhan's ideas haven't entered the public conscience yet.</p>
]]></description><pubDate>Fri, 05 Sep 2025 00:56:13 +0000</pubDate><link>https://news.ycombinator.com/item?id=45133941</link><dc:creator>hamza_q_</dc:creator><comments>https://news.ycombinator.com/item?id=45133941</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45133941</guid></item><item><title><![CDATA[New comment by hamza_q_ in "Senko – Very Fast Speaker Diarization"]]></title><description><![CDATA[
<p>1 hour of audio processed in 5 seconds (RTX 4090, Ryzen 9 7950X). ~17x faster than Pyannote 3.1.<p>On M3 MacBook Air, 1 hour in 23.5 seconds (~14x faster).<p>This is a custom speaker diarization pipeline I've developed; it's a modified version of the pipeline found in the excellent 3D-Speaker project by Alibaba Research.<p>My optimizations/modifications were the following:<p>- changed VAD model<p>- multi-threaded Fbank feature extraction<p>- batched inference of CAM++ embeddings model<p>- clustering is accelerated by RAPIDS, when NVIDIA GPU available<p>Optimizations aside, massive credit needs to be given to the CAM++ speaker embeddings model, whose efficiency is where the majority of the speed comes from.<p>This pipeline powers the Zanshin media player, which is an attempt at a usable integration of diarization in a media player.
Check it out here: <a href="https://zanshin.sh" rel="nofollow">https://zanshin.sh</a>
And discuss here: <a href="https://news.ycombinator.com/item?id=45104866">https://news.ycombinator.com/item?id=45104866</a><p>Let me know what you think! Were you also frustrated by how slow speaker diarization is? Does Senko's speed unlock new use cases for you? Cheers, everyone.</p>
]]></description><pubDate>Tue, 02 Sep 2025 17:40:31 +0000</pubDate><link>https://news.ycombinator.com/item?id=45106442</link><dc:creator>hamza_q_</dc:creator><comments>https://news.ycombinator.com/item?id=45106442</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45106442</guid></item><item><title><![CDATA[Senko – Very Fast Speaker Diarization]]></title><description><![CDATA[
<p>Article URL: <a href="https://github.com/narcotic-sh/senko">https://github.com/narcotic-sh/senko</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=45106441">https://news.ycombinator.com/item?id=45106441</a></p>
<p>Points: 2</p>
<p># Comments: 1</p>
]]></description><pubDate>Tue, 02 Sep 2025 17:40:31 +0000</pubDate><link>https://github.com/narcotic-sh/senko</link><dc:creator>hamza_q_</dc:creator><comments>https://news.ycombinator.com/item?id=45106441</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45106441</guid></item><item><title><![CDATA[Show HN: Zanshin – Navigate through media by speaker w/ fast diarization]]></title><description><![CDATA[
<p>残心/Zanshin is a media player that allows you to:<p><pre><code>  - Visualize who speaks when & for how long
  - Jump/skip speaker segments
  - Remove/disable speakers (auto-skip)
  - Set different playback speeds for each speaker
</code></pre>
It's a better, more efficient way to listen to podcasts, interviews, press conferences, etc.<p>It has first-class support for YouTube videos; just drop in a URL. Also supports your local media files. All processing runs on-device.<p>Download today for macOS. Also works on Linux and WSL, but currently without packaging. You can get it running though with just a few terminal commands. Check out the repo for instructions: <a href="https://github.com/narcotic-sh/zanshin" rel="nofollow">https://github.com/narcotic-sh/zanshin</a><p>Zanshin is powered by Senko, a new, very fast, speaker diarization pipeline I've developed.<p>On an M3 MacBook Air, it takes over 5 minutes to process 1 hour of audio using Pyannote 3.1, the leading open-source diarization pipeline. With Senko, it only takes ~24 seconds, a ~14x speed improvement. And on an RTX 4090 + Ryzen 9 7950X machine, processing 1 hour of audio takes just 5 seconds with Senko, a ~17x speed improvement.<p>Senko's speed is what make's Zanshin possible. Senko is a modified version of the speaker diarization pipeline found in the excellent 3D-Speaker project.
Check out Senko here: <a href="https://github.com/narcotic-sh/senko" rel="nofollow">https://github.com/narcotic-sh/senko</a><p>Cheers, everyone; enjoy 残心/Zanshin and Senko. I hope you find them useful.
Let me know what you think!<p>~<p>Side note: I am looking for a job. If you like my work and have an opportunity for me, I'm all ears :)
You can contact me at mhamzaqayyum [at] icloud.com</p>
<hr>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=45104866">https://news.ycombinator.com/item?id=45104866</a></p>
<p>Points: 2</p>
<p># Comments: 0</p>
]]></description><pubDate>Tue, 02 Sep 2025 16:02:19 +0000</pubDate><link>https://zanshin.sh</link><dc:creator>hamza_q_</dc:creator><comments>https://news.ycombinator.com/item?id=45104866</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45104866</guid></item></channel></rss>