<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: alder</title><link>https://news.ycombinator.com/user?id=alder</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Thu, 02 Jul 2026 00:10:22 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=alder" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by alder in "Show HN: Turn native language audio into flashcards and shadowing practice"]]></title><description><![CDATA[
<p>Simplified/Traditional support is now deployed. Really glad you like it, thanks!</p>
]]></description><pubDate>Fri, 26 Jun 2026 16:59:42 +0000</pubDate><link>https://news.ycombinator.com/item?id=48688947</link><dc:creator>alder</dc:creator><comments>https://news.ycombinator.com/item?id=48688947</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48688947</guid></item><item><title><![CDATA[New comment by alder in "Show HN: Turn native language audio into flashcards and shadowing practice"]]></title><description><![CDATA[
<p>I deployed a new release with a Simplified/Traditional switch at the top of the workspace. Thanks for the OpenCC link. Please let me know if anything's off.</p>
]]></description><pubDate>Fri, 26 Jun 2026 16:55:17 +0000</pubDate><link>https://news.ycombinator.com/item?id=48688896</link><dc:creator>alder</dc:creator><comments>https://news.ycombinator.com/item?id=48688896</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48688896</guid></item><item><title><![CDATA[New comment by alder in "Show HN: Turn native language audio into flashcards and shadowing practice"]]></title><description><![CDATA[
<p>There's a related feature already. The mic icon in the gap control plays the chunk, records you, and plays it back (the recording stays local). But if you mean a tool that automatically grades your pronunciation, I never thought about that. Interesting area to explore though.</p>
]]></description><pubDate>Fri, 26 Jun 2026 11:08:38 +0000</pubDate><link>https://news.ycombinator.com/item?id=48685213</link><dc:creator>alder</dc:creator><comments>https://news.ycombinator.com/item?id=48685213</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48685213</guid></item><item><title><![CDATA[New comment by alder in "Show HN: Turn native language audio into flashcards and shadowing practice"]]></title><description><![CDATA[
<p>Yes, you're right, public speaking is not really my thing. I'm a tech nerd. I should probably rely on partnerships with other people to help it grow. But the first thing I wanted to find out was if the app is actually useful to anyone other than myself. And the response from this community is encouraging.
I also thought about letting content producers publish their content/courses as collections here with restricted access. The app already supports that. It is also possible to realign the transcript to their "gold" transcript. They would have instant Anki cards in multiple languages, and the shadowing practice for their subscribers. I would have users, win-win :) But again I have no idea how to do marketing.</p>
]]></description><pubDate>Fri, 26 Jun 2026 08:08:58 +0000</pubDate><link>https://news.ycombinator.com/item?id=48683783</link><dc:creator>alder</dc:creator><comments>https://news.ycombinator.com/item?id=48683783</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48683783</guid></item><item><title><![CDATA[New comment by alder in "Show HN: Turn native language audio into flashcards and shadowing practice"]]></title><description><![CDATA[
<p>Thanks so much for testing Japanese! I'll review the starter deck and what goes into it. At the moment it's a bit random, and the nonsensical LibriVox intro spills in. I'll also try to filter out the function words. I have no way to judge that myself, my Japanese is zero.<p>Glad the mobile interface works. There's an iPhone app too, basically the web app but with native audio support. It can play in the background. Though it's not ready for release yet.<p>The built-in flashcards use FSRS-5, and I believe it's O(1) so it should scale but I haven't actually done any performance testing yet, to be honest.<p>The infinite loop is a bug, I'll dig into that too.
And I'm not sure if you noticed, but I also put up an FSI Japanese course here: <a href="https://lingochunk.com/c/fsi-japanese" rel="nofollow">https://lingochunk.com/c/fsi-japanese</a>. Again, I can't judge the quality myself, so I'd really welcome any feedback.<p>And thank you for the kind review!</p>
]]></description><pubDate>Thu, 25 Jun 2026 19:57:45 +0000</pubDate><link>https://news.ycombinator.com/item?id=48678448</link><dc:creator>alder</dc:creator><comments>https://news.ycombinator.com/item?id=48678448</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48678448</guid></item><item><title><![CDATA[New comment by alder in "Show HN: Turn native language audio into flashcards and shadowing practice"]]></title><description><![CDATA[
<p>Thanks!
As far as I understand your idea is to starts from the word and pulls examples from some huge data source. My approach is the other way round: I start from a source (the audio that you want to learn), and the tool extracts only the words that appear in it, with their meaning in that context. I think that hugely simplifies the implementation, and it is more useful for learners. They learn the meaning in a particular context.<p>As for the stack: STT with Soniox (word level timestamps), then spaCy for segmentation, POS and lemmas, then AI enrichment, correcting the lemma when spaCy is wrong. Some languages have no spaCy model at all and others are unreliable. I am trying to do spaCy thing in LLM then. Plus some extra magic for Japanese and Chinese.</p>
]]></description><pubDate>Thu, 25 Jun 2026 18:07:41 +0000</pubDate><link>https://news.ycombinator.com/item?id=48677134</link><dc:creator>alder</dc:creator><comments>https://news.ycombinator.com/item?id=48677134</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48677134</guid></item><item><title><![CDATA[New comment by alder in "Show HN: Turn native language audio into flashcards and shadowing practice"]]></title><description><![CDATA[
<p>Thanks, really useful extension link. Proper traditional support probably needs a context aware layer, not a plain lookup. I will experiment with additional LLM enrichment. Appreciate you digging into this!</p>
]]></description><pubDate>Thu, 25 Jun 2026 17:23:48 +0000</pubDate><link>https://news.ycombinator.com/item?id=48676582</link><dc:creator>alder</dc:creator><comments>https://news.ycombinator.com/item?id=48676582</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48676582</guid></item><item><title><![CDATA[New comment by alder in "Show HN: Turn native language audio into flashcards and shadowing practice"]]></title><description><![CDATA[
<p>Thanks, just to clarify "they" is actually only me :) I'm a contractor and run this through my own company. I try to collect as little as possible. And you're right about the UI fonts it's clearly something I need to fix ASAP. Appreciate the feedback!</p>
]]></description><pubDate>Thu, 25 Jun 2026 16:42:51 +0000</pubDate><link>https://news.ycombinator.com/item?id=48676005</link><dc:creator>alder</dc:creator><comments>https://news.ycombinator.com/item?id=48676005</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48676005</guid></item><item><title><![CDATA[New comment by alder in "Show HN: Turn native language audio into flashcards and shadowing practice"]]></title><description><![CDATA[
<p>No TTS at all in my app :) that was a deliberate choice, only STT. I experimented with many STT options, even self hosting Whisper, but ended up with Soniox. A bit expensive, but reliable. For the AI enrichment I went with Gemini Flash. I also tried Gemma 31B, which is really cheap and surprisingly good, on par with Gemini Flash,  but extremely slow everywhere I tried. So you can make your own calculations :) And thanks for the congrats!</p>
]]></description><pubDate>Thu, 25 Jun 2026 16:26:40 +0000</pubDate><link>https://news.ycombinator.com/item?id=48675773</link><dc:creator>alder</dc:creator><comments>https://news.ycombinator.com/item?id=48675773</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48675773</guid></item><item><title><![CDATA[New comment by alder in "Show HN: Turn native language audio into flashcards and shadowing practice"]]></title><description><![CDATA[
<p>Thanks, I hope it will be helpful! If anything looks off, please let me know.</p>
]]></description><pubDate>Thu, 25 Jun 2026 16:03:25 +0000</pubDate><link>https://news.ycombinator.com/item?id=48675418</link><dc:creator>alder</dc:creator><comments>https://news.ycombinator.com/item?id=48675418</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48675418</guid></item><item><title><![CDATA[New comment by alder in "Show HN: Turn native language audio into flashcards and shadowing practice"]]></title><description><![CDATA[
<p>Yes, the transcriber API I use (Soniox) actually supports more than 60 languages. I just didn't have any automated testing for them. The way I tested was to find audio with a reliable reference transcription and put it through my pipeline. Then compare the results. Also some languages don't have reliable libraries to get part of speech and lemmas, something that flashcard needs.</p>
]]></description><pubDate>Thu, 25 Jun 2026 15:53:35 +0000</pubDate><link>https://news.ycombinator.com/item?id=48675277</link><dc:creator>alder</dc:creator><comments>https://news.ycombinator.com/item?id=48675277</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48675277</guid></item><item><title><![CDATA[New comment by alder in "Show HN: Turn native language audio into flashcards and shadowing practice"]]></title><description><![CDATA[
<p>For segmentation and POS I rely on spaCy zh_core_web_sm, pinyin from pypinyin library. Also the small correction level on top. But I am not a Chinese language expert to judge if it really works and I'll rely on feedback from the users to improve it.</p>
]]></description><pubDate>Thu, 25 Jun 2026 15:19:45 +0000</pubDate><link>https://news.ycombinator.com/item?id=48674732</link><dc:creator>alder</dc:creator><comments>https://news.ycombinator.com/item?id=48674732</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48674732</guid></item><item><title><![CDATA[New comment by alder in "Show HN: Turn native language audio into flashcards and shadowing practice"]]></title><description><![CDATA[
<p>Thanks! Yes, it's getting better for Greek but still not on par with other languages. I completed the only 2 Greek levels on Duolingo and they are really boring compared to the German one I am doing now. Easy Greek is a bit above my level, and the number of YouTubers in Greek is tiny compared to German.</p>
]]></description><pubDate>Thu, 25 Jun 2026 15:00:42 +0000</pubDate><link>https://news.ycombinator.com/item?id=48674479</link><dc:creator>alder</dc:creator><comments>https://news.ycombinator.com/item?id=48674479</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48674479</guid></item><item><title><![CDATA[New comment by alder in "Show HN: Turn native language audio into flashcards and shadowing practice"]]></title><description><![CDATA[
<p>Thanks! Chinese and Japanese as source languages are still experimental, I did my best to support them but I have to rely on people who actually know the language and  this kind of feedback is really useful. I'll look into adding traditional characters and fixing the pinyin.</p>
]]></description><pubDate>Thu, 25 Jun 2026 14:49:38 +0000</pubDate><link>https://news.ycombinator.com/item?id=48674313</link><dc:creator>alder</dc:creator><comments>https://news.ycombinator.com/item?id=48674313</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48674313</guid></item><item><title><![CDATA[New comment by alder in "Show HN: Turn native language audio into flashcards and shadowing practice"]]></title><description><![CDATA[
<p>To be honest I haven't tested it on a 4K monitor yet, so I am not surprised. There are two controls above the transcript that change the font size and the line spacing, which should help a bit for now. Something to fix, thanks!</p>
]]></description><pubDate>Thu, 25 Jun 2026 14:29:15 +0000</pubDate><link>https://news.ycombinator.com/item?id=48673997</link><dc:creator>alder</dc:creator><comments>https://news.ycombinator.com/item?id=48673997</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48673997</guid></item><item><title><![CDATA[New comment by alder in "Show HN: Turn native language audio into flashcards and shadowing practice"]]></title><description><![CDATA[
<p>Thanks! I am glad you like it! I essentially mine the source audio, and all examples have cloze style gaps (blurring, in my case) that are revealed on the back of the card. I also beep the word in the sentence when you try to play it on the front card in built-in SRS system. Unfortunately that is not implemented in the Anki export, but it is technically possible.</p>
]]></description><pubDate>Thu, 25 Jun 2026 14:24:09 +0000</pubDate><link>https://news.ycombinator.com/item?id=48673902</link><dc:creator>alder</dc:creator><comments>https://news.ycombinator.com/item?id=48673902</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48673902</guid></item><item><title><![CDATA[Show HN: Turn native language audio into flashcards and shadowing practice]]></title><description><![CDATA[
<p>Here is a tool I built initially for myself to help with my German and Greek language studies. It started as a hack for creating Anki cards from native language audio. It extracts the words, finds their base forms (lemmas) and groups the examples by the lemma. At some point I realised that I have a transcription with word level timestamps that opens a lot of other opportunities. So I added a mode to click the first and last word in the transcript and it starts looping with the right gap and repeat count.<p>Another feature I use a lot is selecting an audio fragment, sending a predefined prompt to an AI to "explain grammar" or "explain nuances of meaning" and I still experimenting with prompts.<p>And because shadowing is so easy I also use it as a player to improve my English pronunciation. (I am not a native English speaker.)<p>I made a quick video showing the workflow for creating Anki cards and shadowing: <a href="https://youtu.be/TaR58uuDBvU?si=o5aGLAi2S-BZ7Zy9" rel="nofollow">https://youtu.be/TaR58uuDBvU?si=o5aGLAi2S-BZ7Zy9</a><p>The app supports 15 input languages (Japanese and Chinese are the latest experimental additions), and more than 30 output languages.<p>I would really appreciate it if you could try it <a href="https://lingochunk.com/try" rel="nofollow">https://lingochunk.com/try</a>. I know there are other tools with similar functionality but I created something that fits my workflow and it is fun to build.<p>Also I struggled to find public domain audio for the try page. I'd be grateful if anyone could point me to public domain sources (I used LibriVox, Wikimedia and FSI courses), or if you're a creator, let me feature some of your own recordings with credits and links.</p>
<hr>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=48671886">https://news.ycombinator.com/item?id=48671886</a></p>
<p>Points: 92</p>
<p># Comments: 37</p>
]]></description><pubDate>Thu, 25 Jun 2026 11:29:13 +0000</pubDate><link>https://lingochunk.com/try</link><dc:creator>alder</dc:creator><comments>https://news.ycombinator.com/item?id=48671886</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48671886</guid></item></channel></rss>