<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: narrationbox</title><link>https://news.ycombinator.com/user?id=narrationbox</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Wed, 29 Apr 2026 18:41:56 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=narrationbox" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by narrationbox in "VibeVoice: Open-source frontier voice AI"]]></title><description><![CDATA[
<p>Yes, the SOTA is currently much more advanced.</p>
]]></description><pubDate>Tue, 28 Apr 2026 19:25:01 +0000</pubDate><link>https://news.ycombinator.com/item?id=47939325</link><dc:creator>narrationbox</dc:creator><comments>https://news.ycombinator.com/item?id=47939325</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47939325</guid></item><item><title><![CDATA[New comment by narrationbox in "Eleven v3"]]></title><description><![CDATA[
<p>Give us a try, I think we are what you are looking for<p><a href="https://narrationbox.com" rel="nofollow">https://narrationbox.com</a></p>
]]></description><pubDate>Fri, 06 Jun 2025 10:54:02 +0000</pubDate><link>https://news.ycombinator.com/item?id=44199594</link><dc:creator>narrationbox</dc:creator><comments>https://news.ycombinator.com/item?id=44199594</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44199594</guid></item><item><title><![CDATA[New comment by narrationbox in "Google Illuminate: Books and papers turned into audio"]]></title><description><![CDATA[
<p>A lot of our customers use us [0] for that, it works pretty well if executed properly. The voiceovers work best as inserts into an existing podcast. If you see the articles of major news orgs like NYT, they often have a (usually) machine narrated voiceover.<p>[0] <a href="https://narrationbox.com" rel="nofollow">https://narrationbox.com</a></p>
]]></description><pubDate>Tue, 10 Sep 2024 18:39:21 +0000</pubDate><link>https://news.ycombinator.com/item?id=41504106</link><dc:creator>narrationbox</dc:creator><comments>https://news.ycombinator.com/item?id=41504106</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=41504106</guid></item><item><title><![CDATA[New comment by narrationbox in "Another Text to Speech API"]]></title><description><![CDATA[
<p>Yeah, neural codecs are pretty amazing. The most incredible part is that they can do compression well across the <i>temporal</i> domain, something which has been non-trivial.</p>
]]></description><pubDate>Tue, 26 Sep 2023 00:39:25 +0000</pubDate><link>https://news.ycombinator.com/item?id=37652812</link><dc:creator>narrationbox</dc:creator><comments>https://news.ycombinator.com/item?id=37652812</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=37652812</guid></item><item><title><![CDATA[New comment by narrationbox in "Project Gutenberg releases 5k free audiobooks"]]></title><description><![CDATA[
<p>Their sentence segmentation heuristics were not configured correctly. It's not an inherent limitation of the technology itself.<p>The newer transformer based generators are a bit better in this regard (since they can maintain a longer context window, not just in short tiny snippets).</p>
]]></description><pubDate>Mon, 25 Sep 2023 01:29:04 +0000</pubDate><link>https://news.ycombinator.com/item?id=37638735</link><dc:creator>narrationbox</dc:creator><comments>https://news.ycombinator.com/item?id=37638735</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=37638735</guid></item><item><title><![CDATA[New comment by narrationbox in "PlayHT2.0: State-of-the-Art Generative Voice AI Model for Conversational Speech"]]></title><description><![CDATA[
<p>Mel + multispeaker vocoder is very much a classic (tacotron era)  TTS approach</p>
]]></description><pubDate>Fri, 11 Aug 2023 20:16:12 +0000</pubDate><link>https://news.ycombinator.com/item?id=37093469</link><dc:creator>narrationbox</dc:creator><comments>https://news.ycombinator.com/item?id=37093469</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=37093469</guid></item><item><title><![CDATA[New comment by narrationbox in "DeepFilterNet: Noise supression using deep filtering"]]></title><description><![CDATA[
<p>Since it does the signal processing in the Fourier domain, does this suffer from audio artefacts e.g. hissing in the output? Torch's inverse STFT uses Griffin-Lim which is probabilistic and if you don't train it sufficiently, you may sometimes get noise in the output.<p><a href="https://pytorch.org/docs/stable/generated/torch.istft.html#torch-istft" rel="nofollow">https://pytorch.org/docs/stable/generated/torch.istft.html#t...</a><p>An alternative would be to use a vocoder network (or just target a neural speech codec like SoundStream).</p>
]]></description><pubDate>Wed, 07 Jun 2023 03:37:16 +0000</pubDate><link>https://news.ycombinator.com/item?id=36222536</link><dc:creator>narrationbox</dc:creator><comments>https://news.ycombinator.com/item?id=36222536</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=36222536</guid></item><item><title><![CDATA[New comment by narrationbox in "US Marines defeat DARPA robot by hiding under a cardboard box"]]></title><description><![CDATA[
<p>> <i>Recognizes "human" and recognizes "desk". I sit on desk. Does AI mark it as a desk or as a chair?</i><p>Not an issue if the image segmentation is advanced enough. You can train the model to understand "human sitting". It may not generalize to other animals sitting but human action recognition is perfectly possible right now.</p>
]]></description><pubDate>Wed, 25 Jan 2023 17:05:34 +0000</pubDate><link>https://news.ycombinator.com/item?id=34520903</link><dc:creator>narrationbox</dc:creator><comments>https://news.ycombinator.com/item?id=34520903</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=34520903</guid></item><item><title><![CDATA[New comment by narrationbox in "NaturalSpeech: End-to-end text to speech synthesis with human-level quality"]]></title><description><![CDATA[
<p>Your average mobile processor doesn't have anywhere near enough processing power to run a state of the art text to speech network in real-time. Most text to speech on mobile hardware are stream from the cloud.</p>
]]></description><pubDate>Wed, 18 May 2022 05:04:42 +0000</pubDate><link>https://news.ycombinator.com/item?id=31419412</link><dc:creator>narrationbox</dc:creator><comments>https://news.ycombinator.com/item?id=31419412</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=31419412</guid></item><item><title><![CDATA[New comment by narrationbox in "Ask HN: Non-tech professionals on HN?"]]></title><description><![CDATA[
<p>For the high end stuff no, but many of the lower tier jobs are under threat.</p>
]]></description><pubDate>Sat, 09 Oct 2021 15:55:03 +0000</pubDate><link>https://news.ycombinator.com/item?id=28810918</link><dc:creator>narrationbox</dc:creator><comments>https://news.ycombinator.com/item?id=28810918</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=28810918</guid></item><item><title><![CDATA[New comment by narrationbox in "Show HN: Automated Binance Trading Bot – Buy Low/Sell High"]]></title><description><![CDATA[
<p>We used to be in this field too (<a href="https://kloudtrader.com/narwhal" rel="nofollow">https://kloudtrader.com/narwhal</a>). It is a very crowded market and monetisation is tricky.</p>
]]></description><pubDate>Fri, 23 Apr 2021 13:24:02 +0000</pubDate><link>https://news.ycombinator.com/item?id=26914495</link><dc:creator>narrationbox</dc:creator><comments>https://news.ycombinator.com/item?id=26914495</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=26914495</guid></item><item><title><![CDATA[New comment by narrationbox in "Quickdraw with Google AI"]]></title><description><![CDATA[
<p>You can throw this together pretty quickly using one of the AutoML APIs.</p>
]]></description><pubDate>Mon, 29 Mar 2021 22:19:26 +0000</pubDate><link>https://news.ycombinator.com/item?id=26629005</link><dc:creator>narrationbox</dc:creator><comments>https://news.ycombinator.com/item?id=26629005</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=26629005</guid></item><item><title><![CDATA[New comment by narrationbox in "YouTube can now warn creators about copyright issues before videos are posted"]]></title><description><![CDATA[
<p>So if I am launching a Spotify/Audible-style music/audiobook streaming platform, how will the pricing work out? Do we pay you instead of the original author for any user uploaded content? If the original author chooses to upload their content onto our platform, do we scan it with your API and explicitly whitelist it and pay them directly?<p>Your company looks very cool btw. What's your email?</p>
]]></description><pubDate>Thu, 18 Mar 2021 21:42:12 +0000</pubDate><link>https://news.ycombinator.com/item?id=26507872</link><dc:creator>narrationbox</dc:creator><comments>https://news.ycombinator.com/item?id=26507872</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=26507872</guid></item><item><title><![CDATA[New comment by narrationbox in "Launch HN: Pry (YC W21) – Finance for Founders"]]></title><description><![CDATA[
<p>What about TransferWise ewallets?</p>
]]></description><pubDate>Thu, 18 Mar 2021 21:39:29 +0000</pubDate><link>https://news.ycombinator.com/item?id=26507847</link><dc:creator>narrationbox</dc:creator><comments>https://news.ycombinator.com/item?id=26507847</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=26507847</guid></item><item><title><![CDATA[New comment by narrationbox in "YouTube can now warn creators about copyright issues before videos are posted"]]></title><description><![CDATA[
<p>Do you have any public pricing or startup plans?</p>
]]></description><pubDate>Thu, 18 Mar 2021 21:38:20 +0000</pubDate><link>https://news.ycombinator.com/item?id=26507836</link><dc:creator>narrationbox</dc:creator><comments>https://news.ycombinator.com/item?id=26507836</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=26507836</guid></item><item><title><![CDATA[New comment by narrationbox in "Launch HN: Pry (YC W21) – Finance for Founders"]]></title><description><![CDATA[
<p>Does the accounting system support Canada and other commonwealth countries? Or is this US only?</p>
]]></description><pubDate>Tue, 16 Mar 2021 17:58:09 +0000</pubDate><link>https://news.ycombinator.com/item?id=26480434</link><dc:creator>narrationbox</dc:creator><comments>https://news.ycombinator.com/item?id=26480434</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=26480434</guid></item><item><title><![CDATA[New comment by narrationbox in "Launch HN: Enombic (YC S20) – Create your own stock indexes"]]></title><description><![CDATA[
<p>Some ETFs also allow easy investment in stocks of foreign countries without requiring additional brokerages on behalf of the end user. I presume this does not offer that functionality?</p>
]]></description><pubDate>Thu, 25 Feb 2021 19:30:26 +0000</pubDate><link>https://news.ycombinator.com/item?id=26267090</link><dc:creator>narrationbox</dc:creator><comments>https://news.ycombinator.com/item?id=26267090</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=26267090</guid></item><item><title><![CDATA[New comment by narrationbox in "Show HN: Turn scripts into fine-tuned voices via Wiki markups"]]></title><description><![CDATA[
<p>I will look into it, Wiki2SSML looks very handy.</p>
]]></description><pubDate>Tue, 23 Feb 2021 19:58:57 +0000</pubDate><link>https://news.ycombinator.com/item?id=26242119</link><dc:creator>narrationbox</dc:creator><comments>https://news.ycombinator.com/item?id=26242119</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=26242119</guid></item><item><title><![CDATA[New comment by narrationbox in "Show HN: Turn scripts into fine-tuned voices via Wiki markups"]]></title><description><![CDATA[
<p>It looks great, have you considered adding a visual editor?<p>We have one for our systems:
<a href="https://narrationbox.com" rel="nofollow">https://narrationbox.com</a></p>
]]></description><pubDate>Mon, 22 Feb 2021 04:02:20 +0000</pubDate><link>https://news.ycombinator.com/item?id=26220715</link><dc:creator>narrationbox</dc:creator><comments>https://news.ycombinator.com/item?id=26220715</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=26220715</guid></item><item><title><![CDATA[New comment by narrationbox in "Launch HN: Feroot (YC W21) – security scanner for front-end JavaScript code"]]></title><description><![CDATA[
<p>Are there any free plans or discounts for HN users?</p>
]]></description><pubDate>Thu, 04 Feb 2021 19:34:24 +0000</pubDate><link>https://news.ycombinator.com/item?id=26028905</link><dc:creator>narrationbox</dc:creator><comments>https://news.ycombinator.com/item?id=26028905</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=26028905</guid></item></channel></rss>