<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: GirkovArpa</title><link>https://news.ycombinator.com/user?id=GirkovArpa</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Fri, 17 Apr 2026 08:42:18 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=GirkovArpa" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[Ask HN: How to OCR a PDF and preserve whitespace?]]></title><description><![CDATA[
<p>I have some rather large PDFs that need to be transcribed, but every service I try has some minor but deal-breaking flaw.<p>Either they don't support PDFs this large (hundreds of pages), are just really bad at English OCR, or, most commonly, don't preserve whitespace correctly.<p>The number one problem is whitespace when it comes to multiple columns (similar to newspapers). Either not putting any spaces between words, or when there are multiple columns of text, putting rows in the wrong order. If it was just a single page, this would still be useful, since I could fix it myself. But I have over 1000 pages.<p>I tried so many free services and trials that I just got charged for forgetting to cancel one (thanks to smallpdf.com for refunding my $12). Is OCR technology just not there yet when it comes to multiple-column pages? Yet, this does not seem to be an issue with newspapers.com, based on my experience using their text search feature. I would like to know what OCR software they are using.</p>
<hr>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=40612190">https://news.ycombinator.com/item?id=40612190</a></p>
<p>Points: 26</p>
<p># Comments: 17</p>
]]></description><pubDate>Fri, 07 Jun 2024 19:43:25 +0000</pubDate><link>https://news.ycombinator.com/item?id=40612190</link><dc:creator>GirkovArpa</dc:creator><comments>https://news.ycombinator.com/item?id=40612190</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40612190</guid></item><item><title><![CDATA[New comment by GirkovArpa in "Scientists have traced human tail loss to a short sequence of genetic code"]]></title><description><![CDATA[
<p>This comment needs to be higher.</p>
]]></description><pubDate>Mon, 25 Mar 2024 22:56:48 +0000</pubDate><link>https://news.ycombinator.com/item?id=39822188</link><dc:creator>GirkovArpa</dc:creator><comments>https://news.ycombinator.com/item?id=39822188</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=39822188</guid></item><item><title><![CDATA[New comment by GirkovArpa in "“Emergent” abilities in LLMs actually develop gradually and predictably – study"]]></title><description><![CDATA[
<p>> You can ask them to reverse numbers or re-arrange words and they'll faceplant in the same way as soon as the input gets beyond a small threshold. Here surely there wouldn't be an issue with tokenization.<p>My guess is the training data contains many <i>short</i> pairs of forward and backward sequences, but none after a certain threshold length (due to how quickly the number of possible sequences grows with length). This would imply there's no actual reversing going on, and the LLM is instead using the training data as a lookup table.</p>
]]></description><pubDate>Mon, 25 Mar 2024 22:47:39 +0000</pubDate><link>https://news.ycombinator.com/item?id=39822104</link><dc:creator>GirkovArpa</dc:creator><comments>https://news.ycombinator.com/item?id=39822104</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=39822104</guid></item><item><title><![CDATA[New comment by GirkovArpa in "Show HN: Glossarie – a new, immersive way to learn a language"]]></title><description><![CDATA[
<p>This should work fantastic in theory, since differing vocabulary (not grammar) is the main factor that determines the difficulty of a new language. Putting off this primary obstacle so one can ease into it sounds genius to me. It also agrees with the method hyped by Steve Kaufman, where one should read and speak level-appropriate material.</p>
]]></description><pubDate>Mon, 25 Mar 2024 16:34:39 +0000</pubDate><link>https://news.ycombinator.com/item?id=39818372</link><dc:creator>GirkovArpa</dc:creator><comments>https://news.ycombinator.com/item?id=39818372</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=39818372</guid></item><item><title><![CDATA[New comment by GirkovArpa in "Anglo-Italian company says it has cracked Bitcoin. People have questions"]]></title><description><![CDATA[
<p>Exactly. The proof they did not actually crack Bitcoin is the fact they said they did.</p>
]]></description><pubDate>Mon, 25 Mar 2024 16:26:50 +0000</pubDate><link>https://news.ycombinator.com/item?id=39818269</link><dc:creator>GirkovArpa</dc:creator><comments>https://news.ycombinator.com/item?id=39818269</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=39818269</guid></item><item><title><![CDATA[New comment by GirkovArpa in "The baffling intelligence of a single cell: The story of E. coli chemotaxis"]]></title><description><![CDATA[
<p>In the context of Condorcet's jury theorem, the percentages refer to the chance of voting for the correct outcome. Think of a legal trial and there is no ambiguity about the meaning of "50%" is.</p>
]]></description><pubDate>Mon, 25 Mar 2024 16:24:13 +0000</pubDate><link>https://news.ycombinator.com/item?id=39818245</link><dc:creator>GirkovArpa</dc:creator><comments>https://news.ycombinator.com/item?id=39818245</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=39818245</guid></item><item><title><![CDATA[New comment by GirkovArpa in "The baffling intelligence of a single cell: The story of E. coli chemotaxis"]]></title><description><![CDATA[
<p>According to Condorcet's jury theorem, a committee of 10 identical members may be smarter than a single member.</p>
]]></description><pubDate>Thu, 21 Mar 2024 14:54:59 +0000</pubDate><link>https://news.ycombinator.com/item?id=39779255</link><dc:creator>GirkovArpa</dc:creator><comments>https://news.ycombinator.com/item?id=39779255</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=39779255</guid></item><item><title><![CDATA[New comment by GirkovArpa in "Netlify just sent me a $104k bill for a simple static site"]]></title><description><![CDATA[
<p>Yes. Back around 2020 they forgave me a $1000 bill for a side project I thought was running on a free tier.</p>
]]></description><pubDate>Tue, 27 Feb 2024 19:12:55 +0000</pubDate><link>https://news.ycombinator.com/item?id=39528291</link><dc:creator>GirkovArpa</dc:creator><comments>https://news.ycombinator.com/item?id=39528291</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=39528291</guid></item><item><title><![CDATA[New comment by GirkovArpa in "Show HN: Minimalist CSS Framework"]]></title><description><![CDATA[
<p>I love it!  Going to use this instead of Pure.CSS from now on.</p>
]]></description><pubDate>Tue, 27 Sep 2022 23:42:28 +0000</pubDate><link>https://news.ycombinator.com/item?id=33002595</link><dc:creator>GirkovArpa</dc:creator><comments>https://news.ycombinator.com/item?id=33002595</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=33002595</guid></item><item><title><![CDATA[Show HN: GameScripter.JS – Sciter-based game engine inspired by DragonRuby]]></title><description><![CDATA[
<p>If you're not familiar with DragonRuby, you can check out a live demo here:<p><a href="http://fiddle.dragonruby.org.s3-website.us-east-2.amazonaws.com/index.html?tutorial=tutorial-arcade-shooter.html" rel="nofollow">http://fiddle.dragonruby.org.s3-website.us-east-2.amazonaws....</a></p>
<hr>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=33002548">https://news.ycombinator.com/item?id=33002548</a></p>
<p>Points: 1</p>
<p># Comments: 0</p>
]]></description><pubDate>Tue, 27 Sep 2022 23:37:04 +0000</pubDate><link>https://girkovarpa.itch.io/gamescripterjs</link><dc:creator>GirkovArpa</dc:creator><comments>https://news.ycombinator.com/item?id=33002548</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=33002548</guid></item><item><title><![CDATA[New comment by GirkovArpa in "Show HN: Handwriting Synthesis with Machine Learning and Sciter.js [Demo]"]]></title><description><![CDATA[
<p>Hello HN — This is a desktop app that accepts typed text and synthesizes a handwritten version of it, using recurrent neural networks in real time.  The network is implemented as a library (included in the Releases section), written in 100% Rust.  The GUI is based on the free Sciter.JS library, which supports SVG and the fancy controls. All this is an unofficial port of <a href="https://calligrapher.ai" rel="nofollow">https://calligrapher.ai</a><p>By clicking the download button at the top-left, the output can be saved as SVG.</p>
]]></description><pubDate>Fri, 06 Aug 2021 23:47:19 +0000</pubDate><link>https://news.ycombinator.com/item?id=28094143</link><dc:creator>GirkovArpa</dc:creator><comments>https://news.ycombinator.com/item?id=28094143</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=28094143</guid></item><item><title><![CDATA[Show HN: Handwriting Synthesis with Machine Learning and Sciter.js [Demo]]]></title><description><![CDATA[
<p>Article URL: <a href="https://github.com/GirkovArpa/calligrapher-ai">https://github.com/GirkovArpa/calligrapher-ai</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=28094135">https://news.ycombinator.com/item?id=28094135</a></p>
<p>Points: 2</p>
<p># Comments: 1</p>
]]></description><pubDate>Fri, 06 Aug 2021 23:46:43 +0000</pubDate><link>https://github.com/GirkovArpa/calligrapher-ai</link><dc:creator>GirkovArpa</dc:creator><comments>https://news.ycombinator.com/item?id=28094135</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=28094135</guid></item><item><title><![CDATA[New comment by GirkovArpa in "Sweden is being shot up"]]></title><description><![CDATA[
<p>> All the languages in Europe descended from the Proto-Indo-European language<p>Except for Basque!</p>
]]></description><pubDate>Wed, 28 Jul 2021 05:57:09 +0000</pubDate><link>https://news.ycombinator.com/item?id=27980628</link><dc:creator>GirkovArpa</dc:creator><comments>https://news.ycombinator.com/item?id=27980628</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=27980628</guid></item><item><title><![CDATA[New comment by GirkovArpa in "Show HN: Generate realistic handwriting without neural networks [demo]"]]></title><description><![CDATA[
<p>Hey HN — This is a simple web app that allows you to draw an alphabet, type some text, and then have the text written in your own handwriting.  I’m happy to hear feedback or answer questions about how it works!<p>P.S. Drawing the entire alphabet isn't needed, only the letters you need.<p>Compare to:<p><a href="https://www.calligrapher.ai/" rel="nofollow">https://www.calligrapher.ai/</a><p><a href="https://www.cs.toronto.edu/~graves/handwriting.html" rel="nofollow">https://www.cs.toronto.edu/~graves/handwriting.html</a></p>
]]></description><pubDate>Thu, 22 Jul 2021 00:47:50 +0000</pubDate><link>https://news.ycombinator.com/item?id=27914054</link><dc:creator>GirkovArpa</dc:creator><comments>https://news.ycombinator.com/item?id=27914054</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=27914054</guid></item><item><title><![CDATA[Show HN: Generate realistic handwriting without neural networks [demo]]]></title><description><![CDATA[
<p>Article URL: <a href="https://fake-handwriting.herokuapp.com/">https://fake-handwriting.herokuapp.com/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=27913942">https://news.ycombinator.com/item?id=27913942</a></p>
<p>Points: 2</p>
<p># Comments: 2</p>
]]></description><pubDate>Thu, 22 Jul 2021 00:33:25 +0000</pubDate><link>https://fake-handwriting.herokuapp.com/</link><dc:creator>GirkovArpa</dc:creator><comments>https://news.ycombinator.com/item?id=27913942</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=27913942</guid></item><item><title><![CDATA[New comment by GirkovArpa in "Show HN: CalDOM: An agnostic, reactive and minimalist JavaScript UI library"]]></title><description><![CDATA[
<p>Sciter is really great.</p>
]]></description><pubDate>Tue, 20 Jul 2021 03:08:24 +0000</pubDate><link>https://news.ycombinator.com/item?id=27889691</link><dc:creator>GirkovArpa</dc:creator><comments>https://news.ycombinator.com/item?id=27889691</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=27889691</guid></item><item><title><![CDATA[New comment by GirkovArpa in "Are we GUI Yet? The state of building user interfaces in Rust"]]></title><description><![CDATA[
<p>> you can't distribute the binaries yourself<p>You can :)  You just have to include a copyright notice unless you purchase an exemption.</p>
]]></description><pubDate>Fri, 16 Jul 2021 23:22:55 +0000</pubDate><link>https://news.ycombinator.com/item?id=27862286</link><dc:creator>GirkovArpa</dc:creator><comments>https://news.ycombinator.com/item?id=27862286</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=27862286</guid></item><item><title><![CDATA[New comment by GirkovArpa in "Ask HN: Desktop Programming Language"]]></title><description><![CDATA[
<p>I recommend Sciter.JS as a lightweight alternative to Electron.  It's less than 10mb.<p>If you visit the repo here: <a href="https://github.com/c-smile/sciter-js-sdk" rel="nofollow">https://github.com/c-smile/sciter-js-sdk</a><p>There's a database demo in samples/sqlite.</p>
]]></description><pubDate>Fri, 02 Apr 2021 20:42:41 +0000</pubDate><link>https://news.ycombinator.com/item?id=26675766</link><dc:creator>GirkovArpa</dc:creator><comments>https://news.ycombinator.com/item?id=26675766</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=26675766</guid></item><item><title><![CDATA[New comment by GirkovArpa in "Nuklear: A cross-platform GUI library in C"]]></title><description><![CDATA[
<p>How does this compare to Sciter?  I understand this is primarily to overlay UIs over full screen applications (like games), but Sciter has that capability as well.  Particularly how easy is it to style things, considering Sciter allows CSS?<p>I did a CTRL+F for "sciter" and usually I find something, but this time, nope.</p>
]]></description><pubDate>Tue, 23 Feb 2021 05:27:55 +0000</pubDate><link>https://news.ycombinator.com/item?id=26234095</link><dc:creator>GirkovArpa</dc:creator><comments>https://news.ycombinator.com/item?id=26234095</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=26234095</guid></item><item><title><![CDATA[New comment by GirkovArpa in "Tauri: An Electron alternative written in Rust"]]></title><description><![CDATA[
<p>Would you be able to mention these apps?  I would like to take a look.</p>
]]></description><pubDate>Sat, 20 Feb 2021 21:42:48 +0000</pubDate><link>https://news.ycombinator.com/item?id=26208247</link><dc:creator>GirkovArpa</dc:creator><comments>https://news.ycombinator.com/item?id=26208247</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=26208247</guid></item></channel></rss>