<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: lignuist</title><link>https://news.ycombinator.com/user?id=lignuist</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Sat, 11 Apr 2026 08:09:55 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=lignuist" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by lignuist in "So You Want to Write Your Own CSV code"]]></title><description><![CDATA[
<p>Nitpicking much?<p>As I wrote above, by making sure that I use a placeholder that does not appear in the data, I make sure that it does not cause the issues you describe. And if I was wrong with that assumption, I can at least minimize the effect by choosing a very unlikely sequence as placeholder.<p>I really see no issue here. How do you find valid grammars for fuzzy data in practice?</p>
]]></description><pubDate>Mon, 26 May 2014 17:28:11 +0000</pubDate><link>https://news.ycombinator.com/item?id=7800648</link><dc:creator>lignuist</dc:creator><comments>https://news.ycombinator.com/item?id=7800648</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=7800648</guid></item><item><title><![CDATA[New comment by lignuist in "PHỞ: Automated workflow for front-end"]]></title><description><![CDATA[
<p>I'm not sure if interchangeable is the right word.
'phởne' and 'phone' yield different result lists.<p>At least Google seems to have a way to detect visually similar letters.</p>
]]></description><pubDate>Mon, 26 May 2014 15:35:22 +0000</pubDate><link>https://news.ycombinator.com/item?id=7800143</link><dc:creator>lignuist</dc:creator><comments>https://news.ycombinator.com/item?id=7800143</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=7800143</guid></item><item><title><![CDATA[New comment by lignuist in "PHỞ: Automated workflow for front-end"]]></title><description><![CDATA[
<p>I wonder how many potential user are lost due to ungoogleability. :O)</p>
]]></description><pubDate>Mon, 26 May 2014 15:02:00 +0000</pubDate><link>https://news.ycombinator.com/item?id=7799974</link><dc:creator>lignuist</dc:creator><comments>https://news.ycombinator.com/item?id=7799974</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=7799974</guid></item><item><title><![CDATA[New comment by lignuist in "New Haxe website"]]></title><description><![CDATA[
<p>The idea is to be able to write code in one language and use this code in many different languages.</p>
]]></description><pubDate>Mon, 26 May 2014 13:39:53 +0000</pubDate><link>https://news.ycombinator.com/item?id=7799675</link><dc:creator>lignuist</dc:creator><comments>https://news.ycombinator.com/item?id=7799675</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=7799675</guid></item><item><title><![CDATA[New comment by lignuist in "So You Want to Write Your Own CSV code"]]></title><description><![CDATA[
<p>I used that strategy for parsing gigabytes of CSVs containing arbitrary natural language from the web - try to get these files fixed, or figure out a grammar for gigabytes of fuzzy data...<p>My approach never failed for me, so telling me that my strategy does not work is a strong claim, where it reliably did the job for me.<p>Your examples are all valid, but what you are describing are theoretical attacks on the method, while the method works in almost all cases in practice. We are talking about two different viewpoints: dealing with large amounts of messy data on one hand and parser theory in an ideal cosmos on the other hand.</p>
]]></description><pubDate>Mon, 26 May 2014 12:52:30 +0000</pubDate><link>https://news.ycombinator.com/item?id=7799531</link><dc:creator>lignuist</dc:creator><comments>https://news.ycombinator.com/item?id=7799531</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=7799531</guid></item><item><title><![CDATA[New comment by lignuist in "So You Want to Write Your Own CSV code"]]></title><description><![CDATA[
<p>> What if there is #COMMA, in one of the fields (but no #COMMA#)?<p>What should happen? Since #COMMA is not #COMMA#, it gets not replaced, because it does not match.<p>Please keep in mind, that I replied to suni's very specific question and did not try to start a discussion about general parser theory. In practice, we find a lot of files that do not respect the grammar, but still need to find a way to make the data accessible.</p>
]]></description><pubDate>Mon, 26 May 2014 09:33:35 +0000</pubDate><link>https://news.ycombinator.com/item?id=7799177</link><dc:creator>lignuist</dc:creator><comments>https://news.ycombinator.com/item?id=7799177</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=7799177</guid></item><item><title><![CDATA[New comment by lignuist in "So You Want to Write Your Own CSV code"]]></title><description><![CDATA[
<p>You just choose a placeholder that does not appear in the data. You could even implement it in a way that a placeholder is automatically selected upfront that does not appear in the data.<p>When it comes to parsing, the thing is that you usually have to make some assumptions about the document structure.</p>
]]></description><pubDate>Mon, 26 May 2014 04:22:09 +0000</pubDate><link>https://news.ycombinator.com/item?id=7798686</link><dc:creator>lignuist</dc:creator><comments>https://news.ycombinator.com/item?id=7798686</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=7798686</guid></item><item><title><![CDATA[New comment by lignuist in "So You Want to Write Your Own CSV code"]]></title><description><![CDATA[
<p>I was referencing to "What if the character separating fields is not a comma?".<p>And there it clearly works. I used this technique a few times with success. If you find a CSV file that has mixed field separator types, then you probably found a broken CSV file.</p>
]]></description><pubDate>Mon, 26 May 2014 03:19:28 +0000</pubDate><link>https://news.ycombinator.com/item?id=7798532</link><dc:creator>lignuist</dc:creator><comments>https://news.ycombinator.com/item?id=7798532</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=7798532</guid></item><item><title><![CDATA[New comment by lignuist in "So You Want to Write Your Own CSV code"]]></title><description><![CDATA[
<p>You can replace all commas with a placeholder (e.g. "#COMMA#"), replace the delimiter with a comma, parse the document and then replace all placeholders in the data with ",".</p>
]]></description><pubDate>Sun, 25 May 2014 17:40:19 +0000</pubDate><link>https://news.ycombinator.com/item?id=7796923</link><dc:creator>lignuist</dc:creator><comments>https://news.ycombinator.com/item?id=7796923</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=7796923</guid></item><item><title><![CDATA[New comment by lignuist in "Wu-Tang Clan Offered $5M for New Album Only Available as One Copy"]]></title><description><![CDATA[
<p>My perspective on art is a reaction on the elitism of the art scene, so basically my comments are art.<p>Edit/addition: Honestly, I could have much more respect for this project, if Wu-Tang made it only accessible to homeless people, or only to prisoners, but effectively, they make it only accessible to the riches. I really do like the Wu-Tang Clan, but I am really not impressed by this stunt.</p>
]]></description><pubDate>Thu, 03 Apr 2014 05:09:19 +0000</pubDate><link>https://news.ycombinator.com/item?id=7521068</link><dc:creator>lignuist</dc:creator><comments>https://news.ycombinator.com/item?id=7521068</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=7521068</guid></item><item><title><![CDATA[New comment by lignuist in "Wu-Tang Clan Offered $5M for New Album Only Available as One Copy"]]></title><description><![CDATA[
<p>> He continued: "I don't know how to measure it, but it gives us an idea that what we're doing is being understood by some. And there are some good peers of mine also, who are very high-ranking in the film business and the music business, sending me a lot of good will. It's been real positive.<p>So Wu-Tang Clan fans in Kazakhstan or Tanzania (or even every country other than the U.S.) will probably never be able to listen to this album...? I guess these will be the people who don't "understand" what Wu-Tang Clan is doing, while only the privileged ones "understand" the concept.<p>That's artificial shortage, not art (not talking about the music itself).</p>
]]></description><pubDate>Wed, 02 Apr 2014 22:07:18 +0000</pubDate><link>https://news.ycombinator.com/item?id=7519414</link><dc:creator>lignuist</dc:creator><comments>https://news.ycombinator.com/item?id=7519414</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=7519414</guid></item><item><title><![CDATA[Anything related to Tesla has been secretly banned from /r/Technology.]]></title><description><![CDATA[
<p>Article URL: <a href="http://www.reddit.com/r/Futurology/comments/21melq/">http://www.reddit.com/r/Futurology/comments/21melq/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=7493730">https://news.ycombinator.com/item?id=7493730</a></p>
<p>Points: 8</p>
<p># Comments: 2</p>
]]></description><pubDate>Sat, 29 Mar 2014 20:07:07 +0000</pubDate><link>http://www.reddit.com/r/Futurology/comments/21melq/</link><dc:creator>lignuist</dc:creator><comments>https://news.ycombinator.com/item?id=7493730</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=7493730</guid></item><item><title><![CDATA[New comment by lignuist in "Sony releases the Holy Grail: A flexible e-ink screen"]]></title><description><![CDATA[
<p>What is wrong with just offering a reader without forcing its users to also buy the books from the same source?<p>I own a Sony reader and I actually enjoy that it is not tightly bundled with a bookstore.</p>
]]></description><pubDate>Fri, 28 Mar 2014 20:25:03 +0000</pubDate><link>https://news.ycombinator.com/item?id=7489746</link><dc:creator>lignuist</dc:creator><comments>https://news.ycombinator.com/item?id=7489746</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=7489746</guid></item><item><title><![CDATA[New comment by lignuist in "Pending Comments"]]></title><description><![CDATA[
<p>This is what I feel too. It expect it to streamline the comments and kill the discussions.</p>
]]></description><pubDate>Sat, 22 Mar 2014 09:33:35 +0000</pubDate><link>https://news.ycombinator.com/item?id=7448428</link><dc:creator>lignuist</dc:creator><comments>https://news.ycombinator.com/item?id=7448428</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=7448428</guid></item><item><title><![CDATA[New comment by lignuist in "Microsoft killed my Pappy"]]></title><description><![CDATA[
<p>> Is Microsoft circa 2014 worse than Google, Apple, or Facebook? We're not nearly as organized as we'd need to be to be as evil as you might think we are.<p>Microsoft is not any worse than the other companies. They are all at the same terrible level.<p>But Microsoft became a bit better over the last years, I would say.</p>
]]></description><pubDate>Sat, 22 Feb 2014 07:47:19 +0000</pubDate><link>https://news.ycombinator.com/item?id=7281318</link><dc:creator>lignuist</dc:creator><comments>https://news.ycombinator.com/item?id=7281318</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=7281318</guid></item><item><title><![CDATA[New comment by lignuist in "Ask HN: How to create an anonymous site?"]]></title><description><![CDATA[
<p>Better use Anonymouth:
<a href="https://github.com/psal/anonymouth" rel="nofollow">https://github.com/psal/anonymouth</a><p>The translation method is not really effective:<p>> We also show that automated attempts at circumventing stylometry using machine translation may not be as effective, often altering the meaning of text while providing only small drops in accuracy.<p><a href="https://www.cs.drexel.edu/~sa499/papers/adversarial_stylometry.pdf" rel="nofollow">https://www.cs.drexel.edu/~sa499/papers/adversarial_stylomet...</a></p>
]]></description><pubDate>Wed, 19 Feb 2014 10:03:54 +0000</pubDate><link>https://news.ycombinator.com/item?id=7262835</link><dc:creator>lignuist</dc:creator><comments>https://news.ycombinator.com/item?id=7262835</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=7262835</guid></item><item><title><![CDATA[New comment by lignuist in "You might not need jQuery"]]></title><description><![CDATA[
<p>Citing the AngularJS FAQs<i>:<p>Does Angular use the jQuery library?<p>Yes, Angular can use jQuery if it's present in your app when the application is being bootstrapped. If jQuery is not present in your script path, Angular falls back to its own implementation of the subset of jQuery that we call jQLite.<p>Due to a change to use on()/off() rather than bind()/unbind(), Angular 1.2 only operates with jQuery 1.7.1 or above.<p></i> <a href="http://docs.angularjs.org/misc/faq" rel="nofollow">http://docs.angularjs.org/misc/faq</a></p>
]]></description><pubDate>Thu, 30 Jan 2014 22:05:40 +0000</pubDate><link>https://news.ycombinator.com/item?id=7153653</link><dc:creator>lignuist</dc:creator><comments>https://news.ycombinator.com/item?id=7153653</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=7153653</guid></item><item><title><![CDATA[New comment by lignuist in "Show HN: Unbabel API – Human Corrected Machine Translation"]]></title><description><![CDATA[
<p>> Human Corrected Translations for 1 cent per word<p>"Per word" of the source language, or the target language? Sum of both? What about languages which have a different concept of "words" in written text (e.g. Chinese, Turkish, ...).<p>And by the way... "cent" of which currency? :)<p>Edit: I just saw that the list of supported languages does not contain languages with "exotic" types of word boundaries (yet).</p>
]]></description><pubDate>Thu, 30 Jan 2014 21:04:59 +0000</pubDate><link>https://news.ycombinator.com/item?id=7153354</link><dc:creator>lignuist</dc:creator><comments>https://news.ycombinator.com/item?id=7153354</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=7153354</guid></item><item><title><![CDATA[New comment by lignuist in "A positioning engine to make overlays, tooltips and dropdowns faster"]]></title><description><![CDATA[
<p>Reminds me of the position helper in JQueryUI. I was able to extract the helper into a small file for using it, without having to load the rest of the framework.<p><a href="http://jqueryui.com/position/" rel="nofollow">http://jqueryui.com/position/</a></p>
]]></description><pubDate>Tue, 28 Jan 2014 09:49:53 +0000</pubDate><link>https://news.ycombinator.com/item?id=7136256</link><dc:creator>lignuist</dc:creator><comments>https://news.ycombinator.com/item?id=7136256</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=7136256</guid></item><item><title><![CDATA[New comment by lignuist in "We want your old T-shirts"]]></title><description><![CDATA[
<p>I thought the same. I recently saw a jeans couch and think that this fabric is much better for this purpose.</p>
]]></description><pubDate>Sun, 26 Jan 2014 14:53:53 +0000</pubDate><link>https://news.ycombinator.com/item?id=7125234</link><dc:creator>lignuist</dc:creator><comments>https://news.ycombinator.com/item?id=7125234</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=7125234</guid></item></channel></rss>