<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: madisonmay</title><link>https://news.ycombinator.com/user?id=madisonmay</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Fri, 17 Apr 2026 19:22:38 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=madisonmay" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by madisonmay in "Show HN: Kreuzberg – Modern async Python library for document text extraction"]]></title><description><![CDATA[
<p>pypdfium2 is a great choice and a solid piece of software!<p>You might want to look into <a href="https://github.com/VikParuchuri/surya">https://github.com/VikParuchuri/surya</a> as an alternative to tesseract. Yes, it's associated with a commercial company, but as you long as you aren't a company with 5M in ARR or $5M in funding it's free to use.</p>
]]></description><pubDate>Sat, 15 Feb 2025 13:03:00 +0000</pubDate><link>https://news.ycombinator.com/item?id=43058201</link><dc:creator>madisonmay</dc:creator><comments>https://news.ycombinator.com/item?id=43058201</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43058201</guid></item><item><title><![CDATA[New comment by madisonmay in "LLM4Decompile: Decompiling Binary Code with LLM"]]></title><description><![CDATA[
<p>This is an excellent use case for LLM fine-tuning, purely because of the ease of generating a massive dataset of input / output pairs from public C code</p>
]]></description><pubDate>Sun, 17 Mar 2024 13:21:16 +0000</pubDate><link>https://news.ycombinator.com/item?id=39734317</link><dc:creator>madisonmay</dc:creator><comments>https://news.ycombinator.com/item?id=39734317</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=39734317</guid></item><item><title><![CDATA[New comment by madisonmay in "Our next-generation model: Gemini 1.5"]]></title><description><![CDATA[
<p>It's more like saying "I've upgraded to 128GB of RAM, I'll never use my disk again".</p>
]]></description><pubDate>Thu, 15 Feb 2024 16:55:58 +0000</pubDate><link>https://news.ycombinator.com/item?id=39385058</link><dc:creator>madisonmay</dc:creator><comments>https://news.ycombinator.com/item?id=39385058</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=39385058</guid></item><item><title><![CDATA[New comment by madisonmay in "Llama 2"]]></title><description><![CDATA[
<p>See figure-2</p>
]]></description><pubDate>Tue, 18 Jul 2023 16:57:47 +0000</pubDate><link>https://news.ycombinator.com/item?id=36775638</link><dc:creator>madisonmay</dc:creator><comments>https://news.ycombinator.com/item?id=36775638</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=36775638</guid></item><item><title><![CDATA[New comment by madisonmay in "Show HN: Psychic - An open-source integration platform for unstructured data"]]></title><description><![CDATA[
<p>Why the decision to license as GPL?</p>
]]></description><pubDate>Mon, 22 May 2023 23:38:03 +0000</pubDate><link>https://news.ycombinator.com/item?id=36038076</link><dc:creator>madisonmay</dc:creator><comments>https://news.ycombinator.com/item?id=36038076</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=36038076</guid></item><item><title><![CDATA[New comment by madisonmay in "User In Yer Face, a worst-practise UI experiment (2018)"]]></title><description><![CDATA[
<p>Thanks, I hate it.</p>
]]></description><pubDate>Thu, 18 May 2023 11:49:45 +0000</pubDate><link>https://news.ycombinator.com/item?id=35986662</link><dc:creator>madisonmay</dc:creator><comments>https://news.ycombinator.com/item?id=35986662</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=35986662</guid></item><item><title><![CDATA[New comment by madisonmay in "How are you using ChatGPT internally at your company?"]]></title><description><![CDATA[
<p>Coding aid for unittests. Debugging aid for languages / frameworks I'm not particularly familiar with.  Work that requires reformatting. Translating from rough drafts to more polished / professional language.  Learning more about domains I don't have much expertise in where I need specific conceptual questions answered.</p>
]]></description><pubDate>Sun, 07 May 2023 00:52:26 +0000</pubDate><link>https://news.ycombinator.com/item?id=35846782</link><dc:creator>madisonmay</dc:creator><comments>https://news.ycombinator.com/item?id=35846782</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=35846782</guid></item><item><title><![CDATA[RLHF: Reinforcement Learning from Human Feedback]]></title><description><![CDATA[
<p>Article URL: <a href="https://huyenchip.com/2023/05/02/rlhf.html">https://huyenchip.com/2023/05/02/rlhf.html</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=35807341">https://news.ycombinator.com/item?id=35807341</a></p>
<p>Points: 4</p>
<p># Comments: 1</p>
]]></description><pubDate>Wed, 03 May 2023 20:28:04 +0000</pubDate><link>https://huyenchip.com/2023/05/02/rlhf.html</link><dc:creator>madisonmay</dc:creator><comments>https://news.ycombinator.com/item?id=35807341</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=35807341</guid></item><item><title><![CDATA[New comment by madisonmay in "When you combine two things that are close, but not the same"]]></title><description><![CDATA[
<p>Whether or not to split is more a measure of whether or not these two concepts are likely to split down the road than whether or not share similarity today.</p>
]]></description><pubDate>Fri, 14 Apr 2023 01:45:52 +0000</pubDate><link>https://news.ycombinator.com/item?id=35564560</link><dc:creator>madisonmay</dc:creator><comments>https://news.ycombinator.com/item?id=35564560</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=35564560</guid></item><item><title><![CDATA[New comment by madisonmay in "Introducing Agents in Haystack: Make LLMs resolve complex tasks"]]></title><description><![CDATA[
<p>Imperfect systems are still useful, and any sufficiently complex system is imperfect.</p>
]]></description><pubDate>Tue, 04 Apr 2023 01:31:13 +0000</pubDate><link>https://news.ycombinator.com/item?id=35433824</link><dc:creator>madisonmay</dc:creator><comments>https://news.ycombinator.com/item?id=35433824</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=35433824</guid></item><item><title><![CDATA[New comment by madisonmay in "Petals: Run 100B+ language models at home bit-torrent style"]]></title><description><![CDATA[
<p>Interestingly it sounds like offloading could be made quite efficient in a batch setting if you primarily care about throughput rather than latency.  Though I guess for most current LLM applications latency is quite important.</p>
]]></description><pubDate>Mon, 02 Jan 2023 21:30:59 +0000</pubDate><link>https://news.ycombinator.com/item?id=34223938</link><dc:creator>madisonmay</dc:creator><comments>https://news.ycombinator.com/item?id=34223938</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=34223938</guid></item><item><title><![CDATA[New comment by madisonmay in "Ask HN: How to get back into AI?"]]></title><description><![CDATA[
<p>Often it might be viable to implement prediction w/o necessarily implementing training (especially if there are published weights or a reference implementation).  Not viable for papers where the key contribution is a change to the pre-training objective / training methodology / optimizer, but useful for papers where the key contribution is architectural.</p>
]]></description><pubDate>Sun, 11 Dec 2022 02:37:34 +0000</pubDate><link>https://news.ycombinator.com/item?id=33939508</link><dc:creator>madisonmay</dc:creator><comments>https://news.ycombinator.com/item?id=33939508</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=33939508</guid></item><item><title><![CDATA[New comment by madisonmay in "Show HN: Automatically fill PDF templates per API"]]></title><description><![CDATA[
<p>Guessing data security constraints -- I'm likely in a similar boat.</p>
]]></description><pubDate>Mon, 08 Aug 2022 11:39:34 +0000</pubDate><link>https://news.ycombinator.com/item?id=32384351</link><dc:creator>madisonmay</dc:creator><comments>https://news.ycombinator.com/item?id=32384351</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=32384351</guid></item><item><title><![CDATA[New comment by madisonmay in "Show HN: Automatically fill PDF templates per API"]]></title><description><![CDATA[
<p>I'm getting a variety of CORS errors in console.  Maybe this helps:<p>```
Access to XMLHttpRequest at '<a href="https://api.doqs.dev/v1/organization" rel="nofollow">https://api.doqs.dev/v1/organization</a>' from origin '<a href="https://app.doqs.dev" rel="nofollow">https://app.doqs.dev</a>' has been blocked by CORS policy: Response to preflight request doesn't pass access control check: No 'Access-Control-Allow-Origin' header is present on the requested resource.
```</p>
]]></description><pubDate>Mon, 08 Aug 2022 11:38:51 +0000</pubDate><link>https://news.ycombinator.com/item?id=32384349</link><dc:creator>madisonmay</dc:creator><comments>https://news.ycombinator.com/item?id=32384349</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=32384349</guid></item><item><title><![CDATA[New comment by madisonmay in "Show HN: Automatically fill PDF templates per API"]]></title><description><![CDATA[
<p>Awesome idea, but website seems unstable.  Wasn't able to login after sign-up :/</p>
]]></description><pubDate>Sun, 07 Aug 2022 21:26:06 +0000</pubDate><link>https://news.ycombinator.com/item?id=32380075</link><dc:creator>madisonmay</dc:creator><comments>https://news.ycombinator.com/item?id=32380075</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=32380075</guid></item><item><title><![CDATA[New comment by madisonmay in "A basic introduction to NumPy's einsum"]]></title><description><![CDATA[
<p>For more efficient einsum, see projects like <a href="https://optimized-einsum.readthedocs.io/en/stable/path_finding.html#introduction" rel="nofollow">https://optimized-einsum.readthedocs.io/en/stable/path_findi...</a>.</p>
]]></description><pubDate>Sun, 10 Apr 2022 00:12:38 +0000</pubDate><link>https://news.ycombinator.com/item?id=30973348</link><dc:creator>madisonmay</dc:creator><comments>https://news.ycombinator.com/item?id=30973348</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=30973348</guid></item><item><title><![CDATA[New comment by madisonmay in "PyFlow – Visual scripting framework for Python – NodeRED alternative?"]]></title><description><![CDATA[
<p>I suppose so, but perhaps trying to prevent the spaghettification has some positive benefits in terms of DRY + code structure.</p>
]]></description><pubDate>Sun, 16 Jan 2022 19:46:49 +0000</pubDate><link>https://news.ycombinator.com/item?id=29959260</link><dc:creator>madisonmay</dc:creator><comments>https://news.ycombinator.com/item?id=29959260</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=29959260</guid></item><item><title><![CDATA[New comment by madisonmay in "OpenAI disbands its robotics research team"]]></title><description><![CDATA[
<p>Wojciech stated this pretty explicitly on his Gradient Dissent podcast a few months back.</p>
]]></description><pubDate>Sat, 17 Jul 2021 21:22:47 +0000</pubDate><link>https://news.ycombinator.com/item?id=27868966</link><dc:creator>madisonmay</dc:creator><comments>https://news.ycombinator.com/item?id=27868966</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=27868966</guid></item><item><title><![CDATA[New comment by madisonmay in "Show HN: Parrot.vc – I forced a bot to read 65,000 VC tweets and it became a VC"]]></title><description><![CDATA[
<p>Sweet! Looking forward to it.</p>
]]></description><pubDate>Wed, 04 Dec 2019 22:35:08 +0000</pubDate><link>https://news.ycombinator.com/item?id=21707985</link><dc:creator>madisonmay</dc:creator><comments>https://news.ycombinator.com/item?id=21707985</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=21707985</guid></item><item><title><![CDATA[New comment by madisonmay in "Show HN: Parrot.vc – I forced a bot to read 65,000 VC tweets and it became a VC"]]></title><description><![CDATA[
<p>@nloui any chance you're willing to share your dataset? Would be fun to replicate this with GPT-2 fine-tuning instead of a Markov chain.</p>
]]></description><pubDate>Wed, 04 Dec 2019 22:05:38 +0000</pubDate><link>https://news.ycombinator.com/item?id=21707717</link><dc:creator>madisonmay</dc:creator><comments>https://news.ycombinator.com/item?id=21707717</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=21707717</guid></item></channel></rss>