<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: mkaszkowiak</title><link>https://news.ycombinator.com/user?id=mkaszkowiak</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Tue, 28 Apr 2026 22:57:34 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=mkaszkowiak" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[Customize your Fedora with a Dockerfile]]></title><description><![CDATA[
<p>Article URL: <a href="https://blog.piaseczny.dev/posts/bootc/">https://blog.piaseczny.dev/posts/bootc/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=46512696">https://news.ycombinator.com/item?id=46512696</a></p>
<p>Points: 4</p>
<p># Comments: 0</p>
]]></description><pubDate>Tue, 06 Jan 2026 14:29:21 +0000</pubDate><link>https://blog.piaseczny.dev/posts/bootc/</link><dc:creator>mkaszkowiak</dc:creator><comments>https://news.ycombinator.com/item?id=46512696</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46512696</guid></item><item><title><![CDATA[New comment by mkaszkowiak in "Keep Android Open"]]></title><description><![CDATA[
<p>Google is killing Android. Along with the side-loading changes, I'm losing the desire to keep using it, as it's no longer an open OS.<p>What's the point of those changes? Does Google want to maintain its revenue from Play Store? Feels like a bad long-term decision, especially when Apple is releasing excellent phones.</p>
]]></description><pubDate>Wed, 29 Oct 2025 12:18:57 +0000</pubDate><link>https://news.ycombinator.com/item?id=45745841</link><dc:creator>mkaszkowiak</dc:creator><comments>https://news.ycombinator.com/item?id=45745841</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45745841</guid></item><item><title><![CDATA[New comment by mkaszkowiak in "Show HN: Improving search ranking with chess Elo scores"]]></title><description><![CDATA[
<p>Happy to see competition in rerankers! Good luck with your product.<p>My questions: what languages do your models currently support? Did you perform multilingual benchmarks? Couldn't find an answer on the website</p>
]]></description><pubDate>Wed, 16 Jul 2025 16:47:55 +0000</pubDate><link>https://news.ycombinator.com/item?id=44584364</link><dc:creator>mkaszkowiak</dc:creator><comments>https://news.ycombinator.com/item?id=44584364</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44584364</guid></item><item><title><![CDATA[New comment by mkaszkowiak in "Vector databases are the wrong abstraction"]]></title><description><![CDATA[
<p>Easier to handle edge-cases - real examples:<p>- What if certain rows in a table don't need to be embedded?<p>- What if we use a single API key for embedding database rows and user queries and it hits a rate limit - how to prioritize  user queries?<p>- What if some rows should be vectorized using a different model, depending on an external configuration?</p>
]]></description><pubDate>Sun, 03 Nov 2024 17:43:14 +0000</pubDate><link>https://news.ycombinator.com/item?id=42034600</link><dc:creator>mkaszkowiak</dc:creator><comments>https://news.ycombinator.com/item?id=42034600</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42034600</guid></item><item><title><![CDATA[New comment by mkaszkowiak in "Vector databases are the wrong abstraction"]]></title><description><![CDATA[
<p>I agree with the author - introducing a vector database often isn't worth the extra complexity.<p>Personally, I can vouch for ParadeDB: <a href="https://www.paradedb.com/">https://www.paradedb.com/</a><p>It adds extra extensions to PostgreSQL which enable vector indexing, full text search and BM25. Works great and developers are helpful!<p>The major difference is that you must generate the embeddings by yourself, but I consider it an upside - to each their own :)</p>
]]></description><pubDate>Wed, 30 Oct 2024 12:31:46 +0000</pubDate><link>https://news.ycombinator.com/item?id=41994141</link><dc:creator>mkaszkowiak</dc:creator><comments>https://news.ycombinator.com/item?id=41994141</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=41994141</guid></item><item><title><![CDATA[New comment by mkaszkowiak in "How good can you be at Codenames without knowing any words?"]]></title><description><![CDATA[
<p>I've also thought about creating a Codenames bot: what if we could use semantic similarity to batch words together? Surely, this can be done using a prebuilt embedding model and clustering!<p>After some failed experiments - it performed worse than I thought it will - I've googled the subject, and... it turns out there's a whole paper about ML and codenames :)<p><a href="https://arxiv.org/abs/2105.05885" rel="nofollow">https://arxiv.org/abs/2105.05885</a> (<i>Playing Codenames with Language Graphs and Word Embeddings</i>) - fun to read</p>
]]></description><pubDate>Mon, 12 Aug 2024 12:33:54 +0000</pubDate><link>https://news.ycombinator.com/item?id=41223667</link><dc:creator>mkaszkowiak</dc:creator><comments>https://news.ycombinator.com/item?id=41223667</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=41223667</guid></item><item><title><![CDATA[New comment by mkaszkowiak in "Ask HN: What are you using to parse PDFs for RAG?"]]></title><description><![CDATA[
<p>Glad to hear that :) Thanks for developing Marker!</p>
]]></description><pubDate>Wed, 31 Jul 2024 10:27:07 +0000</pubDate><link>https://news.ycombinator.com/item?id=41117866</link><dc:creator>mkaszkowiak</dc:creator><comments>https://news.ycombinator.com/item?id=41117866</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=41117866</guid></item><item><title><![CDATA[New comment by mkaszkowiak in "Ask HN: What are you using to parse PDFs for RAG?"]]></title><description><![CDATA[
<p>Thanks for answering! In my case, I don't directly use  RAG; but rather post-process documents via LLMs to extract a set of specific answers. That's also why I've asked about deduplication - asking LLM to provide an answer from 2 different data sources (invalid unstructured table text & valid structured table contents) quickly ramps up errors.</p>
]]></description><pubDate>Tue, 30 Jul 2024 21:19:59 +0000</pubDate><link>https://news.ycombinator.com/item?id=41114243</link><dc:creator>mkaszkowiak</dc:creator><comments>https://news.ycombinator.com/item?id=41114243</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=41114243</guid></item><item><title><![CDATA[New comment by mkaszkowiak in "Ask HN: What are you using to parse PDFs for RAG?"]]></title><description><![CDATA[
<p>How do you combine the outputs? Wouldn't there be data duplication between unstructured text and tables?</p>
]]></description><pubDate>Tue, 30 Jul 2024 10:25:34 +0000</pubDate><link>https://news.ycombinator.com/item?id=41107714</link><dc:creator>mkaszkowiak</dc:creator><comments>https://news.ycombinator.com/item?id=41107714</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=41107714</guid></item><item><title><![CDATA[New comment by mkaszkowiak in "Ask HN: What are you using to parse PDFs for RAG?"]]></title><description><![CDATA[
<p>Did you encounter hidden costs when using Azure Document Intelligence? I processed some PDFs using the paid tier, but the resulting costs were way higher than expected, despite using a prebuilt layout model for only structured extraction. Have no clue what could cause it, no extra details on the billing page. Not sure if the price is misleading, or if it's a skill issue on my part :)</p>
]]></description><pubDate>Tue, 30 Jul 2024 08:44:43 +0000</pubDate><link>https://news.ycombinator.com/item?id=41107230</link><dc:creator>mkaszkowiak</dc:creator><comments>https://news.ycombinator.com/item?id=41107230</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=41107230</guid></item><item><title><![CDATA[New comment by mkaszkowiak in "Ask HN: What are you using to parse PDFs for RAG?"]]></title><description><![CDATA[
<p>For my use case, overall Marker seems to work pretty well - but it has issues with tables. Merged cells, misplaced headers, and so forth. I'm currently extracting Polish PDFs that are //not// scanned<p>When compared to Azure Document Intelligence, Marker is really cheap when self-hosted (assuming you fall under the license requirements), but it does not produce high quality data. YMMV.</p>
]]></description><pubDate>Tue, 30 Jul 2024 08:41:14 +0000</pubDate><link>https://news.ycombinator.com/item?id=41107213</link><dc:creator>mkaszkowiak</dc:creator><comments>https://news.ycombinator.com/item?id=41107213</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=41107213</guid></item><item><title><![CDATA[New comment by mkaszkowiak in "Most of Europe is glowing pink under the aurora"]]></title><description><![CDATA[
<p>This aurora was really powerful! I could see it with a naked eye from a town in central Poland, despite cloudy weather and light pollution. Feels great to finally see it in person</p>
]]></description><pubDate>Sat, 11 May 2024 09:53:33 +0000</pubDate><link>https://news.ycombinator.com/item?id=40327150</link><dc:creator>mkaszkowiak</dc:creator><comments>https://news.ycombinator.com/item?id=40327150</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40327150</guid></item><item><title><![CDATA[New comment by mkaszkowiak in "A drone that calculates coordinates using a camera and Google Maps"]]></title><description><![CDATA[
<p>Surprised by the amount of negative comments. Kudos to the team! This is very impressive to accomplish in 24h with a 3-man team.</p>
]]></description><pubDate>Mon, 26 Feb 2024 12:18:01 +0000</pubDate><link>https://news.ycombinator.com/item?id=39510413</link><dc:creator>mkaszkowiak</dc:creator><comments>https://news.ycombinator.com/item?id=39510413</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=39510413</guid></item><item><title><![CDATA[New comment by mkaszkowiak in "Ask HN: How many of you are self employed?"]]></title><description><![CDATA[
<p>Thanks for the links! I'll read them :)</p>
]]></description><pubDate>Mon, 05 Feb 2024 18:17:20 +0000</pubDate><link>https://news.ycombinator.com/item?id=39264616</link><dc:creator>mkaszkowiak</dc:creator><comments>https://news.ycombinator.com/item?id=39264616</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=39264616</guid></item><item><title><![CDATA[New comment by mkaszkowiak in "Ask HN: How many of you are self employed?"]]></title><description><![CDATA[
<p>I still don't see how developing high quality software is related to one's personal viewpoint on taxes.</p>
]]></description><pubDate>Mon, 05 Feb 2024 16:42:11 +0000</pubDate><link>https://news.ycombinator.com/item?id=39263194</link><dc:creator>mkaszkowiak</dc:creator><comments>https://news.ycombinator.com/item?id=39263194</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=39263194</guid></item><item><title><![CDATA[New comment by mkaszkowiak in "Ask HN: How many of you are self employed?"]]></title><description><![CDATA[
<p>I agree that it's problematic, however:<p>> welfare/health care system is bad, taxes are not used well<p>There's a widespread lack of trust in the Polish government, which decreased even further during the 2015-2023 period. If the money is being funelled to the ruling politicians' families and friends, why willingly pay high taxes? I believe this is an underlying core issue, which would probably take a new generation to repair.</p>
]]></description><pubDate>Mon, 05 Feb 2024 14:52:26 +0000</pubDate><link>https://news.ycombinator.com/item?id=39261798</link><dc:creator>mkaszkowiak</dc:creator><comments>https://news.ycombinator.com/item?id=39261798</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=39261798</guid></item><item><title><![CDATA[New comment by mkaszkowiak in "Ask HN: How many of you are self employed?"]]></title><description><![CDATA[
<p>Good questions, no clue. The answer probably lies somewhere between a "badly designed tax system" and "stimulating growth of the IT sector".</p>
]]></description><pubDate>Mon, 05 Feb 2024 14:45:33 +0000</pubDate><link>https://news.ycombinator.com/item?id=39261731</link><dc:creator>mkaszkowiak</dc:creator><comments>https://news.ycombinator.com/item?id=39261731</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=39261731</guid></item><item><title><![CDATA[New comment by mkaszkowiak in "Ask HN: How many of you are self employed?"]]></title><description><![CDATA[
<p>Similiar laws are in Poland, except they're not really enforced.<p>It's really rare that the tax office would prove a company exists solely for tax optimization. The risk virtually drops to zero if one freelances after the hours and has legitimate invoices with other companies.<p>This often causes mismatch between Polish employees who wish to work remotely abroad, and for ex. employeers from the DACH region, where I've heard the laws are strictly enforced. One party claims there is no risk, and the other claims it's too risky :-) (taking other factors aside, such as employee protection, etc.)</p>
]]></description><pubDate>Mon, 05 Feb 2024 14:18:42 +0000</pubDate><link>https://news.ycombinator.com/item?id=39261442</link><dc:creator>mkaszkowiak</dc:creator><comments>https://news.ycombinator.com/item?id=39261442</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=39261442</guid></item><item><title><![CDATA[New comment by mkaszkowiak in "Ask HN: How many of you are self employed?"]]></title><description><![CDATA[
<p>How is tax optimization related to the quality of developed software?</p>
]]></description><pubDate>Mon, 05 Feb 2024 14:09:32 +0000</pubDate><link>https://news.ycombinator.com/item?id=39261348</link><dc:creator>mkaszkowiak</dc:creator><comments>https://news.ycombinator.com/item?id=39261348</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=39261348</guid></item><item><title><![CDATA[New comment by mkaszkowiak in "Ask HN: How many of you are self employed?"]]></title><description><![CDATA[
<p>Taxes.<p>Standard tax rate (on UoP) is 12% up to ~30k USD, the rest is taxed 32%. On top of that, the employer pays a social security fee, its rate rises proportionally to income.<p>As an one-person business, you have two most popular options:<p>- 12% flat tax rate on income, with a flat rate social security fee; (1)<p>- 19% flat tax rate on revenue. The social security fee is dependant on income, but it's less than on UoP. You can write off expenses in this scenario, so the actual tax rate is actually lower. People generally try to write off as much as they can - for example, the tax agency is OK with programmers buying multiple bikes as a means of "transport to clients" ;)<p>You can also write off VAT in both scenarios, effectively making a lot of major purchases (desks, chairs, phones, etc) way cheaper. There's also a 5% tax rate, called IP Box, but it's tricky and doesn't apply for every scenario, so I'm taking this aside.<p>With the employer spending 5k EUR per month (21,7k PLN), you're left with:<p>- 14,6k PLN on UoP<p>- 18,5k PLN on 12% tax<p>- 16,7k PLN on 19% tax, out of which you can potentially recover 3,9k PLN<p>It's easy to see why software developers choose to start a one-person business. It's worth to jump through the hoops to save on taxes.<p>(1) There are actually 3 levels dependant on income, but it's lower than the UoP fee for basically most software developers</p>
]]></description><pubDate>Mon, 05 Feb 2024 14:01:27 +0000</pubDate><link>https://news.ycombinator.com/item?id=39261269</link><dc:creator>mkaszkowiak</dc:creator><comments>https://news.ycombinator.com/item?id=39261269</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=39261269</guid></item></channel></rss>