<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: cipherself</title><link>https://news.ycombinator.com/user?id=cipherself</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Sat, 13 Jun 2026 03:31:43 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=cipherself" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by cipherself in "On Rendering Diffs"]]></title><description><![CDATA[
<p>For anyone else who's suffering, paste this in the console in devtools:<p><pre><code>  document.getElementsByTagName('main')[0].style.margin = '0 auto';</code></pre></p>
]]></description><pubDate>Fri, 29 May 2026 19:55:49 +0000</pubDate><link>https://news.ycombinator.com/item?id=48328362</link><dc:creator>cipherself</dc:creator><comments>https://news.ycombinator.com/item?id=48328362</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48328362</guid></item><item><title><![CDATA[New comment by cipherself in "Project Genie: Experimenting with infinite, interactive worlds"]]></title><description><![CDATA[
<p>It's quite exciting how far we've come from the modern exposition of world models by David Ha and Jürgen Schmidhuber in 2018 <a href="https://worldmodels.github.io/" rel="nofollow">https://worldmodels.github.io/</a></p>
]]></description><pubDate>Fri, 30 Jan 2026 14:00:46 +0000</pubDate><link>https://news.ycombinator.com/item?id=46824466</link><dc:creator>cipherself</dc:creator><comments>https://news.ycombinator.com/item?id=46824466</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46824466</guid></item><item><title><![CDATA[New comment by cipherself in "Ask HN: How are you doing RAG locally?"]]></title><description><![CDATA[
<p>Here's a Dockerfile that will spin up postgres with pgvector and paradedb <a href="https://gist.github.com/cipherself/5260fea1e2631e9630081fb7db171598" rel="nofollow">https://gist.github.com/cipherself/5260fea1e2631e9630081fb7d...</a><p>You can use pgvector for the vector lookup and paradedb for bm25.</p>
]]></description><pubDate>Thu, 15 Jan 2026 15:31:36 +0000</pubDate><link>https://news.ycombinator.com/item?id=46633944</link><dc:creator>cipherself</dc:creator><comments>https://news.ycombinator.com/item?id=46633944</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46633944</guid></item><item><title><![CDATA[New comment by cipherself in "Ask HN: Share your personal website"]]></title><description><![CDATA[
<p><a href="https://mohamed.computer" rel="nofollow">https://mohamed.computer</a></p>
]]></description><pubDate>Wed, 14 Jan 2026 19:24:03 +0000</pubDate><link>https://news.ycombinator.com/item?id=46621348</link><dc:creator>cipherself</dc:creator><comments>https://news.ycombinator.com/item?id=46621348</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46621348</guid></item><item><title><![CDATA[New comment by cipherself in "Ask HN: What are you working on? (January 2026)"]]></title><description><![CDATA[
<p>I am trying to get the TLA+ tools to run completely in the browser <a href="https://github.com/tlaplus/tlaplus/tree/master/tlatools" rel="nofollow">https://github.com/tlaplus/tlaplus/tree/master/tlatools</a></p>
]]></description><pubDate>Sun, 11 Jan 2026 18:26:24 +0000</pubDate><link>https://news.ycombinator.com/item?id=46578182</link><dc:creator>cipherself</dc:creator><comments>https://news.ycombinator.com/item?id=46578182</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46578182</guid></item><item><title><![CDATA[New comment by cipherself in "Five Years of Tinygrad"]]></title><description><![CDATA[
<p>I don't think this is bad, if you know Python then most of the code will be fine for you. I think you're probably referring to <i>pm_reduce_collapse</i>, but while it looks daunting at first, it really isn't when you consider the alternatives, I'd be curious on how you'd improve it short of creating a DSL.</p>
]]></description><pubDate>Wed, 31 Dec 2025 09:12:49 +0000</pubDate><link>https://news.ycombinator.com/item?id=46442607</link><dc:creator>cipherself</dc:creator><comments>https://news.ycombinator.com/item?id=46442607</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46442607</guid></item><item><title><![CDATA[New comment by cipherself in "Parsing Advances"]]></title><description><![CDATA[
<p>I have written parsers using parser combinators in Haskell and Clojure. I find that ML-like (Haskell, OCaml, StandardML) languages generally are great at writing parsers, even hand-written ones in it is a superior experience.<p>In this case, this was a project at $EMPLOYER in an existing codebase with colleagues who have never seen Haskell code, using Haskell would've been a major error in judgement.</p>
]]></description><pubDate>Tue, 30 Dec 2025 20:25:24 +0000</pubDate><link>https://news.ycombinator.com/item?id=46437563</link><dc:creator>cipherself</dc:creator><comments>https://news.ycombinator.com/item?id=46437563</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46437563</guid></item><item><title><![CDATA[New comment by cipherself in "Parsing Advances"]]></title><description><![CDATA[
<p>One anecdote in the same vein, a couple of months ago, I wanted to parse <i>systemd-networkd</i> INI files in Python and the python built-in ConfigParser [0] and pytest's <i>iniconfig</i> parser [1] couldn't handle multiple sections with the same name so I ended up writing 2 parsers, one using a ParserCombinator library and one by hand and ended up using the latter given it was much simpler to understand and I didn't have to introduce an extra dependency.<p>Admittedly, INI is quite a simple format, hence I mention this as an anecdote.<p>[0] <a href="https://docs.python.org/3/library/configparser.html" rel="nofollow">https://docs.python.org/3/library/configparser.html</a><p>[1] <a href="https://github.com/pytest-dev/iniconfig" rel="nofollow">https://github.com/pytest-dev/iniconfig</a></p>
]]></description><pubDate>Tue, 30 Dec 2025 12:19:58 +0000</pubDate><link>https://news.ycombinator.com/item?id=46432582</link><dc:creator>cipherself</dc:creator><comments>https://news.ycombinator.com/item?id=46432582</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46432582</guid></item><item><title><![CDATA[New comment by cipherself in "Capsudo: Rethinking sudo with object capabilities"]]></title><description><![CDATA[
<p>I have used systemd services before to do this to run an application, I had a user created specifically for the application, and I defined the capabilities the application needed via <i>CapabilityBoundingSet</i> and <i>AmbientCapabilities</i> [0] and I used a lot of stuff from [1] to restrict the application e.g. the sandboxing facilities, restricting the allowed syscalls [2], ...etc. systemd also comes with a useful command <i>systemd analyze security</i> [3]<p>[0] <a href="https://www.freedesktop.org/software/systemd/man/latest/systemd.exec.html#Capabilities" rel="nofollow">https://www.freedesktop.org/software/systemd/man/latest/syst...</a><p>[1] <a href="https://www.freedesktop.org/software/systemd/man/latest/systemd.exec.html" rel="nofollow">https://www.freedesktop.org/software/systemd/man/latest/syst...</a><p>[2] <a href="https://www.freedesktop.org/software/systemd/man/latest/systemd.exec.html#System%20Call%20Filtering" rel="nofollow">https://www.freedesktop.org/software/systemd/man/latest/syst...</a><p>[3] <a href="https://www.freedesktop.org/software/systemd/man/latest/systemd-analyze.html#systemd-analyze%20security%20%5BUNIT...%5D" rel="nofollow">https://www.freedesktop.org/software/systemd/man/latest/syst...</a></p>
]]></description><pubDate>Sat, 13 Dec 2025 11:00:51 +0000</pubDate><link>https://news.ycombinator.com/item?id=46253704</link><dc:creator>cipherself</dc:creator><comments>https://news.ycombinator.com/item?id=46253704</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46253704</guid></item><item><title><![CDATA[New comment by cipherself in "Building a Simple Search Engine That Works"]]></title><description><![CDATA[
<p>That was a great book, I wonder what the 2025 equivalent of it is...</p>
]]></description><pubDate>Mon, 17 Nov 2025 08:32:51 +0000</pubDate><link>https://news.ycombinator.com/item?id=45951812</link><dc:creator>cipherself</dc:creator><comments>https://news.ycombinator.com/item?id=45951812</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45951812</guid></item><item><title><![CDATA[New comment by cipherself in "Show HN: Spam classifier in Go using Naive Bayes"]]></title><description><![CDATA[
<p>12 (13?) years ago I had also written a Naïve Bayes classifier in Perl <a href="https://github.com/cipherself/NaiveBayes_perl" rel="nofollow">https://github.com/cipherself/NaiveBayes_perl</a><p>IIRC, next thing on my TODO list was to add vectorization. Also (like OP) it uses log probabilities to avoid floating-point underflow.</p>
]]></description><pubDate>Mon, 17 Nov 2025 01:13:56 +0000</pubDate><link>https://news.ycombinator.com/item?id=45950034</link><dc:creator>cipherself</dc:creator><comments>https://news.ycombinator.com/item?id=45950034</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45950034</guid></item><item><title><![CDATA[New comment by cipherself in "Heartbeats in Distributed Systems"]]></title><description><![CDATA[
<p>10 years ago I've implemented SCAMP (a gossip protocol) in Clojure, you might find it interesting, the implementation is quite small <a href="https://github.com/cipherself/gossip" rel="nofollow">https://github.com/cipherself/gossip</a></p>
]]></description><pubDate>Fri, 14 Nov 2025 10:59:20 +0000</pubDate><link>https://news.ycombinator.com/item?id=45925731</link><dc:creator>cipherself</dc:creator><comments>https://news.ycombinator.com/item?id=45925731</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45925731</guid></item><item><title><![CDATA[New comment by cipherself in "How often does Python allocate?"]]></title><description><![CDATA[
<p>> I recommend using the `dir` function in a REPL<p>A while back I wrote this <a href="https://mohamed.computer/posts/python-internals-cpython-bytecode/" rel="nofollow">https://mohamed.computer/posts/python-internals-cpython-byte...</a>, perhaps it's interesting for people who use `dir` and wonder what some of the weird things that show up are.</p>
]]></description><pubDate>Fri, 07 Nov 2025 00:10:26 +0000</pubDate><link>https://news.ycombinator.com/item?id=45842171</link><dc:creator>cipherself</dc:creator><comments>https://news.ycombinator.com/item?id=45842171</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45842171</guid></item><item><title><![CDATA[New comment by cipherself in "Production RAG: what I learned from processing 5M+ documents"]]></title><description><![CDATA[
<p>Assuming that's what he meant, why would it be considered baseline versus anything else? I am genuinely curious because I'd like to know more about issues people face with this or that vector store in general.</p>
]]></description><pubDate>Tue, 21 Oct 2025 21:13:08 +0000</pubDate><link>https://news.ycombinator.com/item?id=45661789</link><dc:creator>cipherself</dc:creator><comments>https://news.ycombinator.com/item?id=45661789</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45661789</guid></item><item><title><![CDATA[New comment by cipherself in "Code from MIT's 1986 SICP video lectures"]]></title><description><![CDATA[
<p>Moreover, you can have SICP inside emacs by just downloading a package from Melpa:<p><a href="https://melpa.org/#/sicp" rel="nofollow">https://melpa.org/#/sicp</a></p>
]]></description><pubDate>Tue, 21 Oct 2025 10:35:35 +0000</pubDate><link>https://news.ycombinator.com/item?id=45654325</link><dc:creator>cipherself</dc:creator><comments>https://news.ycombinator.com/item?id=45654325</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45654325</guid></item><item><title><![CDATA[New comment by cipherself in "Production RAG: what I learned from processing 5M+ documents"]]></title><description><![CDATA[
<p>Got it, I think this might make sense for a "conversation" type of search not for an instant search feature because lowest latency is gonna be too high IMO.</p>
]]></description><pubDate>Tue, 21 Oct 2025 10:32:10 +0000</pubDate><link>https://news.ycombinator.com/item?id=45654306</link><dc:creator>cipherself</dc:creator><comments>https://news.ycombinator.com/item?id=45654306</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45654306</guid></item><item><title><![CDATA[New comment by cipherself in "Production RAG: what I learned from processing 5M+ documents"]]></title><description><![CDATA[
<p><i>S3 Vectors is hands down the SOTA here</i><p>SOTA for what? Isn't it just a vector store?</p>
]]></description><pubDate>Tue, 21 Oct 2025 00:08:46 +0000</pubDate><link>https://news.ycombinator.com/item?id=45650963</link><dc:creator>cipherself</dc:creator><comments>https://news.ycombinator.com/item?id=45650963</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45650963</guid></item><item><title><![CDATA[New comment by cipherself in "Production RAG: what I learned from processing 5M+ documents"]]></title><description><![CDATA[
<p>I am working on search but rather for text-to-image retrieval, nevertheless, I am curious if by <i>that's all baked into Azure AI search</i> you also meant <i>synthetic query generation</i> from the grandparent comment. If so, what's your latency for this? And do you extract structured data from the query? If so, do you use LLMs for that?<p>Moreover I am curious why you guys use <i>bm25</i> over <i>SPLADE</i>?</p>
]]></description><pubDate>Tue, 21 Oct 2025 00:04:32 +0000</pubDate><link>https://news.ycombinator.com/item?id=45650931</link><dc:creator>cipherself</dc:creator><comments>https://news.ycombinator.com/item?id=45650931</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45650931</guid></item><item><title><![CDATA[New comment by cipherself in "Docker Hub Is Down"]]></title><description><![CDATA[
<p>While I haven’t tried --pull=missing, I have tried --pull=never, which I assume is a stricter version and it was still attempting the HEAD request.</p>
]]></description><pubDate>Thu, 25 Sep 2025 07:28:45 +0000</pubDate><link>https://news.ycombinator.com/item?id=45370162</link><dc:creator>cipherself</dc:creator><comments>https://news.ycombinator.com/item?id=45370162</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45370162</guid></item><item><title><![CDATA[New comment by cipherself in "Docker Hub Is Down"]]></title><description><![CDATA[
<p>I’ll admit I haven’t checked before posting, perhaps an admin can merge both submissions and change the URL on the one you linked to the one in this submission.</p>
]]></description><pubDate>Wed, 24 Sep 2025 23:56:20 +0000</pubDate><link>https://news.ycombinator.com/item?id=45367457</link><dc:creator>cipherself</dc:creator><comments>https://news.ycombinator.com/item?id=45367457</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45367457</guid></item></channel></rss>