<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: tirelli</title><link>https://news.ycombinator.com/user?id=tirelli</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Fri, 26 Jun 2026 02:11:25 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=tirelli" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by tirelli in "Ask HN: Who wants to be hired? (May 2026)"]]></title><description><![CDATA[
<p>Location: Buenos Aires, Argentina (Italian + Argentine citizenship, EU work authorization)<p>Remote: Yes (US or EU timezone overlap)<p>Willing to relocate: Maybe (open to EU for the right role)<p>Technologies: Python, FastAPI, LangGraph, LangChain, RAG, hybrid retrieval, agent workflows, MCP, PostgreSQL/pgvector, Redis, WebSockets/SSE, TypeScript, Next.js, Docker, AWS<p>Résumé/CV: <a href="https://sebastiantirelli.com" rel="nofollow">https://sebastiantirelli.com</a><p>Email: tirelli@gmail.com</p>
]]></description><pubDate>Thu, 07 May 2026 18:24:02 +0000</pubDate><link>https://news.ycombinator.com/item?id=48052917</link><dc:creator>tirelli</dc:creator><comments>https://news.ycombinator.com/item?id=48052917</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48052917</guid></item><item><title><![CDATA[New comment by tirelli in "A 13-month-old LlamaIndex bug re-embeds unchanged content"]]></title><description><![CDATA[
<p>Author here. Quick map of the finding for anyone skimming:<p>Bug 1 is in the hashing path. Node.hash, TextNode.hash, and IngestionCache all include metadata via MetadataMode.ALL, which ignores excluded_embed_metadata_keys. Any volatile field (mtime, atime, file size) flips the hash and forces a re-embed of byte-identical content.<p>Bug 2 is that default_file_metadata_func queries POSIX-only stat keys (mtime, atime, created). Whether a given fsspec backend emits those keys decides whether Bug 1 is firing on you today. I source-inspected every backend under the fsspec GitHub org and every built-in in filesystem_spec.<p>Active today (bug fires at day-level precision): local, gcsfs, sshfs + built-in sftp, smb, arrow/HDFS, memory.<p>Masked today (bug dormant, waiting on Bug 2 getting fixed): s3fs, adlfs, ossfs, swiftspec, tosfs, gdrive-fsspec, dropboxdrivefs, ipfsspec, opendalfs, dbfs, http, webhdfs, ftp, github, gist, git.<p>Wrapper: alluxiofs delegates to its wrapped backend.<p>GCS is the outlier on the active side because gcsfs/core.py explicitly sets result["mtime"] = parse(object_metadata["updated"]) as a legacy compatibility alias. There is a TODO about removing it. The code is still there.<p>Once default_file_metadata_func gets its natural one-line fix to use fs.modified(path) instead of POSIX-specific keys, every masked backend activates at sub-second precision simultaneously.<p>Reproducers at github.com/stirelli/llamaindex-embedding-churn (five progressively real levels, level 3 uses real OpenAI API with billed tokens). Fix is PR #21462 against run-llama/llama_index, three lines plus a regression test covering both directions.<p>Happy to answer questions on the benchmark, the fsspec inspection, or the cost math.</p>
]]></description><pubDate>Fri, 24 Apr 2026 17:11:41 +0000</pubDate><link>https://news.ycombinator.com/item?id=47893069</link><dc:creator>tirelli</dc:creator><comments>https://news.ycombinator.com/item?id=47893069</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47893069</guid></item><item><title><![CDATA[A 13-month-old LlamaIndex bug re-embeds unchanged content]]></title><description><![CDATA[
<p>Article URL: <a href="https://sebastiantirelli.com/writing/llamaindex-embedding-churn/">https://sebastiantirelli.com/writing/llamaindex-embedding-churn/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=47892995">https://news.ycombinator.com/item?id=47892995</a></p>
<p>Points: 1</p>
<p># Comments: 1</p>
]]></description><pubDate>Fri, 24 Apr 2026 17:05:35 +0000</pubDate><link>https://sebastiantirelli.com/writing/llamaindex-embedding-churn/</link><dc:creator>tirelli</dc:creator><comments>https://news.ycombinator.com/item?id=47892995</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47892995</guid></item></channel></rss>