Hacker News: marcklingen

Scaling ClickHouse to petabytes of AI observability data

marcklingen — Fri, 13 Mar 2026 16:59:26 +0000

Article URL: https://langfuse.com/blog/2026-03-10-simplify-langfuse-for-scale

Comments URL: https://news.ycombinator.com/item?id=47366898

Points: 5

# Comments: 0

New comment by marcklingen in "ClickHouse acquires Langfuse"

marcklingen — Sun, 18 Jan 2026 01:41:28 +0000

Let me know what features are missing in prompt management

New comment by marcklingen in "ClickHouse acquires Langfuse"

marcklingen — Sat, 17 Jan 2026 14:47:15 +0000

(thank you!)

Ask HN: Who/what is the top-educator/content on how to use ChatGPT/Claude well?

marcklingen — Sun, 20 Apr 2025 17:49:33 +0000

It think there’s a huge advantage in being very good at using ChatGPT and Claude, but high-quality, actionable content on this topic seems surprisingly hard to find—at least in my social feeds and here on HN.

Can you share any good writing on this? It seems like many of the features within chatgpt/claude are undocumented + would benefit from better examples.

There are so many features that are really useful when being used in combination: audio mode to collect top-of-mind ideas while going for a walk, prompt deep research to compile report that goes deep on multiple topics including comparison tables, then use reasoning model to compile summary within a canvas, rewrite with a large non-reasoning model to improve the writing style (4.5 sounds way nicer than o3 imo).

Anecdotally, many of my friends want to "encourage the use of AI across their teams". Thus, there's probably even a good financial incentive and market to create actionable monetized courses on this. Social feeds also do not seem to be saturated with this kind of content yet, so probably it is fairly doable to gain distribution when starting out.

Comments URL: https://news.ycombinator.com/item?id=43745267

Points: 1

# Comments: 0

Open Source LLMOps Stack

marcklingen — Wed, 26 Feb 2025 09:41:38 +0000

Some background: I work on Langfuse and we've been collaborating with LiteLLM.

(LiteLLM is a Python library and proxy/gateway that handles cost management, virtual keys, caching, and rate-limiting for OpenAI or other LLM APIs. Langfuse manages LLM tracing, evaluation, prompt management, and experiments.)

We’ve each been building our open-source projects since early 2023 and learned that many devs and especially platform teams use the two together, so we created an integrated “OSS LLMOps stack.”

This is a fully self-hostable, technology-agnostic setup that lets you (1) Use LLMs via a standardized interface without adding complexity to the application; (2) Keep LLM Tracing, Evaluation, Prompt Management in-house for compliance; (3) Track cost and usage via a single interface, create virtual API keys for attribution of costs

It also enables direct transfer of LLM traces from the LiteLLM proxy to Langfuse. This simplifies the rollout of LLMOps practices (observability and evaluations) across multiple projects—you don't need to instrument all applications.

Additionally, the LiteLLM proxy can fetch and cache prompts from Langfuse's prompt management system, using them as templates for requests made through the proxy.

Both of these workflows can function without the integration, but are easier to manage with it!

We’d love your feedback!

Comments URL: https://news.ycombinator.com/item?id=43182241

Points: 64

# Comments: 7

New comment by marcklingen in "Ask HN: Books about people who did hard things"

marcklingen — Tue, 07 Jan 2025 02:01:27 +0000

The Innovators, Walter Isaacson

It’s interesting to read how many individuals contributed in all sorts of important ways in the history of computing.

New comment by marcklingen in "Show HN: Houseplant – Database Migrations for ClickHouse"

marcklingen — Fri, 27 Dec 2024 23:09:39 +0000

This is interesting! Do you plan to add support for distinguishing between blocking and non-blocking/async migrations? This seems to be a common challenge when using Clickhouse and needing to change the ordering of existing tables.

Context: When adding Clickhouse to Langfuse [1] to store all observability data, the migration DX was a big challenge. Some migrations need to copy/reorder data between tables which can take a long time and thus need be performed in the background/async. Many teams run Langfuse self-hosted and reducing downtime of an upgrade is important, thus, we added some custom logic to run these “background migrations” for now [2]. For regular migrations we use golang-migrate, works decently although DX isn’t as good as eg Prisma for Postgres.

[1] OSS LLM Observability, https://github.com/langfuse/langfuse

[2] https://langfuse.com/self-hosting/background-migrations

New comment by marcklingen in "Launch HN: Langfuse (YC W23) – OSS Tracing and Workflows to Improve LLM Apps"

marcklingen — Wed, 18 Dec 2024 23:22:30 +0000

The Langfuse data model is closely inspired by OpenTelemetry and we plan to add a Collector soonish. Until now, the OTel-semantics for LLMs have not been very stable and exhaustive while LLM capabilities are changing frequently (think prompt caching, realtime, multi-modal).

We are tracking this here including potential tradeoffs: https://github.com/orgs/langfuse/discussions/2509

New comment by marcklingen in "Launch HN: Langfuse (YC W23) – OSS Tracing and Workflows to Improve LLM Apps"

marcklingen — Wed, 18 Dec 2024 02:38:40 +0000

Thanks! IMO, Discord is good, but GitHub Discussions is the better option for building a growing open-source community. It is indexed and makes it easier to revisit conversations weeks later. Currently we use both but have a strong preference for GitHub Discussions.

New comment by marcklingen in "Launch HN: Langfuse (YC W23) – OSS Tracing and Workflows to Improve LLM Apps"

marcklingen — Wed, 18 Dec 2024 00:33:58 +0000

All core features are fully open-source and identical to those in Langfuse Cloud, with no limitations on capabilities or scalability (e.g. all v3 infrastructure changes).

We also offer some optional commercial add-on features that can help iterate faster or support very large teams using Langfuse. However, these features are entirely optional and we do our best to be transparent about this across our docs.

New comment by marcklingen in "Launch HN: Langfuse (YC W23) – OSS Tracing and Workflows to Improve LLM Apps"

marcklingen — Tue, 17 Dec 2024 21:52:45 +0000

Congrats on the Launch!

New comment by marcklingen in "Launch HN: Langfuse (YC W23) – OSS Tracing and Workflows to Improve LLM Apps"

marcklingen — Tue, 17 Dec 2024 21:49:42 +0000

As you mentioned, this was a significant trade-off. We faced two choices:

(1) Stick with a single Docker container and Postgres. This option is simple to self-host, operate, and iterate on, but it suffers from poor performance at scale, especially for analytical queries that become crucial as the project grows. Additionally, as more features emerged, we needed a queue and benefited from caching and asynchronous processing, which required splitting into a second container and adding Redis. These features would have been blocked when going for this setup.

(2) Switch to a scalable setup with a robust infrastructure that enables us to develop features that interest the majority of our community. We have chosen this path and prioritized templates and Helm charts to simplify self-hosting. Please let us know if you have any questions or feedback as we transition to v3. We aim to make this process as easy as possible.

Regarding OTel, we are considering adding a collector to Langfuse as the OTel semantics are currently developing well. The needs of the Langfuse community are evolving rapidly, and starting with our own instrumentation has allowed us to move quickly while the semantic conventions were not developed. We are tracking this here and would greatly appreciate your feedback, upvotes, or any comments you have on this thread: https://github.com/orgs/langfuse/discussions/2509

New comment by marcklingen in "Launch HN: Langfuse (YC W23) – OSS Tracing and Workflows to Improve LLM Apps"

marcklingen — Tue, 17 Dec 2024 21:24:06 +0000

Thanks! Really enjoyed working with you maintainers of other projects to help them offer more native LLM observability and evaluation to their users/communities. There is a lot that goes into making the observability/eval part scalable/useful and requirements change on a weekly basis with new advancements. Same applies to other projects and it makes a lot of sense to integrate.

Overview of community integrations: https://langfuse.com/docs/integrations/overview

Packages that depend on Langfuse: https://langfuse.com/faq/all/packages-depending-on-langfuse

New comment by marcklingen in "Launch HN: Langfuse (YC W23) – OSS Tracing and Workflows to Improve LLM Apps"

marcklingen — Tue, 17 Dec 2024 20:01:34 +0000

Thank you! Working with your team has been great. I love seeing you ship LLM-powered features and appreciate the feedback you have shared along the way.

New comment by marcklingen in "Launch HN: Langfuse (YC W23) – OSS Tracing and Workflows to Improve LLM Apps"

marcklingen — Tue, 17 Dec 2024 19:55:01 +0000

thank you! if you have any ideas for improvements after having used Langfuse for a while, please contribute them via github discussions: https://langfuse.com/ideas

New comment by marcklingen in "Launch HN: Langfuse (YC W23) – OSS Tracing and Workflows to Improve LLM Apps"

marcklingen — Tue, 17 Dec 2024 19:53:34 +0000

Thanks for the feedback, glad that you find Langfuse useful!

Can you create an issue with more details on the API performance problems? We monitor strict SLOs on the public API for Langfuse Cloud and are not aware of any ongoing issues, would love to learn more.

New comment by marcklingen in "Launch HN: Langfuse (YC W23) – OSS Tracing and Workflows to Improve LLM Apps"

marcklingen — Tue, 17 Dec 2024 19:49:14 +0000

appreciate your constructive feedback!

> i wonder if there are new ops solutions for the realtime apis popping up

This is something we have spent quite some time on already, both on designs internally and talking to teams using Langfuse with realtime applications. IMO the usage patterns are still developing and the data capturing/visualization needs across teams is not aligned. What matters: (1) capture streams, (2) for non-text provide timestamped transcript/labels, (3) capture the difference between user-time and api-level-time (e.g. when catching up on a stream after having categorized the input first).

We are excited to build support for this, if you or others have ideas or a wishlist, please add them to this thread: https://github.com/orgs/langfuse/discussions/4757

> retries for instructor like structured outputs mess up the traces, i wonder if they can be tracked and collapsible

Great feedback. Being able to retroactively downrank llm calls to be `debug` level in order to collapse/hide them by default would be interesting. Added thread for this here: https://github.com/orgs/langfuse/discussions/4758

> chatgpt canvas like "drafting" workflows are on the rise (https://www.latent.space/p/inference-fast-and-slow) and again its noisy to see in a chat flow

Can you share an example trace for this or open a thread on github? Would love to understand this in more detail as I have seen different trace-representations of it -- the best yet was a _git diff_ on a wrapper span for every iteration.

> how often do people actually use the feedback tagging and then subsequently finetuning? i always feel guilty that i dont do it yet and wonder when and where i should.

Have not seen finetuning based on user-feedback a lot as the feedback can be noisy and low in frequency (unless there is a very clear feedback loop built into the product). More common workflow that I have seen: identify new problems via user feedback -> review them manually -> create llm-as-a-judge or other automated evals for this problem -> select "good" examples for fine-tuning based on a mix of different evals that currently run on production data -> sanitize the dataset (e.g. remove PII).

Finetuning has been more popular for structured output, sql generation (clear feedback loop / retries at run-time if the output does not work). More teams fine-tune on all the output that has passed this initial run-time gate for model distillation without further quality controls on the training dataset. They usually then run evals on a test dataset in order to verify whether the fine-tuned hits their quality bar.

New comment by marcklingen in "Launch HN: Langfuse (YC W23) – OSS Tracing and Workflows to Improve LLM Apps"

marcklingen — Tue, 17 Dec 2024 19:30:44 +0000

Thanks for the feedback.

Being unopinionated and API-first has been a core design decision. We want to build the building blocks that everyone needs while acknowledging that most Langfuse users are very sophisticated teams that have a clear idea of what they want to achieve. Over time we will build more abstractions for common workflows to make it easier to get started but new features will always start API-first.

More on this here: https://langfuse.com/why

New comment by marcklingen in "Launch HN: Langfuse (YC W23) – OSS Tracing and Workflows to Improve LLM Apps"

marcklingen — Tue, 17 Dec 2024 19:28:44 +0000

Thanks AJ, feedback on GitHub/Discord (like yours) has been very helpful to evolve prompt management from a quick addition of the core platform to one of the most-used features -- for which we then actually needed to change a lot of infrastructure to make it reliable and fast (see blog post linked in the original post)

New comment by marcklingen in "Launch HN: Langfuse (YC W23) – OSS Tracing and Workflows to Improve LLM Apps"

marcklingen — Tue, 17 Dec 2024 16:53:09 +0000

This is a good long-list of projects, although it is not narrowly scoped to tracing/evals/prompt-management: https://github.com/tensorchord/Awesome-LLMOps?tab=readme-ov-...