<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: sangwulee</title><link>https://news.ycombinator.com/user?id=sangwulee</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Thu, 25 Jun 2026 02:46:15 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=sangwulee" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by sangwulee in "Krea 2: SOTA open-weights 12B image model"]]></title><description><![CDATA[
<p>A lot of coffee for sure. Regarding the training cost, it's hard to give a good estimate because we used a shared kubernetes cluster with inference + research workloads.</p>
]]></description><pubDate>Wed, 24 Jun 2026 16:02:57 +0000</pubDate><link>https://news.ycombinator.com/item?id=48661917</link><dc:creator>sangwulee</dc:creator><comments>https://news.ycombinator.com/item?id=48661917</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48661917</guid></item><item><title><![CDATA[New comment by sangwulee in "Releasing weights for FLUX.1 Krea"]]></title><description><![CDATA[
<p>The highest quality finetuning data was hand curated internally.
I would say our post training pipeline is quite similar to SeedDream 2.0 ~ 3.0 series from ByteDance. Similar to them, we use extensive quality filters and internal models to get the highest quality possible. Even from there, we still hand curate a hand-picked subset.</p>
]]></description><pubDate>Fri, 01 Aug 2025 00:12:45 +0000</pubDate><link>https://news.ycombinator.com/item?id=44751668</link><dc:creator>sangwulee</dc:creator><comments>https://news.ycombinator.com/item?id=44751668</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44751668</guid></item><item><title><![CDATA[New comment by sangwulee in "Releasing weights for FLUX.1 Krea"]]></title><description><![CDATA[
<p>I actually tried a few experiments in early exploration stages! I trained a small classifier to judge AI vs non-AI images. Use it as a reward model to do small RL / post training experiments. Sadly, was not too successful. We found that directly finetuning the model on high quality photorealistic image was most reliable.<p>Another note about preference optimisation and RL is that it has really high quality ceiling but needs to be very carefully tuned. It's easy to get perfect anatomy and structure if you decide to completely "collapse" the model. For instance, ChatGPT images are collapsed to have slight yellow color palette. FLUX images always have this glossy, plastic texture with overly blurry background. It's similar to reward hacking behavior you see in LLMs where they sound overly nice and chatty.<p>I had to make a few compromises to balance between "stable, collapsed, boring model" and "unstable, diverse, explorative" model.</p>
]]></description><pubDate>Fri, 01 Aug 2025 00:03:17 +0000</pubDate><link>https://news.ycombinator.com/item?id=44751624</link><dc:creator>sangwulee</dc:creator><comments>https://news.ycombinator.com/item?id=44751624</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44751624</guid></item><item><title><![CDATA[New comment by sangwulee in "Releasing weights for FLUX.1 Krea"]]></title><description><![CDATA[
<p>The architecture is the same so we found that some LoRAs work out-of-the box, but some LoRAs don't. In those cases, I would expect people to re-run their LoRA finetuning with the trainer they've used.</p>
]]></description><pubDate>Thu, 31 Jul 2025 23:20:25 +0000</pubDate><link>https://news.ycombinator.com/item?id=44751325</link><dc:creator>sangwulee</dc:creator><comments>https://news.ycombinator.com/item?id=44751325</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44751325</guid></item><item><title><![CDATA[New comment by sangwulee in "Releasing weights for FLUX.1 Krea"]]></title><description><![CDATA[
<p>We used two types of datasets for post-training. Supervised finetuning data and preference data used for RLHF stage. You can actually use less than < 1M samples to significantly boost the aesthetics. Quality matters A LOT. Quantity helps with generalisation and stability of the checkpoints though.</p>
]]></description><pubDate>Thu, 31 Jul 2025 21:39:02 +0000</pubDate><link>https://news.ycombinator.com/item?id=44750498</link><dc:creator>sangwulee</dc:creator><comments>https://news.ycombinator.com/item?id=44750498</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44750498</guid></item><item><title><![CDATA[New comment by sangwulee in "Releasing weights for FLUX.1 Krea"]]></title><description><![CDATA[
<p>We have not added a separate RTX accelerated version for FLUX.1 Krea, but the model is fully compatible with existing FLUX.1 dev codebase. I don't think we made a separate onnx export for it though. Doing 4~8 bit quantized version with SVDQuant would be a nice follow up so that the checkpoint is more friendly for consumer grade hardware.</p>
]]></description><pubDate>Thu, 31 Jul 2025 21:33:45 +0000</pubDate><link>https://news.ycombinator.com/item?id=44750448</link><dc:creator>sangwulee</dc:creator><comments>https://news.ycombinator.com/item?id=44750448</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44750448</guid></item><item><title><![CDATA[New comment by sangwulee in "Releasing weights for FLUX.1 Krea"]]></title><description><![CDATA[
<p>FLUX.1 is one of the most popular open weights text-to-image models. We distilled Krea-1 to FLUX.1 [dev] model so that the community can adopt it seamlessly into existing ecosystem. Any finetuning code, workflows, etc that was built on top of FLUX.1 [dev] can be reused with our model :)</p>
]]></description><pubDate>Thu, 31 Jul 2025 21:26:51 +0000</pubDate><link>https://news.ycombinator.com/item?id=44750403</link><dc:creator>sangwulee</dc:creator><comments>https://news.ycombinator.com/item?id=44750403</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44750403</guid></item><item><title><![CDATA[New comment by sangwulee in "Releasing weights for FLUX.1 Krea"]]></title><description><![CDATA[
<p>Quick napkin math assuming bfloat16 format : 1B * 16 bits = 16B bits = 2GB.
Since it's a 12B parameter model, you get around ~24GB. Downcasting to bfloat16 from float32 comes with pretty minimal performance degradation, so we uploaded the weights in bfloat16 format.</p>
]]></description><pubDate>Thu, 31 Jul 2025 21:00:27 +0000</pubDate><link>https://news.ycombinator.com/item?id=44750079</link><dc:creator>sangwulee</dc:creator><comments>https://news.ycombinator.com/item?id=44750079</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44750079</guid></item><item><title><![CDATA[New comment by sangwulee in "Releasing weights for FLUX.1 Krea"]]></title><description><![CDATA[
<p>I love owls. Photorealism was one of the focus areas for training because "AI look" (e.g. plastic skin) was biggest complaint for FLUX.1 model series. Photorealism was achieved with both careful curation of finetuning and preference dataset.</p>
]]></description><pubDate>Thu, 31 Jul 2025 20:46:00 +0000</pubDate><link>https://news.ycombinator.com/item?id=44749918</link><dc:creator>sangwulee</dc:creator><comments>https://news.ycombinator.com/item?id=44749918</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44749918</guid></item><item><title><![CDATA[New comment by sangwulee in "Releasing weights for FLUX.1 Krea"]]></title><description><![CDATA[
<p>Thank you! Glad you find it helpful.
The model is focused on photorealism so it should be able to generate most realistic scenes. Although, I think using 3D engines would be more suitable for typical cases for robotics training since it gives you ground truth data on objects, location, etc.<p>One interesting use case would be if you are focusing on a robotics task that would require perception of realistic scenes.</p>
]]></description><pubDate>Thu, 31 Jul 2025 20:39:17 +0000</pubDate><link>https://news.ycombinator.com/item?id=44749849</link><dc:creator>sangwulee</dc:creator><comments>https://news.ycombinator.com/item?id=44749849</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44749849</guid></item><item><title><![CDATA[New comment by sangwulee in "FLUX.1 Krea [Dev]: An 'Opinionated' Text-to-Image Model"]]></title><description><![CDATA[
<p>Hi there, I'm Sangwu Lee, one of the researchers behind this model. I'm happy to answer any questions here.<p>---<p>I also commented in this other submission: <a href="https://news.ycombinator.com/item?id=44748056">https://news.ycombinator.com/item?id=44748056</a></p>
]]></description><pubDate>Thu, 31 Jul 2025 17:53:59 +0000</pubDate><link>https://news.ycombinator.com/item?id=44748228</link><dc:creator>sangwulee</dc:creator><comments>https://news.ycombinator.com/item?id=44748228</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44748228</guid></item><item><title><![CDATA[New comment by sangwulee in "FLUX.1 Krea: post-trained text-to-image model from Black Forest Labs and Krea"]]></title><description><![CDATA[
<p>Hello HackerNews. My name is Sangwu Lee . I work for Krea and I led the research efforts around the post-training for this model. I'll try to answer any questions you may have, but I recommend you read the technical report I wrote on our site (<a href="https://www.krea.ai/blog/flux-krea-open-source-release" rel="nofollow">https://www.krea.ai/blog/flux-krea-open-source-release</a>).<p>I also see that my colleagues already commented here, but I'll try to answer questions you may have.</p>
]]></description><pubDate>Thu, 31 Jul 2025 17:51:55 +0000</pubDate><link>https://news.ycombinator.com/item?id=44748201</link><dc:creator>sangwulee</dc:creator><comments>https://news.ycombinator.com/item?id=44748201</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44748201</guid></item><item><title><![CDATA[New comment by sangwulee in "Releasing weights for FLUX.1 Krea"]]></title><description><![CDATA[
<p>Hi! I'm lead researcher on Krea-1. FLUX.1 Krea is a 12B rectified flow model distilled from Krea-1, designed to be compatible with FLUX architecture. Happy to answer any technical questions :)</p>
]]></description><pubDate>Thu, 31 Jul 2025 15:01:54 +0000</pubDate><link>https://news.ycombinator.com/item?id=44746421</link><dc:creator>sangwulee</dc:creator><comments>https://news.ycombinator.com/item?id=44746421</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44746421</guid></item></channel></rss>