<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: gregschoeninger</title><link>https://news.ycombinator.com/user?id=gregschoeninger</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Thu, 18 Jun 2026 04:17:12 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=gregschoeninger" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by gregschoeninger in "Lore – Open source version control system designed for scalability"]]></title><description><![CDATA[
<p>We're also working on an open source large asset versioning tool called "oxen" - <a href="https://github.com/Oxen-AI/Oxen" rel="nofollow">https://github.com/Oxen-AI/Oxen</a><p>Would love any feedback on it or contributions if people are interested :)</p>
]]></description><pubDate>Wed, 17 Jun 2026 17:11:08 +0000</pubDate><link>https://news.ycombinator.com/item?id=48573391</link><dc:creator>gregschoeninger</dc:creator><comments>https://news.ycombinator.com/item?id=48573391</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48573391</guid></item><item><title><![CDATA[New comment by gregschoeninger in "The future of version control"]]></title><description><![CDATA[
<p>We're working on this project to help with the non-text file and large file problem: <a href="https://github.com/Oxen-AI/Oxen" rel="nofollow">https://github.com/Oxen-AI/Oxen</a><p>Started with the machine learning use case for datasets and model weights but seeing a lot of traction in gaming as well.<p>Always open for feedback and ideas to improve if you want to take it for a spin!</p>
]]></description><pubDate>Sun, 22 Mar 2026 18:46:38 +0000</pubDate><link>https://news.ycombinator.com/item?id=47480717</link><dc:creator>gregschoeninger</dc:creator><comments>https://news.ycombinator.com/item?id=47480717</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47480717</guid></item><item><title><![CDATA[New comment by gregschoeninger in "[dead]"]]></title><description><![CDATA[
<p>Over the past ~1.5 years I've been running a research paper club where we dive into interesting/foundational papers in AI/ML. So we naturally have come across a lot of the papers that lead up to DeepSeek-R1. While diving into the DeepSeek papers this week, I decided to compile a list of papers that we've already gone over or I think would be good background reading to get a bigger picture of what's going on under the hood of DeepSeek.<p>Grab a cup of coffee and enjoy!<p><a href="https://www.oxen.ai/blog/no-hype-deepseek-r1-reading-list" rel="nofollow">https://www.oxen.ai/blog/no-hype-deepseek-r1-reading-list</a></p>
]]></description><pubDate>Thu, 30 Jan 2025 05:01:11 +0000</pubDate><link>https://news.ycombinator.com/item?id=42875034</link><dc:creator>gregschoeninger</dc:creator><comments>https://news.ycombinator.com/item?id=42875034</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42875034</guid></item><item><title><![CDATA[New comment by gregschoeninger in "[dead]"]]></title><description><![CDATA[
<p>Hey all,<p>If you haven't seen the Oxen project yet, we have been building an open source unstructured data version control tool.<p>We were inspired by the idea of making large machine learning datasets living & breathing assets that people can collaborate on, rather than the static ones of the past. Lately we have been working hard on optimizing the underlying Merkle Trees and data structures with in Oxen.ai and just released v0.19.4 which provides a bunch of performance upgrades and stability to the internal APIs.<p>To put it all to the test, we decided to benchmark the tool on the 1 million+ images in the classic ImageNet dataset.<p>The TLDR is Oxen.ai is faster than raw uploads to S3, 13x faster than git-lfs, and 5x faster than DVC. The full breakdown can be found here.<p><a href="https://docs.oxen.ai/features/performance" rel="nofollow">https://docs.oxen.ai/features/performance</a><p>If you are in the ML/AI community, or rust aficionados, would love to get your feedback on both the tool and the codebase. We would love some community contribution when it comes to different storage backends and integrations into other data tools.</p>
]]></description><pubDate>Sun, 03 Nov 2024 23:46:30 +0000</pubDate><link>https://news.ycombinator.com/item?id=42037092</link><dc:creator>gregschoeninger</dc:creator><comments>https://news.ycombinator.com/item?id=42037092</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42037092</guid></item><item><title><![CDATA[New comment by gregschoeninger in "Data Version Control"]]></title><description><![CDATA[
<p>Maintainer of Oxen here, we initially built Oxen because DVC was pretty painfully slow to work with, and had a lot of extra bells and whistles that we didn’t need. Under the hood we optimized the merkle tree structure, hashing algorithms, network protocols, etc to make it speedy when it came to large datasets. We have a pretty nice front end at <a href="https://oxen.ai" rel="nofollow">https://oxen.ai</a> for viewing and querying the data as well.<p>Happy to answer any thoughts or questions!</p>
]]></description><pubDate>Sun, 20 Oct 2024 15:17:22 +0000</pubDate><link>https://news.ycombinator.com/item?id=41895851</link><dc:creator>gregschoeninger</dc:creator><comments>https://news.ycombinator.com/item?id=41895851</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=41895851</guid></item><item><title><![CDATA[New comment by gregschoeninger in "Paper Club: How Flux.1 models work under the hood"]]></title><description><![CDATA[
<p>Hey all,<p>With Black Forest Labs’ Flux.1 variants being the current state of the art for image gen, we’re doing a technical dive into a few paper that inspired the work, starting with: Scaling Rectified Flow Transformers for High-Resolution Image Synthesis (also known as the Stable Diffusion 3 paper).<p>If you’d like to join the community tomorrow 10 AM PST we’d love to have you. We do it live over zoom and anyone is welcome to join.<p>Scaling Rectified Flow Transformers for High-Resolution Image Synthesis
<a href="https://arxiv.org/abs/2403.03206" rel="nofollow">https://arxiv.org/abs/2403.03206</a><p>Join the paper club:
<a href="https://lu.ma/arxivdive-27" rel="nofollow">https://lu.ma/arxivdive-27</a></p>
]]></description><pubDate>Fri, 13 Sep 2024 03:26:34 +0000</pubDate><link>https://news.ycombinator.com/item?id=41527826</link><dc:creator>gregschoeninger</dc:creator><comments>https://news.ycombinator.com/item?id=41527826</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=41527826</guid></item><item><title><![CDATA[Paper Club: How Flux.1 models work under the hood]]></title><description><![CDATA[
<p>Article URL: <a href="https://www.oxen.ai/community">https://www.oxen.ai/community</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=41527825">https://news.ycombinator.com/item?id=41527825</a></p>
<p>Points: 2</p>
<p># Comments: 1</p>
]]></description><pubDate>Fri, 13 Sep 2024 03:26:34 +0000</pubDate><link>https://www.oxen.ai/community</link><dc:creator>gregschoeninger</dc:creator><comments>https://news.ycombinator.com/item?id=41527825</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=41527825</guid></item><item><title><![CDATA[New comment by gregschoeninger in "Using Llama3.1 405B to generate political synthetic data"]]></title><description><![CDATA[
<p>We thought it'd be interesting to see what political biases Llama 3.1 405B has by generating a bunch of "spam" or "ham" messages with it. We started with 5 hand crafted messages and let the LLM take it from there ending up with over 1k.<p>Full process was documented here:<p><a href="https://www.oxen.ai/blog/create-your-own-synthetic-data-with-only-5-political-spam-texts?utm_source=hackernews" rel="nofollow">https://www.oxen.ai/blog/create-your-own-synthetic-data-with...</a><p>Next up we are going to train a classifier on the outputs, as well as do some classical NLP (named entities, keywords, sentiment, etc) on it to see what we find.<p>Mainly a fun side project, but could have some interesting implications assuming candidates are using LLMs in the upcoming elections.</p>
]]></description><pubDate>Fri, 02 Aug 2024 05:43:59 +0000</pubDate><link>https://news.ycombinator.com/item?id=41136394</link><dc:creator>gregschoeninger</dc:creator><comments>https://news.ycombinator.com/item?id=41136394</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=41136394</guid></item><item><title><![CDATA[Using Llama3.1 405B to generate political synthetic data]]></title><description><![CDATA[
<p>Article URL: <a href="https://www.oxen.ai/Laurence/political-spam/file/main/texts.parquet?query_id=f6bbb123-1453-4e02-a477-4bebdc379b0e&utm_source=hackernews">https://www.oxen.ai/Laurence/political-spam/file/main/texts.parquet?query_id=f6bbb123-1453-4e02-a477-4bebdc379b0e&utm_source=hackernews</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=41136393">https://news.ycombinator.com/item?id=41136393</a></p>
<p>Points: 5</p>
<p># Comments: 3</p>
]]></description><pubDate>Fri, 02 Aug 2024 05:43:59 +0000</pubDate><link>https://www.oxen.ai/Laurence/political-spam/file/main/texts.parquet?query_id=f6bbb123-1453-4e02-a477-4bebdc379b0e&amp;utm_source=hackernews</link><dc:creator>gregschoeninger</dc:creator><comments>https://news.ycombinator.com/item?id=41136393</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=41136393</guid></item><item><title><![CDATA[New comment by gregschoeninger in "Fine Tuning a Diffusion Transformer (DiT) from a Single YouTube Video"]]></title><description><![CDATA[
<p>Hey all,<p>We were messing around with PixArt as a way to fine tune DiT's for image generation. I was pretty impressed with the results and thought I'd share.<p><a href="https://www.oxen.ai/ox/PixArtTutorial" rel="nofollow">https://www.oxen.ai/ox/PixArtTutorial</a><p>In this example I downloaded a video from YouTube (the trailer of Wes Anderson's Asteroid City) chopped up the frames, captioned them with LLaVA, and then trained the model to generate in the style of the video. It's only about 340 frames of data so pretty quick to generate and train.<p>I also compare against pure prompting, which the model did not have encoded in it's base parameters.<p>Using PEFT and LoRA, it took less than 3 hours on an A10 GPU on Lambda Labs. So cost about $3 in total. Pretty wild that it worked right out of the gate for that cheap.<p>Hopefully it inspires others for what they could build!</p>
]]></description><pubDate>Fri, 31 May 2024 00:09:31 +0000</pubDate><link>https://news.ycombinator.com/item?id=40530187</link><dc:creator>gregschoeninger</dc:creator><comments>https://news.ycombinator.com/item?id=40530187</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40530187</guid></item><item><title><![CDATA[Fine Tuning a Diffusion Transformer (DiT) from a Single YouTube Video]]></title><description><![CDATA[
<p>Article URL: <a href="https://www.oxen.ai/ox/PixArtTutorial">https://www.oxen.ai/ox/PixArtTutorial</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=40530186">https://news.ycombinator.com/item?id=40530186</a></p>
<p>Points: 4</p>
<p># Comments: 2</p>
]]></description><pubDate>Fri, 31 May 2024 00:09:31 +0000</pubDate><link>https://www.oxen.ai/ox/PixArtTutorial</link><dc:creator>gregschoeninger</dc:creator><comments>https://news.ycombinator.com/item?id=40530186</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40530186</guid></item><item><title><![CDATA[New comment by gregschoeninger in "How to train diffusion for text from scratch"]]></title><description><![CDATA[
<p>Hey all,<p>I thought the paper “Discrete Diffusion Modeling by Estimating the ratios of the Data Distribution” was a pretty cool idea, so decided to dive deep into the code, strip it down so I could understand it, then train some models from scratch. My findings are linked here:<p><a href="https://www.oxen.ai/blog/how-to-train-diffusion-for-text-from-scratch" rel="nofollow">https://www.oxen.ai/blog/how-to-train-diffusion-for-text-fro...</a><p>I find the diffusion papers a bit difficult to read and looking at the inputs and outputs of code really help me grok what’s going on.<p>Main takeaways are:<p>1) It is yet to be seen if these techniques will scale in both data and model size
2) Is an interesting technique in general, kind of wild that the Monte Carlo sampling and denoising works at all
3) The infilling isn’t a super big selling point as is because the context length is fixed during diffusion. You’d have to layer in some hacks to make it work well for code completion or other use cases.<p>Curious what you guys think about diffusion for text, and hopefully this gives people a jumping off point for understanding and implementing your own!<p>Props to @louaaron and his team at Stanford and Pika Labs for the initial paper and implementation.</p>
]]></description><pubDate>Tue, 30 Apr 2024 03:25:22 +0000</pubDate><link>https://news.ycombinator.com/item?id=40206925</link><dc:creator>gregschoeninger</dc:creator><comments>https://news.ycombinator.com/item?id=40206925</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40206925</guid></item><item><title><![CDATA[How to train diffusion for text from scratch]]></title><description><![CDATA[
<p>Article URL: <a href="https://ghost.oxen.ai/how-to-train-diffusion-for-text-from-scratch/">https://ghost.oxen.ai/how-to-train-diffusion-for-text-from-scratch/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=40206924">https://news.ycombinator.com/item?id=40206924</a></p>
<p>Points: 1</p>
<p># Comments: 1</p>
]]></description><pubDate>Tue, 30 Apr 2024 03:25:22 +0000</pubDate><link>https://ghost.oxen.ai/how-to-train-diffusion-for-text-from-scratch/</link><dc:creator>gregschoeninger</dc:creator><comments>https://news.ycombinator.com/item?id=40206924</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40206924</guid></item><item><title><![CDATA[New comment by gregschoeninger in "Instruct-Tuning BitNet 1.58"]]></title><description><![CDATA[
<p>This is work done for our arxiv dive paper club where we dive into research papers and implement code to see how the models work in practice. We have some internal use cases for BitNets so thought we'd share the work as we go along. Enjoy!<p>Feel free to join us as we build:
<a href="https://oxen.ai/community" rel="nofollow">https://oxen.ai/community</a></p>
]]></description><pubDate>Mon, 08 Apr 2024 21:52:45 +0000</pubDate><link>https://news.ycombinator.com/item?id=39974117</link><dc:creator>gregschoeninger</dc:creator><comments>https://news.ycombinator.com/item?id=39974117</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=39974117</guid></item><item><title><![CDATA[Instruct-Tuning BitNet 1.58]]></title><description><![CDATA[
<p>Article URL: <a href="https://github.com/Oxen-AI/BitNet-1.58-Instruct">https://github.com/Oxen-AI/BitNet-1.58-Instruct</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=39974116">https://news.ycombinator.com/item?id=39974116</a></p>
<p>Points: 4</p>
<p># Comments: 2</p>
]]></description><pubDate>Mon, 08 Apr 2024 21:52:45 +0000</pubDate><link>https://github.com/Oxen-AI/BitNet-1.58-Instruct</link><dc:creator>gregschoeninger</dc:creator><comments>https://news.ycombinator.com/item?id=39974116</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=39974116</guid></item><item><title><![CDATA[New comment by gregschoeninger in "Show HN: Implementation of the "Self-Rewarding Language Models" Paper by MetaAI"]]></title><description><![CDATA[
<p>We used an A10 with 24GB of VRAM, this was enough for PEFT on Mistral-7B</p>
]]></description><pubDate>Fri, 15 Mar 2024 21:42:00 +0000</pubDate><link>https://news.ycombinator.com/item?id=39721028</link><dc:creator>gregschoeninger</dc:creator><comments>https://news.ycombinator.com/item?id=39721028</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=39721028</guid></item><item><title><![CDATA[New comment by gregschoeninger in "Show HN: Implementation of the "Self-Rewarding Language Models" Paper by MetaAI"]]></title><description><![CDATA[
<p>The goal is to iteratively create training data and add it to its own training set. The LLM acts as its own judge and scores its own responses to decide if it should add the data. It’s expensive to have a human in the loop labeling preferences, so the folks at Meta showed you can have a clever prompt and fine tune the model to judge its own responses.</p>
]]></description><pubDate>Fri, 15 Mar 2024 21:41:08 +0000</pubDate><link>https://news.ycombinator.com/item?id=39721019</link><dc:creator>gregschoeninger</dc:creator><comments>https://news.ycombinator.com/item?id=39721019</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=39721019</guid></item><item><title><![CDATA[New comment by gregschoeninger in "Show HN: Implementation of the "Self-Rewarding Language Models" Paper by MetaAI"]]></title><description><![CDATA[
<p>Hey all,<p>After reading the Self-Rewarding Language Models paper by the team at Meta, it felt very approachable and reproducible, so we spent some time implementing it.<p>The scripts provided take any base model and put it in a loop of:<p>1) Supervised fine-tuning on an initial dataset<p>2) Generating new prompts using the SFT<p>3) Generating N responses per prompt<p>4) Scoring the generated responses 1-5<p>5) Running DPO on the rewards from the model itself.<p>We've run it through one loop starting with a Mistral-7b base model and the results are pretty encouraging so far.<p>Feel free to check it out or run it for yourself and let us know what you think:<p><a href="https://github.com/Oxen-AI/Self-Rewarding-Language-Models">https://github.com/Oxen-AI/Self-Rewarding-Language-Models</a></p>
]]></description><pubDate>Fri, 15 Mar 2024 20:46:08 +0000</pubDate><link>https://news.ycombinator.com/item?id=39720537</link><dc:creator>gregschoeninger</dc:creator><comments>https://news.ycombinator.com/item?id=39720537</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=39720537</guid></item><item><title><![CDATA[Show HN: Implementation of the "Self-Rewarding Language Models" Paper by MetaAI]]></title><description><![CDATA[
<p>Article URL: <a href="https://github.com/Oxen-AI/Self-Rewarding-Language-Models">https://github.com/Oxen-AI/Self-Rewarding-Language-Models</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=39720536">https://news.ycombinator.com/item?id=39720536</a></p>
<p>Points: 23</p>
<p># Comments: 5</p>
]]></description><pubDate>Fri, 15 Mar 2024 20:46:08 +0000</pubDate><link>https://github.com/Oxen-AI/Self-Rewarding-Language-Models</link><dc:creator>gregschoeninger</dc:creator><comments>https://news.ycombinator.com/item?id=39720536</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=39720536</guid></item><item><title><![CDATA[New comment by gregschoeninger in ""Road to Sora" Paper Reading List"]]></title><description><![CDATA[
<p>Hey all,<p>Have been diving into the Sora technical report for our paper club on Friday, and decided it would be nice to have a reading list of the background papers need to fully grok everything that is going on in that technical report - each with a little description of the part of the pipeline it would be used for (or a previous state of the art technique that was referenced in the review).<p>We are going to pick a few of the top papers and go over them as a group in the coming Fridays, so join us if you'd like! It's at 10am PST on Fridays over Zoom.<p>Paper Reading List:<p><a href="https://www.oxen.ai/blog/road-to-sora-reading-list" rel="nofollow">https://www.oxen.ai/blog/road-to-sora-reading-list</a><p>Technical Report:<p><a href="https://openai.com/research/video-generation-models-as-world-simulators" rel="nofollow">https://openai.com/research/video-generation-models-as-world...</a><p>Join the paper club:<p><a href="https://lu.ma/oxenbookclub" rel="nofollow">https://lu.ma/oxenbookclub</a></p>
]]></description><pubDate>Tue, 05 Mar 2024 06:18:16 +0000</pubDate><link>https://news.ycombinator.com/item?id=39600019</link><dc:creator>gregschoeninger</dc:creator><comments>https://news.ycombinator.com/item?id=39600019</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=39600019</guid></item></channel></rss>