Hacker News: gregschoeninger

New comment by gregschoeninger in "Lore – Open source version control system designed for scalability"

gregschoeninger — Wed, 17 Jun 2026 17:11:08 +0000

We're also working on an open source large asset versioning tool called "oxen" - https://github.com/Oxen-AI/Oxen

Would love any feedback on it or contributions if people are interested :)

New comment by gregschoeninger in "The future of version control"

gregschoeninger — Sun, 22 Mar 2026 18:46:38 +0000

We're working on this project to help with the non-text file and large file problem: https://github.com/Oxen-AI/Oxen

Started with the machine learning use case for datasets and model weights but seeing a lot of traction in gaming as well.

Always open for feedback and ideas to improve if you want to take it for a spin!

New comment by gregschoeninger in "[dead]"

gregschoeninger — Thu, 30 Jan 2025 05:01:11 +0000

Over the past ~1.5 years I've been running a research paper club where we dive into interesting/foundational papers in AI/ML. So we naturally have come across a lot of the papers that lead up to DeepSeek-R1. While diving into the DeepSeek papers this week, I decided to compile a list of papers that we've already gone over or I think would be good background reading to get a bigger picture of what's going on under the hood of DeepSeek.

Grab a cup of coffee and enjoy!

https://www.oxen.ai/blog/no-hype-deepseek-r1-reading-list

New comment by gregschoeninger in "[dead]"

gregschoeninger — Sun, 03 Nov 2024 23:46:30 +0000

Hey all,

If you haven't seen the Oxen project yet, we have been building an open source unstructured data version control tool.

We were inspired by the idea of making large machine learning datasets living & breathing assets that people can collaborate on, rather than the static ones of the past. Lately we have been working hard on optimizing the underlying Merkle Trees and data structures with in Oxen.ai and just released v0.19.4 which provides a bunch of performance upgrades and stability to the internal APIs.

To put it all to the test, we decided to benchmark the tool on the 1 million+ images in the classic ImageNet dataset.

The TLDR is Oxen.ai is faster than raw uploads to S3, 13x faster than git-lfs, and 5x faster than DVC. The full breakdown can be found here.

https://docs.oxen.ai/features/performance

If you are in the ML/AI community, or rust aficionados, would love to get your feedback on both the tool and the codebase. We would love some community contribution when it comes to different storage backends and integrations into other data tools.

New comment by gregschoeninger in "Data Version Control"

gregschoeninger — Sun, 20 Oct 2024 15:17:22 +0000

Maintainer of Oxen here, we initially built Oxen because DVC was pretty painfully slow to work with, and had a lot of extra bells and whistles that we didn’t need. Under the hood we optimized the merkle tree structure, hashing algorithms, network protocols, etc to make it speedy when it came to large datasets. We have a pretty nice front end at https://oxen.ai for viewing and querying the data as well.

Happy to answer any thoughts or questions!

New comment by gregschoeninger in "Paper Club: How Flux.1 models work under the hood"

gregschoeninger — Fri, 13 Sep 2024 03:26:34 +0000

Hey all,

With Black Forest Labs’ Flux.1 variants being the current state of the art for image gen, we’re doing a technical dive into a few paper that inspired the work, starting with: Scaling Rectified Flow Transformers for High-Resolution Image Synthesis (also known as the Stable Diffusion 3 paper).

If you’d like to join the community tomorrow 10 AM PST we’d love to have you. We do it live over zoom and anyone is welcome to join.

Scaling Rectified Flow Transformers for High-Resolution Image Synthesis https://arxiv.org/abs/2403.03206

Join the paper club: https://lu.ma/arxivdive-27

Paper Club: How Flux.1 models work under the hood

gregschoeninger — Fri, 13 Sep 2024 03:26:34 +0000

Article URL: https://www.oxen.ai/community

Comments URL: https://news.ycombinator.com/item?id=41527825

Points: 2

# Comments: 1

New comment by gregschoeninger in "Using Llama3.1 405B to generate political synthetic data"

gregschoeninger — Fri, 02 Aug 2024 05:43:59 +0000

We thought it'd be interesting to see what political biases Llama 3.1 405B has by generating a bunch of "spam" or "ham" messages with it. We started with 5 hand crafted messages and let the LLM take it from there ending up with over 1k.

Full process was documented here:

https://www.oxen.ai/blog/create-your-own-synthetic-data-with...

Next up we are going to train a classifier on the outputs, as well as do some classical NLP (named entities, keywords, sentiment, etc) on it to see what we find.

Mainly a fun side project, but could have some interesting implications assuming candidates are using LLMs in the upcoming elections.

Using Llama3.1 405B to generate political synthetic data

gregschoeninger — Fri, 02 Aug 2024 05:43:59 +0000

Article URL: https://www.oxen.ai/Laurence/political-spam/file/main/texts.parquet?query_id=f6bbb123-1453-4e02-a477-4bebdc379b0e&utm_source=hackernews

Comments URL: https://news.ycombinator.com/item?id=41136393

Points: 5

# Comments: 3

New comment by gregschoeninger in "Fine Tuning a Diffusion Transformer (DiT) from a Single YouTube Video"

gregschoeninger — Fri, 31 May 2024 00:09:31 +0000

Hey all,

We were messing around with PixArt as a way to fine tune DiT's for image generation. I was pretty impressed with the results and thought I'd share.

https://www.oxen.ai/ox/PixArtTutorial

In this example I downloaded a video from YouTube (the trailer of Wes Anderson's Asteroid City) chopped up the frames, captioned them with LLaVA, and then trained the model to generate in the style of the video. It's only about 340 frames of data so pretty quick to generate and train.

I also compare against pure prompting, which the model did not have encoded in it's base parameters.

Using PEFT and LoRA, it took less than 3 hours on an A10 GPU on Lambda Labs. So cost about $3 in total. Pretty wild that it worked right out of the gate for that cheap.

Hopefully it inspires others for what they could build!

Fine Tuning a Diffusion Transformer (DiT) from a Single YouTube Video

gregschoeninger — Fri, 31 May 2024 00:09:31 +0000

Article URL: https://www.oxen.ai/ox/PixArtTutorial

Comments URL: https://news.ycombinator.com/item?id=40530186

Points: 4

# Comments: 2

New comment by gregschoeninger in "How to train diffusion for text from scratch"

gregschoeninger — Tue, 30 Apr 2024 03:25:22 +0000

Hey all,

I thought the paper “Discrete Diffusion Modeling by Estimating the ratios of the Data Distribution” was a pretty cool idea, so decided to dive deep into the code, strip it down so I could understand it, then train some models from scratch. My findings are linked here:

https://www.oxen.ai/blog/how-to-train-diffusion-for-text-fro...

I find the diffusion papers a bit difficult to read and looking at the inputs and outputs of code really help me grok what’s going on.

Main takeaways are:

1) It is yet to be seen if these techniques will scale in both data and model size 2) Is an interesting technique in general, kind of wild that the Monte Carlo sampling and denoising works at all 3) The infilling isn’t a super big selling point as is because the context length is fixed during diffusion. You’d have to layer in some hacks to make it work well for code completion or other use cases.

Curious what you guys think about diffusion for text, and hopefully this gives people a jumping off point for understanding and implementing your own!

Props to @louaaron and his team at Stanford and Pika Labs for the initial paper and implementation.

How to train diffusion for text from scratch

gregschoeninger — Tue, 30 Apr 2024 03:25:22 +0000

Article URL: https://ghost.oxen.ai/how-to-train-diffusion-for-text-from-scratch/

Comments URL: https://news.ycombinator.com/item?id=40206924

Points: 1

# Comments: 1

New comment by gregschoeninger in "Instruct-Tuning BitNet 1.58"

gregschoeninger — Mon, 08 Apr 2024 21:52:45 +0000

This is work done for our arxiv dive paper club where we dive into research papers and implement code to see how the models work in practice. We have some internal use cases for BitNets so thought we'd share the work as we go along. Enjoy!

Feel free to join us as we build: https://oxen.ai/community

Instruct-Tuning BitNet 1.58

gregschoeninger — Mon, 08 Apr 2024 21:52:45 +0000

Article URL: https://github.com/Oxen-AI/BitNet-1.58-Instruct

Comments URL: https://news.ycombinator.com/item?id=39974116

Points: 4

# Comments: 2

New comment by gregschoeninger in "Show HN: Implementation of the "Self-Rewarding Language Models" Paper by MetaAI"

gregschoeninger — Fri, 15 Mar 2024 21:42:00 +0000

We used an A10 with 24GB of VRAM, this was enough for PEFT on Mistral-7B

New comment by gregschoeninger in "Show HN: Implementation of the "Self-Rewarding Language Models" Paper by MetaAI"

gregschoeninger — Fri, 15 Mar 2024 21:41:08 +0000

The goal is to iteratively create training data and add it to its own training set. The LLM acts as its own judge and scores its own responses to decide if it should add the data. It’s expensive to have a human in the loop labeling preferences, so the folks at Meta showed you can have a clever prompt and fine tune the model to judge its own responses.

New comment by gregschoeninger in "Show HN: Implementation of the "Self-Rewarding Language Models" Paper by MetaAI"

gregschoeninger — Fri, 15 Mar 2024 20:46:08 +0000

Hey all,

After reading the Self-Rewarding Language Models paper by the team at Meta, it felt very approachable and reproducible, so we spent some time implementing it.

The scripts provided take any base model and put it in a loop of:

1) Supervised fine-tuning on an initial dataset

2) Generating new prompts using the SFT

3) Generating N responses per prompt

4) Scoring the generated responses 1-5

5) Running DPO on the rewards from the model itself.

We've run it through one loop starting with a Mistral-7b base model and the results are pretty encouraging so far.

Feel free to check it out or run it for yourself and let us know what you think:

https://github.com/Oxen-AI/Self-Rewarding-Language-Models

Show HN: Implementation of the "Self-Rewarding Language Models" Paper by MetaAI

gregschoeninger — Fri, 15 Mar 2024 20:46:08 +0000

Article URL: https://github.com/Oxen-AI/Self-Rewarding-Language-Models

Comments URL: https://news.ycombinator.com/item?id=39720536

Points: 23

# Comments: 5

New comment by gregschoeninger in ""Road to Sora" Paper Reading List"

gregschoeninger — Tue, 05 Mar 2024 06:18:16 +0000

Hey all,

Have been diving into the Sora technical report for our paper club on Friday, and decided it would be nice to have a reading list of the background papers need to fully grok everything that is going on in that technical report - each with a little description of the part of the pipeline it would be used for (or a previous state of the art technique that was referenced in the review).

We are going to pick a few of the top papers and go over them as a group in the coming Fridays, so join us if you'd like! It's at 10am PST on Fridays over Zoom.

Paper Reading List:

https://www.oxen.ai/blog/road-to-sora-reading-list

Technical Report:

https://openai.com/research/video-generation-models-as-world...

Join the paper club:

https://lu.ma/oxenbookclub