We have trained some respectable models from scratch!
- Marin-8B-Base: beats Llama 3.1 8B on 14/19 benchmarks
- Marin-8B-Instruct: try it out on HuggingFace: https://huggingface.co/spaces/WillHeld/marin-8b-instruct-ChatUI
- Marin-8B-Base: beats Llama 3.1 8B on 14/19 benchmarks
- Marin-8B-Instruct: try it out on HuggingFace: https://huggingface.co/spaces/WillHeld/marin-8b-instruct-ChatUI
Comments
@dlwh.bsky.social
about the need for an open-source set of scaling law checkpoints!
Since then, I was lucky to play a (small) role in building Marin-8B. Check out the model (including intermediate checkpoints) here:
https://huggingface.co/marin-community/marin-8b-base
And about the Models we are releasing in @dlwh.bsky.social's training retro: https://marin.readthedocs.io/en/latest/reports/marin-8b-retro/