๐ 50,000+ annotations reached! The FineWeb2-C community is helping build better language models on annotation at a time.
๐ Current stats:
- 115 languages represented
- 419 amazing contributors
- 24 languages with complete datasets
But we're not done yet! ๐งต
๐ Current stats:
- 115 languages represented
- 419 amazing contributors
- 24 languages with complete datasets
But we're not done yet! ๐งต
Comments
- Kinyarwanda (91.2% done!)
- Tamil (80% done)
- Japanese (76% done)
- Tigrinya (74.5% done)
Join us in pushing them over the finish line! ๐