Spanish, Filipino, Amharic, French, German, Basque, Catalan, Galician, Guarani, Telugu, Italian, Pashto, Romanian, Tamil, Urdu, Danish... and many more! All included in the FineWeb2 Community Annotation Sprint! 🔥
💫 Join to build an impactful dataset for your language!
💫 Join to build an impactful dataset for your language!
Comments
- Coordinate with your Language Lead: https://huggingface.co/spaces/HuggingFaceFW/discussion. Or become one if it is missing: https://huggingface.co/spaces/nataliaElv/language-leads-dashboard
- Read the guidelines and start annotating according to the educational value: https://huggingface.co/spaces/data-is-better-together/fineweb-c