Help shape the future of multilingual Open Source AI!
Join the FineWeb 2 Community Annotation Sprint to create an open training dataset with full transparency and human validation in many languages.
Review datasets in your language and help identify the best sources for training.
Join the FineWeb 2 Community Annotation Sprint to create an open training dataset with full transparency and human validation in many languages.
Review datasets in your language and help identify the best sources for training.
Comments
https://huggingface.co/spaces/data-is-better-together/fineweb-c
Don't know how to start, want to discuss? Join:
https://huggingface.co/spaces/HuggingFaceFW/discussion