Excited to see Alibaba DAMO Academy release a multimodel dataset for vision language pretraining on @hf.co 🔥
Dataset: https://huggingface.co/datasets/DAMO-NLP-SG/multimodal_textbook
Paper: https://huggingface.co/papers/2501.00958
✨ Apache 2.0
✨ 6.5M images + 0.8B text from 22k hours of instructional videos
Dataset: https://huggingface.co/datasets/DAMO-NLP-SG/multimodal_textbook
Paper: https://huggingface.co/papers/2501.00958
✨ Apache 2.0
✨ 6.5M images + 0.8B text from 22k hours of instructional videos
Comments