VideoLLaMA 3🔥multimodal foundation models for Image and Video Understanding by DAMO Alibaba

Paper: https://huggingface.co/papers/2501.13106
Model: https://huggingface.co/collections/DAMO-NLP-SG/videollama3-678cdda9281a0e32fe79af15
✨ 2B/7B
✨ Apache2.0

Comments