VideoLLaMA 3🔥multimodal foundation models for Image and Video Understanding by DAMO Alibaba Paper: huggingface.co/papers/2501.... Model: huggingface.co/collections/... ✨ 2B/7B ✨ Apache2.0 - ThreadSky | a Reddit-style client for Bluesky

adinayakup.bsky.social • 36 days ago

VideoLLaMA 3🔥multimodal foundation models for Image and Video Understanding by DAMO Alibaba

Paper: https://huggingface.co/papers/2501.13106
Model: https://huggingface.co/collections/DAMO-NLP-SG/videollama3-678cdda9281a0e32fe79af15
✨ 2B/7B
✨ Apache2.0

Comments