OmniVision-968M: a new local VLM for edge devices, fast & small but performant 👏 it's based on SigLIP-so-400M and Qwen-2.5-0.5B 💨 9x less image tokens, super efficient 📖 aligned with SFT and DPO for reducing hallucinations 🔥 Apache 2.0 license Demo hf.co/spaces/NexaAIDev/omnivlm-dpo-demo - ThreadSky

merve.bsky.social • 115 days ago

OmniVision-968M: a new local VLM for edge devices, fast & small but performant 👏

it's based on SigLIP-so-400M and Qwen-2.5-0.5B
💨 9x less image tokens, super efficient
📖 aligned with SFT and DPO for reducing hallucinations
🔥 Apache 2.0 license
Demo https://hf.co/spaces/NexaAIDev/omnivlm-dpo-demo

Comments

Posting Rules

Comments

Posting Rules

Reply