ThreadSky
About ThreadSky
Log In
kiran6d.bsky.social
•
90 days ago
ByteDance's UI-TARS, end-to-end GUI agent model based on VLM architecture. It processes screenshots as input and performs human-like interactions.
https://huggingface.co/papers/2501.12326
Your browser does not support HLS video playback.
Comments
Log in
with your Bluesky account to leave a comment
No comments yet
Posting Rules
Be respectful to others
No spam or self-promotion
Stay on topic
Follow Bluesky's terms of service
×
Reply
Post Reply
Comments