TACO, a training-free method using NMF to co-factorize audio and visual features from pre-trained models, achieved state-of-the-art unsupervised sound-prompted segmentation. - ThreadSky

arxiv-sound.bsky.social • 96 days ago

TACO, a training-free method using NMF to co-factorize audio and visual features from pre-trained models, achieved state-of-the-art unsupervised sound-prompted segmentation.

Comments

Posting Rules

Comments

Posting Rules

Reply