I presented our paper VETIM at #BMVC2023 three weeks ago!
🔍 It is possible to learn a single token representing a concept in Stable Diffusion using supervision only at the CLIP output and without mimicking visual features from sample images.
Check it out: https://ivrl.github.io/vetim/
🔍 It is possible to learn a single token representing a concept in Stable Diffusion using supervision only at the CLIP output and without mimicking visual features from sample images.
Check it out: https://ivrl.github.io/vetim/
Comments