🤔Do Vision and Language Models Share Concepts? 🚀 We present an empirical evaluation and find that language models partially converge towards representations isomorphic to those of vision models. #EMNLP 📃 direct.mit.edu/tacl/article... - ThreadSky

jiaangli.bsky.social • 111 days ago

🤔Do Vision and Language Models Share Concepts? 🚀
We present an empirical evaluation and find that language models partially converge towards representations isomorphic to those of vision models. #EMNLP

📃 https://direct.mit.edu/tacl/article/doi/10.1162/tacl_a_00698/124631

Comments

crystalwizard.bsky.social•96 days ago

take into account the fact that they're both trained on only one set of data - human data.
how can they not understand everything in the same way?

jiaangli.bsky.social•111 days ago

Mapping vector spaces:
🎯We measure the alignment between vision models and LMs by mapping their vector spaces and evaluating retrieval precision on held-out data.

🧵(2/8)

jiaangli.bsky.social•111 days ago

The results show a clear trend:
✨LMs converge toward the geometry of visual models as they grow bigger and better.

🧵(3/8)

jiaangli.bsky.social•111 days ago

What factors influence the convergence?
🔍Our experiments show the alignability of LMs
and vision models is sensitive to image and language dispersion, polysemy, and frequency.

🧵(4/8)

jiaangli.bsky.social•111 days ago

🔍We also measure the generalization of the mapping to other POS, and explore the impact of different size of the training data. 👀To investigate the effects of incorporating text signals during vision pretraining, we compare pure vision models against selected CLIP vision encoders.

🧵(5/8)

jiaangli.bsky.social•111 days ago

🌱We then discuss the implications of our finding:
- the LM understanding debate
- the study of emergent properties
- philosophy

🧵(6/8)

jiaangli.bsky.social•111 days ago

🚀Take away:

1. Representation spaces of LMs and VMs grow more partially similar with model size.
2. Lower frequency, polysemy, dispersion can be easier to align.
3. Shared concepts between LMs and VMs might extend beyond nouns.

🧵(7/8)
#NLP #NLProc

Comments

Posting Rules

Reply