Tag: Visual Representation Learning
-
Understanding the visual knowledge of language models
Youโve likely heard that a picture is worth a thousand words, but can a large language model (LLM) get the picture if itโs never seen images before? As it turns out, language models that are trained purely on text have a solid understanding of the visual world. They can write image-rendering code to generate complex…