On the Cone Effect and Modality Gap in Medical Vision-Language Embeddings
arXiv:2603.17246v1 Announce Type: new Abstract: Vision-Language Models (VLMs) exhibit a characteristic "cone effect" in which nonlinear encoders map embeddings into highly concentrated regions of the …