3 papers - avg viability 7.0
ACPO is a novel alignment mechanism for vision-language models that prevents hallucinations by asymmetrically constraining preference optimization, leading to improved performance on benchmark tasks.
GeoAlignCLIP enhances fine-grained vision-language alignment in remote sensing through multi-granular consistency learning.
Develop SOTAlign, a framework for aligning vision and language models using semi-supervised optimal transport.