vision language models (VLMs)

vision language models (VLMs) is a research_field technology tracked in AI research papers.