multimodal large language models (MLLMs) is a research_field technology tracked in AI research papers.