VLM

VLM is a model in our research taxonomy.

Related papers

DesignSense: A Human Preference Dataset and Reward Modeling Framework for Graphic Layout Generation
Mechanisms of Prompt-Induced Hallucination in Vision-Language Models
DeepGen 1.0: A Lightweight Unified Multimodal Model for Advancing Image Generation and Editing
OpenVTON-Bench: A Large-Scale High-Resolution Benchmark for Controllable Virtual Try-On Evaluation
Nüwa: Mending the Spatial Integrity Torn by VLM Token Pruning
Gaming the Judge: Unfaithful Chain-of-Thought Can Undermine Agent Evaluation
V-CAGE: Context-Aware Generation and Verification for Scalable Long-Horizon Embodied Tasks