VLM
VLM is a model in our research taxonomy.
Related papers
- DesignSense: A Human Preference Dataset and Reward Modeling Framework for Graphic Layout Generation
- Mechanisms of Prompt-Induced Hallucination in Vision-Language Models
- DeepGen 1.0: A Lightweight Unified Multimodal Model for Advancing Image Generation and Editing
- OpenVTON-Bench: A Large-Scale High-Resolution Benchmark for Controllable Virtual Try-On Evaluation
- Nüwa: Mending the Spatial Integrity Torn by VLM Token Pruning
- Gaming the Judge: Unfaithful Chain-of-Thought Can Undermine Agent Evaluation
- V-CAGE: Context-Aware Generation and Verification for Scalable Long-Horizon Embodied Tasks