Papers
1–3 of 3Research Paper·Mar 16, 2026
MER-Bench: A Comprehensive Benchmark for Multimodal Meme Reappraisal
Memes represent a tightly coupled, multimodal form of social expression, in which visual context and overlaid text jointly convey nuanced affect and commentary. Inspired by cognitive reappraisal in ps...
8.0 viability
Research Paper·Mar 6, 2026
Learning to Generate via Understanding: Understanding-Driven Intrinsic Rewarding for Unified Multimodal Models
Recently, unified multimodal models (UMMs) have made remarkable progress in integrating visual understanding and generation, demonstrating strong potential for complex text-to-image (T2I) tasks. Despi...
7.0 viability
Research Paper·Mar 10, 2026
Towards Unified Multimodal Interleaved Generation via Group Relative Policy Optimization
Unified vision-language models have made significant progress in multimodal understanding and generation, yet they largely fall short in producing multimodal interleaved outputs, which is a crucial ca...
6.0 viability