Multimodal Generation

Trending

3papers

7.0viability

+100%30d

Papers

1–3 of 3

Research Paper·Mar 16, 2026

MER-Bench: A Comprehensive Benchmark for Multimodal Meme Reappraisal

Memes represent a tightly coupled, multimodal form of social expression, in which visual context and overlaid text jointly convey nuanced affect and commentary. Inspired by cognitive reappraisal in ps...

8.0 viability

Research Paper·Mar 6, 2026

Learning to Generate via Understanding: Understanding-Driven Intrinsic Rewarding for Unified Multimodal Models

Recently, unified multimodal models (UMMs) have made remarkable progress in integrating visual understanding and generation, demonstrating strong potential for complex text-to-image (T2I) tasks. Despi...

7.0 viability

Research Paper·Mar 10, 2026

Towards Unified Multimodal Interleaved Generation via Group Relative Policy Optimization

Unified vision-language models have made significant progress in multimodal understanding and generation, yet they largely fall short in producing multimodal interleaved outputs, which is a crucial ca...

6.0 viability

Multimodal Generation

Papers

MER-Bench: A Comprehensive Benchmark for Multimodal Meme Reappraisal

Learning to Generate via Understanding: Understanding-Driven Intrinsic Rewarding for Unified Multimodal Models

Towards Unified Multimodal Interleaved Generation via Group Relative Policy Optimization

Filters