3 papers - avg viability 5.7
A novel hybrid memory system for video world models that tracks dynamic subjects even when they are out of view, ensuring motion continuity and realistic simulation.
Pretrain action-conditioned video world models for zero-shot action transfer and efficient adaptation.
STEVO-Bench evaluates the limitations of video world models in decoupling state evolution from observation.