3 papers - avg viability 4.7
A novel multimodal framework for robust Facial Action Unit detection leveraging advanced alignment and state space modeling techniques.
A multimodal system for blended emotion recognition that leverages late fusion of specialized encoders, including a novel application of Gemini Embedding 2.0 for competitive accuracy with short video inputs.
HyDRA enhances multimodal emotion recognition through a novel reasoning architecture that reconciles diverse emotional cues.