Multimodal Emotion Recognition via Bi-directional Cross-Attention and Temporal Modeling

PDF Viewer

BUILDER'S SANDBOX

Build This Paper

Use an AI coding agent to implement this research.

OpenAI Codex
OpenAI CodexAI Agent

Lightweight coding agent in your terminal.

Claude Code
Claude CodeAI Agent

Agentic coding tool for terminal workflows.

AntiGravity IDE
AntiGravity IDEScaffolding

AI agent mindset installer and workflow scaffolder.

Cursor
CursorIDE

AI-first code editor built on VS Code.

VS Code
VS CodeIDE

Free, open-source editor by Microsoft.

Estimated $9K - $13K over 6-10 weeks.

See exactly what it costs to build this -- with 3 comparable funded startups.

7-day free trial. Cancel anytime.

Discover the researchers behind this paper and find similar experts.

7-day free trial. Cancel anytime.

References (37)

[1]
From Emotions to Violence: Multimodal Fine-Grained Behavior Analysis at the 9th ABAW
2025D. Kollias, S. Zafeiriou et al.
[2]
Advancements in Affective and Behavior Analysis: The 8th ABAW Workshop and Competition
2025Dimitrios Kollias, Panagiotis Tzirakis et al.
[3]
DVD: A Comprehensive Dataset for Advancing Violence Detection in Real-World Scenarios
2025D. Kollias, D. C. Senadeera et al.
[4]
Emotion Recognition with CLIP and Sequential Learning
2025Weiwei Zhou, Chenkun Ling et al.
[5]
Behaviour4All: in-the-wild Facial Behaviour Analysis Toolkit
2024Dimitrios Kollias, Chunchang Shao et al.
[6]
Multimodal Emotion Recognition using Audio-Video Transformer Fusion with Cross Attention
2024Joe Dhanith, Shravan Venkatraman et al.
[7]
Enhancing Facial Expression Recognition through Dual-Direction Attention Mixed Feature Networks: Application to 7th ABAW Challenge
2024Josep Cabacas-Maso, Elena Ortega-Beltr'an et al.
[8]
7th ABAW Competition: Multi-Task Learning and Compound Expression Recognition
2024D. Kollias, S. Zafeiriou et al.
[9]
Affective Behaviour Analysis via Integrating Multi-Modal Knowledge
2024Wei Zhang, Feng Qiu et al.
[10]
The 6th Affective Behavior Analysis in-the-wild (ABAW) Competition
2024D. Kollias, Panagiotis Tzirakis et al.
[11]
Distribution Matching for Multi-Task Learning of Classification Tasks: a Large-Scale Study on Faces & Beyond
2024D. Kollias, V. Sharmanska et al.
[12]
Multi-Label Compound Expression Recognition: C-EXPR Database & Network
2023D. Kollias
[13]
Leveraging TCN and Transformer for effective visual-audio fusion in continuous emotion recognition
2023Weiwei Zhou, Jiada Lu et al.
[14]
ABAW: Valence-Arousal Estimation, Expression Recognition, Action Unit Detection & Emotional Reaction Intensity Estimation Challenges
2023D. Kollias, Panagiotis Tzirakis et al.
[15]
ABAW: Learning from Synthetic Data & Multi-Task Learning Challenges
2022D. Kollias
[16]
NR-DFERNet: Noise-Robust Network for Dynamic Facial Expression Recognition
2022Hanting Li, Ming-Fa Sui et al.
[17]
Spatio-Temporal Transformer for Dynamic Facial Expression Recognition in the Wild
2022Fuyan Ma, Bin Sun et al.
[18]
Continuous Emotion Recognition using Visual-audio-linguistic Information: A Technical Report for ABAW3
2022Su Zhang, Ruyi An et al.
[19]
Conditional Prompt Learning for Vision-Language Models
2022Kaiyang Zhou, Jingkang Yang et al.
[20]
ABAW: Valence-Arousal Estimation, Expression Recognition, Action Unit Detection & Multi-Task Learning Challenges
2022D. Kollias

Showing 20 of 37 references

Founder's Pitch

"A multimodal framework for robust emotion recognition in video data using cross-attention and temporal modeling."

Emotion RecognitionScore: 6View PDF ↗

Commercial Viability Breakdown

0-10 scale

High Potential

1/4 signals

2.5

Quick Build

1/4 signals

2.5

Series A Potential

0/4 signals

0

Sources used for this analysis

arXiv Paper

Full-text PDF analysis of the research paper

GitHub Repository

Code availability, stars, and contributor activity

Citation Network

Semantic Scholar citations and co-citation patterns

Community Predictions

Crowd-sourced unicorn probability assessments

Analysis model: GPT-4o · Last scored: 3/12/2026

Explore the full citation network and related research.

7-day free trial. Cancel anytime.

Understand the commercial significance and market impact.

7-day free trial. Cancel anytime.

Get detailed profiles of the research team.

7-day free trial. Cancel anytime.

Related Papers

Loading…