GatedxLSTM: A Multimodal Affective Computing Approach for Emotion Recognition in Conversations

📘 Introduction

This repository provides training and evaluation code for the paper

GatedxLSTM: A Multimodal Affective Computing Approach for Emotion Recognition in Conversations. [Paper Link]

Key contributions of GatedxLSTM include:

CLAP-based Cross-modal Alignment: Incorporating Contrastive Language-Audio Pretraining for improved speech-text alignment.
Gated Modality Fusion: A gating mechanism to emphasise emotionally salient utterances.
Dialogical Emotion Decoder (DED): Captures context-aware emotional transitions over conversation turns.

🚀 How to Run

Step 1: Download the Dataset

Download the IEMOCAP dataset and extract it to your local directory.

Install required dependency packages.

Step 2: Data Preprocessing

Update the dataset path in ./data/preprocess.py, then run:

python ./data/preprocess.py

Step 3: Training and Inference

To train and evaluate the model, run:

python ./Dialogical-Emotion-Decoding/main.py

📄 Citation

This work has been accepted at ACII 2025.

Welcome to cite our paper:

@article{li2025gatedxlstm,
  title={GatedxLSTM: A Multimodal Affective Computing Approach for Emotion Recognition in Conversations},
  author={Li, Yupei and Sun, Qiyang and Murthy, Sunil Munthumoduku Krishna and Alturki, Emran and Schuller, Bj{\"o}rn W},
  journal={arXiv preprint arXiv:2503.20919},
  year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
Dialogical-Emotion-Decoding		Dialogical-Emotion-Decoding
data		data
._gated_xlstm_01.jpg		._gated_xlstm_01.jpg
README.md		README.md
gated_xlstm_01.jpg		gated_xlstm_01.jpg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

GatedxLSTM: A Multimodal Affective Computing Approach for Emotion Recognition in Conversations

📘 Introduction

🚀 How to Run

Step 1: Download the Dataset

Step 2: Data Preprocessing

Step 3: Training and Inference

📄 Citation

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

glam-imperial/GatedxLSTM

Folders and files

Latest commit

History

Repository files navigation

GatedxLSTM: A Multimodal Affective Computing Approach for Emotion Recognition in Conversations

📘 Introduction

🚀 How to Run

Step 1: Download the Dataset

Step 2: Data Preprocessing

Step 3: Training and Inference

📄 Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages