Stress Estimation

Sep 1, 2021 min read

Feature late fusion for Valence.

valence

Feature late fusion for Arousal.

arousal

An overview of valence/arousal prediction system.

model

Sentiment analysis is an aspect momentous for application in the future. Especially, emotional stress estimation helps improve our life. In this paper, we solved the problem of stress sub-challenge of Multimodal Sentiment Analysis 2022 (MuSe2022). The aim of the stress task is to predict continuous levels of arousal and valence in continuous times based on audio, video, text and biological signals. In this work, we conducted two steps: feature late fusion and stress estimation. First, various features are provided by the sub-challenge. So we take the strategy of combining features to extract feature fusion. We combined DEEPSPECTRUM and eGeMAPS for arousal; DEEPSPECTRUM, FAUS and Physiological signals for valence by Gated Recurrent Unit (GRU) for multimodal fusion. Second, the valence and arousal prediction were based on combining three ingredients such as Local Attention, GRU, and Bayesian NetWork. The Concordance Correlation Coefficient (CCC) loss function was applied during model training.

Input: There are four types of features provided as Vision, Acoustic, Language, and biosignal.

Output: Predicts valence and arousal of stress in a time continuous manner.

Results: The paper was submitted at the SMA conference 2022 and got the best paper.

(1) E.-B. Choi, Nguyen, Hong-Hai, T. N. Nguyen, and S.-H. Kim, “Stress analysis based on feature-level late fusion,” in Proc. Int. Conf. Smart Media and Applications (SMA2022), Oct. 2022., 2022, pp. 110–114.