Paper
8 June 2023 Adversarial invariant-specific representations fusion network for multimodal sentiment analysis
Jing He, Binghui Su, Zhenwen Sheng, Changfan Zhang, Haonan Yang
Author Affiliations +
Proceedings Volume 12707, International Conference on Image, Signal Processing, and Pattern Recognition (ISPP 2023); 127073R (2023) https://doi.org/10.1117/12.2680996
Event: International Conference on Image, Signal Processing, and Pattern Recognition (ISPP 2023), 2023, Changsha, China
Abstract
Multimodal Sentiment Analysis (MSA) has achieved substantial progress as an open and practical research field. Several methods in MSA focus on exploring the methods for multimodal data fusion to improve the model performance. However, the heterogeneity gaps pose a significant challenge against the fusion interactions of the multimodal data. This work uses a new Multimodal Sentiment Analysis (MSA) model, the adversarial invariant-specific representations fusion (AISRF) network. AISRF is proposed to achieve modality-invariant representations by narrowing the distribution gaps among different modalities. At the same time, the integrity of the modality-specific representation is maintained. First, the heterogeneity gaps among the modalities are reduced by invoking an adversarial encoder-regressor, and thus the modality-invariant representations are obtained. Second, the decoders are employed to reconstruct the modality-invariant representations and obtain the modality-specific representations. Finally, the cross-modal attention method has been used to perform the cross-modal interactions on the invariant-specific representations from different modalities to perform efficient multimodal fusion. Experiments for comparison with other baseline models have been performed on the prevailing benchmark datasets, viz., CMU-MOSI and CMU-MOSEI, and the results demonstrate that the AISRF model is superior to the baseline models in the multiple evaluation indices.
© (2023) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Jing He, Binghui Su, Zhenwen Sheng, Changfan Zhang, and Haonan Yang "Adversarial invariant-specific representations fusion network for multimodal sentiment analysis", Proc. SPIE 12707, International Conference on Image, Signal Processing, and Pattern Recognition (ISPP 2023), 127073R (8 June 2023); https://doi.org/10.1117/12.2680996
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Machine learning

Data modeling

Visualization

Adversarial training

Performance modeling

Data fusion

Feature extraction

Back to Top