Optimal Target-Oriented Knowledge Transportation For Aspect-Based Multimodal Sentiment Analysis.

Authors

  • Linhao Zhang Chinese Academy of Sciences.
  • Li Jin Chinese Academy of Sciences.
  • Guangluan Xu Chinese Academy of Sciences.
  • Xiaoyu Li Chinese Academy of Sciences.
  • Xian Sun Chinese Academy of Sciences.
  • Zequn Zhang Chinese Academy of Sciences.
  • Yanan Zhang Sichuan University.
  • Qi Li Beijing Normal University.

DOI:

https://doi.org/10.9781/ijimai.2024.02.005

Keywords:

Aspect-Based Multimodal Sentiment Analysis, Optimal Transport, Social Media Opinion Mining

Abstract

Aspect-based multimodal sentiment analysis under social media scenario aims to identify the sentiment polarities of each aspect term, which are mentioned in a piece of multimodal user-generated content. Previous approaches for this interdisciplinary multimodal task mainly rely on coarse-grained fusion mechanisms from the data-level or decision-level, which have the following three shortcomings:(1) ignoring the category knowledge of the sentiment target mentioned in the text) in visual information. (2) unable to assess the importance of maintaining target interaction during the unimodal encoding process, which results in indiscriminative representations considering various aspect terms. (3) suffering from the semantic gap between multiple modalities. To tackle the above challenging issues, we propose an optimal target-oriented knowledge transportation network (OtarNet) for this task. Firstly, the visual category knowledge is explicitly transported through input space translation and reformulation. Secondly, with the reformulated knowledge containing the target and category information, the target sensitivity is well maintained in the unimodal representations through a multistage target-oriented interaction mechanism. Finally, to eliminate the distributional modality gap by integrating complementary knowledge, the target-sensitive features of multiple modalities are implicitly transported based on the optimal transport interaction module. Our model achieves state-of-theart performance on three benchmark datasets: Twitter-15, Twitter-17 and Yelp, together with the extensive ablation study demonstrating the superiority and effectiveness of OtarNet.

Downloads

Download data is not yet available.

Downloads

Published

2025-08-29
Metrics
Views/Downloads
  • Abstract
    0
  • PDF
    0

How to Cite

Zhang, L., Jin, L., Xu, G., Li, X., Sun, X., Zhang, Z., … Li, Q. (2025). Optimal Target-Oriented Knowledge Transportation For Aspect-Based Multimodal Sentiment Analysis. International Journal of Interactive Multimedia and Artificial Intelligence, 9(4), 59–69. https://doi.org/10.9781/ijimai.2024.02.005