A short video sentiment analysis model based on multimodal feature fusion

被引:0
|
作者
Shi, Hongyu [1 ]
机构
[1] Guangxi Technol Coll Machinery & Elect, Sch Cultural Tourism & Management, Nanning 530000, Peoples R China
来源
关键词
Emotional analysis; Feature fusion; Multi-head attention mechanism; Short videos; Text; Voice; EMOTION RECOGNITION; PREDICTION; LSTM;
D O I
10.1016/j.sasc.2024.200148
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the development of the internet, the number of short video platform users has increased quickly. People's social entertainment mode has gradually changed from text to short video, generating many multimodal data. Therefore, traditional single-modal sentiment analysis can no longer fully adapt to multimodal data. To address this issue, this study proposes a short video sentiment analysis model based on multimodal feature fusion. This model analyzes the text, speech, and visual content in the video. Meanwhile, the information of the three modalities is integrated through a multi-head attention mechanism to analyze and classify emotions. The experimental results showed that when the training set size was 500, the recognition accuracy of the multimodal sentiment analysis model based on modal contribution recognition and multi-task learning was 0.96. The F1 score was 98, and the average absolute error value was 0.21. When the validation set size was 400, the recognition time of the multimodal sentiment analysis model based on modal contribution recognition and multi-task learning was 2.1 s. When the iterations were 60, the recognition time of the multimodal sentiment analysis model based on modal contribution recognition and multi-task learning was 0.9 s. The experimental results show that the proposed multimodal sentiment analysis model based on modal contribution recognition and multi-task learning has good model performance and can accurately identify emotions in short videos.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] Movie Short-Text Reviews Sentiment Analysis Based on Multi-Feature Fusion
    Zhang, Shangqian
    Lvt, Xueqiang
    Tang, Yunzhong
    Dong, Zhian
    2018 INTERNATIONAL CONFERENCE ON ALGORITHMS, COMPUTING AND ARTIFICIAL INTELLIGENCE (ACAI 2018), 2018,
  • [22] Multimodal Feature Fusion Based Hypergraph Learning Model
    Yang, Zhe
    Xu, Liangkui
    Zhao, Lei
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [23] Special video classification based on multitask learning and multimodal feature fusion
    Wu X.-Y.
    Gu C.-N.
    Wang S.-J.
    Guangxue Jingmi Gongcheng/Optics and Precision Engineering, 2020, 28 (05): : 1177 - 1186
  • [24] Linear Multimodal Fusion in Video Concept Analysis Based on Node Equilibrium Model
    Geng, Jie
    Miao, Zhenjiang
    Liang, Qinghua
    Wang, Shu
    PROCEEDINGS 3RD IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION ACPR 2015, 2015, : 316 - 320
  • [25] Trustworthy Multimodal Fusion for Sentiment Analysis in Ordinal Sentiment Space
    Xie, Zhuyang
    Yang, Yan
    Wang, Jie
    Liu, Xiaorong
    Li, Xiaofan
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (08) : 7657 - 7670
  • [26] Few-shot Multimodal Sentiment Analysis Based on Multimodal Probabilistic Fusion Prompts
    Yang, Xiaocui
    Feng, Shi
    Wang, Daling
    Zhang, Yifei
    Poria, Soujanya
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 6045 - 6053
  • [27] Attention fusion network for multimodal sentiment analysis
    Yuanyi Luo
    Rui Wu
    Jiafeng Liu
    Xianglong Tang
    Multimedia Tools and Applications, 2024, 83 : 8207 - 8217
  • [28] Attention fusion network for multimodal sentiment analysis
    Luo, Yuanyi
    Wu, Rui
    Liu, Jiafeng
    Tang, Xianglong
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (03) : 8207 - 8217
  • [29] Dynamic Dominant Fusion Multimodal Sentiment Analysis Method Based on Autoencoder
    Yang, Xi
    Guo, Junjun
    Yan, Haining
    Tan, Kaiwen
    Xiang, Yan
    Yu, Zhengtao
    Computer Engineering and Applications, 2024, 60 (06) : 180 - 187
  • [30] Implicit Sentiment Analysis for Chinese Texts Based on Multimodal Information Fusion
    Zhang, Huanxiang
    Li, Mengyun
    Zhang, Jing
    Computer Engineering and Applications, 61 (02): : 179 - 190