Sentiment-Oriented Transformer-Based Variational Autoencoder Network for Live Video Commenting

被引:3
|
作者
Fu, Fengyi [1 ]
Fang, Shancheng [1 ]
Chen, Weidong [1 ]
Mao, Zhendong [1 ]
机构
[1] Univ Sci & Technol China, 100 Fuxing Rd, Hefei 230000, Anhui, Peoples R China
基金
中国国家自然科学基金;
关键词
Automatic live video commenting; multi-modal learning; variational autoencoder; batch attention mechanism;
D O I
10.1145/3633334
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Automatic live video commenting is getting increasing attention due to its significance in narration generation, topic explanation, etc. However, the diverse sentiment consideration of the generated comments is missing from current methods. Sentimental factors are critical in interactive commenting, and there has been lack of research so far. Thus, in this article, we propose a Sentiment-oriented Transformer-based Variational Autoencoder (So-TVAE) network, which consists of a sentiment-oriented diversity encoder module and a batch attention module, to achieve diverse video commenting with multiple sentiments and multiple semantics. Specifically, our sentiment-oriented diversity encoder elegantly combines a VAE and random mask mechanism to achieve semantic diversity under sentiment guidance, which is then fused with cross-modal features to generate live video comments. A batch attention module is also proposed in this article to alleviate the problem of missing sentimental samples, caused by the data imbalance that is common in live videos as the popularity of videos varies. Extensive experiments on Livebot and VideoIC datasets demonstrate that the proposed So-TVAE outperforms the state-of-the-art methods in terms of the quality and diversity of generated comments. Related code is available at https://github.com/fufy1024/So-TVAE.
引用
收藏
页数:24
相关论文
共 50 条
  • [1] A Transformer-Based Variational Autoencoder for Sentence Generation
    Liu, Danyang
    Liu, Gongshen
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [2] Transformer-Based Interactive Multi-Modal Attention Network for Video Sentiment Detection
    Zhuang, Xuqiang
    Liu, Fangai
    Hou, Jian
    Hao, Jianhua
    Cai, Xiaohong
    NEURAL PROCESSING LETTERS, 2022, 54 (03) : 1943 - 1960
  • [3] Transformer-Based Interactive Multi-Modal Attention Network for Video Sentiment Detection
    Xuqiang Zhuang
    Fangai Liu
    Jian Hou
    Jianhua Hao
    Xiaohong Cai
    Neural Processing Letters, 2022, 54 : 1943 - 1960
  • [4] T-DVAE: A Transformer-Based Dynamical Variational Autoencoder for Speech
    Perschewski, Jan-Ole
    Stober, Sebastian
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING-ICANN 2024, PT VII, 2024, 15022 : 33 - 46
  • [5] Transformer-Based Graph Convolutional Network for Sentiment Analysis
    AlBadani, Barakat
    Shi, Ronghua
    Dong, Jian
    Al-Sabri, Raeed
    Moctard, Oloulade Babatounde
    APPLIED SCIENCES-BASEL, 2022, 12 (03):
  • [6] Adaptive Transformer-Based Conditioned Variational Autoencoder for Incomplete Social Event Classification
    Li, Zhangming
    Qian, Shengsheng
    Cao, Jie
    Fang, Quan
    Xu, Changsheng
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 1698 - 1707
  • [7] T-CVAE: Transformer-Based Conditioned Variational Autoencoder for Story Completion
    Wang, Tianming
    Wan, Xiaojun
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 5233 - 5239
  • [8] Video Review Analysis via Transformer-based Sentiment Change Detection
    Wu, Zilong
    Huang, Siyuan
    Zhang, Rui
    Li, Lin
    THIRD INTERNATIONAL CONFERENCE ON MULTIMEDIA INFORMATION PROCESSING AND RETRIEVAL (MIPR 2020), 2020, : 330 - 335
  • [9] Application and Study on Sentiment-oriented Analysis on Social Semantic Network
    Zhang, Dan
    Wang, Dongsheng
    Huang, Haiping
    PROCEEDINGS OF THE 2013 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER SCIENCE AND ELECTRONICS INFORMATION (ICACSEI 2013), 2013, 41 : 616 - 618
  • [10] Unsupervised Anomaly Detection in Multivariate Time Series through Transformer-based Variational Autoencoder
    Zhang, Hongwei
    Xia, Yuanqing
    Yan, Tijin
    Liu, Guiyang
    PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, : 281 - 286