Cross-media web video topic detection based on heterogeneous interactive tensor learning

被引:0
|
作者
Zhang, Chengde [1 ]
Mei, Kai [1 ]
Xiao, Xia [2 ]
机构
[1] Zhongnan Univ Econ & Law, Sch Informat & Safety Engn, Wuhan 430073, Peoples R China
[2] Hubei Univ Educ, Inst Educ Sci, Wuhan 430205, Peoples R China
关键词
Cross-media reasoning; Heterogeneous interaction tensor learning; Web video; Topic detection; REPRESENTATION; ATTENTION;
D O I
10.1016/j.knosys.2023.111153
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Topic detection based on text reasoning has attracted widespread attention. Existing methods focus on inference based on textual semantic cues. However, each video is described with only a few words, resulting in sparse textual reasoning cues. In this situation, it is difficult to distinguish videos belonging to the same topic, making topic detection for web videos challenging. Fortunately, visual information contains many more detailed cues than textual information, such as colors, scenes, and objects. Cross-media joint reasoning provides more reasoning cues in a complementary manner than textual information. In view of this, this paper extends topic detection based on text reasoning to cross-media reasoning. A novel heterogeneous interactive tensor learning (HITL) method is proposed, which detects topics through cross-media joint inference. After extracting local features of keyframes and textual information, the semantic correlation between visual and textual information is mined by constructing a keyframe-text interaction attention matrix. Then, a joint cue between textual and visual information is constructed in a cross-media heterogeneous interaction tensor space, thereby achieving rich textual cues through cross-media fusion. Finally, semantic features are extracted through cue interaction in tensor space for topic detection.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Hot Topic Detection of Web Video Based on Cross-Media Semantic Association Enhancement
    Zhang C.
    Liu Y.
    Xiao X.
    Mei K.
    [J]. Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2023, 60 (11): : 2624 - 2637
  • [2] RCE-HIL: Recognizing Cross-media Entailment with Heterogeneous Interactive Learning
    Huang, Xin
    Peng, Yuxin
    Wen, Zhang
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2020, 16 (01)
  • [3] Cross-media Topic Detection with Refined CNN based Image-Dominant Topic Model
    Wang, Zhiyi
    Li, Liang
    Huang, Qingming
    [J]. MM'15: PROCEEDINGS OF THE 2015 ACM MULTIMEDIA CONFERENCE, 2015, : 1171 - 1174
  • [4] Study of Cross-Media Topic Analysis Based on Visual Topic Model
    Zhou, Yipeng
    Liang, Meiyu
    Du, Junping
    [J]. PROCEEDINGS OF THE 2012 24TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2012, : 3467 - 3470
  • [5] Fusing cross-media for topic detection by dense keyword groups
    Zhang, Weigang
    Chen, Tianlong
    Li, Guorong
    Pang, Junbiao
    Huang, Qingming
    Gao, Wen
    [J]. NEUROCOMPUTING, 2015, 169 : 169 - 179
  • [6] Effective Multimodality Fusion Framework for Cross-Media Topic Detection
    Chu, Lingyang
    Zhang, Yanyan
    Li, Guorong
    Wang, Shuhui
    Zhang, Weigang
    Huang, Qingming
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2016, 26 (03) : 556 - 569
  • [7] Cross-media video event mining based on attention graph structure learning
    Zhang, Chengde
    Lei, Yu
    Xiao, Xia
    Chen, Xinzhong
    [J]. NEUROCOMPUTING, 2022, 502 : 148 - 158
  • [8] CROSS-MEDIA TOPIC DETECTION: A MULTI-MODALITY FUSION FRAMEWORK
    Zhang, Yanyan
    Li, Guorong
    Chu, Lingyang
    Wang, Shuhui
    Zhang, Weigang
    Huang, Qingming
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME 2013), 2013,
  • [9] Cross-media web video event mining based on multiple semantic-paths embedding
    Xiao, Xia
    Du, Mingyue
    Xu, Shuyu
    Liu, Guoying
    Zhang, Chengde
    [J]. NEURAL COMPUTING & APPLICATIONS, 2023, 36 (2): : 667 - 683
  • [10] Cross-media web video event mining based on multiple semantic-paths embedding
    Xia Xiao
    Mingyue Du
    Shuyu Xu
    Guoying Liu
    Chengde Zhang
    [J]. Neural Computing and Applications, 2024, 36 : 667 - 683