Cross-media web video topic detection based on heterogeneous interactive tensor learning

被引:0
|
作者
Zhang, Chengde [1 ]
Mei, Kai [1 ]
Xiao, Xia [2 ]
机构
[1] Zhongnan Univ Econ & Law, Sch Informat & Safety Engn, Wuhan 430073, Peoples R China
[2] Hubei Univ Educ, Inst Educ Sci, Wuhan 430205, Peoples R China
关键词
Cross-media reasoning; Heterogeneous interaction tensor learning; Web video; Topic detection; REPRESENTATION; ATTENTION;
D O I
10.1016/j.knosys.2023.111153
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Topic detection based on text reasoning has attracted widespread attention. Existing methods focus on inference based on textual semantic cues. However, each video is described with only a few words, resulting in sparse textual reasoning cues. In this situation, it is difficult to distinguish videos belonging to the same topic, making topic detection for web videos challenging. Fortunately, visual information contains many more detailed cues than textual information, such as colors, scenes, and objects. Cross-media joint reasoning provides more reasoning cues in a complementary manner than textual information. In view of this, this paper extends topic detection based on text reasoning to cross-media reasoning. A novel heterogeneous interactive tensor learning (HITL) method is proposed, which detects topics through cross-media joint inference. After extracting local features of keyframes and textual information, the semantic correlation between visual and textual information is mined by constructing a keyframe-text interaction attention matrix. Then, a joint cue between textual and visual information is constructed in a cross-media heterogeneous interaction tensor space, thereby achieving rich textual cues through cross-media fusion. Finally, semantic features are extracted through cue interaction in tensor space for topic detection.
引用
收藏
页数:15
相关论文
共 50 条
  • [31] Adaptation of cross-media surveys to heterogeneous target groups
    Lorz, Alexander
    [J]. ADAPTIVE HYPERMEDIA AND ADAPTIVE WEB-BASED SYSTEMS, PROCEEDINGS, 2006, 4018 : 182 - 191
  • [32] An Approach for Mining Heterogeneous Data for Cross-Media Retrieval
    Pavan, K. Madhu
    Ananthanarayana, V. S.
    [J]. 2013 FOURTH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATIONS AND NETWORKING TECHNOLOGIES (ICCCNT), 2013,
  • [33] Cross-media storytelling: from literature to video games
    Marsano, Martina
    [J]. AIB STUDI, 2019, 59 (1-2): : 307 - 308
  • [34] Learning semantic correlations for cross-media retrieval
    Wu, Fei
    Zhang, Hong
    Zhuang, Yueting
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP 2006, PROCEEDINGS, 2006, : 1465 - +
  • [35] Dictionary Learning based Supervised Discrete Hashing for Cross-Media Retrieval
    Wu, Ye
    Luo, Xin
    Xu, Xin-Shun
    Guo, Shanqing
    Shi, Yuliang
    [J]. ICMR '18: PROCEEDINGS OF THE 2018 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2018, : 222 - 230
  • [36] A Cross-Media Retrieval Method Based on Semisupervised Learning and Alternate Optimization
    Li, Junzheng
    Zhu, Wei
    Yang, Yanchun
    Zheng, Xiyuan
    [J]. MOBILE INFORMATION SYSTEMS, 2021, 2021
  • [37] Cross-media Hot Topic Auto-tracking Model Based on Semantics and Temporal Context
    LIANG Meiyu
    DU Junping
    ZHOU Yipeng
    [J]. Chinese Journal of Electronics, 2015, 24 (03) : 529 - 534
  • [38] Semi-Supervised Learning Based Semantic Cross-Media Retrieval
    Zheng, Xiyuan
    Zhu, Wei
    Yu, Zhenmei
    Zhang, Meijia
    [J]. IEEE ACCESS, 2021, 9 : 75049 - 75057
  • [39] Manifold learning based cross-media retrieval: A solution to media object complementary nature
    Zhuang, Yueting
    Yang, Yi
    Wu, Fei
    Pan, Yunhe
    [J]. JOURNAL OF VLSI SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2007, 46 (2-3): : 153 - 164
  • [40] Manifold Learning Based Cross-media Retrieval: A Solution to Media Object Complementary Nature
    Yueting Zhuang
    Yi Yang
    Fei Wu
    Yunhe Pan
    [J]. The Journal of VLSI Signal Processing Systems for Signal, Image, and Video Technology, 2007, 46 : 153 - 164