Dual-Domain Aligned Deep Hierarchical Matrix Factorization Method for Micro-Video Multi-Label Classification

被引:2
|
作者
Fan, Fugui [1 ]
Su, Yuting [1 ]
Nie, Liqiang [2 ]
Jing, Peiguang [1 ]
Hong, Daozheng [1 ]
Liu, Yu [3 ]
机构
[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China
[2] Harbin Inst Technol, Sch Comp Sci & Technol, Shenzhen 518055, Peoples R China
[3] Tianjin Univ, Sch Microelect, Tianjin 300072, Peoples R China
基金
中国国家自然科学基金;
关键词
Semantics; Correlation; Visualization; Task analysis; Matrix decomposition; Estimation; Training; Micro-video; multi-label classification; semantic alignment; deep matrix factorization;
D O I
10.1109/TMM.2023.3301224
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recently, with the growing popularity of micro-videos, multi-label learning has attracted increasing attention due to its potential commercial value in different scenarios. However, existing methods place more emphasis on the alignment between explicit semantics and visual features, while neglecting the exploration of interactions at fine-grained semantic levels. To address this problem, we propose a novel dual-domain aligned deep hierarchical matrix factorization (DADHMF) method for micro-video multi-label classification. Specifically, we construct a dual-stream deep matrix factorization framework to explore implicit hierarchical semantics and corresponding intrinsic feature representations in top-down and bottom-up ways, respectively. On this basis, we leverage the intralayer alignment strategy to narrow the semantic gap between label and instance domains by introducing adaptive semantic-aware embeddings. Moreover, we further utilize the inverse covariance estimation module to automatically capture latent semantic correlations, and project the structural information into the semantic-aware embeddings to ensure the stability of the intralayer alignment. Extensive experiments on two available micro-video multi-label datasets demonstrate that our proposed method outperforms the state-of-the-art methods.
引用
收藏
页码:2598 / 2607
页数:10
相关论文
共 47 条
  • [41] A multi-label text classification method via dynamic semantic representation model and deep neural network
    Tianshi Wang
    Li Liu
    Naiwen Liu
    Huaxiang Zhang
    Long Zhang
    Shanshan Feng
    Applied Intelligence, 2020, 50 : 2339 - 2351
  • [42] Hierarchical Graph Transformer-Based Deep Learning Model for Large-Scale Multi-Label Text Classification
    Gong, Jibing
    Teng, Zhiyong
    Teng, Qi
    Zhang, Hekai
    Du, Linfeng
    Chen, Shuai
    Bhuiyan, Md Zakirul Alam
    Li, Jianhua
    Liu, Mingsheng
    Ma, Hongyuan
    IEEE ACCESS, 2020, 8 : 30885 - 30896
  • [43] Satellite Video Multi-Label Scene Classification With Spatial and Temporal Feature Cooperative Encoding: A Benchmark Dataset and Method
    Guo, Weilong
    Li, Shengyang
    Chen, Feixiang
    Sun, Yuhan
    Gu, Yanfeng
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 2238 - 2251
  • [44] A Dual-input Multi-label Classification Approach for Non-Intrusive Load Monitoring via Deep Learning
    Cimen, Halil
    Palacios-Garcia, Emilio J.
    Cetinkaya, Nurettin
    Vasquez, Juan C.
    Guerrero, Josep M.
    2020 ZOOMING INNOVATION IN CONSUMER TECHNOLOGIES CONFERENCE (ZINC), 2020, : 259 - 263
  • [45] Multi-label legal document classification: A deep learning-based approach with label-attention and domain-specific pre-training
    Song, Dezhao
    Vold, Andrew
    Madan, Kanika
    Schilder, Frank
    INFORMATION SYSTEMS, 2022, 106
  • [46] Deep learning based classification of multi-label chest X-ray images via dual-weighted metric loss
    Jin, Yufei
    Lu, Huijuan
    Zhu, Wenjie
    Huo, Wanli
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 157
  • [47] Exploring Weighted Dual Graph Regularized Non-Negative Matrix Tri-Factorization Based Collaborative Filtering Framework for Multi-Label Annotation of Remote Sensing Images
    Zhang, Juli
    Zhang, Junyi
    Dai, Tao
    He, Zhanzhuang
    REMOTE SENSING, 2019, 11 (08)