Semantic Embedding Graph Convolutional Networks for Multi-label Video Segment Classification

被引:0
|
作者
Li, Zhitao [1 ]
Wang, Jianzong [1 ]
Cheng, Ning [1 ]
Xiao, Jing [1 ]
机构
[1] Ping An Technol Shenzhen Co Ltd, Shenzhen, Peoples R China
关键词
Video Segment Classification; NeXtVLAD; Graph Convolutional Network; Graph; Word Embedding; Bidirectional Transformer;
D O I
10.1109/PAAP54281.2021.9720457
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Video classification is a challenging problem, video segment labels are sparse and expensive to get, and it is important to leverage as much available information as possible from labeled datasets. There have been several ways to capture video frame information but none of them have utilized the information hidden in labels correlation to increase classification accuracy. This work proposed a framework called Graph Convolution Semantic Network of aggregated descriptors (GCSN) which can extract neighboring information of related segment labels to increase video segmentation classification accuracy. Label relation graph was built by thresholding on cosine similarity computed from mutual embedding similarities, word embeddings were generated by Deep Bidirectional Transformers. The testing accuracy on Youtube-8m video segments classification dataset shows that our proposed GCSN outperforms NeXtVLAD baseline by considering additional labels relation information.
引用
收藏
页码:146 / 151
页数:6
相关论文
共 50 条
  • [31] Dual-view graph convolutional network for multi-label text classification
    Li, Xiaohong
    You, Ben
    Peng, Qixuan
    Feng, Shaojie
    [J]. APPLIED INTELLIGENCE, 2024, 54 (19) : 9363 - 9380
  • [32] Multi-label classification using hierarchical embedding
    Kumar, Vikas
    Pujari, Arun K.
    Padmanabhan, Vineet
    Sahu, Sandeep Kumar
    Kagita, Venkateswara Rao
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2018, 91 : 263 - 269
  • [33] MG-GCN: Multi-Granularity Graph Convolutional Neural Network for Multi-Label Classification in Multi-Label Information System
    Yu, Bin
    Xie, Hengjie
    Cai, Mingjie
    Ding, Weiping
    [J]. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, 8 (01): : 288 - 299
  • [34] Knowledge Graph Constraints for Multi-label Graph Classification
    Ringsquandl, Martin
    Lamparter, Steffen
    Thon, Ingo
    Lepratti, Raffaello
    Kroeger, Peer
    [J]. 2016 IEEE 16TH INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW), 2016, : 121 - 127
  • [35] Adaptive Graph Guided Embedding for Multi-label Annotation
    Wang, Lichen
    Ding, Zhengming
    Fu, Yun
    [J]. PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 2798 - 2804
  • [36] Label Co-Occurrence Learning With Graph Convolutional Networks for Multi-Label Chest X-Ray Image Classification
    Chen, Bingzhi
    Li, Jinxing
    Lu, Guangming
    Yu, Hongbing
    Zhang, David
    [J]. IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2020, 24 (08) : 2292 - 2302
  • [37] Modeling global and local label correlation with graph convolutional networks for multi-label chest X-ray image classification
    Lanting Li
    Peng Cao
    Jinzhu Yang
    Osmar R. Zaiane
    [J]. Medical & Biological Engineering & Computing, 2022, 60 : 2567 - 2588
  • [38] Modeling global and local label correlation with graph convolutional networks for multi-label chest X-ray image classification
    Li, Lanting
    Cao, Peng
    Yang, Jinzhu
    Zaiane, Osmar R.
    [J]. MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2022, 60 (09) : 2567 - 2588
  • [39] Label Embedding for Multi-label Classification Via Dependence Maximization
    Yachong Li
    Youlong Yang
    [J]. Neural Processing Letters, 2020, 52 : 1651 - 1674
  • [40] Cost-sensitive label embedding for multi-label classification
    Kuan-Hao Huang
    Hsuan-Tien Lin
    [J]. Machine Learning, 2017, 106 : 1725 - 1746