Semantic Embedding Graph Convolutional Networks for Multi-label Video Segment Classification

被引:0
|
作者
Li, Zhitao [1 ]
Wang, Jianzong [1 ]
Cheng, Ning [1 ]
Xiao, Jing [1 ]
机构
[1] Ping An Technol Shenzhen Co Ltd, Shenzhen, Peoples R China
关键词
Video Segment Classification; NeXtVLAD; Graph Convolutional Network; Graph; Word Embedding; Bidirectional Transformer;
D O I
10.1109/PAAP54281.2021.9720457
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Video classification is a challenging problem, video segment labels are sparse and expensive to get, and it is important to leverage as much available information as possible from labeled datasets. There have been several ways to capture video frame information but none of them have utilized the information hidden in labels correlation to increase classification accuracy. This work proposed a framework called Graph Convolution Semantic Network of aggregated descriptors (GCSN) which can extract neighboring information of related segment labels to increase video segmentation classification accuracy. Label relation graph was built by thresholding on cosine similarity computed from mutual embedding similarities, word embeddings were generated by Deep Bidirectional Transformers. The testing accuracy on Youtube-8m video segments classification dataset shows that our proposed GCSN outperforms NeXtVLAD baseline by considering additional labels relation information.
引用
收藏
页码:146 / 151
页数:6
相关论文
共 50 条
  • [1] Multiple Semantic Embedding with Graph Convolutional Networks for Multi-Label Image Classification
    Zhou, Tong
    Feng, Songhe
    [J]. PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2021, PT II, 2021, 13020 : 449 - 461
  • [2] An Attention-Driven Multi-label Image Classification with Semantic Embedding and Graph Convolutional Networks
    Dengdi Sun
    Leilei Ma
    Zhuanlian Ding
    Bin Luo
    [J]. Cognitive Computation, 2023, 15 : 1308 - 1319
  • [3] An Attention-Driven Multi-label Image Classification with Semantic Embedding and Graph Convolutional Networks
    Sun, Dengdi
    Ma, Leilei
    Ding, Zhuanlian
    Luo, Bin
    [J]. COGNITIVE COMPUTATION, 2023, 15 (04) : 1308 - 1319
  • [4] Active learning in multi-label image classification with graph convolutional network embedding
    Xie, Xiurui
    Tian, Maojun
    Luo, Guangchun
    Liu, Guisong
    Wu, Yizhe
    Qin, Ke
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2023, 148 : 56 - 65
  • [5] Cross-Modality Attention with Semantic Graph Embedding for Multi-Label Classification
    You, Renchun
    Guo, Zhiyao
    Cui, Lei
    Long, Xiang
    Bao, Yingze
    Wen, Shilei
    [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 12709 - 12716
  • [6] Multi-label text classification based on semantic-sensitive graph convolutional network
    Zeng, Delong
    Zha, Enze
    Kuang, Jiayi
    Shen, Ying
    [J]. KNOWLEDGE-BASED SYSTEMS, 2024, 284
  • [7] Hierarchical Multi-Label Attribute Classification With Graph Convolutional Networks on Anime Illustration
    Lan, Ziwen
    Maeda, Keisuke
    Ogawa, Takahiro
    Haseyama, Miki
    [J]. IEEE ACCESS, 2023, 11 : 35447 - 35456
  • [8] Multi-Label Image Recognition with Graph Convolutional Networks
    Chen, Zhao-Min
    Wei, Xiu-Shen
    Wang, Peng
    Guo, Yanwen
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 5172 - 5181
  • [9] Multi-label text classification model based on semantic embedding
    Yan Danfeng
    Ke Nan
    Gu Chao
    Cui Jianfei
    Ding Yiqi
    [J]. The Journal of China Universities of Posts and Telecommunications, 2019, 26 (01) : 95 - 104
  • [10] Multi-Label Image Classification Based on Object Detection and Dynamic Graph Convolutional Networks
    Liu, Xiaoyu
    Hu, Yong
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 80 (03): : 4413 - 4432