Joining datasets via data augmentation in the label space for neural networks

被引:0
|
作者
Zhao, Jake [1 ]
Ou, Mingfeng [2 ,3 ]
Xue, Linji [2 ]
Cui, Yunkai [2 ]
Wu, Sai [1 ]
Chen, Gang [1 ]
机构
[1] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou, Zhejiang, Peoples R China
[2] Graviti Inc, Shanghai, Peoples R China
[3] Tongji Univ, Dept Software Engn, Shanghai, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most, if not all, modern deep learning systems restrict themselves to a single dataset for neural network training and inference. In this article, we are interested in systematic ways to join datasets that are made of similar purposes. Unlike previous published works that ubiquitously conduct the dataset joining in the uninterpretable latent vectorial space, the core to our method is an augmentation procedure in the label space. The primary challenge to address the label space for dataset joining is the discrepancy between labels: non-overlapping label annotation sets, different labeling granularity or hierarchy and etc. Notably we propose a new technique leveraging artificially created knowledge graph, recurrent neural networks and policy gradient that successfully achieve the dataset joining in the label space. Empirical results on both image and text classification justify the validity of our approach.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] Kernel-convoluted Deep Neural Networks with Data Augmentation
    Kim, Minjin
    Kim, Young-geun
    Kim, Dongha
    Kim, Yongdai
    Paik, Myunghee Cho
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 8155 - 8162
  • [32] Supervised text data augmentation method for deep neural networks
    Seol, Jaehwan
    Jung, Jieun
    Choi, Yeonseok
    Choi, Yong-Seok
    COMMUNICATIONS FOR STATISTICAL APPLICATIONS AND METHODS, 2023, 30 (03) : 343 - 354
  • [33] SanitAIs: Unsupervised Data Augmentation to Sanitize Trojaned Neural Networks
    Karra, Kiran
    Ashcraft, Chace
    Costello, Cash
    Proceedings of the 2022 IEEE International Conference on Dependable, Autonomic and Secure Computing, International Conference on Pervasive Intelligence and Computing, International Conference on Cloud and Big Data Computing, International Conference on Cyber Science and Technology Congress, DASC/PiCom/CBDCom/CyberSciTech 2022, 2022,
  • [34] Self-paced data augmentation for training neural networks
    Takase, Tomoumi
    Karakida, Ryo
    Asoh, Hideki
    NEUROCOMPUTING, 2021, 442 : 296 - 306
  • [35] Posterior Uncertainty Quantification in Neural Networks using Data Augmentation
    Wu, Luhuan
    Williamson, Sinead A.
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 238, 2024, 238
  • [36] Multichannel Adaptive Data Mixture Augmentation for Graph Neural Networks
    Ye, Zhonglin
    Zhou, Lin
    Li, Mingyuan
    Zhang, Wei
    Liu, Zhen
    Zhao, Haixing
    INTERNATIONAL JOURNAL OF DATA WAREHOUSING AND MINING, 2024, 20 (01)
  • [37] Biomedical Data Augmentation Using Generative Adversarial Neural Networks
    Calimeri, Francesco
    Marzullo, Aldo
    Stamile, Claudio
    Terracina, Giorgio
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, PT II, 2017, 10614 : 626 - 634
  • [38] A survey on face data augmentation for the training of deep neural networks
    Wang, Xiang
    Wang, Kai
    Lian, Shiguo
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (19): : 15503 - 15531
  • [39] A survey on face data augmentation for the training of deep neural networks
    Xiang Wang
    Kai Wang
    Shiguo Lian
    Neural Computing and Applications, 2020, 32 : 15503 - 15531
  • [40] Metropolis-Hastings Data Augmentation for Graph Neural Networks
    Park, Hyeonjin
    Lee, Seunghun
    Kim, Sihyeon
    Park, Jinyoung
    Jeong, Jisu
    Kim, Kyung-Min
    Ha, Jung-Woo
    Kim, Hyunwoo J.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34