Joining datasets via data augmentation in the label space for neural networks

被引:0
|
作者
Zhao, Jake [1 ]
Ou, Mingfeng [2 ,3 ]
Xue, Linji [2 ]
Cui, Yunkai [2 ]
Wu, Sai [1 ]
Chen, Gang [1 ]
机构
[1] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou, Zhejiang, Peoples R China
[2] Graviti Inc, Shanghai, Peoples R China
[3] Tongji Univ, Dept Software Engn, Shanghai, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most, if not all, modern deep learning systems restrict themselves to a single dataset for neural network training and inference. In this article, we are interested in systematic ways to join datasets that are made of similar purposes. Unlike previous published works that ubiquitously conduct the dataset joining in the uninterpretable latent vectorial space, the core to our method is an augmentation procedure in the label space. The primary challenge to address the label space for dataset joining is the discrepancy between labels: non-overlapping label annotation sets, different labeling granularity or hierarchy and etc. Notably we propose a new technique leveraging artificially created knowledge graph, recurrent neural networks and policy gradient that successfully achieve the dataset joining in the label space. Empirical results on both image and text classification justify the validity of our approach.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Rumour detection on benchmark twitter datasets using graph neural networks with data augmentation
    Patel, Shaswat
    Bansal, Prince
    Kaur, Preeti
    SOCIAL NETWORK ANALYSIS AND MINING, 2024, 14 (01)
  • [2] Data Augmentation for Graph Neural Networks
    Zhao, Tong
    Liu, Yozen
    Neves, Leonardo
    Woodford, Oliver
    Jiang, Meng
    Shah, Neil
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 11015 - 11023
  • [3] Hydranet: Data Augmentation for Regression Neural Networks
    Dubost, Florian
    Bortsova, Gerda
    Adams, Hieab
    Ikram, M. Arfan
    Niessen, Wiro
    Vernooij, Meike
    de Bruijne, Marleen
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2019, PT IV, 2019, 11767 : 438 - 446
  • [4] Rationalizing Graph Neural Networks with Data Augmentation
    Liu, Gang
    Inae, Eric
    Luo, Tengfei
    Jiang, Meng
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2024, 18 (04)
  • [5] An Imperceptible Data Augmentation Based Blackbox Clean-Label Backdoor Attack on Deep Neural Networks
    Xu, Chaohui
    Liu, Wenye
    Zheng, Yue
    Wang, Si
    Chang, Chip-Hong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2023, 70 (12) : 5011 - 5024
  • [6] Data-dependent Sample Complexity of Deep Neural Networks via Lipschitz Augmentation
    Wei, Colin
    Ma, Tengyu
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [7] Data-dependent sample complexity of deep neural networks via lipschitz augmentation
    Wei, Colin
    Ma, Tengyu
    Advances in Neural Information Processing Systems, 2019, 32
  • [8] Fast and interpretable classification of small X-ray diffraction datasets using data augmentation and deep neural networks
    Oviedo, Felipe
    Ren, Zekun
    Sun, Shijing
    Settens, Charles
    Liu, Zhe
    Hartono, Noor Titan Putri
    Ramasamy, Savitha
    DeCost, Brian L.
    Tian, Siyu I. P.
    Romano, Giuseppe
    Kusne, Aaron Gilad
    Buonassisi, Tonio
    NPJ COMPUTATIONAL MATERIALS, 2019, 5
  • [9] Fast and interpretable classification of small X-ray diffraction datasets using data augmentation and deep neural networks
    Felipe Oviedo
    Zekun Ren
    Shijing Sun
    Charles Settens
    Zhe Liu
    Noor Titan Putri Hartono
    Savitha Ramasamy
    Brian L. DeCost
    Siyu I. P. Tian
    Giuseppe Romano
    Aaron Gilad Kusne
    Tonio Buonassisi
    npj Computational Materials, 5
  • [10] Data-Efficient Augmentation for Training Neural Networks
    Liu, Tian Yu
    Mirzasoleiman, Baharan
    Advances in Neural Information Processing Systems, 2022, 35