Joining datasets via data augmentation in the label space for neural networks

被引:0
|
作者
Zhao, Jake [1 ]
Ou, Mingfeng [2 ,3 ]
Xue, Linji [2 ]
Cui, Yunkai [2 ]
Wu, Sai [1 ]
Chen, Gang [1 ]
机构
[1] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou, Zhejiang, Peoples R China
[2] Graviti Inc, Shanghai, Peoples R China
[3] Tongji Univ, Dept Software Engn, Shanghai, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most, if not all, modern deep learning systems restrict themselves to a single dataset for neural network training and inference. In this article, we are interested in systematic ways to join datasets that are made of similar purposes. Unlike previous published works that ubiquitously conduct the dataset joining in the uninterpretable latent vectorial space, the core to our method is an augmentation procedure in the label space. The primary challenge to address the label space for dataset joining is the discrepancy between labels: non-overlapping label annotation sets, different labeling granularity or hierarchy and etc. Notably we propose a new technique leveraging artificially created knowledge graph, recurrent neural networks and policy gradient that successfully achieve the dataset joining in the label space. Empirical results on both image and text classification justify the validity of our approach.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Spatial Data Augmentation: Improving the Generalization of Neural Networks for Pansharpening
    Chen, Lihui
    Vivone, Gemine
    Nie, Zihao
    Chanussot, Jocelyn
    Yang, Xiaomin
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [22] Data augmentation for univariate time series forecasting with neural networks
    Semenoglou, Artemios-Anargyros
    Spiliotis, Evangelos
    Assimakopoulos, Vassilios
    PATTERN RECOGNITION, 2022, 134
  • [23] SanitAIs: Unsupervised Data Augmentation to Sanitize Trojaned Neural Networks
    Karra, Kiran
    Ashcraft, Chace
    Costello, Cash
    2022 IEEE INTL CONF ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING, INTL CONF ON PERVASIVE INTELLIGENCE AND COMPUTING, INTL CONF ON CLOUD AND BIG DATA COMPUTING, INTL CONF ON CYBER SCIENCE AND TECHNOLOGY CONGRESS (DASC/PICOM/CBDCOM/CYBERSCITECH), 2022, : 300 - 305
  • [24] Backdoor Attacks on Graph Neural Networks Trained with Data Augmentation
    Yashiki, Shingo
    Takahashi, Chako
    Suzuki, Koutarou
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2024, E107A (03) : 355 - 358
  • [25] MixupExplainer: Generalizing Explanations for Graph Neural Networks with Data Augmentation
    Zhang, Jiaxing
    Luo, Dongsheng
    Wei, Hua
    PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 3286 - 3296
  • [26] Data augmentation in Bayesian neural networks and the cold posterior effect
    Nabarro, Seth
    Ganev, Stoil
    Garriga-Alonso, Adria
    Fortuin, Vincent
    Van der Wilk, Mark
    Aitchison, Laurence
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, VOL 180, 2022, 180 : 1434 - 1444
  • [27] How Tempering Fixes Data Augmentation in Bayesian Neural Networks
    Bachmann, Gregor
    Noci, Lorenzo
    Hofmann, Thomas
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [28] Perfusion Parameter Estimation Using Neural Networks and Data Augmentation
    Robben, David
    Suetens, Paul
    BRAINLESION: GLIOMA, MULTIPLE SCLEROSIS, STROKE AND TRAUMATIC BRAIN INJURIES, BRAINLES 2018, PT I, 2019, 11383 : 439 - 446
  • [29] An Approach to Improving the Effectiveness of Data Augmentation for Deep Neural Networks
    Jang, Seunghui
    Lee, Ki Yong
    Kim, Yanggon
    2020 IEEE 44TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE (COMPSAC 2020), 2020, : 1290 - 1295
  • [30] Data augmentation on convolutional neural networks to classify mechanical noise
    Abeysinghe, Asith
    Tohmuang, Sitthichart
    Davy, John Laurence
    Fard, Mohammad
    APPLIED ACOUSTICS, 2023, 203