Joining datasets via data augmentation in the label space for neural networks

被引：0

作者：

Zhao, Jake ^{[1
]}

Ou, Mingfeng ^{[2
,3
]}

Xue, Linji ^{[2
]}

Cui, Yunkai ^{[2
]}

Wu, Sai ^{[1
]}

Chen, Gang ^{[1
]}

机构：

[1] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou, Zhejiang, Peoples R China

[2] Graviti Inc, Shanghai, Peoples R China

[3] Tongji Univ, Dept Software Engn, Shanghai, Peoples R China

来源：

INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139 | 2021年 / 139卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Most, if not all, modern deep learning systems restrict themselves to a single dataset for neural network training and inference. In this article, we are interested in systematic ways to join datasets that are made of similar purposes. Unlike previous published works that ubiquitously conduct the dataset joining in the uninterpretable latent vectorial space, the core to our method is an augmentation procedure in the label space. The primary challenge to address the label space for dataset joining is the discrepancy between labels: non-overlapping label annotation sets, different labeling granularity or hierarchy and etc. Notably we propose a new technique leveraging artificially created knowledge graph, recurrent neural networks and policy gradient that successfully achieve the dataset joining in the label space. Empirical results on both image and text classification justify the validity of our approach.

引用

页数：11

共 50 条

[1] Rumour detection on benchmark twitter datasets using graph neural networks with data augmentation
Patel, Shaswat
Bansal, Prince
Kaur, Preeti
SOCIAL NETWORK ANALYSIS AND MINING, 2024, 14 (01)
[2] Data Augmentation for Graph Neural Networks
Zhao, Tong
Liu, Yozen
Neves, Leonardo
Woodford, Oliver
Jiang, Meng
Shah, Neil
THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 11015 - 11023
[3] Hydranet: Data Augmentation for Regression Neural Networks
Dubost, Florian
Bortsova, Gerda
Adams, Hieab
Ikram, M. Arfan
Niessen, Wiro
Vernooij, Meike
de Bruijne, Marleen
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2019, PT IV, 2019, 11767 : 438 - 446
[4] Rationalizing Graph Neural Networks with Data Augmentation
Liu, Gang
Inae, Eric
Luo, Tengfei
Jiang, Meng
ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2024, 18 (04)
[5] An Imperceptible Data Augmentation Based Blackbox Clean-Label Backdoor Attack on Deep Neural Networks
Xu, Chaohui
Liu, Wenye
Zheng, Yue
Wang, Si
Chang, Chip-Hong
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2023, 70 (12) : 5011 - 5024
[6] Data-dependent Sample Complexity of Deep Neural Networks via Lipschitz Augmentation
Wei, Colin
Ma, Tengyu
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
[7] Data-dependent sample complexity of deep neural networks via lipschitz augmentation
Wei, Colin
Ma, Tengyu
Advances in Neural Information Processing Systems, 2019, 32
[8] Fast and interpretable classification of small X-ray diffraction datasets using data augmentation and deep neural networks
Oviedo, Felipe
Ren, Zekun
Sun, Shijing
Settens, Charles
Liu, Zhe
Hartono, Noor Titan Putri
Ramasamy, Savitha
DeCost, Brian L.
Tian, Siyu I. P.
Romano, Giuseppe
Kusne, Aaron Gilad
Buonassisi, Tonio
NPJ COMPUTATIONAL MATERIALS, 2019, 5
[9] Fast and interpretable classification of small X-ray diffraction datasets using data augmentation and deep neural networks
Felipe Oviedo
Zekun Ren
Shijing Sun
Charles Settens
Zhe Liu
Noor Titan Putri Hartono
Savitha Ramasamy
Brian L. DeCost
Siyu I. P. Tian
Giuseppe Romano
Aaron Gilad Kusne
Tonio Buonassisi
npj Computational Materials, 5
[10] Data-Efficient Augmentation for Training Neural Networks
Liu, Tian Yu
Mirzasoleiman, Baharan
Advances in Neural Information Processing Systems, 2022, 35

← 1 2 3 4 5 →