On the Trade-off of Intra-/Inter-class Diversity for Supervised Pre-training

被引:0
|
作者
Zhang, Jieyu [1 ]
Wang, Bohan [2 ]
Hu, Zhengyu [3 ]
Koh, Pang Wei [1 ]
Ratner, Alexander [1 ,4 ]
机构
[1] Univ Washington, Seattle, WA 98195 USA
[2] USTC, Hefei, Anhui, Peoples R China
[3] HKUST GZ, Hong Kong, Peoples R China
[4] Snorkel AI Inc, Redwood City, CA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Pre-training datasets are critical for building state-of-the-art machine learning models, motivating rigorous study on their impact on downstream tasks. In this work, we study the impact of the trade-off between the intra-class diversity (the number of samples per class) and the inter-class diversity (the number of classes) of a supervised pre-training dataset. Empirically, given a fixed pre-training dataset size, we find that the best downstream performance comes with a balance on the intra-/inter-class diversity. To understand the underlying mechanism, we show theoretically that downstream performance depends monotonically on both types of diversity. Notably, our theory reveals that the optimal class-to-sample ratio (#classes/#samples per class), i.e., the ratio of the number of pre-training classes to the number of samples per class, is invariant to the size of the pre-training dataset, enabling the prediction of the optimal number of pre-training classes. We demonstrate the effectiveness of this application by an improvement of approximately 2 points on average on downstream tasks when pre-training on ImageNet.
引用
收藏
页数:20
相关论文
共 26 条
  • [1] Generating Intra- and Inter-Class Iris Images by Identity Contrast
    Wang, Chen
    He, Zhaofeng
    Wang, Caiyong
    Tian, Qing
    2022 IEEE INTERNATIONAL JOINT CONFERENCE ON BIOMETRICS (IJCB), 2022,
  • [2] An evolutionary approach to discover intra- and inter-class exceptions in databases
    Vashishtha, Jyoti
    Kumar, Dharminder
    Ratnoo, Saroj
    International Journal of Intelligent Systems Technologies and Applications, 2013, 12 (3-4) : 283 - 300
  • [3] Blessing of Class Diversity in Pre-training
    Zhao, Yulai
    Chen, Jianshu
    Du, Simon S.
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 206, 2023, 206 : 283 - 305
  • [4] Improving Textual Emotion Recognition Based on Intra- and Inter-Class Variations
    Alhuzali, Hassan
    Ananiadou, Sophia
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (02) : 1297 - 1307
  • [5] Necessary Conditions for Successful Application of Intra- and Inter-class Common Vector Classifiers
    Mehmet Koc
    Semih Ergin
    Mehmet Bilginer Gulmezoglu
    Mehmet Fidan
    Omer Nezih Gerek
    Atalay Barkana
    Arabian Journal for Science and Engineering, 2022, 47 : 10101 - 10113
  • [6] Softmax Dissection: Towards Understanding Intra- and Inter-Class Objective for Embedding Learning
    He, Lanqing
    Wang, Zhongdao
    Li, Yali
    Wang, Shengjin
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 10957 - 10964
  • [7] Intra- and Inter-Class Induced Discriminative Deep Dictionary Learning for Visual Recognition
    Gou, Jianping
    Yuan, Xia
    Yu, Baosheng
    Yu, Jiali
    Yi, Zhang
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 1575 - 1583
  • [8] Necessary Conditions for Successful Application of Intra- and Inter-class Common Vector Classifiers
    Koc, Mehmet
    Ergin, Semih
    Gulmezoglu, Mehmet Bilginer
    Fidan, Mehmet
    Gerek, Omer Nezih
    Barkana, Atalay
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2022, 47 (08) : 10101 - 10113
  • [9] The trade-off between intra- and intergenerational equity in climate policy
    Kverndokk, Snorre
    Naevdal, Eric
    Nostbakken, Linda
    EUROPEAN ECONOMIC REVIEW, 2014, 69 : 40 - 58
  • [10] Supervised graph regularization based cross media retrieval with intra and inter-class correlation
    Zhang, Meijia
    Zhang, Huaxiang
    Li, Junzheng
    Wang, Li
    Fang, Yixian
    Sun, Jiande
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2019, 58 : 1 - 11