ADEQ: Adaptive Diversity Enhancement for Zero-Shot Quantization

被引:0
|
作者
Chen, Xinrui [1 ]
Yan, Renao [1 ]
Cheng, Junru [1 ]
Wang, Yizhi [1 ]
Fu, Yuqiu [1 ]
Chen, Yi [1 ]
Guan, Tian [1 ]
He, Yonghong [1 ]
机构
[1] Tsinghua Univ, Shenzhen Int Grad Sch, Shenzhen, Peoples R China
基金
中国国家自然科学基金;
关键词
Zero-shot Quantization; Diversity Enhancement; Class-wise Adaptability; Layer-wise Adaptability; Inter-class Separability;
D O I
10.1007/978-981-99-8079-6_5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Zero-shot quantization (ZSQ) is an effective way to compress neural networks, especially when real training sets are inaccessible because of privacy and security issues. Most existing synthetic-data-driven zero-shot quantization methods introduce diversity enhancement to simulate the distribution of real samples. However, the adaptivity between the enhancement degree and network is neglected, i.e., whether the enhancement degree benefits different network layers and different classes, and whether it reaches the best match between the inter-class distance and intra-class diversity. Due to the absence of the metric for class-wise and layer-wise diversity, maladaptive enhancement degree run the vulnerability of mode collapse of the inter-class inseparability. To address this issue, we propose a novel zero-shot quantization method, ADEQ. For layer-wise and class-wise adaptivity, the enhancement degree of different layers is adaptively initialized with a diversity coefficient. For inter-class adaptivity, an incremental diversity enhancement strategy is proposed to achieve the trade-off between inter-class distance and intra-class diversity. Extensive experiments on the CIFAR-100 and ImageNet show that our ADEQ is observed to have advanced performance at low bit-width quantization. For example, when ResNet-18 is quantized to 3 bits, we improve top-1 accuracy by 17.78% on ImageNet compared to the advanced ARC. Code at https://github.com/dangsingrue/ADEQ.
引用
收藏
页码:53 / 64
页数:12
相关论文
共 50 条
  • [41] Semantic Diversity Learning for Zero-Shot Multi-label Classification
    Ben-Cohen, Avi
    Zamir, Nadav
    Ben Baruch, Emanuel
    Friedman, Itamar
    Zelnik-Manor, Lihi
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 620 - 630
  • [42] Zero-Shot Machine Unlearning
    Chundawat, Vikram S.
    Tarun, Ayush K.
    Mandal, Murari
    Kankanhalli, Mohan
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2023, 18 : 2345 - 2354
  • [43] Zero-Shot Object Detection
    Bansal, Ankan
    Sikka, Karan
    Sharma, Gaurav
    Chellappa, Rama
    Divakaran, Ajay
    COMPUTER VISION - ECCV 2018, PT I, 2018, 11205 : 397 - 414
  • [44] Zero-shot Metric Learning
    Xu, Xinyi
    Cao, Huanhuan
    Yang, Yanhua
    Yang, Erkun
    Deng, Cheng
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 3996 - 4002
  • [45] Active Zero-Shot Learning
    Xie, Sihong
    Wang, Shaoxiong
    Yu, Philip S.
    CIKM'16: PROCEEDINGS OF THE 2016 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2016, : 1889 - 1892
  • [46] Spherical Zero-Shot Learning
    Shen, Jiayi
    Xiao, Zehao
    Zhen, Xiantong
    Zhang, Lei
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (02) : 634 - 645
  • [47] Zero-Shot Logit Adjustment
    Chen, Dubing
    Shen, Yuming
    Zhang, Haofeng
    Torr, Philip H. S.
    PROCEEDINGS OF THE THIRTY-FIRST INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2022, 2022, : 813 - 819
  • [48] Zero-Shot Image Dehazing
    Li, Boyun
    Gou, Yuanbiao
    Liu, Jerry Zitao
    Zhu, Hongyuan
    Zhou, Joey Tianyi
    Peng, Xi
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 8457 - 8466
  • [49] Rebalanced Zero-Shot Learning
    Ye, Zihan
    Yang, Guanyu
    Jin, Xiaobo
    Liu, Youfa
    Huang, Kaizhu
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 4185 - 4198
  • [50] Zero-Shot Visual Imitation
    Pathak, Deepak
    Mahmoudieh, Parsa
    Luo, Guanghao
    Agrawal, Pulkit
    Chen, Dian
    Shentu, Fred
    Shelhamer, Evan
    Malik, Jitendra
    Efros, Alexei A.
    Darrell, Trevor
    PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, : 2131 - 2134