Unsupervised Layer-Wise Score Aggregation for Textual OOD Detection

被引:0
|
作者
Darrin, Maxime [1 ,2 ,3 ,4 ]
Staerman, Guillaume [4 ,6 ,9 ]
Gomes, Eduardo Dadalto Camara [4 ,5 ,6 ,7 ]
Gomes, Camara
Cheung, Jackie Ck [2 ,3 ,8 ]
Piantanida, Pablo [1 ,2 ,4 ,6 ]
Colombo, Pierre [2 ,4 ,7 ,10 ,11 ]
机构
[1] Int Lab Learning Syst, Gif Sur Yvette, France
[2] MILA Quebec AI Inst, Montreal, PQ, Canada
[3] McGill Univ, Montreal, PQ H3A 2T5, Canada
[4] Univ Paris Saclay, Paris, France
[5] Lab Signaux & Syst, Paris, France
[6] CNRS, Paris, France
[7] Cent Supelec, Paris, France
[8] Mila, Canada CIFAR Chair, Montreal, PQ, Canada
[9] INRIA, CEA, Paris, France
[10] Equal, Paris, France
[11] MICS, Paris, France
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Out-of-distribution (OOD) detection is a rapidly growing field due to new robustness and security requirements driven by an increased number of AI-based systems. Existing OOD textual detectors often rely on anomaly scores (e.g., Mahalanobis distance) computed on the embedding output of the last layer of the encoder. In this work, we observe that OOD detection performance varies greatly depending on the task and layer output. More importantly, we show that the usual choice (the last layer) is rarely the best one for OOD detection and that far better results can be achieved, provided that an oracle selects the best layer. We propose a data-driven, unsupervised method to leverage this observation to combine layer-wise anomaly scores. In addition, we extend classical textual OOD benchmarks by including classification tasks with a more significant number of classes (up to 150), which reflects more realistic settings. On this augmented benchmark, we show that the proposed post-aggregation methods achieve robust and consistent results comparable to using the best layer according to an oracle while removing manual feature selection altogether.
引用
下载
收藏
页码:17880 / 17888
页数:9
相关论文
共 50 条
  • [1] Layer-wise domain correction for unsupervised domain adaptation
    Shuang LI
    Shi-ji SONG
    Cheng WU
    Frontiers of Information Technology & Electronic Engineering, 2018, 19 (01) : 91 - 103
  • [2] Layer-wise domain correction for unsupervised domain adaptation
    Shuang Li
    Shi-ji Song
    Cheng Wu
    Frontiers of Information Technology & Electronic Engineering, 2018, 19 : 91 - 103
  • [3] Layer-wise domain correction for unsupervised domain adaptation
    Li, Shuang
    Song, Shi-ji
    Wu, Cheng
    FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2018, 19 (01) : 91 - 103
  • [4] Unsupervised Layer-Wise Model Selection in Deep Neural Networks
    Ludovic, Arnold
    Helene, Paugam-Moisy
    Michele, Sebag
    ECAI 2010 - 19TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2010, 215 : 915 - 920
  • [5] Unsupervised Discovery of Disentangled Interpretable Directions for Layer-Wise GAN
    Hu, Haotian
    Jiang, Bin
    Zhou, Xinjiao
    Huo, Xiaofei
    Zhang, Bolin
    BIG DATA, BIGDATA 2022, 2022, 1709 : 21 - 39
  • [6] Unsupervised Discovery of Disentangled Interpretable Directions for Layer-Wise GAN
    Hu, Haotian
    Jiang, Bin
    Zhou, Xinjiao
    Huo, Xiaofei
    Zhang, Bolin
    Communications in Computer and Information Science, 2022, 1709 CCIS : 21 - 39
  • [7] Adversarial Examples Detection and Analysis with Layer-wise Autoencoders
    Wojcik, Bartosz
    Morawiecki, Pawel
    Smieja, Marek
    Krzyzek, Tomasz
    Spurek, Przemyslaw
    Tabor, Jacek
    2021 IEEE 33RD INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2021), 2021, : 1322 - 1326
  • [8] Layer-Wise Optimization of Contextual Neural Networks with Dynamic Field of Aggregation
    Jodlowiec, Marcin
    Albu, Adriana
    Wolk, Krzysztof
    Nguyen Thai-Nghe
    Karasinski, Adrian
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2022, PT II, 2022, 13758 : 302 - 312
  • [9] Efficient Federated Learning Using Layer-Wise Regulation and Momentum Aggregation*
    Zhang, Fan
    Fang, Zekuan
    Li, Yiming
    Chen, Mingsong
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2022, 31 (18)
  • [10] YOGA: Adaptive Layer-Wise Model Aggregation for Decentralized Federated Learning
    Liu, Jun
    Liu, Jianchun
    Xu, Hongli
    Liao, Yunming
    Wang, Zhiyuan
    Ma, Qianpiao
    IEEE-ACM TRANSACTIONS ON NETWORKING, 2024, 32 (02) : 1768 - 1780