AD-NLP: A Benchmark for Anomaly Detection in Natural Language Processing

被引:0
|
作者
Bejan, Matei [1 ]
Manolache, Andrei [2 ,3 ]
Popescu, Marius [1 ]
机构
[1] Univ Bucharest, Bucharest, Romania
[2] Bitdefender, Bucharest, Romania
[3] Univ Stuttgart, Stuttgart, Germany
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning models have reignited the interest in Anomaly Detection research in recent years. Methods for Anomaly Detection in text have shown strong empirical results on ad-hoc anomaly setups that are usually made by downsampling some classes of a labeled dataset. This can lead to reproducibility issues and models that are biased toward detecting particular anomalies while failing to recognize them in more sophisticated scenarios. In the present work, we provide a unified benchmark for detecting various types of anomalies, focusing on problems that can be naturally formulated as Anomaly Detection in text, ranging from syntax to stylistics. In this way, we are hoping to facilitate research in Text Anomaly Detection. We also evaluate and analyze two strong shallow baselines, as well as two of the current state-of-the-art neural approaches, providing insights into the knowledge the neural models are learning when performing the anomaly detection task. We provide the code for evaluation, downloading, and preprocessing the dataset at https://github.com/mateibejan1/ad-nlp/.
引用
收藏
页码:10766 / 10778
页数:13
相关论文
共 50 条
  • [1] NLP (Natural Language Processing) for NLP (Natural Language Programming)
    Mihalcea, R
    Liu, H
    Lieberman, H
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2006, 3878 : 319 - 330
  • [2] Natural language processing (NLP) for personality disorder detection
    Jang, Jihee
    Yoon, Seowon
    Son, Gaeun
    Park, Soohyun
    Hwang, Jueun
    Choeh, Joon Yeon
    Choi, Kee-hong
    INTERNATIONAL JOURNAL OF PSYCHOLOGY, 2024, 59 : 79 - 80
  • [3] NAS-Bench-NLP: Neural Architecture Search Benchmark for Natural Language Processing
    Klyuchnikov, Nikita
    Trofimov, Ilya
    Artemova, Ekaterina
    Salnikov, Mikhail
    Fedorov, Maxim
    Filippov, Alexander
    Burnaev, Evgeny
    IEEE ACCESS, 2022, 10 : 45736 - 45747
  • [4] From NLP (Natural Language Processing) to MLP (Machine Language Processing)
    Institute for Applied Information Processing and Communications , Graz University of Technology, Austria
    不详
    不详
    Lect. Notes Comput. Sci., (256-269):
  • [5] From NLP (Natural Language Processing) to MLP (Machine Language Processing)
    Teufl, Peter
    Payer, Udo
    Lackner, Guenter
    COMPUTER NETWORK SECURITY, 2010, 6258 : 256 - +
  • [6] APPLICATION OF NATURAL LANGUAGE PROCESSING (NLP) IN EARLY DETECTION OF AMYLOIDOSIS: THE ALARM STUDY
    Altibi, Ahmed
    Elman, Miriam
    Volk, Hailey
    Masri, Ahmad
    JOURNAL OF THE AMERICAN COLLEGE OF CARDIOLOGY, 2024, 83 (13) : 346 - 346
  • [7] Natural Language Processing (NLP) Applied on Issue Trackers
    Ellmann, Mathias
    PROCEEDINGS OF THE 4TH ACM SIGSOFT INTERNATIONAL WORKSHOP ON NLP FOR SOFTWARE ENGINEERING (NL4SE '18), 2018, : 38 - 41
  • [9] A Systematic Literature Review on Natural Language Processing (NLP)
    Castanha, Jick
    Indrawati
    Pillai, Subhash K. B.
    Ramantoko, Gadang
    Widarmanti, Tri
    2022 INTERNATIONAL CONFERENCE ON ADVANCED CREATIVE NETWORKS AND INTELLIGENT SYSTEMS, ICACNIS, 2022, : 130 - 135
  • [10] Natural Language Processing-based Model for Log Anomaly Detection
    Li, Zezhou
    Zhang, Jing
    Zhang, Xianbo
    Lin, Feng
    Wang, Chao
    Cai, Xingye
    2022 2ND IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND ARTIFICIAL INTELLIGENCE (SEAI 2022), 2022, : 129 - 134