AD-NLP: A Benchmark for Anomaly Detection in Natural Language Processing

被引：0

作者：

Bejan, Matei ^{[1
]}

Manolache, Andrei ^{[2
,3
]}

Popescu, Marius ^{[1
]}

机构：

[1] Univ Bucharest, Bucharest, Romania

[2] Bitdefender, Bucharest, Romania

[3] Univ Stuttgart, Stuttgart, Germany

来源：

2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2023) | 2023年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Deep learning models have reignited the interest in Anomaly Detection research in recent years. Methods for Anomaly Detection in text have shown strong empirical results on ad-hoc anomaly setups that are usually made by downsampling some classes of a labeled dataset. This can lead to reproducibility issues and models that are biased toward detecting particular anomalies while failing to recognize them in more sophisticated scenarios. In the present work, we provide a unified benchmark for detecting various types of anomalies, focusing on problems that can be naturally formulated as Anomaly Detection in text, ranging from syntax to stylistics. In this way, we are hoping to facilitate research in Text Anomaly Detection. We also evaluate and analyze two strong shallow baselines, as well as two of the current state-of-the-art neural approaches, providing insights into the knowledge the neural models are learning when performing the anomaly detection task. We provide the code for evaluation, downloading, and preprocessing the dataset at https://github.com/mateibejan1/ad-nlp/.

引用

页码：10766 / 10778

页数：13

共 50 条

[1] NLP (Natural Language Processing) for NLP (Natural Language Programming)
Mihalcea, R
Liu, H
Lieberman, H
COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2006, 3878 : 319 - 330
[2] Natural language processing (NLP) for personality disorder detection
Jang, Jihee
Yoon, Seowon
Son, Gaeun
Park, Soohyun
Hwang, Jueun
Choeh, Joon Yeon
Choi, Kee-hong
INTERNATIONAL JOURNAL OF PSYCHOLOGY, 2024, 59 : 79 - 80
[3] NAS-Bench-NLP: Neural Architecture Search Benchmark for Natural Language Processing
Klyuchnikov, Nikita
Trofimov, Ilya
Artemova, Ekaterina
Salnikov, Mikhail
Fedorov, Maxim
Filippov, Alexander
Burnaev, Evgeny
IEEE ACCESS, 2022, 10 : 45736 - 45747
[4] From NLP (Natural Language Processing) to MLP (Machine Language Processing)
Institute for Applied Information Processing and Communications , Graz University of Technology, Austria
不详
不详
Lect. Notes Comput. Sci., (256-269):
[5] From NLP (Natural Language Processing) to MLP (Machine Language Processing)
Teufl, Peter
Payer, Udo
Lackner, Guenter
COMPUTER NETWORK SECURITY, 2010, 6258 : 256 - +
[6] APPLICATION OF NATURAL LANGUAGE PROCESSING (NLP) IN EARLY DETECTION OF AMYLOIDOSIS: THE ALARM STUDY
Altibi, Ahmed
Elman, Miriam
Volk, Hailey
Masri, Ahmad
JOURNAL OF THE AMERICAN COLLEGE OF CARDIOLOGY, 2024, 83 (13) : 346 - 346
[7] Natural Language Processing (NLP) Applied on Issue Trackers
Ellmann, Mathias
PROCEEDINGS OF THE 4TH ACM SIGSOFT INTERNATIONAL WORKSHOP ON NLP FOR SOFTWARE ENGINEERING (NL4SE '18), 2018, : 38 - 41
[8] Open Health Natural Language Processing (NLP) Consortium
不详
METHODS OF INFORMATION IN MEDICINE, 2009, 48 (03) : V - V
[9] A Systematic Literature Review on Natural Language Processing (NLP)
Castanha, Jick
Indrawati
Pillai, Subhash K. B.
Ramantoko, Gadang
Widarmanti, Tri
2022 INTERNATIONAL CONFERENCE ON ADVANCED CREATIVE NETWORKS AND INTELLIGENT SYSTEMS, ICACNIS, 2022, : 130 - 135
[10] Natural Language Processing-based Model for Log Anomaly Detection
Li, Zezhou
Zhang, Jing
Zhang, Xianbo
Lin, Feng
Wang, Chao
Cai, Xingye
2022 2ND IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND ARTIFICIAL INTELLIGENCE (SEAI 2022), 2022, : 129 - 134

← 1 2 3 4 5 →