AD-NLP: A Benchmark for Anomaly Detection in Natural Language Processing

被引:0
|
作者
Bejan, Matei [1 ]
Manolache, Andrei [2 ,3 ]
Popescu, Marius [1 ]
机构
[1] Univ Bucharest, Bucharest, Romania
[2] Bitdefender, Bucharest, Romania
[3] Univ Stuttgart, Stuttgart, Germany
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning models have reignited the interest in Anomaly Detection research in recent years. Methods for Anomaly Detection in text have shown strong empirical results on ad-hoc anomaly setups that are usually made by downsampling some classes of a labeled dataset. This can lead to reproducibility issues and models that are biased toward detecting particular anomalies while failing to recognize them in more sophisticated scenarios. In the present work, we provide a unified benchmark for detecting various types of anomalies, focusing on problems that can be naturally formulated as Anomaly Detection in text, ranging from syntax to stylistics. In this way, we are hoping to facilitate research in Text Anomaly Detection. We also evaluate and analyze two strong shallow baselines, as well as two of the current state-of-the-art neural approaches, providing insights into the knowledge the neural models are learning when performing the anomaly detection task. We provide the code for evaluation, downloading, and preprocessing the dataset at https://github.com/mateibejan1/ad-nlp/.
引用
收藏
页码:10766 / 10778
页数:13
相关论文
共 50 条
  • [41] Investigating Mood Instability in Psychotic Disorders Using Natural Language Processing (NLP)
    Patel, Rashmi
    Lloyd, Theodore
    Jackson, Richard
    Ball, Michael
    Shetty, Hitesh
    Broadbent, Matthew
    Geddes, John R.
    Stewart, Robert
    McGuire, Philip
    Taylor, Matthew
    EARLY INTERVENTION IN PSYCHIATRY, 2016, 10 : 106 - 106
  • [42] GluonCV and gluon NLP: Deep learning in computer vision and natural language processing
    Guo, Jian
    He, He
    He, Tong
    Lausen, Leonard
    Li, Mu
    Lin, Haibin
    Shi, Xingjian
    Wang, Chenguang
    Xie, Junyuan
    Zha, Sheng
    Zhang, Aston
    Zhang, Hang
    Zhang, Zhi
    Zhang, Zhongyue
    Zheng, Shuai
    Zhu, Yi
    Journal of Machine Learning Research, 2020, 21
  • [43] Applying Natural Language Processing (NLP) to Verbatim Patient-Reported Outcomes
    Purks, Jennifer L.
    Harris, Michael
    Anderson, Karen E.
    Shoulson, Ira
    ANNALS OF NEUROLOGY, 2016, 80 : S69 - S69
  • [44] ADBench: Anomaly Detection Benchmark
    Han, Songqiao
    Hu, Xiyang
    Huang, Hailiang
    Jiang, Minqi
    Zhao, Yue
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [45] UTILIZING NATURAL LANGUAGE PROCESSING (NLP) TO ACCURATELY IDENTIFY FATTY LIVER DISEASE
    Redman, Joseph S.
    Natarajan, Yamini
    Wang, Jingqi
    Hanif, Muzammil
    Feng, Hua
    Kramer, Jennifer R.
    Desiderio, Roxanne
    Xu, Hua
    El-Serag, Hashem B.
    Hou, Jason K.
    Kanwal, Fasiha
    GASTROENTEROLOGY, 2017, 152 (05) : S1115 - S1115
  • [46] Sarcasm detection in natural language processing
    Ashwitha, A.
    Shruthi, G.
    Shruthi, H. R.
    Upadhyaya, Makarand
    Ray, Abhra Pratip
    Manjunath, T. C.
    MATERIALS TODAY-PROCEEDINGS, 2021, 37 : 3324 - 3331
  • [47] Fraud detection with natural language processing
    Boulieris, Petros
    Pavlopoulos, John
    Xenos, Alexandros
    Vassalos, Vasilis
    MACHINE LEARNING, 2024, 113 (08) : 5087 - 5108
  • [48] Jurisprudence search in Colombia based on natural language processing (NLP) and Lynked Data
    Camilo Ordonez, Cristian
    Armando Ordonez, Jose
    Ordonez Eraso, Hugo Armando
    Urbano, Franco
    INGE CUC, 2020, 16 (02)
  • [49] Teaching Natural Language Processing (NLP) Using Ontology Based Education Design
    Rehman, Zobia
    Kifor, Stefania
    3RD INTERNATIONAL ENGINEERING AND TECHNOLOGY EDUCATION CONFERENCE & 7TH BALKAN REGION CONFERENCE ON ENGINEERING AND BUSINESS EDUCATION, 2015,
  • [50] Use of Natural Language Processing (NLP) Tools to Assess Digital Literacy Skills
    Rodriguez-Ruiz, Jorge
    Alvarez-Delgado, Alvaro
    Caratozzolo, Patricia
    2021 MACHINE LEARNING-DRIVEN DIGITAL TECHNOLOGIES FOR EDUCATIONAL INNOVATION WORKSHOP, 2021,