AD-NLP: A Benchmark for Anomaly Detection in Natural Language Processing

被引:0
|
作者
Bejan, Matei [1 ]
Manolache, Andrei [2 ,3 ]
Popescu, Marius [1 ]
机构
[1] Univ Bucharest, Bucharest, Romania
[2] Bitdefender, Bucharest, Romania
[3] Univ Stuttgart, Stuttgart, Germany
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning models have reignited the interest in Anomaly Detection research in recent years. Methods for Anomaly Detection in text have shown strong empirical results on ad-hoc anomaly setups that are usually made by downsampling some classes of a labeled dataset. This can lead to reproducibility issues and models that are biased toward detecting particular anomalies while failing to recognize them in more sophisticated scenarios. In the present work, we provide a unified benchmark for detecting various types of anomalies, focusing on problems that can be naturally formulated as Anomaly Detection in text, ranging from syntax to stylistics. In this way, we are hoping to facilitate research in Text Anomaly Detection. We also evaluate and analyze two strong shallow baselines, as well as two of the current state-of-the-art neural approaches, providing insights into the knowledge the neural models are learning when performing the anomaly detection task. We provide the code for evaluation, downloading, and preprocessing the dataset at https://github.com/mateibejan1/ad-nlp/.
引用
收藏
页码:10766 / 10778
页数:13
相关论文
共 50 条
  • [31] Natural language processing (NLP) aided qualitative method in health research
    Cheligeer, Cheligeer
    Yang, Lin
    Nandi, Tannistha
    Doktorchik, Chelsea
    Quan, Hude
    Zeng, Yong
    Singh, Shaminder
    JOURNAL OF INTEGRATED DESIGN & PROCESS SCIENCE, 2023, 27 (01) : 41 - 58
  • [32] Sentiment Analysis of Multilingual Tweets Based on Natural Language Processing (NLP)
    Bera, Abhijit
    Ghose, Mrinal Kanti
    Pal, Dibyendu Kumar
    INTERNATIONAL JOURNAL OF SYSTEM DYNAMICS APPLICATIONS, 2021, 10 (04)
  • [33] A NATURAL LANGUAGE PROCESSING (NLP) APPROACH TO AUTOMATE PATIENTS TESTIMONIALS ANALYSIS
    Hayat, P.
    Clemente, C.
    Martenot, V
    Rollot, M.
    VALUE IN HEALTH, 2022, 25 (12) : S424 - S424
  • [34] NLP Scholar: An Interactive Visual Explorer for Natural Language Processing Literature
    Mohammad, Saif M.
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020): SYSTEM DEMONSTRATIONS, 2020, : 232 - 255
  • [35] Improved neural machine translation using Natural Language Processing (NLP)
    Sk Hasane Ahammad
    Ruth Ramya Kalangi
    S. Nagendram
    Syed Inthiyaz
    P. Poorna Priya
    Osama S. Faragallah
    Alsharef Mohammad
    Mahmoud M. A. Eid
    Ahmed Nabih Zaki Rashed
    Multimedia Tools and Applications, 2024, 83 : 39335 - 39348
  • [36] Absorption of Natural Language Processing (NLP) Tasks by Information Science (IS): a literature review for tangibilizing the use of NLP by IS
    de Jesus Falcao, Luander Cipriano
    Lopes, Brenner
    Souza, Renato Rocha
    EM QUESTAO, 2022, 28 (01): : 13 - 34
  • [37] Targeting the Benchmark: On Methodology in Current Natural Language Processing Research
    Schlangen, David
    ACL-IJCNLP 2021: THE 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 2, 2021, : 670 - 674
  • [38] Text processing in an Integrated Development Environment (IDE): Integrating Natural Language Processing (NLP) techniques
    Deane, P.
    De, Hilster, D.
    Meyers, A.
    PC AI, 2001, 15 (05):
  • [39] Natural language processing (NLP) software use in the discovery of incidental lung cancers
    Johnson, Melissa Lynne
    Blakemore, Brook E.
    Baxter, Tammy M.
    Ashiq, Javed
    Moore, Sharon P.
    Smith, Priscilla G.
    Stults, Dawn Michelle
    Burris, Howard A.
    Spigel, David R.
    JOURNAL OF CLINICAL ONCOLOGY, 2016, 34 (15)
  • [40] Extracting Business Process Models using Natural Language Processing (NLP) Techniques
    Sintoris, Konstantinos
    Vergidis, Kostas
    2017 IEEE 19TH CONFERENCE ON BUSINESS INFORMATICS (CBI), VOL 1, 2017, 1 : 135 - 139