An Ensemble Approach to Cross-Domain Authorship Attribution

被引:7
|
作者
Custodio, Jose Eleandro [1 ]
Paraboni, Ivandre [1 ]
机构
[1] Univ Sao Paulo, Sch Arts Sci & Humanities EACH, Sao Paulo, Brazil
基金
巴西圣保罗研究基金会;
关键词
D O I
10.1007/978-3-030-28577-7_17
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents an ensemble approach to cross-domain authorship attribution that combines predictions made by three independent classifiers, namely, standard character n-grams, character n-grams with non-diacritic distortion and word n-grams. Our proposal relies on variable-length n-gram models and multinomial logistic regression to select the prediction of highest probability among the three models as the output for the task. The present approach is compared against a number of baseline systems, and we report results based on both the PAN-CLEF 2018 test data, and on a new corpus of song lyrics in English and Portuguese.
引用
收藏
页码:201 / 212
页数:12
相关论文
共 50 条
  • [31] The complex networks approach for authorship attribution of books
    Mehri, Ali
    Darooneh, Amir H.
    Shariati, Ashrafalsadat
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2012, 391 (07) : 2429 - 2437
  • [32] A Computational Approach for Authorship Attribution on Multiple Languages
    Varela, Paulo J.
    Albonico, Michel
    Justino, Edson J. R.
    Bortolozzi, Flavin
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [33] New approach to feature extraction in authorship attribution
    Gonzalez Brito, Omar
    Tapia Fabela, Jose Luis
    Salas Hernandez, Silvia
    INTERNATIONAL JOURNAL OF COMBINATORIAL OPTIMIZATION PROBLEMS AND INFORMATICS, 2021, 12 (03): : 87 - 97
  • [34] Cross-Domain Disentanglement: A Novel Approach to Financial Market Prediction
    Abdulsahib, Humam M.
    Ghaderi, Foad
    IEEE ACCESS, 2024, 12 : 16255 - 16265
  • [35] A constrained optimization approach for cross-domain emotion distribution learning
    Qin, Xiaorui
    Chen, Yufu
    Rao, Yanghui
    Xie, Haoran
    Wong, Man Leung
    Wang, Fu Lee
    KNOWLEDGE-BASED SYSTEMS, 2021, 227
  • [36] An NLP approach for cross-domain ambiguity detection in requirements engineering
    Ferrari, Alessio
    Esuli, Andrea
    AUTOMATED SOFTWARE ENGINEERING, 2019, 26 (03) : 559 - 598
  • [37] CROSS-DOMAIN SENTIMENT CLASSIFICATION USING DEEP LEARNING APPROACH
    Sun, Miao
    Tan, Qi
    Ding, Runwei
    Liu, Hong
    2014 IEEE 3RD INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS (CCIS), 2014, : 60 - 64
  • [38] A Realistic Approach to a Network-on-Chip Cross-Domain Pattern
    Larrucea, Asier
    Perez, Jon
    Nicolas, Carlos F.
    Ahmadian, Hamidreza
    Obermaisser, Roman
    19TH EUROMICRO CONFERENCE ON DIGITAL SYSTEM DESIGN (DSD 2016), 2016, : 396 - 403
  • [39] Automatic Driving Scenarios: A Cross-Domain Approach for Object Detection
    Liu, Shengheng
    Chen, Jiacheng
    Li, Lei
    Ma, Yahui
    Huang, Yongming
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT VII, 2023, 14260 : 40 - 52
  • [40] An NLP approach for cross-domain ambiguity detection in requirements engineering
    Alessio Ferrari
    Andrea Esuli
    Automated Software Engineering, 2019, 26 : 559 - 598