On exploiting transformers for detecting explicit song lyrics

被引:2
|
作者
Rospocher, Marco [1 ]
机构
[1] Univ Verona, Verona, Italy
关键词
Transformer-based language models; Convolutional neural networks; Text classification; Explicit content detection;
D O I
10.1016/j.entcom.2022.100508
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Determining if the lyrics of a given song could be hurtful or inappropriate for children is of utmost importance to prevent the reproduction of songs whose textual content is unsuitable for them. This problem can be computationally tackled as a binary classification task, and in the last couple of years various machine learning approaches have been applied to perform this task automatically. In this work, we investigate the automatic detection of explicit song lyrics by leveraging transformer-based language models, i.e., large language representations, unsupervisely built from huge textual corpora, that can be fine-tuned on various natural language processing tasks, such as text classification. We assess the performance of various transformer-based language model classifiers on a dataset consisting of more than 800K lyrics, marked with explicit information. The evaluation shows that while the classifiers built with these powerful tools achieve state-of-the-art performance, they do not outperform lighter and computationally less demanding approaches. We complement this empirical evaluation with further analyses, including an assessment of the performance of these classifiers in a few-shot learning scenario, where they are trained with just few thousands of samples.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] EXPLICIT AND IMPLICIT MEMORY FOR SONG LYRICS
    WALLACE, WT
    SCHULKIND, MD
    [J]. BULLETIN OF THE PSYCHONOMIC SOCIETY, 1992, 30 (06) : 457 - 457
  • [2] Explicit Language in English Song Lyrics: Should We Be Concerned?
    Bikic, Mila
    Bockaj, Valerija
    [J]. FORMALIZING NATURAL LANGUAGES: APPLICATIONS TO NATURAL LANGUAGE PROCESSING AND DIGITAL HUMANITIES, NOOJ 2023, 2024, 1816 : 153 - 164
  • [3] Detecting explicit lyrics: a case study in Italian music
    Marco Rospocher
    [J]. Language Resources and Evaluation, 2023, 57 : 849 - 867
  • [4] Detecting explicit lyrics: a case study in Italian music
    Rospocher, Marco
    [J]. LANGUAGE RESOURCES AND EVALUATION, 2023, 57 (02) : 849 - 867
  • [6] Lyrics2Song: An Automatic Song Generator for Lyrics Input
    Liu, Aozhi
    Mei, Yaqi
    Zhu, Qingying
    Zhu, Zhaohua
    Cai, Zifeng
    Xie, Zongyang
    Zhang, Manzhi
    Zhang, Shuang
    Xiao, Jing
    [J]. THIRD INTERNATIONAL CONFERENCE ON MULTIMEDIA INFORMATION PROCESSING AND RETRIEVAL (MIPR 2020), 2020, : 388 - 391
  • [7] WRITING SONG LYRICS
    COFFIN, LW
    [J]. ENGLISH JOURNAL, 1970, 59 (07): : 954 - 955
  • [8] Reading Song Lyrics
    Mergenthal, Silvia
    [J]. ZEITSCHRIFT FUR ANGLISTIK UND AMERIKANISTIK, 2011, 59 (01): : 97 - 98
  • [9] Fractured song lyrics
    Harwood, Stacey
    [J]. HUMOR-INTERNATIONAL JOURNAL OF HUMOR RESEARCH, 2009, 22 (03): : 361 - 370
  • [10] Southern song lyrics
    Brady, EG
    [J]. SOUTHERN CULTURES, 2002, 8 (04) : 2 - 2