Voice Detection in Traditionnal Tunisian Music using Audio Features and Supervised Learning Algorithms

被引:0
|
作者
Ziadi, Wissem [1 ]
机构
[1] ENIT, Tunisian Natl Sch Engn, Signal Images Informat Technol LR SITI, Tunis, Tunisia
关键词
Tunisian voice timbre; audio features extraction; singing voice detection; sung/instrumental discrimination; supervised learning algorithms;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The research presented in this paper aims to automatically detect the singing voice in traditional Tunisian music, taking into account the main characteristics of the sound of the voice in this particular music style. This means creating the possibility to automatically identify instrumental and singing sounds. Therefore different methods for the automatic classification of sounds using supervised learning algorithms were compared and evaluated. The research is divided into four successive stages. First, the extraction of features vectors from the audio tracks (through calculation of the parameters of sound perception) followed by the selection and transformation process of relevant features for singing/instrumental discrimination. Then, using learning algorithms, the instrumental and vocal classes were modeled from a manually annotated database. Finally, the evaluation of the decision-making process (indexing) was applied on the test part of the database. The musical databases used for this study consists of extracts from the national sound archives of Centre of Mediterranean and Arabic Music (CMAM) and recordings made especially for this research. The possibility to index audio data (classify/segment) into vocal and instrumental recognition allows for the retrieval of content-based information of musical databases.
引用
收藏
页码:26 / 31
页数:6
相关论文
共 50 条
  • [1] OPINION MINING AND CLASSIFICATION OF MUSIC LYRICS USING SUPERVISED LEARNING ALGORITHMS
    Ahuja, Mahesh
    Sangal, A. L.
    [J]. 2018 FIRST INTERNATIONAL CONFERENCE ON SECURE CYBER COMPUTING AND COMMUNICATIONS (ICSCCC 2018), 2018, : 223 - 227
  • [2] Insider threat detection using supervised machine learning algorithms
    Manoharan, Phavithra
    Yin, Jiao
    Wang, Hua
    Zhang, Yanchun
    Ye, Wenjie
    [J]. TELECOMMUNICATION SYSTEMS, 2023, 87 (04) : 899 - 915
  • [3] Noise-Tolerant Self-Supervised Learning for Audio-Visual Voice Activity Detection
    Kim, Ui-Hyun
    [J]. INTERSPEECH 2021, 2021, : 326 - 330
  • [4] Supervised Contrastive Learning for Voice Activity Detection
    Heo, Youngjun
    Lee, Sunggu
    [J]. ELECTRONICS, 2023, 12 (03)
  • [5] Music Emotion Recognition with the Extraction of Audio Features Using Machine Learning Approaches
    Juthi, Jannatul Humayra
    Gomes, Anthony
    Bhuiyan, Touhid
    Mahmud, Imran
    [J]. PROCEEDINGS OF ICETIT 2019: EMERGING TRENDS IN INFORMATION TECHNOLOGY, 2020, 605 : 318 - 329
  • [6] Automatic hate speech detection in audio using machine learning algorithms
    Joan L. Imbwaga
    Nagatatna B. Chittaragi
    Shashidhar G. Koolagudi
    [J]. International Journal of Speech Technology, 2024, 27 (2) : 447 - 469
  • [7] Pathological voice detection and binary classification using MPEG-7 audio features
    Muhammad, Ghulam
    Melhem, Moutasem
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2014, 11 : 1 - 9
  • [8] Target detection using supervised machine learning algorithms for GPR data
    Smitha, N.
    Singh, Vipula
    [J]. SENSING AND IMAGING, 2020, 21 (01):
  • [9] Target detection using supervised machine learning algorithms for GPR data
    N. Smitha
    Vipula Singh
    [J]. Sensing and Imaging, 2020, 21
  • [10] Fall Detection Using Supervised Machine Learning Algorithms: A Comparative Study
    Zerrouki, Nabil
    Harrou, Fouzi
    Houacine, Amrane
    Sun, Ying
    [J]. PROCEEDINGS OF 2016 8TH INTERNATIONAL CONFERENCE ON MODELLING, IDENTIFICATION & CONTROL (ICMIC 2016), 2016, : 665 - 670