An ensemble model of CNN with Bi-LSTM for automatic singer identification

被引:0
|
作者
Mukkamala S. N. V. Jitendra
Y. Radhika
机构
[1] GITAM School of Technology,Department of Computer Science and Engineering
[2] GITAM (Deemed-to-be University),undefined
来源
关键词
Bidirectional long short-term memory; CNN; Gender identification; LSTM-RNN; Music information retrieval; Singer identification; Spectrogram;
D O I
暂无
中图分类号
学科分类号
摘要
In the present-day scenario, gender detection has become significant in content-based multimedia systems. An automated mechanism for gender identification is mainly in demand to process the massive data. Singer identification is a popular topic in music information recommender systems that includes identifying the singer from the song based on the singer’s voice and other background key features like timbre and pitch. Many models like GMM, SVM, and MLP are broadly used for classification and singer identification. Moreover, most current models have limitations where vocals and instrumental music are separated manually, and only vocals are used to build and train the model. To deal with unstructured data like music, the deep learning techniques are very suitable and have exhibited exemplary performance in similar studies. In acoustic modeling, the Deep Neural Networks (DNN) models like convolutional neural networks (CNN) have played a promising role in classifying unstructured and poorly labeled data. In the current study, an ensemble model, a combination of a CNN model with bi-directional LSTM, is considered for singer identification from the spectrogram images generated from the audio clip. CNN models are proven to better handle variable-length input data by identifying the features. Bi-LSTM will yield better accuracy by remembering the essential features over time and addressing temporal contextual information. The experimentation is performed on the Indian songs and MIR-1 k data set, and it is observed that the proposed model has outperformed with a prediction accuracy of 97.4%. The performance of the proposed model is being compared against the existing models in the current study.
引用
收藏
页码:38853 / 38874
页数:21
相关论文
共 50 条
  • [1] An ensemble model of CNN with Bi-LSTM for automatic singer identification
    Jitendra, Mukkamala S. N. V.
    Radhika, Y.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (25) : 38853 - 38874
  • [2] Automatic cardiac arrhythmia classification based on hybrid 1-D CNN and Bi-LSTM model
    Rahul, Jagdeep
    Sharma, Lakhan Dev
    BIOCYBERNETICS AND BIOMEDICAL ENGINEERING, 2022, 42 (01) : 312 - 324
  • [3] Residual CNN + Bi-LSTM model to analyze GPR B scan images
    Özkaya, Umut
    Öztürk, Şaban
    Melgani, Farid
    Seyfi, Levent
    Automation in Construction, 2021, 123
  • [4] A hybrid model based on CNN and Bi-LSTM for urban water demand prediction
    Hu, Piao
    Tong, Jun
    Wang, Jingcheng
    Yang, Yue
    Turci, Luca de Oliveira
    2019 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2019, : 1088 - 1094
  • [5] A Driving Behavior Recognition Model with Bi-LSTM and Multi-Scale CNN
    Zhang, He
    Nan, Zhixiong
    Yang, Tao
    Liu, Yifan
    Meng, Nanning
    2020 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2020, : 284 - 289
  • [6] DeepBio: A Deep CNN and Bi-LSTM Learning for Person Identification Using Ear Biometrics
    Mahajan, Anshul
    Singla, Sunil K.
    CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2024, 141 (02): : 1623 - 1649
  • [7] Residual CNN plus Bi-LSTM model to analyze GPR B scan images
    Ozkaya, Umut
    Ozturk, Saban
    Melgani, Farid
    Seyfi, Levent
    AUTOMATION IN CONSTRUCTION, 2021, 123
  • [8] ABCNet: A comprehensive highway visibility prediction model based on attention, Bi-LSTM and CNN
    Li, Wen
    Yang, Xuekun
    Yuan, Guowu
    Xu, Dan
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2024, 21 (03) : 4397 - 4420
  • [9] Walking Pattern Identification of FMCW Radar Data based on a Combined CNN and bi-LSTM Approach
    Nocera, Antonio
    Senigagliesi, Linda
    Ciatiaglia, Giaiduca
    Gambi, Ermio
    2023 IEEE 36TH INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS, CBMS, 2023, : 275 - 280
  • [10] CNN-Bi-LSTM: A Complex Environment-Oriented Cattle Behavior Classification Network Based on the Fusion of CNN and Bi-LSTM
    Gao, Guohong
    Wang, Chengchao
    Wang, Jianping
    Lv, Yingying
    Li, Qian
    Ma, Yuxin
    Zhang, Xueyan
    Li, Zhiyu
    Chen, Guanglan
    SENSORS, 2023, 23 (18)