An ensemble model of CNN with Bi-LSTM for automatic singer identification

被引:0
|
作者
Mukkamala S. N. V. Jitendra
Y. Radhika
机构
[1] GITAM School of Technology,Department of Computer Science and Engineering
[2] GITAM (Deemed-to-be University),undefined
来源
关键词
Bidirectional long short-term memory; CNN; Gender identification; LSTM-RNN; Music information retrieval; Singer identification; Spectrogram;
D O I
暂无
中图分类号
学科分类号
摘要
In the present-day scenario, gender detection has become significant in content-based multimedia systems. An automated mechanism for gender identification is mainly in demand to process the massive data. Singer identification is a popular topic in music information recommender systems that includes identifying the singer from the song based on the singer’s voice and other background key features like timbre and pitch. Many models like GMM, SVM, and MLP are broadly used for classification and singer identification. Moreover, most current models have limitations where vocals and instrumental music are separated manually, and only vocals are used to build and train the model. To deal with unstructured data like music, the deep learning techniques are very suitable and have exhibited exemplary performance in similar studies. In acoustic modeling, the Deep Neural Networks (DNN) models like convolutional neural networks (CNN) have played a promising role in classifying unstructured and poorly labeled data. In the current study, an ensemble model, a combination of a CNN model with bi-directional LSTM, is considered for singer identification from the spectrogram images generated from the audio clip. CNN models are proven to better handle variable-length input data by identifying the features. Bi-LSTM will yield better accuracy by remembering the essential features over time and addressing temporal contextual information. The experimentation is performed on the Indian songs and MIR-1 k data set, and it is observed that the proposed model has outperformed with a prediction accuracy of 97.4%. The performance of the proposed model is being compared against the existing models in the current study.
引用
收藏
页码:38853 / 38874
页数:21
相关论文
共 50 条
  • [41] Trajectory outlier detection algorithm based on Bi-LSTM model
    Han Z.
    Huang T.
    Ren W.
    Xu G.
    Journal of Radars, 2019, 8 (01) : 36 - 43
  • [42] Evaluating a Bi-LSTM Model for Metaphor Detection in TOEFL Essays
    Kuo, Kevin
    Carpuat, Marine
    FIGURATIVE LANGUAGE PROCESSING, 2020, : 192 - 196
  • [43] Ship Trajectory Prediction Model Based on Improved Bi-LSTM
    Li, Weifeng
    Lian, Yifan
    Liu, Yaochen
    Shi, Guoyou
    ASCE-ASME JOURNAL OF RISK AND UNCERTAINTY IN ENGINEERING SYSTEMS PART A-CIVIL ENGINEERING, 2024, 10 (03):
  • [44] A Novel Wireless Propagation Model Based on Bi-LSTM Algorithm
    Yang, Yu Lu
    Wan, Guo Chun
    Tong, Mei Song
    IEEE ACCESS, 2022, 10 : 43837 - 43847
  • [45] A Novel Model Combining Transformer and Bi-LSTM for News Categorization
    Liu, Yuanzhi
    He, Min
    Shi, Mengjia
    Jeon, Seunggil
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (04) : 4862 - 4869
  • [46] A Robust License Plate Recognition Model Based on Bi-LSTM
    Zou, Yongjie
    Zhang, Yongjun
    Yan, Jun
    Jiang, Xiaoxu
    Huang, Tengjie
    Fan, Haisheng
    Cui, Zhongwei
    IEEE ACCESS, 2020, 8 : 211630 - 211641
  • [47] Stacked BI-LSTM and E-Optimized CNN-A Hybrid Deep Learning Model for Stock Price Prediction
    Rath, Swarnalata
    Das, Nilima R.
    Pattanayak, Binod Kumar
    OPTICAL MEMORY AND NEURAL NETWORKS, 2024, 33 (02) : 102 - 120
  • [48] Gold price prediction by a CNN-Bi-LSTM model along with automatic parameter tuning
    Amini, Amirhossein
    Kalantari, Robab
    PLOS ONE, 2024, 19 (03):
  • [49] Bi-LSTM Model to Increase Accuracy in Text Classification: Combining Word2vec CNN and Attention Mechanism
    Jang, Beakcheol
    Kim, Myeonghwi
    Harerimana, Gaspard
    Kang, Sang-ug
    Kim, Jong Wook
    APPLIED SCIENCES-BASEL, 2020, 10 (17):
  • [50] Multi-stations' Weather Prediction Based on Hybrid Model Using 1D CNN and Bi-LSTM
    Fu, Qi
    Niu, Dan
    Zang, Zengliang
    Huang, Junhao
    Diao, Li
    PROCEEDINGS OF THE 38TH CHINESE CONTROL CONFERENCE (CCC), 2019, : 3771 - 3775