Non-intrusive method for audio quality assessment of lossy-compressed music recordings using convolutional neural networks

被引:0
|
作者
Kasperuk, Aleksandra [1 ]
Zielinski, Slawomir Krzysztof [1 ]
机构
[1] Bialystok Tech Univ, Fac Comp Sci, Bialystok, Poland
关键词
- objective audio quality assessment; non-intrusive audio quality evaluation; convolutional neural networks; MODEL;
D O I
10.24425/ijet.2024.149549
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
of the existing algorithms for the objective audio quality assessment are intrusive, as they require access both to an unimpaired reference recording and an evaluated signal. This feature excludes them from many practical applications. In this paper, we introduce a non-intrusive audio quality assessment method. The proposed method is intended to account for audio artefacts arising from the lossy compression of music signals. During its development, 250 high-quality uncompressed music recordings were collated. They were subsequently processed using the selection of five popular audio codecs, resulting in the repository of 13,000 audio excerpts representing various levels of audio quality. The proposed non-intrusive method was trained with the data obtained employing a well-established intrusive model (ViSQOL v3). Next, the performance of the trained model was evaluated utilizing the quality scores obtained in the subjective listening tests undertaken remotely over the Internet. The listening tests were carried out in compliance with the MUSHRA recommendation (ITU-R BS.1534-3). In this study, the following three convolutional neural networks were compared: (1) a model employing 1D convolutional filters, (2) an Inception-based model, and (3) a VGG-based model. The last-mentioned model outperformed the model employing 1D convolutional filters in terms of predicting the scores from the listening tests, reaching framework, recently introduced by Mumtaz et al. (2022).
引用
收藏
页码:331 / 339
页数:9
相关论文
共 50 条
  • [1] NON-INTRUSIVE SPEECH QUALITY ASSESSMENT USING NEURAL NETWORKS
    Avila, Anderson R.
    Gamper, Hannes
    Reddy, Chandan
    Cutler, Ross
    Tashev, Ivan
    Gehrke, Johannes
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 631 - 635
  • [2] INTRUSIVE AND NON-INTRUSIVE PERCEPTUAL SPEECH QUALITY ASSESSMENT USING A CONVOLUTIONAL NEURAL NETWORK
    Gamper, Hannes
    Reddy, Chandan K. A.
    Cutler, Ross
    Tashev, Ivan J.
    Gehrke, Johannes
    2019 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2019, : 85 - 89
  • [3] Non-intrusive speech quality assessment using context-aware neural networks
    Jaiswal R.K.
    Dubey R.K.
    International Journal of Speech Technology, 2022, 25 (04) : 947 - 965
  • [4] Performance improvement of a non-intrusive voice quality metric in lossy networks
    Nunes, Rodrigo Dantas
    Rosa, Renata Lopes
    Rodriguez, Demostenes Zegarra
    IET COMMUNICATIONS, 2019, 13 (20) : 3401 - 3408
  • [5] Non-Intrusive POLQA Estimation of Speech Quality using Recurrent Neural Networks
    Sharma, Dushyant
    Hogg, Aidan O. T.
    Wang, Yu
    Nour-Eldin, Amr
    Naylor, Patrick A.
    2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2019,
  • [6] Non-intrusive Method for Video Quality Prediction Over LTE Using Random Neural Networks (RNN)
    Ghalut, Tarik
    Larijani, Hadi
    2014 9TH INTERNATIONAL SYMPOSIUM ON COMMUNICATION SYSTEMS, NETWORKS & DIGITAL SIGNAL PROCESSING (CSNDSP), 2014, : 519 - 524
  • [7] Non-Intrusive Speech Quality Assessment Based on Deep Neural Networks for Speech Communication
    Liu, Miao
    Wang, Jing
    Wang, Fei
    Xiang, Fei
    Chen, Jingdong
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (01) : 174 - 187
  • [8] Non-Intrusive Speech Quality Assessment Based on Deep Neural Networks for Speech Communication
    Liu, Miao
    Wang, Jing
    Wang, Fei
    Xiang, Fei
    Chen, Jingdong
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (01) : 174 - 187
  • [9] Non-intrusive speech quality prediction in VoIP networks using a neural network approach
    Al-Akhras, M.
    Zedan, H.
    John, R.
    ALmomani, I.
    NEUROCOMPUTING, 2009, 72 (10-12) : 2595 - 2608
  • [10] Non-Intrusive Harmonic Source Identification Using Neural Networks
    Janani, K.
    Himavathi, S.
    PROCEEDINGS OF 2013 INTERNATIONAL CONFERENCE ON COMPUTATION OF POWER, ENERGY, INFORMATION AND COMMUNICATION (ICCPEIC 2013), 2013, : 59 - 64