Non-intrusive method for audio quality assessment of lossy-compressed music recordings using convolutional neural networks

被引:0
|
作者
Kasperuk, Aleksandra [1 ]
Zielinski, Slawomir Krzysztof [1 ]
机构
[1] Bialystok Tech Univ, Fac Comp Sci, Bialystok, Poland
关键词
- objective audio quality assessment; non-intrusive audio quality evaluation; convolutional neural networks; MODEL;
D O I
10.24425/ijet.2024.149549
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
of the existing algorithms for the objective audio quality assessment are intrusive, as they require access both to an unimpaired reference recording and an evaluated signal. This feature excludes them from many practical applications. In this paper, we introduce a non-intrusive audio quality assessment method. The proposed method is intended to account for audio artefacts arising from the lossy compression of music signals. During its development, 250 high-quality uncompressed music recordings were collated. They were subsequently processed using the selection of five popular audio codecs, resulting in the repository of 13,000 audio excerpts representing various levels of audio quality. The proposed non-intrusive method was trained with the data obtained employing a well-established intrusive model (ViSQOL v3). Next, the performance of the trained model was evaluated utilizing the quality scores obtained in the subjective listening tests undertaken remotely over the Internet. The listening tests were carried out in compliance with the MUSHRA recommendation (ITU-R BS.1534-3). In this study, the following three convolutional neural networks were compared: (1) a model employing 1D convolutional filters, (2) an Inception-based model, and (3) a VGG-based model. The last-mentioned model outperformed the model employing 1D convolutional filters in terms of predicting the scores from the listening tests, reaching framework, recently introduced by Mumtaz et al. (2022).
引用
收藏
页码:331 / 339
页数:9
相关论文
共 50 条
  • [41] A New Non-intrusive Assessment Method for VoLTE Quality Based on Extended E-Model
    Duy-Huy Nguyen
    Nguyen, Hang
    Renault, Eric
    MOBILE, SECURE, AND PROGRAMMABLE NETWORKING (MSPN 2016), 2016, 10026 : 122 - 136
  • [42] AUTOMATED QUALITY ASSESSMENT OF APPLES USING CONVOLUTIONAL NEURAL NETWORKS
    Iosif, Adrian
    Maican, Edmond
    Biris, Sorin
    Popa, Lucretia
    INMATEH-AGRICULTURAL ENGINEERING, 2023, 71 (03): : 483 - 498
  • [43] Non-intrusive quality assessment of noise-suppressed speech using unsupervised deep features
    Soni, Meet H.
    Patil, Hemant A.
    SPEECH COMMUNICATION, 2021, 130 : 27 - 44
  • [44] Non-intrusive Objective Speech Quality Assessment using a Combination of MFCC, PLP and LSF Features
    Dubey, Rajesh Kumar
    Kumar, Arun
    2013 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION (ICSC), 2013, : 297 - 302
  • [45] Towards Blind Audio Quality Assessment using a Convolutional-Recurrent Neural Network
    Zheng, Xiguang
    Zhang, Chen
    2021 13TH INTERNATIONAL CONFERENCE ON QUALITY OF MULTIMEDIA EXPERIENCE (QOMEX), 2021, : 91 - 96
  • [46] Farmland quality assessment using deep fully convolutional neural networks
    Junxiao Wang
    Xingong Li
    Xiaorui Wang
    Shenglu Zhou
    Yanjun Luo
    Environmental Monitoring and Assessment, 2023, 195
  • [47] Hierarchical aesthetic quality assessment using deep convolutional neural networks
    Kao, Yueying
    Huang, Kaiqi
    Maybank, Steve
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2016, 47 : 500 - 510
  • [48] Farmland quality assessment using deep fully convolutional neural networks
    Wang, Junxiao
    Li, Xingong
    Wang, Xiaorui
    Zhou, Shenglu
    Luo, Yanjun
    ENVIRONMENTAL MONITORING AND ASSESSMENT, 2023, 195 (01)
  • [49] No reference image quality assessment using gabor convolutional neural networks
    Vadlamudi, Jyothisri
    Khan, Sameeulla
    ENGINEERING RESEARCH EXPRESS, 2024, 6 (02):
  • [50] Image Quality Assessment Using Combination of Deep Convolutional Neural Networks
    Shikkenawis, Gitam
    Mitra, Suman K.
    Saxena, Ashutosh
    PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2021, 2024, 13102 : 3 - 11