Ontology-aware Learning and Evaluation for Audio Tagging

被引:0
|
作者
Liu, Haohe [1 ]
Kong, Qiuqiang [2 ]
Liu, Xubo [1 ]
Mei, Xinhao [1 ]
Wang, Wenwu [1 ]
Plumbley, Mark D. [1 ]
机构
[1] Univ Surrey, CVSSP, Guildford, Surrey, England
[2] ByteDance, Speech Audio & Mus Intelligence SAMI Grp, Beijing, Peoples R China
来源
关键词
machine learning; audio tagging; ontology; evaluation metric; CLASSIFICATION;
D O I
10.21437/Interspeech.2023-979
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This study defines a new evaluation metric for audio tagging tasks to alleviate the limitation of the mean average precision (mAP) metric. The mAP metric treats different kinds of sound as independent classes without considering their relations. The proposed metric, ontology-aware mean average precision (OmAP), addresses the weaknesses of mAP by utilizing additional ontology during evaluation. Specifically, we reweight the false positive events in the model prediction based on the AudioSet ontology graph distance to the target classes. The OmAP also provides insights into model performance by evaluating different coarse-grained levels in the ontology graph. We conduct a human assessment and show that OmAP is more consistent with human perception than mAP. We also propose an ontology-based loss function (OBCE) that reweights binary cross entropy (BCE) loss based on the ontology distance. Our experiment shows that OBCE can improve both mAP and OmAP metrics on the AudioSet tagging task.
引用
收藏
页码:3799 / 3803
页数:5
相关论文
共 50 条
  • [1] Learning ontology-aware classifiers
    Zhang, J
    Caragea, D
    Honavar, V
    DISCOVERY SCIENCE, PROCEEDINGS, 2005, 3735 : 308 - 321
  • [2] AN ONTOLOGY-AWARE FRAMEWORK FOR AUDIO EVENT CLASSIFICATION
    Sun, Yiwei
    Ghaffarzadegan, Shabnam
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 321 - 325
  • [3] Automatically disambiguating medical acronyms with ontology-aware deep learning
    Skreta, Marta
    Arbabi, Aryan
    Wang, Jixuan
    Drysdale, Erik
    Kelly, Jacob
    Singh, Devin
    Brudno, Michael
    NATURE COMMUNICATIONS, 2021, 12 (01)
  • [4] Automatically disambiguating medical acronyms with ontology-aware deep learning
    Marta Skreta
    Aryan Arbabi
    Jixuan Wang
    Erik Drysdale
    Jacob Kelly
    Devin Singh
    Michael Brudno
    Nature Communications, 12
  • [5] Towards a Complete Ontology-Aware Authoring Tool for Collaborative Learning
    Isotani, Seiji
    Mizoguchi, Riichiro
    SUPPORTING LEARNING FLOW THROUGH INTEGRATIVE TECHNOLOGIES, 2007, 162 : 647 - 648
  • [6] Ontology-Aware Overlapping Event Extraction
    Wu, Zhichen
    Zhang, Hongbin
    Cheng, Lianglun
    Wang, Tao
    Chen, Chong
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT I, ICIC 2024, 2024, 14875 : 184 - 196
  • [7] Ontology-Aware Biomedical Relation Extraction
    Aghaebrahimian, Ahmad
    Anisimova, Maria
    Gil, Manuel
    TEXT, SPEECH, AND DIALOGUE (TSD 2022), 2022, 13502 : 160 - 171
  • [8] Ontology-Aware Clinical Abstractive Summarization
    MacAvaney, Sean
    Sotudeh, Sajad
    Cohan, Arman
    Goharian, Nazli
    Talati, Ish
    Filice, Ross W.
    PROCEEDINGS OF THE 42ND INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '19), 2019, : 1013 - 1016
  • [9] Uncertain Ontology-Aware Knowledge Graph Embeddings
    Boutouhami, Khaoula
    Zhang, Jiatao
    Qi, Guilin
    Gao, Huan
    SEMANTIC TECHNOLOGY, JIST 2019, 2020, 1157 : 129 - 136
  • [10] Graph Node Embeddings for ontology-aware Sound Event Classification: an evaluation study
    Aironi, Carlo
    Cornell, Samuele
    Principi, Emanuele
    Squartini, Stefano
    2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 414 - 418