Audio Event-Relational Graph Representation Learning for Acoustic Scene Classification

被引:3
|
作者
Hou, Yuanbo [1 ]
Song, Siyang [2 ]
Yu, Chuang [3 ]
Wang, Wenwu [4 ]
Botteldooren, Dick [1 ]
机构
[1] Univ Ghent, WAVES Res Grp, B-9000 Ghent, Belgium
[2] Univ Leicester, Sch Comp & Math Sci, Leicester LE17RH, Leics, England
[3] UCL, UCL Interact Ctr, London WC1E 6EA, England
[4] Univ Surrey, Ctr Vis Speech & Signal Proc, Guildford GU2 7XH, Surrey, England
关键词
Acoustic scene classification; event-relational graph; multi-dimensional edge; graph representation learning; NEURAL-NETWORKS;
D O I
10.1109/LSP.2023.3319233
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Most deep learning-based acoustic scene classification (ASC) approaches identify scenes based on acoustic features converted from audio clips containing mixed information entangled by polyphonic audio events (AEs). However, these approaches have difficulties in explaining what cues they use to identify scenes. This letter conducts the first study on disclosing the relationship between real-life acoustic scenes and semantic embeddings from the most relevant AEs. Specifically, we propose an event-relational graph representation learning (ERGL) framework for ASC to classify scenes, and simultaneously answer clearly and straightly which cues are used in classifying. In the event-relational graph, embeddings of each event are treated as nodes, while relationship cues derived fromeach pair of nodes are described by multi-dimensional edge features. Experiments on a real-life ASC dataset show that the proposed ERGL achieves competitive performance on ASC by learning embeddings of only a limited number of AEs. The results show the feasibility of recognizing diverse acoustic scenes based on the audio event-relational graph.
引用
收藏
页码:1382 / 1386
页数:5
相关论文
共 50 条
  • [1] Supervised Representation Learning for Audio Scene Classification
    Rakotomamonjy, Alain
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (06) : 1253 - 1265
  • [2] Learning audio sequence representations for acoustic event classification
    Zhang, Zixing
    Liu, Ding
    Han, Jing
    Qian, Kun
    Schuller, Bjorn W.
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2021, 178
  • [3] Graph-based Representation of Audio signals for Sound Event Classification
    Aironi, Carlo
    Cornell, Samuele
    Principi, Emanuele
    Squartini, Stefano
    [J]. 29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 566 - 570
  • [4] Acoustic Scene Classification using Audio Tagging
    Jung, Jee-weon
    Shim, Hye-jin
    Kim, Ju-ho
    Kim, Seung-bin
    Yu, Ha-Jin
    [J]. INTERSPEECH 2020, 2020, : 1176 - 1180
  • [5] Learning Hierarchy Aware Embedding From Raw Audio for Acoustic Scene Classification
    Abrol, Vinayak
    Sharma, Pulkit
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 : 1964 - 1973
  • [6] Unsupervised Learning of Acoustic Unit Descriptors for Audio Content Representation and Classification
    Chaudhuri, Sourish
    Harvilla, Mark
    Raj, Bhiksha
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2276 - 2279
  • [7] An Audio Data Representation for Traffic Acoustic Scene Recognition
    Jiang, Dazhi
    Huang, Dongmin
    Song, Youyi
    Wu, Kaichao
    Lu, Huakang
    Liu, Quanquan
    Zhou, Teng
    [J]. IEEE ACCESS, 2020, 8 : 177863 - 177873
  • [8] Cooperative Scene-Event Modelling for Acoustic Scene Classification
    Hou, Yuanbo
    Kang, Bo
    Mitchell, Andrew
    Wang, Wenwu
    Kang, Jian
    Botteldooren, Dick
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 (32) : 68 - 82
  • [9] Sparse Representation Frameworks for Acoustic Scene Classification
    Tyagi, Akansha
    Rajan, Padmanabhan
    [J]. SPEECH AND COMPUTER, SPECOM 2023, PT I, 2023, 14338 : 177 - 188
  • [10] Deep Learning Based Audio Scene Classification
    Sophiya, E.
    Jothilakshmi, S.
    [J]. COMPUTATIONAL INTELLIGENCE, CYBER SECURITY AND COMPUTATIONAL MODELS: MODELS AND TECHNIQUES FOR INTELLIGENT SYSTEMS AND AUTOMATION, 2018, 844 : 98 - 109