An Effective Hierarchical Graph Attention Network Modeling Approach for Pronunciation Assessment

被引:0
|
作者
Yan, Bi-Cheng [1 ]
Chen, Berlin [1 ]
机构
[1] Natl Taiwan Normal Univ, Dept Comp Sci & Informat Engn, Taipei 11677, Taiwan
关键词
Linguistics; Stress; Accuracy; Training; Feature extraction; Task analysis; Predictive models; Automatic pronunciation assessment (APA); computer-assisted pronunciation training; deep regression models; pre-training mechanism; MULTI-GRANULARITY; SPEECH RECOGNITION; PITCH;
D O I
10.1109/TASLP.2024.3449111
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Automatic pronunciation assessment (APA) manages to quantify second language (L2) learners' pronunciation proficiency in a target language by providing fine-grained feedback with multiple aspect scores (e.g., accuracy, fluency, and completeness) at various linguistic levels (i.e., phone, word, and utterance). Most of the existing efforts commonly follow a parallel modeling framework, which takes a sequence of phone-level pronunciation feature embeddings of a learner's utterance as input and then predicts multiple aspect scores across various linguistic levels. However, these approaches neither take the hierarchy of linguistic units into account nor consider the relatedness among the pronunciation aspects in an explicit manner. In light of this, we put forward an effective modeling approach for APA, termed HierGAT, which is grounded on a hierarchical graph attention network. Our approach facilitates hierarchical modeling of the input utterance as a heterogeneous graph that contains linguistic nodes at various levels of granularity. On top of the tactfully designed hierarchical graph message passing mechanism, intricate interdependencies within and across different linguistic levels are encapsulated and the language hierarchy of an utterance is factored in as well. Furthermore, we also design a novel aspect attention module to encode relatedness among aspects. To our knowledge, we are the first to introduce multiple types of linguistic nodes into graph-based neural networks for APA and perform a comprehensive qualitative analysis to investigate their merits. A series of experiments conducted on the speechocean762 benchmark dataset suggests the feasibility and effectiveness of our approach in relation to several competitive baselines.
引用
收藏
页码:3974 / 3985
页数:12
相关论文
共 50 条
  • [21] Collective Link Prediction Oriented Network Embedding with Hierarchical Graph Attention
    Jiao, Yizhu
    Xiong, Yun
    Zhang, Jiawei
    Zhu, Yangyong
    PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, : 419 - 428
  • [22] Hierarchical graph attention network for miRNA-disease association prediction
    Li, Zhengwei
    Zhong, Tangbo
    Huang, Deshuang
    You, Zhu-Hong
    Nie, Ru
    MOLECULAR THERAPY, 2022, 30 (04) : 1775 - 1786
  • [23] A Hierarchical Attention Graph Convolutional Network for Traffic Incident Impact Forecasting
    Fu, Kaiqun
    Ji, Taoran
    Self, Nathan
    Chen, Zhiqian
    Lu, Chang-Tien
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 1619 - 1624
  • [24] Hierarchical Neighbor Propagation With Bidirectional Graph Attention Network for Relation Prediction
    Xie, Zhiwen
    Zhu, Runjie
    Liu, Jin
    Zhou, Guangyou
    Huang, Jimmy Xiangji
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 1762 - 1773
  • [25] Hierarchical Random Graph based Network Diversity Modeling for the Cloud
    Yuchi, Xuebiao
    Shetty, Sachin
    PROCEEDINGS 2016 IEEE WORLD CONGRESS ON SERVICES - SERVICES 2016, 2016, : 35 - 38
  • [26] Learning Effective Road Network Representation with Hierarchical Graph Neural Networks
    Wu, Ning
    Zhao, Wayne Xin
    Wang, Jingyuan
    Pan, Dayan
    KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 6 - 14
  • [27] Multi-Agent Actor-Critic with Hierarchical Graph Attention Network
    Ryu, Heechang
    Shin, Hayong
    Park, Jinkyoo
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 7236 - 7243
  • [28] GACaps-HTC: graph attention capsule network for hierarchical text classification
    Jinhyun Bang
    Jonghun Park
    Jonghyuk Park
    Applied Intelligence, 2023, 53 : 20577 - 20594
  • [29] A Multi-channel Hierarchical Graph Attention Network for Open Event Extraction
    Wan, Qizhi
    Wan, Changxuan
    Xiao, Keli
    Hu, Rong
    Liu, Dexi
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2023, 41 (01)
  • [30] BHGAttN: A Feature-Enhanced Hierarchical Graph Attention Network for Sentiment Analysis
    Zhang, Junjun
    Cui, Zhengyan
    Park, Hyun Jun
    Noh, Giseop
    ENTROPY, 2022, 24 (11)