An Effective Hierarchical Graph Attention Network Modeling Approach for Pronunciation Assessment

被引:0
|
作者
Yan, Bi-Cheng [1 ]
Chen, Berlin [1 ]
机构
[1] Natl Taiwan Normal Univ, Dept Comp Sci & Informat Engn, Taipei 11677, Taiwan
关键词
Linguistics; Stress; Accuracy; Training; Feature extraction; Task analysis; Predictive models; Automatic pronunciation assessment (APA); computer-assisted pronunciation training; deep regression models; pre-training mechanism; MULTI-GRANULARITY; SPEECH RECOGNITION; PITCH;
D O I
10.1109/TASLP.2024.3449111
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Automatic pronunciation assessment (APA) manages to quantify second language (L2) learners' pronunciation proficiency in a target language by providing fine-grained feedback with multiple aspect scores (e.g., accuracy, fluency, and completeness) at various linguistic levels (i.e., phone, word, and utterance). Most of the existing efforts commonly follow a parallel modeling framework, which takes a sequence of phone-level pronunciation feature embeddings of a learner's utterance as input and then predicts multiple aspect scores across various linguistic levels. However, these approaches neither take the hierarchy of linguistic units into account nor consider the relatedness among the pronunciation aspects in an explicit manner. In light of this, we put forward an effective modeling approach for APA, termed HierGAT, which is grounded on a hierarchical graph attention network. Our approach facilitates hierarchical modeling of the input utterance as a heterogeneous graph that contains linguistic nodes at various levels of granularity. On top of the tactfully designed hierarchical graph message passing mechanism, intricate interdependencies within and across different linguistic levels are encapsulated and the language hierarchy of an utterance is factored in as well. Furthermore, we also design a novel aspect attention module to encode relatedness among aspects. To our knowledge, we are the first to introduce multiple types of linguistic nodes into graph-based neural networks for APA and perform a comprehensive qualitative analysis to investigate their merits. A series of experiments conducted on the speechocean762 benchmark dataset suggests the feasibility and effectiveness of our approach in relation to several competitive baselines.
引用
收藏
页码:3974 / 3985
页数:12
相关论文
共 50 条
  • [31] A Hierarchical Heterogeneous Graph Attention Network for Emotion-Cause Pair Extraction
    Yu, Jiaxin
    Liu, Wenyuan
    He, Yongjun
    Zhong, Bineng
    ELECTRONICS, 2022, 11 (18)
  • [32] GACaps-HTC: graph attention capsule network for hierarchical text classification
    Bang, Jinhyun
    Park, Jonghun
    Park, Jonghyuk
    APPLIED INTELLIGENCE, 2023, 53 (17) : 20577 - 20594
  • [33] Molecule Edit Graph Attention Network: Modeling Chemical Reactions as Sequences of Graph Edits
    Sacha, Mikolaj
    Blaz, Mikolaj
    Byrski, Piotr
    Dabrowski-Tumanski, Pawel
    Chrominski, Mikolaj
    Loska, Rafal
    Wlodarczyk-Pruszynski, Pawel
    Jastrzebski, Stanislaw
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2021, 61 (07) : 3273 - 3284
  • [34] A Novel API Recommendation Approach By Using Graph Attention Network
    Chen, Zijie
    Zhang, Tao
    Peng, Xiao
    2021 IEEE 21ST INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY (QRS 2021), 2021, : 726 - 737
  • [35] Explainable train delay propagation: A graph attention network approach
    Huang, Ping
    Guo, Jingwei
    Liu, Shu
    Corman, Francesco
    TRANSPORTATION RESEARCH PART E-LOGISTICS AND TRANSPORTATION REVIEW, 2024, 184
  • [36] Deep Soft Error Propagation Modeling Using Graph Attention Network
    Junchi Ma
    Zongtao Duan
    Lei Tang
    Journal of Electronic Testing, 2022, 38 : 303 - 319
  • [37] A novel Graph Attention Network Architecture for modeling multimodal brain connectivity
    Filip, Alexandru-Catalin
    Azevedo, Tiago
    Passamonti, Luca
    Toschi, Nicola
    Lio, Pietro
    42ND ANNUAL INTERNATIONAL CONFERENCES OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY: ENABLING INNOVATIVE TECHNOLOGIES FOR GLOBAL HEALTHCARE EMBC'20, 2020, : 1071 - 1074
  • [38] Graph Attention Convolutional Network: Spatiotemporal Modeling for Urban Traffic Prediction
    Song, Qingyu
    Ming, RuiBo
    Hu, Jianming
    Niu, Haoyi
    Gao, Mingyang
    2020 IEEE 23RD INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2020,
  • [39] Deep Soft Error Propagation Modeling Using Graph Attention Network
    Ma, Junchi
    Duan, Zongtao
    Tang, Lei
    JOURNAL OF ELECTRONIC TESTING-THEORY AND APPLICATIONS, 2022, 38 (03): : 303 - 319
  • [40] Hierarchical Graph Representation Learning with Structural Attention for Graph Classification
    Yu, Bin
    Xu, Xinhang
    Wen, Chao
    Xie, Yu
    Zhang, Chen
    ARTIFICIAL INTELLIGENCE, CICAI 2022, PT II, 2022, 13605 : 473 - 484