Classification of Cleft Lip and Palate Speech Using Fine-Tuned Transformer Pretrained Models

被引:0
|
作者
Bhattacharjee, Susmita [1 ]
Shekhawat, H. S. [1 ]
Prasanna, S. R. M. [2 ]
机构
[1] Indian Inst Technol Guwahati, Gauhati, India
[2] Indian Inst Technol Dharwad, Dharwad, Karnataka, India
关键词
Cleft lip and palate speech; Wav2Vec2; SEW; SEW-D; UniSpeechSat; HuBERT; DistilHuBERT;
D O I
10.1007/978-3-031-53827-8_6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cleft lip and palate speech (CLP) is a cranio-facial disorder which leads to spectro-temporal distortions in the speech of an individual. This makes accessibility of CLP speakers to speech enabled applications which require Human-computer interaction (HCI) such as voice assistants very challenging. Recently the availability of pretrained models have made the constraint of low resource language very convenient. Recent findings have proven that pretrained transformer models perform way ahead of traditional classifiers. In this paper, with an aim to achieve high end classification results, pretrained Transformer models fine-tuned on CLP data are used. The results obtained from the transformer models such as Wav2Vec2, SEW, SEW-D, UniSpeechSat, HuBERT, DistilHuBERT showed a comparative performance of the models and specially DistilHuBERT showed a significant improvement in the accuracy being close to 100%.
引用
收藏
页码:55 / 61
页数:7
相关论文
共 50 条
  • [1] FuNet-40: fundus disease/abnormality classification using ensemble of fine-tuned pretrained convolution models
    Mittal, Ansh
    2
    S., Gupta
    V., Srivastava
    A., Balodi
    M., Tolani
    [J]. Computer Methods in Biomechanics and Biomedical Engineering: Imaging and Visualization, 2024, 12 (01):
  • [2] Automatic Component Prediction for Issue Reports Using Fine-Tuned Pretrained Language Models
    Wang, Dae-Sung
    Lee, Chan-Gun
    [J]. IEEE ACCESS, 2022, 10 : 131456 - 131468
  • [3] Genealogical Relationship Extraction from Unstructured Text Using Fine-Tuned Transformer Models
    Parrolivelli, Carloangello
    Stanchev, Lubomir
    [J]. 2023 IEEE 17TH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING, ICSC, 2023, : 167 - 174
  • [4] Tissue and Tumor Epithelium Classification using Fine-tuned Deep CNN Models
    Anju, T. E.
    Vimala, S.
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (09) : 306 - 314
  • [5] Speech after repair of isolated cleft palate and cleft lip and palate
    Timmons, MJ
    Wyatt, RA
    Murphy, T
    [J]. BRITISH JOURNAL OF PLASTIC SURGERY, 2001, 54 (05): : 377 - 384
  • [6] Facial grimace during speech in cleft lip and palate: a proposal for classification
    Scarmagnani, Rafaeli Higa
    Fukushiro, Ana Paula
    Yamashita, Renata Paciello
    [J]. CODAS, 2022, 34 (03):
  • [7] CLEFT-LIP AND PALATE CLASSIFICATION
    KERNAHAN, DA
    [J]. PLASTIC AND RECONSTRUCTIVE SURGERY, 1973, 51 (05) : 578 - 578
  • [8] A revised classification of the cleft lip and palate
    Khan, Mansoor
    Ullah, Hidayat
    Naz, Shazia
    Iqbal, Tariq
    Ullah, Tahmeed
    Tahir, Muhammad
    Ullah, Obaid
    [J]. CANADIAN JOURNAL OF PLASTIC SURGERY, 2013, 21 (01): : 48 - 50
  • [9] Classification and Etiology of Cleft Lip and Palate
    Essig, H.
    Kokemueller, H.
    Ruecker, M.
    Gellrich, N. -C.
    [J]. SPRACHE-STIMME-GEHOR, 2012, 36 (02): : 57 - 60
  • [10] Fine-Tuned Transformer Model for Sentiment Analysis
    Liu, Sishun
    Shuai, Pengju
    Zhang, Xiaowu
    Chen, Shuang
    Li, Li
    Liu, Ming
    [J]. KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT (KSEM 2020), PT II, 2020, 12275 : 336 - 343