ICA-based hierarchical text classification for multi-domain text-to-speech synthesis

被引:0
|
作者
Sevillano, X [1 ]
Alías, F [1 ]
Socoró, JC [1 ]
机构
[1] Univ Ramon Llull, Dept Commun & Signal Theory, Barcelona 08022, Spain
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the framework of multi-domain Text-to-Speech synthesis it is essential to (i) design a hierarchically structured database for allowing several domains in the same speech corpus and (ii) include a text classification module that, at run time, assigns the input sentences to a domain or set of domains from the database. In this paper, we present a hierarchical text classifier based on Independent Component Analysis (ICA), which is capable of (i) organizing the contents of the corpus in a hierarchical manner and (ii) classifying the texts to be synthesized according to the learned structure. The document organization and classification performance of our ICA-based hierarchical classifier are evaluated in several encouraging experiments conducted on a journalistic-style text corpus for speech synthesis in Catalan.
引用
收藏
页码:697 / 700
页数:4
相关论文
共 50 条
  • [21] Margin Discrepancy-Based Adversarial Training for Multi-Domain Text Classification
    Wu, Yuan
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, PT III, NLPCC 2024, 2025, 15361 : 170 - 182
  • [22] MAXIMUM BATCH FROBENIUS NORM FOR MULTI-DOMAIN TEXT CLASSIFICATION
    Wu, Yuan
    Inkpen, Diana
    El-Roby, Ahmed
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 3763 - 3767
  • [23] Learning Multi-Domain Adversarial Neural Networks for Text Classification
    Ding, Xiao
    Shi, Qiankun
    Cai, Bibo
    Liu, Ting
    Zhao, Yanyan
    Ye, Qiang
    IEEE ACCESS, 2019, 7 : 40323 - 40332
  • [24] A ROBUST CONTRASTIVE ALIGNMENT METHOD FOR MULTI-DOMAIN TEXT CLASSIFICATION
    Li, Xuefeng
    Lei, Hao
    Wang, Liwen
    Dong, Guanting
    Zhao, Jinzheng
    Liu, Jiachi
    Xu, Weiran
    Zhang, Chunyun
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7827 - 7831
  • [25] MIXUP REGULARIZED ADVERSARIAL NETWORKS FOR MULTI-DOMAIN TEXT CLASSIFICATION
    Wu, Yuan
    Inkpen, Diana
    El-Roby, Ahmed
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 7733 - 7737
  • [26] Prosody model in a Mandarin Text-to-Speech System based on a hierarchical approach
    Pan, NH
    Jen, WT
    Yu, SS
    Yu, MS
    Huang, SY
    Wu, MJ
    2000 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, PROCEEDINGS VOLS I-III, 2000, : 448 - 451
  • [27] Residual-based speech modification algorithms for text-to-speech synthesis
    Edgington, M
    Lowry, A
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1425 - 1428
  • [28] Language modeling for multi-domain speech-driven text retrieval
    Itou, K
    Fujii, A
    Ishikawa, T
    ASRU 2001: IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, CONFERENCE PROCEEDINGS, 2001, : 327 - 330
  • [29] CLUSTERING OF DURATION PATTERNS IN SPEECH FOR TEXT-TO-SPEECH SYNTHESIS
    Sreelekshmi, K. S.
    Gopinath, Deepa P.
    2012 ANNUAL IEEE INDIA CONFERENCE (INDICON), 2012, : 1122 - 1127
  • [30] Slovenian Text-to-Speech Synthesis for Speech User Interfaces
    Gros, Jerneja Zganec
    Mihelic, Ales
    Pavesic, Nikola
    Zganec, Mario
    Gruden, Stanislav
    PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY, VOL 5, 2005, 5 : 216 - 220