ICA-based hierarchical text classification for multi-domain text-to-speech synthesis

被引:0
|
作者
Sevillano, X [1 ]
Alías, F [1 ]
Socoró, JC [1 ]
机构
[1] Univ Ramon Llull, Dept Commun & Signal Theory, Barcelona 08022, Spain
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the framework of multi-domain Text-to-Speech synthesis it is essential to (i) design a hierarchically structured database for allowing several domains in the same speech corpus and (ii) include a text classification module that, at run time, assigns the input sentences to a domain or set of domains from the database. In this paper, we present a hierarchical text classifier based on Independent Component Analysis (ICA), which is capable of (i) organizing the contents of the corpus in a hierarchical manner and (ii) classifying the texts to be synthesized according to the learned structure. The document organization and classification performance of our ICA-based hierarchical classifier are evaluated in several encouraging experiments conducted on a journalistic-style text corpus for speech synthesis in Catalan.
引用
收藏
页码:697 / 700
页数:4
相关论文
共 50 条
  • [31] A Multi-domain Text Classification Method Based on Recurrent Convolution Multi-task Learning
    Xie Jinbao
    Li Jiahui
    Kang Shouqiang
    Wang Qingyan
    Wang Yujing
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2021, 43 (08) : 2395 - 2403
  • [32] DaCon: Multi-Domain Text Classification Using Domain Adversarial Contrastive Learning
    Dai, Yingjun
    El-Roby, Ahmed
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT V, 2023, 14258 : 40 - 52
  • [33] Corpus-based Malay Text-to-Speech Synthesis System
    Swee, Tan Tian
    Salleh, Sheikh Hussain Shaikh
    2008 14TH ASIA-PACIFIC CONFERENCE ON COMMUNICATIONS, (APCC), VOLS 1 AND 2, 2008, : 52 - 56
  • [34] [Invited] Generative Model-Based Text-to-Speech Synthesis
    Zen, Heiga
    2018 IEEE 7TH GLOBAL CONFERENCE ON CONSUMER ELECTRONICS (GCCE 2018), 2018, : 327 - 328
  • [35] A RULE BASED PROSODY MODEL FOR TURKISH TEXT-TO-SPEECH SYNTHESIS
    Uslu, Ibrahim Baran
    Ilk, Hakki Gokhan
    Yilmaz, Asim Egemen
    TEHNICKI VJESNIK-TECHNICAL GAZETTE, 2013, 20 (02): : 217 - 223
  • [36] Continuity Metric for Unit Selection based Text-to-Speech Synthesis
    Lakkavalli, Vikram Ramesh
    Arulmozhi, P.
    Ramakrishnan, A. G.
    2010 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS (SPCOM), 2010,
  • [37] An enhanced text analysis approach in text-to-speech synthesis for mandarin chinese
    Jiang, Wei
    Wang, Xiao-Long
    Guan, Yi
    Pang, Xiu-Li
    ICNC 2007: THIRD INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 5, PROCEEDINGS, 2007, : 410 - +
  • [38] HiFi-GAN based Text-to-Speech Synthesis in Serbian
    Suzic, Sinisa
    Pekar, Darko
    Secujski, Milan
    Nosek, Tijana
    Delic, Vlado
    2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 2231 - 2235
  • [39] HiFi-GAN based Text-to-Speech Synthesis in Serbian
    Suzic, Sinisa
    Pekar, Darko
    Secujski, Milan
    Nosek, Tijana
    Delic, Vlado
    2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 1178 - 1182
  • [40] Rule-Based Storytelling Text-to-Speech (TTS) Synthesis
    Ramli, Izzad
    Seman, Noraini
    Ardi, Norizah
    Jamil, Nursuriati
    2016 3RD INTERNATIONAL CONFERENCE ON MECHANICS AND MECHATRONICS RESEARCH (ICMMR 2016), 2016, 77