ICA-based hierarchical text classification for multi-domain text-to-speech synthesis

被引：0

作者：

Sevillano, X ^{[1
]}

Alías, F ^{[1
]}

Socoró, JC ^{[1
]}

机构：

[1] Univ Ramon Llull, Dept Commun & Signal Theory, Barcelona 08022, Spain

来源：

2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL V, PROCEEDINGS: DESIGN AND IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS INDUSTRY TECHNOLOGY TRACKS MACHINE LEARNING FOR SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING SIGNAL PROCESSING FOR EDUCATION | 2004年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In the framework of multi-domain Text-to-Speech synthesis it is essential to (i) design a hierarchically structured database for allowing several domains in the same speech corpus and (ii) include a text classification module that, at run time, assigns the input sentences to a domain or set of domains from the database. In this paper, we present a hierarchical text classifier based on Independent Component Analysis (ICA), which is capable of (i) organizing the contents of the corpus in a hierarchical manner and (ii) classifying the texts to be synthesized according to the learned structure. The document organization and classification performance of our ICA-based hierarchical classifier are evaluated in several encouraging experiments conducted on a journalistic-style text corpus for speech synthesis in Catalan.

引用

页码：697 / 700

页数：4

共 50 条

[31] A Multi-domain Text Classification Method Based on Recurrent Convolution Multi-task Learning
Xie Jinbao
Li Jiahui
Kang Shouqiang
Wang Qingyan
Wang Yujing
JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2021, 43 (08) : 2395 - 2403
[32] DaCon: Multi-Domain Text Classification Using Domain Adversarial Contrastive Learning
Dai, Yingjun
El-Roby, Ahmed
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT V, 2023, 14258 : 40 - 52
[33] Corpus-based Malay Text-to-Speech Synthesis System
Swee, Tan Tian
Salleh, Sheikh Hussain Shaikh
2008 14TH ASIA-PACIFIC CONFERENCE ON COMMUNICATIONS, (APCC), VOLS 1 AND 2, 2008, : 52 - 56
[34] [Invited] Generative Model-Based Text-to-Speech Synthesis
Zen, Heiga
2018 IEEE 7TH GLOBAL CONFERENCE ON CONSUMER ELECTRONICS (GCCE 2018), 2018, : 327 - 328
[35] A RULE BASED PROSODY MODEL FOR TURKISH TEXT-TO-SPEECH SYNTHESIS
Uslu, Ibrahim Baran
Ilk, Hakki Gokhan
Yilmaz, Asim Egemen
TEHNICKI VJESNIK-TECHNICAL GAZETTE, 2013, 20 (02): : 217 - 223
[36] Continuity Metric for Unit Selection based Text-to-Speech Synthesis
Lakkavalli, Vikram Ramesh
Arulmozhi, P.
Ramakrishnan, A. G.
2010 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS (SPCOM), 2010,
[37] An enhanced text analysis approach in text-to-speech synthesis for mandarin chinese
Jiang, Wei
Wang, Xiao-Long
Guan, Yi
Pang, Xiu-Li
ICNC 2007: THIRD INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 5, PROCEEDINGS, 2007, : 410 - +
[38] HiFi-GAN based Text-to-Speech Synthesis in Serbian
Suzic, Sinisa
Pekar, Darko
Secujski, Milan
Nosek, Tijana
Delic, Vlado
2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 2231 - 2235
[39] HiFi-GAN based Text-to-Speech Synthesis in Serbian
Suzic, Sinisa
Pekar, Darko
Secujski, Milan
Nosek, Tijana
Delic, Vlado
2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 1178 - 1182
[40] Rule-Based Storytelling Text-to-Speech (TTS) Synthesis
Ramli, Izzad
Seman, Noraini
Ardi, Norizah
Jamil, Nursuriati
2016 3RD INTERNATIONAL CONFERENCE ON MECHANICS AND MECHATRONICS RESEARCH (ICMMR 2016), 2016, 77

← 1 2 3 4 5 →