Phoneme Segmentation using Deep Learning for Speech Synthesis

被引：2

作者：

Lee, Young Han ^{[1
]}

Yang, Jong-Yeol ^{[1
]}

Cho, Choongsang ^{[1
]}

Jung, Hyedong ^{[1
]}

机构：

[1] Korea Elect Technol Inst, Artificial Intelligent Res Ctr, Seongnam, South Korea

来源：

PROCEEDINGS OF THE 2018 CONFERENCE ON RESEARCH IN ADAPTIVE AND CONVERGENT SYSTEMS (RACS 2018) | 2018年

关键词：

Phoneme segmentation; Speech synthesis; Deep learning;

D O I：

10.1145/3264746.3264801

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

In this paper, we propose the phoneme segmentation method, which is one of the basic module that consist unit-selection-based speech synthesis, using deep learning algorithm. To enhance this, we apply the additional cross entropy loss into the Deep speech based speech recognition architecture. From this approach, we can get higher accuracy of phoneme boundary. In our experiments, the proposed method has 20.91 % boundary accuracy which is higher than the conventional phoneme segmentation.

引用

页码：59 / 61

页数：3

共 50 条

[31] Image Segmentation Using Deep Learning: A Survey
Minaee, Shervin
Boykov, Yuri Y.
Porikli, Fatih
Plaza, Antonio J.
Kehtarnavaz, Nasser
Terzopoulos, Demetri
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (07) : 3523 - 3542
[32] Iris Segmentation Using Interactive Deep Learning
Sardar, Mousumi
Banerjee, Subhashis
Mitra, Sushmita
IEEE ACCESS, 2020, 8 : 219322 - 219330
[33] Using Phoneme Recognition and Text-dependent Speaker Verification to Improve Speaker Segmentation for Chinese Speech
Wang, Gang
Wu, Xiaojun
Zheng, Thomas Fang
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 1457 - 1460
[34] Pediatric Sarcoma Segmentation using Deep Learning
Erum, Louise
Banke, Kirstine
Borgwardt, Lise
Hansen, Adam
Hejgaard, Liselotte
Andersen, Flemming
Ladefoged, Claes
JOURNAL OF NUCLEAR MEDICINE, 2019, 60
[35] SEMANTIC SEGMENTATION OF TEXT USING DEEP LEARNING
Lattisi, Tiziano
Farina, Davide
Ronchetti, Marco
COMPUTING AND INFORMATICS, 2022, 41 (01) : 78 - 97
[36] Aortic Valve Segmentation using Deep Learning
Lai, Khin Wee
Shoaib, Muhammad Ali
Chuah, Joon Huang
Nizar, Muhammad Hanif Ahmad
Anis, Shazia
Ching, Serena Low Woan
2020 IEEE-EMBS CONFERENCE ON BIOMEDICAL ENGINEERING AND SCIENCES (IECBES 2020): LEADING MODERN HEALTHCARE TECHNOLOGY ENHANCING WELLNESS, 2021, : 528 - 532
[37] Scheimpflug Image Segmentation using Deep Learning
Morley, Dustin
Evans, Mike
INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2024, 65 (07)
[38] Speech Emotion Classification Using Deep Learning
Mishra, Siba Prasad
Warule, Pankaj
Deb, Suman
PROCEEDINGS OF 27TH INTERNATIONAL SYMPOSIUM ON FRONTIERS OF RESEARCH IN SPEECH AND MUSIC, FRSM 2023, 2024, 1455 : 19 - 31
[39] Korean speech recognition using deep learning
Lee, Suji
Han, Seokjin
Park, Sewon
Lee, Kyeongwon
Lee, Jaeyong
KOREAN JOURNAL OF APPLIED STATISTICS, 2019, 32 (02) : 213 - 227
[40] Persian speech recognition using deep learning
Veisi, Hadi
Haji Mani, Armita
INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2020, 23 (04) : 893 - 905

← 1 2 3 4 5 →