TRAED: Speech audio editing using imperfect transcripts

被引：0

作者：

Masoodian, Masood ^{[1
]}

Rogers, Bill ^{[1
]}

Ware, David ^{[1
]}

McKoy, Sam ^{[1
]}

机构：

[1] Univ Waikato, Dept Comp Sci, Hamilton, New Zealand

来源：

12TH INTERNATIONAL MULTI-MEDIA MODELLING CONFERENCE PROCEEDINGS | 2006年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Although digital recording, of speech is widespread, and an increasing range of applications allow recording and inclusion of speech data in documents, editing mid retrievol of speech audio remains generally a challenging task. We have previously developed a speech audio editing and browsing application which utilizes imperfect transcripts of speech os a mechanism for text-based editing and retrieval of speech audio documents. This paper presents a second prototype, called TRAED, which enhances the functionality provided by our earlier prototype, and further facilitates the task of speech audio editing and access.

引用

页码：454 / 459

页数：6

共 50 条

[21] A simple in vitro RNA editing assay for chloroplast transcripts using fluorescent dideoxynucleotides:: distinct types of sequence elements required for editing of ndh transcripts
Sasaki, Tadamasa
Yukawa, Yasushi
Wakasugi, Tatsuya
Yamada, Kyoji
Sugiura, Masahiro
PLANT JOURNAL, 2006, 47 (05): : 802 - 810
[22] Detecting dementia from speech and transcripts using transformers
Ilias, Loukas
Askounis, Dimitris
Psarras, John
COMPUTER SPEECH AND LANGUAGE, 2023, 79
[23] Noisy audio feature enhancement using audio-visual speech data
Goecke, R
Potamianos, G
Neti, C
2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 2025 - 2028
[24] ELECTRONIC AUDIO EDITING BY SIGHT
FALCONE, PF
SMPTE JOURNAL, 1979, 88 (01): : 32 - 32
[25] AUDIO EDITING - ANALOG TECHNIQUES
RELICH, N
JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 1986, 34 (7-8): : 591 - 591
[26] MULTIMODAL SPEECH EMOTION RECOGNITION USING AUDIO AND TEXT
Yoon, Seunghyun
Byun, Seokhyun
Jung, Kyomin
2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, : 112 - 118
[27] Audio-Speech Watermarking Using a Channel Equalizer
Shokri, Shervin
Ismail, Mahamod
Zainal, Nasharuddin
Moghaddasi, Majid
WIRELESS PERSONAL COMMUNICATIONS, 2017, 95 (04) : 4457 - 4476
[28] AUDIO SEGMENTATION FOR SPEECH RECOGNITION USING SEGMENT FEATURES
Rybach, David
Gollan, Christian
Schlueter, Ralf
Ney, Hermann
2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4197 - 4200
[29] Using spatial audio cues from speech excitation for meeting speech segmentation
Cheng, Eva
Burnett, Ian
Ritz, Christian
2006 8TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-4, 2006, : 3067 - +
[30] Automatic speech recognition using audio visual cues
Yashwanth, H
Mahendrakar, H
David, S
PROCEEDINGS OF THE IEEE INDICON 2004, 2004, : 166 - 169

← 1 2 3 4 5 →