TRAED: Speech audio editing using imperfect transcripts

被引:0
|
作者
Masoodian, Masood [1 ]
Rogers, Bill [1 ]
Ware, David [1 ]
McKoy, Sam [1 ]
机构
[1] Univ Waikato, Dept Comp Sci, Hamilton, New Zealand
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Although digital recording, of speech is widespread, and an increasing range of applications allow recording and inclusion of speech data in documents, editing mid retrievol of speech audio remains generally a challenging task. We have previously developed a speech audio editing and browsing application which utilizes imperfect transcripts of speech os a mechanism for text-based editing and retrieval of speech audio documents. This paper presents a second prototype, called TRAED, which enhances the functionality provided by our earlier prototype, and further facilitates the task of speech audio editing and access.
引用
收藏
页码:454 / 459
页数:6
相关论文
共 50 条
  • [21] A simple in vitro RNA editing assay for chloroplast transcripts using fluorescent dideoxynucleotides:: distinct types of sequence elements required for editing of ndh transcripts
    Sasaki, Tadamasa
    Yukawa, Yasushi
    Wakasugi, Tatsuya
    Yamada, Kyoji
    Sugiura, Masahiro
    PLANT JOURNAL, 2006, 47 (05): : 802 - 810
  • [22] Detecting dementia from speech and transcripts using transformers
    Ilias, Loukas
    Askounis, Dimitris
    Psarras, John
    COMPUTER SPEECH AND LANGUAGE, 2023, 79
  • [23] Noisy audio feature enhancement using audio-visual speech data
    Goecke, R
    Potamianos, G
    Neti, C
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 2025 - 2028
  • [24] ELECTRONIC AUDIO EDITING BY SIGHT
    FALCONE, PF
    SMPTE JOURNAL, 1979, 88 (01): : 32 - 32
  • [25] AUDIO EDITING - ANALOG TECHNIQUES
    RELICH, N
    JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 1986, 34 (7-8): : 591 - 591
  • [26] MULTIMODAL SPEECH EMOTION RECOGNITION USING AUDIO AND TEXT
    Yoon, Seunghyun
    Byun, Seokhyun
    Jung, Kyomin
    2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, : 112 - 118
  • [27] Audio-Speech Watermarking Using a Channel Equalizer
    Shokri, Shervin
    Ismail, Mahamod
    Zainal, Nasharuddin
    Moghaddasi, Majid
    WIRELESS PERSONAL COMMUNICATIONS, 2017, 95 (04) : 4457 - 4476
  • [28] AUDIO SEGMENTATION FOR SPEECH RECOGNITION USING SEGMENT FEATURES
    Rybach, David
    Gollan, Christian
    Schlueter, Ralf
    Ney, Hermann
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4197 - 4200
  • [29] Using spatial audio cues from speech excitation for meeting speech segmentation
    Cheng, Eva
    Burnett, Ian
    Ritz, Christian
    2006 8TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-4, 2006, : 3067 - +
  • [30] Automatic speech recognition using audio visual cues
    Yashwanth, H
    Mahendrakar, H
    David, S
    PROCEEDINGS OF THE IEEE INDICON 2004, 2004, : 166 - 169