Real-time Synchronization of Live speech with Its Transcription

被引:0
|
作者
Lertwongkhanakool, Nat [1 ]
Punyabukkana, Proadpran [1 ]
Suchato, Atiwong [1 ]
机构
[1] Chulalongkorn Univ, Fac Engn, Dept Comp Engn, Spoken Language Syst Res Grp, Bangkok, Thailand
关键词
Automatic speech-text synchronization; Syllable Detection; Real-Time alignment; Live speech and transcription alignment; Endpoint Detection;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Most of the researches in synchronization of audio and text have been focusing on the synchronization at the level of utterance. However, to generate audio books in unstructured language like Thai from live speech, a finer level of synchronization is necessary. We propose an algorithm to synchronize live speech with its corresponding transcription in real time at syllabic unit. The proposed algorithm employs the syllable endpoint detection method and the syllable landmark detection method with bandlimited intensity as features. The experiment was conducted with LOTUS datasets and the results were compared with baseline ASR-based syllable detection. We evaluated our algorithm by measuring its error through error aberration, which is the difference of the actual number of syllables and the detected syllables for each phrase, and found average total error aberration of the proposed algorithm to outperform that of the baseline. The average total error aberrations are 11.54 and 34.21 for the proposed method and the baseline respectively. We also found the reference deviation from our method to be better than that of the baseline as well.
引用
收藏
页数:5
相关论文
共 50 条
  • [41] Adaptive synchronization in real-time multimedia applications
    Liu, CD
    Xie, Y
    Lee, MJ
    Saadawi, TN
    MULTIMEDIA COMMUNICATIONS AND VIDEO CODING, 1996, : 147 - 156
  • [42] Task synchronization for distributed real-time applications
    Mourlas, C
    Halatsis, C
    NINTH EUROMICRO WORKSHOP ON REAL TIME SYSTEMS, PROCEEDINGS, 1997, : 184 - 190
  • [43] PRESCHEDULING FOR SYNCHRONIZATION IN HARD REAL-TIME SYSTEMS
    TRIPATHI, SK
    NIRKHE, V
    LECTURE NOTES IN COMPUTER SCIENCE, 1991, 563 : 102 - 108
  • [44] Towards precise and robust automatic synchronization of live speech and its transcripts
    Gao, Jie
    Zhao, Qingwei
    Yan, Yonghong
    SPEECH COMMUNICATION, 2011, 53 (04) : 508 - 523
  • [45] Real-Time Deepfake System for Live Streaming
    Fan, Yifei
    Xie, Modan
    Wu, Peihan
    Yang, Gang
    PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2022, 2022, : 202 - 205
  • [46] Enabling the Real-Time City: LIVE Singapore!
    Kloeckl, Kristian
    Senn, Oliver
    Ratti, Carlo
    JOURNAL OF URBAN TECHNOLOGY, 2012, 19 (02) : 89 - 112
  • [47] Live Demonstration: MWC for Real-Time Application
    Hilgendorf, Rolf
    Mishali, Moshe
    Eldar, Yonina C.
    Shoshan, Eli
    Rivkin, Ina
    2011 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2011, : 2002 - 2002
  • [48] REAL-TIME SPEECH CODING AND DECODING FOR GSM SYSTEM AND ITS IMPLEMENT IN VC
    Wan, Guojin
    Xu, Qingyi
    Xiao, Jing
    Lu, Sheng
    PROCEEDINGS OF 2011 INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY AND APPLICATION, ICCTA2011, 2011, : 848 - 852
  • [49] The Impact of Real-Time Articulatory Information on Phonetic Transcription: Ultrasound-Aided Transcription in Cleft Lip and Palate Speech
    Cleland, Joanne
    Lloyd, Susan
    Campbell, Linsay
    Crampin, Lisa
    Palo, Juha-Pertti
    Sugden, Eleanor
    Wrench, Alan
    Zharkova, Natalia
    FOLIA PHONIATRICA ET LOGOPAEDICA, 2020, 72 (02) : 120 - 130
  • [50] Time Synchronization Accuracy in Real-time Wireless Sensor Networks
    Mahmood, Aamir
    Jantti, Riku
    2009 IEEE 9TH MALAYSIA INTERNATIONAL CONFERENCE ON COMMUNICATIONS (MICC), 2009, : 652 - 657