Language modeling and transcription of the TED corpus lectures

被引：0

作者：

Leeuwis, E ^{[1
]}

Federico, M ^{[1
]}

Cettolo, M ^{[1
]}

机构：

[1] Univ Twente, Dept Comp Sci, NL-7500 AE Enschede, Netherlands

来源：

2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I | 2003年

关键词：

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Transcribing lectures is a challenging task, both in acoustic and in language modeling. In this work, we present our first results on the automatic transcription of lectures from the TED corpus, recently released by ELRA and LDC. In particular, we concentrated our effort on language modeling. Baseline acoustic and language models were developed using respectively 8 hours of TED transcripts and various types of texts: conference proceedings, lecture transcripts, and conversational speech transcripts. Then, adaptation of the language model to single speakers was investigated by exploiting different kinds of information: automatic transcripts of the talk, the title of the talk, the abstract and, finally, the paper. In the last case, a 39.2% WER was achieved.

引用

页码：232 / 235

页数：4

共 50 条

[31] Ted Hughes and the corpus of Sylvia Plath (Contemporary American poetry)
Churchwell, S
CRITICISM-A QUARTERLY FOR LITERATURE AND THE ARTS, 1998, 40 (01): : 99 - 132
[32] TED-LIUM: an Automatic Speech Recognition dedicated corpus
Rousseau, Anthony
Deleglise, Paul
Esteve, Yannick
LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 125 - 129
[33] Language In Culture: Lectures on the Social Semiotics of Language
Kuipers, Joel
ANTHROPOLOGICAL QUARTERLY, 2023, 96 (04) : 769 - 777
[34] "Language and Lectures" The silent Child
Sandbichler, Bernhard
LITERATUR UND KRITIK, 2012, (463): : 95 - 96
[35] Christmas lectures on language and life
不详
PSYCHOLOGIST, 2017, 30 : 13 - 13
[36] The poetry of Ted Hughes: Language, illusion and beyond
Deane, P
CONTEMPORARY LITERATURE, 1999, 40 (03) : 491 - 506
[37] NATURAL-LANGUAGE MODELING FOR PHONEME-TO-TEXT TRANSCRIPTION
DEROUAULT, AM
MERIALDO, B
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1986, 8 (06) : 742 - 749
[38] Lectures on ancient languages and language
Palmieri, Claudia
RASSEGNA DELLA LETTERATURA ITALIANA, 2022, 126 (02): : 509 - 509
[39] On gamifying the transcription of digital video lectures
Furini, Marco
ENTERTAINMENT COMPUTING, 2016, 14 : 23 - 31
[40] ANALYZING PEDAGOGICAL LINK-MAKING DEVICES IN SCIENCE CLASSROOM LANGUAGE USING AN ONLINE CORPUS OF SCIENCE AND ENGINEERING LECTURES
Kunioshi, Nilson
Noguchi, Judy
Tojo, Kazuko
Hayashi, Hiroko
INTED2015: 9TH INTERNATIONAL TECHNOLOGY, EDUCATION AND DEVELOPMENT CONFERENCE, 2015, : 3065 - 3071

← 1 2 3 4 5 →