Using prosody for automatic sentence segmentation of multi-party meetings

被引：0

作者：

Kolar, Jachym ^{[1
]}

Shriberg, Elizabeth

Liu, Yang

机构：

[1] Int Comp Sci Inst, Berkeley, CA 94704 USA

[2] Univ W Bohemia, Dept Cybernet, Plzen, Czech Republic

[3] SRI Int, Menlo Pk, CA 94025 USA

[4] Univ Texas Dallas, Dallas, TX 75230 USA

来源：

TEXT, SPEECH AND DIALOGUE, PROCEEDINGS | 2006年 / 4188卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We explore the use of prosodic features beyond pauses, including duration, pitch, and energy features, for automatic sentence segmentation of ICSI meeting data. We examine two different approaches to boundary classification: score-level combination of independent language and prosodic models using HMMs, and feature-level combination of models using a boosting-based method (BoosTexter). We report classification results for reference word transcripts as well as for transcripts from a state-of-the-art automatic speech recognizer (ASR). We also compare results using the lexical model plus a pause-only prosody model, versus results using additional prosodic features. Results show that (1) information from pauses is important, including pause duration both at the boundary and at the previous and following word boundaries; (2) adding duration, pitch, and energy features yields significant improvement over pause alone; (3) the integrated boosting-based model performs better than the HMM for ASR conditions; (4) training the boosting-based model on recognized words yields further improvement.

引用

页码：629 / 636

页数：8

共 50 条

[21] Threshold quantum secret sharing between multi-party and multi-party
YuGuang Yang
QiaoYan Wen
Science in China Series G: Physics, Mechanics and Astronomy, 2008, 51 : 1308 - 1315
[22] Multi-Party Campaigning
Koutecky, Martin
Talmon, Nimrod
THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 5506 - 5513
[23] Chronemics of multi-party videoconference meetings. Notes on silence during the preparation phase
Munoz, Arantxa Santos
LENGUA Y HABLA, 2015, 19 : 56 - 76
[24] Predicting Next Speaker and Timing from Gaze Transition Patterns in Multi-Party Meetings
Ishii, Ryo
Otsuka, Kazuhiro
Kumano, Shiro
Matsuda, Masafumi
Yamato, Junji
ICMI'13: PROCEEDINGS OF THE 2013 ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2013, : 79 - 86
[25] CostCO: An automatic cost modeling framework for secure multi-party computation
Fang, Vivian
Brown, Lloyd
Lin, William
Zheng, Wenting
Panda, Aurojit
Popa, Raluca Ada
2022 IEEE 7TH EUROPEAN SYMPOSIUM ON SECURITY AND PRIVACY (EUROS&P 2022), 2022, : 140 - 153
[26] Optimally Efficient Multi-party Fair Exchange and Fair Secure Multi-party Computation
Alper, Handan Kilinc
Kupcu, Alptekin
ACM TRANSACTIONS ON PRIVACY AND SECURITY, 2022, 25 (01)
[27] Quantum secure multi-party computational geometry based on multi-party summation and multiplication
Dou, Zhao
Wang, Yifei
Liu, Zhaoqian
Bi, Jingguo
Chen, Xiubo
Li, Lixiang
QUANTUM SCIENCE AND TECHNOLOGY, 2024, 9 (02)
[28] Dynamic Multi-Party to Multi-Party Quantum Secret Sharing based on Bell States
Tian, Yuan
Wang, Jialong
Bian, Genqing
Chang, Jinyong
Li, Jian
ADVANCED QUANTUM TECHNOLOGIES, 2024, 7 (07)
[29] Multi-Party Set Reconciliation Using Characteristic Polynomials
Boral, Anudhyan
Mitzenmacher, Michael
2014 52ND ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2014, : 1182 - 1187
[30] Secure Multi-Party Computation Using Polarizing Cards
Shinagawa, Kazumasa
Mizuki, Takaaki
Schuldt, Jacob
Nuida, Koji
Kanayama, Naoki
Nishide, Takashi
Hanaoka, Goichiro
Okamoto, Eiji
ADVANCES IN INFORMATION AND COMPUTER SECURITY (IWSEC 2015), 2015, 9241 : 281 - 297

← 1 2 3 4 5 →