Using prosody for automatic sentence segmentation of multi-party meetings

被引:0
|
作者
Kolar, Jachym [1 ]
Shriberg, Elizabeth
Liu, Yang
机构
[1] Int Comp Sci Inst, Berkeley, CA 94704 USA
[2] Univ W Bohemia, Dept Cybernet, Plzen, Czech Republic
[3] SRI Int, Menlo Pk, CA 94025 USA
[4] Univ Texas Dallas, Dallas, TX 75230 USA
来源
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We explore the use of prosodic features beyond pauses, including duration, pitch, and energy features, for automatic sentence segmentation of ICSI meeting data. We examine two different approaches to boundary classification: score-level combination of independent language and prosodic models using HMMs, and feature-level combination of models using a boosting-based method (BoosTexter). We report classification results for reference word transcripts as well as for transcripts from a state-of-the-art automatic speech recognizer (ASR). We also compare results using the lexical model plus a pause-only prosody model, versus results using additional prosodic features. Results show that (1) information from pauses is important, including pause duration both at the boundary and at the previous and following word boundaries; (2) adding duration, pitch, and energy features yields significant improvement over pause alone; (3) the integrated boosting-based model performs better than the HMM for ASR conditions; (4) training the boosting-based model on recognized words yields further improvement.
引用
收藏
页码:629 / 636
页数:8
相关论文
共 50 条
  • [21] Threshold quantum secret sharing between multi-party and multi-party
    YuGuang Yang
    QiaoYan Wen
    Science in China Series G: Physics, Mechanics and Astronomy, 2008, 51 : 1308 - 1315
  • [22] Multi-Party Campaigning
    Koutecky, Martin
    Talmon, Nimrod
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 5506 - 5513
  • [23] Chronemics of multi-party videoconference meetings. Notes on silence during the preparation phase
    Munoz, Arantxa Santos
    LENGUA Y HABLA, 2015, 19 : 56 - 76
  • [24] Predicting Next Speaker and Timing from Gaze Transition Patterns in Multi-Party Meetings
    Ishii, Ryo
    Otsuka, Kazuhiro
    Kumano, Shiro
    Matsuda, Masafumi
    Yamato, Junji
    ICMI'13: PROCEEDINGS OF THE 2013 ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2013, : 79 - 86
  • [25] CostCO: An automatic cost modeling framework for secure multi-party computation
    Fang, Vivian
    Brown, Lloyd
    Lin, William
    Zheng, Wenting
    Panda, Aurojit
    Popa, Raluca Ada
    2022 IEEE 7TH EUROPEAN SYMPOSIUM ON SECURITY AND PRIVACY (EUROS&P 2022), 2022, : 140 - 153
  • [26] Optimally Efficient Multi-party Fair Exchange and Fair Secure Multi-party Computation
    Alper, Handan Kilinc
    Kupcu, Alptekin
    ACM TRANSACTIONS ON PRIVACY AND SECURITY, 2022, 25 (01)
  • [27] Quantum secure multi-party computational geometry based on multi-party summation and multiplication
    Dou, Zhao
    Wang, Yifei
    Liu, Zhaoqian
    Bi, Jingguo
    Chen, Xiubo
    Li, Lixiang
    QUANTUM SCIENCE AND TECHNOLOGY, 2024, 9 (02)
  • [28] Dynamic Multi-Party to Multi-Party Quantum Secret Sharing based on Bell States
    Tian, Yuan
    Wang, Jialong
    Bian, Genqing
    Chang, Jinyong
    Li, Jian
    ADVANCED QUANTUM TECHNOLOGIES, 2024, 7 (07)
  • [29] Multi-Party Set Reconciliation Using Characteristic Polynomials
    Boral, Anudhyan
    Mitzenmacher, Michael
    2014 52ND ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2014, : 1182 - 1187
  • [30] Secure Multi-Party Computation Using Polarizing Cards
    Shinagawa, Kazumasa
    Mizuki, Takaaki
    Schuldt, Jacob
    Nuida, Koji
    Kanayama, Naoki
    Nishide, Takashi
    Hanaoka, Goichiro
    Okamoto, Eiji
    ADVANCES IN INFORMATION AND COMPUTER SECURITY (IWSEC 2015), 2015, 9241 : 281 - 297