Dynamic Conditional Random Fields for Joint Sentence Boundary and Punctuation Prediction

被引:0
|
作者
Wang, Xuancong
Ng, Hwee Tou
Sim, Khe Chai
机构
关键词
punctuation; dynamic conditional random fields; sentence boundary detection;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The use of dynamic conditional random fields (DCRF) has been shown to outperform linear-chain conditional random fields (LCRF) for punctuation prediction on conversational speech texts [1]. In this paper, we combine lexical, prosodic, and modified n-gram score features into the DCRF framework for a joint sentence boundary and punctuation prediction task on TDT3 English broadcast news. We show that the joint prediction method outperforms the conventional two-stage method using LCRF or maximum entropy model (MaxEnt). We show the importance of various features using DCRF, LCRF, MaxEnt, and hidden-event n-gram model (HEN) respectively. In addition, we address the practical issue of feature explosion by introducing lexical pruning, which reduces model size and improves the F1-measure. We adopt incremental local training to overcome memory size limitation without incurring significant performance penalty. Our results show that adding prosodic and n-gram score features gives about 20% relative error reduction in all cases. Overall, DCRF gives the best accuracy, followed by LCRF, MaxEnt, and HEN.
引用
收藏
页码:1382 / 1385
页数:4
相关论文
共 50 条
  • [1] Punctuation Prediction for Vietnamese Texts Using Conditional Random Fields
    Pham, Quang H.
    Nguyen, Binh T.
    Nguyen Viet Cuong
    [J]. SOICT 2019: PROCEEDINGS OF THE TENTH INTERNATIONAL SYMPOSIUM ON INFORMATION AND COMMUNICATION TECHNOLOGY, 2019, : 322 - 327
  • [2] Discriminative sentence compression with conditional random fields
    Nomoto, Tadashi
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2007, 43 (06) : 1571 - 1587
  • [3] SENTENCE BOUNDARY DETECTION IN CHINESE BROADCAST NEWS USING CONDITIONAL RANDOM FIELDS AND PROSODIC FEATURES
    Xu, Chenglin
    Xie, Lei
    Fu, Zhonghua
    [J]. 2014 IEEE CHINA SUMMIT & INTERNATIONAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (CHINASIP), 2014, : 37 - 41
  • [4] Investigating syllabic prominence with Conditional Random Fields and Latent-Dynamic Conditional Random Fields
    Cutugno, Francesco
    Leone, Enrico
    Ludusan, Bogdan
    Origlia, Antonio
    [J]. 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 2399 - 2402
  • [5] Conditional random fields for transmembrane helix prediction
    Lukov, L
    Chawla, S
    Church, WB
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2005, 3518 : 155 - 161
  • [6] Clause boundary identification using conditional random fields
    Ram, R. Vijay Sundar
    Devi, Sobha Lalitha
    [J]. COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2008, 4919 : 140 - 150
  • [7] Conrad: Gene prediction using conditional random fields
    DeCaprio, David
    Vinson, Jade P.
    Pearson, Matthew D.
    Montgomery, Philip
    Doherty, Matthew
    Galagan, James E.
    [J]. GENOME RESEARCH, 2007, 17 (09) : 1389 - 1398
  • [8] Web Page Prediction Based on Conditional Random Fields
    Guo, Yong Zhen
    Ramamohanarao, Kotagiri
    Park, Laurence A. F.
    [J]. ECAI 2008, PROCEEDINGS, 2008, 178 : 251 - +
  • [9] Context Based Pedestrian Intention Prediction using Factored Latent Dynamic Conditional Random Fields
    Neogi, Satyajit
    Hoy, Michael
    Weng Chaoqun
    Dauwels, Justin
    [J]. 2017 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2017,
  • [10] Background Extraction Based on Joint Gaussian Conditional Random Fields
    Wang, Hong-Cyuan
    Lai, Yu-Chi
    Cheng, Wen-Huang
    Cheng, Chin-Yun
    Hua, Kai-Lung
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2018, 28 (11) : 3127 - 3140