SENTENCE BOUNDARY DETECTION IN CHINESE BROADCAST NEWS USING CONDITIONAL RANDOM FIELDS AND PROSODIC FEATURES

被引:0
|
作者
Xu, Chenglin [1 ]
Xie, Lei [1 ]
Fu, Zhonghua [1 ]
机构
[1] Northwestern Polytech Univ, Sch Comp Sci, Shaanxi Prov Key Lab Speech & Image Informat Proc, Xian, Peoples R China
关键词
sentence boundary detection; sentence segmentation; speech prosody; feature selection; conditional random field; SPEECH;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper studies the use of condition random fields (CRF) and prosodic features for sentence boundary detection in Chinese broadcast news. Previous approaches mostly use first-order CRF and ignore the important context and sequential information. In this paper, we explore high-order CRF models to fully make use of the contextual and sequential information. Moreover, we show the effectiveness of CRF in sentence boundary detection by comparing it with various competitive models. The prosodic feature set is usually designed to be as exhaustive as possible in previous approaches. As a result, features may be highly correlated and some of them may be not effective. In this paper, we use a correlation-based feature selection method to select a subset with the most useful features. Finally, the use of the prosodic features, e.g., pitch, in Chinese sentence segmentation deserves further investigation because the tonal aspect of Chinese may complicate the expressions of pitch features. In this paper, we study the effectiveness of the prosodic features and rank their importance by an analysis of feature usage.
引用
收藏
页码:37 / 41
页数:5
相关论文
共 50 条
  • [1] Broadcast News Story Segmentation Using Conditional Random Fields and Multimodal Features
    Wang, Xiaoxuan
    Xie, Lei
    Lu, Mimi
    Ma, Bin
    Chng, Eng Siong
    Li, Haizhou
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2012, E95D (05) : 1206 - 1215
  • [2] PROSODY-BASED SENTENCE BOUNDARY DETECTION IN CHINESE BROADCAST NEWS
    Xie, Lei
    Xu, Chenglin
    Wang, Xiaoxuan
    [J]. 2012 8TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, 2012, : 261 - 265
  • [3] News monologue shot detection using conditional random fields
    Ji, Zhong
    Su, Yu-Ting
    [J]. PROCEEDINGS OF 2007 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2007, : 2657 - 2661
  • [4] Extracting the Prosodic Information for Turkish Broadcast News Data and Using on the Sentence Segmentation Task
    Dalva, Dogan
    Revidi, Izel D.
    Guz, Umit
    Gurkan, Hakan
    [J]. 2014 22ND SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2014, : 1810 - 1813
  • [5] Dynamic Conditional Random Fields for Joint Sentence Boundary and Punctuation Prediction
    Wang, Xuancong
    Ng, Hwee Tou
    Sim, Khe Chai
    [J]. 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1382 - 1385
  • [6] Semi Supervised Learning for Prediction of Prosodic Phrase Boundaries in Chinese TTS Using Conditional Random Fields
    Zhao, Ziping
    Ma, Xirong
    Pei, Weidong
    [J]. ADVANCES IN NEURAL NETWORKS - ISNN 2011, PT II, 2011, 6676 : 477 - 485
  • [7] Chinese Negation and Speculation Detection with Conditional Random Fields
    Chen, Zhancheng
    Zou, Bowei
    Zhu, Qiaoming
    Li, Peifeng
    [J]. NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2013, 2013, 400 : 30 - 40
  • [8] Discriminative sentence compression with conditional random fields
    Nomoto, Tadashi
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2007, 43 (06) : 1571 - 1587
  • [9] Voice activity detection based on conditional random fields using multiple features
    Saito, Akira
    Nankaku, Yoshihiko
    Lee, Akinobu
    Tokuda, Keiichi
    [J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2086 - 2089
  • [10] Active Learning for the Prediction of Prosodic Phrase Boundaries in Chinese Speech Synthesis Systems Using Conditional Random Fields
    Zhao, Ziping
    Ma, Xirong
    [J]. 2015 16TH IEEE/ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING (SNPD), 2015, : 199 - 203