An example-based study on Chinese word segmentation using critical fragments

被引:0
|
作者
Hu, QA [1 ]
Pan, HH [1 ]
Kit, C [1 ]
机构
[1] City Univ Hong Kong, Dept Chinese Translat & Linguist, Hong Kong, Peoples R China
来源
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In our study, sentences are represented as sequences of critical fragments, and critical fragments with more than one distinct resolution found in the training corpus are considered as being ambiguous. Different from other studies, the ambiguous critical fragments are disambiguated using an example-based system(1) in our study. The contexts, i.e. the adjacent characters, words and critical fragments, on either side of an ambiguous critical fragment, are used to measure the distance between training and testing examples. Two kinds of measures, overlap metric and chi-squared feature weighting, are employed, and our system achieves a precision of 93.65% and a recall of 96.56% in the open test.
引用
收藏
页码:714 / 722
页数:9
相关论文
共 50 条
  • [41] Example-Based Query Analysis Using Functional Conceptual Graphs
    Liu, Hui
    Chen, Yuquan
    ACTIVE MEDIA TECHNOLOGY, PROCEEDINGS, 2009, 5820 : 136 - +
  • [42] Creating various styles of animations using example-based filtering
    Hashimoto, R
    Johan, H
    Nishita, T
    COMPUTER GRAPHICS INTERNATIONAL, PROCEEDINGS, 2003, : 312 - +
  • [43] A compression-based algorithm for Chinese word segmentation
    Teahan, WJ
    Wen, YY
    McNab, R
    Witten, IH
    COMPUTATIONAL LINGUISTICS, 2000, 26 (03) : 375 - 393
  • [44] An Optimization Algorithm of Chinese Word Segmentation Based on Dictionary
    Tang, Jun
    Wu, Qing
    Li, Yinghong
    2015 INTERNATIONAL CONFERENCE ON NETWORK AND INFORMATION SYSTEMS FOR COMPUTERS (ICNISC), 2015, : 259 - 262
  • [45] Subjective Testing System Based on Chinese Word Segmentation]
    Jiao Cui-Ling
    Wang Jian-Ping
    INFORMATION COMPUTING AND APPLICATIONS, PT 1, 2012, 307 : 756 - +
  • [46] Chinese Word Segmentation Based on Improved Double Hashtable
    Shao, Hong
    Sun, Huayu
    Cui, Wencheng
    FIFTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2012): COMPUTER VISION, IMAGE ANALYSIS AND PROCESSING, 2013, 8783
  • [47] Chinese integral word segmentation recognition based on reliability
    School of Information Technology, Jiangxi University of Finance and Economics, Nanchang 330013, China
    不详
    不详
    J. Inf. Comput. Sci., 2009, 1 (533-542):
  • [48] Capsules Based Chinese Word Segmentation for Ancient Chinese Medical Books
    Li, Si
    Li, Mingzheng
    Xu, Yajing
    Bao, Zuyi
    Fu, Lu
    Zhu, Yan
    IEEE ACCESS, 2018, 6 : 70874 - 70883
  • [49] Context Information and Fragments Based Cross-Domain Word Segmentation
    Huang Degen
    Tong Deqin
    CHINA COMMUNICATIONS, 2012, 9 (03) : 49 - 57
  • [50] Cross-domain Chinese Word Segmentation Based on New Word Discovery
    Zhang Jun
    Lai Zhipeng
    Li Xue
    Ning Gengxin
    Yang Cui
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2022, 44 (09) : 3241 - 3248