Using directed graph based BDMM algorithm for Chinese word segmentation

被引:0
|
作者
Chen, YD [1 ]
Wang, T [1 ]
Chen, HW [1 ]
机构
[1] Natl Lab Parallel & Distributed Proc, Changsha 410073, Hunan, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Word segmentation is a key problem for Chinese text analysis. In this paper, with the consideration of both word-coverage rate and sentence-coverage rate, based on the classic Bi-Directed Maximum Match (BDMM) segmentation method, a character Directed Graph with ambiguity mark is designed for searching multiple possible segmentation sequences. This method is compared with the classic Maximum Match algorithm and Omni-segmentation algorithm. The experiment result shows that Directed Graph based BDMM algorithm can achieve higher coverage rate and lower complexity.
引用
收藏
页码:214 / 217
页数:4
相关论文
共 50 条
  • [1] A compression-based algorithm for Chinese word segmentation
    Teahan, WJ
    Wen, YY
    McNab, R
    Witten, IH
    [J]. COMPUTATIONAL LINGUISTICS, 2000, 26 (03) : 375 - 393
  • [2] An Optimization Algorithm of Chinese Word Segmentation Based on Dictionary
    Tang, Jun
    Wu, Qing
    Li, Yinghong
    [J]. 2015 INTERNATIONAL CONFERENCE ON NETWORK AND INFORMATION SYSTEMS FOR COMPUTERS (ICNISC), 2015, : 259 - 262
  • [3] Models and algorithm of Chinese word segmentation
    Wang, X
    Fu, G
    Yeung, DS
    Liu, JNK
    Luk, R
    [J]. IC-AI'2000: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 1-III, 2000, : 1279 - 1284
  • [4] Research of Chinese Word Knowledge Graph Based on SLPA Algorithm
    Qian, Xicheng
    Hu, Yuyang
    Pan, Jing-Chang
    [J]. 3RD INTERNATIONAL SYMPOSIUM ON MECHATRONICS AND INDUSTRIAL INFORMATICS, (ISMII 2017), 2017, : 84 - 88
  • [5] An ambiguity discovery algorithm on Chinese word segmentation based dictionary
    Sun, Tieli
    Liu, Yanji
    Yang, Lehua
    Li, Zhiying
    Liu, Zhenghong
    [J]. PROCEEDINGS OF THE 2009 SECOND PACIFIC-ASIA CONFERENCE ON WEB MINING AND WEB-BASED APPLICATION, 2009, : 39 - 42
  • [6] Research on Chinese Word Segmentation Algorithm Based on Special Identifiers
    Qun, Zhang
    Yu, Cheng
    [J]. COMPUTING AND INTELLIGENT SYSTEMS, PT III, 2011, 233 : 377 - 385
  • [7] Research on Chinese Word Segmentation Algorithm Based on Special Identifiers
    Zhang Qun
    Shen Haibo
    [J]. 2010 SECOND INTERNATIONAL CONFERENCE ON E-LEARNING, E-BUSINESS, ENTERPRISE INFORMATION SYSTEMS, AND E-GOVERNMENT (EEEE 2010), VOL I, 2010, : 277 - 280
  • [8] A rule-based Chinese-word segmentation algorithm
    Fu, Shiguang
    Lin, Youfang
    [J]. RECENT ADVANCE OF CHINESE COMPUTING TECHNOLOGIES, 2007, : 159 - 162
  • [9] A Graph-based Model for Joint Chinese Word Segmentation and Dependency Parsing
    Yan, Hang
    Qiu, Xipeng
    Huang, Xuanjing
    [J]. TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2020, 8 : 78 - 92
  • [10] Improved fast algorithm for Chinese word segmentation
    Chen, Guilin
    Wang, Yongcheng
    Han, Kesong
    Wang, Gang
    [J]. Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2000, 37 (04): : 418 - 424