A RULE-BASED METHOD FOR COMMAS' DISAMBIGUATION IN CHINESE PATENT TEXT

被引:0
|
作者
Song, Qianqian [1 ,2 ]
Zhu, Yun [1 ,2 ]
Wang, Lixia [1 ,2 ]
Jin, Yaohong [1 ,2 ]
机构
[1] Beijing Normal Univ, Inst Chinese Informat Proc, Beijing 100875, Peoples R China
[2] Beijing Normal Univ, CPIC BNU Joint Lab Machine Translat, Beijing 100875, Peoples R China
基金
国家高技术研究发展计划(863计划);
关键词
Rule-based method; Commas' disambiguation; Chinese patent text; MT;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We described a rule-based method for disambiguating Chinese commas in patent text, which will be beneficial to the work on Chinese-English Patent MT. We annotated ten thousand sentences of patent text, and made a number of rules according to the annotated results. Experiments were conducted on 5 intact patent documents containing 1219 commas, and our model achieves an accuracy of over 90% overall.
引用
收藏
页码:1506 / 1510
页数:5
相关论文
共 50 条
  • [1] Rule-based Logical Reasoning Knowledge Extraction of Chinese Legislative Text
    Li, Li
    Wang, Houfeng
    Liu, Yang
    [J]. 2022 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2022), 2022, : 445 - 450
  • [2] Rule-based perspective rectification for Chinese text in natural scene images
    Yang, Xieliu
    Yin, Chenyu
    Tian, Dake
    Liang, Wenfeng
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (12) : 18243 - 18262
  • [3] A simple rule-based approach to organization name recognition in Chinese text
    Wang, HF
    Shi, WG
    [J]. COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2005, 3406 : 769 - 772
  • [4] Rule-based perspective rectification for Chinese text in natural scene images
    Xieliu Yang
    Chenyu Yin
    Dake Tian
    Wenfeng Liang
    [J]. Multimedia Tools and Applications, 2021, 80 : 18243 - 18262
  • [5] An Unsupervised Rule-Based Method to Populate Ontologies from Text
    Motta, Eduardo
    Siqueira, Sean
    Andreatta, Alexandre
    [J]. WEB INFORMATION SYSTEMS AND TECHNOLOGIES, 2010, 45 : 157 - 169
  • [6] TEXT COMPRESSION AS RULE-BASED PATTERN-RECOGNITION - TEXT COMPRESSION USING RULE-BASED ENCODER - COMMENT
    NGUYEN, K
    [J]. ELECTRONICS LETTERS, 1995, 31 (09) : 701 - 702
  • [7] A Rule-Based Method for Identifying Patterns in Old Chinese Sentences
    Liu, Youran
    Long, Dan
    [J]. CHINESE LEXICAL SEMANTICS, 2014, 8922 : 221 - 230
  • [8] A Rule-Based Method for Chinese Punctuations Processing in Sentences Segmentation
    Wang, Jing
    Zhu, Yun
    Jin, Yaohong
    [J]. PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2014), 2014, : 195 - 198
  • [9] A Rule-Based Method for Text Shortening in Vietnamese Sign Language Translation
    Thi Bich Diep Nguyen
    Trung-Nghia Phung
    Tat-Thang Vu
    [J]. INFORMATION SYSTEMS DESIGN AND INTELLIGENT APPLICATIONS, INDIA 2017, 2018, 672 : 655 - 662
  • [10] Rule-based Text Mining of Traditional Chinese Medicine Patterns with Chinese Herbal Medicines and Formulae on Hypertension
    Zhou, Hongmei
    Yang, Jing
    Guo, Jinrui
    Wang, Yahong
    Zheng, Guang
    Guo, Hongtao
    Tan, Yong
    Ren, Xiaoxia
    Dong, Rongfen
    Zhang, Jinrong
    Cui, Zhaoli
    Lv, Aiping
    Jiang, Miao
    Wang, Yaoxian
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2013,