A Hybrid strategy for Chinese Domain-Specific terminology Extraction

被引:0
|
作者
Zhan, Qiang [1 ,2 ]
Wang, Chunhong
机构
[1] Beijing Inst Technol, Sch Comp Sci & Technol, Beijing, Peoples R China
[2] Yuncheng Univ, Dept Comp Sci, Yuncheng, Shanxi, Peoples R China
关键词
nature Language Processing; term extraction; information extraction; conditional random fields;
D O I
10.1109/SKG.2015.39
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Automatic Term Extraction is an important issue in Natural Language Processing. This paper presents a new approach of terminology extraction combining with machine learning based on cascaded conditional random fields and corpus-based statistical model. In this approach, firstly, the low-layer and high-layer conditional random fields (CRFs) are used to extract the simple and compound terminologies respectively. Then, Domain Relevance (DR) and Domain Consensus (DC) degrees are calculated to acquire the final domain terminologies. Experimental results show that the precision, recall and F-score are 83.29%, 80.75%, 82.01% respectively. The comparison with CRFs and MI+T-value shows that the proposed method for extracting terminology is effective.
引用
收藏
页码:217 / 221
页数:5
相关论文
共 50 条
  • [1] A Hybrid strategy for Chinese Domain-Specific terminology Extraction
    Zhan, Qiang
    Wang, Chunhong
    [J]. 2015 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2015,
  • [2] Arabic Terminology Extraction and Enrichment based on Domain-Specific Text Mining
    Lahbib, Wiem
    Bounhas, Ibrahim
    Slimani, Yahya
    [J]. 2015 IEEE 27TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2015), 2015, : 340 - 347
  • [3] Domain-specific keyphrase extraction
    Frank, E
    Paynter, GW
    Witten, IH
    Gutwin, C
    Nevill-Manning, CG
    [J]. IJCAI-99: PROCEEDINGS OF THE SIXTEENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOLS 1 & 2, 1999, : 668 - 673
  • [4] Domain-Specific Paraphrase Extraction
    Pavlick, Ellie
    Ganitkevitch, Juri
    Chan, Tsz Ping
    Yao, Xuchen
    Van Durme, Benjamin
    Callison-Burch, Chris
    [J]. PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL) AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (IJCNLP), VOL 2, 2015, : 57 - 62
  • [5] HYBRID DOMAIN-SPECIFIC KITS
    GRISS, ML
    WENTZEL, KD
    [J]. JOURNAL OF SYSTEMS AND SOFTWARE, 1995, 30 (03) : 213 - 230
  • [6] Domain-specific Chinese term extraction via word segmentation optimization
    Wei, Chuyuan
    Li, Fangfang
    Zhan, Qiang
    Zhang, Dakui
    Mao, Yu
    [J]. Journal of Information and Computational Science, 2015, 12 (17): : 6477 - 6490
  • [7] Domain-specific information extraction structures
    Lyons, S
    Smith, D
    [J]. 13TH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2002, : 80 - 84
  • [8] Prioritization of Domain-Specific Web Information Extraction
    Huang, Jian
    Yu, Cong
    [J]. PROCEEDINGS OF THE TWENTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-10), 2010, : 1327 - 1333
  • [9] Software Keyphrase Extraction with Domain-specific Features
    Karnalim, Oscar
    [J]. 2016 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING AND APPLICATIONS (ACOMP), 2016, : 43 - 50
  • [10] Domain-specific optimization strategy for skeleton programs
    Emoto, Kento
    Matsuzaki, Kiminori
    Hu, Zhenjiang
    Takeichi, Masato
    [J]. EURO-PAR 2007 PARALLEL PROCESSING, PROCEEDINGS, 2007, 4641 : 705 - +