Research on Automatic Chinese Multi-word Term Extraction Based on Term Component

被引:0
|
作者
Kang, Wei [1 ]
Sui, Zhifang [1 ]
机构
[1] Peking Univ, Inst Computat Linguisitcs, Beijing 100871, Peoples R China
关键词
Chinese terminology; Automatic terminology extraction; Term component; Unithood; Termhood;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents an automatic Chinese multi-word term extraction method based on the unithood and the termhood measure. The unithood of the candidate term is measured by the strength of inner unity and marginal variety. Term component is taken into account to estimate the termhood. Inspired by the economical law of term generating, we propose two measures of a candidate term to be a true term: the first measure is based on domain speciality of term, and the second one is based on the similarity between a candidate and a template that contains structured information of terms. Experiments on I.T. domain and Medicine domain show that our method is effective and portable in different domains.
引用
收藏
页码:57 / 67
页数:11
相关论文
共 50 条
  • [1] Research on Automatic Chinese Multi-word Term Extraction Based on Integration of Web Information and Term Component
    Kang, Wei
    Sui, Zhifang
    Liu, Yao
    [J]. 2009 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCES ON WEB INTELLIGENCE (WI) AND INTELLIGENT AGENT TECHNOLOGIES (IAT), VOL 3, 2009, : 267 - +
  • [2] Automatic Chinese Multi-Word Term Extraction
    Nari Song
    Feng, Zhiwei
    Kit, Chunyu
    [J]. ALPIT 2008: SEVENTH INTERNATIONAL CONFERENCE ON ADVANCED LANGUAGE PROCESSING AND WEB INFORMATION TECHNOLOGY, PROCEEDINGS, 2008, : 181 - 184
  • [3] Rule-based Automatic Multi-Word Term Extraction and Lemmatization
    Stankovic, Ranka
    Krstev, Cvetana
    Obradovic, Ivan
    Lazic, Biljana
    Trtovac, Aleksandra
    [J]. LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, : 507 - 514
  • [4] A multi-word term extraction system
    Chen, Jisong
    Yeh, Chung-Hsing
    Chau, Rowena
    [J]. PRICAI 2006: TRENDS IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, 4099 : 1160 - 1165
  • [5] Term Extraction For A Single & Multi-Word Based On Islamic Corpus English
    Abduljabbar, Waleed Khalid
    Tomah, Saadiyaa A.
    Ali, Ammar Abdulateef
    [J]. 2018 1ST ANNUAL INTERNATIONAL CONFERENCE ON INFORMATION AND SCIENCES (AICIS 2018), 2018, : 107 - 111
  • [6] A hybrid Approach for Arabic Multi-Word Term Extraction
    Bounhas, Ibrahim
    Slimani, Yahya
    [J]. IEEE NLP-KE 2009: PROCEEDINGS OF INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, 2009, : 429 - 436
  • [7] A multi-word term extraction program for Arabic language
    Boulaknadel, Siham
    Daille, Beatrice
    Aboutajdine, Driss
    [J]. SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 1485 - 1488
  • [8] A Contrastive Approach to Multi-word Term Extraction from Domain Corpora
    Bonin, Francesca
    Dell'Orletta, Felice
    Venturi, Giulia
    Montemagni, Simonetta
    [J]. LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010,
  • [9] A Study on Multi-word Extraction from Chinese Documents
    Zhang, Wen
    Yoshida, Taketoshi
    Tang, Xijin
    [J]. ADVANCED WEB AND NETWORK TECHNOLOGIES, AND APPLICATIONS, 2008, 4977 : 42 - +
  • [10] The Oil Field Multi-word Term Recognition Based on Hybrid Strategy
    Liang, Ying-hong
    Liang, Ying-hong
    Li, Jin-xiang
    Xian, Xue-feng
    Chen, Ke
    [J]. 2013 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ARTIFICIAL INTELLIGENCE (ICCSAI 2013), 2013, : 395 - 398