Using n-gram method in the decomposition of compound medical diagnoses

被引:3
|
作者
Héja, G
Surján, G
机构
[1] Budapest Univ Technol & Econ, Dept Measurement & Informat Syst, H-1521 Budapest, Hungary
[2] Natl Inst & Lib Hlth Informat, Budapest, Hungary
关键词
computing methodologies; automatic data; processing terminology; medical records; documentation; natural language processing;
D O I
10.1016/S1386-5056(03)00049-2
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective: Our goal in this study was to find an easy to implement method to detect compound medical diagnosis in Hungarian medical language and decompose them into expressions referring to a single disease. Methods: A corpus of clinical diagnoses extracted form discharge reports (3079 expressions, each of them referring to only one disease) was represented in an n-gram tree (a series of n consecutive word). A matching algorithm was implemented in a software, which is able to identify sensible n-grams existing both in test expressions and in the n-gram tree. A test sample of another 92 diagnoses was decomposed by two independent humans and by the software. The decompositions were compared with measure the recall and the precision of the method. Results: There was not full agreement between the decompositions of the humans, (which underlines the relevance of the problem). A consensus was arrived in all disagreed point by a third opinion and open discussion. The resulting decomposition was used as a gold standard and compared with the decomposition produced by the computer. The recall was 82.6% the precision 37.2%. After correction of spelling errors in the test sample the recall increased to 88.6% while the precision slightly decreased to 36.7%. Conclusion: The proposed method seems to be useful in decomposition of compound diagnostic expressions and can improve quality of diagnostic coding of clinical cases. Other statistical methods (like vector space methods or neural networks) usually offer a ranked list of candidate codes either for single or compound expressions, and do not warn the user how many codes should be chosen. We propose our method especially in a situation where formal NLP techniques are not available, as it is the case with scarcely spoken languages like Hungarian. (C) 2003 Elsevier Science Ireland Ltd. All rights reserved.
引用
收藏
页码:229 / 236
页数:8
相关论文
共 50 条
  • [1] Using n-gram method in the decomposition of compound medical diagnoses
    Héja, G
    Surján, G
    [J]. HEALTH DATA IN THE INFORMATION SOCIETY, 2002, 90 : 455 - 459
  • [2] Arabic supervised learning method using N-gram
    Sanan, Majed
    Rammal, Mahmoud
    Zreik, Khaldoun
    [J]. INTERACTIVE TECHNOLOGY AND SMART EDUCATION, 2008, 5 (03) : 157 - +
  • [3] Malayalam Spell Checker Using N-Gram Method
    Hema, P. H.
    Sunitha, C.
    [J]. COMPUTATIONAL INTELLIGENCE IN DATA MINING, VOL 1, CIDM 2015, 2016, 410 : 217 - 225
  • [4] An efficient document retrieval method using n-gram indexing
    Ogawa, Yasushi
    Matsuda, Toru
    [J]. Systems and Computers in Japan, 2002, 33 (02) : 54 - 63
  • [5] Analysis of Historical Medical Phenomena Using Large N-Gram Corpora
    Kasac, Zdenko
    Schulz, Stefan
    [J]. MEDINFO 2017: PRECISION HEALTHCARE THROUGH INFORMATICS, 2017, 245 : 437 - 441
  • [6] Classification of facemarks using N-gram
    Yamada, Thichi
    Tsuchiya, Seiji
    Kuroiwa, Shiongo
    Ren, Fuji
    [J]. PROCEEDINGS OF THE 2007 IEEE INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING (NLP-KE'07), 2007, : 322 - +
  • [7] Improving arabic information retrieval system using n-gram method
    Legal Informatics center, Lebanese University, Sami Solh Street-Bp5396/116, Lebanon
    不详
    不详
    [J]. WSEAS Trans. Comput., 1600, 4 (125-133):
  • [8] Evaluation of action prediction method using inductive learning with N-gram
    Xu, JA
    Itoh, T
    Araki, K
    Tochinai, K
    [J]. 2004 7TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS 1-3, 2004, : 1605 - 1609
  • [9] Syntactic and semantic disambiguation of numeral strings using an n-gram method
    Min, KH
    Wilson, WH
    Moon, YJ
    [J]. AI 2005: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2005, 3809 : 82 - 91
  • [10] N-gram Insight
    Prans, George
    [J]. AMERICAN SCIENTIST, 2011, 99 (05) : 356 - 357