Text mining at the term level

被引:0
|
作者
Feldman, R [1 ]
Fresko, M
Kinar, Y
Lindell, Y
Liphstat, O
Rajman, M
Schler, Y
Zamir, O
机构
[1] Bar Ilan Univ, Dept Math & Comp Sci, Ramat Gan, Israel
[2] Swiss Fed Inst Technol, Artificial Intelligence Lab, CH-1015 Lausanne, Switzerland
[3] Univ Washington, Dept Comp Sci, Seattle, WA 98195 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Knowledge Discovery in Databases (KDD) focuses on the computerized exploration of large amounts of data and on the discovery of interesting patterns within them. While most work on KDD has been concerned with structured databases, there has been little work on handling the huge amount of information that is available only in unstructured textual form. Previous work in text mining focused at the word or the tag level. This paper presents an approach to performing text mining at the term level. The mining process starts by preprocessing the document collection and extracting terms from the documents. Each document is then represented by a set of terms and annotations characterizing the document. Terms and additional higher-level entities are then organized in a hierarchical taxonomy. In this paper we will describe the Term Extraction module of the Document Explorer system, and provide experimental evaluation performed on a set of 52,000 documents published by Reuters in the years 1995-1996.
引用
收藏
页码:65 / 73
页数:9
相关论文
共 50 条
  • [1] Text mining at detail level using conceptual graphs
    Montes-y-Gómez, M
    Gelbukh, A
    López-López, A
    [J]. CONCEPTUAL STRUCTURES: INTEGRATION AND INTERFACES, PROCEEDINGS, 2002, 2393 : 122 - 136
  • [2] Speculative text mining for document-level sentiment classification
    Wen, Jiahui
    Zhang, Guangda
    Zhang, Hongyun
    Yin, Wei
    Ma, Jingwei
    [J]. NEUROCOMPUTING, 2020, 412 : 52 - 62
  • [3] Constructing Term Thesaurus using Text Association Rule Mining
    Kongthon, Alisa
    Haruechaiyasak, Choochart
    Thaiprayoon, Santipong
    [J]. ECTI-CON 2008: PROCEEDINGS OF THE 2008 5TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING/ELECTRONICS, COMPUTER, TELECOMMUNICATIONS AND INFORMATION TECHNOLOGY, VOLS 1 AND 2, 2008, : 137 - 140
  • [4] Mining term networks from text collections for crime investigation
    Tseng, Yuen-Hsien
    Ho, Zih-Ping
    Yang, Kai-Sheng
    Chen, Chun-Cheng
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (11) : 10082 - 10090
  • [5] Text mining in fisheries scientific literature: A term coding approach
    Fytilakos, Ioannis
    [J]. ECOLOGICAL INFORMATICS, 2021, 61
  • [6] Text Mining for Procedure-Level Primitives in Human Reliability Analysis
    Ewing, Sarah M.
    Boring, Ronald L.
    Rasmussen, Martin
    Ulrich, Thomas
    [J]. ADVANCES IN HUMAN ERROR, RELIABILITY, RESILIENCE, AND PERFORMANCE, 2018, 589 : 239 - 249
  • [7] Intelligent Prediction of Vulnerability Severity level Based on Text Mining and XGBoost
    Wang, Peichao
    Zhou, Yun
    Sun, Baodan
    Zhang, Weiming
    [J]. 2019 ELEVENTH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTATIONAL INTELLIGENCE (ICACI 2019), 2019, : 72 - 77
  • [8] Multi-level text mining method to extract biological relationships
    Palakal, M
    Stephens, M
    Mukhopadhyay, S
    Raje, R
    Rhodes, S
    [J]. CSB2002: IEEE COMPUTER SOCIETY BIOINFORMATICS CONFERENCE, 2002, : 97 - 108
  • [9] A web text mining approach for the evaluation of regional characteristics at the town level
    Wang, Shu
    Qian, Lang
    Zhu, Yunqiang
    Song, Jia
    Lu, Feng
    Zeng, Hongyun
    Chen, Pengfei
    Yuan, Wen
    Li, Weirong
    Geng, Wenguang
    [J]. TRANSACTIONS IN GIS, 2021, 25 (04) : 2074 - 2103
  • [10] Text mining
    Meier, M
    Beckh, M
    [J]. WIRTSCHAFTSINFORMATIK, 2000, 42 (02): : 165 - 167