Text mining at the term level

被引:0
|
作者
Feldman, R [1 ]
Fresko, M
Kinar, Y
Lindell, Y
Liphstat, O
Rajman, M
Schler, Y
Zamir, O
机构
[1] Bar Ilan Univ, Dept Math & Comp Sci, Ramat Gan, Israel
[2] Swiss Fed Inst Technol, Artificial Intelligence Lab, CH-1015 Lausanne, Switzerland
[3] Univ Washington, Dept Comp Sci, Seattle, WA 98195 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Knowledge Discovery in Databases (KDD) focuses on the computerized exploration of large amounts of data and on the discovery of interesting patterns within them. While most work on KDD has been concerned with structured databases, there has been little work on handling the huge amount of information that is available only in unstructured textual form. Previous work in text mining focused at the word or the tag level. This paper presents an approach to performing text mining at the term level. The mining process starts by preprocessing the document collection and extracting terms from the documents. Each document is then represented by a set of terms and annotations characterizing the document. Terms and additional higher-level entities are then organized in a hierarchical taxonomy. In this paper we will describe the Term Extraction module of the Document Explorer system, and provide experimental evaluation performed on a set of 52,000 documents published by Reuters in the years 1995-1996.
引用
收藏
页码:65 / 73
页数:9
相关论文
共 50 条
  • [41] Text mining in action!
    Mladenic, D
    [J]. FROM DATA AND INFORMATION ANALYSIS TO KNOWLEDGE ENGINEERING, 2006, : 52 - 62
  • [42] Text Association Analysis and Ambiguity in Text Mining
    Bhonde, S. B.
    Paikrao, R. L.
    Rahane, K. U.
    [J]. INTERNATIONAL CONFERENCE ON METHODS AND MODELS IN SCIENCE AND TECHNOLOGY (ICM2ST-10), 2010, 1324 : 204 - +
  • [43] Text Mining Technique for Data Mining Application
    Govindarajan, M.
    [J]. PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY, VOL 26, PARTS 1 AND 2, DECEMBER 2007, 2007, 26 : 544 - 549
  • [44] Integrated Text Mining and Chemoinformatics Analysis Associates Diet to Health Benefit at Molecular Level
    Jensen, Kasper
    Panagiotou, Gianni
    Kouskoumvekaki, Irene
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2014, 10 (01)
  • [45] Integrating IFC and CityGML Model at Schema Level by Using Linguistic and Text Mining Techniques
    Ding, Xiaohui
    Yang, Ji
    Liu, Lingjia
    Huang, Wumeng
    Wu, Peng
    [J]. IEEE ACCESS, 2020, 8 (08): : 56429 - 56440
  • [46] Pediatric literature trends: high-level analysis using text-mining
    Sarina Levy-Mendelovich
    Yiftach Barbash
    Ivan Budnik
    Daniella Levy-Erez
    Raz Somech
    Shelly Soffer
    Susan Furth
    Eyal Klang
    [J]. Pediatric Research, 2021, 90 : 212 - 215
  • [47] Automated Surgical Term Clustering: A Text Mining Approach for Unstructured Textual Surgery Descriptions
    Khaleghi, Tannaz
    Murat, Alper
    Arslanturk, Suzan
    Davies, Eric
    [J]. IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2020, 24 (07) : 2107 - 2118
  • [48] Long-term stock index forecasting based on text mining of regulatory disclosures
    Feuerriegel, Stefan
    Gordon, Julius
    [J]. DECISION SUPPORT SYSTEMS, 2018, 112 : 88 - 97
  • [49] Term Structure Models During the Global Financial Crisis: A Parsimonious Text Mining Approach
    Nishimura, Kiyohiko G.
    Sato, Seisho
    Takahashi, Akihiko
    [J]. ASIA-PACIFIC FINANCIAL MARKETS, 2019, 26 (03) : 297 - 337
  • [50] Term Structure Models During the Global Financial Crisis: A Parsimonious Text Mining Approach
    Kiyohiko G. Nishimura
    Seisho Sato
    Akihiko Takahashi
    [J]. Asia-Pacific Financial Markets, 2019, 26 : 297 - 337