Technology of text mining

被引:0
|
作者
Visa, A [1 ]
机构
[1] Tampere Univ Technol, FIN-33101 Tampere, Finland
来源
MACHINE LEARNING AND DATA MINING IN PATTERN RECOGNITION | 2001年 / 2123卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A large amount of information is stored in databases, in intranets or in Internet. This information is organised in documents or in text documents. The difference depends on the fact if pictures, tables, figures, and formulas are included or not. The common problem is to find the desired piece of information, a trend, or an undiscovered pattern from these sources. The problem is not a new one. Traditionally the problem has been considered under the title of information seeking, this means the science how to find a book in the library. Traditionally the problem has been solved either by classifying and accessing documents by Dewey Decimal Classification system or by giving a number of characteristic keywords. The problem is that nowadays there axe lots of unclassified documents in company databases and in intranet or in Internet. First one defines some terms. Text filtering means an information seeking process in which documents are selected from a dynamic text stream. Text mining is a process of analysing text to extract information from it for particular purposes. Text categorisation means the process of clustering similar documents from a large document set. All these terms have a certain degree of overlapping. Text mining, also know as document information mining, text data mining, or knowledge discovery in textual databases is an merging technology for analysing large collections of unstructured documents for the purposes of extracting interesting and non-trivial patterns or knowledge. Typical subproblems that have been solved axe language identification, feature selection/extraction, clustering, natural language processing, summarisation, categorisation, search, indexing, and visualisation. These subproblems are discussed in detail and the most common approaches axe given. Finally some examples of current uses of text mining are given and some potential application areas are mentioned.
引用
收藏
页码:1 / 11
页数:11
相关论文
共 50 条
  • [1] Text mining for technology monitoring
    Teichert, T
    Mittermayer, MA
    IEMC-2002: IEEE INTERNATIONAL ENGINEERING MANAGEMENT CONFERENCE, VOLS I AND II, PROCEEDINGS: MANAGING TECHNOLOGY FOR THE NEW ECONOMY, 2002, : 596 - 601
  • [2] Research and Exploration of Text Mining Technology
    Cao Lijun
    Yu Hongkui
    Li Yuxiang
    Liu Xiyin
    2ND IEEE INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER CONTROL (ICACC 2010), VOL. 5, 2010, : 435 - 439
  • [3] A technology of text classification of data mining
    Yang, Bin
    Meng, Zhi-qing
    Xiangtan Daxue Ziran Kexue Xuebao, 2001, 23 (04): : 34 - 37
  • [4] A Variety of Text Mining Technology and Tools Research
    Jiang, Mingyang
    Fan, Xiaojing
    Zhang, Xinhong
    Jie, Lian
    Zhou, Yuxin
    Wang, QiangHu
    Zhang, ZhiFeng
    Pei, Zhili
    PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON MECHATRONICS, ELECTRONIC, INDUSTRIAL AND CONTROL ENGINEERING, 2014, 5 : 918 - +
  • [5] Text mining - from technology to biological applications
    Koehler, J
    BRIEFINGS IN BIOINFORMATICS, 2005, 6 (03) : 220 - 221
  • [6] Mining Thematic Trends in Chinese Literature Using Text Mining Technology
    Yan, Yanfang
    Liu, Tao
    INTERNATIONAL JOURNAL OF MULTIPHYSICS, 2024, 18 (03) : 827 - 839
  • [7] Morphology analysis for technology roadmapping: application of text mining
    Yoon, Byungun
    Phaal, Rob
    Probert, David
    R & D MANAGEMENT, 2008, 38 (01) : 51 - 68
  • [8] Blockchain technology forecasting by patent analytics and text mining
    Bamakan, Seyed Mojtaba Hosseini
    Bondarti, Alireza Babaei
    Bondarti, Parinaz Babaei
    Qu, Qiang
    BLOCKCHAIN-RESEARCH AND APPLICATIONS, 2021, 2 (02):
  • [9] TEXT MINING FOR TECHNOLOGY ROADMAPPING - THE STRATEGIC VALUE OF INFORMATION
    Kayser, Victoria
    Goluchowicz, Kerstin
    Bierwisch, Antje
    INTERNATIONAL JOURNAL OF INNOVATION MANAGEMENT, 2014, 18 (03)
  • [10] Towards Text Mining in Technology-Enhanced Learning
    Bayer, Jaroslav
    Cuhel, Matej
    Geryk, Jan
    Obsivac, Tomas
    Popelinsky, Lubos
    PROCEEDINGS OF THE 9TH EUROPEAN CONFERENCE ON E-LEARNING, VOL 1, 2010, : 67 - 71