Technology of text mining

被引:0
|
作者
Visa, A [1 ]
机构
[1] Tampere Univ Technol, FIN-33101 Tampere, Finland
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A large amount of information is stored in databases, in intranets or in Internet. This information is organised in documents or in text documents. The difference depends on the fact if pictures, tables, figures, and formulas are included or not. The common problem is to find the desired piece of information, a trend, or an undiscovered pattern from these sources. The problem is not a new one. Traditionally the problem has been considered under the title of information seeking, this means the science how to find a book in the library. Traditionally the problem has been solved either by classifying and accessing documents by Dewey Decimal Classification system or by giving a number of characteristic keywords. The problem is that nowadays there axe lots of unclassified documents in company databases and in intranet or in Internet. First one defines some terms. Text filtering means an information seeking process in which documents are selected from a dynamic text stream. Text mining is a process of analysing text to extract information from it for particular purposes. Text categorisation means the process of clustering similar documents from a large document set. All these terms have a certain degree of overlapping. Text mining, also know as document information mining, text data mining, or knowledge discovery in textual databases is an merging technology for analysing large collections of unstructured documents for the purposes of extracting interesting and non-trivial patterns or knowledge. Typical subproblems that have been solved axe language identification, feature selection/extraction, clustering, natural language processing, summarisation, categorisation, search, indexing, and visualisation. These subproblems are discussed in detail and the most common approaches axe given. Finally some examples of current uses of text mining are given and some potential application areas are mentioned.
引用
收藏
页码:1 / 11
页数:11
相关论文
共 50 条
  • [1] Text mining for technology monitoring
    Teichert, T
    Mittermayer, MA
    [J]. IEMC-2002: IEEE INTERNATIONAL ENGINEERING MANAGEMENT CONFERENCE, VOLS I AND II, PROCEEDINGS: MANAGING TECHNOLOGY FOR THE NEW ECONOMY, 2002, : 596 - 601
  • [2] Research and Exploration of Text Mining Technology
    Cao Lijun
    Yu Hongkui
    Li Yuxiang
    Liu Xiyin
    [J]. 2ND IEEE INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER CONTROL (ICACC 2010), VOL. 5, 2010, : 435 - 439
  • [3] A technology of text classification of data mining
    Yang, Bin
    Meng, Zhi-qing
    [J]. Xiangtan Daxue Ziran Kexue Xuebao, 2001, 23 (04): : 34 - 37
  • [4] A Variety of Text Mining Technology and Tools Research
    Jiang, Mingyang
    Fan, Xiaojing
    Zhang, Xinhong
    Jie, Lian
    Zhou, Yuxin
    Wang, QiangHu
    Zhang, ZhiFeng
    Pei, Zhili
    [J]. PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON MECHATRONICS, ELECTRONIC, INDUSTRIAL AND CONTROL ENGINEERING, 2014, 5 : 918 - +
  • [5] Text mining - from technology to biological applications
    Koehler, J
    [J]. BRIEFINGS IN BIOINFORMATICS, 2005, 6 (03) : 220 - 221
  • [6] Morphology analysis for technology roadmapping: application of text mining
    Yoon, Byungun
    Phaal, Rob
    Probert, David
    [J]. R & D MANAGEMENT, 2008, 38 (01) : 51 - 68
  • [7] Blockchain technology forecasting by patent analytics and text mining
    Bamakan, Seyed Mojtaba Hosseini
    Bondarti, Alireza Babaei
    Bondarti, Parinaz Babaei
    Qu, Qiang
    [J]. BLOCKCHAIN-RESEARCH AND APPLICATIONS, 2021, 2 (02):
  • [8] TEXT MINING FOR TECHNOLOGY ROADMAPPING - THE STRATEGIC VALUE OF INFORMATION
    Kayser, Victoria
    Goluchowicz, Kerstin
    Bierwisch, Antje
    [J]. INTERNATIONAL JOURNAL OF INNOVATION MANAGEMENT, 2014, 18 (03)
  • [9] Towards Text Mining in Technology-Enhanced Learning
    Bayer, Jaroslav
    Cuhel, Matej
    Geryk, Jan
    Obsivac, Tomas
    Popelinsky, Lubos
    [J]. PROCEEDINGS OF THE 9TH EUROPEAN CONFERENCE ON E-LEARNING, VOL 1, 2010, : 67 - 71
  • [10] A study on technology monitoring based on text mining to support science and technology management
    Yuan, JP
    Zhu, DH
    Li, JF
    [J]. PROCEEDINGS OF THE WORLD ENGINEERS' CONVENTION 2004, VOL A, NETWORK ENGINEERING AND INFORMATION SOCIETY, 2004, : 47 - 51