Distributed data mining and its applications to intelligent textual information processing

被引:0
|
作者
Qiu, SB [1 ]
Qiu, M [1 ]
机构
[1] Univ New Mexico, Dept Elect & Comp Engn, Albuquerque, NM 87131 USA
关键词
D O I
暂无
中图分类号
F [经济];
学科分类号
02 ;
摘要
Textual information processing is of fundamental importance, due to the massive amount of documents, especially online textual information that we need to process every day. In this paper, we stud), data mining techniques applied to intelligent textual information processing in distributed environments, including text classification. information extraction (IE) and topic detection and tracking (TDT). These intelligent processing techniques will improve the quality and efficiency of information resource management and utilization. Their statistical models and computational algorithms challenge the researches in data mining and distributed/parallel computing. When successfully applied, they will help enhance and benefit applications in IT, digital library, and information retrieval. Specifically, we study the distributed computing of the following algorithms: naive Bayes classifier combined with expectation-maximization (EM) for text classification, hidden Markov model for information extraction, and deterministic annealing with EM for topic detection and tracking. We also study the performances of the proposed algorithms and experiment on the improvements.
引用
收藏
页码:366 / 370
页数:5
相关论文
共 50 条
  • [31] Advances in Information Processing and Its Applications
    Wang, Zheng
    Kao, Shao Ku
    RECENT ADVANCES IN ELECTRICAL & ELECTRONIC ENGINEERING, 2020, 13 (08) : 1108 - 1109
  • [32] Data mining techniques for information security applications
    Al-Shawi, Amany
    WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2011, 3 (03): : 221 - 229
  • [33] Integrated distributed intelligent multimedia system and its applications
    Rao, M
    Yang, H
    SMC '97 CONFERENCE PROCEEDINGS - 1997 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-5: CONFERENCE THEME: COMPUTATIONAL CYBERNETICS AND SIMULATION, 1997, : 129 - 134
  • [34] Mining ideas from textual information
    Thorleuchter, Dirk
    Van den Poel, Dirk
    Prinzie, Anita
    EXPERT SYSTEMS WITH APPLICATIONS, 2010, 37 (10) : 7182 - 7188
  • [35] Task scheduling in Distributed Data Mining for medical applications
    Gantenbein, RE
    Sung, CO
    COMPUTER APPLICATIONS IN INDUSTRY AND ENGINEERING, 2003, : 250 - 253
  • [36] Shared State for Distributed Interactive Data Mining Applications
    Srinivasan Parthasarathy
    Sandhya Dwarkadas
    Distributed and Parallel Databases, 2002, 11 : 129 - 155
  • [37] Distributed data mining on grids: Services, tools, and applications
    Cannataro, M
    Congiusta, A
    Pugliese, A
    Talia, D
    Trunfio, P
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2004, 34 (06): : 2451 - 2465
  • [38] Shared state for distributed interactive data mining applications
    Parthasarathy, S
    Dwarkadas, S
    DISTRIBUTED AND PARALLEL DATABASES, 2002, 11 (02) : 129 - 155
  • [39] Lightweight clustering technique for distributed data mining applications
    Aouad, Lamine M.
    Le-Khac, Nhien-An
    Kechadi, Tahar M.
    ADVANCES IN DATA MINING: THEORETICAL ASPECTS AND APPLICATIONS, PROCEEDINGS, 2007, 4597 : 120 - +
  • [40] High Efficient Scheduler for Distributed Data Mining Applications
    Liu, Meiqun
    Gao, Kun
    CEA'09: PROCEEDINGS OF THE 3RD WSEAS INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND APPLICATIONS, 2009, : 87 - +