Text mining using database tomography and bibliometrics: A review

被引:107
|
作者
Kostoff, RN
Toothman, DR
Eberhart, HJ
Humenik, JA
机构
[1] Off Naval Res, Arlington, VA 22217 USA
[2] RSIS Inc, Mclean, VA 22102 USA
[3] NOESIS Inc, Manassas, VA USA
关键词
database tomography; text mining; bibliometrics; innovation; information retrieval; information extraction; cluster; taxonomies;
D O I
10.1016/S0040-1625(01)00133-0
中图分类号
F [经济];
学科分类号
02 ;
摘要
Database tomography (DT) is a textual database analysis system consisting of two major components: (1) algorithms for extracting multiword phrase frequencies and phrase proximities (physical closeness of the multiword technical phrases) from any type of large textual database, to augment (2) interpretative capabilities of the expert human analyst. DT has been used to derive technical intelligence from a variety of textual database sources, most recently the published technical literature as exemplified by the Science Citation Index (SCI) and the Engineering Compendex (EC). Phrase frequency analysis (the occurrence frequency of multiword technical phrases) provides the pervasive technical themes of the topical databases of interest, and phrase proximity analysis provides the relationships among the pervasive technical themes. In the structured published literature databases, bibliometric analysis of the database records supplements the DT results by identifying the recent most prolific topical area authors; the journals that contain numerous topical area papers; the institutions that produce numerous topical area papers; the keywords specified most frequently by the topical area authors; the authors whose works are cited most frequently in the topical area papers; and the particular papers and journals cited most frequently in the topical area papers. This review paper summarizes: (1) the theory and background development of DT; (2) past published and unpublished literature study results; (3) present application activities; (4) potential expansion to new DT applications. In addition, application of DT to technology forecasting is addressed. (C) 2001 Elsevier Science Inc. All rights reserved.
引用
收藏
页码:223 / 253
页数:31
相关论文
共 50 条
  • [41] Mining the ACCENT database: a review and update
    Renfro, Lindsay A.
    Shi, Qian
    Sargent, Daniel J.
    CHINESE CLINICAL ONCOLOGY, 2013, 2 (02)
  • [42] TCMGeneDIT: a database for associated traditional Chinese medicine, gene and disease information using text mining
    Yu-Ching Fang
    Hsuan-Cheng Huang
    Hsin-Hsi Chen
    Hsueh-Fen Juan
    BMC Complementary and Alternative Medicine, 8
  • [43] TCMGeneDIT: a database for associated traditional Chinese medicine, gene and disease information using text mining
    Fang, Yu-Ching
    Huang, Hsuan-Cheng
    Chen, Hsin-Hsi
    Juan, Hsueh-Fen
    BMC COMPLEMENTARY AND ALTERNATIVE MEDICINE, 2008, 8 (1):
  • [44] Generating the Blood Exposome Database Usinga Comprehensive Text Mining and Database Fusion Approach
    Barupal, Dinesh Kumar
    Fiehn, Oliver
    ENVIRONMENTAL HEALTH PERSPECTIVES, 2019, 127 (09)
  • [45] Text Preprocessing for Text Mining in Organizational Research: Review and Recommendations
    Hickman, Louis
    Thapa, Stuti
    Tay, Louis
    Cao, Mengyang
    Srinivasan, Padmini
    ORGANIZATIONAL RESEARCH METHODS, 2022, 25 (01) : 114 - 146
  • [46] A review: Data mining and Text mining Tools in biological domain
    Gouider, Manel
    Hamdi, Ines
    Ben Ghezala, Henda
    VISION 2020: INNOVATION MANAGEMENT, DEVELOPMENT SUSTAINABILITY, AND COMPETITIVE ECONOMIC GROWTH, 2016, VOLS I - VII, 2016, : 2737 - 2746
  • [47] Using text mining and link analysis for software mining
    Grcar, Miha
    Grobehlik, Marko
    Mladenic, Dunja
    MINING COMPLEX DATA, 2008, 4944 : 1 - 12
  • [48] Marketing Management in the Hotel Industry: A Systematic Literature Review by Using Text Mining
    Horng, Jeou-Shyan
    Liu, Chih-Hsing
    Chou, Sheng-Fang
    Yu, Tai-Yi
    Hu, Da-Chian
    SUSTAINABILITY, 2022, 14 (04)
  • [49] A Review on Text Mining Techniques for the Prediction of Psychological Behaviour using Social Media
    Padminivalli, S. J. R. K., V
    Rao, M. V. P. Chandra Sekhara
    INTERNATIONAL JOURNAL OF EARLY CHILDHOOD SPECIAL EDUCATION, 2022, 14 (03) : 2462 - 2467
  • [50] A Review of Digital Transformation on Supply Chain Process Management Using Text Mining
    Tavana, Madjid
    Shaabani, Akram
    Raeesi Vanani, Iman
    Kumar Gangadhari, Rajan
    PROCESSES, 2022, 10 (05)