Text mining using database tomography and bibliometrics: A review

被引:107
|
作者
Kostoff, RN
Toothman, DR
Eberhart, HJ
Humenik, JA
机构
[1] Off Naval Res, Arlington, VA 22217 USA
[2] RSIS Inc, Mclean, VA 22102 USA
[3] NOESIS Inc, Manassas, VA USA
关键词
database tomography; text mining; bibliometrics; innovation; information retrieval; information extraction; cluster; taxonomies;
D O I
10.1016/S0040-1625(01)00133-0
中图分类号
F [经济];
学科分类号
02 ;
摘要
Database tomography (DT) is a textual database analysis system consisting of two major components: (1) algorithms for extracting multiword phrase frequencies and phrase proximities (physical closeness of the multiword technical phrases) from any type of large textual database, to augment (2) interpretative capabilities of the expert human analyst. DT has been used to derive technical intelligence from a variety of textual database sources, most recently the published technical literature as exemplified by the Science Citation Index (SCI) and the Engineering Compendex (EC). Phrase frequency analysis (the occurrence frequency of multiword technical phrases) provides the pervasive technical themes of the topical databases of interest, and phrase proximity analysis provides the relationships among the pervasive technical themes. In the structured published literature databases, bibliometric analysis of the database records supplements the DT results by identifying the recent most prolific topical area authors; the journals that contain numerous topical area papers; the institutions that produce numerous topical area papers; the keywords specified most frequently by the topical area authors; the authors whose works are cited most frequently in the topical area papers; and the particular papers and journals cited most frequently in the topical area papers. This review paper summarizes: (1) the theory and background development of DT; (2) past published and unpublished literature study results; (3) present application activities; (4) potential expansion to new DT applications. In addition, application of DT to technology forecasting is addressed. (C) 2001 Elsevier Science Inc. All rights reserved.
引用
收藏
页码:223 / 253
页数:31
相关论文
共 50 条
  • [1] Fractals text mining using bibliometrics and database tomography
    Kostoff, RN
    Shlesinger, MF
    Malpohl, G
    FRACTALS-COMPLEX GEOMETRY PATTERNS AND SCALING IN NATURE AND SOCIETY, 2004, 12 (01) : 1 - 16
  • [2] Nonlinear dynamics text mining using bibliometrics and Database Tomography
    Kostoff, RN
    Shlesinger, MF
    Tshiteya, R
    INTERNATIONAL JOURNAL OF BIFURCATION AND CHAOS, 2004, 14 (01): : 61 - 92
  • [3] Electrochemical power text mining using bibliometrics and database tomography
    Kostoff, RN
    Tshiteya, R
    Pfeil, KM
    Humenik, JA
    JOURNAL OF POWER SOURCES, 2002, 110 (01) : 163 - 176
  • [4] Fullerene data mining using bibliometrics and database tomography
    Kostoff, RN
    Braun, T
    Schubert, A
    Toothman, DR
    Humenik, JA
    JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2000, 40 (01): : 19 - 39
  • [5] Power source roadmaps using bibliometrics and database tomography
    Kostoff, RN
    Tshiteya, R
    Pfeil, KM
    Humenik, JA
    Karypis, G
    ENERGY, 2005, 30 (05) : 709 - 730
  • [6] Text Mining in Education-A Bibliometrics-Based Systematic Review
    Ahadi, Alireza
    Singh, Abhay
    Bower, Matt
    Garrett, Michael
    EDUCATION SCIENCES, 2022, 12 (03):
  • [7] Hypersonic and supersonic flow roadmaps using bibliometrics and database tomography
    Kostoff, RN
    Eberhart, HJ
    Toothman, DR
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE, 1999, 50 (05): : 427 - 447
  • [8] Weighted Hybrid Clustering by Combining Text Mining and Bibliometrics on a Large-Scale Journal Database
    Liu, Xinhai
    Yu, Shi
    Janssens, Frizo
    Glanzel, Wolfgang
    Moreau, Yves
    De Moor, Bart
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2010, 61 (06): : 1105 - 1119
  • [9] Educational chatbot research: text mining and bibliometrics
    Chen, Xieling
    Zou, Di
    Xie, Haoran
    Wang, Fu Lee
    INTERACTIVE LEARNING ENVIRONMENTS, 2024,
  • [10] Citation mining:: Integrating text mining and bibliometrics for research user profiling
    Kostoff, RN
    del Río, JA
    Humenik, JA
    García, EO
    Ramírez, AM
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2001, 52 (13): : 1148 - 1156