Text mining using database tomography and bibliometrics: A review

被引:107
|
作者
Kostoff, RN
Toothman, DR
Eberhart, HJ
Humenik, JA
机构
[1] Off Naval Res, Arlington, VA 22217 USA
[2] RSIS Inc, Mclean, VA 22102 USA
[3] NOESIS Inc, Manassas, VA USA
关键词
database tomography; text mining; bibliometrics; innovation; information retrieval; information extraction; cluster; taxonomies;
D O I
10.1016/S0040-1625(01)00133-0
中图分类号
F [经济];
学科分类号
02 ;
摘要
Database tomography (DT) is a textual database analysis system consisting of two major components: (1) algorithms for extracting multiword phrase frequencies and phrase proximities (physical closeness of the multiword technical phrases) from any type of large textual database, to augment (2) interpretative capabilities of the expert human analyst. DT has been used to derive technical intelligence from a variety of textual database sources, most recently the published technical literature as exemplified by the Science Citation Index (SCI) and the Engineering Compendex (EC). Phrase frequency analysis (the occurrence frequency of multiword technical phrases) provides the pervasive technical themes of the topical databases of interest, and phrase proximity analysis provides the relationships among the pervasive technical themes. In the structured published literature databases, bibliometric analysis of the database records supplements the DT results by identifying the recent most prolific topical area authors; the journals that contain numerous topical area papers; the institutions that produce numerous topical area papers; the keywords specified most frequently by the topical area authors; the authors whose works are cited most frequently in the topical area papers; and the particular papers and journals cited most frequently in the topical area papers. This review paper summarizes: (1) the theory and background development of DT; (2) past published and unpublished literature study results; (3) present application activities; (4) potential expansion to new DT applications. In addition, application of DT to technology forecasting is addressed. (C) 2001 Elsevier Science Inc. All rights reserved.
引用
收藏
页码:223 / 253
页数:31
相关论文
共 50 条
  • [31] Mining knowledge from text repositories using information extraction: A review
    SANDEEP R SIRSAT
    DR VINAY CHAVAN
    DR SHRINIVAS P DESHPANDE
    Sadhana, 2014, 39 : 53 - 62
  • [32] A Review on Social Audience Identification on Twitter using Text mining methods
    Dastanwala, Priyanka B.
    Patel, Vibha
    PROCEEDINGS OF THE 2016 IEEE INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, SIGNAL PROCESSING AND NETWORKING (WISPNET), 2016, : 1917 - 1920
  • [33] Service Quality Evaluation Using Text Mining: A Systematic Literature Review
    Vencovsky, Filip
    PERSPECTIVES IN BUSINESS INFORMATICS RESEARCH, BIR 2020, 2020, 398 : 159 - 173
  • [34] The Identification of Marketing Performance Using Text Mining of Airline Review Data
    Hong, Jae-Won
    Park, Seung-Bae
    MOBILE INFORMATION SYSTEMS, 2019, 2019
  • [35] Text mining in a literature review of urothelial cancer using topic model
    Hsuan-Jen Lin
    Phillip C.-Y. Sheu
    Jeffrey J. P. Tsai
    Charles C. N. Wang
    Che-Yi Chou
    BMC Cancer, 20
  • [36] Text mining in a literature review of urothelial cancer using topic model
    Lin, Hsuan-Jen
    Sheu, Phillip C-Y
    Tsai, Jeffrey J. P.
    Wang, Charles C. N.
    Chou, Che-Yi
    BMC CANCER, 2020, 20 (01)
  • [37] Mining knowledge from text repositories using information extraction: A review
    Sirsat, Sandeep R.
    Chavan, Vinay
    Deshpande, Shrinivas P.
    SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2014, 39 (01): : 53 - 62
  • [38] Trends of e-learning research from 2000 to 2008: Use of text mining and bibliometrics
    Hung, Jui-long
    BRITISH JOURNAL OF EDUCATIONAL TECHNOLOGY, 2012, 43 (01) : 5 - 16
  • [39] A review on authorship attribution in text mining
    Zheng, Wanwan
    Jin, Mingzhe
    WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2023, 15 (02)
  • [40] An advanced review on text mining in medicine
    Luque, Carmen
    Luna, Jose M.
    Luque, Maria
    Ventura, Sebastian
    WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2019, 9 (03)