Text Mining for Type of Research Classification

被引:1
|
作者
Lowe, David B. [1 ]
Dollinger, Ian [2 ]
Koster, Tristan [3 ]
Herbert, Bruce E. [1 ]
机构
[1] Texas A&M Univ, Univ Lib, Off Scholarly Commun, College Stn, TX 77843 USA
[2] Texas A&M Univ, Elect & Comp Engn, College Stn, TX USA
[3] Texas A&M Univ, Interdisciplinary Engn Data Sci, College Stn, TX USA
基金
美国国家科学基金会;
关键词
Text mining; type of research classification; cataloging for digital resources; BERT (Bidirectional Encoder Representations from Transformers); project-based learning;
D O I
10.1080/01639374.2021.1998281
中图分类号
G25 [图书馆学、图书馆事业]; G35 [情报学、情报工作];
学科分类号
1205 ; 120501 ;
摘要
This project brought together undergraduate students in Computer Science with librarians to mine abstracts of articles from the Texas A&M University Libraries' institutional repository, OAKTrust, in order to probe the creation of new metadata to improve discovery and use. The mining operation task consisted simply of classifying the articles into two categories of research type: basic research ("for understanding," "curiosity-based," or "knowledge-based") and applied research ("use-based"). These categories are fundamental especially for funders but are also important to researchers. The mining-to-classification steps took several iterations, but ultimately, we achieved good results with the toolkit BERT (Bidirectional Encoder Representations from Transformers). The project and its workflows represent a preview of what may lie ahead in the future of crafting metadata using text mining techniques to enhance discoverability.
引用
收藏
页码:815 / 834
页数:20
相关论文
共 50 条
  • [1] The research of classification technologies based on text mining
    Liu, LZ
    Li, PZ
    Chen, JJ
    [J]. ISTM/2005: 6th International Symposium on Test and Measurement, Vols 1-9, Conference Proceedings, 2005, : 8517 - 8520
  • [2] Research article classification with text mining method
    Gurbuz, Tugba
    Uluyol, Celebi
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2023, 35 (01):
  • [3] The Logistics Policy Classification Research Based on Text Mining
    Zhang, Yong-an
    Qie, Hai-tuo
    [J]. 2016 2ND INTERNATIONAL CONFERENCE ON HUMANITY AND SOCIAL SCIENCE (ICHSS 2016), 2016, : 109 - 113
  • [4] Research on text classification mining based on Naive Bayes
    Liu, LZ
    Zhang, CL
    Chen, JJ
    [J]. ISTM/2005: 6TH INTERNATIONAL SYMPOSIUM ON TEST AND MEASUREMENT, VOLS 1-9, CONFERENCE PROCEEDINGS, 2005, : 8521 - 8524
  • [5] Classification of News and Research Articles Using Text Pattern Mining
    Chaudhari, Sujit V.
    Lade, Shrikant
    [J]. INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2015, 15 (10): : 43 - 47
  • [6] Text Mining-based Research on Aircraft Faults Classification and Retrieval Model
    Xu, Xingxing
    Zhou, Shenghan
    Xiao, Yiyong
    Chang, Wenbing
    Wei, Fajie
    Yang, Ming
    [J]. 2020 ANNUAL RELIABILITY AND MAINTAINABILITY SYMPOSIUM (RAMS 2020), 2020,
  • [7] Research on online shopping contextual cues: refining classification from text mining
    Wang, Lin
    Gao, Huaxia
    Zhao, Yang
    [J]. ASIA PACIFIC JOURNAL OF MARKETING AND LOGISTICS, 2023, 35 (11) : 2704 - 2726
  • [8] A SURVEY ON CLASSIFICATION TECHNIQUES FOR TEXT MINING
    Brindha, S.
    Sukumaran, S.
    Prabha, K.
    [J]. 2016 3RD INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING AND COMMUNICATION SYSTEMS (ICACCS), 2016,
  • [9] Cyberbullying Classification using Text Mining
    Noviantho
    Isa, Sani Muhamad
    Ashianti, Livia
    [J]. 2017 1ST INTERNATIONAL CONFERENCE ON INFORMATICS AND COMPUTATIONAL SCIENCES (ICICOS), 2017, : 241 - 245
  • [10] Text mining in the classification of digital documents
    Contreras Barrera, Marcial
    [J]. BIBLIOS-REVISTA DE BIBLIOTECOLOGIA Y CIENCIAS DE LA INFORMACION, 2016, (64): : 33 - 43