MetaEnhance: Metadata Quality Improvement for Electronic Theses and Dissertations of University Libraries

被引:0
|
作者
Choudhury, Muntabir Hasan [1 ]
Salsabil, Lamia [1 ]
Jayanetti, Himarsha R. [1 ]
Wu, Jian [1 ]
Ingram, William A. [2 ]
Fox, Edward A. [2 ]
机构
[1] Old Dominion Univ, Norfolk, VA 23529 USA
[2] Virginia Tech, Blacksburg, VA USA
关键词
Digital Libraries; Scholarly Big Data; ETD; Metadata Quality; Artificial Intelligence;
D O I
10.1109/JCDL57899.2023.00019
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Metadata quality is crucial for discovering digital objects through digital library (DL) interfaces. However, due to various reasons, the metadata of digital objects often exhibits incomplete, inconsistent, and incorrect values. We investigate methods to automatically detect, correct, and canonicalize scholarly metadata, using seven key fields of electronic theses and dissertations (ETDs) as a case study. We propose MetaEnhance, a framework that utilizes state-of-the-art artificial intelligence (AI) methods to improve the quality of these fields. To evaluate MetaEnhance, we compiled a metadata quality evaluation benchmark containing 500 ETDs, by combining subsets sampled using multiple criteria. We evaluated MetaEnhance against this benchmark and found that the proposed methods achieved nearly perfect F1-scores in detecting errors and F1-scores ranging from 0.85 to 1.00 for correcting five of seven key metadata fields. The codes and data are publicly available on GitHub(1).
引用
收藏
页码:61 / 65
页数:5
相关论文
共 50 条
  • [1] Electronic theses and dissertations in Nigeria university libraries Status, challenges and strategies
    Ezema, Ifeanyi J.
    Ugwu, C. I.
    [J]. ELECTRONIC LIBRARY, 2013, 31 (04): : 493 - 507
  • [2] Electronic Theses and Dissertations at the University of Virginia
    Sharretts, CW
    Shieh, J
    French, JC
    [J]. ASIS 99: PROCEEDINGS OF THE 62ND ASIS ANNUAL MEETING, VOL 36, 1999: KNOWLEDGE: CREATION ORGANIZATION AND USE, 1999, 36 : 240 - 255
  • [3] Morphing metadata: maximizing access to electronic theses and dissertations
    McCutcheon, Sevim
    Kreyche, Michael
    Maurer, Margaret Beecher
    Nickerson, Joshua
    [J]. LIBRARY HI TECH, 2008, 26 (01) : 41 - 57
  • [4] Metadata matters: evaluating the quality of Electronic Theses and Dissertations (ETDs) descriptions in Malaysian institutional repositories
    Osman, R.
    Idaya, A. M. K. Yanti
    Abrizah, A.
    [J]. MALAYSIAN JOURNAL OF LIBRARY & INFORMATION SCIENCE, 2023, 28 (01) : 109 - 125
  • [5] Automatic classification of digital objects for improved metadata quality of electronic theses and dissertations in institutional repositories
    Phiri, Lighton
    [J]. International Journal of Metadata, Semantics and Ontologies, 2020, 14 (03): : 234 - 248
  • [6] An Analysis of Evolving Metadata Influences, Standards, and Practices in Electronic Theses and Dissertations
    Potvin, Sarah
    Thompson, Santi
    [J]. LIBRARY RESOURCES & TECHNICAL SERVICES, 2016, 60 (02): : 99 - 114
  • [7] Status of Electronic Theses and Dissertations (ETDs) in Academic Libraries in Zimbabwe
    Chisita, Collence Takaingenhamo
    Enakrire, Rexwhite Tega
    Muziringa, Masimba Clyde
    [J]. INTERNATIONAL JOURNAL OF E-COLLABORATION, 2020, 16 (03) : 96 - 108
  • [8] Communication channels and the adoption of digital libraries for electronic theses and dissertations
    Allard, S
    [J]. JCDL 2004: PROCEEDINGS OF THE FOURTH ACM/IEEE JOINT CONFERENCE ON DIGITAL LIBRARIES: GLOBAL REACH AND DIVERSE IMPACT, 2004, : 381 - 381
  • [9] Electronic theses and dissertations
    Fineman, Y
    [J]. PORTAL-LIBRARIES AND THE ACADEMY, 2003, 3 (02) : 219 - 227
  • [10] Electronic theses and dissertations in CRIS
    Schoepfel, Joachim
    Zendulkova, Danica
    Fatemi, Omid
    [J]. 12TH INTERNATIONAL CONFERENCE ON CURRENT RESEARCH INFORMATION SYSTEMS (CRIS 2014): MANAGING DATA INTENSIVE SCIENCE: THE ROLE OF RESEARCH INFORMATION SYSTEMS IN REALISING THE DIGITAL AGENDA, 2014, 33 : 110 - 117