Automated labeling of bibliographic data extracted from biomedical online journals

被引:7
|
作者
Kim, JW [1 ]
Le, DX [1 ]
Thoma, GR [1 ]
机构
[1] Natl Lib Med, Lister Hill Natl Ctr Biomed Commun, Bethesda, MD 20894 USA
来源
关键词
!text type='HTML']HTML[!/text; online journals; labeling; fuzzy rule-based algorithm; statistics; WebMARS; NLM;
D O I
10.1117/12.476047
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
dA prototype system has been designed to automate the extraction of bibliographic data (e.g., article title, authors, abstract, affiliation and others) from online biomedical journals to populate the National Library of Medicine's MEDLINE(R) database. This paper describes a key module in this system: the labeling module that employs statistics and fuzzy rule-based algorithms to identify segmented zones in an article's HTML pages as specific bibliographic data. Results from experiments conducted with 1, 149 medical articles from forty-seven journal issues are presented.
引用
收藏
页码:47 / 56
页数:10
相关论文
共 50 条
  • [1] Automated cleanup processing for extracting bibliographic data from biomedical online journals
    Kim, In Cheol
    Le, Daniel X.
    Thoma, George R.
    [J]. WMSCI 2005: 9TH WORLD MULTI-CONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL 4, 2005, : 401 - 405
  • [2] Automated labeling from biomedical journals published in foreign languages
    Kim, J
    Le, DX
    Thoma, GR
    [J]. 8TH WORLD MULTI-CONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL V, PROCEEDINGS: COMPUTER SCIENCE AND ENGINEERING, 2004, : 444 - 449
  • [3] Automated labeling of biomedical online journal articles
    Kim, Jongwoo
    Le, Daniel X.
    Thoma, George R.
    [J]. WMSCI 2005: 9TH WORLD MULTI-CONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL 4, 2005, : 406 - 411
  • [4] Automated document labeling for web-based Online medical journals
    Le, DX
    Thoma, GR
    [J]. 7TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL II, PROCEEDINGS: COMPUTER SCIENCE AND ENGINEERING, 2003, : 411 - 415
  • [5] Labeling data extracted from the Web
    da Silva, Altigran S.
    Barbosa, Denilson
    Cavalcanti, Joao M. B.
    Sevalho, Marco A. S.
    [J]. ON THE MOVE TO MEANINGFUL INTERNET SYSTEMS 2007: COOPLS, DOA, ODBASE, GADA, AND IS, PT 1, PROCEEDINGS, 2007, 4803 : 1099 - +
  • [6] Reporting of article retractions in bibliographic databases and online journals
    Wright, Kath
    McDaid, Catriona
    [J]. JOURNAL OF THE MEDICAL LIBRARY ASSOCIATION, 2011, 99 (02) : 164 - 167
  • [7] Electronic journals in the online catalog: Selection and bibliographic control
    Simpson, P
    Seeds, R
    [J]. LIBRARY RESOURCES & TECHNICAL SERVICES, 1998, 42 (02): : 126 - 132
  • [8] Automated labeling algorithms for biomedical document images
    Kim, J
    Le, DX
    Thoma, GR
    [J]. 7TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL V, PROCEEDINGS: COMPUTER SCIENCE AND ENGINEERING: I, 2003, : 352 - 357
  • [9] Automated bibliometric data generation in Python']Python from a bibliographic database
    Toaza, Bladimir
    Esztergar-Kiss, Domokos
    [J]. SOFTWARE IMPACTS, 2024, 19
  • [10] UAV-Based Automated Labeling of Training Data for Online Water and Land Differentiation
    Klein, Curtis
    Speckman, Trevor
    Medeiros, Thomas
    Eells, Derek
    Basha, Elizabeth
    [J]. PROCEEDINGS OF THE 2018 INTERNATIONAL SYMPOSIUM ON EXPERIMENTAL ROBOTICS, 2020, 11 : 106 - 116