A Figure Search Engine Architecture for a Chemistry Digital Library

被引:0
|
作者
Choudhury, Sagnik Ray [1 ]
Tuarob, Suppawong [2 ]
Mitra, Prasenjit [1 ,2 ]
Rokach, Lior [3 ]
Kirk, Andi [4 ]
Szep, Silvia [4 ]
Pellegrino, Donald [4 ]
Jones, Sue [4 ]
Giles, C. Lee [1 ,2 ]
机构
[1] Penn State Univ, Informat Sci & Technol, University Pk, PA 16802 USA
[2] Penn State Univ, Comp Sci & Engn, University Pk, PA 16802 USA
[3] Ben Gurion Univ Negev, Informat Syst Engn, IL-84105 Beer Sheva, Israel
[4] Dow Chem Co USA, Spring House, PA 19477 USA
基金
美国国家科学基金会;
关键词
Information Extraction; Figure Search;
D O I
暂无
中图分类号
G25 [图书馆学、图书馆事业]; G35 [情报学、情报工作];
学科分类号
1205 ; 120501 ;
摘要
Academic papers contain multiple figures representing important findings and experimental results; we present a search engine specifically focused on figures in academic documents. This search engine allows users to search on figures in approximately 150,000 chemistry journal articles though the method is easily extendable to other domains. Our system indexes figure caption and mentions extracted from the PDF in documents using a custom built extractor. Recall and precision performance of extracted figures is in the 80 to 90 % range. We give the frame work for the extraction algorithm, architecture and ranking function.
引用
收藏
页码:369 / 370
页数:2
相关论文
共 50 条
  • [1] CiteSeerX: AI in a Digital Library Search Engine
    Wu, Jian
    Williams, Kyle
    Chen, Hung-Hsuan
    Khabsa, Madian
    Caragea, Cornelia
    Ororbia, Alexander
    Jordan, Douglas
    Giles, C. Lee
    [J]. PROCEEDINGS OF THE TWENTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2014, : 2930 - 2937
  • [2] CiteSeerX: AI in a Digital Library Search Engine
    Wu, Jian
    William, Kyle
    Chen, Hung-Hsuan
    Khabsa, Madian
    Caragea, Cornelia
    Tuarob, Suppawong
    Ororbia, Alexander
    Jordan, Douglas
    Mitra, Prasenjit
    Giles, C. Lee
    [J]. AI MAGAZINE, 2015, 36 (03) : 35 - 48
  • [3] The research about Intelligent Search Engine and in Digital Library personalization services
    Fu Junhui
    Lv Jingqiao
    Li Xueyong
    [J]. SMART MATERIALS AND INTELLIGENT SYSTEMS, PTS 1 AND 2, 2011, 143-144 : 333 - +
  • [4] Digital Library Engine: Adapting Digital Library for Cloud Computing
    Lu, Weiming
    Zheng, Liangju
    Shao, Jian
    Wei, Baogang
    Zhuang, Yueting
    [J]. 2013 IEEE SIXTH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD 2013), 2013, : 934 - 941
  • [5] The Alexandria Digital Library architecture
    J. Frew
    M. Freeston
    N. Freitas
    L. Hill
    G. Janée
    K. Lovette
    R. Nideffer
    T. Smith
    Q. Zheng
    [J]. International Journal on Digital Libraries, 2000, 2 (4) : 259 - 268
  • [6] An XQuery engine for digital library systems
    Kang, JH
    Kim, CS
    Ko, EJ
    [J]. 2003 JOINT CONFERENCE ON DIGITAL LIBRARIES, PROCEEDINGS, 2003, : 400 - 400
  • [7] Digital object and repository architecture for digital library
    Bong, KW
    Chol, RG
    Chol, HS
    Min, KC
    [J]. PROCEEDINGS OF THE THIRD INTERNATIONAL SYMPOSIUM ON MAGNETIC INDUSTRY (ISMI'04) & FIRST INTERNATIONAL SYMPOSIUM ON PHYSICS AND IT INDUSTRY (ISITI'04), 2005, : 262 - 263
  • [8] A conceptual architecture for semantic search engine
    Ilyas, QM
    Kai, YZ
    Talib, MA
    [J]. INMIC 2004: 8th International Multitopic Conference, Proceedings, 2004, : 605 - 610
  • [9] Multi Agent Architecture for Search Engine
    Verma, Disha
    Kochar, Barjesh
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2016, 7 (03) : 224 - 229
  • [10] LIBRARY SEARCH IN ANALYTICAL-CHEMISTRY
    ZUPAN, J
    [J]. FRESENIUS ZEITSCHRIFT FUR ANALYTISCHE CHEMIE, 1982, 311 (04): : 317 - 317