VizioMetrix: A Platform for Analyzing the Visual Information in Big Scholarly Data

被引:7
|
作者
Lee, Po-Shen [1 ]
West, Jevin D. [2 ]
Howe, Bill [1 ]
机构
[1] Univ Washington, 185 Stevens Way, Seattle, WA 98105 USA
[2] Univ Washington, Box 352840, Seattle, WA 98195 USA
基金
美国国家科学基金会;
关键词
Figure Retrieval; Information Retrieval; Crowdsourcing; Opendata; Bibliometrics; Scientometrics; Viziometrics;
D O I
10.1145/2872518.2890523
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We present VizioMetrix, a platform that extracts visual information from the scientific literature and makes it available for use in new information retrieval applications and for studies that look at patterns of visual information across millions of papers. New ideas are conveyed visually in the scientific literature through figures - diagrams, photos, visualizations, tables - but these visual elements remain ensconced in the surrounding paper and difficult to use directly to facilitate information discovery tasks or longitudinal analytics. Very few applications in information retrieval, academic search, or bibliometrics make direct use of the figures, and none attempt to recognize and exploit the type of figure, which can be used to augment interactions with a large corpus of scholarly literature. The VizioMetrix platform processes a corpus of documents, classifies the figures, organizes the results into a cloud-hosted databases, and drives three distinct applications to support bibliometric analysis and information retrieval. The first application supports information retrieval tasks by allowing rapid browsing of classified figures. The second application supports longitudinal analysis of visual patterns in the literature and facilitates data mining of these figures. The third application supports crowdsourced tagging of figures to improve classification, augment search, and facilitate new kinds of analyses. Our initial corpus is the entirety of PubMed Central (PMC), and will be released to the public alongside this paper; we welcome other researchers to make use of these resources.
引用
收藏
页码:413 / 418
页数:6
相关论文
共 50 条
  • [41] A versatile data-intensive computing platform for information retrieval from big geospatial data
    Soille, P.
    Burger, A.
    De Marchi, D.
    Kempeneers, P.
    Rodriguez, D.
    Syrris, V.
    Vasilev, V.
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2018, 81 : 30 - 40
  • [42] Designing Framework for Precise Service of Scholarly Big Data
    Xie, Jing
    Qian, Li
    Shi, Hongbo
    Kong, Beibei
    Hu, Jiying
    [J]. Data Analysis and Knowledge Discovery, 2019, 3 (01): : 63 - 71
  • [43] Searching for Evidence of Scientific News in Scholarly Big Data
    Ul Hoque, Md Reshad
    Bradley, Dash
    Kwan, Chiman
    Chiatti, Agnese
    Li, Jiang
    Wu, Jian
    [J]. PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE ON KNOWLEDGE CAPTURE (K-CAP '19), 2019, : 251 - 254
  • [44] Design Considerations for a Sustainable Scholarly Big Data Service
    Wu, Jian
    Rohatgi, Shaurya
    Angadi, Manoj K.
    Puranik, Kavya S.
    Giles, C. Lee
    [J]. ACM International Conference Proceeding Series, 2022, : 83 - 87
  • [45] Entity deduplication in big data graphs for scholarly communication
    Manghi, Paolo
    Atzori, Claudio
    De Bonis, Michele
    Bardi, Alessia
    [J]. DATA TECHNOLOGIES AND APPLICATIONS, 2020, 54 (04) : 409 - 435
  • [46] Research Paper Recommender Systems on Big Scholarly Data
    Chen, Tsung Teng
    Lee, Maria
    [J]. KNOWLEDGE MANAGEMENT AND ACQUISITION FOR INTELLIGENT SYSTEMS (PKAW 2018), 2018, 11016 : 251 - 260
  • [47] An algorithm for analyzing the city residents' activity information through mobile big data mining
    Guo, Yanbin
    Zhang, Jianzhong
    Zhang, Yu
    [J]. 2016 IEEE TRUSTCOM/BIGDATASE/ISPA, 2016, : 2133 - 2138
  • [48] Wireless Multifunctional Display Platform for Visual Communication Design Based on IoT Big Data
    Wang, Baoqing
    [J]. MOBILE INFORMATION SYSTEMS, 2022, 2022
  • [49] Scholarly Data Share: A Model for Sharing Big Data in Academic Research
    Chapman, Katie
    Ruan, Guangchen
    Tuna, M. Esen
    Walsh, Alan
    Wernert, Eric
    [J]. PRACTICE AND EXPERIENCE IN ADVANCED RESEARCH COMPUTING 2022, 2022,
  • [50] Data Storage Adapter in Big Data Platform
    Minh Chau Nguyen
    Won, Hee Sun
    [J]. 2015 8TH INTERNATIONAL CONFERENCE ON DATABASE THEORY AND APPLICATION (DTA), 2015, : 6 - 9