SourceTrac: Tracing Data Sources within Spreadsheets

被引:0
|
作者
Asuncion, Hazeline U. [1 ]
机构
[1] Univ Washington, Bothell, WA USA
关键词
data provenance; spreadsheets; multiple sources;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Analyzing data from multiple sources is a common task in scientific research. In particular, spreadsheet data is often aggregated from a variety of sources to identify patterns and synthesize reports. Yet, techniques are lacking for automatically capturing the provenance of such data within spreadsheet environments like Excel. We present a novel approach for fine-grained tracing of tabular data that may have been obtained from files, databases, or the Web. Our approach provides relevant provenance information at both the micro-level (per cell) and the macro-level (per sheet). Initial results suggest that our approach is scalable and beneficial to data analysts.
引用
收藏
页码:1 / 10
页数:10
相关论文
共 50 条
  • [1] Specification of the Schema of Spreadsheets for the Materialization of Ontologies from Integrated Data Sources
    Alejandro Gomez, Sergio
    Ruben Fillottrani, Pablo
    [J]. COMPUTER SCIENCE - CACIC 2020, 2021, 1409 : 247 - 262
  • [2] Ray Tracing Within a Data Parallel Framework
    Larsen, Matthew
    Meredith, Jeremy S.
    Navratil, Paul A.
    Childs, Hank
    [J]. 2015 IEEE PACIFIC VISUALIZATION SYMPOSIUM (PACIFICVIS), 2015, : 279 - 286
  • [3] DATA ANALYSIS WITH SPREADSHEETS
    Hovmand, Peter
    [J]. JOURNAL OF TECHNOLOGY IN HUMAN SERVICES, 2007, 25 (03) : 101 - 103
  • [4] Data Organization in Spreadsheets
    Broman, Karl W.
    Woo, Kara H.
    [J]. AMERICAN STATISTICIAN, 2018, 72 (01): : 2 - 10
  • [5] SPREADSHEETS, DATA HANDLING AND STATISTICS
    NAKATSU, K
    OWEN, JA
    [J]. TRENDS IN PHARMACOLOGICAL SCIENCES, 1989, 10 (02) : 57 - 60
  • [6] CHECKCELL: Data Debugging for Spreadsheets
    Barowy, Daniel W.
    Gochev, Dimitar
    Berger, Emery D.
    [J]. ACM SIGPLAN NOTICES, 2014, 49 (10) : 507 - 523
  • [7] CHECKCELL: Data debugging for spreadsheets
    Barowy, Daniel W.
    Gochev, Dimitar
    Berger, Emery D.
    [J]. ACM SIGPLAN Notices, 2014, 49 (10): : 507 - 523
  • [8] Transforming Spreadsheets with Data Noodles
    Gorinova, Maria I.
    Sarkar, Advait
    Blackwell, Alan F.
    Prince, Karl
    [J]. 2016 IEEE SYMPOSIUM ON VISUAL LANGUAGES AND HUMAN-CENTRIC COMPUTING (VL/HCC), 2016, : 236 - 237
  • [9] Aerosol data sources and their roles within PARAGON
    Kahn, RA
    Ogren, JA
    Ackerman, TP
    Bösenberg, J
    Charlson, RJ
    Diner, DJ
    Holben, BN
    Menzies, RT
    Miller, MA
    Seinfeld, JH
    [J]. BULLETIN OF THE AMERICAN METEOROLOGICAL SOCIETY, 2004, 85 (10) : 1511 - +
  • [10] Toolkits for nuclear science: Data and spreadsheets
    R. M. Lindstrom
    [J]. Journal of Radioanalytical and Nuclear Chemistry, 2006, 270 : 335 - 337