Too Big to Mail: On the Way to Publish Large-scale Mobile Analytics Data

被引:0
|
作者
Peltonen, Ella [1 ]
Lagerspetz, Eemil [1 ]
Nurmi, Petteri [1 ,2 ]
Tarkoma, Sasu [1 ,2 ]
机构
[1] Univ Helsinki, Dept Comp Sci, HIIT, Helsinki, Finland
[2] Univ Helsinki, Dept Comp Sci, POB 64, FI-00014 Helsinki, Finland
基金
芬兰科学院;
关键词
Big Data; Mobile Analytics; Energy-awareness;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The Carat project started in 2012 has collected over 1.5 TB of data from over 850,000 mobile users all over the world. The project uses Apache Thrift to transmit data, and Apache Spark to run data analysis tasks, and the gist of the Carat analysis method has been published. While the Carat application code is open source, the data is much harder to share because of its size and privacy concerns. This paper outlines the challenges in sharing such a large-scale dataset with detailed information about smart devices, applications, and their users, and presents some solutions to these challenges.
引用
收藏
页码:2374 / 2377
页数:4
相关论文
共 50 条
  • [21] Node attributes and edge structure for large-scale big data network analytics and community detection
    Department of Computer Science and CSE, North Carolina AandT State University, Greensboro
    NC, United States
    [J]. IEEE Int. Symp. Technol. Homel. Secur., HST, 2015,
  • [22] Flexpath: Type-Based Publish/Subscribe System for Large-scale Science Analytics
    Dayal, Jai
    Bratcher, Drew
    Eisenhauer, Greg
    Schwan, Karsten
    Wolf, Matthew
    Zhang, Xuechen
    Abbasi, Hasan
    Klasky, Scott
    Podhorszki, Norbert
    [J]. 2014 14TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2014, : 246 - 255
  • [23] Visual Analytics of Large-Scale Climate Model Data
    Wong, Pak Chung
    Shen, Han-Wei
    Leung, Ruby
    Hagos, Samson
    Lee, Teng-Yok
    Tong, Xin
    Lu, Kewei
    [J]. 2014 IEEE 4TH SYMPOSIUM ON LARGE DATA ANALYSIS AND VISUALIZATION (LDAV), 2014, : 85 - 92
  • [24] Disco: A Computing Platform for Large-Scale Data Analytics
    Mundkur, Prashanth
    Tuulos, Ville
    Flatow, Jared
    [J]. ERLANG 11: PROCEEDINGS OF THE 2011 ACM SIGPLAN ERLANG WORKSHOP, 2011, : 84 - 89
  • [25] Scalable computing for large-scale multimedia data analytics
    Karuppiah, Marimuthu
    Chaudhry, Shehzad Ashraf
    Alsharif, Mohammed H.
    [J]. JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2024, 62 (03) : 601 - 603
  • [26] Visual Cascade Analytics of Large-Scale Spatiotemporal Data
    Deng, Zikun
    Weng, Di
    Liang, Yuxuan
    Bao, Jie
    Zheng, Yu
    Schreck, Tobias
    Xu, Mingliang
    Wu, Yingcai
    [J]. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2022, 28 (06) : 2486 - 2499
  • [27] Anytime Large-Scale Analytics of Linked Open Data
    Soulet, Arnaud
    Suchanek, Fabian M.
    [J]. SEMANTIC WEB - ISWC 2019, PT I, 2019, 11778 : 576 - 592
  • [28] Aggregation and Multidimensional Analysis of Big Data for Large-Scale Scientific Applications: Models, Issues, Analytics, and Beyond
    Cuzzocrea, Alfredo
    [J]. PROCEEDINGS OF THE 27TH INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, 2015,
  • [29] Big Data Analytics: Impacting Business in Big Way
    Sharma, Neha
    Sawai, Deepali
    Surve, Ganesh
    [J]. 2017 1ST IEEE INTERNATIONAL CONFERENCE ON DATA MANAGEMENT, ANALYTICS AND INNOVATION (ICDMAI), 2017, : 111 - 116
  • [30] MOBILE BIG DATA FOR URBAN ANALYTICS
    Hui, Pan
    Li, Yong
    Ott, Joerg
    Uhlig, Steve
    Han, Bo
    Tan, Kun
    [J]. IEEE COMMUNICATIONS MAGAZINE, 2018, 56 (11) : 12 - 12