Sampling Operations on Big Data

被引:0
|
作者
Gadepally, Vijay [1 ]
Herr, Taylor [1 ]
Johnson, Luke [1 ]
Milechin, Lauren [1 ]
Milosavljevic, Maja [1 ]
Miller, Benjamin A. [1 ]
机构
[1] MIT, Lincoln Lab, Lexington, MA 02420 USA
关键词
LINK PREDICTION;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The 3Vs - Volume, Velocity and Variety - of Big Data continues to be a large challenge for systems and algorithms designed to store, process and disseminate information for discovery and exploration under real-time constraints. Common signal processing operations such as sampling and filtering, which have been used for decades to compress signals are often undefined in data that is characterized by heterogeneity, high dimensionality, and lack of known structure. In this article, we describe and demonstrate an approach to sample large datasets such as social media data. We evaluate the effect of sampling on a common predictive analytic: link prediction. Our results indicate that greatly sampling a dataset can still yield meaningful link prediction results.
引用
收藏
页码:1515 / 1519
页数:5
相关论文
共 50 条
  • [41] BigDataStack: A holistic data-driven stack for big data applications and operations
    Kyriazis, Dimosthenis
    Doulkeridis, Christos
    Gouvas, Panagiotis
    Jimenez-Peris, Ricardo
    Ferrer, Ana Juan
    Kallipolitis, Leonidas
    Kranas, Pavlos
    Kousiouris, George
    Macdonald, Craig
    McCreadie, Richard
    Moatti, Yosef
    Papageorgiou, Apostolos
    Patino-Martinez, Marta
    Plitsos, Stathis
    Poulopoulos, Dimitris
    Paradell, Antonio
    Raouzaiou, Amaryllis
    Ta-Shma, Paula
    Vianello, Valerio
    [J]. 2018 IEEE INTERNATIONAL CONGRESS ON BIG DATA (IEEE BIGDATA CONGRESS), 2018, : 237 - 241
  • [42] Making sense of Big Data - can it transform operations management?
    Matthias, Olga
    Fouweather, Ian
    Gregory, Ian
    Vernon, Andy
    [J]. INTERNATIONAL JOURNAL OF OPERATIONS & PRODUCTION MANAGEMENT, 2017, 37 (01) : 37 - 55
  • [43] Big Data Perspective on Financial Operations Revenue Management Approach
    Yang, Jing
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2022, 2022
  • [44] Modeling big data enablers for operations and supply chain management
    Lamba, Kuldeep
    Singh, Surya Prakash
    [J]. INTERNATIONAL JOURNAL OF LOGISTICS MANAGEMENT, 2018, 29 (02) : 629 - 658
  • [45] Big Data and Discrete Optimization for Electric Urban Bus Operations
    Wuertz, Samuel
    Bogenberger, Klaus
    Goehner, Ulrich
    [J]. TRANSPORTATION RESEARCH RECORD, 2023, 2677 (03) : 389 - 401
  • [46] Big Data Perspective on Financial Operations Revenue Management Approach
    Yang, Jing
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2022, 2022
  • [47] Cognitive systems and operations research in big data and cloud computing
    Marek R. Ogiela
    Hoon Ko
    [J]. Annals of Operations Research, 2018, 265 : 183 - 186
  • [48] Taxonomy for Unsecure Big Data Processing in Security Operations Centers
    Miloslavskaya, Natalia
    Tolstoy, Alexander
    Zapechnikov, Sergey
    [J]. 2016 IEEE 4TH INTERNATIONAL CONFERENCE ON FUTURE INTERNET OF THINGS AND CLOUD WORKSHOPS (FICLOUDW), 2016, : 154 - 159
  • [49] The Role of Big Data in Industrial (Bio)chemical Process Operations
    Udugama, Isuru A.
    Gargalo, Carina L.
    Yamashita, Yoshiyuki
    Taube, Michael A.
    Palazoglu, Ahmet
    Young, Brent R.
    Gernaey, Krist, V
    Kulahci, Murat
    Bayer, Christoph
    [J]. INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 2020, 59 (34) : 15283 - 15297
  • [50] Secure transmission for big data based on nested sampling and coprime sampling with spectrum efficiency
    Chen, Junjie
    Liang, Qilian
    Wang, Jie
    [J]. SECURITY AND COMMUNICATION NETWORKS, 2015, 8 (14) : 2447 - 2456