Data Source Selection in Big Data Context

被引:1
|
作者
Safhi, Hicham Moad [1 ]
Frikh, Bouchra [1 ]
Ouhbi, Brahim [2 ]
机构
[1] Sidi Mohammed Ben Abdellah Univ, LTTI Lab, Fes, Morocco
[2] Moulay Ismail Univ, ENSAM, LM2I Lab, Meknes, Morocco
关键词
Big Data integration; Big Data Source Selection; Source reliability; Data quality; DISCOVERY;
D O I
10.1145/3366030.3366121
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Big Data presents promising technological and economical opportunities. In fact, it has become the raw material of production for many organizations. Data is available in large quantities, and it continues generating abundantly. However, not all the data will have valuable knowledge. Unreliable sources provide misleading and biased information, and even reliable sources could suffer from low data quality. In this paper, we propose a novel methodology for the selectability of data sources, by both considering the presence and the absence of users' preferences. The proposed model integrates multiple factors that affect the reliability of data sources, including their quality, gain, cost and coverage. Experimental results on real world data-sets, show its capability to find the subset of relevant and reliable sources with the lowest cost.
引用
收藏
页码:611 / 616
页数:6
相关论文
共 50 条
  • [1] Data source selection for information integration in big data era
    Lin, Yiming
    Wang, Hongzhi
    Li, Jianzhong
    Gao, Hong
    [J]. INFORMATION SCIENCES, 2019, 479 : 197 - 213
  • [2] Data Source Selection Support in the Big Data Integration Process - Towards a Taxonomy
    Kruse, Felix
    Schrlier, Christoph
    Gomez, Jorge Marx
    [J]. INNOVATION THROUGH INFORMATION SYSTEMS, VOL III: A COLLECTION OF LATEST RESEARCH ON MANAGEMENT ISSUES, 2021, 48 : 5 - 21
  • [3] Exploiting Context and Quality for Linked Data Source Selection
    Catania, Barbara
    Guerrini, Giovanna
    Yaman, Beyza
    [J]. SAC '19: PROCEEDINGS OF THE 34TH ACM/SIGAPP SYMPOSIUM ON APPLIED COMPUTING, 2019, : 2251 - 2258
  • [4] Feature selection techniques in the context of big data: taxonomy and analysis
    Hudhaifa Mohammed Abdulwahab
    S. Ajitha
    Mufeed Ahmed Naji Saif
    [J]. Applied Intelligence, 2022, 52 : 13568 - 13613
  • [5] Feature selection techniques in the context of big data: taxonomy and analysis
    Abdulwahab, Hudhaifa Mohammed
    Ajitha, S.
    Saif, Mufeed Ahmed Naji
    [J]. APPLIED INTELLIGENCE, 2022, 52 (12) : 13568 - 13613
  • [6] Spatial Data Mining in the Context of Big Data
    Wang, Shuliang
    Yuan, Hanning
    [J]. 2013 19TH IEEE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS 2013), 2013, : 486 - 491
  • [7] Data storage in Big Data Context: A Survey
    ELomari, A.
    Maizate, A.
    Hassouni, L.
    [J]. PROCEEDINGS OF 2016 THIRD INTERNATIONAL CONFERENCE ON SYSTEMS OF COLLABORATION (SYSCO), 2016, : P107 - P110
  • [8] Recent advances and emerging challenges of feature selection in the context of big data
    Bolon-Canedo, V.
    Sanchez-Marono, N.
    Alonso-Betanzos, A.
    [J]. KNOWLEDGE-BASED SYSTEMS, 2015, 86 : 33 - 45
  • [9] Overview of data quality challenges in the context of Big Data
    Juddoo, Suraj
    [J]. 2015 INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND SECURITY (ICCCS), 2015,
  • [10] An Audit Framework for Data Lifecycles in a Big Data context
    El Arass, M.
    Tikito, I.
    Souissi, N.
    [J]. 2018 INTERNATIONAL CONFERENCE ON SELECTED TOPICS IN MOBILE AND WIRELESS NETWORKING (MOWNET), 2018, : 103 - 107