Unravelling Unstructured Data: A Wealth of Information in Big Data

被引:0
|
作者
Tanwar, Mona [1 ]
Duggal, Reena [1 ]
Khatri, Sunil Kumar [1 ]
机构
[1] Amity Univ Uttar Pradesh, Amity Inst Informat Technol, Noida, India
关键词
Big Data; Unstructured data; Text Analytics; Audio Analytics; Video Analytics; Social Media Analytics; CHALLENGES;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Big Data is data of high volume and high variety being produced or generated at high velocity which cannot be stored, managed, processed or analyzed using the existing traditional software tools, techniques and architectures. With big data many challenges such as scale, heterogeneity, speed and privacy are associated but there are opportunities as well. Potential information is locked in big data which if properly leveraged will make a huge difference to business. With the help of big data analytics, meaningful insights can be extracted from big data which is heterogeneous in nature comprising of structured, unstructured and semi-structured content. One prime challenge in big data analytics is that nearly 95% data is unstructured. This paper describes what big data and big data analytics is. A review of different techniques and approaches to analyze unstructured data is given. This paper emphasizes the importance of analysis of unstructured data along with structured data in business to extract holistic insights. The need for appropriate and efficient analytical methods for knowledge discovery from huge volumes of heterogeneous data in unstructured formats has been highlighted.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Unstructured Data Treatment for Big Data Solutions
    Sato, Shintaro
    Kayahara, Akihiro
    Imai, Shin-ichi
    [J]. INTERNATIONAL SYMPOSIUM ON SEMICONDUCTOR MANUFACTURING (ISSM) 2016 PROCEEDINGS OF TECHNICAL PAPERS, 2016,
  • [2] ExNav: An Interactive Big Data hxploration Framework for Big Unstructured Data
    Ge, Xiaoyu
    Zhang, Xiaozhong
    Chrysanthis, Panos K.
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 503 - 512
  • [3] A Framework for Extracting Reliable Information from Unstructured Uncertain Big Data
    Singh, Sanjay Kumar
    Mani, Neel
    Singh, Bharat
    [J]. INTELLIGENT DECISION TECHNOLOGIES 2016, PT II, 2016, 57 : 175 - 185
  • [4] An analytical study of information extraction from unstructured and multidimensional big data
    Adnan, Kiran
    Akbar, Rehan
    [J]. JOURNAL OF BIG DATA, 2019, 6 (01)
  • [5] An analytical study of information extraction from unstructured and multidimensional big data
    Kiran Adnan
    Rehan Akbar
    [J]. Journal of Big Data, 6
  • [6] Limitations of information extraction methods and techniques for heterogeneous unstructured big data
    Adnan, Kiran
    Akbar, Rehan
    [J]. INTERNATIONAL JOURNAL OF ENGINEERING BUSINESS MANAGEMENT, 2019, 11
  • [7] Big Data Quality Assessment Model for Unstructured Data
    Taleb, Ikbal
    Serhani, Mohamed Adel
    Dssouli, Rachida
    [J]. PROCEEDINGS OF THE 2018 13TH INTERNATIONAL CONFERENCE ON INNOVATIONS IN INFORMATION TECHNOLOGY (IIT), 2018, : 69 - 74
  • [8] Big Data Trend: Knowledge Discovery on the Unstructured Data
    Abu Muntalib, Shamsiah
    Sidi, Fatimah
    Jabar, Marzanah A.
    Ishak, Iskandar
    [J]. PROCEEDING OF KNOWLEDGE MANAGEMENT INTERNATIONAL CONFERENCE (KMICE) 2014, VOLS 1 AND 2, 2014, : 338 - 342
  • [9] Die Vorstandsperspektive: Big Data = Big Wealth?
    Christof Leng
    [J]. Informatik-Spektrum, 2014, 37 (2) : 88 - 89
  • [10] An Approach to Security for Unstructured Big Data
    Md. Ezazul Islam
    Md. Rafiqul Islam
    A B M Shawkat Ali
    [J]. The Review of Socionetwork Strategies, 2016, 10 (2) : 105 - 123