Data science in light of natural language processing: An overview

被引:13
|
作者
Zeroual, Imad [1 ]
Lakhouaja, Abdelhak [1 ]
机构
[1] Mohamed First Univ, Fac Sci, Av Med 6 BP 717, Oujda 60000, Morocco
关键词
Data science; Natural language processing; Data driven approches; Corpora; Machine learning;
D O I
10.1016/j.procs.2018.01.101
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The focus of data scientists is essentially divided into three areas: collecting data, analyzing data, and inferring information from data. Each one of these tasks requires special personnel, takes time, and costs money. Yet, the next and the fastidious step is how to turn data into products. Therefore, this field grabs the attention of many research groups in academia as well as industry. In the last decades, data-driven approaches came into existence and gained more popularity because they require much less human effort. Natural Language Processing (NLP) is strongly among the fields influenced by data. The growth of data is behind the performance improvement of most NLP applications such as machine translation and automatic speech recognition. Consequently, many NLP applications are frequently moving from rule-based systems and knowledge-based methods to data driven approaches. However, collected data that are based on undefined design criteria or on technically unsuitable forms will be useless. Also, they will be neglected if the size is not enough to perform the required analysis and to infer the accurate information. The chief purpose of this overview is to shed some lights on the vital role of data in various fields and give a better understanding of data in light of NLP. Expressly, it describes what happen to data during its life-cycle: building, processing, analyzing, and exploring phases. (C) 2018 The Authors. Published by Elsevier B.V.
引用
收藏
页码:82 / 91
页数:10
相关论文
共 50 条
  • [1] Arabic natural language processing: An overview
    Guellil, Imane
    Saadane, Houda
    Azouaou, Faical
    Gueni, Billel
    Nouvel, Damien
    [J]. JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2021, 33 (05) : 497 - 507
  • [2] Natural language processing in medicine: An overview
    Spyns, P
    [J]. METHODS OF INFORMATION IN MEDICINE, 1996, 35 (4-5) : 285 - 301
  • [3] An overview of empirical natural language processing
    Brill, E
    Mooney, RJ
    [J]. AI MAGAZINE, 1997, 18 (04) : 13 - 24
  • [4] Data Innovation for International Development: An overview of natural language processing for qualitative data analysis
    Broniecki, Philipp
    Hanchar, Anna
    [J]. 2017 INTERNATIONAL CONFERENCE ON THE FRONTIERS AND ADVANCES IN DATA SCIENCE (FADS), 2017, : 118 - 123
  • [5] Data Science and Natural Language Processing to Extract Information in Clinical Domain
    Vydiswaran, V. G. Vinod
    Zhao, Xinyan
    Yu, Deahan
    [J]. PROCEEDINGS OF THE 5TH JOINT INTERNATIONAL CONFERENCE ON DATA SCIENCE & MANAGEMENT OF DATA, CODS COMAD 2022, 2022, : 352 - 353
  • [6] An Overview of Natural Language Processing for Indonesian and Malay
    Jiang, Shengyi
    Li, Shanshan
    Fu, Sihui
    Lin, Nankai
    [J]. Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2020, 33 (06): : 530 - 541
  • [7] Data Science and Natural Language Processing to Extract Information from Clinical Narratives
    Vydiswaran, V. G. Vinod
    Zhao, Xinyan
    Yu, Deahan
    [J]. CODS-COMAD 2021: PROCEEDINGS OF THE 3RD ACM INDIA JOINT INTERNATIONAL CONFERENCE ON DATA SCIENCE & MANAGEMENT OF DATA (8TH ACM IKDD CODS & 26TH COMAD), 2021, : 441 - 442
  • [8] Measuring Memberships in Collectives in Light of Developments in Cognitive Science and Natural- Language Processing
    Hannan, Michael T.
    [J]. SOCIOLOGICAL SCIENCE, 2022, 9 : 473 - 492
  • [9] Natural Language Processing and Big Data
    Monti, Johanna
    Monteleone, Mario
    di Buono, Maria Pia
    Marano, Federica
    [J]. 2013 ASE/IEEE INTERNATIONAL CONFERENCE ON SOCIAL COMPUTING (SOCIALCOM), 2013, : 725 - 731
  • [10] Natural Language Processing in Game Studies Research: An Overview
    Zagal, Jose P.
    Tomuro, Noriko
    Shepitsen, Andriy
    [J]. SIMULATION & GAMING, 2012, 43 (03) : 356 - 373