Data science in light of natural language processing: An overview

被引:13
|
作者
Zeroual, Imad [1 ]
Lakhouaja, Abdelhak [1 ]
机构
[1] Mohamed First Univ, Fac Sci, Av Med 6 BP 717, Oujda 60000, Morocco
关键词
Data science; Natural language processing; Data driven approches; Corpora; Machine learning;
D O I
10.1016/j.procs.2018.01.101
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The focus of data scientists is essentially divided into three areas: collecting data, analyzing data, and inferring information from data. Each one of these tasks requires special personnel, takes time, and costs money. Yet, the next and the fastidious step is how to turn data into products. Therefore, this field grabs the attention of many research groups in academia as well as industry. In the last decades, data-driven approaches came into existence and gained more popularity because they require much less human effort. Natural Language Processing (NLP) is strongly among the fields influenced by data. The growth of data is behind the performance improvement of most NLP applications such as machine translation and automatic speech recognition. Consequently, many NLP applications are frequently moving from rule-based systems and knowledge-based methods to data driven approaches. However, collected data that are based on undefined design criteria or on technically unsuitable forms will be useless. Also, they will be neglected if the size is not enough to perform the required analysis and to infer the accurate information. The chief purpose of this overview is to shed some lights on the vital role of data in various fields and give a better understanding of data in light of NLP. Expressly, it describes what happen to data during its life-cycle: building, processing, analyzing, and exploring phases. (C) 2018 The Authors. Published by Elsevier B.V.
引用
收藏
页码:82 / 91
页数:10
相关论文
共 50 条
  • [1] Arabic natural language processing: An overview
    Guellil, Imane
    Saadane, Houda
    Azouaou, Faical
    Gueni, Billel
    Nouvel, Damien
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2021, 33 (05) : 497 - 507
  • [2] An overview of empirical natural language processing
    Brill, E
    Mooney, RJ
    AI MAGAZINE, 1997, 18 (04) : 13 - 24
  • [3] Natural language processing in medicine: An overview
    Spyns, P
    METHODS OF INFORMATION IN MEDICINE, 1996, 35 (4-5) : 285 - 301
  • [4] Data Innovation for International Development: An overview of natural language processing for qualitative data analysis
    Broniecki, Philipp
    Hanchar, Anna
    2017 INTERNATIONAL CONFERENCE ON THE FRONTIERS AND ADVANCES IN DATA SCIENCE (FADS), 2017, : 118 - 123
  • [5] Using natural language processing to analyse text data in behavioural science
    Stefan Feuerriegel
    Abdurahman Maarouf
    Dominik Bär
    Dominique Geissler
    Jonas Schweisthal
    Nicolas Pröllochs
    Claire E. Robertson
    Steve Rathje
    Jochen Hartmann
    Saif M. Mohammad
    Oded Netzer
    Alexandra A. Siegel
    Barbara Plank
    Jay J. Van Bavel
    Nature Reviews Psychology, 2025, 4 (2): : 96 - 111
  • [6] Data Science and Natural Language Processing to Extract Information in Clinical Domain
    Vydiswaran, V. G. Vinod
    Zhao, Xinyan
    Yu, Deahan
    PROCEEDINGS OF THE 5TH JOINT INTERNATIONAL CONFERENCE ON DATA SCIENCE & MANAGEMENT OF DATA, CODS COMAD 2022, 2022, : 352 - 353
  • [7] An Overview of Natural Language Processing for Indonesian and Malay
    Jiang S.
    Li S.
    Fu S.
    Lin N.
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2020, 33 (06): : 530 - 541
  • [8] Data Science and Natural Language Processing to Extract Information from Clinical Narratives
    Vydiswaran, V. G. Vinod
    Zhao, Xinyan
    Yu, Deahan
    CODS-COMAD 2021: PROCEEDINGS OF THE 3RD ACM INDIA JOINT INTERNATIONAL CONFERENCE ON DATA SCIENCE & MANAGEMENT OF DATA (8TH ACM IKDD CODS & 26TH COMAD), 2021, : 441 - 442
  • [9] Measuring Memberships in Collectives in Light of Developments in Cognitive Science and Natural- Language Processing
    Hannan, Michael T.
    SOCIOLOGICAL SCIENCE, 2022, 9 : 473 - 492
  • [10] Natural language processing and cognitive science: Foreword
    Sharp, Bernadette
    Zock, Michael
    Natural Language Processing and Cognitive Science - Proceedings of the 6th International Workshop on Natural Language Processing and Cognitive Science - NLPCS 2009 In Conjunction with ICEIS 2009, 2009,