Natural language processing for social science research: A comprehensive review

被引:0
|
作者
Hou, Yuxin [1 ,2 ]
Huang, Junming [3 ]
机构
[1] Peking Univ, Ctr Social Res, Beijing, Peoples R China
[2] Tsinghua Univ, Inst Educ, Beijing, Peoples R China
[3] Princeton Univ, Paul & Marcia Wythes Ctr Contemporary China, Princeton, NJ 08544 USA
关键词
Big data/data science; language/linguistics; quantitative methods; natural language processing; text analysis; neural network; topic model; COMPUTERIZED TEXT ANALYSIS; MEDIA; CULTURE; TWITTER; CLASSIFICATION; COMMUNICATION; SENTIMENT; MICROBLOGS; CAMPAIGNS; FACEBOOK;
D O I
10.1177/2057150X241306780
中图分类号
C91 [社会学];
学科分类号
030301 ; 1204 ;
摘要
Text data has been a longstanding pivotal source for social science research, providing an informative lens across disciplines including sociology, psychology, and political science. Its salient role in research, combined with the difficulty in numerically digesting unstructured data in natural languages, has been inspiring growing demands for natural language processing techniques to extract meaningful insights from vast text data. Breakthrough advances in natural language processing emerge with the recent expansion in data availability and computational resources, calling for an up-to-date comprehensive review for those methodologies and applications in social science research. This article reviews natural language processing techniques, detailing the procedure from representing unstructured text data to distilling semantic information, with expertise-based algorithms and unsupervised/supervised machine-learning methods. We then introduce their typical applications in producing research outcomes for sociology and political science. Keeping in mind challenges in data representativeness, interpretability, and biases, this review encourages utilizing natural language processing technique responsibly and effectively in social science research to improve quantitative understandings of emerging text data.
引用
收藏
页数:37
相关论文
共 50 条
  • [1] Designing a Natural Language Processing System to Support Social Science Research
    Gone, Keshava Pallavi
    Smit, Michael
    PROCEEDINGS OF THE 2023 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING, ASONAM 2023, 2023, : 345 - 347
  • [2] A comprehensive review on resolving ambiguities in natural language processing
    Yadav, Apurwa
    Patel, Aarshil
    Shah, Manan
    AI OPEN, 2021, 2 : 85 - 92
  • [3] A comprehensive review of deep learning for natural language processing
    Bouraoui, Amal
    Jamoussi, Salma
    Ben Hamadou, Abdelmajid
    INTERNATIONAL JOURNAL OF DATA MINING MODELLING AND MANAGEMENT, 2022, 14 (02) : 149 - 182
  • [4] Natural language processing for urban research: A systematic review
    Cai, Meng
    HELIYON, 2021, 7 (03)
  • [5] A Systematic Review of Reproducibility Research in Natural Language Processing
    Belz, Anya
    Agarwal, Shubham
    Shimorina, Anastasia
    Reiter, Ehud
    16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 381 - 393
  • [6] Analyzing Social Robotics Research with Natural Language Processing Techniques
    Mazzei, Daniele
    Chiarello, Filippo
    Fantoni, Gualtiero
    COGNITIVE COMPUTATION, 2021, 13 (02) : 308 - 321
  • [7] Analyzing Social Robotics Research with Natural Language Processing Techniques
    Daniele Mazzei
    Filippo Chiarello
    Gualtiero Fantoni
    Cognitive Computation, 2021, 13 : 308 - 321
  • [8] The Utilization of Natural Language Processing for Analyzing Social Media Data in Nursing Research: A Scoping Review
    Wang, Zhenrong
    Ma, Yulin
    Song, Yuanyuan
    Huang, Yao
    Liang, Guopeng
    Zhong, Xi
    JOURNAL OF NURSING MANAGEMENT, 2024, 2024 (01)
  • [9] Machine Learning Techniques for Biomedical Natural Language Processing: A Comprehensive Review
    Houssein, Essam H.
    Mohamed, Rehab E.
    Ali, Abdelmgeid A.
    IEEE ACCESS, 2021, 9 : 140628 - 140653
  • [10] Social Science for Natural Language Processing: A Hostile Narrative Analysis Prototype
    Anning, Stephen
    Konstantinidis, George
    Webber, Craig
    PROCEEDINGS OF THE 13TH ACM WEB SCIENCE CONFERENCE, WEBSCI 2021, 2020, : 102 - 111