An analysis of the relationship between cohesion and clause combination in English discourse employing NLP and data mining approaches

被引:1
|
作者
Green, Clarence [1 ]
机构
[1] Univ Melbourne, Sch Languages & Linguist, Melbourne, Vic 3010, Australia
关键词
D O I
10.1093/llc/fqu012
中图分类号
C [社会科学总论];
学科分类号
03 ; 0303 ;
摘要
This study examines the relationship between the frequencies of clause combination and the distribution of discourse-pragmatic markers of cohesion in a sub-sample of the Susanne corpus. It addresses the theory that clause grammar constitutes a form of grammar-cued discourse coherence which functions as an integrated system with other methods of managing coherence in language. Evidence is sought for whether increased clause density in a corpus correlates with a reduction in explicit cohesive devices. To address this, a computational approach is outlined for the coding of cohesion in a corpus, using a semi-automated data mining procedure. To validate this approach, it is compared with cohesion measures on the same data using the NLP tool Coh-Metrix 3.0. The two approaches are shown to positively correlate on a series of measures, suggesting they significantly overlap in quantifying the cohesion construct. The final analysis of the tagged corpus indicates that as frequencies of clause combination increase in a text, the use of explicit lexical cohesive devices decrease. Also, higher frequencies of clause combination positively correlate with an increased use of grammatical cohesive devices. Findings are interpreted as generally aligning with the expectations of the theoretical framework known as the Adaptive Approach to Grammar.
引用
收藏
页码:326 / 343
页数:18
相关论文
共 28 条
  • [1] On the relationship between clause combination, grammatical hierarchy and discourse-pragmatic coherence
    Green, Clarence
    FUNCTIONS OF LANGUAGE, 2014, 21 (03) : 297 - 332
  • [2] The Causal Relationship Between Volunteering and Social Cohesion: A Large Scale Analysis of Secondary Longitudinal Data
    Davies, Ben
    Abrams, Dominic
    Horsham, Zoe
    Lalot, Fanny
    SOCIAL INDICATORS RESEARCH, 2024, 171 (03) : 809 - 825
  • [3] The Causal Relationship Between Volunteering and Social Cohesion: A Large Scale Analysis of Secondary Longitudinal Data
    Ben Davies
    Dominic Abrams
    Zoe Horsham
    Fanny Lalot
    Social Indicators Research, 2024, 171 : 809 - 825
  • [4] Analysis of the Relationship between Sleep Quality and Impact Indicators Based on Data Mining
    Sun, Yue
    Liu, Yongdong
    Yan, Jiechen
    Hu, Beibei
    Dong, Xianlei
    2ND INTERNATIONAL CONFERENCE ON DATA SCIENCE AND BUSINESS ANALYTICS (ICDSBA 2018), 2018, : 303 - 306
  • [5] Data mining analysis for relationship between family support of nursing personnel and job
    Ren, Y., 1600, CESER Publications, Post Box No. 113, Roorkee, 247667, India (46):
  • [6] Emotional analysis of evaluation discourse in business English translation based on language big data mining of public health environment
    Liu, Song
    Chen, Yukun
    Xu, Kunpei
    Lin, Jiaxin
    FRONTIERS IN PUBLIC HEALTH, 2022, 10
  • [7] APPROACHES TO ANALYSIS OF DIETARY DATA - RELATIONSHIP BETWEEN PLANNED ANALYSES AND CHOICE OF METHODOLOGY
    BEATON, GH
    AMERICAN JOURNAL OF CLINICAL NUTRITION, 1994, 59 (01): : 253S - 261S
  • [8] Text Complexity Classification Data Mining Model Based on Dynamic Quantitative Relationship between Modality and English Context
    Zhang, Dan
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2021, 2021
  • [9] Mining the relationship between production and customer service data for failure analysis of industrial products
    Kang, Seokho
    Kim, Eunji
    Shim, Jaewoong
    Cho, Sungzoon
    Chang, Wonsang
    Kim, Junhwan
    COMPUTERS & INDUSTRIAL ENGINEERING, 2017, 106 : 137 - 146
  • [10] The Relationship between Mental Health and Poverty Assistance: An Empirical Analysis Based on Multivariate Data Mining
    Wang, Huan
    Zhang, Shuai
    Wang, Zhe
    Liu, Yang
    Peng, Yong
    Li, Qing
    Li, Jingwen
    INTERNATIONAL JOURNAL OF PSYCHIATRY IN MEDICINE, 2025, 60 (2_SUPPL): : 16S - 17S