An analysis of the relationship between cohesion and clause combination in English discourse employing NLP and data mining approaches

被引:1
|
作者
Green, Clarence [1 ]
机构
[1] Univ Melbourne, Sch Languages & Linguist, Melbourne, Vic 3010, Australia
关键词
D O I
10.1093/llc/fqu012
中图分类号
C [社会科学总论];
学科分类号
03 ; 0303 ;
摘要
This study examines the relationship between the frequencies of clause combination and the distribution of discourse-pragmatic markers of cohesion in a sub-sample of the Susanne corpus. It addresses the theory that clause grammar constitutes a form of grammar-cued discourse coherence which functions as an integrated system with other methods of managing coherence in language. Evidence is sought for whether increased clause density in a corpus correlates with a reduction in explicit cohesive devices. To address this, a computational approach is outlined for the coding of cohesion in a corpus, using a semi-automated data mining procedure. To validate this approach, it is compared with cohesion measures on the same data using the NLP tool Coh-Metrix 3.0. The two approaches are shown to positively correlate on a series of measures, suggesting they significantly overlap in quantifying the cohesion construct. The final analysis of the tagged corpus indicates that as frequencies of clause combination increase in a text, the use of explicit lexical cohesive devices decrease. Also, higher frequencies of clause combination positively correlate with an increased use of grammatical cohesive devices. Findings are interpreted as generally aligning with the expectations of the theoretical framework known as the Adaptive Approach to Grammar.
引用
收藏
页码:326 / 343
页数:18
相关论文
共 28 条
  • [21] The relationship between the built environment and subjective wellbeing - Analysis of cross-sectional data from the English Housing Survey
    Huebner, Gesche M.
    Oreszczyn, Tadj
    Direk, Kenan
    Hamilton, Ian
    JOURNAL OF ENVIRONMENTAL PSYCHOLOGY, 2022, 80
  • [22] Candidate Genes and MiRNAs Linked to the Inverse Relationship Between Cancer and Alzheimer's Disease: Insights From Data Mining and Enrichment Analysis
    Battaglia, Cristina
    Venturin, Marco
    Sojic, Aleksandra
    Jesuthasan, Nithiya
    Orro, Alessandro
    Spinelli, Roberta
    Musicco, Massimo
    De Bellis, Gianluca
    Adorni, Fulvio
    FRONTIERS IN GENETICS, 2019, 10
  • [23] Relationship Between Clinical Quality and Patient Experience: Analysis of Data From the English Quality and Outcomes Framework and the National GP Patient Survey
    Llanwarne, Nadia R.
    Abel, Gary A.
    Elliott, Marc N.
    Paddison, Charlotte A. M.
    Lyratzopoulos, Georgios
    Campbell, John L.
    Roland, Martin
    ANNALS OF FAMILY MEDICINE, 2013, 11 (05) : 467 - 472
  • [24] Combination of compositional data analysis and machine learning approaches to identify sources and geochemical associations of potentially toxic elements in soil and assess the associated human health risk in a mining city
    Tepanosyan, Gevorg
    Sahakyan, Lilit
    Maghakyan, Nairuhi
    Saghatelyan, Armen
    ENVIRONMENTAL POLLUTION, 2020, 261
  • [25] Relationship between 'Language and Emergent Literacy' before School Age and Literacy Performance Levels for First and Second Graders: Data Mining Decision Tree Model Analysis
    Lee, Eun Ju
    COMMUNICATION SCIENCES AND DISORDERS-CSD, 2021, 26 (03): : 568 - 588
  • [26] Data-driven mapping-relationship mining between hardness and mechanical properties of dual-phase titanium alloys via random forest and statistical analysis
    Gong, Hai-Chao
    Fan, Qun-Bo
    Zhang, Hong-Mei
    Cheng, Xing-Wang
    Xie, Wen-Qiang
    Chen, Kai
    Yang, Lin
    Zhang, Jun-Jie
    Wei, Bing-Qiang
    Xu, Shun
    RARE METALS, 2024, 43 (02) : 829 - 841
  • [27] Data-driven mapping-relationship mining between hardness and mechanical properties of dual-phase titanium alloys via random forest and statistical analysis
    Hai-Chao Gong
    Qun-Bo Fan
    Hong-Mei Zhang
    Xing-Wang Cheng
    Wen-Qiang Xie
    Kai Chen
    Lin Yang
    Jun-Jie Zhang
    Bing-Qiang Wei
    Shun Xu
    Rare Metals, 2024, 43 (02) : 829 - 841
  • [28] Data-driven mapping-relationship mining between hardness and mechanical properties of dual-phase titanium alloys via random forest and statistical analysis
    Hai-Chao Gong
    Qun-Bo Fan
    Hong-Mei Zhang
    Xing-Wang Cheng
    Wen-Qiang Xie
    Kai Chen
    Lin Yang
    Jun-Jie Zhang
    Bing-Qiang Wei
    Shun Xu
    Rare Metals, 2024, 43 : 829 - 841