Language independent statistical software for Corpus exploration

被引:3
|
作者
Sinclair, J [1 ]
Mason, O [1 ]
Ball, J [1 ]
Barnbrook, G [1 ]
机构
[1] Univ Birmingham, Sch English, Corpus Res, Birmingham B15 2TT, W Midlands, England
来源
COMPUTERS AND THE HUMANITIES | 1997年 / 31卷 / 03期
关键词
collocation; concordance lines; language independent software; lexical statistics;
D O I
10.1023/A:1000911520943
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In this report two programs for statistical analysis of concordance lines are described. The programs have been developed for analysing the lexical context of a given word. It is shown how different parameter settings influence the outcome of collocational analysis, and how the concept of collocation can be extended to allow the extraction of lines typical for a word from a set of concordance lines. Even though all the examples are for English, the software is completely language independent and only requires minimal linguistic resources.
引用
收藏
页码:229 / 255
页数:27
相关论文
共 50 条
  • [1] Language Independent Statistical Software for Corpus Exploration
    John Sinclair
    Oliver Mason
    Jackie Ball
    Geoff Barnbrook
    Computers and the Humanities, 1997, 31 : 229 - 255
  • [2] CORPUS SOFTWARE IN EFL TEACHING: EXAMINATION OF LANGUAGE EXPOSURE
    Kudryashova, A., V
    Rozanova, Ya, V
    Sidorenko, T., V
    OBRAZOVANIE I NAUKA-EDUCATION AND SCIENCE, 2020, 22 (04): : 131 - 145
  • [3] The Influence of Corpus Quality on Statistical Measurements on Language Resources
    Eckart, Thomas
    Quasthoff, Uwe
    Goldhahn, Dirk
    LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 2318 - 2321
  • [4] Statistical Corpus and Language Comparison using Comparable Corpora
    Eckart, Thomas
    Quasthoff, Uwe
    LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : 15 - 20
  • [5] A Corpus-Driven Exploration of Language Use in Religious Discourse
    Nofal, Mohammed
    JOURNAL OF RESEARCH IN APPLIED LINGUISTICS, 2023, 14 (01): : 41 - 60
  • [6] IS DESCRIBING LANGUAGE MERE BUTTERFLY COLLECTION? ON EPISTEMOLOGY, STATISTICAL LANGUAGE MODELS, AND CORPUS
    de Uzeda-Garrao, Milena
    12TH INTERNATIONAL CONFERENCE OF EDUCATION, RESEARCH AND INNOVATION (ICERI2019), 2019, : 10900 - 10903
  • [7] A language-independent software renovation framework
    Di Penta, M
    Neteler, M
    Antoniol, G
    Merlo, E
    JOURNAL OF SYSTEMS AND SOFTWARE, 2005, 77 (03) : 225 - 240
  • [8] Language Independent Statistical Approach for Extracting Keywords
    Rahaman, Md. Mahfitzur
    Amin, Md. Ruhul
    2017 4TH INTERNATIONAL CONFERENCE ON ADVANCES IN ELECTRICAL ENGINEERING (ICAEE), 2017, : 205 - 210
  • [9] Statistical Analysis of Multilingual Text Corpus and Development of Language Models
    Agrawal, Shyam S.
    Bansal, Abhimanue Shweta
    Mahajan, Minakshi
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 2436 - 2440
  • [10] Statistical Analysis of Polish Language Corpus for Speech Recognition Application
    Klosowski, Piotr
    2016 SIGNAL PROCESSING: ALGORITHMS, ARCHITECTURES, ARRANGEMENTS, AND APPLICATIONS (SPA), 2016, : 304 - 309