State-of-the-art Text Linguistics: Corpus-Analysis Tools. A Practical Demonstration

被引:0
|
作者
Postolea, Sorina [1 ,2 ]
机构
[1] Alexandru Ioan Cuza Univ, Iasi, Romania
[2] Petre Andrei Univ Iasi, Iasi, Romania
来源
PHILOLOGICA JASSYENSIA | 2014年 / 10卷 / 01期
关键词
corpus linguistics; corpus-based analysis; corpus-analysis tools; lexicography; terminology;
D O I
暂无
中图分类号
H [语言、文字];
学科分类号
05 ;
摘要
In recent years, electronic corpora and the computer programs specifically designed for their analysis have been extensively used for various types of text analyses and in a wide array of applied language-related studies. Small-sized to mega-sized digitalized collections of texts and corpus-analysis tools are used nowadays to support research in such fields as general linguistics, lexicography, grammar studies, terminology, translation studies, or literary studies. Corpus linguistics, the discipline that deals with corpora and corpus tools,has developed exponentially in the Western world, to the point that most language-related studies are nowadays based on its principles and tenets. Yet, because the development of corpus-analysis tools specifically designed to support the peculiarities of Romanian as a language would require insight from interdisciplinary teams of researchers, i.e. at least from the fields of linguistics and natural language processing, corpus linguistics is still a tentative branch of research in Romania. Based on a corpus of English news articles that approach information and communication technology topics this contribution aims to provide a practical demonstration of how the main types of corpus- analysis tools that are now available to Western researchers may be used to explore a collection of texts.
引用
收藏
页码:51 / 59
页数:9
相关论文
共 50 条
  • [31] A survey of state-of-the-art approaches for emotion recognition in text
    Nourah Alswaidan
    Mohamed El Bachir Menai
    Knowledge and Information Systems, 2020, 62 : 2937 - 2987
  • [32] A recent overview of the state-of-the-art elements of text classification
    Mironczuk, Marcin Michal
    Protasiewicz, Jaroslaw
    EXPERT SYSTEMS WITH APPLICATIONS, 2018, 106 : 36 - 54
  • [33] A state-of-the-art of content analysis
    Hall, DM
    West, MD
    Germeraad, P
    Vogel, C
    Porter, AL
    ASIST 2002: PROCEEDINGS OF THE 65TH ASIST ANNUAL MEETING, VOL 39, 2002, 2002, 39 : 463 - 463
  • [34] Corpus supported text analysis. Features of level-oriented text linguistics
    Gerstenberg, Annette
    ROMANISCHE FORSCHUNGEN, 2009, 121 (02) : 261 - 262
  • [35] Corpus based text analysis. Main features of the level oriented text linguistics
    Teich, Elke
    ZEITSCHRIFT FUR SPRACHWISSENSCHAFT, 2007, 26 (02): : 378 - 381
  • [36] State-of-the-art XRD analysis
    不详
    FOOD AUSTRALIA, 2010, 62 (08): : 327 - 327
  • [37] Coupling an annotated corpus and a lexicon for state-of-the-art POS tagging
    Pascal Denis
    Benoît Sagot
    Language Resources and Evaluation, 2012, 46 : 721 - 736
  • [38] CUTTING AND CHUCKING TOOLS - STATE-OF-THE-ART AND NEW DEVELOPMENTS
    STORN, H
    WERKSTATTSTECHNIK ZEITSCHRIFT FUR INDUSTRIELLE FERTIGUNG, 1987, 77 (10): : 557 - 561
  • [39] State-of-the-art MEMS and microsystem tools for brain research
    Seymour, John P.
    Wu, Fan
    Wise, Kensall D.
    Yoon, Euisik
    MICROSYSTEMS & NANOENGINEERING, 2017, 3
  • [40] Benchmarking State-of-the-Art Deep Learning Software Tools
    Shi, Shaohuai
    Wang, Qiang
    Xu, Pengfei
    Chu, Xiaowen
    2016 7TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA (CCBD), 2016, : 99 - 104