State-of-the-art Text Linguistics: Corpus-Analysis Tools. A Practical Demonstration

被引:0
|
作者
Postolea, Sorina [1 ,2 ]
机构
[1] Alexandru Ioan Cuza Univ, Iasi, Romania
[2] Petre Andrei Univ Iasi, Iasi, Romania
来源
PHILOLOGICA JASSYENSIA | 2014年 / 10卷 / 01期
关键词
corpus linguistics; corpus-based analysis; corpus-analysis tools; lexicography; terminology;
D O I
暂无
中图分类号
H [语言、文字];
学科分类号
05 ;
摘要
In recent years, electronic corpora and the computer programs specifically designed for their analysis have been extensively used for various types of text analyses and in a wide array of applied language-related studies. Small-sized to mega-sized digitalized collections of texts and corpus-analysis tools are used nowadays to support research in such fields as general linguistics, lexicography, grammar studies, terminology, translation studies, or literary studies. Corpus linguistics, the discipline that deals with corpora and corpus tools,has developed exponentially in the Western world, to the point that most language-related studies are nowadays based on its principles and tenets. Yet, because the development of corpus-analysis tools specifically designed to support the peculiarities of Romanian as a language would require insight from interdisciplinary teams of researchers, i.e. at least from the fields of linguistics and natural language processing, corpus linguistics is still a tentative branch of research in Romania. Based on a corpus of English news articles that approach information and communication technology topics this contribution aims to provide a practical demonstration of how the main types of corpus- analysis tools that are now available to Western researchers may be used to explore a collection of texts.
引用
收藏
页码:51 / 59
页数:9
相关论文
共 50 条
  • [1] Symbolic analysis tools - The state-of-the-art
    Fernandez, FV
    RodriguezVazquez, A
    ISCAS 96: 1996 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS - CIRCUITS AND SYSTEMS CONNECTING THE WORLD, VOL 4, 1996, : 798 - 801
  • [2] Comparing Commercial Tools and State-of-the-Art Methods for Generating Text Summaries
    Arnulfo Garcia-Hernandez, Rene
    Ledeneva, Yulia
    Matias Mendoza, Griselda
    Hernandez Dominguez, Angel
    Chavez, Jorge
    Gelbukh, Alexander
    Tapia Fabela, Jose Luis
    2009 EIGHTH MEXICAN INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2009, : 92 - +
  • [3] NLP Workbench: Efficient and Extensible Integration of State-of-the-art Text Mining Tools
    Yao, Peiran
    Kosmajac, Matej
    Waheed, Abeer
    Guzhva, Kostyantyn
    Hervieux, Natalie
    Barbosa, Denilson
    17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 18 - 26
  • [5] Text Variation Explorer Towards interactive visualization tools for corpus linguistics
    Siirtola, Harri
    Saily, Tanja
    Nevalainen, Terttu
    Raiha, Kari-Jouko
    INTERNATIONAL JOURNAL OF CORPUS LINGUISTICS, 2014, 19 (03) : 417 - 429
  • [6] Fault tree analysis: A survey of the state-of-the-art in modeling, analysis and tools
    Ruijters, Enno
    Stoelinga, Marielle
    COMPUTER SCIENCE REVIEW, 2015, 15-16 : 29 - 62
  • [7] A State-of-the-art Analysis of Innovation Models and Innovation Software Tools
    Hernandez-Munoz, Luis
    Torane, Meghana
    Amini, Ardavan
    Vivekanandan-Dhukaram, Anandhi
    PROCEEDINGS OF THE 10TH EUROPEAN CONFERENCE ON INNOVATION AND ENTREPRENEURSHIP (ECIE 2015), 2015, : 237 - 245
  • [8] Practical Corpus Linguistics: An Introduction to Corpus-Based Language Analysis
    Shirazizadeh, Mohsen
    RELC JOURNAL, 2019, 50 (02) : 361 - 363
  • [9] Practical Corpus Linguistics: An Introduction to Corpus-based Language Analysis
    Siwicki, Aleksander
    NORDIC JOURNAL OF LINGUISTICS, 2018, 41 (03) : 383 - 387
  • [10] TEXT RETRIEVAL - THE STATE-OF-THE-ART - GILLMAN,P
    TEDD, LA
    INFORMATION PROCESSING & MANAGEMENT, 1991, 27 (05) : 596 - 597