Text mining in different languages

被引:0
|
作者
Lebart, L [1 ]
机构
[1] Ecole Natl Super Telecommun, CNRS, F-75013 Paris, France
来源
关键词
Text Mining; text categorization; language independent methods; discriminant analysis;
D O I
暂无
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
The purpose of Text Mining is to describe and explore textual data, to uncover structural traits, and proceed to predictions. The field of application concerns Information Retrieval, processing responses to open-ended questions in sample surveys as well as processing textual corpora of a more general nature. At the intersection of Corpora Linguistics and Exploratory Statistical Analysis, a series of language independent tools and methods can perform most of the previously mentioned tasks, including the assessment and validation of the obtained results, be it visualization or categorization. Multiple confusion matrices calculated on test-samples characterize the quality of the prediction as well as the structure of errors of prediction. In the case of multinational surveys and corpora, they allow us to proceed to comparisons among several countries, in spite of the very heterogeneous character of the basic information (texts in different languages). Copyright (C) 1998 John Wiley & Sons, Ltd.
引用
收藏
页码:323 / 334
页数:12
相关论文
共 50 条
  • [1] DNA AND NATURAL LANGUAGES Text Mining
    Bel-Enguix, Gemma
    Dahl, Veronica
    Dolores Jimenez-Lopez, M.
    KDIR 2009: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND INFORMATION RETRIEVAL, 2009, : 140 - 145
  • [2] Detecting the same text in different languages
    Koroutchev, Kostadin
    Cebrian, Manuel
    PROCEEDINGS OF 2006 IEEE INFORMATION THEORY WORKSHOP, 2006, : 337 - +
  • [3] Generation of Original Text with Text Mining and Deep Learning Methods for Turkish and Other Languages
    Dogan, Emre
    Kaya, Buket
    Mungen, Ahmet
    2018 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND DATA PROCESSING (IDAP), 2018,
  • [4] Study of automatic text summarization approaches in different languages
    Yogesh Kumar
    Komalpreet Kaur
    Sukhpreet Kaur
    Artificial Intelligence Review, 2021, 54 : 5897 - 5929
  • [5] Study of automatic text summarization approaches in different languages
    Kumar, Yogesh
    Kaur, Komalpreet
    Kaur, Sukhpreet
    ARTIFICIAL INTELLIGENCE REVIEW, 2021, 54 (08) : 5897 - 5929
  • [6] TEXT MINING-BASED FORMATION OF DICTIONARIES EXPRESSING OPINIONS IN NATURAL LANGUAGES
    Darena, Frantisek
    Zizka, Jan
    MENDEL 2011 - 17TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING, 2011, : 374 - 381
  • [7] Introduction: Developing discourse stance in different text types and languages
    Berman, RA
    JOURNAL OF PRAGMATICS, 2005, 37 (02) : 105 - 124
  • [8] Dealing with different languages and old profiles in keystroke analysis of free text
    Gunetti, D
    Picardi, C
    Ruffo, G
    AI*IA2005: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2005, 3673 : 347 - 358
  • [9] Different Data Mining Approaches Based Medical Text Data
    Xiao, Wenke
    Jing, Lijia
    Xu, Yaxin
    Zheng, Shichao
    Gan, Yanxiong
    Wen, Chuanbiao
    JOURNAL OF HEALTHCARE ENGINEERING, 2021, 2021
  • [10] Grid-based Support for Different Text Mining Tasks
    Sarnovsky, Martin
    Butka, Peter
    Paralic, Jan
    ACTA POLYTECHNICA HUNGARICA, 2009, 6 (04) : 5 - 27