Text mining in different languages

被引:0
|
作者
Lebart, L [1 ]
机构
[1] Ecole Natl Super Telecommun, CNRS, F-75013 Paris, France
来源
关键词
Text Mining; text categorization; language independent methods; discriminant analysis;
D O I
暂无
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
The purpose of Text Mining is to describe and explore textual data, to uncover structural traits, and proceed to predictions. The field of application concerns Information Retrieval, processing responses to open-ended questions in sample surveys as well as processing textual corpora of a more general nature. At the intersection of Corpora Linguistics and Exploratory Statistical Analysis, a series of language independent tools and methods can perform most of the previously mentioned tasks, including the assessment and validation of the obtained results, be it visualization or categorization. Multiple confusion matrices calculated on test-samples characterize the quality of the prediction as well as the structure of errors of prediction. In the case of multinational surveys and corpora, they allow us to proceed to comparisons among several countries, in spite of the very heterogeneous character of the basic information (texts in different languages). Copyright (C) 1998 John Wiley & Sons, Ltd.
引用
收藏
页码:323 / 334
页数:12
相关论文
共 50 条
  • [41] Text mining in action!
    Mladenic, D
    FROM DATA AND INFORMATION ANALYSIS TO KNOWLEDGE ENGINEERING, 2006, : 52 - 62
  • [42] Mining Typos in Text
    Nizamkari, Navya
    2016 IEEE 7TH ANNUAL UBIQUITOUS COMPUTING, ELECTRONICS MOBILE COMMUNICATION CONFERENCE (UEMCON), 2016,
  • [43] Text mining in education
    Ferreira-Mello, Rafael
    Andre, Maverick
    Pinheiro, Anderson
    Costa, Evandro
    Romero, Cristobal
    WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2019, 9 (06)
  • [44] Text Mining with HathiTrust
    Koehl, Eleanor Dickson
    Dubnicek, Ryan
    2019 ACM/IEEE JOINT CONFERENCE ON DIGITAL LIBRARIES (JCDL 2019), 2019, : 451 - 452
  • [45] Different languages
    Macara, A
    NEW SCIENTIST, 2006, 189 (2540) : 25 - 25
  • [46] Different languages
    Air Force Mag, 2007, 6 (6-7):
  • [47] Professional demand analysis for teaching Chinese to speakers of other languages: a text mining approach on internet recruitment platforms
    Guo, Xingrong
    Wang, Xingjia
    Guo, Yiming
    HUMANITIES & SOCIAL SCIENCES COMMUNICATIONS, 2025, 12 (01):
  • [48] Text Association Analysis and Ambiguity in Text Mining
    Bhonde, S. B.
    Paikrao, R. L.
    Rahane, K. U.
    INTERNATIONAL CONFERENCE ON METHODS AND MODELS IN SCIENCE AND TECHNOLOGY (ICM2ST-10), 2010, 1324 : 204 - +
  • [49] Text Mining Technique for Data Mining Application
    Govindarajan, M.
    PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY, VOL 26, PARTS 1 AND 2, DECEMBER 2007, 2007, 26 : 544 - 549
  • [50] Summarising Historical Text in Modern Languages
    Peng, Xutan
    Zheng, Yi
    Lin, Chenghua
    Siddharthan, Advaith
    16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 3123 - 3142