Study on Unknown Term Translation Mining from Google Snippets

被引:2
|
作者
Li, Bin [1 ]
Yao, Jianmin [2 ]
机构
[1] Anhui Open Univ, Sch Informat Engn, Hefei 230041, Anhui, Peoples R China
[2] Soochow Univ, Prov Key Lab Comp Informat Proc Technol, Suzhou 215006, Peoples R China
基金
中国国家自然科学基金;
关键词
unknown term; translation mining; web mining; google snippets;
D O I
10.3390/info10090267
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Bilingual web pages are widely used to mine translations of unknown terms. This study focused on an effective solution for obtaining relevant web pages, extracting translations with correct lexical boundaries, and ranking the translation candidates. This research adopted co-occurrence information to obtain the subject terms and then expanded the source query with the translation of the subject terms to collect effective bilingual search engine snippets. Afterwards, valid candidates were extracted from small-sized, noisy bilingual corpora using an improved frequency change measurement that combines adjacent information. This research developed a method that considers surface patterns, frequency-distance, and phonetic features to elect an appropriate translation. The experimental results revealed that the proposed method performed remarkably well for mining translations of unknown terms.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Web Data Mining: Validity of Data from Google Earth for Food Retail Evaluation
    Mariana Carvalho de Menezes
    Vanderlei Pascoal de Matos
    Maria de Fátima de Pina
    Bruna Vieira de Lima Costa
    Larissa Loures Mendes
    Milene Cristine Pessoa
    Paulo Roberto Borges de Souza-Junior
    Amélia Augusta de Lima Friche
    Waleska Teixeira Caiaffa
    Letícia de Oliveira Cardoso
    Journal of Urban Health, 2021, 98 : 285 - 295
  • [32] Language Errors in Machine Translation of Encyclopedic Texts from English into Arabic: the case of Google Translate
    Al-Samawi, Ahmad Muhammed
    ARAB WORLD ENGLISH JOURNAL, 2014, : 182 - 211
  • [33] Determination of an Unknown Heat Source Term from Boundary Data
    Hu, Y.
    Wei, T.
    CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2012, 87 (04): : 307 - 325
  • [34] Translation: Domestication and Foreignization - A Study of Translation from the Perspective of Memetics
    Xia, Jiajia
    PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON GLOBALIZATION: CHALLENGES FOR TRANSLATORS AND INTERPRETERS, VOL. II, 2017, : 220 - 225
  • [35] A study of translation erratology from the perspective of translation competence model
    Yang Shizhang
    VOPROSY ISTORII, 2023, 3 (01) : 256 - 263
  • [36] Expression, Purification, and Analysis of Unknown Translation Factors from Escherichia coli: A Synthesis Approach
    Walter, Justin D.
    Littlefield, Peter
    Delbecq, Scott
    Prody, Gerry
    Spiegel, R. Clint
    BIOCHEMISTRY AND MOLECULAR BIOLOGY EDUCATION, 2010, 38 (01) : 17 - 22
  • [37] Specular Surface Recovery from Reflections of a Planar Pattern Undergoing an Unknown Pure Translation
    Liu, Miaomiao
    Wong, Kwan-Yee K.
    Dai, Zhenwen
    Chen, Zhihu
    COMPUTER VISION - ACCV 2010, PT II, 2011, 6493 : 137 - +
  • [38] Schumpeter's Unknown Commentary on the Great Depression: An Annotated Translation from the Japanese Text
    Kubo, Shin
    Hagemann, Harald
    HISTORY OF ECONOMICS REVIEW, 2022, 82 (01) : 15 - 28
  • [39] Analysis of User Feedback in The Mobile App Store Using Text Mining: A Case Study of Google Fit
    Lai, Yi-Horng
    Huang, Fen-Fen
    Chiou, Piao-Yi
    2017 IEEE 8TH INTERNATIONAL CONFERENCE ON AWARENESS SCIENCE AND TECHNOLOGY (ICAST), 2017, : 50 - 54
  • [40] Late-term pregnancy: Navigating in unknown waters - A hermeneutic study
    Wessberg, Anna
    Lundgren, Ingela
    Elden, Helen
    WOMEN AND BIRTH, 2020, 33 (03) : 265 - 272