N-Gram Based Paraphrase Generator from Large text Document

被引:0
|
作者
Gadag, Ashwini I. [1 ]
Sagar, B. M. [1 ]
机构
[1] RVCE, Dept ISE, Bengaluru, Karnataka, India
关键词
N-gram; candidate paraphrase; reference paraphrase; paraphrase generator;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper describes the paraphrase generation based on n-gram approach. N-grams are relevant words of text document that can be applied for a range of Natural Language Processing (NLP) applications. The candidate paraphrases are generated based on trigrams approach. The reference paraphrases (keyphrases) are the set of relevant paraphrases, which acts like training data set for generating candidate paraphrases. The task of paraphrase generation is similar to machine translation; hence we used machine translation evaluation metrics. R-precision evaluation metric is used to find the number of common words between candidate and reference paraphrases.
引用
收藏
页码:91 / 94
页数:4
相关论文
共 50 条
  • [21] N-gram language models for document image decoding
    Kopec, GE
    Said, MR
    Popat, K
    [J]. DOCUMENT RECOGNITION AND RETRIEVAL IX, 2002, 4670 : 191 - 202
  • [22] Teraman: A tool for n-gram extraction from large datasets
    Ceska, Zdenek
    Hanak, Ivo
    Tesar, Roman
    [J]. ICCP 2007: IEEE 3RD INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTER COMMUNICATION AND PROCESSING, PROCEEDINGS, 2007, : 209 - +
  • [23] N-GRAM ANALYSIS OF TEXT DOCUMENTS IN SERBIAN LANGUAGE
    Marovac, Ulfeta
    Pljaskovic, Aldina
    Crnisanin, Adela
    Kajan, Ejub
    [J]. 2012 20TH TELECOMMUNICATIONS FORUM (TELFOR), 2012, : 1385 - 1388
  • [24] Multilingual Text Categorization Using Character N-gram
    Suzuki, Makoto
    Yamagishi, Naohide
    Tsai, Yi-Ching
    Hirasawa, Shigeichi
    [J]. 2008 IEEE CONFERENCE ON SOFT COMPUTING IN INDUSTRIAL APPLICATIONS SMCIA/08, 2009, : 49 - +
  • [25] Chinese Text Categorization Using the Character N-gram
    Suzuki, Makoto
    Yamagishi, Naohide
    Tsai, Yi-Ching
    [J]. 2012 INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY AND ITS APPLICATIONS (ISITA 2012), 2012, : 722 - 726
  • [26] Improved Text Generation Using N-gram Statistics
    de Novais, Eder Miranda
    Tadeu, Thiago Dias
    Paraboni, Ivandre
    [J]. ADVANCES IN ARTIFICIAL INTELLIGENCE - IBERAMIA 2010, 2010, 6433 : 316 - 325
  • [27] Classification of Text Documents based on Naive Bayes using N-Gram Features
    Baygin, Mehmet
    [J]. 2018 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND DATA PROCESSING (IDAP), 2018,
  • [28] Short Text Classification Based on Feature Extension Using The N-Gram Model
    Zhang, Xinwei
    Wu, Bin
    [J]. 2015 12TH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (FSKD), 2015, : 710 - 716
  • [29] An efficient document retrieval method using n-gram indexing
    Ogawa, Yasushi
    Matsuda, Toru
    [J]. Systems and Computers in Japan, 2002, 33 (02) : 54 - 63
  • [30] Implementation of A Parallel Algorithm to Extract N-gram from Text in a Functional Language
    Daribayev, B. S.
    Lebedev, D. V.
    Akhmed-Zaki, D. Zh
    [J]. JOURNAL OF MATHEMATICS MECHANICS AND COMPUTER SCIENCE, 2020, 107 (03): : 47 - 56