The linguistic patterns and rhetorical structure of citation context: an approach using n-grams

被引:39
|
作者
Bertin, Marc [1 ]
Atanassova, Iana [2 ]
Sugimoto, Cassidy R. [3 ]
Lariviere, Vincent [4 ,5 ]
机构
[1] Univ Quebec, CIRST, Succ Ctr Ville, CP 8888, Montreal, PQ H3C 3P8, Canada
[2] Univ Bourgogne Franche Comte, Ctr Rech Linguist & Traitement Automat Langues Lu, F-25000 Besancon, France
[3] Indiana Univ, Sch Informat & Comp, Bloomington, IN 47405 USA
[4] Univ Montreal, Ecole Bibliothecon & Sci Informat, Succ Ctr Ville, CP 6128, Montreal, PQ H3C 3J7, Canada
[5] Univ Quebec, CIRST, OST, Succ Ctr Ville, CP 8888, Montreal, PQ H3C 3P8, Canada
关键词
Content citation analysis; IMRaD; Discursive patterns; Citation function; Rhetorical structure; n-grams; SCIENTIFIC DISCOVERY; REFERENCES; DISCOURSE; SOCIOLOGY; SCIENCE; CHAPTER;
D O I
10.1007/s11192-016-2134-8
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Using the full-text corpus of more than 75,000 research articles published by seven PLOS journals, this paper proposes a natural language processing approach for identifying the function of citations. Citation contexts are assigned based on the frequency of n-gram co-occurrences located near the citations. Results show that the most frequent linguistic patterns found in the citation contexts of papers vary according to their location in the IMRaD structure of scientific articles. The presence of negative citations is also dependent on this structure. This methodology offers new perspectives to locate these discursive forms according to the rhetorical structure of scientific articles, and will lead to a better understanding of the use of citations in scientific articles.
引用
收藏
页码:1417 / 1434
页数:18
相关论文
共 50 条
  • [1] The linguistic patterns and rhetorical structure of citation context: an approach using n-grams
    Marc Bertin
    Iana Atanassova
    Cassidy R. Sugimoto
    Vincent Lariviere
    [J]. Scientometrics, 2016, 109 : 1417 - 1434
  • [2] Using word n-grams to identify authors and idiolects A corpus approach to a forensic linguistic problem
    Wright, David
    [J]. INTERNATIONAL JOURNAL OF CORPUS LINGUISTICS, 2017, 22 (02) : 212 - 241
  • [3] Language Distance using Common N-Grams Approach
    Kosmajac, Dijana
    Keselj, Vlado
    [J]. 2020 19TH INTERNATIONAL SYMPOSIUM INFOTEH-JAHORINA (INFOTEH), 2020,
  • [4] Understanding the Linguistic Characteristics of Network Signaling for the "Internet of Things" Using n-Grams
    Emmons, Stephen P.
    Kamangar, Farhad
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON CLOUD ENGINEERING (IC2E 2015), 2015, : 219 - 227
  • [5] A first approach to CLIR using character n-grams alignment
    Vilares, Jesus
    Oakes, Michael P.
    Tait, John I.
    [J]. EVALUATION OF MULTILINGUAL AND MULTI-MODAL INFORMATION RETRIEVAL, 2007, 4730 : 111 - +
  • [6] Modeling documents for structure recognition using generalized N-grams
    Brugger, R
    Zramdini, A
    Ingold, R
    [J]. PROCEEDINGS OF THE FOURTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS 1 AND 2, 1997, : 56 - 60
  • [7] Automated citation sentiment analysis using high order n-grams: a preliminary investigation
    Ikram, Muhammad Touseef
    Afzal, Muhammad Tanvir
    Butt, Naveed Anwer
    [J]. TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2018, 26 (04) : 1922 - 1932
  • [8] Linguistic segmentation of tuples for the modeling of stochastic translation by n-grams
    de Gispert, Adria
    Marino, Jose B.
    [J]. PROCESAMIENTO DEL LENGUAJE NATURAL, 2006, (37): : 241 - 248
  • [9] SPEECH RECOGNITION USING FUNCTION-WORD N-GRAMS AND CONTENT-WORD N-GRAMS
    ISOTANI, R
    MATSUNAGA, S
    SAGAYAMA, S
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 1995, E78D (06) : 692 - 697
  • [10] Plagiarism Detection Using Stopword n-grams
    Stamatatos, Efstathios
    [J]. JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2011, 62 (12): : 2512 - 2527