An Automatic Method for Extracting Citations From Google Books

被引:37
|
作者
Kousha, Kayvan [1 ]
Thelwall, Mike [1 ]
机构
[1] Wolverhampton Univ, Sch Technol, Stat Cybermetr Res Grp, Wolverhampton WV1 1LY, W Midlands, England
关键词
citation analysis; experiments; SOCIAL-SCIENCES; HUMANITIES; MONOGRAPHS; CHAPTERS; IMPACT; OUTPUT;
D O I
10.1002/asi.23170
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recent studies have shown that counting citations from books can help scholarly impact assessment and that Google Books (GB) is a useful source of such citation counts, despite its lack of a public citation index. Searching GB for citations produces approximate matches, however, and so its raw results need time-consuming human filtering. In response, this article introduces a method to automatically remove false and irrelevant matches from GB citation searches in addition to introducing refinements to a previous GB manual citation extraction method. The method was evaluated by manual checking of sampled GB results and comparing citations to about 14,500 monographs in the Thomson Reuters Book Citation Index (BKCI) against automatically extracted citations from GB across 24 subject areas. GB citations were 103% to 137% as numerous as BKCI citations in the humanities, except for tourism (72%) and linguistics (91%), 46% to 85% in social sciences, but only 8% to 53% in the sciences. In all cases, however, GB had substantially more citing books than did BKCI, with BKCI's results coming predominantly from journal articles. Moderate correlations between the GB and BKCI citation counts in social sciences and humanities, with most BKCI results coming from journal articles rather than books, suggests that they could measure the different aspects of impact, however.
引用
收藏
页码:309 / 320
页数:12
相关论文
共 50 条
  • [31] THE ELEPHANTINE GOOGLE BOOKS SETTLEMENT
    Grimmelmann, James
    [J]. JOURNAL OF THE COPYRIGHT SOCIETY OF THE USA, 2011, 58 (03): : 497 - 520
  • [32] Google & the Future of Books: An Exchange
    Lewis, Anthony
    [J]. NEW YORK REVIEW OF BOOKS, 2010, 57 (01) : 64 - 64
  • [33] Google & Books: An Exchange Reply
    不详
    [J]. NEW YORK REVIEW OF BOOKS, 2009, 56 (05) : 49 - 50
  • [34] A method for automatic analysis Table of Contents in Chinese books
    Chen, Jing
    Lu, Quan
    [J]. LIBRARY HI TECH, 2015, 33 (03) : 424 - 438
  • [35] An Automatic Product Features Extracting Method in Chinese Customer Reviews
    Yu, Zhenzhi
    Zheng, Ning
    Xu, Ming
    [J]. 2012 7TH INTERNATIONAL CONFERENCE ON SYSTEM OF SYSTEMS ENGINEERING (SOSE), 2012, : 455 - 459
  • [36] Automatic method for extracting and classifying defect in optical photomask images
    Ha, Youngmin
    Jeong, Hong
    [J]. MUE: 2007 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND UBIQUITOUS ENGINEERING, PROCEEDINGS, 2007, : 710 - +
  • [37] Boyle's Books: the evidence of his citations
    Knight, David
    [J]. SEVENTEENTH CENTURY, 2013, 28 (01): : 104 - 106
  • [38] AN AUTOMATIC METHOD FOR EXTRACTING SIGNIFICANT PHRASES IN SCIENTIFIC OR TECHNICAL DOCUMENTS
    MAEDA, T
    MOMOUCHI, Y
    SAWAMURA, H
    [J]. INFORMATION PROCESSING & MANAGEMENT, 1980, 16 (03) : 119 - 127
  • [39] An Automatic Method for Extracting Innovative Ideas Based on the Scopus® Database
    Chen, Lielei
    Fang, Hui
    [J]. KNOWLEDGE ORGANIZATION, 2019, 46 (03): : 171 - 186
  • [40] Method for automatic extracting process models from criminal case records with business process model
    Zhang, Yuan
    Zou, Wentao
    Yuan, Hao
    Li, Chuanyi
    Ge, Jidong
    Luo, Bin
    [J]. Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2024, 30 (08): : 2968 - 2980