High-Precision Extraction of Emerging Concepts from Scientific Literature

被引:5
|
作者
King, Daniel [1 ]
Downey, Doug [1 ]
Weld, Daniel S. [1 ,2 ]
机构
[1] Allen Inst AI, Seattle, WA 98103 USA
[2] Univ Washington, Paul G Allen Sch Comp Sci & Engn, Seattle, WA USA
基金
美国国家科学基金会;
关键词
Concept extraction; scientific literature; citation graph;
D O I
10.1145/3397271.3401235
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Identification of new concepts in scientific literature can help power faceted search, scientific trend analysis, knowledge-base construction, and more, but current methods are lacking. Manual identification can't keep up with the torrent of new publications, while the precision of existing automatic techniques is too low for many applications. We present an unsupervised concept extraction method for scientific literature that achieves much higher precision than previous work. Our approach relies on a simple but novel intuition: each scientific concept is likely to be introduced or popularized by a single paper that is disproportionately cited by subsequent papers mentioning the concept. From a corpus of computer science papers on arXiv, we find that our method achieves a Precision@1000 of 99%, compared to 86% for prior work, and a substantially better precision-yield trade-off across the top 15,000 extractions. To stimulate research in this area, we release our code and data.
引用
收藏
页码:1549 / 1552
页数:4
相关论文
共 50 条
  • [1] Scientific results from high-precision astrometry at the Palomar Testbed Interferometer
    Muterspaugh, Matthew W.
    Lane, Benjamin F.
    Konacki, Maciej
    Burke, B. F.
    Colavita, M. M.
    Kulkarni, S. R.
    Shao, M.
    [J]. ADVANCES IN STELLAR INTERFEROMETRY PTS 1 AND 2, 2006, 6268
  • [2] Modelling of high-precision edge extraction phenomena
    Tretjakov, E. V.
    Simkin, B. E.
    [J]. PERCEPTION, 1996, 25 : 80 - 80
  • [3] INFORMATION EXTRACTION AS A BASIS FOR HIGH-PRECISION TEXT CLASSIFICATION
    RILOFF, E
    LEHNERT, W
    [J]. ACM TRANSACTIONS ON INFORMATION SYSTEMS, 1994, 12 (03) : 296 - 333
  • [4] Azimuth extraction method for high-precision strapdown IMU
    Zhong, Yan
    Wang, Dan-Dan
    Wang, Xing-Quan
    Wang, Dong-Sheng
    Li, Yan-Zheng
    Sun, Xue-Cheng
    [J]. Zhongguo Guanxing Jishu Xuebao/Journal of Chinese Inertial Technology, 2014, 22 (06): : 845 - 848
  • [5] Emerging concepts from the recent literature
    Houston, Dayle
    [J]. MOLECULAR INTERVENTIONS, 2007, 7 (05) : 244 - 245
  • [6] Research on a high-precision extraction method of industrial cuboid
    Liu, Qi
    Zhu, Zijian
    Huo, Ju
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 132
  • [7] A Comparison of Channel Access Concepts for High-Precision Local Positioning
    Mosshammer, R.
    Waldmann, B.
    Eickhoff, R.
    Weigel, R.
    Huemer, M.
    [J]. WPNC: 2009 6TH WORKSHOP ON POSITIONING, NAVIGATION AND COMMUNICATION, PROCEEDINGS, 2009, : 37 - +
  • [8] High-precision simulation of slow-extraction spill from a hadrontherapy synchrotron
    Meot, F.
    [J]. NUCLEAR INSTRUMENTS & METHODS IN PHYSICS RESEARCH SECTION A-ACCELERATORS SPECTROMETERS DETECTORS AND ASSOCIATED EQUIPMENT, 2008, 595 (03): : 535 - 542
  • [9] High-Precision Person Name Extraction from Turkish Texts Using Wikipedia
    Kucuk, Dilek
    Kucuk, Dogan
    [J]. NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, NLDB 2015, 2015, 9103 : 347 - 354
  • [10] High-precision accounting for high-precision network services
    Clemm, Alexander
    Strassner, John
    [J]. 2021 IEEE 22ND INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE SWITCHING AND ROUTING (IEEE HPSR), 2021,