Searching for Evidence of Scientific News in Scholarly Big Data

被引:1
|
作者
Ul Hoque, Md Reshad [1 ]
Bradley, Dash [2 ]
Kwan, Chiman [3 ]
Chiatti, Agnese [4 ]
Li, Jiang [1 ]
Wu, Jian [2 ]
机构
[1] Old Dominion Univ, Elect & Comp Engn, Norfolk, VA 23529 USA
[2] Old Dominion Univ, Comp Sci, Norfolk, VA USA
[3] Appl Res LLC, Rockville, MD USA
[4] Open Univ, Knowledge Media Inst, Milton Keynes, Bucks, England
关键词
Fake news; Domain knowledge entity; Web API; Embedding;
D O I
10.1145/3360901.3364438
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Public digital media can often mix factual information with fake scientific news, which is typically difficult to pinpoint, especially for non-professionals. These scientific news articles create illusions and misconceptions, thus ultimately influence the public opinion, with serious consequences at a broader social scale. Yet, existing solutions aiming at automatically verifying the credibility of news articles are still unsatisfactory. We propose to verify scientific news by retrieving and analyzing its most relevant source papers from an academic digital library (DL), e.g., arXiv. Instead of querying keywords or regular named entities extracted from news articles, we query domain knowledge entities (DKEs) extracted from the text. By querying each DKE, we retrieve a list of candidate scholarly papers. We then design a function to rank them and select the most relevant scholarly paper. After exploring various representations, experiments indicate that the term frequency-inverse document frequency (TF-IDF) representation with cosine similarity outperforms baseline models based on word embedding. This result demonstrates the efficacy of using DKEs to retrieve scientific papers which are relevant to a specific news article. It also indicates that word embedding may not be the best document representation for domain specific document retrieval tasks. Our method is fully automated and can be effectively applied to facilitating fake and misinformed news detection across many scientific domains.
引用
收藏
页码:251 / 254
页数:4
相关论文
共 50 条
  • [1] Algo_Seer: System for Extracting and Searching Algorithms in Scholarly Big Data
    Sangam, M. Biradar
    Shekhar, R.
    Reddy, Pranayanath
    [J]. INTELLIGENT COMMUNICATION TECHNOLOGIES AND VIRTUAL MOBILE NETWORKS, ICICV 2019, 2020, 33 : 116 - 126
  • [2] Career Age-Aware Scientific Collaborator Recommendation in Scholarly Big Data
    Sun, Na
    Lu, Yong
    Cao, Yongcun
    [J]. IEEE ACCESS, 2019, 7 : 136036 - 136045
  • [3] Guest Editorial: Scholarly Big Data
    Xia, Feng
    Giles, C. Lee
    Liu, Huan
    Wang, Kuansan
    [J]. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING, 2021, 9 (01) : 200 - 203
  • [4] Scholarly Big Data Knowledge and Semantics
    Giles, C. Lee
    [J]. PROCEEDINGS OF THE 25TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'16 COMPANION), 2016, : 371 - 371
  • [5] Searching Inter-disciplinary Scientific Big Data based on Latent Correlation Analysis
    Gonzales, Eloy
    Ong, Bun Theang
    Zettsu, Koji
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2013,
  • [6] Scholarly Big Data: Information Extraction and Data Mining
    Giles, C. Lee
    [J]. PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), 2013, : 1 - 1
  • [7] A survey on scholarly data: From big data perspective
    Khan, Samiya
    Liu, Xiufeng
    Shakil, Kashish A.
    Alam, Mansaf
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2017, 53 (04) : 923 - 944
  • [8] Scholarly searching
    不详
    [J]. INDUSTRIAL ENGINEER, 2005, 37 (03): : 16 - 16
  • [9] Preprocessing framework for scholarly big data management
    Khan, Samiya
    Alam, Mansaf
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (25) : 39719 - 39743
  • [10] Preprocessing framework for scholarly big data management
    Samiya Khan
    Mansaf Alam
    [J]. Multimedia Tools and Applications, 2023, 82 : 39719 - 39743