Searching for Evidence of Scientific News in Scholarly Big Data

被引:1
|
作者
Ul Hoque, Md Reshad [1 ]
Bradley, Dash [2 ]
Kwan, Chiman [3 ]
Chiatti, Agnese [4 ]
Li, Jiang [1 ]
Wu, Jian [2 ]
机构
[1] Old Dominion Univ, Elect & Comp Engn, Norfolk, VA 23529 USA
[2] Old Dominion Univ, Comp Sci, Norfolk, VA USA
[3] Appl Res LLC, Rockville, MD USA
[4] Open Univ, Knowledge Media Inst, Milton Keynes, Bucks, England
关键词
Fake news; Domain knowledge entity; Web API; Embedding;
D O I
10.1145/3360901.3364438
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Public digital media can often mix factual information with fake scientific news, which is typically difficult to pinpoint, especially for non-professionals. These scientific news articles create illusions and misconceptions, thus ultimately influence the public opinion, with serious consequences at a broader social scale. Yet, existing solutions aiming at automatically verifying the credibility of news articles are still unsatisfactory. We propose to verify scientific news by retrieving and analyzing its most relevant source papers from an academic digital library (DL), e.g., arXiv. Instead of querying keywords or regular named entities extracted from news articles, we query domain knowledge entities (DKEs) extracted from the text. By querying each DKE, we retrieve a list of candidate scholarly papers. We then design a function to rank them and select the most relevant scholarly paper. After exploring various representations, experiments indicate that the term frequency-inverse document frequency (TF-IDF) representation with cosine similarity outperforms baseline models based on word embedding. This result demonstrates the efficacy of using DKEs to retrieve scientific papers which are relevant to a specific news article. It also indicates that word embedding may not be the best document representation for domain specific document retrieval tasks. Our method is fully automated and can be effectively applied to facilitating fake and misinformed news detection across many scientific domains.
引用
收藏
页码:251 / 254
页数:4
相关论文
共 50 条
  • [21] Scholarly Data Share: A Model for Sharing Big Data in Academic Research
    Chapman, Katie
    Ruan, Guangchen
    Tuna, M. Esen
    Walsh, Alan
    Wernert, Eric
    [J]. PRACTICE AND EXPERIENCE IN ADVANCED RESEARCH COMPUTING 2022, 2022,
  • [22] Unified Searching Service for Electric Big Data
    Gao, LiFang
    Li, QiMeng
    Lian, YangYang
    Lv, PengPeng
    Zhou, WenFang
    [J]. 7TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND QUANTITATIVE MANAGEMENT (ITQM 2019): INFORMATION TECHNOLOGY AND QUANTITATIVE MANAGEMENT BASED ON ARTIFICIAL INTELLIGENCE, 2019, 162 : 123 - 130
  • [23] Scholarly Data Mining: Making Sense of Scientific Literature
    Saggion, Horacio
    Ronzano, Francesco
    [J]. 2017 ACM/IEEE JOINT CONFERENCE ON DIGITAL LIBRARIES (JCDL 2017), 2017, : 346 - 347
  • [24] Searching for big data How incumbents explore a possible adoption of big data technologies
    Caesarius, Leon Michael
    Hohenthal, Jukka
    [J]. SCANDINAVIAN JOURNAL OF MANAGEMENT, 2018, 34 (02) : 129 - 140
  • [25] Faster big data searching by utilizing the statistical model of the data
    Shibuya, Tetsuo
    [J]. Journal of the Institute of Electronics, Information and Communication Engineers, 2014, 97 (05): : 384 - 387
  • [26] Scientific big data and Digital Earth
    Guo, Huadong
    Wang, Lizhe
    Chen, Fang
    Liang, Dong
    [J]. CHINESE SCIENCE BULLETIN, 2014, 59 (35): : 5066 - 5073
  • [27] Big data: the end of the scientific method?
    Succi, Sauro
    Coveney, Peter V.
    [J]. PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2019, 377 (2142):
  • [28] Machine learning and big scientific data
    Hey, Tony
    Butler, Keith
    Jackson, Sam
    Thiyagalingam, Jeyarajan
    [J]. PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2020, 378 (2166):
  • [29] Big data for scientific research and discovery
    Guo, Huadong
    [J]. INTERNATIONAL JOURNAL OF DIGITAL EARTH, 2015, 8 (01) : 1 - 2
  • [30] Scientific big data and Digital Earth
    Huadong Guo
    Lizhe Wang
    Fang Chen
    Dong Liang
    [J]. Science Bulletin, 2014, (35) : 5066 - 5073