More extreme duplication in FDA Adverse Event Reporting System detected by literature reference normalization and fuzzy string matching

被引:6
|
作者
Hung, Eric [1 ,4 ]
Hauben, Manfred [1 ]
Essex, Henry [1 ]
Zou, Chen [2 ]
Bright, Steve [3 ]
机构
[1] Pfizer Inc, Safety Surveillance & Risk Management, New York, NY USA
[2] Pfizer Inc, Safety Surveillance & Risk Management, Shanghai, Peoples R China
[3] Oracle Hlth Sci, Las Vegas, NV USA
[4] Pfizer Inc, New York, NY 10017 USA
关键词
addictovigilance; duplicate detection; spontaneous reports; string matching;
D O I
10.1002/pds.5555
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
PurposeLiterature reports of adverse drug events can be replicated across multiple companies, resulting in extreme duplication (defined as a majority of reports being duplicates) in the FDA Adverse Event Reporting System (FAERS) database because they can escape legacy duplicate detection algorithms routinely deployed on that data source. Literature reference field, added to in 2014, could potentially be utilized to identify replicated reports. FAERS does not enforce adherence to the Vancouver referencing convention, thus the same article may be referenced differently leading to duplication. The objective of this analysis is to determine if variations of the same literature references observed in FAERS can be resolved with text normalization and fuzzy string matching. MethodsWe normalized the literature references recorded in the FAERS database through the first quarter of 2021 with a rule-based algorithm so that they better conform to the Vancouver convention. Levenshtein distance was then utilized to merge sufficiently similar normalized literature references together. ResultsNormalization of literature references increases the percentage that can be parsed into author, title, and journal from 61.74% to 93.93%. We observe that about 98% of pairs within groups do have a Levenshtein similarity of the title above the threshold. The extreme duplication ranged from 66% to 87% with a median of 72% of reports being duplicates and often involved addictovigilance scenarios. ConclusionsWe have shown that these normalized references can be merged via fuzzy string matching to improve enumeration of all the individual case safety reports that refer to the same article. Inclusion of the PubMed ID and adherence to the Vancouver convention could facilitate identification of duplicates in the FAERS dataset. Awareness of this phenomenon may improve disproportionality analysis, especially in areas such as addictovigilance.
引用
收藏
页码:387 / 391
页数:5
相关论文
共 20 条
  • [1] ‘Extreme Duplication’ in the US FDA Adverse Events Reporting System Database
    Manfred Hauben
    Lester Reich
    James De Micco
    Katherine Kim
    [J]. Drug Safety, 2007, 30 : 551 - 554
  • [2] Adverse Events Associated with Fosfomycin Use: Review of the Literature and Analyses of the FDA Adverse Event Reporting System Database
    Iarikov D.
    Wassel R.
    Farley J.
    Nambiar S.
    [J]. Infectious Diseases and Therapy, 2015, 4 (4) : 433 - 458
  • [3] Cases of Benzocaine-Associated Methemoglobinemia Identified in the FDA Adverse Event Reporting System and the Literature
    Lardieri, Allison B.
    Crew, Page E.
    McCulley, Lynda
    Kim, Ivone E.
    Waldron, Peter
    Diak, Ida-Lina
    [J]. ANNALS OF PHARMACOTHERAPY, 2019, 53 (04) : 437 - 438
  • [4] Eosinophilic Pneumonia in Patients Treated with DaptomycinReview of the Literature and US FDA Adverse Event Reporting System Reports
    Peter W. Kim
    Alfred F. Sorbello
    Ronald T. Wassel
    Tracy M. Pham
    Joseph M. Tonning
    Sumathi Nambiar
    [J]. Drug Safety, 2012, 35 : 447 - 457
  • [5] Eosinophilic Pneumonia in Patients Treated with Daptomycin Review of the Literature and US FDA Adverse Event Reporting System Reports
    Kim, Peter W.
    Sorbello, Alfred F.
    Wassel, Ronald T.
    Pham, Tracy M.
    Tonning, Joseph M.
    Nambiar, Sumathi
    [J]. DRUG SAFETY, 2012, 35 (06) : 447 - 457
  • [6] Drug Interaction Between Febuxostat and Thiopurine Antimetabolites: A Review of the FDA Adverse Event Reporting System and Medical Literature
    Logan, Jill K.
    Yapa, Shalini Wickramaratne Senarath
    Harinstein, Lisa
    Saluja, Bhawana
    Munoz, Monica
    Sahajwalla, Chandrahas
    Neuner, Rosemarie
    Seymour, Sally
    [J]. PHARMACOTHERAPY, 2020, 40 (02): : 125 - 132
  • [7] Cyclosporine-induced alopecia:a case report, FDA adverse event reporting system analysis and literature assessment
    Wang, Ying
    Wang, Youhong
    Xu, Ping
    [J]. FRONTIERS IN PHARMACOLOGY, 2024, 15
  • [8] Dropped head syndrome: a rare adverse drug reaction identified in the FDA adverse event reporting system and review of case reports in the literature
    Liu, Xin
    Zhao, Xiaoyue
    He, Yangyang
    Tang, Yan
    Yan, Xue-Lian
    Zhao, Bin
    Dai, Yi
    [J]. EXPERT OPINION ON DRUG SAFETY, 2022, 21 (10) : 1329 - 1336
  • [9] An evaluation of a data mining signal for amyotrophic lateral sclerosis and statins detected in FDA's spontaneous adverse event reporting system
    Colman, Eric
    Szarfman, Ana
    PharmD, Jo Wyeth
    Mosholder, Andrew
    Jillapalli, Devanand
    Levine, Jonathan
    Avigan, Mark
    [J]. PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2008, 17 (11) : 1068 - 1076
  • [10] Asparaginase-related diabetic ketoacidosis: Analysis of the FDA Adverse Event Reporting System (FAERS) data and literature review
    Li, Dongxuan
    Gou, Jinghui
    Dong, Jie
    Dong, Yuzhu
    Xi, Xin
    Chen, Cheng
    Du, Qian
    Liu, Songqing
    [J]. JOURNAL OF CLINICAL PHARMACY AND THERAPEUTICS, 2022, 47 (12) : 2176 - 2181