Examining the merits of feature-specific similarity functions in the news domain using human judgments

被引:0
|
作者
Starke, Alain D. [1 ,2 ]
Solberg, Vegard R. [2 ]
Overhaug, Sebastian [2 ]
Trattner, Christoph [2 ]
机构
[1] Univ Amsterdam, Amsterdam Sch Commun Res, POB 15791, NL-1001 NG Amsterdam, Netherlands
[2] Univ Bergen, Dept Informat Sci & Media Studies, MediaFutures, Lars Hilles Gate 30, N-5008 Bergen, Norway
关键词
News; Similarity; Similar-item retrieval; Recommender systems; Human judgment;
D O I
10.1007/s11257-024-09412-2
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Online news article recommendations are typically of the 'more like this' type, generated by similarity functions. Across three studies, we examined the representativeness of different similarity functions for news item retrieval, by comparing them to human judgments of similarity. In Study 1 (N=401\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$N=401$$\end{document}), participants assessed the overall similarity of ten randomly paired news articles on politics and compared their judgments to different feature-specific similarity functions (e.g., based on body text or images). In Study 2, we checked for domain differences in a mixed-methods survey (N=45\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$N=45$$\end{document}), surfacing evidence that the effectiveness of similarity functions differs across different news categories ('Recent Events', 'Sport'). In Study 3 (N=173\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$N=173$$\end{document}), we improved the design of Study 1, by controlling for how news articles were matched, differentiating between dissimilar news articles and articles that were matched on a shared topic, named entities, and/or date of publication, across 'Recent Events' and 'Sport' categories. Across all studies, we found that users mostly used text-based features (e.g., body text, title) for their similarity judgments, while BodyText:TF-IDF was found to be the most representative for their judgments. Moreover, the strength of similarity judgments by humans and similarity scores by feature-specific functions was strongly affected by how news article pairs were matched. We show that humans and similarity functions are better aligned when two news articles are more alike, such as in a news recommendation scenario.
引用
收藏
页码:995 / 1042
页数:48
相关论文
共 50 条
  • [31] Re-examining User Burden in Human-AI Interaction: Focusing on a Domain-Specific Approach
    Park, Hyanghee
    EXTENDED ABSTRACTS OF THE 2024 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, CHI 2024, 2024,
  • [32] Stock Trend Prediction Using Multi-attention Network on Domain-specific and Domain-general Features in News Headline
    Soon, Phaik Ching
    Tan, Tien-Ping
    Chan, Huah Yong
    Gan, Keng Hoon
    PERTANIKA JOURNAL OF SCIENCE AND TECHNOLOGY, 2025, 33 (02): : 823 - 843
  • [33] Weakly supervised Medulloblastoma tumor classification using domain specific patch-level feature extraction
    Maack, Lennart
    Bhattacharya, Debayan
    Behrendt, Finn
    Bockmayr, Michael
    Schlaefer, Alexander
    DIGITAL AND COMPUTATIONAL PATHOLOGY, MEDICAL IMAGING 2024, 2024, 12933
  • [34] Time Domain Multi-Feature Extraction and Classification of Human Hand Movements Using Surface EMG
    Bhattacharya, Avik
    Sarkar, Anasua
    Basak, Piyali
    2017 4TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING AND COMMUNICATION SYSTEMS (ICACCS), 2017,
  • [35] Using words as lexical basis functions for automatically indexing face images in a manner that correlates with human perception of similarity
    Phielipp, Mariano
    Black, John A., Jr.
    Panchanathan, Sethuraman
    HUMAN VISION AND ELECTRONIC IMAGING XI, 2006, 6057
  • [36] Revealing long noncoding RNA architecture and functions using domain-specific chromatin isolation by RNA purification
    Quinn, Jeffrey J.
    Ilik, Ibrahim A.
    Qu, Kun
    Georgiev, Plamen
    Chu, Ci
    Alchtar, Asifa
    Chang, Howard Y.
    NATURE BIOTECHNOLOGY, 2014, 32 (09) : 933 - 940
  • [37] Revealing long noncoding RNA architecture and functions using domain-specific chromatin isolation by RNA purification
    Jeffrey J Quinn
    Ibrahim A Ilik
    Kun Qu
    Plamen Georgiev
    Ci Chu
    Asifa Akhtar
    Howard Y Chang
    Nature Biotechnology, 2014, 32 : 933 - 940
  • [38] Design of a Specific Colonic Mucus Marker Using a Human Commensal Bacterium Cell Surface Domain
    Coic, Yves-Marie
    Baleux, Francoise
    Poyraz, Oemer
    Thibeaux, Roman
    Labruyere, Elisabeth
    Chretien, Fabrice
    Sobhani, Iradj
    Lazure, Thierry
    Wyplosz, Benjamin
    Schneider, Gunter
    Mulard, Laurence
    Sansonetti, Philippe J.
    Marteyn, Benoit S.
    JOURNAL OF BIOLOGICAL CHEMISTRY, 2012, 287 (19) : 15916 - 15922
  • [39] Human Multi-Activities Classification Using mmWave Radar: Feature Fusion in Time-Domain and PCANet
    Lin, Yier
    Li, Haobo
    Faccio, Daniele
    SENSORS, 2024, 24 (16)
  • [40] Examining the Predictive Validity of the Grit Scale-Short (Grit-S) Using Domain-General and Domain-Specific Approaches With Student-Athletes
    Rumbold, James L.
    Dunn, John G. H.
    Olusoga, Peter
    FRONTIERS IN PSYCHOLOGY, 2022, 13