Examining the merits of feature-specific similarity functions in the news domain using human judgments

被引:0
|
作者
Starke, Alain D. [1 ,2 ]
Solberg, Vegard R. [2 ]
Overhaug, Sebastian [2 ]
Trattner, Christoph [2 ]
机构
[1] Univ Amsterdam, Amsterdam Sch Commun Res, POB 15791, NL-1001 NG Amsterdam, Netherlands
[2] Univ Bergen, Dept Informat Sci & Media Studies, MediaFutures, Lars Hilles Gate 30, N-5008 Bergen, Norway
关键词
News; Similarity; Similar-item retrieval; Recommender systems; Human judgment;
D O I
10.1007/s11257-024-09412-2
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Online news article recommendations are typically of the 'more like this' type, generated by similarity functions. Across three studies, we examined the representativeness of different similarity functions for news item retrieval, by comparing them to human judgments of similarity. In Study 1 (N=401\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$N=401$$\end{document}), participants assessed the overall similarity of ten randomly paired news articles on politics and compared their judgments to different feature-specific similarity functions (e.g., based on body text or images). In Study 2, we checked for domain differences in a mixed-methods survey (N=45\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$N=45$$\end{document}), surfacing evidence that the effectiveness of similarity functions differs across different news categories ('Recent Events', 'Sport'). In Study 3 (N=173\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$N=173$$\end{document}), we improved the design of Study 1, by controlling for how news articles were matched, differentiating between dissimilar news articles and articles that were matched on a shared topic, named entities, and/or date of publication, across 'Recent Events' and 'Sport' categories. Across all studies, we found that users mostly used text-based features (e.g., body text, title) for their similarity judgments, while BodyText:TF-IDF was found to be the most representative for their judgments. Moreover, the strength of similarity judgments by humans and similarity scores by feature-specific functions was strongly affected by how news article pairs were matched. We show that humans and similarity functions are better aligned when two news articles are more alike, such as in a news recommendation scenario.
引用
收藏
页码:995 / 1042
页数:48
相关论文
共 50 条