Probabilistic Multigraph Modeling for Improving the Quality of Crowdsourced Affective Data

被引:10
|
作者
Ye, Jianbo [1 ]
Li, Jia [2 ]
Newman, Michelle G. [3 ]
Adams, Reginald B., Jr. [3 ]
Wang, James Z. [1 ]
机构
[1] Penn State Univ, Coll Informat Sci & Technol, University Pk, PA 16802 USA
[2] Penn State Univ, Dept Stat, University Pk, PA 16802 USA
[3] Penn State Univ, Dept Psychol, University Pk, PA 16802 USA
基金
美国国家科学基金会;
关键词
Emotions; human subjects; crowdsourcing; probabilistic graphical model; visual stimuli;
D O I
10.1109/TAFFC.2017.2678472
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We proposed a probabilistic approach to joint modeling of participants' reliability and humans' regularity in crowdsourced affective studies. Reliability measures how likely a subject will respond to a question seriously; and regularity measures how often a human will agree with other seriously-entered responses coming from a targeted population. Crowdsourcing-based studies or experiments, which rely on human self-reported affect, pose additional challenges as compared with typical crowdsourcing studies that attempt to acquire concrete non-affective labels of objects. The reliability of participants has been massively pursued for typical non-affective crowdsourcing studies, whereas the regularity of humans in an affective experiment in its own right has not been thoroughly considered. It has been often observed that different individuals exhibit different feelings on the same test question, which does not have a sole correct response in the first place. High reliability of responses from one individual thus cannot conclusively result in high consensus across individuals. Instead, globally testing consensus of a population is of interest to investigators. Built upon the agreement multigraph among tasks and workers, our probabilistic model differentiates subject regularity from population reliability. We demonstrate the method's effectiveness for in-depth robust analysis of large-scale crowdsourced affective data, including emotion and aesthetic assessments collected by presenting visual stimuli to human subjects.
引用
收藏
页码:115 / 128
页数:14
相关论文
共 50 条
  • [31] Quality assessment of crowdsourced social media data for urban flood management
    Songchon, Chanin
    Wright, Grant
    Beevers, Lindsay
    COMPUTERS ENVIRONMENT AND URBAN SYSTEMS, 2021, 90
  • [32] Analyzing and Improving Data Quality
    Buccella, Agustina
    Cechich, Alejandra
    Domingo, Gonzalo
    JOURNAL OF COMPUTER SCIENCE & TECHNOLOGY, 2008, 8 (02): : 57 - 63
  • [33] IMPROVING THE QUALITY OF RADIOMETRIC DATA
    MORSE, JG
    OIL & GAS JOURNAL, 1989, 87 (42) : 88 - 89
  • [34] Improving the quality of XAFS data
    Abe, Hitoshi
    Aquilanti, Giuliana
    Boada, Roberto
    Bunker, Bruce
    Glatzel, Pieter
    Nachtegaal, Maarten
    Pascarelli, Sakura
    JOURNAL OF SYNCHROTRON RADIATION, 2018, 25 : 972 - 980
  • [35] Improving probabilistic monthly water quantity and quality predictions using a simplified residual-based modeling approach
    Guo, Tian
    Liu, Yaoze
    Shao, Gang
    Engel, Bernard A.
    Sharma, Ashish
    Marshall, Lucy A.
    Flanagan, Dennis C.
    Cibin, Raj
    Wallace, Carlington W.
    Zhao, Kaiguang
    Ren, Dongyang
    Mercado, Johann Vera
    Aboelnour, Mohamed A.
    ENVIRONMENTAL MODELLING & SOFTWARE, 2022, 156
  • [36] Improving JPDA (joint probabilistic data association) algorithms
    Qin, W., 2005, Northwestern Polytechnical University (23):
  • [37] Data modeling for improving performance of data mart
    Ha, SH
    Park, SC
    PIONEERING NEW TECHNOLOGIES: MANAGEMENT ISSUES AND CHALLENGES IN THE THIRD MILLENNIUM, PROCEEDINGS, 1998, : 436 - 441
  • [38] Modeling and Computing Probabilistic Skyline on Incomplete Data
    Zhang, Kaiqi
    Gao, Hong
    Han, Xixian
    Cai, Zhipeng
    Li, Jianzhong
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2020, 32 (07) : 1405 - 1418
  • [39] Rule discovery and probabilistic modeling for onomastic data
    Leino, A
    Mannila, H
    Pitkänen, RL
    KNOWLEDGE DISCOVERY IN DATABASES: PKDD 2003, PROCEEDINGS, 2003, 2838 : 291 - 302
  • [40] Protecting Genomic Data Privacy with Probabilistic Modeling
    Simmons, Sean
    Berger, Bonnie
    Sahinalp, Cenk
    PACIFIC SYMPOSIUM ON BIOCOMPUTING 2019, 2019, : 403 - 414