Human-in-the-loop Exploration of Composite Items

被引:0
|
作者
Roy, Senjuti Basu [1 ]
机构
[1] New Jersey Inst Technol, Newark, NJ 07102 USA
来源
PROCEEDINGS OF THE 6TH ACM IKDD CODS AND 24TH COMAD | 2019年
基金
美国国家科学基金会;
关键词
D O I
10.1145/3297001.3297065
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Human-in-the-loop data exploration is seeing a renewed interest in our community. With the rise of big data analytics, this area is growing to encompass not only approaches and algorithms to find the next best data items to explore but also interactivity, i.e. accounting for feedback from the data scientist during the exploration. Interactivity is essential to account for evolving needs during the exploration and also customize the discovery process. In this tutorial, we focus on exploration of Composite Items (CIs) that require repeated interaction with human users. CIs address complex information needs and are prevalent in online shopping where products are bundled together to provide discounts, in travel itinerary recommendation where points of interest in a city are combined into a single travel package, and task assignment in crowdsourcing where persoalized micro-tasks are composed and recommended to workers. CI formation is usually expressed as a constrained optimization problem. For instance, in online shopping, package retrieval can retrieve the cheapest smartphones (optimization objective) with compatible accessories (constraints). Similarly, a city tour must be the most popular and conform to a total time and cost budget. A data scientist interested in exploring a variety of CIs has to repeatedly reformulate optimization problems with new constraints and objectives. In this tutorial, we investigate the applicability of interactive data exploration approaches to CI formation. The tutorial will have the following parts: 15 minutes We will first review CI applications and shapes (15mn) that are applicable in different domains. This part will gather different examples and attempt to unify them. 60 minutes We then discuss three big research questions : (i) existing algorithms for CI formation, (ii) human-in-the-loop CIs, and (iii) optimization opportunities. 15 minutes We conclude with ongoing and future research directions. The tutorial targets theoreticians and practitioners interested in the development of data science applications. It should be of particular interest to database researchers, applied machine learners, as well as data scientists in industrial research settings who want to learn about how different domains, such as product recommendation, scientific simulation, or team formation in the social sciences and crowdsourcing, have been developing their "siloed" definitions of CIs. The research direction presented in the tutorial will be helpful to converge these domain specific ideas and creating an overarching generic framework. Tutorial attendees are expected to have basic knowledge in algorithms and data management. Knowledge in constrained optimization is not necessary. The proposed tutorial is timely. It brings together several related efforts and addresses unsolved questions in the emerging area of human-in-the-loop exploration of complex information needs. The tutorial is relevant to the general area of data science and more specifically to Scalable Analytics, Data Mining, Clustering and Knowledge Discovery, Indexing, Query Processing and Optimization, and Crowdsourcing. The technical topics covered are constrained optimization,ranking semantics, clustering, algorithms, and empirical evaluations.
引用
收藏
页码:367 / 367
页数:1
相关论文
共 50 条
  • [41] Human-in-the-loop Abstractive Dialogue Summarization
    Chen, Jiaao
    Doddat, Mohan
    Yang, Diyi
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023), 2023, : 9176 - 9190
  • [42] Human-in-the-loop image segmentation and annotation
    Zhang, Xiaoya
    Wang, Lianjie
    Xie, Jin
    Zhu, Pengfei
    SCIENCE CHINA-INFORMATION SCIENCES, 2020, 63 (11)
  • [43] A Human-in-the-Loop Evaluation of ACAS Xu
    Rorie, R. Conrad
    Smith, Casey
    Sadler, Garrett
    Monk, Kevin J.
    Tyson, Terence L.
    Keeler, Jillian
    2020 AIAA/IEEE 39TH DIGITAL AVIONICS SYSTEMS CONFERENCE (DASC) PROCEEDINGS, 2020,
  • [44] Interpretation of Sentiment Analysis with Human-in-the-Loop
    Yeruva, Vijaya Kumari
    Chandrashekar, Mayanka
    Lee, Yugyung
    Rydberg-Cox, Jeff
    Blanton, Virginia
    Oyler, Nathan A.
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 3099 - 3108
  • [45] Human-In-The-Loop Automatic Program Repair
    Bohme, Marcel
    Geethal, Charaka
    Van-Thuan Pham
    2020 IEEE 13TH INTERNATIONAL CONFERENCE ON SOFTWARE TESTING, VALIDATION AND VERIFICATION (ICST 2020), 2020, : 274 - 285
  • [46] ProDiGy : Human-in-the-loop Process Discovery
    Dixit, P. M.
    Buijs, J. C. A. M.
    van der Aalst, W. M. P.
    2018 12TH INTERNATIONAL CONFERENCE ON RESEARCH CHALLENGES IN INFORMATION SCIENCE (RCIS), 2018,
  • [47] Human-in-the-loop handling of knowledge drift
    Bontempelli, Andrea
    Giunchiglia, Fausto
    Passerini, Andrea
    Teso, Stefano
    DATA MINING AND KNOWLEDGE DISCOVERY, 2022, 36 (05) : 1865 - 1884
  • [48] Human-in-the-Loop Simulation of Cloud Services
    Bezirgiannis, Nikolaos
    de Boer, Frank
    de Gouw, Stijn
    SERVICE-ORIENTED AND CLOUD COMPUTING (ESOCC 2017), 2017, 10465 : 143 - 158
  • [49] Viewpoint: Human-in-the-loop Artificial Intelligence
    Zanzotto, Fabio Massimo
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2019, 64 : 243 - 252
  • [50] Human-in-the-loop handling of knowledge drift
    Andrea Bontempelli
    Fausto Giunchiglia
    Andrea Passerini
    Stefano Teso
    Data Mining and Knowledge Discovery, 2022, 36 : 1865 - 1884