WebCiteS: Attributed Query-Focused Summarization on ChineseWeb Search Results with Citations

被引:0
|
作者
Deng, Haolin [1 ]
Wang, Chang [3 ]
Li, Xin [3 ]
Yuan, Dezhang [3 ]
Zhan, Junlang [3 ]
Zhou, Tianhua [3 ]
Ma, Jin [4 ]
Gao, Jun [1 ]
Xu, Ruifeng [1 ,2 ,5 ]
机构
[1] Harbin Inst Technol, Shenzhen, Peoples R China
[2] Peng Cheng Lab, Shenzhen, Peoples R China
[3] Tencent Inc, Shenzhen, Peoples R China
[4] Univ Sci & Technol China, Hefei, Peoples R China
[5] Guangdong Prov Key Lab Novel Secur Intelligence T, Shenzhen, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Enhancing the attribution in large language models (LLMs) is a crucial task. One feasible approach is to enable LLMs to cite external sources that support their generations. However, existing datasets and evaluation methods in this domain still exhibit notable limitations. In this work, we formulate the task of attributed query-focused summarization (AQFS) and present WebCiteS, a Chinese dataset featuring 7k human-annotated summaries with citations. WebCiteS derives from real-world user queries and web search results, offering a valuable resource for model training and evaluation. Prior works in attribution evaluation do not differentiate between groundedness errors and citation errors. They also fall short in automatically verifying sentences that draw partial support from multiple sources. We tackle these issues by developing detailed metrics and enabling the automatic evaluator to decompose the sentences into sub-claims for fine-grained verification. Our comprehensive evaluation of both open-source and proprietary models on WebCiteS highlights the challenge LLMs face in correctly citing sources, underscoring the necessity for further improvement.(1)
引用
收藏
页码:15095 / 15114
页数:20
相关论文
共 50 条
  • [21] A Compare-Aggregate Model with External Knowledge for Query-Focused Summarization
    Ya, Jing
    Liu, Tingwen
    Guo, Li
    WEB INFORMATION SYSTEMS ENGINEERING, WISE 2020, PT II, 2020, 12343 : 68 - 83
  • [22] Query-focused Multi-document Summarization Using Cloud Model
    Chen, Jinguang
    He, Tingting
    INFORMATION-AN INTERNATIONAL INTERDISCIPLINARY JOURNAL, 2011, 14 (03): : 951 - 956
  • [23] Query-focused Multi-documents Summarization Using Genetic Algorithm
    Tang, Jun
    Li, Jichu
    COMPONENTS, PACKAGING AND MANUFACTURING TECHNOLOGY, 2011, 460-461 : 48 - 53
  • [24] Query-Biased Self-Attentive Network for Query-Focused Video Summarization
    Xiao, Shuwen
    Zhao, Zhou
    Zhang, Zijian
    Guan, Ziyu
    Cai, Deng
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 5889 - 5899
  • [25] An indicator-based multi-objective variable neighborhood search approach for query-focused summarization
    Sanchez-Gomez, Jesus M.
    Vega-Rodriguez, Miguel A.
    Perez, Carlos J.
    SWARM AND EVOLUTIONARY COMPUTATION, 2024, 91
  • [26] Data Augmentation for Abstractive Query-Focused Multi-Document Summarization
    Pasunuru, Ramakanth
    Celikyilmaz, Asli
    Galley, Michel
    Xiong, Chenyan
    Zhang, Yizhe
    Bansal, Mohit
    Gao, Jianfeng
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 13666 - 13674
  • [27] Query-focused summarization with the context-graph information fusion transformer
    Park, Choongwon
    Ko, Youngjoong
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 241
  • [28] Exploring heterogeneous features for query-focused summarization of categorized community answers
    Wei, Wei
    Ming, ZhaoYan
    Nie, Liqiang
    Li, Guohui
    Li, Jianjun
    Zhu, Feida
    Shang, Tianfeng
    Luo, Changyin
    INFORMATION SCIENCES, 2016, 330 : 403 - 423
  • [29] Query-Focused Multi-document Summarization Based on Concept Importance
    Zheng, Hai-Tao
    Guo, Ji-Min
    Jiang, Yong
    Xia, Shu-Tao
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2016, PT II, 2016, 9652 : 443 - 453
  • [30] Applying regression models to query-focused multi-document summarization
    Ouyang, You
    Li, Wenjie
    Li, Sujian
    Lu, Qin
    INFORMATION PROCESSING & MANAGEMENT, 2011, 47 (02) : 227 - 237