WebCiteS: Attributed Query-Focused Summarization on ChineseWeb Search Results with Citations

被引:0
|
作者
Deng, Haolin [1 ]
Wang, Chang [3 ]
Li, Xin [3 ]
Yuan, Dezhang [3 ]
Zhan, Junlang [3 ]
Zhou, Tianhua [3 ]
Ma, Jin [4 ]
Gao, Jun [1 ]
Xu, Ruifeng [1 ,2 ,5 ]
机构
[1] Harbin Inst Technol, Shenzhen, Peoples R China
[2] Peng Cheng Lab, Shenzhen, Peoples R China
[3] Tencent Inc, Shenzhen, Peoples R China
[4] Univ Sci & Technol China, Hefei, Peoples R China
[5] Guangdong Prov Key Lab Novel Secur Intelligence T, Shenzhen, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Enhancing the attribution in large language models (LLMs) is a crucial task. One feasible approach is to enable LLMs to cite external sources that support their generations. However, existing datasets and evaluation methods in this domain still exhibit notable limitations. In this work, we formulate the task of attributed query-focused summarization (AQFS) and present WebCiteS, a Chinese dataset featuring 7k human-annotated summaries with citations. WebCiteS derives from real-world user queries and web search results, offering a valuable resource for model training and evaluation. Prior works in attribution evaluation do not differentiate between groundedness errors and citation errors. They also fall short in automatically verifying sentences that draw partial support from multiple sources. We tackle these issues by developing detailed metrics and enabling the automatic evaluator to decompose the sentences into sub-claims for fine-grained verification. Our comprehensive evaluation of both open-source and proprietary models on WebCiteS highlights the challenge LLMs face in correctly citing sources, underscoring the necessity for further improvement.(1)
引用
收藏
页码:15095 / 15114
页数:20
相关论文
共 50 条
  • [31] Hierarchical Variational Network for User-Diversified & Query-Focused Video Summarization
    Jiang, Pin
    Han, Yahong
    ICMR'19: PROCEEDINGS OF THE 2019 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2019, : 202 - 206
  • [32] Exploring actor–object relationships for query-focused multi-document summarization
    Mohammadreza Valizadeh
    Pavel Brazdil
    Soft Computing, 2015, 19 : 3109 - 3121
  • [33] QuerySum: A Multi-Document Query-Focused Summarization Dataset Augmented with Similar Query Clusters
    Liu, Yushan
    Wang, Zili
    Yuan, Ruifeng
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 17, 2024, : 18725 - 18732
  • [34] Query-Focused Video Summarization: Dataset, Evaluation, and A Memory Network Based Approach
    Sharghi, Aidean
    Laurel, Jacob S.
    Gong, Boqing
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 2127 - 2136
  • [35] Constructing Query-Focused Summarization Dataset AMTQFSum Based on ChatGPT and Prompt Engineering
    Jinling, Shang
    Jianyong, Zhang
    Data Analysis and Knowledge Discovery, 2024, 8 (8-9) : 122 - 132
  • [36] SDbQfSum: Query-focused summarization framework based on diversity and text semantic analysis
    Mohamed, Muhidin
    Oussalah, Mourad
    Chang, Victor
    EXPERT SYSTEMS, 2024, 41 (01)
  • [37] A Novel Contextual Topic Model for Query-focused Multi-document Summarization
    Yang, Guangbing
    2014 IEEE 26TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2014, : 576 - 583
  • [38] Lexical Similarity Based Query-Focused Summarization Using Artificial Immune Systems
    Katiyar, Sulabh
    Borgohain, Samir
    ARTIFICIAL INTELLIGENCE PERSPECTIVES AND APPLICATIONS (CSOC2015), 2015, 347 : 287 - 296
  • [39] Query-focused multi-document text summarization using fuzzy inference
    Agarwal, Raksha
    Chatterjee, Niladri
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 42 (05) : 4641 - 4652
  • [40] Review on Query-focused Multi-document Summarization (QMDS) with Comparative Analysis
    Roy, Prasenjeet
    Kundu, Suman
    ACM COMPUTING SURVEYS, 2024, 56 (01)