WebCiteS: Attributed Query-Focused Summarization on ChineseWeb Search Results with Citations

被引:0
|
作者
Deng, Haolin [1 ]
Wang, Chang [3 ]
Li, Xin [3 ]
Yuan, Dezhang [3 ]
Zhan, Junlang [3 ]
Zhou, Tianhua [3 ]
Ma, Jin [4 ]
Gao, Jun [1 ]
Xu, Ruifeng [1 ,2 ,5 ]
机构
[1] Harbin Inst Technol, Shenzhen, Peoples R China
[2] Peng Cheng Lab, Shenzhen, Peoples R China
[3] Tencent Inc, Shenzhen, Peoples R China
[4] Univ Sci & Technol China, Hefei, Peoples R China
[5] Guangdong Prov Key Lab Novel Secur Intelligence T, Shenzhen, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Enhancing the attribution in large language models (LLMs) is a crucial task. One feasible approach is to enable LLMs to cite external sources that support their generations. However, existing datasets and evaluation methods in this domain still exhibit notable limitations. In this work, we formulate the task of attributed query-focused summarization (AQFS) and present WebCiteS, a Chinese dataset featuring 7k human-annotated summaries with citations. WebCiteS derives from real-world user queries and web search results, offering a valuable resource for model training and evaluation. Prior works in attribution evaluation do not differentiate between groundedness errors and citation errors. They also fall short in automatically verifying sentences that draw partial support from multiple sources. We tackle these issues by developing detailed metrics and enabling the automatic evaluator to decompose the sentences into sub-claims for fine-grained verification. Our comprehensive evaluation of both open-source and proprietary models on WebCiteS highlights the challenge LLMs face in correctly citing sources, underscoring the necessity for further improvement.(1)
引用
收藏
页码:15095 / 15114
页数:20
相关论文
共 50 条
  • [1] Bayesian Query-Focused Summarization
    Daume, Hal, III
    Marcu, Daniel
    COLING/ACL 2006, VOLS 1 AND 2, PROCEEDINGS OF THE CONFERENCE, 2006, : 305 - 312
  • [2] Query-Focused Extractive Video Summarization
    Sharghi, Aidean
    Gong, Boqing
    Shah, Mubarak
    COMPUTER VISION - ECCV 2016, PT VIII, 2016, 9912 : 3 - 19
  • [3] A Query-Focused Summarization Method that Guarantees the Inclusion of Query Words
    Yasuda, Norihito
    Nishino, Masaaki
    Hirao, Tsutomu
    Suzuki, Jun
    Kataoka, Ryoji
    2012 23RD INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS (DEXA), 2012, : 126 - 130
  • [4] QTSUMM: Query-Focused Summarization over Tabular Data
    Zhao, Yilun
    Qi, Zhenting
    Nan, Linyong
    Mi, Boyu
    Liu, Yixin
    Zou, Weijin
    Han, Simeng
    Chen, Ruizhe
    Tang, Xiangru
    Xu, Yumo
    Radev, Dragomir
    Cohan, Arman
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, 2023, : 1157 - 1172
  • [5] Query-Focused EHR Summarization to Aid Imaging Diagnosis
    McInerney, Denis Jered
    Dabiri, Borna
    Touret, Anne-Sophie
    Young, Geoffrey
    van de Meent, Jan-Willem
    Wallace, Byron C.
    MACHINE LEARNING FOR HEALTHCARE CONFERENCE, VOL 126, 2020, 126 : 632 - 658
  • [6] A Lightweight Constrained Generation Alternative for Query-focused Summarization
    Xu, Zhichao
    Cohen, Daniel
    PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, : 1745 - 1749
  • [7] Query-Focused Multi-document Summarization Survey
    Alanzi, Entesar
    Alballaa, Safa
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (06) : 822 - 833
  • [8] Learning to Rank Utterances for Query-Focused Meeting Summarization
    Liu, Xingxian
    Xu, Yajing
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023), 2023, : 8496 - 8505
  • [9] Transforming Wikipedia Into Augmented Data for Query-Focused Summarization
    Zhu, Haichao
    Dong, Li
    Wei, Furu
    Qin, Bing
    Liu, Ting
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 2357 - 2367
  • [10] Grasping Both Query Relevance and Essential Content for Query-focused Summarization
    Xiong, Ye
    Kamigaito, Hidetaka
    Murakami, Soichiro
    Zhang, Peinan
    Takamura, Hiroya
    Okumura, Manabu
    PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 2452 - 2456