JEC-QA: A Legal-Domain Question Answering Dataset

被引:0
|
作者
Zhong, Haoxi [1 ,2 ]
Xiao, Chaojun [1 ,2 ]
Tu, Cunchao [1 ,2 ]
Zhang, Tianyang [3 ]
Liu, Zhiyuan [1 ,2 ]
Sun, Maosong [1 ,2 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, Inst Artificial Intelligence, Beijing, Peoples R China
[2] Beijing Natl Res Ctr Informat Sci & Technol, Beijing, Peoples R China
[3] Beijing Powerlaw Intelligent Technol Co Ltd, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present JEC-QA, the largest question answering dataset in the legal domain, collected from the National Judicial Examination of China. The examination is a comprehensive evaluation of professional skills for legal practitioners. College students are required to pass the examination to be certified as a lawyer or a judge. The dataset is challenging for existing question answering methods, because both retrieving relevant materials and answering questions require the ability of logic reasoning. Due to the high demand of multiple reasoning abilities to answer legal questions, the state-of-the-art models can only achieve about 28% accuracy on JEC-QA, while skilled humans and unskilled humans can reach 81% and 64% accuracy respectively, which indicates a huge gap between humans and machines on this task. We will release JEC-QA and our baselines to help improve the reasoning ability of machine comprehension models. You can access the dataset from http://jecqa.thunlp.org/.
引用
收藏
页码:9701 / 9708
页数:8
相关论文
共 50 条
  • [1] ECG-QA: A Comprehensive Question Answering Dataset Combined With Electrocardiogram
    Oh, Jungwoo
    Lee, Gyubok
    Bae, Seongsu
    Kwon, Joon-Myoung
    Choi, Edward
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [2] DISFL-QA: A Benchmark Dataset for Understanding Disfluencies in Question Answering
    Gupta, Aditya
    Xu, Jiacheng
    Upadhyay, Shyam
    Yang, Diyi
    Faruqui, Manaal
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 3309 - 3319
  • [3] Transformer Models for Question Answering on Autism Spectrum Disorder QA Dataset
    Firsanova, Victoria
    [J]. DIGITAL TRANSFORMATION AND GLOBAL SOCIETY, DTGS 2021, 2022, 1503 : 122 - 133
  • [4] AILA: A Question Answering System in the Legal Domain
    Huang, Weiyi
    Jiang, Jiahao
    Qu, Qiang
    Yang, Min
    [J]. PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 5258 - 5260
  • [5] Improvisation of Dataset Efficiency in Visual Question Answering Domain
    Mohamed, Sheerin Sitara Noor
    Srinivasan, Kavitha
    [J]. STATISTICS AND APPLICATIONS, 2022, 20 (02): : 279 - 289
  • [6] QA Dataset Explosion: A Taxonomy of NLP Resources for Question Answering and Reading Comprehension
    Rogers, Anna
    Gardner, Matt
    Augenstein, Isabelle
    [J]. ACM COMPUTING SURVEYS, 2023, 55 (10)
  • [7] Non-factoid Question Answering in the Legal Domain
    McElvain, Gayle
    Sanchez, George
    Teo, Don
    Custis, Tonya
    [J]. PROCEEDINGS OF THE 42ND INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '19), 2019, : 1395 - 1396
  • [8] A dataset and baselines for sequential open-domain question answering
    Elgohary, Ahmed
    Zhao, Chen
    Boyd-Graber, Jordan
    [J]. 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 1077 - 1083
  • [9] ODSQA: OPEN-DOMAIN SPOKEN QUESTION ANSWERING DATASET
    Lee, Chia-Hsuan
    Wang, Shang-Ming
    Chang, Huan-Cheng
    Lee, Hung-Yi
    [J]. 2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, : 949 - 956
  • [10] MLEC-QA: A Chinese Multi-Choice Biomedical Question Answering Dataset
    Li, Jing
    Zhong, Shangping
    Chen, Kaizhi
    [J]. 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 8862 - 8874