CJRC:A Reliable Human-Annotated Benchmark DataSet for Chinese Judicial Reading Comprehension

来源 :第十八届中国计算语言学大会暨中国中文信息学会2019学术年会 | 被引量 : 0次 | 上传用户:fangfang200805
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
  We present a Chinese judicial reading comprehension(CJRC)dataset which contains approximately 10K documents and almost 50K questions with answers.The documents come from judgment documents and the questions are annotated by law experts.The CJRC dataset can help researchers extract elements by reading comprehension technology.Element extraction is an important task in the legal field.However,it is difficult to predefine the element types completely due to the diversity of document types and causes of action.By contrast,machine reading comprehension technology can quickly extract elements by answering various questions from the long document.We build two strong baseline models based on BERT and BiDAF.The experimental results show that there is enough space for improvement compared to human annotators.
其他文献
Knowledge base question answering aims to answer natural language questions by querying external knowledge base,which has been widely applied to many real-world systems.Most existing methods are templ
Multiple-choice reading comprehension task has seen a recent surge of popularity,aiming at choosing the correct option from candidate options for the question referring to a related passage.Previous w
学位
学位
学位
In the e-commerce websites,such as Taobao and Amazon,interactive question-answering(QA)style reviews usually carry rich aspect information of products.To well automatically analyze the aspect informat
Natural Language Inference(NLI),which is also known as Recognizing Textual Entailment(RTE),aims to identify the logical relationship between a premise and a hypothesis.In this paper,a DCAE(Directly-Co
The neural components in deep learning framework are crucial for the performance of many natural language processing tasks.So far there is no systematic work to investigate the influence of neural com
Legal Cause Prediction(LCP)aims to determine the charges in criminal cases or types of disputes in civil cases according to the fact descriptions.The research to date takes LCP as a text classificatio
会议
Natural language inference(NLI)aims to predict whether a premise sentence can infer another hypothesis sentence.Models based on tree structures have shown promising results on this task,but the perfor