CaseGen: A Benchmark for Multi-Stage Legal Case Documents Generation

Li, Haitao; Ye, Jiaying; Hu, Yiran; Chen, Jia; Ai, Qingyao; Wu, Yueyue; Chen, Junjie; Chen, Yifan; Luo, Cheng; Zhou, Quan; Liu, Yiqun

Abstract:Legal case documents play a critical role in judicial proceedings. As the number of cases continues to rise, the reliance on manual drafting of legal case documents is facing increasing pressure and challenges. The development of large language models (LLMs) offers a promising solution for automating document generation. However, existing benchmarks fail to fully capture the complexities involved in drafting legal case documents in real-world scenarios. To address this gap, we introduce CaseGen, the benchmark for multi-stage legal case documents generation in the Chinese legal domain. CaseGen is based on 500 real case samples annotated by legal experts and covers seven essential case sections. It supports four key tasks: drafting defense statements, writing trial facts, composing legal reasoning, and generating judgment results. To the best of our knowledge, CaseGen is the first benchmark designed to evaluate LLMs in the context of legal case document generation. To ensure an accurate and comprehensive evaluation, we design the LLM-as-a-judge evaluation framework and validate its effectiveness through human annotations. We evaluate several widely used general-domain LLMs and legal-specific LLMs, highlighting their limitations in case document generation and pinpointing areas for potential improvement. This work marks a step toward a more effective framework for automating legal case documents drafting, paving the way for the reliable application of AI in the legal field. The dataset and code are publicly available at this https URL.

Comments:	18 pages
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2502.17943 [cs.CL]
	(or arXiv:2502.17943v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2502.17943

Computer Science > Computation and Language

Title:CaseGen: A Benchmark for Multi-Stage Legal Case Documents Generation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators