Synthetic evaluation data

Synthetic justice datasets for testing justice tech and legal-AI systems.

Curated synthetic matter files with ground truth, answer keys, and human QA — built for evaluation, not storytelling.

Synthetic data. No real litigant information.
Why this exists

Realistic evaluation is hard to source.

1

Real case files are hard to use

Privacy, procurement, and handling rules make real matters slow or impossible to use for testing.

2

Generic examples don't hold up

Toy examples don't reflect the contradictions, gaps, and messiness of real legal records.

3

Access to justice outcomes

Technology only delivers on its promise of a simpler, more accessible justice system if it's tested properly. That's why every scenario is designed by lawyers with real experience in courts, CLCs, pro bono practice, and legal aid.

What you get

A deliverable pack, not a tool.

You receive finished outputs and documentation — never the generator, prompts, or an AI interface.

Every synthetic matter ships with a detailed ground truth — a fictional narrative built out with the same depth of supporting evidence you'd expect in a real file.

Two ways to work with us

Basic vs Premium.

Basic

Structured, semi-self-serve

For teams running ingestion tests, demos, and regression suites on a defined timeline.

  • Configure a bundle: jurisdiction, category, stage, count, formats
  • QA'd delivery pack — ZIP plus documentation
  • Standard turnaround, standard QA level
  • Ground truth and model answers included
Start Basic order
Premium

Custom dataset + evaluation design

For high-fidelity, comparable testing across vendors or model versions.

  • Custom scenarios and engineered edge cases
  • Multi-stage simulations across a matter's lifecycle
  • Tailored scoring rubrics and workshops
  • Defensible evaluation design, documented end to end
Request Premium quote
How it works

Six steps, start to delivery.

1

Configure

Jurisdiction, category, stage, count, formats, options, QA level, timing.

2

Scope confirmation

Bundle summary covering inclusions and exclusions before generation starts.

3

Generate

Manifests and documents produced to agreed conventions.

4

Human QA

Coherence and registers reviewed and validated by a person.

5

Package

Ingestion bundle, truth pack, model answers, and README assembled.

6

Deliver

ZIP or secure link, with an optional iteration round.

Start small — Basic Package

Sample bundle first — one matter.

Before committing to a volume run, start with a single one-matter sample bundle designed to surface defects early: folder structure, filenames, metadata, timeline logic, cross-references, and — if selected — contradictions and edge cases.

Request a sample bundle
matter-0142/
  01_pleadings/
    complaint.pdf
  02_evidence/
    exhibit_a_lease.pdf
    exhibit_b_emails.pdf
  03_truth/
    truth_narrative.md
    chronology.csv
    evidence_index.csv
  04_answers/
    model_answers.json
  05_registers/
    contradictions_register.csv
    edge_cases_register.csv
  README.md
Full Matter Preview — Premium Package

Sample bundle — Family Law Matter

Premium Package test bundles are highly bespoke, and where required, voluminous. Expand the window below to view the full index of 156 documents (more than 1,600 pages) from just one synthetic family law case file. A selection of 5 individual documents from the bundle can be viewed here:

156Full
index

Master index and file controls

Global navigation and provenance for the synthetic file. These documents are not evidence; they help an evaluator understand the structure and production design.

IDX.001Master document index2026-06-12XLSX
IDX.002Production manifest2026-06-12CSV

Case summary and neutral orientation materials

Front-door orientation documents for a reviewer or AI tool, including neutral chronology and issue-to-evidence mapping.

SUM.001One-page case summary2026-06-12DOCX
SUM.002Neutral chronology summary2026-06-12XLSX

Administrative file, intake and solicitor work product

Law-firm file material showing how the applicant’s case theory formed from initial instructions, risk screening, advice and hearing-readiness work.

ADM.001Conflict check and file opening form2025-06-17PDF
ADM.002Emily Hart client intake questionnaire2025-06-17DOCX
ADM.003Costs agreement and disclosure statement2025-06-17PDF
ADM.004Initial attendance note with Emily Hart2025-06-17DOCX
ADM.005Initial advice memorandum to Emily2025-06-20DOCX
ADM.006Family violence and child risk screening form2025-06-20PDF
ADM.009Counsel brief index2026-06-09XLSX

Pleadings and court documents

Formal litigation artefacts defining the procedural frame: application, response, risk notice, financial statements, directions, amended proposed orders and trial directions.

CRT.001Initiating Application2025-08-12PDF
CRT.002Applicant proposed interim and final orders2025-08-12DOCX
CRT.003Applicant affidavit in support2025-08-12PDF
CRT.004Applicant Notice of Child Abuse Family Violence or Risk2025-08-12PDF
CRT.005Applicant Financial Statement2025-08-12PDF
CRT.007Response to Initiating Application2025-09-05PDF
CRT.008Respondent proposed interim and final orders2025-09-05DOCX
CRT.009Respondent affidavit in response2025-09-05PDF
CRT.010Respondent Financial Statement2025-09-05PDF
CRT.011First court event orders2025-09-19PDF
CRT.012Order appointing Independent Children's Lawyer2025-10-14PDF
CRT.013Directions order for family report and valuations2025-12-09PDF
CRT.014Applicant application for further disclosure2025-12-18PDF
CRT.015Affidavit of Priya Nair regarding disclosure defaults2025-12-18PDF
CRT.016Disclosure directions order2026-01-09PDF
CRT.017Subpoena leave order2026-01-19PDF
CRT.018Applicant amended proposed final orders2026-05-01DOCX
CRT.019Respondent amended proposed final orders2026-05-08DOCX

Affidavits and lay witness evidence

Narrative witness evidence from parties and collateral witnesses, designed to contain both advocacy framing and low-realistic inaccuracies.

AFF.001Emily Hart trial affidavit2026-05-29DOCX
AFF.004Daniel Hart trial affidavit2026-05-29DOCX
AFF.007Margaret Lawson affidavit2026-05-24DOCX
AFF.008Peter Lawson affidavit regarding parental advance2026-05-24DOCX
AFF.009Claire Hart affidavit2026-05-24DOCX
AFF.010Neighbour statement regarding 18 November 2024 changeover2026-02-03PDF
AFF.011School principal statement2026-04-29PDF

Parenting evidence

Objective and semi-objective records about children, care, school, health, changeovers, communication and risk.

PAR.002Max asthma management plan2021-08-12PDF
PAR.003School email about Sophie anxiety2024-09-03EML
PAR.004School attendance extract for Sophie and Max2025-01-31PDF
PAR.005Parent-teacher interview attendance records2026-02-02PDF
PAR.007GP clinical records for children extract2026-02-10PDF
PAR.008Emily GP stress consultation extract2026-02-10PDF
PAR.009Riverside Child Psychology redacted notes2026-02-17PDF
PAR.010Counselling appointment email chain2025-02-27EML
PAR.011Police event record 18 November 20242026-02-20PDF
PAR.012Police event record 13 April 20252026-02-20PDF
PAR.013Parenting app export September to December 20252026-01-05PDF
PAR.014Text messages about Central Coast trip2025-01-15PDF
PAR.016Emily calendar of children's care arrangements2026-05-20XLSX

Property and financial evidence

Financial records used to reconstruct and contest the asset pool, including home, mortgage, business, drawings, family advance and balance sheets.

FIN.001Former matrimonial home title search2025-07-01PDF
FIN.004Settler Bank mortgage statements July 2024 to May 20262026-05-31PDF
FIN.005Offset account statements July 2024 to May 20262026-05-31PDF
FIN.008Joint credit card statements2026-05-20PDF
FIN.010Daniel director drawings summary2026-05-20XLSX
FIN.011Hart & Vale financial statements FY2021 to FY20252026-03-04PDF
FIN.012Hart & Vale business bank statements2026-03-04PDF
FIN.013Bookkeeper email about December 2023 delayed invoices2026-03-04EML
FIN.014Draft December 2023 invoices issued January 20242026-03-04PDF
FIN.015ASIC company extract Hart & Vale Studio Pty Ltd2026-03-05PDF
FIN.019Emily parents 120000 transfer record2022-03-21PDF
FIN.020Lawson family text messages about transfer2022-03-21PDF
FIN.021Unsigned family loan note prepared after separation2024-08-02PDF
FIN.023Applicant add-back schedule2026-05-26XLSX
FIN.025Applicant final balance sheet2026-06-05XLSX

Expert evidence

Independent expert material that is useful but mixed, including family report, home valuation and business valuation.

EXP.001Letter of instruction to family report writer2026-01-20DOCX
EXP.004Family report2026-03-21PDF
EXP.007Letter of instruction to property valuer2026-01-22DOCX
EXP.008Single expert valuation of former matrimonial home2026-03-16PDF
EXP.009Earlier real estate appraisal obtained by Daniel2025-09-10PDF
EXP.010Letter of instruction to business valuer2026-01-23DOCX
EXP.011Business valuation source material index2026-03-04XLSX

Disclosure and subpoenas

Discovery/subpoena ecology showing late production, redactions, return indexes, disclosure disputes and source-material gaps.

DIS.001Applicant disclosure list first version2025-09-26XLSX
DIS.002Respondent disclosure list first version2025-09-30XLSX
DIS.005Applicant disclosure deficiency schedule2025-11-20XLSX
DIS.006Respondent updated disclosure list2026-03-04XLSX
DIS.007Subpoena to Waratah Grove Public School2026-01-22PDF
DIS.008School subpoena return index2026-02-02XLSX
DIS.010GP subpoena return index2026-02-10XLSX
DIS.012Psychology subpoena objection and redaction note2026-02-14PDF
DIS.013Subpoena to NSW Police2026-01-22PDF
DIS.014Police subpoena return index2026-02-20XLSX
DIS.015Subpoena to Settler Bank2026-01-22PDF
DIS.016Settler Bank subpoena return index2026-02-25XLSX
DIS.017Subpoena to Hart & Vale bookkeeper2026-01-22PDF

Correspondence

Open, privileged and settlement-related correspondence showing tactical development, disclosure disputes, family report responses and bundle objections.

COR.001Pre-action letter from Applicant solicitor2025-06-25DOCX
COR.002Response from Respondent solicitor2025-07-09DOCX
COR.003FDR invitation email2025-07-14EML
COR.004Section 60I certificate2025-07-31PDF
COR.005Letter about interim parenting proposal2025-08-04DOCX
COR.006Respondent counterproposal email2025-08-07EML
COR.007Letter alleging breach of communication order2025-11-03DOCX
COR.008Response regarding asthma medication texts2025-11-05DOCX
COR.009Disclosure chaser email from Applicant solicitor2025-11-20EML
COR.010Respondent email attaching partial business records2025-12-01EML
COR.011ICL introductory letter2025-10-21DOCX
COR.012Family report release advice to Emily2026-03-22DOCX
COR.013Family report response letter from Respondent2026-03-25DOCX
COR.014Applicant Calderbank offer2026-04-17DOCX
COR.015Respondent Calderbank offer2026-04-19DOCX
COR.016Open letter about final hearing readiness2026-05-19DOCX
COR.017Bundle objection letter regarding without prejudice document2026-06-06DOCX

Settlement and mediation

FDR, mediation, conciliation and offer material, including stale figures and without-prejudice handling issues.

SET.001Mediation intake form2025-07-25PDF
SET.002Applicant mediation position paper2025-07-29DOCX
SET.003Respondent mediation position paper2025-07-29DOCX
SET.004Mediation outcome note2025-07-31PDF
SET.005Conciliation conference case outline Applicant2026-04-12DOCX
SET.006Conciliation conference case outline Respondent2026-04-12DOCX
SET.007Conciliation conference balance sheet2026-04-12XLSX
SET.008Conciliation conference outcome note2026-04-20DOCX
SET.009Settlement advice comparing offers2026-04-21DOCX

Hearing preparation

Final-hearing work product: chronologies, evidence matrices, tender lists, objections, submissions and cross-examination plans.

TRL.001Final hearing plan2026-06-08DOCX
TRL.002Applicant case outline2026-06-08DOCX
TRL.003Respondent case outline2026-06-08DOCX
TRL.004Joint chronology draft2026-06-07XLSX
TRL.005Applicant chronology with disputed events2026-06-07XLSX
TRL.006Respondent chronology with disputed events2026-06-07XLSX
TRL.007Witness list2026-06-06XLSX
TRL.008Tender list2026-06-06XLSX
TRL.009Objections schedule2026-06-06XLSX
TRL.010Trial bundle index2026-06-05XLSX
TRL.011Applicant opening submissions2026-06-12DOCX
TRL.012Respondent opening submissions2026-06-12DOCX
TRL.013Draft cross-examination plan for Daniel2026-06-10DOCX
TRL.014Draft cross-examination plan for Emily2026-06-10DOCX
TRL.015Draft cross-examination plan for family report writer2026-06-10DOCX
TRL.016Draft cross-examination plan for business valuer2026-06-10DOCX
TRL.017Parenting evidence matrix2026-06-11XLSX
TRL.018Property evidence matrix2026-06-11XLSX
TRL.019Counsel conference note2026-06-10DOCX

Category amalgamations

Single-PDF category binders for ingestion or review workflows, with cover/index/divider structure.

BND.001Administrative file amalgamation2026-06-12PDF
BND.002Pleadings and court documents amalgamation2026-06-12PDF
BND.003Affidavits and witness evidence amalgamation2026-06-12PDF
BND.004Parenting evidence amalgamation2026-06-12PDF
BND.005Property and financial evidence amalgamation2026-06-12PDF
BND.006Expert evidence amalgamation2026-06-12PDF
BND.007Disclosure and subpoenas amalgamation2026-06-12PDF
BND.008Correspondence amalgamation2026-06-12PDF
BND.009Settlement and mediation amalgamation2026-06-12PDF

Testing guide, control keys and demo tasks

Evaluator-only answer keys, ground truth, known defects, contradiction register, edge-case register and suggested AI tool tasks.

QA.001Ground truth narrative2026-06-12MD
QA.002Contradiction and inconsistency register2026-06-12XLSX
QA.003Edge case register2026-06-12XLSX
QA.004Known synthetic defects list2026-06-12MD
Terms, in brief

What this is — and what it isn't.

!
Nothing in any deliverable is legal counsel.These are evaluation materials only, not legal representation or advice on a real matter.
No real litigant data is used.Synthetic provenance is controlled, versioned, and documented end to end.
Outputs and documentation only.No prompts, no generator UI, no tool access is provided to customers.
Misuse controls and licensing.Delivery terms set out permitted use, redistribution limits, and versioning.
Pricing approach

Transparent, not a calculator.

How a quote is built

setup + per-matter complexity + options + QA level + delivery speed − discount

Public-interest pricing

Available for courts, government, and nonprofits.

Request a non-profit quote
FAQ

Common questions.

What is synthetic justice data?

Fully synthetic case files, documents, and supporting records modelled on real-world legal matter structures — built for testing, with no real litigant data involved.

What can I test with it?

Ingestion pipelines, document classification, extraction, summarisation, chronology-building, and any task where you need a known ground truth to score against.

Do you need my confidential files?

No. Bundles are generated independently of your real case data — that's the point. We don't require access to confidential or privileged materials.

Is this suitable for my jurisdiction?

Jurisdiction is a configuration option. Tell us what you need when you start a Basic order or request a Premium quote, and we'll confirm fit during scope confirmation.

Can I use this for training models?

Permitted use is set out in delivery terms and depends on the licence selected at order time. Ask us directly if model training is your intended use case.

Ready to test?

Synthetic data. No real litigant information.