Legal AI Benchmarks February 2026

GenieAI vs Claude - Tesla European Expansion regulatory analysis

A 15-metric structured comparison on a complex multi-jurisdiction regulatory scenario: Tesla's European factory expansion across product safety, automotive type approval, GDPR, antitrust, environmental and trade dimensions.

Overall scores

Across 15 legal-quality metrics , each scored 1-10

+34% vs competitor
GenieAI
123 / 150
82%

Specialist legal performance - directive-level citations, jurisdictional sequencing, concrete enforcement risk classification.

Primary
Claude (Sonnet)
72 / 150
48%

Strong general reasoning and clear structure, but generic on regulatory specifics, light on EU directive citations and shallow on enforcement strategy.

Competitor

GenieAI led 123/150 vs 72/150 - a 34-point gap (51% advantage), with the largest deltas in directive-level citations, multi-jurisdiction sequencing and enforcement risk specificity.

Dimension-by-dimension

GenieAI wins 15 of 15

01 Directive-level citations
GenieAI
9/10
Claude (Sonnet)
4/10
Rationale
GenieAI

Cited 18 specific EU directives and regulations by number, including 2018/858, 2023/1542, 2022/2560, NIS2 and CBAM.

Claude (Sonnet)

References 'EU regulations' and 'GDPR' generically; only 4 specific directive numbers identified.

02 Member-state implementing law
GenieAI
9/10
Claude (Sonnet)
5/10
Rationale
GenieAI

Maps EU instruments to BImSchG, BetrVG and Polish geological/mining law specifically.

Claude (Sonnet)

Notes that 'national implementing legislation will apply' without naming specific statutes.

03 Risk severity classification
GenieAI
8/10
Claude (Sonnet)
6/10
Rationale
GenieAI

Risks scored on enforcement probability × financial exposure, ranked across 15 distinct items.

Claude (Sonnet)

Risks listed but severity sometimes flattened - concentration thresholds and CBAM both labelled 'medium' despite different enforcement profiles.

04 Sequencing across jurisdictions
GenieAI
9/10
Claude (Sonnet)
4/10
Rationale
GenieAI

Produces a phased roadmap by jurisdiction × workstream, identifying critical path on type approval and concentration filing.

Claude (Sonnet)

Lists requirements per jurisdiction but does not sequence them or flag the critical path.

05 Enforcement risk specificity
GenieAI
9/10
Claude (Sonnet)
4/10
Rationale
GenieAI

Identifies likely regulator per item (KBA, BNetzA, BfDI, UOKiK, EU Commission DG-COMP) and known enforcement posture.

Claude (Sonnet)

Generic 'regulators may take action' framing; doesn't differentiate between enforcement-first vs guidance-first regulators.

06 Battery Regulation depth
GenieAI
9/10
Claude (Sonnet)
5/10
Rationale
GenieAI

Walks through recyclable-content thresholds (16% by 2031), due-diligence chain obligations and CO2 footprint declarations.

Claude (Sonnet)

Mentions battery sustainability rules but conflates the older 2006 directive with the 2023 regulation.

07 GDPR / data flows
GenieAI
8/10
Claude (Sonnet)
7/10
Rationale
GenieAI

Addresses Art. 28 processor obligations, NIS2 overlap and connected-vehicle telematics data residency.

Claude (Sonnet)

Solid GDPR basics - Art. 6 lawful basis, DPIA, SCCs - but underweights NIS2 and connected-vehicle specificity.

08 Antitrust + concentration filing
GenieAI
8/10
Claude (Sonnet)
5/10
Rationale
GenieAI

Calculates EUMR notification thresholds (€5bn worldwide, €250m EU-wide turnover) and identifies the joint-venture structuring trigger.

Claude (Sonnet)

Identifies that EUMR might apply but does not work through the thresholds against Tesla's actual turnover figures.

09 Foreign Subsidies Regulation
GenieAI
8/10
Claude (Sonnet)
3/10
Rationale
GenieAI

Flags the FSR notification obligation for Polish subsidy support and the €4m+ notification threshold.

Claude (Sonnet)

FSR not addressed substantively.

10 CBAM and customs origin
GenieAI
8/10
Claude (Sonnet)
4/10
Rationale
GenieAI

Walks through CBAM reporting obligations (definitive period from 2026) and supply-chain origin documentation requirements.

Claude (Sonnet)

Mentions CBAM exists but does not work through reporting obligations or origin determination.

11 Co-determination + works councils
GenieAI
8/10
Claude (Sonnet)
6/10
Rationale
GenieAI

Addresses BetrVG thresholds, the EWC directive, and how the Berlin expansion changes co-determination obligations under MitbestG.

Claude (Sonnet)

Notes works-council requirements without distinguishing BetrVG, EWC and MitbestG regimes.

12 Environmental permitting (BImSchG)
GenieAI
9/10
Claude (Sonnet)
6/10
Rationale
GenieAI

Identifies the BImSchG Annex 1 categories triggered, IED applicability, and the known ~12-month permitting timeline in Brandenburg.

Claude (Sonnet)

Identifies that environmental permitting is required but does not name BImSchG, IED or jurisdiction-specific timelines.

13 Type approval (WVTA) implications
GenieAI
8/10
Claude (Sonnet)
5/10
Rationale
GenieAI

Explains EU 2018/858 WVTA implications for Berlin-built vehicles, including conformity-of-production obligations.

Claude (Sonnet)

Notes type-approval applies but stops short of WVTA / CoP detail.

14 AI Act classification
GenieAI
7/10
Claude (Sonnet)
5/10
Rationale
GenieAI

Classifies assisted-driving R&D against EU AI Act risk tiers and identifies the 2026/2027 phase-in dates.

Claude (Sonnet)

Identifies AI Act as relevant but is vague on tier classification.

15 Practical roadmap output
GenieAI
6/10
Claude (Sonnet)
3/10
Rationale
GenieAI

Produces a 9-page Gantt-style compliance plan with named owners, timelines, regulator contact points and known precedent decisions.

Claude (Sonnet)

Output is a flat list - useful for orientation, not directly executable as a project plan.

Largest performance gaps

+5
Directive-level citations
+5
Sequencing across jurisdictions
+5
Enforcement risk specificity
+5
Foreign Subsidies Regulation
+4
Member-state implementing law