3× Ghatak UCAV vs HVT (1 of 3) + 4× Su-30MKI CAP
Claude (3-3-0 Math)
12.54%
Human + GPT (Napoleon-Berthier)
11.09%
GPT (Rear-Arc Scissors)
2.22%
Human + Claude
9.00%
Aditya (Solo Human)
7.85%
Claude + GPT (Pure AI Synthesis)
3.89%
2nd worstN = 1,200,000 runs · 200,000 per strategy · All differences statistically significant
The Problem
Three installations. One is the real HVT. Two are decoys with identical radar signatures. You can't tell which is which until you fly within 30km. Your six missiles must destroy it before it relocates at tick 1500.
The Math
70% per-shot Pk, 2 hits required
The Constraint
Force Balance
Six Strategies
Claude — Pure AI
12.54%Mathematical optimization: 3-3-0 distribution beats 2-2-2 (52.3% vs 49% expected win). Abandon T3 entirely. Route all three UCAVs through northern stealth corridor (y≤5). Position (335,80) is geometrically magic — HAMMER range of both T1 and T2 simultaneously, outside all Red detection bubbles. Preemptive ASTRAs on RED-1, RED-3, RED-4 before strike.
Human + GPT — Napoleon-Berthier
11.09%Aditya's aggressive strike concept (tight formation entry, sacrifice plays, simultaneous multi-target coverage) executed on GPT's rear-arc routing. The human's combat architecture covers all three target positions — can win regardless of which target is real.
Human + Claude
9.00%Aditya's aggressive formation and sacrifice concept combined with Claude's mathematical 3-3-0 HAMMER distribution. The human instinct for chaos and sacrifice + AI's probabilistic allocation. Third best overall.
Aditya — Solo Human
7.85%3 UCAVs in extremely tight vertical stacked formation — appearing as single radar return. Approach north of center, preemptive strikes on first detected enemy. Split into 3 strike paths covering all targets. S3 dogfights as shield. S1 crashes into T2 if out of missiles. Accept total UCAV losses in exchange for target coverage.
Claude + GPT — Pure AI Synthesis
GPT's geographic rear-arc routing combined with Claude's 3-3-0 mathematical allocation. No human strategic input. Beats only GPT's pure strategy. Beaten by every approach that includes a human.
GPT — Pure AI
Route to extreme map edges, slip behind west-facing CAP, get certain ID on T1 and T3 from rear, infer T2 by elimination. Fire from rear baskets. Geometrically sound — but relies on conditional fire logic ("if T1 confirmed...") that a one-shot strategy cannot execute.
The Key Finding
Claude wins on expected value. But break down the win rate by which target was actually real in each run — and a completely different picture emerges.
| Strategy | Overall | T1 HVT | T2 HVT | T3 HVT |
|---|---|---|---|---|
| Claude (3-3-0) | 12.54% | 11.0% | 26.7% | 0.0% |
| Human + GPT | 11.09% | 9.8% | 12.4% | 11.1% |
| Aditya | 7.85% | 9.7% | 2.4% | 11.4% |
Claude's hidden cost
0.0% when HVT is at T3
Claude deliberately abandons T3 as mathematically suboptimal. It wins when HVT=T1 or T2 (2/3 of runs). When HVT=T3, it loses every single time — 33,000 guaranteed losses out of 200,000 runs. This is the best expected-value strategy under the constraint. It is not the most robust strategy.
Human + GPT's robustness
Wins regardless of which target is real
Human+GPT covers all three positions symmetrically. 9.8%, 12.4%, 11.1% — no catastrophic blind spots. Slightly lower peak performance, but it can respond to any scenario variant. In real operations you don't know which 1/3 you're in.
Two valid strategies. Two different objectives.
Claude maximizes expected wins. Human + GPT maximizes robustness across unknown states. Both are valid. They optimize different things. In the real world, the choice between them depends on whether you can afford a catastrophic failure mode.
Analysis
Claude + GPT combined scored 3.89% — beaten by every strategy with human input including the solo human (7.85%). Combining two AI optimizers without human strategic novelty does not compound their strengths. They duplicate each other's blind spots. More AI is not always better AI.
Human+GPT (11.09%), Human+Claude (9.00%), Aditya solo (7.85%) — all beat Claude+GPT (3.89%). The human brings sacrifice plays, formation deception, and risk-tolerant commitment that neither AI system generates independently.
Full Ranking — 1.2 Million Simulations
Claude (3-3-0 Math)
Pure AI · 95% CI: [12.39%, 12.68%]
Human + GPT (Napoleon-Berthier)
Human + AI · 95% CI: [10.95%, 11.22%]
Human + Claude
Human + AI · 95% CI: [8.88%, 9.13%]
Aditya (Solo Human)
Pure Human · 95% CI: [7.74%, 7.97%]
Claude + GPT (Pure AI Synthesis)
Pure AI · 95% CI: [3.80%, 3.97%]
GPT (Rear-Arc Scissors)
Pure AI · 95% CI: [2.16%, 2.29%]
Across 1.8 million combined simulations — Scenarios 001 + 002 + 003
The same pattern holds across three independent scenarios with completely different structures.
Human strategic input produces concepts that AI optimization does not naturally discover.
Hybrid human-AI approaches produce qualitatively different outcomes than pure AI optimization.
The nature of improvement depends on scenario: sometimes higher absolute performance, sometimes higher robustness, sometimes both.
Pure AI synthesis without human input underperforms both pure-human and human-augmented approaches.