Sample size: planned number of observations
AI decision-maker: 40–80 AI runs per preregistered persona × condition cell (Stage 1: N=40; Stage 2 adds N=40 if stopping-rule criteria are met), with outcomes recorded at the run level.
Human study: 100–150 participants (US-based Prolific), each providing 4 scenario-level observations (within-subject 2×2), for a total of ~800–1,000 scenario evaluations.
3.7.2026
Human decision-maker study: 80 runs (hidden-info) and 40 runs (full-info) for each of Canonical Payoffs, Expensive Fairness, Increased Harm