Experimental Design Details
The experiment consists of four parts. In the first part, workers earn real money through a real-effort task. In the second part, spectators decide how to redistribute the earnings between a randomly drawn pair of workers. In the third part, reviewers observe the spectator's decision and determine whether they want to intervene and modify the earnings allocation. In the fourth part, workers receive payments based on the final redistribution determined by spectators and reviewers. This study primarily focuses on reviewers' decisions, while workers and spectators establish a real economic setting with tangible consequences.
Workers:
600 workers are recruited on Prolific. When recruited, workers are promised a participation fee of 0.50 USD, and they are told that they could earn additional money, depending on the actions they and others will take in the experiment. After completing a real-effort assignment, workers are randomly paired. In each pair, one worker receives an additional reward of 6 USD, while the other receives nothing. The assignment follows one of two possible criteria:
- Merit: The worker with the higher performance in the pair receives 6 USD.
- Luck: The worker who receives 6 USD is randomly selected.
The criterion for each pair is determined randomly with a 50/50 probability. Thus, half of the workers (150 pairs) have their initial earnings assigned based on performance, while the other half receive earnings based on luck. For clarity, I henceforth refer to the worker who receives 6 USD as Blue Worker, regardless of the adopted criterion—merit or luck. Workers are informed about the allocation mechanism, but do not know which criterion was used in their specific case. After completing the effort task, they are told that a third party—the spectator—will observe the initial distribution of earnings and the criterion (merit or luck) that determined the earnings. The spectator will then have the opportunity to redistribute the earnings between the two workers in the pair.
Spectators:
There are two types of spectators:
- Human spectators: 300 participants, recruited online via Prolific. They receive a fixed payment for participation and do not overlap with the workers' sample. The choice they have to make will have consequences for a real-life situation and is therefore incentive-compatible (this incentive assumes that spectators care about the earnings of others; otherwise, purely selfish spectators would never intervene).
- AI spectators: 300 artificial agents represented by ChatGPT-4.1, receiving as a prompt the same instructions that the human spectators see.
Each unique pair of workers is assigned to both a human spectator and an AI spectator. The spectators decide whether and how to redistribute the initial earnings. Spectators are fully informed about the effort task completed by workers, the criterion used to assign initial earnings (Merit or Luck), and the fact that workers were unaware that their performance would be observed for redistribution purposes. Each spectator completes the assignment for two pairs of workers: one in the Merit condition and one in the Luck condition. The order of these conditions is randomized to control for order effects. We elicit 1200 redistributions, but there are only 300 unique pairs of workers. Therefore, there is a 25% probability that a redistribution choice made by the spectator is actually implemented and evaluated by a reviewer.
Reviewers:
The same 300 subjects who participated as human spectators are invited to a follow-up session one week later, where they act as reviewers. Their task is to evaluate and, if desired, revise a redistribution decision made by a spectator. Initially, reviewers have incomplete information: they only observe the final earnings of both workers after the spectator's redistribution, without knowing whether the initial allocation of earnings was determined by merit or luck. They are, however, informed about the nature of the spectator (human or AI) responsible for the decision. Reviewers have two possible choices:
- No intervention: They accept the current earnings allocation of workers.
- Intervention: They pay a small but non-negligible fee (0.50 USD) deducted from their own earnings to reveal the original payoff criterion (Merit or Luck). If they pay the fee, they are also allowed to modify the earnings allocation as they prefer.
The decision is elicited using the strategy method. Each reviewer observes all four allocations {(6,0), (5,1), (4,2), (3,3)} and, for each, decides whether to intervene and potentially redistribute the earnings or not. Reviewers complete this task for both a human spectator and an AI spectator. The sequence of these two treatment conditions is randomized to mitigate order effects. Hence, each reviewer faces the decision task eight times: (AI or Human) X {(6,0), (5,1), (4,2), (3,3)}. One of these choices is payoff-relevant for a pair of workers and the reviewer herself, but the reviewers do not know which one is, as they make their decisions. To prevent any strategic behavior, all participants remain fully anonymous, and reviewers are never matched with decisions they previously made as spectators.
Belief Elicitation:
I elicit reviewers' beliefs about the source of inequality, conditional on redistribution choices and the type of spectator. After completing their redistribution decisions as spectators, subjects are presented with a redistribution made by another spectator (human or AI) and must estimate the probability that the workers were in the Merit condition (pM) or the Luck condition (pL), given the spectator's type and the observed allocation.
Each subject evaluates eight different scenarios: (AI or Human) X {(6,0), (5,1), (4,2), (3,3)}. To incentivize truthful reporting, I use the binarized scoring rule. I do not provide explicit details about the scoring mechanism to minimize distortions in reported beliefs. Participants are simply informed that their best estimate maximizes their expected earnings, with further details on the payment rule available in a clickable link that opens a PDF containing a description of the scoring rule. I randomize the order of the spectator's type (AI or Human) that subjects will evaluate.