Inconsistent Risk Preferences (Online)

Last registered on May 11, 2026

View Trial History

Pre-Trial

Trial Information

General Information

Title

Inconsistent Risk Preferences (Online)

RCT ID

AEARCTR-0018584

Initial registration date

May 09, 2026

Initial registration date is when the trial was registered.

It corresponds to when the registration was submitted to the Registry to be reviewed for publication.

First published

May 11, 2026, 9:27 AM EDT

First published corresponds to when the trial was first made public on the Registry after being reviewed.

Locations

Country

United Kingdom of Great Britain and Northern Ireland

Region

Primary Investigator

Name

Sascha Füllbrunn

Affiliation

Radboud University

Contact Primary Investigator

Other Primary Investigator(s)

Additional Trial Information

Status

In development

Start date

2026-05-08

End date

2026-05-13

Keywords

Behavior, Finance & Microfinance

Additional Keywords

Risk Preferences, Experiment, Behavioral Finance

JEL code(s)

Secondary IDs

Prior work

This trial does not extend or rely on any prior RCTs.

Abstract

This study tests whether risk preferences elicited via the Gneezy–Potters investment task are consistent across different expected value conditions. In a within-subjects design, participants make investment decisions under three scenarios that vary only in expected returns while keeping the overall structure constant. The experiment randomizes condition order and pays out only one condition to limit hedging behavior. By comparing decisions across conditions, the design evaluates whether standard classifications of risk preferences remain stable. The results provide a direct assessment of the reliability of commonly used experimental risk measures.

External Link(s)

Registration Citation

Citation

Füllbrunn, Sascha. 2026. "Inconsistent Risk Preferences (Online)." AEA RCT Registry. May 11. https://doi.org/10.1257/rct.18584-1.0

Sponsors & Partners

Experimental Details

Interventions

Intervention(s)

We will vary the expected value of a risky asset (positive, zero, negative) using a Gneezy-Potters (1997) experimental design within subjects. Between subjects, we vary the order in which the different treatments are presented (positive-zero-negative, negative-zero, positive) and also the winning probability (1/3 vs 1/6 probability).

Intervention (Hidden)

Subjects receive an endowment of 150 pence and bet in a lottery. When successful (probability p), the lottery pays 150+k*bet, otherwise, the payment is 150-bet. They simultaneously decide for three different lotteries using a payment generator. For each lottery, they simulate the outcomes yielding a table. Then they select one row for each lottery that is finally implemented.

The parameter set (k_pos, k_zero, k_neg, p, order) of the four treatments is
i) (2.5 ; 2 ; 1.5 ; 1/3 ; pos-zero-neg)
ii) (2.5 ; 2 ; 1.5 ; 1/3 ; neg-zero-pos)
iii) (8 ; 5 ; 2 ; 1/6 ; pos-zero-neg)
iii) (8 ; 5 ; 2 ; 1/6 ; neg-zero-pos)

Hence, we consider a within subject design for the variation in k, and a between subject design in the variation of order and probability (2x2).

Intervention Start Date

2026-05-09

Intervention End Date

2026-05-11

Primary Outcomes

Primary Outcomes (end points)

The decision maker has an endowment to invest. The amount invested in the three environments POS, ZERO, and NEG are the three outcome variable.

We have the following hypotheses:
1. Incentive Compatibility: Investment increases with expected value (Inv(POS)>INV(ZERO)>INV(NEG)
2. Stability of Risk Preferences: Risk-preference classifications inferred in the POS environment remain stable across environments. That is, if a participant is classified as risk-averse, risk-neutral, or risk-seeking in POS, the same classification should apply in ZERO and NEG.
3. Rank-Order Persistence: If individual risk-preference classifications are not stable across environments, the relative ordering of participants’ investment levels should still persist across POS, ZERO, and NEG.

We test all those in the high and low winning probability environment.

Primary Outcomes (explanation)

Secondary Outcomes

Secondary Outcomes (end points)

Secondary Outcomes (explanation)

Experimental Design

Prolific participants randomly enter one of the four conditions. They get a welcome screen and then decide as described above. In addition, they answer questions on Age, Sex, General Risk, and understanding of the task.

For each participant, a single virtual die roll determines the outcome for all three lotteries. Eventually, only one lottery will be randomly selected for their bonus payment, with each lottery equally likely to be chosen. The participants final payment is the fixed fee plus the bonus payment.

After this experiment, there is a second in the Prolific session which is about grocery shopping decisions. That one is unrelated to the risk taking experiment as it was added after the risky decision.

Experimental Design Details

Randomization Method

Prolific participants from the UK, aged 20-50, will be randomly allocated to the treatments with a 50:50 sex split.

Randomization Unit

individual

Was the treatment clustered?

Experiment Characteristics

Sample size: planned number of clusters

240 individuals

Sample size: planned number of observations

240

Sample size (or number of clusters) by treatment arms

60 observations for each treatment

Minimum detectable effect size for main outcomes (accounting for sample design and clustering)

H1) Trend Test on Pos>Zero>Neg we code as a variable with 1,0,-1. We estimate mixed-effects regressions with participant-level random effects and interactions between the trend variable and the treatment conditions. The power analysis is based on standard deviations from a previous classroom experiment (SDs ranging from 25 to 50) and high within-subject correlations across environments (approximately 0.8–0.9). Assuming 240 participants, 80% power, and a 5% significance level, the approximate minimum detectable effects are: Trend: 2.5–3; Trend × treatment interaction: 5–6; Three-way interaction: 10–12, measured on the 0–150 decision scale. Overall, the design is well powered to detect small trend effects and moderate treatment differences in trends. H3) Rank preservation: To test this hypothesis, participants are ranked separately within each environment, and Kendall’s coefficient of concordance W is computed across the three rankings. Kendall’s W ranges from 0 (no rank agreement) to 1 (perfect rank preservation). The power analysis for Kendall’s W uses a simulation-based approach with: N=240, three repeated environments, 5,000 simulation replications, and a two-sided significance level of 5%. The simulations assume latent correlations across environments ranging from weak to strong dependence. The minimum detectable concordance effect with 80% power corresponds approximately to ρ≈0.15 which reflects relatively weak rank stability. Pilot data from a previous classroom experiment suggest substantially stronger associations, with observed pairwise correlations around 0.8 or higher. Under such conditions, statistical power is expected to be substantially above 80%, implying that the planned sample size is sufficient to detect even moderate deviations from perfect rank preservation.

Supporting Documents and Materials

IRB