AI Assistance and Student Problem Solving: The Role of Timing and Accuracy

Last registered on April 24, 2026

View Trial History

Pre-Trial

Trial Information

General Information

Title

AI Assistance and Student Problem Solving: The Role of Timing and Accuracy

RCT ID

AEARCTR-0018411

Initial registration date

April 20, 2026

Initial registration date is when the trial was registered.

It corresponds to when the registration was submitted to the Registry to be reviewed for publication.

First published

April 24, 2026, 8:56 AM EDT

First published corresponds to when the trial was first made public on the Registry after being reviewed.

Locations

There is information in this trial unavailable to the public. Use the button below to request access.

Request Information

Primary Investigator

Name

Ipek Mumcu

Affiliation

University of Exeter

Contact Primary Investigator

Other Primary Investigator(s)

PI Name

Elif Kubilay

PI Affiliation

University of Essex

Contact Investigator

Additional Trial Information

Status

In development

Start date

2026-04-20

End date

2026-06-26

Keywords

Behavior, Education

Additional Keywords

AI, problem-solving, education

JEL code(s)

C93, D83, I20

Secondary IDs

Prior work

This trial does not extend or rely on any prior RCTs.

Abstract

This study explores how access to AI-generated solutions influences problem-solving behavior and performance in 16–18-year-old students. Participants are randomly assigned to different conditions based on when they receive AI help and whether the AI solution is correct or incorrect, plus a no-AI control group. They complete an incentivized task with AI access and a second task without it to measure effects on independent performance.

External Link(s)

Registration Citation

Citation

Kubilay, Elif and Ipek Mumcu. 2026. "AI Assistance and Student Problem Solving: The Role of Timing and Accuracy." AEA RCT Registry. April 24. https://doi.org/10.1257/rct.18411-1.0

Sponsors & Partners

Experimental Details

Interventions

Intervention(s)

The intervention consists of providing participants with access to an AI-generated solution while completing a simple problem-solving task. The intervention varies along two dimensions: the timing of access to the AI-generated solution and the accuracy of the solution provided. Participants may receive access either immediately or delayed, and the AI-generated solution may be either correct or incorrect. Participants in the pure control group receive no AI assistance while solving the problem.

Intervention Start Date

2026-04-20

Intervention End Date

2026-06-26

Primary Outcomes

Primary Outcomes (end points)

The primary outcomes are binary indicators of whether the participant submitted a correct answer in the first stage and in the second stage.

Primary Outcomes (explanation)

Secondary Outcomes

Secondary Outcomes (end points)

From the first stage of the task, the following are used as secondary outcomes:
⁃ AI usage behavior
⁃ Task completion time
⁃ An indicator for whether the participant entered a response prior to viewing the AI-generated solution
⁃ An indicator for whether the participant revised their answer after viewing the AI-generated solution
⁃ The time elapsed between the appearance of the AI-generated solution and the first subsequent edit to the answer
⁃ In cases where the participant’s answer differed from the AI-generated solution, an indicator for whether the participant subsequently revised their answer
In addition, participants’ reported confidence in their submitted answers in both stages is included as a secondary outcome.

Secondary Outcomes (explanation)

Experimental Design

The experimental design follows a 2×2 between-subjects framework with an additional pure control condition. Participants are randomly assigned to conditions that vary along two dimensions: the timing of AI access (immediate versus delayed) and the accuracy of the AI-provided solution (correct versus incorrect), or to a control condition with no AI access. Participants are randomly assigned at the individual level with equal probability across all five conditions. Participants complete a simple, incentivized problem-solving task. In the immediate access condition, participants can request an AI-generated solution at any point from the start of the task. In the delayed access condition, the AI-generated solution is not immediately accessible and becomes available during the course of the task, with the point of access standardized across all participants in this condition. Within each timing condition, participants are randomly assigned to receive either a correct or an incorrect AI-generated solution upon request. Participants in the control condition are not given any access to AI assistance and are required to solve the task independently. This results in five groups defined by the combination of access timing, solution accuracy, and the absence of AI access.

Group 1: Pure control (no AI access)
Group 2: Immediate access – correct solution
Group 3: Immediate access – incorrect solution
Group 4: Delayed access – correct solution
Group 5: Delayed access – incorrect solution

All participants are first presented with a problem and instructed to solve it within a fixed time limit. After completing the first task, all participants proceed to a second problem. In this second stage, all participants are treated identically regardless of their initial assignment. They are given a fixed time limit to solve the problem and are not provided with any AI assistance. Both stages are incentivized.

Experimental Design Details

Not available

Randomization Method

Randomization at the individual level via lottery. We implement stratified randomization by session to ensure balanced allocation of participants across treatment arms within each classroom.

Randomization Unit

individual level

Was the treatment clustered?

Experiment Characteristics

Sample size: planned number of clusters

Around 2500 high school students

Sample size: planned number of observations

Around 2500 high school students

Sample size (or number of clusters) by treatment arms

Around 500 high school students in each of the 5 treatment arms.

Minimum detectable effect size for main outcomes (accounting for sample design and clustering)

We can detect a minimum effect size of approximately 8.8 percentage points in the accuracy rate between any two experimental groups with 500 participants per group (N = 2,500 total), assuming a baseline accuracy rate of 50%, α = 0.05, and power = 0.80. With 600 participants per group (N = 3,000 total), the corresponding minimum detectable effect is approximately 8.1 percentage points.

Supporting Documents and Materials

There is information in this trial unavailable to the public. Use the button below to request access.

Request Information

IRB

Institutional Review Boards (IRBs)

IRB Name

FESE UEBS Ethics Committee

IRB Approval Date

2026-04-08

IRB Approval Number

12199543

Analysis Plan

There is information in this trial unavailable to the public. Use the button below to request access.

Request Information