Experimental Design
The evaluation is a three-arm individually randomized controlled trial conducted in a single large undergraduate economics course, AAEC 1006 (Principles of Macroeconomics), Spring 2026 at Virginia Tech. Random assignment is at the student level. The arms are:
Group A â Retrieval, No AI (Lockdown Browser, no external resources)
Group B â Retrieval, With AI (No Lockdown Browser, Microsoft Copilot permitted)
Group C â Placebo Control (Lockdown Browser, reads passage and answers a different question)
Participation eligibility is determined by enrolment in AAEC 1006 in Spring 2026 and completion of the Canvas consent quiz. Out of 154 enrolled students, 141 gave affirmative consent. All 141 were randomly assigned to three arms, each arm with 41 students each. Participation is incentivized by extra-credit points (not part of normal grading) tied to the post-treatment graded exercise.
The comparison between A vs. C: identifies the effect of retrieval practice. B vs. C: identifies the effect of AI-supported practice. B vs. A: isolates the marginal effect of AI access conditional on engaging in practice exercise.
Stratification and randomization: Students were sorted in ascending order by Test 1 score (the pre-randomization exam) and divided into four ability strata of sizes 36, 36, 36, and 33. These sizes were chosen so that each stratum is divisible by three and the four strata sum to the full sample of 141, guaranteeing an exact 47/47/47 split across arms. Within each stratum, treatment was assigned by block randomization: an arm list with each label (A, B, C) repeated in equal proportion (12 times in the first three strata and 11 times in the fourth) was randomly shuffled, and students were paired with the shuffled list element-wise. Pre-treatment Test 1 scores did not differ significantly across arms (one-way ANOVA F = 0.18, p = 0.833; all pairwise Welch t-tests p > 0.5).