Generative AI and the Restructuring of Workplace Collaboration

Trial Information

General Information

Title

RCT ID

AEARCTR-0018958

Initial registration date

June 17, 2026

Initial registration date is when the trial was registered.

It corresponds to when the registration was submitted to the Registry to be reviewed for publication.

First published

June 23, 2026, 8:25 AM EDT

First published corresponds to when the trial was first made public on the Registry after being reviewed.

Locations

There is information in this trial unavailable to the public. Use the button below to request access.

Request Information

Primary Investigator

Name

Mariana Oseguera

Affiliation

Georgetown University

Contact Primary Investigator

Other Primary Investigator(s)

PI Name

Federica De Stefano

PI Affiliation

Oxford

Contact Investigator

Additional Trial Information

Status

In development

Start date

2026-07-15

End date

2027-02-28

Keywords

Behavior, Labor

Additional Keywords

JEL code(s)

Secondary IDs

Prior work

This trial does not extend or rely on any prior RCTs.

Abstract

This study examines how access to generative AI changes the way workers undertake collaborative tasks with peers and supervisors, and the downstream consequences for creative output and learning. In a field experiment crossing team structure (Individual vs. Team) with technology access (No AI vs. AI), we will test how AI access changes (1) the frequency with which workers consult teammates and supervisors, capturing whether they reallocate time toward or away from collaboration; (2) the timing of those consultations within their own work process, revealing whether they brainstorm collaboratively up front or work independently first to avoid collaboration's costs; and (3) the content of collaboration — its frictions and its balance of routine and non-routine cognitive tasks — capturing what collaboration is used for. We will also estimate effects on productivity, output quality, and learning.

External Link(s)

Registration Citation

Citation

De Stefano, Federica and Mariana Oseguera. 2026. "Generative AI and the Restructuring of Workplace Collaboration." AEA RCT Registry. June 23. https://doi.org/10.1257/rct.18958-1.0

Sponsors & Partners

Experimental Details

Interventions

Intervention(s)

This is a 2×2 design that crosses team structure (Individual vs. Team) with technology access (No AI vs. AI Access), giving four conditions: Individual with no AI, Individual with AI, Team with no AI, and Team with AI. Our main interest is the effect of AI access within the Team conditions. Within those Team conditions we also randomize the basis of the performance bonus, rewarding workers either on expert ratings of their output or on their collaborator's evaluation. Workers are recruited through Upwork's normal hiring process against set skill and eligibility criteria, complete a real deliverable on a platform we built, and can consult a supervisor who holds firm-specific information. We log all communication and AI use, and independent expert raters evaluate the output.
Beyond the main 2×2 contrast, we will run three secondary analyses. First, supervisor consultation across the AI and No AI conditions, which identifies whether AI substitutes for the firm-specific knowledge the supervisor holds or changes the cost of consulting them (H1, H2). Second, the bonus basis within the Team conditions, which examines whether the return to collaboration drives AI's effect on collaborative behavior, comparing pairs rewarded on their collaborator's evaluation to those rewarded on expert ratings (H1). Third, worker status, which investigates whether AI's effect on collaboration differs across the status distribution (H3), using the baseline characteristics on which we stratify. Because Upwork makes worker ratings and experience public, we have a platform-validated measure of status to draw on here.

Intervention Start Date

2026-07-22

Intervention End Date

2026-11-30

Primary Outcomes

Primary Outcomes (end points)

Our main outcomes fall into two groups. The first is how workers collaborate: how often a worker consults peers and the supervisor; when in their process those consultations happen, and the content of the collaboration, both its routine versus non-routine cognitive content and its communicative function. The second is the work itself: the quality, novelty, and firm-specific integration of the output, judged by independent expert raters at the component and the full-deliverable level.

Primary Outcomes (explanation)

The collaboration measures come from the platform's logs of communication between teammates and with the supervisor. Frequency is a count of the consultations a worker initiates. Timing is where those consultations fall across the stages of the task, summarized as the share that happen before versus after the worker drafts their own component. Content is built by classifying each exchange two ways: whether it is routine or non-routine cognitive work (Autor, Levy & Murnane 2003), and what function it serves (explanatory talk, complex problem-solving, integrative critique, help-seeking, coaching, or task-information sharing). We automate this classification and have trained research assistants cross-validate it. Output quality, novelty, and firm-specific integration come from independent expert evaluations collected on Prolific, where each deliverable is rated by several raters who do not know which condition produced it.

Secondary Outcomes

Secondary Outcomes (end points)

How workers use AI, measured the same way as peer collaboration: the frequency, timing, and content of their AI queries. How often workers consult the supervisor across the AI and No AI conditions. Workers' ratings of their teammate after the task. And post-task self-reports on how they used firm-specific information, whether they found the collaboration and the AI useful, whether they would take on a similar project again, and which working arrangement they would prefer.

Secondary Outcomes (explanation)

The AI and supervisor measures come from the same platform logs as the peer-collaboration measures. The teammate ratings and self-reports come from a short post-task survey. Together these support the secondary analyses: whether AI substitutes for the firm-specific knowledge the supervisor holds, whether the return to collaboration drives AI's effect on how workers collaborate, and how AI's effect on collaboration differs across the worker status distribution.

Experimental Design

This is a 2×2 design that crosses team structure (Individual vs. Team) with technology access (No AI vs. AI Access), giving four conditions: Individual with no AI, Individual with AI, Team with no AI, and Team with AI. Our main interest is the effect of AI access within the Team conditions. Within those Team conditions we also randomize the basis of the performance bonus, rewarding workers either on expert ratings of their output or on their collaborator's evaluation. Workers are recruited through Upwork's normal hiring process against set skill and eligibility criteria, complete a real deliverable on a platform we built, and can consult a supervisor who holds firm-specific information. We log all communication and AI use, and independent expert raters evaluate the output.

Experimental Design Details

Not available

Randomization Method

. We sort workers into blocks defined by baseline characteristics we observe before assignment — platform tenure, prior work experience, and demographics — and randomize them to conditions within each block.

Randomization Unit

The unit assigned to a condition is the individual worker in the Individual conditions and the worker pair in the Team conditions. The bonus-basis treatment is randomized within the Team conditions at the pair level.

Was the treatment clustered?

Yes

Experiment Characteristics

Sample size: planned number of clusters

About 110 individual workers in the Individual conditions plus 110 worker pairs in the Team conditions — roughly 220 randomized units.

Sample size: planned number of observations

330 workers: 110 in the Individual conditions and 220 in the Team conditions.

Sample size (or number of clusters) by treatment arms

Individual conditions: about 55 workers with no AI and 55 with AI. Team conditions: about 110 workers (55 pairs) with no AI and 110 workers (55 pairs) with AI, with the bonus-basis treatment randomized within each Team arm.

Minimum detectable effect size for main outcomes (accounting for sample design and clustering)

The design detects a minimum effect of about 0.40 standard deviations on the main collaboration outcome at 80% power and two-sided α = 0.05, following Athey & Imbens (2017). This is in line with effects from recent field experiments on generative AI and teamwork, which fall around 0.2 to 0.45 standard deviations on output and quality measures (Noy & Zhang 2023; Dell'Acqua et al. 2025), and is comparable to the teamwork effects found in online labor markets (Lyons 2017).

Supporting Documents and Materials

There is information in this trial unavailable to the public. Use the button below to request access.

Request Information

IRB