Human-AI collaboration – experimental evidence

Last registered on November 02, 2024

View Trial History

Pre-Trial

Trial Information

General Information

Title

Human-AI collaboration – experimental evidence

RCT ID

AEARCTR-0012135

Initial registration date

September 17, 2023

Initial registration date is when the trial was registered.

It corresponds to when the registration was submitted to the Registry to be reviewed for publication.

First published

September 20, 2023, 10:56 AM EDT

First published corresponds to when the trial was first made public on the Registry after being reviewed.

Last updated

November 02, 2024, 9:32 PM EDT

Last updated is the most recent time when changes to the trial's registration were published.

Locations

Country

China

Region

Beijing

Primary Investigator

Name

Jinwen Xia

Affiliation

BNU

Contact Primary Investigator

Other Primary Investigator(s)

PI Name

Haoran He

PI Affiliation

BNU

Additional Trial Information

Status

In development

Start date

2023-08-28

End date

2025-12-31

Keywords

Behavior

Additional Keywords

Human-AI collaboration, Field experiment

JEL code(s)

C93, I11

Secondary IDs

Prior work

This trial does not extend or rely on any prior RCTs.

Abstract

We conduct an audit study in the online healthcare market to analyze the effect of information with various characteristics on physicians’ behavior and the quality of their healthcare.

External Link(s)

Registration Citation

Citation

He, Haoran and Jinwen Xia. 2024. "Human-AI collaboration – experimental evidence." AEA RCT Registry. November 02. https://doi.org/10.1257/rct.12135-2.0

Sponsors & Partners

Experimental Details

Interventions

Intervention(s)

N/A

Intervention (Hidden)

We use an online audit experiment to study the impact of AI-assisted advice under various conditions (independent variables) on the quality of healthcare (outcome) in the “Chunyu Doctor” platform, where online physician consultations are common and the availability of AI consultation for patients has grown recently. Additionally, attending physicians constitute the foundation of the platform’s online consultation service.

The unit of analysis is the physician who conducts the online consultation on the platform. We use the Internet to send a number of standard patients to physicians, completing the consultation and collecting behavioral data on physicians’ responses to various treatments. The experimental interventions varying in (1) the source of information (AI or human), (2) the subjective norm (induced or not), and (3) the key items for diagnosing the unstable angina (include or not). Three more questions are added after the experiment, so we can get physicians’ attitudes about using the AI tool. Due to attrition, the additional dataset will probably be relatively small.

Intervention Start Date

2023-10-15

Intervention End Date

2023-12-31

Primary Outcomes

Primary Outcomes (end points)

Diagnosis: whether the diagnosis of physician is correct or not.
Effort: including the number of recommended items and the number of important items, and the IRT (Item Response Theory) score.

Primary Outcomes (explanation)

The IRT score reflects the quality of items, and its construction can be seen in Das et al. (2016).

Secondary Outcomes

Secondary Outcomes (end points)

Treatment (if the dataset is available): whether the treatment given by physicians is correct or unnecessary/harmful.
Subjective trust (if the dataset is available): physicians’ subjective trust towards the AI tool.

Secondary Outcomes (explanation)

Experimental Design

We conduct an audit study in the online healthcare market to analyze the effect of information with various characteristics on physicians’ behavior and the quality of their healthcare.

Experimental Design Details

1. Case
As a common chronic disease, unstable angina is used as the study’s case since it is appropriate for the online consultation setting. And several audit studies (Sylvia et al., 2015; Das et al., 2016; Si et al., 2023) have used it as a common case. According to these studies and guidelines for the diagnosis and management of unstable angina, we construct the professional script for standard patients and the checklist with both the recommend items and important items for diagnosing this disease.

2. Sample
The experiment will be conducted in September 2023 with a sample of 300 physicians (if all physicians accept orders sent from standard patients). If we assume that the proportion of physicians with correct diagnosis in the control group is 15% (Das et al., 2016) and α=0.05, and the number of samples across all treatments is the same, we should need a sample of 198 to have an estimated power of 0.9. Therefore, the statistical power of the experiment should be sufficient.

3. Treatments
Please refer to the intervention section above for specific treatment conditions. We implement these interventions by sending consultation orders with different information, five treatments’ contents are displayed as follows.
(1)The source of information (AI or human)
Treatment 1 (T1): Doctor, I'm a little tired, and my chest hurts. Previous AI consultation with Chunyu Huiwen indicated that I may have cardiovascular disease. Could you help me see what is wrong with me? And how should it to be treated?
Treatment 2 (T2): Doctor, I'm a little tired, and my chest hurts. I consulted another doctor on Chunyu Doctor before and he said that I might have a cardiovascular disease. Could you help me see what is wrong with me? And how should it to be treated?

(2)The subjective norm (induced or not)
Treatment 3 (T3): Doctor, I'm a little tired, and my chest hurts. I participated in a series of activities organized by Chunyu Doctor with the slogan "AI in the New Journey, Walk in the Medical Road Together", and many doctors there suggested Chunyu Huiwen to me. I used it for AI consultation, and the result said that I might have the angina. Could you help me see what is wrong with me? And how should it to be treated?
Treatment 4 (T4): Doctor, I'm a little tired, and my chest hurts. Previous AI consultation with Chunyu Huiwen indicated that I may have the angina. Could you help me see what is wrong with me? And how should it to be treated?

(3)The key items for diagnosing the unstable angina (include or not)
Treatment 5 (T5): Doctor, I'm a little tired, and my chest hurts. As can be seen in the screenshot below, previous AI consultation with Chunyu Huiwen indicated that I may have cardiovascular disease. Could you help me see what is wrong with me? And how should it to be treated?
Treatment 6 (T6): Doctor, I'm a little tired, and my chest hurts. As can be seen in the screenshot below, my previous AI consultation with Chunyu Huiwen indicated that I may have cardiovascular disease (Note: the screenshot does not mention any important items of diagnosing unstable angina). Could you help me see what is wrong with me? And how should it to be treated?

We also add several questions after the experiments, which enable us to get physicians’ attitudes about using the AI tool. These questions are displayed as follows.
1. Doctor, I’m not sure whether to use the AI consultation tool or not, do you think the prediction of AI-assisted advice is reliable or not?
2. Doctor, I received a diagnosis with confidence (probability) from some AI tools. What is the confidence (probability) that I need to go to the hospital for a checkup?
3. Doctor, would you reduce the use of AI tool because of patients’ distrust towards it even if you believed that the AI tool was useful?

Randomization Method

We made a stratified randomization based on physicians’ department and title.

Randomization Unit

The unit of randomization is at the individual physician level.

Was the treatment clustered?

Experiment Characteristics

Sample size: planned number of clusters

N/A

Sample size: planned number of observations

300 physicians.

Sample size (or number of clusters) by treatment arms

300/6

Minimum detectable effect size for main outcomes (accounting for sample design and clustering)

The statistical power of the experiment should be sufficient, please refer to the sample section above for an explanation.

Supporting Documents and Materials

IRB

Institutional Review Boards (IRBs)

IRB Name

Business School, Beijing Normal University

IRB Approval Date

2023-09-05

IRB Approval Number

BNU-BS-IRB 2023-026

Analysis Plan

There is information in this trial unavailable to the public. Use the button below to request access.

Request Information

Post-Trial

Post Trial Information

Study Withdrawal

There is information in this trial unavailable to the public. Use the button below to request access.

Request Information

Intervention

Is the intervention completed?

Data Collection Complete

Data Publication

Is public data available?

Program Files

Reports, Papers & Other Materials

Human-AI collaboration – experimental evidence

Pre-Trial

General Information

Locations

Primary Investigator

Other Primary Investigator(s)

Additional Trial Information

Registration Citation

Interventions

Primary Outcomes

Secondary Outcomes

Experimental Design

Experiment Characteristics

Institutional Review Boards (IRBs)

Post-Trial

Study Withdrawal

Intervention

Data Publication

Program Files

Relevant Paper(s)

Reports & Other Materials