Experimental Design
Design. This study employs a discrete choice experiment (DCE) with a between and within experimental design.
All participants are exposed to the same vignette (hypothetical scenario): they need to imagine being responsible for developing a policy proposal within a three-month deadline addressing a highly relevant issue in their area of competence. The have a fixed budget for this task, a part of which they can allocate to the production of synthesis of the available scientific evidence for the policy problem.
Choice task structure. Participants complete 8 binary choice tasks, each presenting two alternative scientific evidence syntheses. The two synthesis options are presented side-by-side on desktop computers or vertically stacked on mobile devices, with attributes displayed using visual icons and standardized descriptions. In each task, participants must select their preferred option to support policy development by clicking the corresponding button. Choice tasks are presented one at a time and are self-paced, but must be completed in a single session. Participants cannot return to previous tasks once submitted.
Attribute specification. Each synthesis option is characterized by three attributes with the following levels:
1) Production method:
• Expert-generated: Experts (academics or consultants with experience on the topic) produce a document that synthesizes the scientific evidence
• AI-generated: A generative artificial intelligence (AI) model automatically produces a document that synthesizes the scientific evidence
• AI-generated with expert supervision: A generative artificial intelligence (AI) model automatically produces a document that synthesizes the scientific evidence, whose content is reviewed by experts
2) Delivery time:
• For the Expert-generated production method: 2 weeks, 1 month, 2 months;
• For the AI-generated production method: 1 day, 1 week, 2 weeks;
• For the AI-generated with expert supervision production method: 1 week, 2 weeks, 1 month.
3) Cost:
• For the Expert-generated production method: 1000 € , 1500 €, 2000 €, 3000 €, 4000 €;
• For the AI-generated production method: 250 €, 500 €, 1000 €, 1500 €, 2000 €;
• For the AI-generated with expert supervision production method: 500 €, 1000 €, 1500 €, 2000 €, 3000 €.
Treatment manipulation. Participants are randomly assigned to one of two between-subject conditions that differ in the type of AI used in AI-generated options:
- Treatment group (validated generative AI): the AI model is specifically validated for scientific study synthesis. Participants are informed that "validated" means the AI has been tested by comparing its outputs with those generated by human experts to verify accuracy and reliability.
- Control group (standard generative AI): the AI is a generic generative AI model. Participants are provided with a brief definition of standard generative AI.
Both groups receive identical descriptions for expert-generated syntheses.
The choice set are generated through a constrained randomized design. In each choice set, the main attribute (production method) varies between the two choices. The values of the other two attributes are allowed to vary randomly in each choice set, under the constraints reported above, chosen to ensure realism of the choice.
The data collection will be stagered, we will invite participants gradually, spanning the invitations across several weeks. If the response rate for the first 5000 invitations sent is lower than 3%, we will use the collected data to generate priors and change the design to a D-efficient fractional design, to ensure a higher efficiency for the estimation of the main effects. The intial pool of respondents will be assigned to a separate block and we will constrol for it in the analysis.