Experimental Design Details
We will conduct an online randomized control trial to assess the impact of GenAI on upskilling in data science. Initially, we will invite Boston Consulting Group (BCG) associates, and consultants to join a GenAI study via email. Associates and consultants who register will be randomly assigned to either a Control or a ChatGPT experimental condition group. In parallel, we plan to recruit data scientists to participate in a similar exercise, serving as a benchmark for the typical performance of a data scientist.
In the recruitment phase, we sent a survey to BCG's associates, and consultants to gauge their interest in the study, offering participation as a contribution towards their career development. This survey collected information on demographics, programming and ChatGPT skills, technology openness, creativity, and learning orientation (Agarwal & Prasad, 1998; Miron, Erez, & Naveh, 2004; Jha & Bhattacharyya, 2013). Details of the survey are available in the Appendix (Registration survey).
Associates and consultants will be randomly assigned to a Control or ChatGPT experimental condition. We plan to stratify our sample across multiple dimensions including their, gender, location, role (i.e., associate or consultant), coding skills, college degree (i.e., bachelors, masters, Ph.D.), and experience with ChatGPT for coding. Prior to participation in the experiment, all subjects will be asked to consent to participate in the experiment. We indicate that participation in the study is voluntary, and the time will count as an "office contribution" to their career development committee to reflect our appreciation for their efforts. We also provide additional incentives to encourage an ‘honest effort’ in the tasks. Top performers in each group will receive recognition among BCG leadership, an invitation to a small group chat with OpenAI and OpenAI merchandise.
After the consent, participants will complete pre- and post-experiment surveys and engage in data science tasks designed to evaluate their knowledge. Control group members will not use ChatGPT or similar tools to complete these tasks, although they may use Google or other resources. Conversely, the ChatGPT group will be provided a brief ChatGPT training (15-20 minutes) and asked to use ChatGPT to assist with their responses.
The experiment includes four stages, starting with a pre-experiment survey on subjective coding skills, GenAI usage, professional identity, and career aspirations (Pre-survey). We developed three independent tasks to test the knowledge in data science - a coding task (Coding task), a problem-solving task (Problem-solving task) and statistical knowledge task (Statistical knowledge task). However, we will randomly assign each participant, two randomly selected tasks from the three due to the effort they require to complete (~90 minutes each), with task order randomized to prevent ordering effects. Following task completion, a post-survey similar to the pre-survey will measure any change in participants' perceptions (Post-survey).
Following participation in the experiment, we will conduct qualitative interviews with a selective sample of participants. Particularly, we will target individuals who over-performed and under-performed in the data science tasks to better understand the underlying factors that influenced their outcomes.