Experimental Design Details
Kartu Prakerja is an ideal research context because eligible applicants to are offered the program through a lottery which randomizes eligible applicants, at the individual level, to receive the program (treatment group) or not (control group). Our study employs Kartu Prakerja’s lottery randomization, and multiple data sources to generate evidence, focusing on eligible applicants to the 2020 and 2021 enrollment cohorts.
Given Kartu Prakerja’s lottery system, we can compare the outcomes of eligible applicants selected to receive the program to eligible applicants not selected to the program, using the outcome data described below, to accurately calculate the program’s impacts. The enrollment lottery is conducted in waves, and applicants are screened for eligibility prior to randomization. Eligible applicants who were not selected in a given lottery wave are allowed to reapply and participate in a subsequent wave.
To account for this design, we will use an instrumental variable approach, where winning the lottery is an instrument for receiving Kartu Prakerja. The first stage regression will predict ultimate receipt of the Kartu Prakerja program by the time the outcome data was collected based on being selected in a given lottery wave. To analyze all lottery waves together, we simply stack the data as if we were running each wave one by one, and run a joint version above the instrumental variable equations with lottery wave fixed effects, clustering standard errors by individual to take into account the fact that a given individual can enter a lottery multiple times, but only has a single outcome variable in a given dataset. Using current program data that we have received from PMO on the number of eligible applicants and beneficiaries selected in each wave from PMO, we have already assessed that this instrumental variable approach yields a strong first stage.
We can extend the impact evaluation analysis to understand determinants of program take-up. First, we can explore the characteristics of those who apply, are selected, and engage most in training; whether treatment effects differ for different types of beneficiaries; how different types of beneficiaries use the cash transfers; and the characteristics of those who obtain the highest benefit from Kartu Prakerja. Using regression interactions and machine learning techniques, we will analyze heterogeneity by baseline demographic characteristics, as well as competency and motivation metrics from the application form.
Kartu Prakerja’s lottery also presents a second strategy for studying take-up decisions. Given the lottery is conducted in waves, and applicants not selected in a given wave can reapply and participate in the next lottery wave, by examining the number of lottery waves an eligible applicant attempts, we can explore whether these “hurdles” are an effective screening device for ensuring that the highest-potential applicants enter the program, whether they screen out those most in need, or whether selection effects vary by outcome (different implications for labor vs. consumption vs. financial inclusion impacts). Since multiple family members can apply, and who is chosen is random, we can also use this to study the differential impacts of the program for different types of family members.
(2) Data
We study eligible program applicants from 2020 and 2021 using 1) Kartu Prakerja administrative data, 2) Indonesian national sample surveys, and 3) program-specific surveys. Administrative data contains baseline characteristics and which trainings participants choose. The SUSENAS and SAKERNAS national sample surveys are administered twice a year by the GoI to a random sample of households across Indonesia, including some individuals who have applied for Kartu Prakerja. SAKERNAS asks about labor outcomes (employment, wages, hours), SUSENAS focuses on consumption (spending patterns, food security), and both contain measures relevant to financial inclusion. We are also conducting additional surveys for more detailed information. As part of this research, we are expanding the scope and coverage of these surveys to include additional questions and also provide comparable data for the control group.