Experimental Design
Sampling
Our sample of respondents is drawn from 51 subdistricts that are comparable to agricultural extension
service offices1, located in five regencies in the Yogyakarta special region of Indonesia. The 51 subdis-
tricts cover almost the entire area under analysis. The covered area is incomplete as five offices have
been excluded from the analysis since they reach at most two MFs. The reasons for the exclusion are
implementation efficiency and cost-benefit.
The planned proportional random sample consists of 839 MFs and 171 EWs drawn from a sampling
frame of 1,624 MFs and 303 EWs. The sampling frame comes from administrative data collected by
the agriculture and food security office of the local government2 and consists of governmental EWs and
registered MFs. We selected the sample to have a similar number of respondents across treatment arms
to increase statistical power. Therefore, we first considered 50% of the individuals to be distributed
across the three treatment arms (two treatment groups and one control), such that
_______________________________
where RT is the ideal number of respondent per treatment group, and N is the total number of
individuals in the sampling frame. In a second step, we select the number of respondent per subdistrict
as explained in the following equation:
_______________________________
In particular, the number of selected respondents R in a subdistrict S within a treatment arm T is
rounded up integer of the proportion of individuals in a subdistrict of a given treatment arm NT S
over the total population of that treatment arm NT , times the total number of respondents in a given
treatment arm RT . We rounded up all decimal points to select the number of respondents within each
subdistrict, to ensure at least the ideal number of respondents per treatment arm to be reached.
During the baseline data collection (February 2025), we realized that the reliability of the MFs’
data was low, as many the potential respondent only attended one or more events held by BPSDMP,
therefore, we requested additional data from the local agricultural department (DINAS). After ran-
domizing the potential replacements from this list, we obtained a final number of respondent of 784
MFs and 170 EWs. As we plan two additional waves, we aim at obtaining 2,862 observations from
954 respondents.
11.2 Data collection
Data collection is planned in three phases: Baseline, midline, and endline. These three phases will
occur at the same time for both respondent types. Baseline data collection ended in March 2025.
Midline data collection is planned for October 2025, and endline is planned for February and March
2026.
Randomization procedure:
We applied a stratified random assignment with misfit corrections based on the distribution of EWs
across subdistricts. First, we considered the distribution of EWs in our sampling frame and created
strata based on whether the number of EWs in a subdistrict is above the median number of EWs per
subdistrict. Then, we randomly assigned one-third of the subdistricts (17) to each treatment arm (two
treatment groups and one control), making sure that misfit corrections were globally applied based on
the distribution of EWs in the subdistricts.
Spillovers
We do not expect major spillovers in terms of training participation, as the intervention is based on
subdistrict level and is only offered to the selected respondents (due to budget constraints), preventing
EWs from sharing their freshly acquired knowledge with MFs outside of their working subdistrict area.
Moreover, access to self-paced training will be first offered only to selected respondents4. Still, given
that digital platforms are nonexclusive5, the respondents in the control group could access the platform
themselves. Therefore, the control group can become aware of the platform and access its general
content, learning about those agricultural and business practices that might affect their behavior. As
we plan to analyze intent-to-treat effects, the impact of the training effect on these outcomes will only
be underestimated.
Multiple hypothesis testing
In our study, there is a high probability of falsely rejecting the null hypothesis due to the large number
of outcomes. Therefore, to ensure that our findings are robust to p-value adjustments for multiple
hypothesis testing, we will control for the False Discovery Rates as in Benjamini and Hochberg (1995).
Specifically, we will calculate sharpened q-values within each family of outcomes.
Cost-benefit analysis
We plan to perform a cost-benefit analysis to inform the local government about the value of a potential
scale-up of the project.