Experimental Design
Measuring enumerator honesty
We will conduct a lab-in-the-field experiment with enumerators involved in survey data collection in Ethiopia and Kenya. The design follows the approaches in experimental literature on dishonesty (e.g., Barfort et al., 2019; Fischbacher and F¨ollmi-Heusi, 2013) to measure enumerators’ propensity for dishonesty, and is adapted to our context. For logistical simplicity, we will implement a coin-toss task instead of a dice-rolling task, consistent with prior studies (e.g., Abeler et al., 2014; Cohn et al., 2014; Janezic and Gallego, 2020).
Approximately 80–90 enumerators, including supervisors, will participate across the two countries. Participants will be informed that the task is designed to understand decision-making among survey enumerators engaged in data collection activities and to test how well they guess in situations characterized by randomness. They will also be informed that they earn money based on their reported outcomes. Participants will be explicitly informed that the research team cannot observe the outcome of any coin toss and will be instructed to toss the
coin privately, without the presence of anyone else.
Participants will complete six practice rounds to familiarize themselves with the procedure before starting the actual incentive-compatible exercise. The main coin-tossing exercise and tasks consist of 30 independent rounds. In each round, participants privately toss a coin and
then report their outcome.
The experiment uses local currency. To standardize the task across participants and minimize concerns about coin availability or biased flips from worn or damaged coins, all participants will receive a new coin of the corresponding denomination obtained from a bank. In Ethiopia, participants receive a one-Birr coin, with the lion side defined as heads and the weighing-scale side as tails. In Kenya, participants receive a 20-shilling coin, with the African bush elephant (Ndovu) side defined as heads and the Kenya Coat of Arms (National Crest) side as tails (see Fig. A.1).
Reporting heads pays 50 Birr per round in Ethiopia and 80 KES per round in Kenya, whereas reporting tails pays zero. The Kenya payment is designed to be broadly comparable to the Ethiopian payoff in purchasing-power-parity terms: 50 Birr ≈ 80 KES.2 Since payment
depends only on the reported outcome, participants have a monetary incentive to misreport the realized outcomes by overstating the number of heads.
The true outcomes are not observed by the research team, so this task creates a clear incentive to misreport. Cumulative monetary payments increase with the number of reported heads. Prior studies show that (dis)honesty measured through this approach predicts real-world unethical behavior and corruption (Fischbacher and F¨ollmi-Heusi, 2013; Barfort et al., 2019; Olsen et al., 2019; Cohn and Mar´echal, 2018).
Under truthful reporting, the probability of reporting heads in any given round is 50 percent. Enumerator honesty is measured by comparing the distribution of reported heads to the distribution implied by truthful reporting under random chance. Deviations from this benchmark
distribution will be used to construct measures of enumerators’ propensity to misreport.
Measuring enumerator effort
We measure enumerator effort using an incentivized real-effort task, following Abeler et al. (2011) and Koch and Nafziger (2020). Enumerators count the number of zeros in a sequence of tables consisting of randomly ordered zeros and ones. The sequence of tables is randomly generated for each enumerator, and each table appears one at a time in the CAPI instrument. For each table, enumerators enter the number of zeros and can proceed to the next table only after submitting the correct answer. Incorrect submissions are not counted as completed tables; instead, the enumerator remains on the same table and must try again.
The task lasts for a maximum of 90 minutes, the minimum time needed to complete the field surveys. A countdown timer displayed in the CAPI instrument shows the time remaining. After each correctly completed table, enumerators choose whether to continue counting zeros
in the tables or stop. Enumerators earn 15 Birr for each correctly completed table. To reduce potential confounding, including peer effects, enumerators were randomly assigned to rooms, seated apart from one another, and given staggered start times. These procedures limited
the scope for enumerators to observe others’ progress or stopping decisions. We record the session, room, date, start time, and number of enumerators present, and account for session-level variation in the empirical analysis.
Our primary measure of effort is the number of correctly completed tables before the enumerator stops or reaches the time limit. This measure captures correct work rather than attempted work. The relatively long duration of the task also allows us to examine the pattern of effort over time. We construct additional measures of effort and performance using the task data. As alternative measures of effort provision, we use the total time spent working and accumulated earnings up to the point at which the enumerator stops. As a measure of productivity, we use the average time per correctly completed table.
The task is well-suited for measuring effort in this setting. It requires no prior knowledge, measures performance objectively, and has a limited scope for learning. But the task is tedious and repetitive, and completing additional tables involves costly effort for enumerators, as do conventional surveys.