Experimental Design Details
In a laboratory experiment with real effort tasks, we measure whether decisions for extra work are narrowly bracketed: whether people make decisions for extra work by thinking only about the direct disutility incurred from doing the extra work, or whether they also take into account the indirect effects of this extra work on other work they already have to complete. Specifically, subjects will be asked to complete a fixed and given amount of work and then be asked to do additional work. However, there will be 5 treatments:
- BEFORE ONLY: Subjects are asked for their WTW for additional tasks when there are no required tasks.
- NARROW UNSPECIFIED: Subjects are asked for their WTW for additional tasks when they know there are required tasks. They are not told whether these tasks are done before or after the main tasks.
- NARROW BEFORE: Subjects are asked for their WTW for additional tasks before doing some required tasks.
- NARROW AFTER: Subjects are asked for their WTW for additional tasks after doing some required tasks.
- BROAD: Subjects are asked for their WTW for additional tasks when it is made clear that they are in addition to the required tasks.
In all choices except in the BEFORE only treatment, the choice offered allows to choose exactly the same amount of work for exactly the same amount of money -- the choice set is fixed, including no extra requirements or benefits from working fast or slow. A person who brackets narrowly may nonetheless act differently, since they may perceive the extra tasks differently if they are framed as having to be done before or after the required work or if there is no active reminder that there are required tasks to do. We consider as a control treatment the BROAD treatment, in which subjects are told that they choose between the required work (say 30 tasks) or the required work plus extra work (40 tasks). Thus is the most transparent choice, and the one that economic theory would say is 'the right' framing, under standard assumptions on utility over work.
If people bracket narrowly and find the first 10 tasks easier than the last 10 tasks (increasing marginal disutility), then our hypotheses are the following:
- NARROW BEFORE: A person who brackets narrowly should choose as if (or more closely towards) BEFORE ONLY, since they are thinking only of the 10 tasks, not about how it makes the other 30 tasks harder.
- NARROW UNSPECIFIED: Similarly to NARROW BEFORE. Since we don't remind people of the required tasks in this treatment, the effect may be stronger in this treatment (although it may be weaker, as some people might naturally think of doing the work AFTER the required work).
- NARROW AFTER: A person who brackets narrowly and thinks of doing work after 30 tasks should choose as they would in BROAD. However, it may be that the reminder of the required tasks is ignore and not integrated with this choice, and thus it should be between NARROW UNSPECIFIED and BROAD.
Overview of the main experiment:
• Experiment based on the transcription task similar to the one used by Augenblick and Rabin (2015).
• Two parts: the first part will be conducted online (via Lioness Lab, Arechar et al., 2018), the second in the laboratory.
• PART 1 Subjects are invited to participate to the first part of the experiment online. Subjects read the instructions online, telling them that the experiment is made of two parts and that earnings are accumulated in both parts and are paid at the end of the experiment.
o PHASE 1: Subjects practice with the transcription task. They are rewarded a fixed amount (participation fee), for performing this task for 10 minutes.
o PHASE 2: Subjects are told that the week after they will perform this task in the lab. They will book the slot where they can participate to the experiment and told that for that session they will be asked to complete (say) 30 of these tasks to receive XX Euros.
o PHASE 3: depending on treatment, they will be given the opportunity to do YY extra tasks. These tasks will be done alone (BEFORE ONLY), before the main fixed tasks (NARROW BEFORE), after the main fixed tasks (NARROW AFTER), at an unspecificed time (NARROW UNSPECIFIED) or clearly in addition to the main 30 tasks (BROAD). Subjects will be asked to state the minimum amount of money they would be willing to do this task. For the elicitation we will use two different elicitation methods, randomized across the two treatments.
The two methods of eliciting the minimal payment (the WTW) are:
- A slider to select the minimum acceptable payment in order to perform a fix amount of work for a fixed amount of money (e.g. 13 tasks for $3.00)
- A set of multiple questions eliciting the minimum acceptable piece rate payment (13 tasks at $0.20/task, $0.40/task, $0.60/task...)
• PART 2
o PHASE 1: One of the choices made during the PHASE 3 will be selected randomly and implemented.
o PHASE 2: Subjects will work and will be rewarded according to the schedule.
o PHASE 3: At the end of the working part, subjects will be asked to answer to a series of incentivized questions, replicating Rabin and Weizsäcker (2009) with low stakes.
For all treatments except BEFORE ONLY, we will follow them up (in the first session) by a within-subjects treatment where we ask them the BROAD question too for non-BROAD treatments and one of the NARROW treatments for the BROAD treatment. Thus these subjects will be asked both a NARROW and a BROAD question, allowing us to see how narrowly they themselves bracket (since answers should be the same), as well as whether there is more or less narrow bracketing depending on the order of the questions. The subjects who receive both allow us to potentially estimate narrow bracketing at the individual level; or to see if people are less likely to narrowly bracket if they receive the NARROW question after the BROAD treatment, as this may draw their attention to the identical nature of the choices. (See "Secondary Outcomes (Explanation)").
Pilot Description
Without narrow bracketing, all choices except BEFORE ONLY should be identical. However, BEFORE ONLY and BROAD are identical, then our problem is that we have no power to identify narrow bracketing, as both narrow bracketing and broad bracketing give exactly the same answer: all answers should be the same, no matter whether subjects bracket broadly or narrowly. Why might this happen? It can happen if the first 10 tasks are exactly as painful as the next 10, and as the next 10, and so on. In that case narrow bracketing doesn't lead to a mistake. Another reason this can happen is that people *think* that 10 tasks are always equally painful (even if it turns out that they are not).
For this reason we run the following pilot to identify whether subjects think that the task gets harder (as well as whether they end up believing that). Note that what truly matters is the *beliefs* people have at the time they make the choices, not whether it actually ends up being more tedious.
In the pilot, we have three treatments, low-, medium-, and high-effort, denoted W10, W40, and W70 based on the total number of required tasks they'll have to do in each treatment. In each treatment, subjects do 10 tasks first. Then they are asked for their willingness to work after the remaining tasks, which is 0 for the W10 group (they already did 10 tasks), 30 for the W40 group, and 60 for the W70 group. It is important to ask them at this point, since that is the point at which we also ask the subjects in the main study. We ask them for their WTW by telling them now for their minimal payment for doing additional work after the required work. We do so by asking them the broadly framed question - that from BROAD treatment. We moreover ask them the same question as in the main study regarding whether they expect work to become more or less tedious (a question on a 10-point scale).
We will use the slider and price-list (randomized) and choose the more precise of these methods.
Based on the outcomes of this pilot, we will decide whether to use the sliders or the price list, and whether to use 30 or 60 tasks in the main experiment based on which of the 4 combinations would give us the largest power in the main experiment if the effect sizes and variances in the main study were identical (with identical variance) as in the pilot. This power computation will be done under the assumption of a fixed budget.
Arechar, A.A., Gächter, S., & Molleman, L. (2018). Conducting interactive experiments online. Experimental economics, 21(1), 99-131.
Augenblick, N., & Rabin, M. (2015). An experiment on time preference and misprediction in unpleasant tasks. The Review of Economic Studies.
Rabin, M., & Weizsäcker, G. (2009). Narrow bracketing and dominated choices. American Economic Review, 99(4), 1508-43.