Back to History

Fields Changed

Registration

Field Before After
Trial Status in_development completed
Last Published November 24, 2025 09:06 AM December 12, 2025 02:07 PM
Experimental Design (Public) Participants will be recruited via Prolific. We will require that participants have completed 100+ previous studies with 95% or greater approval rating. In addition, participants must be 18 or older and located in the United States. They must also self-identity as either white or Black/African-American on Prolific. We will use Prolific’s screening criteria to ensure adequate sampling of Black participants. Our recruited sample will be ¾ white and ¼ Black. In the first part of the experiment, participants answer brief demographic questions, provide information on their risk preferences and social preferences, and build an avatar that resembles them in terms of race and gender. They are also introduced to the “slider task,” which is a real-effort task that involves dragging a slider to a pre-specified position on a scale. They complete practice sliders and then make decisions about how willing they would be to complete additional sliders for pay. In the second part, participants are randomly assigned to pairs. They see the avatar of their assigned partner. One partner within the pair will be randomly assigned to play the role of the manager and the other will be assigned to play the role of the worker for 2 rounds of interactive play. In each round, the worker has the opportunity to complete slider tasks in order to increase the chances of a good payoff for both the manager and themselves. Here is how the round works. There are two possible outcomes: a good outcome and a bad outcome. In the good outcome manager and worker both earn 10 tokens. In the bad outcome, the worker will earn 10 tokens but the manager will earn 0 tokens. Whether there is a good outcome or a bad outcome is partly determined by chance. At the start of the round, the chance of a good outcome is 5%. In order to increase the chances of a good outcome, the worker can complete slider tasks during a work period. For each slider they choose to complete during the work period, the likelihood of the good outcome goes up by 1 percentage point. If they successfully complete all 50 sliders, the likelihood of the good outcome increases to 55%. During the work period, the worker completes as many sliders as they wish (up to 50). The manager is asked how many sliders they think their worker should complete and how many sliders they believe their worker will complete. The answers to these questions are not revealed to the worker. After the work period, the computer will select the outcome based upon the likelihoods determined by the number of sliders completed. Both parties are told whether the outcome was good or bad and the associated payoffs. Then, a second round is played, identical to the first. After the second round, the worker has the choice of whether to play a third round. If they elect to play a third round, a third round is played, identical to the second round. There are two randomized treatment variations: no punishment versus punishment; hidden effort versus revealed effort. Each worker-manager pair is randomly assigned to one of the four possible cells with equal likelihood. No punishment versus Punishment In the “No punishment” treatments, the manager cannot punish the worker. In the “Punishment” treatments, the manager can levy a punishment on the worker. Punishment is costless. The manager can choose to deduct up to 10 experimental tokens from the worker in each round. They make the punishment decision after learning the outcome. The punishment decision is immediately communicated to the worker prior to the start of the next round of play. Each worker will learn at the beginning of the first round of interactive play whether their assigned manager has the ability to punish them. Note that in the first round, every manager is told that they have a 50% chance of being able to punish their worker. We ask all managers to make a punishment decision in the first round, and then we reveal to the manager whether they had been assigned to the “No punishment” treatment (in which case the punishment is not implemented) or the “Punishment” treatment (in which case the punishment is implemented). In the remaining rounds, only managers in the Punishment treatment make punishment decisions. Each worker-manager pair remains in the same treatment for all remaining rounds. Hidden Effort versus Revealed Effort This treatment is cross-randomized with the punishment treatments. We vary whether the manager learns how many sliders the worker completed during the work period in addition to learning the outcome. In the punishment treatments, effort is revealed prior to the manager’s punishment decision in each round. Note that both workers and managers are told about whether the worker’s effort (number of sliders) will be revealed or not at the start of the first round. In the Hidden Effort treatment, we ask managers after making their punishment decision but prior to continuing to the next round how many sliders they believe their worker completed. Note that in the first round, every worker is told that there is a 50% chance that the manager observed how many sliders the worker completed. After the first round, we reveal to the worker whether they had been assigned to the “Hidden Effort” treatment (in which case the number of sliders completed was not revealed) or the “Revealed” treatment (in which case the manager sees the number of sliders the worker completed). In the remaining rounds, only managers in the Revealed Effort treatment observe the number of sliders completed. Again, each worker-manager pair remains in the same treatment for all remaining rounds. Race and Gender determination We will have three options for the avatar’s skin tones: one dark brown, one light brown, and one pale. For the purposes of analysis, we will categorize workers as Black if they chose either the dark or the light brown skin tones. For the purposes of analysis, gender will be determined by the participant’s answer to the gender question on the demographic survey at the beginning of the study. (Their answer to this question determines the set of avatars available for them to select from. Male avatars have a blue background, female avatars have a pink background and non-binary have yellow backgrounds. We will use the respective pronouns referring to the worker during the study, to highlight their gender to the manager.) We plan to exclude workers who choose non-binary gender from the analysis due to anticipated limited statistical power. Other Analysis Notes: We have the following exclusion criteria: -- We will exclude from analysis any participant who dropped out during the study. -- We will exclude from analysis any participant whose partner dropped out during the study. -- We will exclude from analysis any participant who failed any of the following mandatory attention checks: The first attention check occurs during the survey on demographic characteristics. The second attention check occurs also during the first survey portion of the study, embedded in questions on risk attitudes. For every round, the manager will have an attention check embedded in the expectation portion of the survey. Additionally, all participants will have a last attention check occurring immediately after the interactive portion of the study, embedded in questions asking them about the avatar of their partner. Every participant who fails either of the two mandatory attention checks that appears in the demographic survey will be dropped from the study before being paired with another participant. Participants will be recruited via Prolific. We will require that participants have completed 100+ previous studies with 95% or greater approval rating. In addition, participants must be 18 or older and located in the United States. They must also self-identity as either white or Black/African-American on Prolific. We will use Prolific’s screening criteria to ensure adequate sampling of Black participants. Our recruited sample will be ¾ white and ¼ Black. In the first part of the experiment, participants answer brief demographic questions, provide information on their risk preferences and social preferences, and build an avatar that resembles them in terms of race and gender. They are also introduced to the “slider task,” which is a real-effort task that involves dragging a slider to a pre-specified position on a scale. They complete practice sliders and then make decisions about how willing they would be to complete additional sliders for pay. In the second part, participants are randomly assigned to pairs. They see the avatar of their assigned partner. One partner within the pair will be randomly assigned to play the role of the manager and the other will be assigned to play the role of the worker for 2 rounds of interactive play. In each round, the worker has the opportunity to complete slider tasks in order to increase the chances of a good payoff for both the manager and themselves. Here is how the round works. There are two possible outcomes: a good outcome and a bad outcome. In the good outcome manager and worker both earn 10 tokens. In the bad outcome, the worker will earn 10 tokens but the manager will earn 0 tokens. Whether there is a good outcome or a bad outcome is partly determined by chance. At the start of the round, the chance of a good outcome is 5%. In order to increase the chances of a good outcome, the worker can complete slider tasks during a work period. For each slider they choose to complete during the work period, the likelihood of the good outcome goes up by 1 percentage point. If they successfully complete all 50 sliders, the likelihood of the good outcome increases to 55%. During the work period, the worker completes as many sliders as they wish (up to 50). The manager is asked how many sliders they think their worker should complete and how many sliders they believe their worker will complete. The answers to these questions are not revealed to the worker. After the work period, the computer will select the outcome based upon the likelihoods determined by the number of sliders completed. Both parties are told whether the outcome was good or bad and the associated payoffs. Then, a second round is played, identical to the first. After the second round, the worker has the choice of whether to play a third round. If they elect to play a third round, a third round is played, identical to the second round. There are two randomized treatment variations: no punishment versus punishment; hidden effort versus revealed effort. Each worker-manager pair is randomly assigned to one of the four possible cells with equal likelihood. No punishment versus Punishment In the “No punishment” treatments, the manager cannot punish the worker. In the “Punishment” treatments, the manager can levy a punishment on the worker. Punishment is costless. The manager can choose to deduct up to 10 experimental tokens from the worker in each round. They make the punishment decision after learning the outcome. The punishment decision is immediately communicated to the worker prior to the start of the next round of play. Each worker will learn at the beginning of the first round of interactive play whether their assigned manager has the ability to punish them. Note that in the first round, every manager is told that they have a 50% chance of being able to punish their worker. We ask all managers to make a punishment decision in the first round, and then we reveal to the manager whether they had been assigned to the “No punishment” treatment (in which case the punishment is not implemented) or the “Punishment” treatment (in which case the punishment is implemented). In the remaining rounds, only managers in the Punishment treatment make punishment decisions. Each worker-manager pair remains in the same treatment for all remaining rounds. Hidden Effort versus Revealed Effort This treatment is cross-randomized with the punishment treatments. We vary whether the manager learns how many sliders the worker completed during the work period in addition to learning the outcome. In the punishment treatments, effort is revealed prior to the manager’s punishment decision in each round. Note that both workers and managers are told about whether the worker’s effort (number of sliders) will be revealed or not at the start of the first round. In the Hidden Effort treatment, we ask managers after making their punishment decision but prior to continuing to the next round how many sliders they believe their worker completed. Note that in the first round, every worker is told that there is a 50% chance that the manager observed how many sliders the worker completed. After the first round, we reveal to the worker whether they had been assigned to the “Hidden Effort” treatment (in which case the number of sliders completed was not revealed) or the “Revealed” treatment (in which case the manager sees the number of sliders the worker completed). In the remaining rounds, only managers in the Revealed Effort treatment observe the number of sliders completed. Again, each worker-manager pair remains in the same treatment for all remaining rounds. Race and Gender determination We will have three options for the avatar’s skin tones: one dark brown, one light brown, and one pale. For the purposes of analysis, we will categorize workers as Black if they chose either the dark or the light brown skin tones. For the purposes of analysis, gender will be determined by the participant’s answer to the gender question on the demographic survey at the beginning of the study. (Their answer to this question determines the set of avatars available for them to select from. Male avatars have a blue background, female avatars have a pink background and non-binary have yellow backgrounds. We will use the respective pronouns referring to the worker during the study, to highlight their gender to the manager.) We plan to exclude workers who choose non-binary gender from the analysis due to anticipated limited statistical power. Other Analysis Notes: We have the following exclusion criteria: -- We will exclude from analysis any participant who dropped out during the study. -- We will exclude from analysis any participant whose partner dropped out during the study. -- We will exclude from analysis any participant who failed any of the following mandatory attention checks: The first attention check occurs during the survey on demographic characteristics. The second attention check occurs also during the first survey portion of the study, embedded in questions on risk attitudes. For every round, the manager will have an attention check embedded in the expectation portion of the survey. Additionally, all participants will have a last attention check occurring immediately after the interactive portion of the study, embedded in questions asking them about the avatar of their partner. Every participant who fails either of the two mandatory attention checks that appears in the demographic survey will be dropped from the study before being paired with another participant. MODIFICATION (12/12/25): This modification impacts our plan for exclusions. We realized after starting the data collection that the oTree code and the pre-registration were inconsistent in how they treated participants who failed attention checks. Our original pre-registration planned to drop any participant who failed any attention check. However, the code as run dropped a participant only after they failed their second attention check. We are modifying our plan for exclusions to match the implemented code. We will drop any participant who failed two attention checks. This better aligns with Prolific's guidance of using two or more attention checks for surveys of 5 or more minutes and it reduces "drop" rates from the collected data, helping with budget and partner matching. This modification is occurring after data collection has begun. However, we have not cleaned or analyzed data, and we have collected only approximately 1/3 of our intended sample size. We will do a number of supplementary checks to make sure our results are robust to this decision. In particular, we will do supplementary analysis that drops any participant who (i) failed only one attention check AND (ii) participated before this modification was registered. We will also check the robustness of our results to dropping all participants who failed one attention, though we expect to be under-powered for that analysis given expected failure rates.
Back to top