Experimental Design
### Treatments
We create two **baseline** treatments. One vertical (Task 1 is easy and Task 2 is hard) and one horizontal (Task 1 is young and Task 2 is old). For each, we recruit 200 participants who select a task and give a rating. The average rating from previous participants is never disclosed. This allows us to create a baseline and see how much welfare participants are able to generate **without** a rating system. (Note, we do not need participants to arrive sequentially since ratings are never displayed)
We then create two **rating** treatments where the average rating from previous participants is available at the buying stage. For the reasons described above, for each treatment, we generate 14 groups of 30 participants who enter the marketplace sequentially and where the average ratings are updated at every passing. This allows us to see how much additional welfare is generated by the introduction of a rating system.
We believe that rating systems are less effective when goods are horizontally differentiated. For this reason, we study two variations of rating systems that are used by some online platforms and could help improve welfare for the horizontal markets.
1. Filtering: This treatment is similar to “Rating Horizontal” but rather than displaying the average (and number) of ratings from previous participants, it is the average (and number) of ratings for each type (18-30 and 50+) that is displayed.
2. Freezing: This treatment is similar to “Rating Horizontal” but each task needs at least 5 ratings for the average (and number) or ratings to be disclosed.
### Additional exercises
In addition, for all participants and before the buying stage, we measure self-confidence in performance in celebrity quizzes. Participants have to answer the following question: *Imagine taking a quiz about celebrities. Out of 100 randomly selected people who also took the same quiz, how many do you think would perform worse than you?* This allows us to explore the effect of initial expectations on buying decisions and ratings.
Finally, we also measure risk aversion by asking the following “Are you generally a person who is fully prepared to take risks or do you try to avoid taking risks? [Scale from 0 to 10]]”.
**********************
## Additional treatments: November 2024
#Treatment 1: Algorithm-Based Recommendation
In this treatment, we introduce an ‘algorithm’ designed to recommend the most profitable option based on participants’ individual characteristics. The algorithm is trained on data from previous participants in this experiment. The algorithm is very simple as it will suggest participants to buy the age matching task (i.e. task old for old participants and task young for young participants). The algorithm’s suggestion will be presented as: ”Based on previous performance of participants similar to you, we suggest you buy task [age group].”
#Procedure:
Participants first complete the same introductory questionnaire as in previous treatments (including demographic information and confidence). During the buying stage, they are presented with the algorithm’s recommendation, which is tailored to their specific characteristics. Participants then choose a task or opt to count zeros, complete the chosen task, rate it, and exit the experiment.
#Sample:
We recruit 200 participants for this treatment (100 old and 100 young). Each
participant makes a decision independently (without sequential interdependence of
ratings).
#Treatment 2: Combined Rating and Algorithm-
This treatment combines the existing ”Rating Horizontal” system with the algorithm-based recommendation. Participants are provided with both the average ratings of previous participants (without filtering) and the algorithm’s recommendation during the buying stage.
#Procedure:
As in previous treatments, participants answer the introductory questionnaire and are exposed to the buying stage, where they see both the algorithm’s recommendation and the average rating (along with the number of ratings) from previous participants. After making their choice, they complete the task, rate it, and exit the experiment.
This treatment follows the same 14x30 sequential structure as in the ”Rating Horizontal” treatment, with ratings evolving across groups of participants. In each sequence, 15 old and 15 young participants will be randomly assigned a position in the sequence.
#Sample:
We recruit 14 groups of 30 participants (total 420 participants) who enter the
marketplace sequentially and observe the ratings of previous participants.