The Value of Rating Systems in Credence Goods Markets

Last registered on November 18, 2023

Pre-Trial

Trial Information

General Information

Title
The Value of Rating Systems in Credence Goods Markets
RCT ID
AEARCTR-0012483
Initial registration date
November 09, 2023

Initial registration date is when the trial was registered.

It corresponds to when the registration was submitted to the Registry to be reviewed for publication.

First published
November 18, 2023, 5:50 PM EST

First published corresponds to when the trial was first made public on the Registry after being reviewed.

Locations

Region

Primary Investigator

Affiliation
Technical University of Munich, School of Management

Other Primary Investigator(s)

PI Affiliation
University of Innsbruck
PI Affiliation
ESCP Business School
PI Affiliation
UMIT Tirol
PI Affiliation
ETH Zürich

Additional Trial Information

Status
In development
Start date
2023-11-13
End date
2024-01-26
Secondary IDs
Prior work
This trial does not extend or rely on any prior RCTs.
Abstract
In this project, we replicate the four main treatments in Angerer et al. (2022) using a neutral frame (instead of a healthcare frame).

We experimentally investigate the effect of public consumer ratings on market outcomes in credence goods markets. Contrary to search or experience goods, consumers cannot evaluate all dimensions of trade for credence goods, which may inhibit the information and reputation-building value of public rating systems. We implement a expert market in which experts have an informational advantage over consumers with respect to the apppropriate treatment. The rating system takes the form of a five-star rating system as is common on online rating websites. The value of this rating system is compared in two different expert market settings: First, one in which consumers cannot rely on information from personal experience with the expert, reflecting markets in which consumer-expert interactions are often first-time and infrequent. Second, one in which consumers have personal experience with the expert, reflecting markets in which consumer-expert interactions are frequent and repeated.

Reference:
Angerer, S., Glätzle-Rützler, D., Mimra, W., Rittmannsberger, T., & Waibel, C. (2022). The value of rating systems in healthcare credence goods markets. Available at SSRN. https://doi.org/10.2139/ssrn. 3965318
External Link(s)

Registration Citation

Citation
Angerer, Silvia et al. 2023. "The Value of Rating Systems in Credence Goods Markets." AEA RCT Registry. November 18. https://doi.org/10.1257/rct.12483-1.0
Sponsors & Partners

There is information in this trial unavailable to the public. Use the button below to request access.

Request Information
Experimental Details

Interventions

Intervention(s)
We replicate the four main treatments in Angerer et al. (2022) with a neutral frame.
We employ a 2x2 factorial design to experimentally investigate the effect of a public rating system in two different expert markets: A market in which consumers can rely on their personal experience, and one in which they cannot. These two different market environments are implemented in the experiment as follows:

In the conditions without personal experience, in each period consumers choose one expert from a list of four without being able to identify them. All players are informed beforehand that consumers have no means of identifying experts from previous periods. Thus, although consumers observe their payoffs in each period and can partially infer expert behavior, they cannot attribute it to a particular expert and therefore cannot build up personal experience with a particular expert. In experimental conditions with personal experience, consumers can identify experts by a fixed ID and decide whether to interact with a particular identified expert. Over the 16 periods of play, they can thus learn from their personal experience (payoffs) with a particular identified expert.

In the conditions with the public rating system, consumers can choose to rate interactions with experts on a five-star rating scale after receiving their payoff in a given period. This rating is shown to the respective expert at the end of the period. Subsequently, ratings for each expert over all treated consumers are aggregated, averaged, and displayed to consumers. Consumers see these public ratings for all experts when they decide whether to interact and which expert to choose starting in period 5. In the condition without personal experience, as highlighted before, consumers cannot identify a particular expert and only see the public ratings. The public ratings of all experts are displayed to experts when they decide on the type of treatment and which price to charge in a given period.
In total, we run four experimental conditions: BASELINE (no experience, no rating), EXPERIENCE (experience, no rating), RATING (no experience, rating), EXP-RATING (experience, rating).

Reference:
Angerer, S., Glätzle-Rützler, D., Mimra, W., Rittmannsberger, T., & Waibel, C. (2022). The value of rating systems in healthcare credence goods markets. Available at SSRN. https://doi.org/10.2139/ssrn. 3965318
Intervention (Hidden)
The experiment will be conducted at the Econ Lab at the University of Innsbruck. We build our experimental design on a slightly adopted credence goods framework of Dulleck and Kerschbamer (2006).

Experiment
The basic set-up and parameterization:
In our basic set-up, consumers and experts are grouped in a market of 8 subjects (4 consumers & 4 experts). Consumers suffer from a major problem with probability h = 0.5 and a minor one with probability (1-h). The probability h=0.5 is common knowledge. Consumers choose a expert knowing that they suffer from some problem in every period. They do not get information about the severity of their problem. Experts diagnose their consumers’ problem with certainty and at zero costs. They provide one of two treatments, a minor (qL) or an major (qH)treatment. The cost for the expert to provide the major treatment (cH) is 6 ECU (Experimental Currency Unit). The cost for the minor treatment (cL) is 2 ECU.
Treatment prices, paid by the consumers, are either 3 ECU (pL) or 8 ECU (pH). The major treatment cures both, the major and the minor health problem, while the minor treatment only cures the minor one. Consumers obtain 10 ECU (v) if cured, and zero if treated insufficiently. The payoff for Consumers consulting a expert is the difference between the obtained value v and the price charged (pL or pH). For experts, the payoff is the spread between the price charged (pL or pH) and the cost for the chosen treatment (cL or cH). In case a consumer decides against consulting any expert, the consumer receives an outside option of (-4) ECU (OPAT). Experts receive OEXP =0 if they do not interact with any consumer in a given period.
The structure of the stage-game is as follows:

1) For each consumer, nature draws the type of health problem. With probability h consumers have a major problem, and with probability (1-h) consumers have a minor health problem.

2) Consumers decide whether to consult an expert. If consumers decide not to visit an expert, the period ends. Otherwise, they choose one expert from a list of four.

3) Experts costlessly diagnose the problem, provide a treatment (q_L or q_H), and charge a price (p_L or p_H).
Consumers and experts observe their payoff in the respective period.

4) In the conditions with a public rating system: After learning the payoff for the respective period, consumers decide whether to rate the interaction with the expert. If they decide to rate the interaction, they choose the rating on a scale between 0 and 5 stars which is shown to the expert afterward.

[Treatment Variation]
As explained above, we plan to run four treatments:
[Experimental Condition 1] — No feedback-mechanism, No personal experience (BASELINE)
[Experimental Condition 2] — No feedback-mechanism, personal experience (EXPERIENCE)
[Experimental Condition 3] — Feedback-mechanism, No personal experience (RATING)
[Experimental Condition 4] —Feedback-mechanism, personal experience (EXP-RATING)
Intervention Start Date
2023-11-13
Intervention End Date
2024-01-26

Primary Outcomes

Primary Outcomes (end points)
Undertreatment, and overcharging-rates
Primary Outcomes (explanation)
Undertreatment is defined as the consumer needing the major treatment q_H but the expert providing the minor treatment q_L. An expert might have incentives to do so since the costs for the major treatment are higher (6 ECU versus 2 ECU) and the expert can always charge the price of the major treatment (8 ECU). In the results section, undertreatment will be reported in percent of expert-consumer interactions in which consumers need the major treatment. In terms of information, consumers can detect undertreatment in a period ex-post via their payoff, as the problem is not solved. In particular, if the expert charged p_H, the consumer payoff from undertreatment is -8 ECU

Overcharging is defined as the expert charging the price of the major treatment p_H while only providing the minor treatment to a consumer who has a minor problem. Overcharging is accordingly reported in percent of expert-consumer interactions in which consumers need minor treatment. In terms of information, consumers cannot infer ex post whether they have been overcharged, as they might have had a major problem requiring the major treatment charged at p_H. Thus, an expert can 'hide' behind a major treatment problem when overcharging.

Secondary Outcomes

Secondary Outcomes (end points)
Consumer outcomes: Interactin rates, giving feedback, and rating.
Market outcomes: Market efficiency and consumer surplus.
Secondary Outcomes (explanation)
On the consumer side, we record whether they choose to interact, and in the rating conditions whether they choose to provide a rating (captured by the variable feedback) and what the rating is (captured by variable rating).

We use two measures of market outcomes, overall market efficiency and consumer surplus. Market efficiency is driven by interaction (allowing surplus generation) and whether there is undertreatment, as undertreatment does not generate consumer value. Given our parametrization, we expect high levels of interactions, such that market efficiency is primarily determined by undertreatment. We normalize market efficiency, with 0% for no interaction and 100% for an interaction with the correct treatment.

Consumer surplus incorporates the prices paid by consumers and is thereby influenced by overcharging, which is not the case for market efficiency. Consumer surplus is reported in absolute value.

Experimental Design

Experimental Design
We plan to use a student sample from the University of Innsbruck and run each experimental condition with 48 subjects (as suggested by our power analysis). Therefore, we plan to run two sessions with 24 subjects each in every experimental condition. All sessions are run computerized using z-Tree and students are recruited using hroot. Participants do not know which experiment they are going to participate in when they register. They only receive information about the expected duration of the experiment (1:45h).
Our experiment is structured as follows for all our conditions:

Stage 1: The experimenter explains the experiment and participants read the instructions.
Stage 2: Participants answer several control questions to ensure they understood the game.
Stage 3: The computer randomly assigns roles and markets to participants.
Stage 4: Participants play the game for 16 periods.
Stage 5: Participants participate in additional games: an individual risk preference task, a dictator game, a lying task, and a trust game.
Stage 6: Participants fill out a questionnaire.
Experimental Design Details
Randomization Method
Randomization is carried out in the experiment by a computer.
Randomization Unit
at the session level
Was the treatment clustered?
Yes

Experiment Characteristics

Sample size: planned number of clusters
6 clusters á 8 individuals per experimental condition.
Sample size: planned number of observations
48 (6 x 8) individuals per experimental condition
Sample size (or number of clusters) by treatment arms
192 (4 x 48) individuals (students at the University of Innsbruck).
Minimum detectable effect size for main outcomes (accounting for sample design and clustering)
Based on previous findings, we performed a power calculation, indicating that we need six clusters á 8 subjects per experimental condition when aiming for a power of 80%.
Supporting Documents and Materials

Documents

Document Name
The Value of Rating Systems in Healthcare Credence Goods Markets
Document Type
other
Document Description
Original Manuscript with health frame.
File
The Value of Rating Systems in Healthcare Credence Goods Markets

MD5: ce0b534262f4a3aa04c19c3f6e22fee3

SHA1: 0c8c5c3d158a408d07c9243bb4f2cb528206717c

Uploaded At: November 09, 2023

IRB

Institutional Review Boards (IRBs)

IRB Name
Leopold-Franzens-Universität Innsbruck, Certificate of good standing.
IRB Approval Date
2017-10-18
IRB Approval Number
40/2017

Post-Trial

Post Trial Information

Study Withdrawal

There is information in this trial unavailable to the public. Use the button below to request access.

Request Information

Intervention

Is the intervention completed?
No
Data Collection Complete
Data Publication

Data Publication

Is public data available?
No

Program Files

Program Files
Reports, Papers & Other Materials

Relevant Paper(s)

Reports & Other Materials