Intervention (Hidden)
E's current business model uses a machine learning algorithm trained on applicant characteristics as well as unconventional smartphone data in order to make a credit decision within 15 minutes of an application being made. The lending process works as follows: On the company's proprietary mobile application, an applicant fills out a questionnaire that asks for standard information on employment status, income, etc. (hereafter referred to as "hard" or "limited" information). In addition to these hard data points, E also has access to "soft" information from other, past borrowers' smartphone data. Soft information includes metrics such as social media usage, the time of day individuals make phone calls, etc. In order to make a new lending decision, E feeds in both hard and soft data into a machine learning algorithm that creates a user specific credit score, which is then used to determine credit approval, loan size, and interest rate.
Once approved for a loan, an applicant must provide additional information (hereafter referred to as "identifying" information), including name, age, gender, a picture and their location in order to satisfy E's Know Your Client (KYC) requirements. Importantly, the applicant agrees that E can access her own smartphone data if she accepts the loan. Thus, through this iterative process, the algorithm uses soft data from past borrowers - who share observable characteristics with a current applicant - in order to inform new lending decisions.
We have formed a research partnership with a leading microfinance bank in Pakistan, hereafter referred to as H, in order to recruit loan officers for the proposed experiment. These loan officers will spend two days with the research team, where they will evaluate loan applications that E has already received and approved in the past. Given that these loans have already been administered, I can observe their repayment outcomes ex-ante. The officers will have access to all the information that was entered by an applicant on E's app when they applied for a loan. Officers will not be informed of the fact that loan decisions have already been made. Instead, they will be told that their help is being solicited to screen digital loan applications, and that their approval or rejection of a loan is "real", in that funds will be disbursed if they choose to approve an application. All credit decisions will be made on a custom-made web interface.
There might be concern that I am only able to evaluate those loans that were approved by E. However, when E was testing its product, it randomly approved around 60%-65% of loan applications that would otherwise have been rejected by its algorithm. This unique feature allows me to observe default outcomes across the full support of the applicant pool rather than for only those applicants that were approved by the algorithm.