8.10 MLR: Donors Choose Overview: You are a manager at DonorsChoose.org and
8.10 MLR: Donors Choose
Overview:
You are a manager at DonorsChoose.org and you want to find a way to improve the quantity and amount of donations given to schools, teachers, and projects. But to do this, you must first understand what causes people to donate and what determines how much they donate. Therefore, you have extracted a sizable dataset for a single county of all donations given to any school/project/teacher in that county. Use the dataset provided to learn about the causes of donations and to answer all questions below.
Data Source:
CSV file containing 44740 records of actual donations given through the DonorsChoose.org website for an anonymous county in the United States.
Features about the donation
Donation Amount: the amount of the donation in US dollars; use this as the label
Optional Donation: whether the included an amount above the standard value requested per donation
Weekday: day of the week that the donation was made
Month: month of the year that the donation was made
Features about the donor
Donor Is Teacher: whether the donation was made by a teacher
Features about the project
Project Current Status: whether the project is fully funded, expired, or live
Project Type: Teacher-Led, Student-Led, Professional Development
Project Subject Category Tree: the school category of the project; there can be one or two values assigned to this category
Project Grade Level: grade level of the students targeted for this project
Project Cost: the entire proposed/requested cost of the project
Days Live: the number of days that the project was live at the time the donation was made
Features about the teacher
Teacher Prefix: Ms. Mrs., Mr. Teacher
Features about the school
School Metro Type: urban, suburban, unknown, rural, town
School Percentage Free Lunch: the percent of students who were getting free lunch when the donation was made
Features about the resources that the donation is intended to purchase
Resource Category: category of the resource that money is being raised to purchase
Resource Quantity: the number of resources needing to be purchased
Resource Unit Price: the cost per resource
Requirements:
Answer the questions below by completing the tasks they require and/or generating an appropriate analysis to answer the question. These questions may require skills you learned about both exploratory data analyses and modeling.
Single best model: your goal is to select the variables that will give you the highest R squared score possible to predict Donation Amount. Remember that Excel can only include up to 16 variables.
Your grade on this portion is based entirely on your R squared score. It will not be based on a curve, but rather, on your ability to meet certain thresholds. These thresholds will not be shared until after all submissions are received.
You must follow these rules:
Use every row in the dataset.
Do not arbitrarily change any of the data.
Use only Excel.
Every feature retained in the model must have a p-value below 0.25.
You can clean or transform the features in any way you want as long as the formula is consistent across all rows.
For example, you can create dummy codes or non-linear transformations.
Deliverables:
The Excel data file including your final model.
Answer the questions below.
There are two attempts on this assessment and the highest score will keep.
Points: 100
Resource Files:1 file
donors_choose.xlsx
