Uncategorized

8.10 MLR: Donors Choose Overview: You are a manager at DonorsChoose.org and

8.10 MLR: Donors Choose

Overview:

You are a manager at DonorsChoose.org and you want to find a way to improve the quantity and amount of donations given to schools, teachers, and projects. But to do this, you must first understand what causes people to donate and what determines how much they donate. Therefore, you have extracted a sizable dataset for a single county of all donations given to any school/project/teacher in that county. Use the dataset provided to learn about the causes of donations and to answer all questions below.

Data Source:

CSV file containing 44740 records of actual donations given through the DonorsChoose.org website for an anonymous county in the United States.

Features about the donation

Donation Amount: the amount of the donation in US dollars; use this as the label

Optional Donation: whether the included an amount above the standard value requested per donation

Weekday: day of the week that the donation was made

Month: month of the year that the donation was made

Features about the donor

Donor Is Teacher: whether the donation was made by a teacher

Features about the project

Project Current Status: whether the project is fully funded, expired, or live

Project Type: Teacher-Led, Student-Led, Professional Development

Project Subject Category Tree: the school category of the project; there can be one or two values assigned to this category

Project Grade Level: grade level of the students targeted for this project

Project Cost: the entire proposed/requested cost of the project

Days Live: the number of days that the project was live at the time the donation was made

Features about the teacher

Teacher Prefix: Ms. Mrs., Mr. Teacher

Features about the school

School Metro Type: urban, suburban, unknown, rural, town

School Percentage Free Lunch: the percent of students who were getting free lunch when the donation was made

Features about the resources that the donation is intended to purchase

Resource Category: category of the resource that money is being raised to purchase

Resource Quantity: the number of resources needing to be purchased

Resource Unit Price: the cost per resource

Requirements:

Answer the questions below by completing the tasks they require and/or generating an appropriate analysis to answer the question. These questions may require skills you learned about both exploratory data analyses and modeling.

Single best model: your goal is to select the variables that will give you the highest R squared score possible to predict Donation Amount. Remember that Excel can only include up to 16 variables.

Your grade on this portion is based entirely on your R squared score. It will not be based on a curve, but rather, on your ability to meet certain thresholds. These thresholds will not be shared until after all submissions are received.

You must follow these rules:

Use every row in the dataset.

Do not arbitrarily change any of the data.

Use only Excel.

Every feature retained in the model must have a p-value below 0.25.

You can clean or transform the features in any way you want as long as the formula is consistent across all rows.

For example, you can create dummy codes or non-linear transformations.

Deliverables:

The Excel data file including your final model.

Answer the questions below.

There are two attempts on this assessment and the highest score will keep.

Points: 100

Resource Files:1 file

donors_choose.xlsx