Austin Animal Shelter

by Anton Haugen and Ignacio Ruiz

Sources

-Database
https://www.kaggle.com/aaronschlegel/austin-animal-center-shelter-outcomes-and
-Webscrape
https://docs.thedogapi.com/api-reference/models/breed

Introduction

The goal of this project is to assist the Austin Animal Shelter predict the potential length of time that a dog will spend in the shelter. We used a version of Austin Animal Shelter's data provided on Kaggle by Aaron Schlegel as our base. To complement the data provided we used the Dog API and webscraped American Kennel Club for breed statistics. In doing so, we hoped to providing a predictive timeline for this and other shelters across the country. Although Austin Animal Shelter does not suffer from such, our model hoped to provide accurate prediction to better prepare the shelter to avoid issues such as lack of adequate living conditions and avoidable expense.

Model

The Data

While the data we had was a good base, we needed more continuous variables to create a Linear Regression model. Most of the values on our data were categorical and thus were binned. We proceeded by making several dummy variables such as intake_month, age_bins, and color_bins so that these string variables could have an impact in our modelling process.

While analyzing our data by analyzing our data we found that dogs tend to be adopted more during the weekend than any other day.

We discovered through data cleaning that most dogs aside from being stray dogs, were mostly surrendered by their owners, which could give us the insight of what the center can expect their intakes to be.

Modeling

Since our data is not normally distributed we decided to also explore using a Poisson distribution for our predictive model in addition to our Linear Regressions.

We started with linear regression. Using linear regression without polynomial interaction our model turned out with a quite low R2.

While using different methods like k-best, T score or lasso, we found that K-best and T score were better fits when using polynomial interactions.

The model tended to be over fit, but we were able to find the best option which would result in using Lasso for the final model. When we tried our poisson distribution, we realized models based on our features were not capturing extreme values, even when the dependent variable was capped at 60.

We performed a secondary eda and found that most of these dogs were Pit Bulls.

Through a one-way ANOVA, we found that Pit Bulls had a mean number of days statistically significant from other breeds.

So we reengineered our models to introduce a 'is_pitbull' feature, which improved our models

Though the Poisson model provided great insights, our best predictive model was that from Lasso feature selection because it captured interactions between features that our poisson could not.

Conclusions

While initially we believed that adopters' breed preferences would cancel each other out, our EDA for our final model allowed us to understand how significant cultural factors were to dog selection. For a future predictive model we would like to find other contributing factors to extreme values that are not represented in the data set. Considering how Austin's demographics are changing, we would also like to find ways to better consider adopter preferences and cultural milieu compared to those of former owners.

Name		Name	Last commit message	Last commit date
Latest commit History 53 Commits
images		images
.gitignore		.gitignore
Data_Cleaning.ipynb		Data_Cleaning.ipynb
Final_Linear_Models.ipynb		Final_Linear_Models.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Austin Animal Shelter

Sources

Introduction

Model

The Data

Modeling

Conclusions

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

freeebooksdotcom/mod2project

Folders and files

Latest commit

History

Repository files navigation

Austin Animal Shelter

Sources

Introduction

Model

The Data

Modeling

Conclusions

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages