The final project for BC1016 provides an opportunity to bring together, apply, and communicate your knowledge of data science and statistics from this course. You will work in groups of 2 to choose one of 4 provided datasets to analyze and submit a writeup of your analysis and conclusions in the form of a Jupyter Notebook.
Your team will pick one of four possible datasets to explore:
food
- A dataset with info on the results of NYC restaurant inspections, including the name of establishments, borough, cuisine, inspection grade, and types of violations [🔗 starter notebook & dataset]public health
- Data from a 1987 National Indonesia Contraceptive Prevalence survey which looks at the relationships between a mother and partner’s age, education, religious beliefs, and income level as well as contraceptive use and number of children [🔗 starter notebook & dataset]music
- A dataset of top streamed tracks from 2023, which includes information like a track’s artist and album, release date, and audio characteristics (like danceability, energy, and liveliness) [🔗 starter notebook & dataset]pets
- Data on registered pets (largely cats and dogs) in Seattle, including breed, pet owner’s zip code and income, and local parks by zip code [🔗 starter notebook & dataset]We plan to release the starter notebooks for these datasets by Wednesday 4/9
Group Declaration - due Fri 4/11
at 11:59pm
Project Proposal - due Mon 4/14
at 11:59pm
firstPersonLastName-secondPersonLastName.pdf
)Progress Report - due Fri 4/25
at 11:59
both a PDF and your Jupyter Notebook file (.ipynb)
to Courseworks (each person on the team should submit) using the file name convention firstPersonLastName-secondPersonLastName.zip/.ipynb
Final Project Report - Due Fri 5/9
at 11:59
both a PDF and your Jupyter Notebook file (.ipynb)
to Courseworks (each person on the team should submit) using the file name convention firstPersonLastName-secondPersonLastName.zip/.ipynb
Note: Peer review is required. Failure to complete the 2 steps of the peer review will lead to an automatic deduction of 20 points from your individual grade on the final project
.Grading Breakdown