An basic repository with an environment and .gitignore
https://www.thesquirrelcensus.com
The survey data was originally collected and stored in to files available at this link:
https://www.thesquirrelcensus.com/data
For this exercise, we will use an integrated .csv file that combines data and is a little more structured than the raw .csv. This file was downloaded from kaggle.com at this link:
The downloaded file is already added to the class repository in the data/ folder as as 2018_Central_Park_Squirrel_Census_-_Squirrel_Data.csv. You can copy this folder to your new repository on your local machine
-
Create a short name for your group and a new repository for this exercise. Clone the repo to your local machines.
-
Load the combined datafile (either using the url or from your repository's
../datadirectory) usingpd.read_csv(). -
Use standard
df.info(),df.head(),df.describe()to explore the data. -
Read over the user guide here to understand what the various columns contain.
Additional information on the survey and stories are available here:
https://www.thesquirrelcensus.com/data
- Work together to make a new
.ipynbin your repo that contains an analysis and simple visualization of the data!
