I found this project to be much easier than the hurricane analysis. I’ve worked with Pandas in the past doing some NFL analysis but didn’t have a ton of experience with dicts so that was a challenge.
This project took me about 30 minutes to complete.
My Github Repo
Congrats on completing the project.
A few things…It might be a good idea to add a bit more descriptive stats to your EDA.
For example, you can get basic stats by:
df.describe(include='all')
Or, you can do the same with a single column.
df['column_name'].mean()
You’ve already broken out the dataframe by regions so you could compare the different columns by region as well. It’s a suggestion. Data analysts are also storytellers–giving voice to what’s in the data. So, one’s notebook of EDA should tell a story—use the text or markdown cells to explain what you’re doing at different points of the EDA & what you’ve discovered.
Check out the Pandas User Guide too if you ever get stuck.
1 Like