U.S. Medical Insurance Costs - Cost By Region

I found this project to be much easier than the hurricane analysis. I’ve worked with Pandas in the past doing some NFL analysis but didn’t have a ton of experience with dicts so that was a challenge.

This project took me about 30 minutes to complete.

My Github Repo

Congrats on completing the project.

A few things…It might be a good idea to add a bit more descriptive stats to your EDA.
For example, you can get basic stats by:

df.describe(include='all')

Or, you can do the same with a single column.

df['column_name'].mean()

You’ve already broken out the dataframe by regions so you could compare the different columns by region as well. It’s a suggestion. Data analysts are also storytellers–giving voice to what’s in the data. So, one’s notebook of EDA should tell a story—use the text or markdown cells to explain what you’re doing at different points of the EDA & what you’ve discovered.
Check out the Pandas User Guide too if you ever get stuck.

1 Like