Hello everyone, I am happy to share with you my recent project called “U.S. Medical Insurance Costs”, I will wait for your comments. thank you so much
Some considerations:
-
You kind of just jump into the code w/o a brief intro first. One or two sentences will suffice here. The notebook should have an intro (data citation/origins, initial questions), then move into the body: descriptive stats analysis/vizzes, & finally, conclusions/next steps.
-
I would also be mindful of using subjective language when it comes to BMI. Terms like “underweight”, “normal”, “obese” are all subjective. In fact, I’d avoid any discussions of it (bmi) at all b/c it’s archaic and not an accurate health indicator (just google it). There are other variables one can focus on that might have a connection to charges: region, children v. no children, men v. women, smoker v. non.
-
You might want to look at some basic, descriptive stats in the data first. How many people in each region, how many men v. women, smokers v. non, etc. And, rather focus on the mean of the column
charges
, look at the median. There are outliers in the dataset that pull the mean. -
You have a good grasp of function writing. Is there a way for you to limit the output so one doesn’t have to view a wall of text?