In this project, I created a regression model on insurance cost based on the parameters provided for each patient.
I used both linear regression (Ridge) and non-linear regression (Random Forest). I used the mean absolute error as the scoring parameter. What could be done to improve the accuracy of the model? Should I use outlier detection?
Do you agree with my findings regarding the importance of each feature?
Thank you very much for your time.