Hi all,
I just finished my first portfolio project on CodeCademy and I would like to see what you think of my work, and how it could be improved. I published it here on GibHub.
I have a basic understanding of Python but not very much experience. I studied economics in uni, so I did already have knowledge of and experience with statistics and research methods. Performing OLS regression analysis in Python was a first for me though. It took me quite a while working on this, in part because I kept on expanding/reviewing my work. Iād say it probably took me a total of 24 hours to complete.
I started off by cleaning the data and putting it in a dataframe, to make it easier to work with. I then ran an initial model, and checked whether the OLS assumptions were satisfied. I then added some more variables and interaction effects, improving the model. Finally, I checked for the OLS assumptions again, based on the final model.
TLDR: step 1, 2 and 8 are the most important sections in my notebook file.
I find it hard to say whether this project was difficult or easy, since I scoped it myself. I used Google and sometimes Microsoft Copilot for help with code, and found this source very helpful for checking OLS assumptions in Python.
I would love it if you could take a look at my project and give me some feedback. Of course, if you have any questions about what I did and why, let me know
Thanks a lot!
Jordy