I’m so satisfied!
Submitting this post is like crossing the finish line of my run.
Some thoughts about this experience with Codecademy:
My journey on the Data Scientist Career path was fruitful.
In addition to completing projects (which are a great motivator) and improving my skills, I have build a simple sqlite database at work to enable the automation of some tasks.
I already knew Python and some of the packages I used before subscribing to Codecademy, but I approached the sql language during this course and being able to immediately put into practice what I learned was a great satisfaction.
And now, ladies and gentlemen, it’s time to introduce my final project …
I was listening to Iron Maiden’s Run to the Hills when I had the idea for this project and I found it to be an appropriate title, given the main topic.
I’ll leave it to you to find out what it’s all about by reading the story on Medium.
- I had to work with xml files, a new challenge that led me to develop a dedicated python module to better handle the dataset and a notebook for describing some details about this process.
This second notebook, which explain data cleaning and EDA, is not included in the Medium story but you will find it on GitHub.
- I applied some domain knowledge I have learned from my job to a different subject. It was engaging!
- It took almost every evening for two weeks to complete this project. I spent most of the time converting, cleaning and exploring the dataset (approx. 60-70% of the overall time). Last days were dedicated to data visualization and storytelling.
One last thing it’s worth to add: I have used a package that is not mentioned during lessons.
I made friends with Plotly several months ago.
I use this tool extensively in this project, I hope you will find those interactive figures as interesting as I do.
Feedbacks are welcome.
CODECADEMY WALKTHROUGH: RUN TO THE HILLS
Link to repo at the end of the story