I would just like to express the sentiment that I saw from some other users, this seems out of order for the Data Scientist Career Path, I will come back to this, but since we haven’t covered Pandas at all, I have no idea what is happening at all. This segment on Data Scraping should probably be elsewhere in this Career Path.
Agree! And all that fully refers to the Data Analyst Career Path It seems like someone intentionally ‘forgot’ this section there to make a beautiful soup out of your brains!
This is what I came here to find out. I have completed a few skill paths and reading books while I complete this path, so I have some experience with Pandas and Regex, but this stuff definitely wasn’t covered prior to being asked to clean up this data. I was making my way through this lesson just fine, really starting to understand the capabilities of Python, and then I ran into that wall where I was being asked to do things that were never covered. I managed to figure it out with what I knew + some reading, but they should really consider ordering these lessons better so as to avoid he discouragement that inevitably follows when you feel completely lost.
I finished the assignment with a slightly different method from the class, and the result DataFrame in the end seems correct except that I don’t know how to add the turtle_name column name.
same here. i have no experience with panda, html and css stuff. i went through the class pretty struggle. trying to understand each concept took me hours. i completed the task by looking through each hints>>>>>>>>>> pretty sad. but i will keep up learning. may be a tutorial on Pandas is a good starting point for me too.
I’ve gone through separate Pandas, Regex and Matplotlib courses before this one and it still didn’t help much because there were no methods that are needed here, not to mention the sacred knowledge of CSS selectors that have been used in this lesson but haven’t been explained one bit anywhere. I’ve gone through this lesson with obscenities on nearly every paragraph and with the help of Senior dev with 1000 years of exp. So we all are in the same boat here and none has been done about it since 2019.
Hi! @codecademy
It has been over a year and I still suffer from the problem a lot of people mentioned here.
There is not enough lesson in Data Science Career Path about data cleaning, data frames, Pandas, etc! I even do not know what is going on here. Looking then solutions here and I am lost. This makes me unsafe about the platform, and the material you provide. How could I trust that everything I need is given here or the education here has good quality content?
I kindly ask you to revise this section. It is really discouraging for new beginners.
Hi! Thank you so much for the feedback!! My name is Michelle and I’m the Domain Manager for the Data Science Domain – which means that I have the power to fix this!!
The fix might not be immediate, but I want to be sure that we are preparing you with everything you need to be successful. If you are willing, could you tell me what version of the Career Path you are in? This will better help me figure out where to address my attention and make sure we are talking about the same thing.
Secondly, I want to be transparent that I will be out of office next week, so might be a little slow to respond.
I am glad to hear that you can fix this! I have been enrolled in Data Scientist 2021 Version.
Thanks for the reply. I will be waiting for your response when you get back.
Enjoy your vacay!
Hi Hande,
First, thank you so much for flagging this (sorry for the delay – I’m finally catching up to myself)! The 2021 version of the Career Path is a bit outdated (and is not really available for new people to enroll). I’m sure you noticed that we launched a new version with a new set of content items. This new version has a lot more about cleaning data, working with variable types, and handling missing data. If you are open to it, I’d recommend checking out the Data Scientist: Machine Learning Specialist Career Path, to see what your relative completion would look like. It covers a lot more foundational ground.
I also want to let you know that we are currently building an all-new, hands-on course with Jupyter notebooks “Python for Data Science” (end of this year), which is really data science-focused, highlighting the iterative process, and putting data first in all activities.
I’m also personally working on revising the Handling Missing Data course, which will go even deeper into techniques to work with messy datasets.
I hope that this can address some of your concerns. If not, please do reply here because I want to be sure that we are preparing you (and all our learners) to feel confident about your skills as a Data Scientist.