FAQ: Naive Bayes Classifier - Formatting the Data for scikit-learn

This community-built FAQ covers the “Formatting the Data for scikit-learn” exercise from the lesson “Naive Bayes Classifier”.

Paths and Courses
This exercise can be found in the following Codecademy content:

Data Science

FAQs on the exercise Formatting the Data for scikit-learn

Join the Discussion. Help a fellow learner on their journey.

Ask or answer a question about this exercise by clicking reply (reply) below!

Agree with a comment or answer? Like (like) to up-vote the contribution!

Need broader help or resources? Head here.

Looking for motivation to keep learning? Join our wider discussions.

Learn more about how to use this guide.

Found a bug? Report it!

Have a question about your account or billing? Reach out to our customer support team!

None of the above? Find out where to ask other questions here!

Hi, I am new to this method.

from sklearn.feature_extraction.text import CountVectorizer

After .fit(neg_list,pos_list), why the result would be different between CV.transform(pos_list+neg_list) and CV.transform(neg_list+pos_list)?