FAQ: Getting Started with Natural Language Processing - Topic Models

This community-built FAQ covers the “Topic Models” exercise from the lesson “Getting Started with Natural Language Processing”.

Paths and Courses
This exercise can be found in the following Codecademy content:

Natural Language Processing

FAQs on the exercise Topic Models

There are currently no frequently asked questions associated with this exercise – that’s where you come in! You can contribute to this section by offering your own questions, answers, or clarifications on this exercise. Ask or answer a question by clicking reply (reply) below.

If you’ve had an “aha” moment about the concepts, formatting, syntax, or anything else with this exercise, consider sharing those insights! Teaching others and answering their questions is one of the best ways to learn and stay sharp.

Join the Discussion. Help a fellow learner on their journey.

Ask or answer a question about this exercise by clicking reply (reply) below!

Agree with a comment or answer? Like (like) to up-vote the contribution!

Need broader help or resources? Head here.

Looking for motivation to keep learning? Join our wider discussions.

Learn more about how to use this guide.

Found a bug? Report it!

Have a question about your account or billing? Reach out to our customer support team!

None of the above? Find out where to ask other questions here!

Why do words in the stopwords list still appear in the Topics found by tf-idf LDA ?

1 Like

Why does every run give a new, different set of topic words in both versions?

I’m a bit lost on this exercise, as well. Would it be possible for a moderator or instructional lead to break down what’s happening in the code? In previous exercises, those comments in the code have been incredibly helpful.

1 Like

Hi everyone!
I am confused about how to read the trees. I can’t make much sense out of it. It just looks like a bunch of words and lines that follow no particular order. Are we going to be taught how to understand it later on in the course?

Also in an unrelated question:
I am a Jamaican. In Jamaica we speak an English Dialect called Patois. If I wanted to create a chatbot that was best suited for my Jamaican Patios speaking audience, would I follow these same rules, functions and models or would I have to do something else? Looking forward to hear your response!

Completely agree. The exercises are meant to help us understand how the tools work, but without additional instruction, I don’t really know what’s happening.

Hello! Because bag of words LDA topics based on filtered_for_stops while tf-idf LDA topics based on preprocessed_corpus. Also you can compare strings 41 and 47 and can find that different variables were used: bag_of_words_creator and tfidf_creator.