FAQ: Multiple Linear Regression - Multiple Linear Regression: Scikit-Learn

This community-built FAQ covers the “Multiple Linear Regression: Scikit-Learn” exercise from the lesson “Multiple Linear Regression”.

Paths and Courses
This exercise can be found in the following Codecademy content:

Machine Learning

FAQs on the exercise Multiple Linear Regression: Scikit-Learn

There are currently no frequently asked questions associated with this exercise – that’s where you come in! You can contribute to this section by offering your own questions, answers, or clarifications on this exercise. Ask or answer a question by clicking reply (reply) below.

If you’ve had an “aha” moment about the concepts, formatting, syntax, or anything else with this exercise, consider sharing those insights! Teaching others and answering their questions is one of the best ways to learn and stay sharp.

Join the Discussion. Help a fellow learner on their journey.

Ask or answer a question about this exercise by clicking reply (reply) below!

Agree with a comment or answer? Like (like) to up-vote the contribution!

Need broader help or resources? Head here.

Looking for motivation to keep learning? Join our wider discussions.

Learn more about how to use this guide.

Found a bug? Report it!

Have a question about your account or billing? Reach out to our customer support team!

None of the above? Find out where to ask other questions here!

In the exercise, sonny apartment predicted rent is 2071, I have calculated the predicted rent as 2451.48. What could be the reason for this difference ?

2 Likes

I got the same answer and I confirmed through the hints that my code matches up. My best guess is that this is another example of a continuity error in the course… maybe the data we are using was updated since they first wrote the exercise… definitely confusing when this happens though.

Guys, is it a typo here?

y_predicted = mlr.predict(x_text)

takes values calculated by .fit() and the x values, plugs them into the multiple linear regression equation, and calculates the predicted y values.

I think the x_text should be x_test?

I’m under the impression that the url used for this exercise is wrong.
We are training a model to later test on an apartment in brooklyn, however the url used is for manhattan… this is giving out a wrong prediction, but the hint says that prediction is the correct one.

In my case, the hint says that the final prediction should be $2393.58… but this is only true when using the url for manhattan.
If I use for Brooklyn, the prediction is $2059.95

3 Likes

Hi,
in this article https://www.codecademy.com/article/training-set-vs-validation-set-vs-test-set, it describle the 3 splits of a data set in ML.

I am not sure to understand clearly:
When we use x_train, x_test, y_train, y_test= train_test_split(x, y, train_size, test_size) in https://www.codecademy.com/courses/machine-learning/lessons/multiple-linear-regression-streeteasy/exercises/scikit-learn
the “training split” was used to build the model
the “test_split” was used to test the model
Where is the “validation process”? in this exercise? is the test_split was used for the validating process? if yes what is the test split?