FAQ: Hypothesis Testing: Associations - Introduction

This community-built FAQ covers the “Introduction” exercise from the lesson “Hypothesis Testing: Associations”.

Paths and Courses
This exercise can be found in the following Codecademy content:

Data Scientist: Machine Learning Specialist
Master Statistics with Python
Data Scientist: Inference Specialist
Machine Learning Foundations

Hypothesis Testing with Python
Hypothesis Testing: Associations

FAQs on the exercise Introduction

There are currently no frequently asked questions associated with this exercise – that’s where you come in! You can contribute to this section by offering your own questions, answers, or clarifications on this exercise. Ask or answer a question by clicking reply (reply) below.

If you’ve had an “aha” moment about the concepts, formatting, syntax, or anything else with this exercise, consider sharing those insights! Teaching others and answering their questions is one of the best ways to learn and stay sharp.

Join the Discussion. Help a fellow learner on their journey.

Ask or answer a question about this exercise by clicking reply (reply) below!
You can also find further discussion and get answers to your questions over in Language Help.

Agree with a comment or answer? Like (like) to up-vote the contribution!

Need broader help or resources? Head to Language Help and Tips and Resources. If you are wanting feedback or inspiration for a project, check out Projects.

Looking for motivation to keep learning? Join our wider discussions in Community

Learn more about how to use this guide.

Found a bug? Report it online, or post in Bug Reporting

Have a question about your account or billing? Reach out to our customer support team!

None of the above? Find out where to ask other questions here!

I am reading the Introduction to Hypothesis Testing (Simulating a One-Sample T-Test) and am confused about something and was hoping this can be cleared up.

It says in Option 1:
“Alternative Hypothesis: The sample of 100 scores earned by Statistics Academy students came from a population with an average score that is greater than 29.92.
In this case, we want to know the probability of observing a sample average greater than or equal to 31.16 given that the null hypothesis is true. Visually, this is the proportion of the null distribution that is greater than or equal to 31.16 (shaded in red below). Here, the red region makes up about 3.1% of the total distribution. This proportion, which is usually written in decimal form (i.e., 0.031), is called a p-value.”

From my understanding, does this mean that there is only a 3.1% chance of a Statistics Academy student getting greater or equal to 31.16.

And then towards the end of the lesson it says:
“Impact of the Alternative Hypothesis
Note that different alternative hypotheses can lead to different conclusions. For example, the one-sided test described above (p = 0.031) would lead a data scientist to reject the null at a 0.05 significance level. Meanwhile, a two-sided test on the same data leads to a p-value of 0.062, which is greater than the 0.05 threshold. Thus, for the two-sided test, a data scientist could not reject the null hypothesis. This highlights the importance of choosing an alternative hypothesis ahead of time!”

Why would I reject the NULL hypothesis test if there was only a 3.1% chance of 100 Statistics Academy students getting a score greater than or equal to 31.16?