In your previous courses, you learned about the following topics:
Exploratory data analysis: Exploring data for insights and patterns
Inferential statistics: Making inferences about the population using the sample data
These methods help you formulate a basic idea or conclusion about the population. Such assumptions are called “hypotheses.” But how do you confirm these conclusions or hypotheses? Let’s see.
Let’s understand the basic difference between inferential statistics and hypothesis testing.
Inferential statistics is used to find some population parameter (mostly population mean) when you have no initial number to start with. So, you start with the sampling activity and find out the sample mean. Then, you estimate the population mean from the sample mean using the confidence interval.
Hypothesis testing is used to confirm your conclusion (or hypothesis) about the population parameter (which you know from EDA or your intuition). Through hypothesis testing, you can determine whether there is enough evidence to conclude if the hypothesis about the population parameter is true or not.
Both these modules have a few similar concepts, so don’t confuse terminology used in hypothesis testing with inferential statistics.
Let’s get started by understanding the basics of hypothesis testing.
Hypothesis Testing starts with the formulation of these two hypotheses:
Null hypothesis (H₀): The status quo
Alternate hypothesis (H₁): The challenge to the status quo
Now, having got a brief idea about what hypothesis testing is, in the next page, we will look at its different aspects in detail, starting with the formulation of the null and alternate hypotheses.