Inferential Statistics
Inferential statistics is a branch of statistics that makes the use of various analytical tools to draw inferences about the population data from sample data. Apart from inferential statistics, descriptive statistics forms another branch of statistics. Inferential statistics help to draw conclusions about the population while descriptive statistics summarizes the features of the data set.
There are two main types of inferential statistics - hypothesis testing and regression analysis. The samples chosen in inferential statistics need to be representative of the entire population. In this article, we will learn more about inferential statistics, its types, examples, and see the important formulas.
1. | What is Inferential Statistics? |
2. | Types of Inferential Statistics |
3. | Inferential Statistics Examples |
4. | Inferential Statistics vs Descriptive Statistics |
5. | FAQs on Inferential Statistics |
What is Inferential Statistics?
Inferential statistics helps to develop a good understanding of the population data by analyzing the samples obtained from it. It helps in making generalizations about the population by using various analytical tests and tools. In order to pick out random samples that will represent the population accurately many sampling techniques are used. Some of the important methods are simple random sampling, stratified sampling, cluster sampling, and systematic sampling techniques.
Inferential Statistics Definition
Inferential statistics can be defined as a field of statistics that uses analytical tools for drawing conclusions about a population by examining random samples. The goal of inferential statistics is to make generalizations about a population. In inferential statistics, a statistic is taken from the sample data (e.g., the sample mean) that used to make inferences about the population parameter (e.g., the population mean).
Types of Inferential Statistics
Inferential statistics can be classified into hypothesis testing and regression analysis. Hypothesis testing also includes the use of confidence intervals to test the parameters of a population. Given below are the different types of inferential statistics.
Hypothesis Testing
Hypothesis testing is a type of inferential statistics that is used to test assumptions and draw conclusions about the population from the available sample data. It involves setting up a null hypothesis and an alternative hypothesis followed by conducting a statistical test of significance. A conclusion is drawn based on the value of the test statistic, the critical value, and the confidence intervals. A hypothesis test can be left-tailed, right-tailed, and two-tailed. Given below are certain important hypothesis tests that are used in inferential statistics.
Z Test: A z test is used on data that follows a normal distribution and has a sample size greater than or equal to 30. It is used to test if the means of the sample and population are equal when the population variance is known. The right tailed hypothesis can be set up as follows:
Null Hypothesis: \(H_{0}\) : \(\mu = \mu_{0}\)
Alternate Hypothesis: \(H_{1}\) : \(\mu > \mu_{0}\)
Test Statistic: z = \(\frac{\overline{x}-\mu}{\frac{\sigma}{\sqrt{n}}}\). \(\overline{x}\) is the sample mean, \(\mu\) is the population mean, \(\sigma\) is the population standard deviation and n is the sample size.
Decision Criteria: If the z statistic > z critical value then reject the null hypothesis.
T Test: A t test is used when the data follows a student t distribution and the sample size is lesser than 30. It is used to compare the sample and population mean when the population variance is unknown. The hypothesis test for inferential statistics is given as follows:
Null Hypothesis: \(H_{0}\) : \(\mu = \mu_{0}\)
Alternate Hypothesis: \(H_{1}\) : \(\mu > \mu_{0}\)
Test Statistics: t = \(\frac{\overline{x}-\mu}{\frac{s}{\sqrt{n}}}\)
Decision Criteria: If the t statistic > t critical value then reject the null hypothesis.
F Test: An f test is used to check if there is a difference between the variances of two samples or populations. The right tailed f hypothesis test can be set up as follows:
Null Hypothesis: \(H_{0}\) : \(\sigma_{1}^{2} = \sigma_{2}^{2}\)
Alternate Hypothesis: \(H_{1}\) : \(\sigma_{1}^{2} > \sigma_{2}^{2}\)
Test Statistic: f = \(\frac{\sigma_{1}^{2}}{\sigma_{2}^{2}}\), where \(\sigma_{1}^{2}\) is the variance of the first population and \(\sigma_{2}^{2}\) is the variance of the second population.
Decision Criteria: If the f test statistic > f test critical value then reject the null hypothesis.
Confidence Interval: A confidence interval helps in estimating the parameters of a population. For example, a 95% confidence interval indicates that if a test is conducted 100 times with new samples under the same conditions then the estimate can be expected to lie within the given interval 95 times. Furthermore, a confidence interval is also useful in calculating the critical value in hypothesis testing.
Apart from these tests, other tests used in inferential statistics are the ANOVA test, Wilcoxon signed-rank test, Mann-Whitney U test, Kruskal-Wallis H test, etc.
Regression Analysis
Regression analysis is used to quantify how one variable will change with respect to another variable. There are many types of regressions available such as simple linear, multiple linear, nominal, logistic, and ordinal regression. The most commonly used regression in inferential statistics is linear regression. Linear regression checks the effect of a unit change of the independent variable in the dependent variable. Some important formulas used in inferential statistics for regression analysis are as follows:
The straight line equation is given as y = \(\alpha\) + \(\beta x\), where \(\alpha\) and \(\beta\) are regression coefficients.
\(\beta = \frac{\sum_{1}^{n}\left ( x_{i}-\overline{x} \right )\left ( y_{i}-\overline{y} \right )}{\sum_{1}^{n}\left ( x_{i}-\overline{x} \right )^{2}}\)
\(\beta = r_{xy}\frac{\sigma_{y}}{\sigma_{x}}\)
\(\alpha = \overline{y}-\beta \overline{x}\)
Here, \(\overline{x}\) is the mean, and \(\sigma_{x}\) is the standard deviation of the first data set. Similarly, \(\overline{y}\) is the mean, and \(\sigma_{y}\) is the standard deviation of the second data set.
Inferential Statistics Examples
Inferential statistics is very useful and cost-effective as it can make inferences about the population without collecting the complete data. Some inferential statistics examples are given below:
- Suppose the mean marks of 100 students in a particular country are known. Using this sample information the mean marks of students in the country can be approximated using inferential statistics.
- Suppose a coach wants to find out how many average cartwheels sophomores at his college can do without stopping. A sample of a few students will be asked to perform cartwheels and the average will be calculated. Inferential statistics will use this data to make a conclusion regarding how many cartwheel sophomores can perform on average.
Inferential Statistics vs Descriptive Statistics
Descriptive and inferential statistics are used to describe data and make generalizations about the population from samples. The table given below lists the differences between inferential statistics and descriptive statistics.
Inferential Statistics | Descriptive Statistics |
---|---|
Inferential statistics are used to make conclusions about the population by using analytical tools on the sample data. | Descriptive statistics are used to quantify the characteristics of the data. |
Hypothesis testing and regression analysis are the analytical tools used. | Measures of central tendency and measures of dispersion are the important tools used. |
It is used to make inferences about an unknown population | It is used to describe the characteristics of a known sample or population. |
Measures of inferential statistics are t-test, z test, linear regression, etc. | Measures of descriptive statistics are variance, range, mean, median, etc. |
Related Articles:
Important Notes on Inferential Statistics
- Inferential statistics makes use of analytical tools to draw statistical conclusions regarding the population data from a sample.
- Hypothesis testing and regression analysis are the types of inferential statistics.
- Sampling techniques are used in inferential statistics to determine representative samples of the entire population.
- Z test, t-test, linear regression are the analytical tools used in inferential statistics.
Examples on Inferential Statistics
-
Example 1: After a new sales training is given to employees the average sale goes up to $150 (a sample of 25 employees was examined) with a standard deviation of $12. Before the training, the average sale was $100. Check if the training helped at \(\alpha\) = 0.05.
Solution: The t test in inferential statistics is used to solve this problem.
\(\overline{x}\) = 150, \(\mu\) = 100, s = 12, n = 25
\(H_{0}\) : \(\mu = 100\)
\(H_{1}\) : \(\mu > 100\)
t = \(\frac{\overline{x}-\mu}{\frac{s}{\sqrt{n}}}\)
= 20.83
The degrees of freedom is given by 25 - 1 = 24
Using the t table at \(\alpha\) = 0.05, the critical value is T(0.05, 24) = 1.71
As 20.83 > 1.71 thus, the null hypothesis is rejected and it is concluded that the training helped in increasing the average sales.
Answer: Reject Null Hypothesis.
-
Example 2: A test was conducted with the variance = 108 and n = 8. Certain changes were made in the test and it was again conducted with variance = 72 and n = 6. At a 0.05 significance level was there any improvement in the test results?
Solution: The f test in inferential statistics will be used
\(H_{0}\) : \(s_{1}^{2} = s_{2}^{2}\)
\(H_{1}\) : \(s_{1}^{2} > s_{2}^{2}\)
\(n_{1}\) = 8, \(n_{2}\) = 6
\(df_{1}\) = 8 - 1 = 7
\(df_{2}\) = 6 - 1 = 5
\(s_{1}^{2}\) = 108, \(s_{2}^{2}\) = 72
The f test formula is given as follows:
F = \(\frac{s_{1}^{2}}{s_{2}^{2}}\) = 106 / 72
F = 1.5
Now from the F table the critical value F(0.05, 7, 5) = 4.88
As 4.88 < 1.5, thus, we fail to reject the null hypothesis and conclude that there is not enough evidence to suggest that the test results improved.
Answer: Fail to reject the null hypothesis.
-
Example 3: After a new sales training is given to employees the average sale goes up to $150 (a sample of 49 employees was examined). Before the training, the average sale was $100 with a standard deviation of $12. Check if the training helped at \(\alpha\) = 0.05.
Solution: This is similar to example 1. However, as the sample size is 49 and the population standard deviation is known, thus, the z test in inferential statistics is used.
\(\overline{x}\) = 150, \(\mu\) = 100, \(\sigma\) = 12, n = 49
\(H_{0}\) : \(\mu = 100\)
\(H_{1}\) : \(\mu > 100\)
t = \(\frac{\overline{x}-\mu}{\frac{\sigma}{\sqrt{n}}}\)
= 29.2
From the z table at \(\alpha\) = 0.05, the critical value is 1.645.
As 29.2 > 1.645 thus, the null hypothesis is rejected and it is concluded that the training was useful in increasing the average sales.
Answer: Reject the null hypothesis.
FAQs on Inferential Statistics
What is the Meaning of Inferential Statistics?
Inferential statistics is a field of statistics that uses several analytical tools to draw inferences and make generalizations about population data from sample data.
What are the Types of Inferential Statistics?
There are two main types of inferential statistics that use different methods to draw conclusions about the population data. These are regression analysis and hypothesis testing.
What are the Different Sampling Methods Used in Inferential Statistics?
It is necessary to choose the correct sample from the population so as to represent it accurately. Some important sampling strategies used in inferential statistics are simple random sampling, stratified sampling, cluster sampling, and systematic sampling.
What are the Different Types of Hypothesis Tests In Inferential Statistics?
The most frequently used hypothesis tests in inferential statistics are parametric tests such as z test, f test, ANOVA test, t test as well as certain non-parametric tests such as Wilcoxon signed-rank test.
What is Inferential Statistics Used For?
Inferential statistics is used for comparing the parameters of two or more samples and makes generalizations about the larger population based on these samples.
Is Z Score a Part of Inferential Statistics?
Yes, z score is a fundamental part of inferential statistics as it determines whether a sample is representative of its population or not. Furthermore, it is also indirectly used in the z test.
What is the Difference Between Descriptive and Inferential Statistics?
Descriptive statistics is used to describe the features of some known dataset whereas inferential statistics analyzes a sample in order to draw conclusions regarding the population.
visual curriculum