1 added
2 removed
Original
2026-01-01
Modified
2026-02-28
1
-
<p>1242 Learners</p>
1
+
<p>1268 Learners</p>
2
<p>Last updated on<strong>November 26, 2025</strong></p>
2
<p>Last updated on<strong>November 26, 2025</strong></p>
3
<p>Correlation tells us how strongly two variables move together from -1 to +1, while regression helps to predict one variable using the other. Together, they let us explore and forecast the relationships, like how a child’s study time might engagingly influence test scores. In this article, we will learn in detail.</p>
3
<p>Correlation tells us how strongly two variables move together from -1 to +1, while regression helps to predict one variable using the other. Together, they let us explore and forecast the relationships, like how a child’s study time might engagingly influence test scores. In this article, we will learn in detail.</p>
4
<h2>What is Correlation?</h2>
4
<h2>What is Correlation?</h2>
5
<p>Correlation explains how two<a>variables</a>are related and whether they change in the same or opposite direction. This link is measured by the correlation<a>coefficient</a>from -1 to +1.</p>
5
<p>Correlation explains how two<a>variables</a>are related and whether they change in the same or opposite direction. This link is measured by the correlation<a>coefficient</a>from -1 to +1.</p>
6
<p>Here is the meaning<a>of</a>the values:</p>
6
<p>Here is the meaning<a>of</a>the values:</p>
7
<ul><li>Positive Correlation: Both variables rise or fall together. </li>
7
<ul><li>Positive Correlation: Both variables rise or fall together. </li>
8
<li>Negative Correlation: One goes up while the other goes down. </li>
8
<li>Negative Correlation: One goes up while the other goes down. </li>
9
<li>No Correlation: The variables show there is no clear relationship. </li>
9
<li>No Correlation: The variables show there is no clear relationship. </li>
10
</ul><p><strong>For example:</strong></p>
10
</ul><p><strong>For example:</strong></p>
11
<p>Imagine you are observing your child’s daily study hours and their marks in tests:</p>
11
<p>Imagine you are observing your child’s daily study hours and their marks in tests:</p>
12
<ul><li>If students who study more tend to score higher, that shows a positive correlation. </li>
12
<ul><li>If students who study more tend to score higher, that shows a positive correlation. </li>
13
<li>If spending more time on video games usually results in lower marks, that shows the negative correlation. </li>
13
<li>If spending more time on video games usually results in lower marks, that shows the negative correlation. </li>
14
<li>If the shoe size has no pattern or connection with marks, that shows no correlation. </li>
14
<li>If the shoe size has no pattern or connection with marks, that shows no correlation. </li>
15
</ul><p>This helps us understand how two things are related and how they change together.</p>
15
</ul><p>This helps us understand how two things are related and how they change together.</p>
16
<h2>Types of Correlation Coefficients</h2>
16
<h2>Types of Correlation Coefficients</h2>
17
<p>There are many types of correlation<a>coefficients</a>. Let us now see the most commonly used types of correlation coefficients:</p>
17
<p>There are many types of correlation<a>coefficients</a>. Let us now see the most commonly used types of correlation coefficients:</p>
18
<p><strong>Pearson’s Correlation Coefficient: </strong>This type of correlation coefficient measures the linear relationship between two continuous variables. The coefficient of values ranges from -1 (perfect negative correlation) to +1 (perfect positive correlation), with 0 being no correlation. This type of correlation coefficient assumes all the variables are distributed normally.</p>
18
<p><strong>Pearson’s Correlation Coefficient: </strong>This type of correlation coefficient measures the linear relationship between two continuous variables. The coefficient of values ranges from -1 (perfect negative correlation) to +1 (perfect positive correlation), with 0 being no correlation. This type of correlation coefficient assumes all the variables are distributed normally.</p>
19
<p><strong>Spearman’s Rank Coefficient:</strong> The Spearman’s rank coefficient measures the strength and the direction of a relationship between two variables. We use this type for ranked<a>data</a>or when the assumption of normality is violated. It is based on ranking rather than the actual values.</p>
19
<p><strong>Spearman’s Rank Coefficient:</strong> The Spearman’s rank coefficient measures the strength and the direction of a relationship between two variables. We use this type for ranked<a>data</a>or when the assumption of normality is violated. It is based on ranking rather than the actual values.</p>
20
<p><strong>Kendall’s Tau:</strong> This type of correlation coefficient measures the ranked association between two variables. We use it when the data has ties or small sample sizes. It compares a<a>number</a>of concordant and discordant pairs in ranking.</p>
20
<p><strong>Kendall’s Tau:</strong> This type of correlation coefficient measures the ranked association between two variables. We use it when the data has ties or small sample sizes. It compares a<a>number</a>of concordant and discordant pairs in ranking.</p>
21
<h2>What is Regression?</h2>
21
<h2>What is Regression?</h2>
22
<p>Regression is a statistical tool that helps us understand how one variable changes as another changes. It also allows us to predict future values of a dependent variable using information from one or more independent variables.</p>
22
<p>Regression is a statistical tool that helps us understand how one variable changes as another changes. It also allows us to predict future values of a dependent variable using information from one or more independent variables.</p>
23
<p><strong>Here is an example:</strong></p>
23
<p><strong>Here is an example:</strong></p>
24
<p>Imagine you want to estimate your child’s exam score based on their study hours:</p>
24
<p>Imagine you want to estimate your child’s exam score based on their study hours:</p>
25
<ul><li>Dependent Variable: Exam score (because it depends on study time) </li>
25
<ul><li>Dependent Variable: Exam score (because it depends on study time) </li>
26
<li>Independent Variable: Study hours </li>
26
<li>Independent Variable: Study hours </li>
27
</ul><p>If regression shows that more study hours usually lead to higher marks, you can use this pattern to predict your child’s score based on how much they study. Regression turns simple observations into meaningful, real-life predictions.</p>
27
</ul><p>If regression shows that more study hours usually lead to higher marks, you can use this pattern to predict your child’s score based on how much they study. Regression turns simple observations into meaningful, real-life predictions.</p>
28
<h3>Explore Our Programs</h3>
28
<h3>Explore Our Programs</h3>
29
-
<p>No Courses Available</p>
30
<h2>Types of Regression</h2>
29
<h2>Types of Regression</h2>
31
<p>There are several types of regression. Let us see some main types of regression mentioned below:</p>
30
<p>There are several types of regression. Let us see some main types of regression mentioned below:</p>
32
<p><strong>Linear Regression: </strong>We use this type of regression to model the relationship between one independent and one dependent variable using a straight line. It looks at two variables and tries to find if one variable makes a contribution to the other through putting it in a place on a<a>linear equation</a>.</p>
31
<p><strong>Linear Regression: </strong>We use this type of regression to model the relationship between one independent and one dependent variable using a straight line. It looks at two variables and tries to find if one variable makes a contribution to the other through putting it in a place on a<a>linear equation</a>.</p>
33
<p><strong>Multiple Linear Regression:</strong> This type of regression looks at the effect of two or more independent variables on one dependent variable. This extends the linear regression to<a>multiple</a>independent variables.</p>
32
<p><strong>Multiple Linear Regression:</strong> This type of regression looks at the effect of two or more independent variables on one dependent variable. This extends the linear regression to<a>multiple</a>independent variables.</p>
34
<p><strong>Polynomial Regression:</strong> This type of regression shows the non-linear relationships by<a>adding polynomial</a>terms like squared, cubed, etc. It plans the connection between the dependent and independent variables by a polynomial<a>function</a>.</p>
33
<p><strong>Polynomial Regression:</strong> This type of regression shows the non-linear relationships by<a>adding polynomial</a>terms like squared, cubed, etc. It plans the connection between the dependent and independent variables by a polynomial<a>function</a>.</p>
35
<p><strong>Logistic Regression:</strong>We use this type of regression for binary<a>classification</a>like yes/no or 0/1 outcomes. We also use a logistic function instead of a straight line. Likewise, we apply this type of regression where the dependent variable is qualitative.</p>
34
<p><strong>Logistic Regression:</strong>We use this type of regression for binary<a>classification</a>like yes/no or 0/1 outcomes. We also use a logistic function instead of a straight line. Likewise, we apply this type of regression where the dependent variable is qualitative.</p>
36
<h2>What is the Formula for Correlation and Regression</h2>
35
<h2>What is the Formula for Correlation and Regression</h2>
37
<p>The<a>formulas</a>and the explanation about each formula of correlation and regression, respectively, are mentioned below:</p>
36
<p>The<a>formulas</a>and the explanation about each formula of correlation and regression, respectively, are mentioned below:</p>
38
<p><strong>Correlation Formula:</strong> The formula most commonly used for correlation is the Pearson’s correlation coefficient formula:</p>
37
<p><strong>Correlation Formula:</strong> The formula most commonly used for correlation is the Pearson’s correlation coefficient formula:</p>
39
<p> r = \(\frac{\sum (X - \bar{X})(Y - \bar{Y})} {\sqrt{\sum (X - \bar{X})^{2} \; \sum (Y - \bar{Y})^{2}}}\)</p>
38
<p> r = \(\frac{\sum (X - \bar{X})(Y - \bar{Y})} {\sqrt{\sum (X - \bar{X})^{2} \; \sum (Y - \bar{Y})^{2}}}\)</p>
40
<p>Where,</p>
39
<p>Where,</p>
41
<p>r = Pearson’s correlation coefficient (ranges for -1 to +1)</p>
40
<p>r = Pearson’s correlation coefficient (ranges for -1 to +1)</p>
42
<p>X, Y = Individual data point for variables X and Y</p>
41
<p>X, Y = Individual data point for variables X and Y</p>
43
<p>X, Y = Mean (<a>average</a>) of X and Y. = Summation Symbol.</p>
42
<p>X, Y = Mean (<a>average</a>) of X and Y. = Summation Symbol.</p>
44
<p>The<a>numerator</a>shows the covariance between X and Y.</p>
43
<p>The<a>numerator</a>shows the covariance between X and Y.</p>
45
<p>The<a>denominator</a>shows the<a>product</a>of the standard deviations of X and Y.</p>
44
<p>The<a>denominator</a>shows the<a>product</a>of the standard deviations of X and Y.</p>
46
<p><strong>Regression Formula:</strong>The most commonly used regression formula is the linear regression formula, which is mentioned below:</p>
45
<p><strong>Regression Formula:</strong>The most commonly used regression formula is the linear regression formula, which is mentioned below:</p>
47
<p> Y = a + bX</p>
46
<p> Y = a + bX</p>
48
<p> Where,</p>
47
<p> Where,</p>
49
<p> a = Intercept</p>
48
<p> a = Intercept</p>
50
<p> b = Slope</p>
49
<p> b = Slope</p>
51
<p> Y = Dependent variable</p>
50
<p> Y = Dependent variable</p>
52
<p> X = Independent Variable.</p>
51
<p> X = Independent Variable.</p>
53
<h2>Difference Between Correlation and Regression</h2>
52
<h2>Difference Between Correlation and Regression</h2>
54
<p>There are a lot of differences between correlation and regression. Let us see the differences of correlation and regression in the given table below:</p>
53
<p>There are a lot of differences between correlation and regression. Let us see the differences of correlation and regression in the given table below:</p>
55
<strong>Correlation</strong><strong>Regression</strong>It measures the strength and direction of the relationship between two variables. It models the relationship between independent and dependent variables, which allows for predictions The range of the values lie in between -1 and +1. Regression has no fixed range. The purpose is to show how strongly two variables are related. It establishes a cause and effect relationship and predicts one variable based on the other. We use it in<a>statistics</a>, economics, finance, psychology, and research We use it for forecasting, data modelling, risk analysis and machine learning.<h2>Correlation and Regression Analysis</h2>
54
<strong>Correlation</strong><strong>Regression</strong>It measures the strength and direction of the relationship between two variables. It models the relationship between independent and dependent variables, which allows for predictions The range of the values lie in between -1 and +1. Regression has no fixed range. The purpose is to show how strongly two variables are related. It establishes a cause and effect relationship and predicts one variable based on the other. We use it in<a>statistics</a>, economics, finance, psychology, and research We use it for forecasting, data modelling, risk analysis and machine learning.<h2>Correlation and Regression Analysis</h2>
56
<p>Correlation Analysis</p>
55
<p>Correlation Analysis</p>
57
<p>Correlation analysis helps us to understand whether two variables are connected and how strong that connection is. We can use a correlation coefficient, such as Pearson’s correlation, to obtain a value between -1 and +1 that indicates both the direction and strength of the relationship. A<a>scatter plot</a>is often used to visually display how two variables (x and y) move together. </p>
56
<p>Correlation analysis helps us to understand whether two variables are connected and how strong that connection is. We can use a correlation coefficient, such as Pearson’s correlation, to obtain a value between -1 and +1 that indicates both the direction and strength of the relationship. A<a>scatter plot</a>is often used to visually display how two variables (x and y) move together. </p>
58
<p>Regression Analysis</p>
57
<p>Regression Analysis</p>
59
<p>Regression analysis goes beyond the correlation. It explains the exact relationship between two variables and helps us to predict the value of one variable using the other. In linear regression, we fit a straight line to the data points. This line clearly shows how x and y are related when plotted on a graph.</p>
58
<p>Regression analysis goes beyond the correlation. It explains the exact relationship between two variables and helps us to predict the value of one variable using the other. In linear regression, we fit a straight line to the data points. This line clearly shows how x and y are related when plotted on a graph.</p>
60
<h2>Correlation and Regression in Statistics</h2>
59
<h2>Correlation and Regression in Statistics</h2>
61
<p>In statistics, the strength and direction of the relationship between two variables are measured using “r”, the correlation coefficient. This value shows how closely the variables move together. When the relationship is not straight or linear, more advanced techniques are used to capture and represent the curved pattern between the variables.</p>
60
<p>In statistics, the strength and direction of the relationship between two variables are measured using “r”, the correlation coefficient. This value shows how closely the variables move together. When the relationship is not straight or linear, more advanced techniques are used to capture and represent the curved pattern between the variables.</p>
62
<h2>Tips and Tricks to Master Correlation and Regression</h2>
61
<h2>Tips and Tricks to Master Correlation and Regression</h2>
63
<p>Correlation and regression are two complex mathematical concepts and to get a better understanding of them, some tips and tricks are mentioned below: </p>
62
<p>Correlation and regression are two complex mathematical concepts and to get a better understanding of them, some tips and tricks are mentioned below: </p>
64
<ul><li>Understand that correlation measures the strength and direction of a relationship between two variables, while regression predicts one variable based on another. </li>
63
<ul><li>Understand that correlation measures the strength and direction of a relationship between two variables, while regression predicts one variable based on another. </li>
65
<li>Memorize key formulas such as the correlation coefficient and the regression line<a>equation</a>to perform calculations quickly and accurately. </li>
64
<li>Memorize key formulas such as the correlation coefficient and the regression line<a>equation</a>to perform calculations quickly and accurately. </li>
66
<li>Plot a scatter diagram first to visualize data, identify trends, and detect any outliers before performing calculations. </li>
65
<li>Plot a scatter diagram first to visualize data, identify trends, and detect any outliers before performing calculations. </li>
67
<li>Identify the type of correlation, positive, negative, or zero so you can interpret results correctly. </li>
66
<li>Identify the type of correlation, positive, negative, or zero so you can interpret results correctly. </li>
68
<li>Focus on interpreting results, not just calculating them, to translate statistical outcomes into real-world insights. </li>
67
<li>Focus on interpreting results, not just calculating them, to translate statistical outcomes into real-world insights. </li>
69
<li>Use the scatter plots to show how two variables move together. Visual learning helps the children clearly see positive, negative, or no correlation. </li>
68
<li>Use the scatter plots to show how two variables move together. Visual learning helps the children clearly see positive, negative, or no correlation. </li>
70
<li>Allow students to explore the data using a correlation and regression<a>calculator</a>. Entering numbers and instantly seeing graphs and results makes learning interactive and engaging. </li>
69
<li>Allow students to explore the data using a correlation and regression<a>calculator</a>. Entering numbers and instantly seeing graphs and results makes learning interactive and engaging. </li>
71
<li>Explain correlation and regression using everyday situations, like how study time affects marks or how practice improves performance. Real examples make learning easier for children. </li>
70
<li>Explain correlation and regression using everyday situations, like how study time affects marks or how practice improves performance. Real examples make learning easier for children. </li>
72
</ul><h2>Common mistakes and How to Avoid Them on Correlation and Regression</h2>
71
</ul><h2>Common mistakes and How to Avoid Them on Correlation and Regression</h2>
73
<p>Students tend to make mistakes when they solve problems related to correlation and regression. Let us now see the common mistakes they make and the solutions to avoid them:</p>
72
<p>Students tend to make mistakes when they solve problems related to correlation and regression. Let us now see the common mistakes they make and the solutions to avoid them:</p>
74
<h2>Real Life Applications on Correlation and Regression</h2>
73
<h2>Real Life Applications on Correlation and Regression</h2>
75
<p>There are a lot of applications of correlation and regression. Let us now see the different uses of correlation and regression in different fields:</p>
74
<p>There are a lot of applications of correlation and regression. Let us now see the different uses of correlation and regression in different fields:</p>
76
<p><strong>Economics and Finance:</strong> We use correlation in economics and finance, where it is used to analyze the relationship between economic indicators. We use regression in economics and finance to predict the future trends, use GDP growth to forecast employment rates.</p>
75
<p><strong>Economics and Finance:</strong> We use correlation in economics and finance, where it is used to analyze the relationship between economic indicators. We use regression in economics and finance to predict the future trends, use GDP growth to forecast employment rates.</p>
77
<p><strong>Stock Market:</strong> We use correlation and regression in stock markets, where correlation is used for the study of the correlation between stock prices. Regression is used to forecast the stock prices.</p>
76
<p><strong>Stock Market:</strong> We use correlation and regression in stock markets, where correlation is used for the study of the correlation between stock prices. Regression is used to forecast the stock prices.</p>
78
<p><strong>Healthcare: </strong>We use correlation and regression in stock markets, where correlation is used to find the risk<a>factors</a>for diseases. We use regression to make medical predictions. </p>
77
<p><strong>Healthcare: </strong>We use correlation and regression in stock markets, where correlation is used to find the risk<a>factors</a>for diseases. We use regression to make medical predictions. </p>
79
<p><strong>Agriculture:</strong>In agriculture, correlation is used to find the relationship between rainfall and crop yield. Regression is applied to predict future crop production based on soil fertility, fertilizer use, and weather conditions.</p>
78
<p><strong>Agriculture:</strong>In agriculture, correlation is used to find the relationship between rainfall and crop yield. Regression is applied to predict future crop production based on soil fertility, fertilizer use, and weather conditions.</p>
80
<p><strong>Marketing and Business:</strong>Companies use correlation to identify the relationship between advertising expenditure and sales revenue. Regression analysis helps businesses forecast future sales based on factors like price, promotions, and customer demand.</p>
79
<p><strong>Marketing and Business:</strong>Companies use correlation to identify the relationship between advertising expenditure and sales revenue. Regression analysis helps businesses forecast future sales based on factors like price, promotions, and customer demand.</p>
81
<h3>Problem 1</h3>
80
<h3>Problem 1</h3>
82
<p>Compute the Pearson correlation coefficient for the paired data: X = [1, 2, 3, 4, 5] and Y = [2, 4, 5, 4, 5]</p>
81
<p>Compute the Pearson correlation coefficient for the paired data: X = [1, 2, 3, 4, 5] and Y = [2, 4, 5, 4, 5]</p>
83
<p>Okay, lets begin</p>
82
<p>Okay, lets begin</p>
84
<p> r 0.78</p>
83
<p> r 0.78</p>
85
<h3>Explanation</h3>
84
<h3>Explanation</h3>
86
<p>Compute the means:</p>
85
<p>Compute the means:</p>
87
<p>X = \(\frac{1 + 2 + 3 + 4 + 5}{5}\) = 3.</p>
86
<p>X = \(\frac{1 + 2 + 3 + 4 + 5}{5}\) = 3.</p>
88
<p>Y = \(\frac{2 + 4 + 5 + 4 + 5}{5}\) = 4</p>
87
<p>Y = \(\frac{2 + 4 + 5 + 4 + 5}{5}\) = 4</p>
89
<p>Compute the deviations and their products:</p>
88
<p>Compute the deviations and their products:</p>
90
X Y X - X Y - Y (X - X)(Y - Y) (X - X)2 (Y - Y)2 1 2 -2 -2 4 4 4 2 4 -1 0 0 1 0 3 5 0 1 0 0 1 4 4 1 0 0 1 0 5 5 2 1 2 4 1<p>Sum the Products and squares:</p>
89
X Y X - X Y - Y (X - X)(Y - Y) (X - X)2 (Y - Y)2 1 2 -2 -2 4 4 4 2 4 -1 0 0 1 0 3 5 0 1 0 0 1 4 4 1 0 0 1 0 5 5 2 1 2 4 1<p>Sum the Products and squares:</p>
91
<p>(X - X)(Y - Y) = \(4 + 0 + 0 + 0 + 2 = 6\)</p>
90
<p>(X - X)(Y - Y) = \(4 + 0 + 0 + 0 + 2 = 6\)</p>
92
<p>(X - X)2 = \(4 + 1 + 0 + 1+ 4 = 10\)</p>
91
<p>(X - X)2 = \(4 + 1 + 0 + 1+ 4 = 10\)</p>
93
<p>(Y - Y)2 = \(4 + 0 + 1 + 0 + 1 = 6\)</p>
92
<p>(Y - Y)2 = \(4 + 0 + 1 + 0 + 1 = 6\)</p>
94
<p>Apply the Pearson formula: </p>
93
<p>Apply the Pearson formula: </p>
95
<p> r \(= \frac{6}{10} \times 6 = 6.60 = \frac{6}{7.746} = 0.775\)</p>
94
<p> r \(= \frac{6}{10} \times 6 = 6.60 = \frac{6}{7.746} = 0.775\)</p>
96
<p>Well explained 👍</p>
95
<p>Well explained 👍</p>
97
<h3>Problem 2</h3>
96
<h3>Problem 2</h3>
98
<p>For the ranked data X = [1, 2, 3, 4, 5] and Y = [2, 3, 5, 4, 1], compute Spearman’s rank correlation coefficient.</p>
97
<p>For the ranked data X = [1, 2, 3, 4, 5] and Y = [2, 3, 5, 4, 1], compute Spearman’s rank correlation coefficient.</p>
99
<p>Okay, lets begin</p>
98
<p>Okay, lets begin</p>
100
<p>ρ = 0.9</p>
99
<p>ρ = 0.9</p>
101
<h3>Explanation</h3>
100
<h3>Explanation</h3>
102
<p>Assign Ranks:</p>
101
<p>Assign Ranks:</p>
103
<p>Y ranks:</p>
102
<p>Y ranks:</p>
104
<p>Y = 2 → Rank 1</p>
103
<p>Y = 2 → Rank 1</p>
105
<p>Y = 3 → Rank 2</p>
104
<p>Y = 3 → Rank 2</p>
106
<p>Y = 5 → Rank 3</p>
105
<p>Y = 5 → Rank 3</p>
107
<p>Y = 4 → Rank 4</p>
106
<p>Y = 4 → Rank 4</p>
108
<p>Y = 1 → Rank 5</p>
107
<p>Y = 1 → Rank 5</p>
109
<p>Calculate the differences between the ranks of X and Y</p>
108
<p>Calculate the differences between the ranks of X and Y</p>
110
Observation Rank X Rank Y d = Rank X - Rank Y d2 1 1 1 0 0 2 2 2 0 0 3 3 4 -1 1 4 4 3 1 1 5 5 5 0 0<p>Sum the squared differences:</p>
109
Observation Rank X Rank Y d = Rank X - Rank Y d2 1 1 1 0 0 2 2 2 0 0 3 3 4 -1 1 4 4 3 1 1 5 5 5 0 0<p>Sum the squared differences:</p>
111
<p>\(\sum d^{2} = 0 + 0 + 1 + 1 + 0 = 2\)</p>
110
<p>\(\sum d^{2} = 0 + 0 + 1 + 1 + 0 = 2\)</p>
112
<p>Applying the Spearman’s formula:</p>
111
<p>Applying the Spearman’s formula:</p>
113
<p>\(\rho = 1 - \frac{6\sum d^{2}}{n(n^{2} - 1)} = 1 - \frac{6 \times 2}{5(5^{2} - 1)} = 1 - \frac{12}{5(24)} = 1 - \frac{12}{120} = 1 - 0.1 = 0.9\) </p>
112
<p>\(\rho = 1 - \frac{6\sum d^{2}}{n(n^{2} - 1)} = 1 - \frac{6 \times 2}{5(5^{2} - 1)} = 1 - \frac{12}{5(24)} = 1 - \frac{12}{120} = 1 - 0.1 = 0.9\) </p>
114
<p>Well explained 👍</p>
113
<p>Well explained 👍</p>
115
<h3>Problem 3</h3>
114
<h3>Problem 3</h3>
116
<p>For the dataset X = [2, 4, 6, 8, 10] and Y = [20, 16, 12, 8, 4], find the Pearson correlation coefficient.</p>
115
<p>For the dataset X = [2, 4, 6, 8, 10] and Y = [20, 16, 12, 8, 4], find the Pearson correlation coefficient.</p>
117
<p>Okay, lets begin</p>
116
<p>Okay, lets begin</p>
118
<p>r = -1</p>
117
<p>r = -1</p>
119
<h3>Explanation</h3>
118
<h3>Explanation</h3>
120
<p>Means: </p>
119
<p>Means: </p>
121
<p>X = \(\frac{2 + 4 + 6 + 8 + 10}{5} = 6\)</p>
120
<p>X = \(\frac{2 + 4 + 6 + 8 + 10}{5} = 6\)</p>
122
<p>Y = \(\frac{20 + 16 + 12 + 8 + 4}{5} = 12\)</p>
121
<p>Y = \(\frac{20 + 16 + 12 + 8 + 4}{5} = 12\)</p>
123
<p>Deviations and products:</p>
122
<p>Deviations and products:</p>
124
X Y X - X Y - Y Product 2 20 -4 8 -32 4 16 -2 4 -8 6 12 0 0 0 8 8 2 -4 -8 10 4 4 -8 -32<p>∑(X - X)(Y - Y) =\( -32 - 8 + 0 - 8 - 32 = -80\)</p>
123
X Y X - X Y - Y Product 2 20 -4 8 -32 4 16 -2 4 -8 6 12 0 0 0 8 8 2 -4 -8 10 4 4 -8 -32<p>∑(X - X)(Y - Y) =\( -32 - 8 + 0 - 8 - 32 = -80\)</p>
125
<p>Sums of Squares:</p>
124
<p>Sums of Squares:</p>
126
<p>\(\sum (X - \bar{X})^{2} = (-4)^{2} + (-2)^{2} + 0^{2} + 2^{2} + 4^{2} = 16 + 4 + 0 + 4 + 16 = 40\)</p>
125
<p>\(\sum (X - \bar{X})^{2} = (-4)^{2} + (-2)^{2} + 0^{2} + 2^{2} + 4^{2} = 16 + 4 + 0 + 4 + 16 = 40\)</p>
127
<p>\(\sum (Y - \bar{Y})^{2} = 8^{2} + 4^{2} + 0^{2} + (-4)^{2} + (-8)^{2} = 64 + 16 + 0 + 16 + 64 = 160\)</p>
126
<p>\(\sum (Y - \bar{Y})^{2} = 8^{2} + 4^{2} + 0^{2} + (-4)^{2} + (-8)^{2} = 64 + 16 + 0 + 16 + 64 = 160\)</p>
128
<p>Pearson’s r:</p>
127
<p>Pearson’s r:</p>
129
<p>\(r = \frac{-8040}{160} = -50.25 \)</p>
128
<p>\(r = \frac{-8040}{160} = -50.25 \)</p>
130
<p>Well explained 👍</p>
129
<p>Well explained 👍</p>
131
<h3>Problem 4</h3>
130
<h3>Problem 4</h3>
132
<p>Using the dataset X = [1, 2, 3, 4, 5] and Y = [2, 4, 6, 8, 10], determine the regression line and predict Y when X = 7.</p>
131
<p>Using the dataset X = [1, 2, 3, 4, 5] and Y = [2, 4, 6, 8, 10], determine the regression line and predict Y when X = 7.</p>
133
<p>Okay, lets begin</p>
132
<p>Okay, lets begin</p>
134
<p>Y = 2X and predicted Y(7) = 14</p>
133
<p>Y = 2X and predicted Y(7) = 14</p>
135
<h3>Explanation</h3>
134
<h3>Explanation</h3>
136
<p>Find the Means:</p>
135
<p>Find the Means:</p>
137
<p>X = 3 and Y = 6</p>
136
<p>X = 3 and Y = 6</p>
138
<p>Compute the slope:</p>
137
<p>Compute the slope:</p>
139
<p>b = Change in Y/Change in X = \(\frac{10 - 2}{5 - 1} = \frac{8}{4} = 2\)</p>
138
<p>b = Change in Y/Change in X = \(\frac{10 - 2}{5 - 1} = \frac{8}{4} = 2\)</p>
140
<p>Intercept: </p>
139
<p>Intercept: </p>
141
<p>\(a = Y - bX = 6 - 2 \times 3 = 0\)</p>
140
<p>\(a = Y - bX = 6 - 2 \times 3 = 0\)</p>
142
<p>Prediction:</p>
141
<p>Prediction:</p>
143
<p>Regression equation: Y = 2X</p>
142
<p>Regression equation: Y = 2X</p>
144
<p>For X = 7: Y = 2 x 7 = 14.</p>
143
<p>For X = 7: Y = 2 x 7 = 14.</p>
145
<p>Well explained 👍</p>
144
<p>Well explained 👍</p>
146
<h3>Problem 5</h3>
145
<h3>Problem 5</h3>
147
<p>Given the regression equation, Y = 5 + 3X, interpret the slope and intercept.</p>
146
<p>Given the regression equation, Y = 5 + 3X, interpret the slope and intercept.</p>
148
<p>Okay, lets begin</p>
147
<p>Okay, lets begin</p>
149
<p>Intercept = 5 and Slope = 3</p>
148
<p>Intercept = 5 and Slope = 3</p>
150
<h3>Explanation</h3>
149
<h3>Explanation</h3>
151
<p>Intercept:</p>
150
<p>Intercept:</p>
152
<p>When X = 0, the predicted Y is 5. This is the baseline value.</p>
151
<p>When X = 0, the predicted Y is 5. This is the baseline value.</p>
153
<p>Slope: </p>
152
<p>Slope: </p>
154
<p>For each unit increase in X, Y increases by 3 units.</p>
153
<p>For each unit increase in X, Y increases by 3 units.</p>
155
<p>Well explained 👍</p>
154
<p>Well explained 👍</p>
156
<h2>FAQs on Correlation and Coefficient</h2>
155
<h2>FAQs on Correlation and Coefficient</h2>
157
<h3>1.What is correlation?</h3>
156
<h3>1.What is correlation?</h3>
158
<p>Correlation is a statistics tool which we use to measure the strength and direction of any two or more given variables.</p>
157
<p>Correlation is a statistics tool which we use to measure the strength and direction of any two or more given variables.</p>
159
<h3>2.What is regression?</h3>
158
<h3>2.What is regression?</h3>
160
<p>Regression is a statistical methodology that is used to model a relationship between an independent variable and a dependent variable.</p>
159
<p>Regression is a statistical methodology that is used to model a relationship between an independent variable and a dependent variable.</p>
161
<h3>3.What are the assumptions behind Pearson’s correlation?</h3>
160
<h3>3.What are the assumptions behind Pearson’s correlation?</h3>
162
<p>The assumptions behind the Pearson correlation are the relationship between the variables is linear, both the variables are normally distributed, and the data is free of any significant outliers.</p>
161
<p>The assumptions behind the Pearson correlation are the relationship between the variables is linear, both the variables are normally distributed, and the data is free of any significant outliers.</p>
163
<h3>4.What is simple linear regression?</h3>
162
<h3>4.What is simple linear regression?</h3>
164
<p>Simple linear regression is a type of regression which has one independent variable and is expressed in the form of; Y = a + bX + ε.</p>
163
<p>Simple linear regression is a type of regression which has one independent variable and is expressed in the form of; Y = a + bX + ε.</p>
165
<h3>5.What are the common mistakes that occur when interpreting correlation?</h3>
164
<h3>5.What are the common mistakes that occur when interpreting correlation?</h3>
166
<p>Some of the common mistakes that occur while interpreting correlation are confusing correlation with causation, overlooking non-linear relationships or the influence of outliers.</p>
165
<p>Some of the common mistakes that occur while interpreting correlation are confusing correlation with causation, overlooking non-linear relationships or the influence of outliers.</p>
167
<h2>Jaipreet Kour Wazir</h2>
166
<h2>Jaipreet Kour Wazir</h2>
168
<h3>About the Author</h3>
167
<h3>About the Author</h3>
169
<p>Jaipreet Kour Wazir is a data wizard with over 5 years of expertise in simplifying complex data concepts. From crunching numbers to crafting insightful visualizations, she turns raw data into compelling stories. Her journey from analytics to education ref</p>
168
<p>Jaipreet Kour Wazir is a data wizard with over 5 years of expertise in simplifying complex data concepts. From crunching numbers to crafting insightful visualizations, she turns raw data into compelling stories. Her journey from analytics to education ref</p>
170
<h3>Fun Fact</h3>
169
<h3>Fun Fact</h3>
171
<p>: She compares datasets to puzzle games-the more you play with them, the clearer the picture becomes!</p>
170
<p>: She compares datasets to puzzle games-the more you play with them, the clearer the picture becomes!</p>