Correlation Coefficient Formula

Last Updated : 23 May, 2026

The correlation coefficient is a numerical measure that shows the strength and direction of the relationship between two variables. Its value lies between -1 and +1.

Types of Correlation

There are mainly 3 types of correlation.

pearson_correlation_coefficient
  • Positive Correlation: Both variables increase or decrease together.
  • Zero Correlation: No relationship between the variables.
  • Negative Correlation: One variable increases while the other decreases.

Types of Correlation Coefficient Formula

Different formulas are used to calculate the correlation coefficient depending on the data (sample or population).

The main types are shown below:

formula_for_correlation_coefficient

Solved Problems

Problem 1: Calculate the correlation coefficient from the following table:

SUBJECTAGE (X)GLUCOSE LEVEL (Y)
14298
22368
32273
44779
55088
66082

Solution:

Make a table from the given data and add three more columns of XY, X², and Y².

SUBJECT AGE (X)GLUCOSE LEVEL (Y)XY    X²
14298411617649604
2236815645294624
3227316064845329
44779371322096241
55088440025007744
66082498036006724
∑  244488203791108640266

∑xy = 20379

∑x = 244

∑y = 488

∑x² = 11086

∑y² = 40266

n = 6.

Put all the values in the Pearson's correlation coefficient formula:

 R= \frac{n(∑xy) - (∑x)(∑y)}{\sqrt{[n∑x²-(∑x)²][n∑y²-(∑y)²}}                                                            

R = 6(20379) - (244)(488) / √[6(11086)-(244)²][6(40266)-(488)² ]                                                 

R = 3202 / √[6980][3452]        

R = 3202/4972.238

R = 0.6439

It shows that the relationship between the variables of the data is a strong positive relationship.

Problem 2: Calculate the correlation coefficient from the following table:

SUBJECT   AGE (X)Weight (Y)
14099
22579
32269
45489

Solution:

Make a table from the given data and add three more columns of XY,  X², and Y².

SUBJECT AGE (X) Weight (Y)   XY X²  Y²
14099396016009801
2257919756256241
3226915184844761
45489480629167921
∑ 15133612259562528724

∑xy = 12258

∑x = 151

∑y = 336

∑x² = 5625

∑y² 28724

n = 4

Put all the values in the Pearson's correlation coefficient formula:

 R= \frac{n(∑xy) - (∑x)(∑y)}{\sqrt{[n∑x²-(∑x)²][n∑y²-(∑y)²}}             

R = 4(12258) - (151)(336) / √[4(5625)-(151)²][4(28724)-(336)²]        

R = -1704 / √[-301][-2000]        

R=-1704/775.886

R=-2.1961

It shows that the relationship between the variables of the data is a very strong negative relationship.

Problem 3:  Calculate the correlation coefficient for the following data:

X = 7,9,14 and Y = 17,19,21

Solution:

Given variables are,

X = 7,9,14

and,

Y = 17,19,21

To, find the correlation coefficient of the following variables Firstly a table is to be constructed as follows, to get the values required in the formula.

XYXY X² Y²
7171194936
91917181361
1421294196441
∑ 30∑ 57∑ 584∑ 326∑ 838

∑xy = 584

∑x = 30

∑y = 57

∑x² = 326

∑y² = 838

n = 3

Put all the values in the Pearson's correlation coefficient formula:

 R= \frac{n(∑xy) - (∑x)(∑y)}{\sqrt{[n∑x²-(∑x)²][n∑y²-(∑y)²}}              

R = 3(584) - (30)(57) / √[3(326)-(30)²][3(838)-(57)²]        

R = 42 / √[78][-735]        

R = 42/-239.43

R = -0.1754

It shows that the relationship between the variables of the data is negligible relationship

Problem 4: Calculate the correlation coefficient for the following data:

X = 21, 31, 25, 40, 47, 38 and Y = 70,55,60,78,66,80

Solution:

Given variables are,

X = 21,31,25,40,47,38

And,

Y = 70,55,60,78,66,80

To, find the correlation coefficient of the following variables Firstly a table is to be constructed as follows, to get the values required in the formula.

XYXY  X²   Y²
217014704414900
315517059613025
256015006253600
4078312016006094
4766310222094356
3880304014446400
∑202∑409∑13937∑7280∑28265

 ∑xy = 13937

∑x = 202

∑y = 409

∑x² = 7280

∑y² = 28265

n = 6

Put all the values in the Pearson's correlation coefficient formula:

 R= \frac{n(∑xy) - (∑x)(∑y)}{\sqrt{[n∑x²-(∑x)²][n∑y²-(∑y)²}}             

R = 6(13937) - (202)(409) / √[6(7280) - (202)²][6(28265) - (409)²]        

R = 1004 /√[2876][2909]        

R = 1004 / 2892.452938

R = 0.3471

It shows that the relationship between the variables of the data is a moderate positive relationship.

Problem 5: Calculate the correlation coefficient for the following data?

X = 5 ,9 ,14, 16 and Y = 6, 10, 16, 20 .

Solution:

Given variables are,

X = 5 ,9 ,14, 16

And

Y = 6, 10, 16, 20.

To, find the correlation coefficient of the following variables Firstly a table is to be constructed as follows, to get the values required in the formula add all the values in the columns to get the values used in the formula

XYXY
56302536
9109081100
1416224196256
1620320256400
∑44∑52∑664∑558∑792

∑xy = 664

∑x = 44

∑y = 52

∑x² = 558

∑y² = 792

n = 4

Put all the values in the Pearson's correlation coefficient formula:

 R= n(∑xy) - (∑x)(∑y) / √[n∑x²-(∑x)²][n∑y²-(∑y)² 

R = 4(664) - (44)(52) / √[4(558) - (44)²][4(792) - (52)²]

R = 368 / √[296][464]

R = 368/370.599

R = 0.9930

It shows that the relationship between the variables of the data is a very strong positive relationship.

Comment

Explore