Problem 55225. Simpson's Paradox - Calculate correlation coefficients for groups of data
Simpson's Paradox is a statistical phenomenon where groups of data can have a characteristic while the whole data set together has the opposite characteristic. In the example below, both groups have a negative correlation between x and y, but collectively there is a positive correlation.
Write a function that takes three vectors as input: x, y, and g. The vector g will contain only the values 1 and 2. The function should return three outputs. These outputs are the Pearson correlation coefficients for three different groupings of the data, which are: (1) for all x and y, (2) x and y corresponding to elements where g has the value 1, (3) x and y for which g is 2.
[c,c1,c2] = groupcorr(x,y,g)
c =
0.8800
c1 =
-0.6800
c2 =
-0.4396
Solution Stats
Problem Comments
Solution Comments
Show commentsProblem Recent Solvers329
Suggested Problems
-
Which values occur exactly three times?
5142 Solvers
-
1842 Solvers
-
Generate N equally spaced intervals between -L and L
880 Solvers
-
Given a matrix, swap the 2nd & 3rd columns
1139 Solvers
-
1486 Solvers
More from this Author22
Problem Tags
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!