Maths encyclopedia and lessons  
Search

Mathematics Encyclopedia and Lessons

 
     
 

Lessons

Popular
Subjects

algebra
arithmetic
calculus
equations
geometry
differential equations
trigonometry
number theory
probability theory
more
 

References

applied mathematics
mathematical games
mathematicians
more
 
 

Canonical correlation

In statistics, canonical correlation analysis, introduced by Harold Hotelling, is a way of making sense of cross-covariance matrices.

Given two column vectors X = (X1, ..., Xn)′ and Y = (Y1, ..., Ym)′ of random variables with finite second moments, one may define the cross-covariance cov(X, Y) to be the n×m matrix whose ij entry is the covariance cov(Xi, Yj). (Sometimes this is called simply the covariance between X and Y. But sometimes one speaks of the "covariance" of X, intending the n×n matrix of covariances between the pairs of scalar components of X. Sometimes the latter matrix is called the variance of X.)

Canonical correlation analysis seeks vectors a and b such that the real random variables aX and bY (where the row-vector a′ is the transpose of the column-vector a) maximize the correlation ρ(aX, bY ). The random vectors U = aX and V = bY are the first pair of canonical variables. Then one seeks vectors maximizing the same correlation subject to the constraint that they are to be uncorrelated with the first pair of canonical variables; this gives the second pair of canonical variables, etc.

01-04-2007 01:18:14
The contents of this article are licensed from Wikipedia.org
under the GNU Free Documentation License. How to see transparent copy