Pearson Correlation and the Dot Product

Given two vectors X and Y, where

The Dot Product is defined as:

The Pearson Correlation (PCor) is generally defined as:

Pearson's Correlation can be directly related to the Dot Product ONLY IF every vector is first centered to 0, by subtracting its mean:

After that transformation it is possible to write:

Therefore, after the transformation, the Pearson Correlation can be regarded as a weighted Dot Product.
The weighting enables every vector couple to have the same "leverage", notwithstanding the different norm (which can be regarded as the "magnitude").

As the Pearson Correlation requires the mean and standard deviation estimation, it is sensitive to a limited number of points.
In the following paper the minimal number of points required to satisfactorily estimate Pearson's Correlation is estimated as 5.
Accuracy of Correlation Coefficient with Limited Number of Points

For more details on Pearson Correlation, and its relation to the Dot Product, refer to: Wikipedia, Pearson's Correlation

DanieleMerico/HowtoDirectory/PearsonDotP (last edited 2009-07-07 00:48:04 by localhost)

MoinMoin Appliance - Powered by TurnKey Linux