Page 175 - DCOM203_DMGT204_QUANTITATIVE_TECHNIQUES_I
P. 175

Quantitative Techniques – I




                    Notes          8.1.5 Karl Pearson’s Coefficient of Linear Correlation

                                   Let us assume, again, that we have data on two variables X and Y denoted by the pairs (X , Y ),
                                                                                                           i  i
                                   i = 1,2, ...... n. Further, let the scatter diagram of the data be as shown in figure 8.3.
                                                         Figure 8.3:  Scatter  Diagram  of the  Data

















                                   Let X  and Y  be the arithmetic means of X and Y respectively. Draw two lines X = X  and Y = Y
                                   on  the  scatter  diagram.  These  two lines,  intersect  at  the  point  (X ,Y )  and  are  mutually
                                   perpendicular, divide the whole diagram into four parts, termed as I, II, III and IV quadrants, as
                                   shown.
                                   As mentioned earlier, the correlation between X and Y will be positive if low (high) values of X
                                   are associated with low (high) values of Y. In terms of the above figure, we can say that when
                                   values of X that are greater (less) than  X  are generally associated with values of  Y that are
                                   greater (less) than Y , the correlation between X and Y will be positive. This implies that there
                                   will be a  general tendency  of points  to concentrate in I  and III  quadrants. Similarly, when
                                   correlation between X and Y is negative, the point of the scatter diagram will have a general
                                   tendency to concentrate in II and IV quadrants.

                                   Further, if we consider deviations of values from their means,  i.e.,  X i  X  and  Y i  Y , we
                                   note that:

                                   1.  Both  X i  X  and  Y i  Y will be positive for all points in quadrant I.

                                   2.    X i  X  will be negative and  Y i  Y  will be positive for all points in quadrant II.

                                   3.  Both  X i  X and  Y i  Y   will be negative for all points in quadrant III.


                                   4.    X i  X will be positive and  Y i  Y   will be negative for all points in quadrant IV.

                                   It is obvious from the above that the product of deviations, i.e.,  X  i  X Y i  Y will be positive
                                   for points in quadrants I and III and negative for points in quadrants II and IV.
                                   Since, for positive correlation, the points will tend to concentrate more in  I and III quadrants
                                   than in II and IV, the sum of positive products of deviations will outweigh the sum of negative
                                   products of deviations. Thus,   X i  X Y i  Y will be positive for all the n observations.

                                   Similarly, when correlation is negative, the points will tend to concentrate more in II and IV
                                   quadrants than in I and III. Thus, the sum of negative products of deviations will outweigh the




          170                               LOVELY PROFESSIONAL UNIVERSITY
   170   171   172   173   174   175   176   177   178   179   180