Page 174 - DMGT404 RESEARCH_METHODOLOGY
P. 174

Research Methodology




                    Notes          This unit deals only with linear association between the two variables X and Y. We shall measure
                                   the degree of linear  association by  the Karl  Pearson’s formula  for the coefficient of  linear
                                   correlation.

                                   9.1.2 Karl Pearson’s Coefficient of Linear Correlation

                                   Let us assume, again, that we have data on two variables X and Y denoted by the pairs (X , Y ),
                                                                                                           i  i
                                   i = 1, 2, ...... n. Further, let the scatter diagram of the data be as shown in Figure.
                                   Let  X  and  Y  be the arithmetic means of X and Y respectively. Draw two lines X   X  and  Y   Y
                                   on  the  scatter  diagram.  These  two  lines,  intersect  at  the  point  ( , )X Y   and  are  mutually
                                   perpendicular, divide the whole diagram into four parts, termed as I, II, III and IV quadrants, as
                                   shown.

                                                                     Figure  9.3

                                                          Y
                                                            II     X =  X           I



                                                                                  Y =  Y
                                                                           Y
                                                                         X,
                                                             III         (   )       IV

                                                          O                            X

                                   As mentioned earlier, the correlation between X and Y will be positive if low (high) values of X
                                   are associated with low (high) values of Y. In terms of the above Figure, we can say that when
                                   values of X that are greater (less) than  X  are generally associated  with values of  Y that are
                                   greater (less) than  Y  , the correlation between X and Y will be positive. This implies that there
                                   will be a general tendency  of points to concentrate  in I  and III  quadrants. Similarly,  when
                                   correlation between X and Y is negative, the point of the scatter diagram will have a general
                                   tendency to concentrate in II and IV quadrants.

                                                                                            X
                                   Further, if we consider deviations of values from their means, i.e.,  (X  – )  and  (Y  – ),  we note
                                                                                                      Y
                                                                                          i         i
                                   that:
                                                          Y
                                                X
                                   1.  Both  (X  – )   and  (Y  – )  will be positive for all points in quadrant I.
                                              i         i
                                   2.   (X  – )  will be negative and  (Y  – ) will be positive for all points in quadrant II.
                                                                   Y
                                           X
                                         i                       i
                                   3.  Both  (X  – )  and  (Y  – )  will be negative for all points in quadrant III.
                                                X
                                                         Y
                                              i         i
                                   4.   (X  – )  will be positive and  (Y  – )  will be negative for all points in quadrant IV.
                                                                 Y
                                           X
                                         i                      i
                                                                                         X
                                   It is obvious from the above that the product of deviations, i.e., (X  – )(Y  – )  will be positive for
                                                                                              Y
                                                                                       i    i
                                   points in quadrants I and III and negative for points in quadrants II and IV.
                                     Notes  Since, for positive correlation, the points will tend to concentrate more in I and III
                                     quadrants than in II and IV, the sum of positive products of deviations will outweigh the
                                     sum of negative products of deviations. Thus,  (X  – )(Y  – )  will be positive for all the n
                                                                                    Y
                                                                               X
                                                                             i    i
                                     observations.
                                                                                                         Contd...
          168                               LOVELY PROFESSIONAL UNIVERSITY
   169   170   171   172   173   174   175   176   177   178   179