Page 186 - DMGT404 RESEARCH_METHODOLOGY
P. 186

Research Methodology




                    Notes          9.1.7 Spearman’s Rank Correlation

                                   This is a crude method of computing correlation between two characteristics. In this method,
                                   various items are assigned ranks according to the two characteristics and a correlation is computed
                                   between these ranks. This method is often used in the following circumstances:
                                   1.  When the quantitative measurements of the characteristics are not possible, e.g., the results
                                       of a beauty contest where various individuals can only be ranked.
                                   2.  Even when the characteristics is measurable, it is desirable to avoid such measurements
                                       due to shortage of time, money, complexities of calculations due to large data, etc.
                                   3.  When the given data consist of some extreme observations, the value of Karl Pearson’s
                                       coefficient is likely to be unduly affected. In such a situation the computation of the rank
                                       correlation is preferred because it will give less importance to the extreme observations.
                                   4.  It is used as a measure  of the degree of  association in situations where the nature of
                                       population, from which data are collected, is not known.
                                   The coefficient of correlation obtained on the basis of ranks is called ‘Spearman’s Rank Correlation’
                                   or simply the ‘Rank Correlation’. This correlation is denoted by  (rho).
                                   Let X  be the rank of ith individual according to the characteristics X and Y  be its rank according
                                       i                                                      i
                                   to the characteristics Y. If there are n individuals, there would be n pairs of ranks (X , Y ), i = 1, 2,
                                                                                                     i  i
                                   ...... n. We assume here that  there are no ties, i.e., no two or more individuals are  tied to  a
                                                             s
                                                       s
                                   particular rank. Thus, X ’  and Y ’  are simply integers from 1 to n, appearing in any order.
                                                      i     i
                                                                   +
                                                                       +
                                                                +
                                   The means of X and Y, i.e.,  X  Y   1 2 ..... n    ( n n + 1)    n +  1  .
                                                                    n        2n      2
                                                                              é
                                                      1
                                                                +
                                                            2
                                                        2
                                         2
                                                                              ê
                                                   2
                                   Also, s  =    s  =  [1 +  2 + ... n  2  ]– (n +  1)  2    1 n (n +  1)(2n +  1)ù ú  – (n +  1) 2    n  2  – 1
                                        x         y   n                 4    n ë     6     û    4     12
                                   Let d  be the difference in ranks of the ith individual, i.e.,
                                       i
                                                             d  = X  – Y   (X  -  X -  Y - Y )( X   Y )
                                                              i   i  i    i  ) ( i
                                   Squaring both sides and taking sum over all the observations, we get
                                                                                 )
                                                            d  = å  é (X -  X -  Y - Y ù  2
                                                              2
                                                              i    ë  i  ) ( i   û
                                                                 å
                                                                                    2
                                                                         2
                                                               = (X -  X ) + å ( i  Y ) - å ( X -  X Y -  ) Y
                                                                               Y -
                                                                                      2
                                                                                           i
                                                                     i
                                                                                               )( i
                                   Dividing both sides by n, we get
                                                  1 å d  å (X -  X ) + å  Y -  Y ) - å ( X - X Y -  Y )
                                                                                2
                                                                     1
                                                         1
                                                                   2
                                                                              2
                                                      2
                                                  n   i  n    i      n  ( i     n     i   )( i
                                                                                                )(
                                                                                 )
                                                               = s +  s - 2Cov ( ,X Y   2s -  2Cov ( ,X Y   s   s 2  )
                                                                                      2
                                                                  2
                                                                      2
                                                                                                    2
                                                                                                    X
                                                                                     X
                                                                                                        Y
                                                                      Y
                                                                  X
                                                             1 å  d  2
                                   From this, we can write  1 -    ´  i
                                                             n  2s X 2
                                                                    1 å  d  2  1 å  d  2          6å d 2
                                                                 1 -  ´   i    1 -  ´  i  ´  12    1 -  i
                                   or                         =         2               2          2
                                                                    n  2s X    n    2   n - 1     ( n n -  ) 1
          180                               LOVELY PROFESSIONAL UNIVERSITY
   181   182   183   184   185   186   187   188   189   190   191