Page 169 - DECO504_STATISTICAL_METHODS_IN_ECONOMICS_ENGLISH
P. 169

Unit 10: Correlation: Scatter Diagram Method, Karl Pearson's Coefficient of Correlation


            The relationship between r and r  may be noted—as the value of r decreases from its maximum value  Notes
                                     2
            of 1, the value of r  decreases much more rapidly. r will, of course, always be larger than r , unless r 2
                                                                                  2
                          2
            = 0 or 1.
                                        r                        r 2
                                       0.90                     0.81
                                       0.80                     0.64
                                       0.70                     0.49
                                       0.60                     0.36
                                       0.50                     0.25
            Thus the coefficient of correlation is 0.707 when just half the variance in Y is due to X.
            It should be clearly noted that the fact that a correlation between two variables has a value of r = 0.60
            and the correlation between two other variables has a value of r = 0.30 does not demonstrate that the
            first correlation is twice as strong as the second. The relationship between the two given values of r
                                                                                         2
            can better be understood by computing the value of r . When r = 0.6, r  = 0.36 and when r = 0.30, r  =
                                                                   2
                                                      2
            0.09.
            The coefficient of determination is a highly useful measure. However, it is often misinterpreted. The
            term itself may be misleading in that it implies that the variable X stands in a determining or causal
            relationship to the variable Y. The statistical evidence itself never establishes the existence of such
            causality. All that statistical evidence can do is to define covariation, that term being used in a perfectly
            neutral sense. Whether causality is present or not, and which way it runs if it is present, must be
            determined on the basis of evidence other than the quantitative observations.
            Properties of the Coefficient of Correlation

            The following are the important properties of the correlation coefficient r:
            1.  The coefficient of correlation lies between – 1 and + 1. Symbolically, –1 ≤≤ +1  or || 1r  ≤ .
                                                                          r
            2.  The coefficient of correlation is independent of change of scale and origin of the variables X and
                Y.
            3.  The coefficient of correlation is the geometric mean of two regression coefficients.
                Symbolically,
                                        r =  b xy  ×b yx
            Self-Assessment

            1. Indicate whether the following statements are True or False:
               (i) There are no limits to the value of r.
               (ii) If r is negative both the variable are decreasing.
              (iii) If the values of X variable are 1, 2, 3, 4, 5 and those of Y 4, 6, 8, 10, 12 the Karl Pearson and the
                  Rank method would give the same answer.
              (iv) Pearsonian coefficient is the best under all situations.
               (v) Karl Pearson’s coefficient of correlation always lies between 0 and + 1.
            10.3 Summary

            •   A scatter diagram is used to show the relationship between two kinds of data. It could be the
                relationship between a cause and an effect, between one cause and another, or even between
                one cause and two others.
            •   In statistics, the Pearson product-moment correlation coefficient (r) is a common measure of the
                correlation between two variables X and Y. When measured in a population the Pearson Product




                                             LOVELY PROFESSIONAL UNIVERSITY                                      163
   164   165   166   167   168   169   170   171   172   173   174