Page 204 - DMGT404 RESEARCH_METHODOLOGY
P. 204

Research Methodology




                    Notes             Straight-line (linear) relationships are particularly important because a straight line is a
                                       simple pattern that is quite common.
                                      The correlation measures the direction and strength of the linear relationship.
                                      The least-squares regression line is the line  that makes the sum of the squares of  the
                                       vertical distances of the data points from the line as small as possible.

                                      Non-parametric regression analysis traces the dependence of a response variable on one
                                       or several predictors without specifying in advance the function that relates the response
                                       to the predictors.

                                   9.6 Keywords


                                   Correlation: It is an analysis of covariation between two or more variables.
                                   Correlation Coefficient: It is a numerical measure of the degree of association between two or
                                   more variables.

                                   Kernel Estimation: The kernel regression is a non-parametric technique in statistics to estimate
                                   the conditional expectation of a random variable.
                                   Regression Equation: If the coefficient of correlation calculated for bivariate data (Xi, Yi), i = 1, 2,
                                   ...... n, is reasonably high and a cause and effect type of relation is also believed to be existing
                                   between them, the next logical step is to obtain a functional relation between these variables.
                                   This functional relation is known as regression equation in statistics.
                                   Smoothing Splines: It is a method of fitting a smooth curve to a set of noisy observations using
                                   a spline function.
                                   9.7 Review Questions


                                   1.  Obtain the two lines of regression from the following data and estimate the blood pressure
                                       when age is 50 years. Can we also estimate the blood pressure of a person aged 20 years on
                                       the basis of this regression equation? Discuss.

                                    Age (X)  (in years)   56   42   72   39   63   47   52   49   40   42   68   60
                                    Blood Pressure (Y)   127   112   140   118   129   116   130   125   115   120   135   133

                                   2.  Show that the coefficient of correlation, r, is independent of change of origin and scale.
                                   3.  Prove that the coefficient of correlation lies between – 1 and + 1.
                                   4.  “If two variables are independent the correlation between them is zero, but the converse
                                       is not always true”. Explain the meaning of this statement.
                                   5.  What is Spearman’s rank correlation? What are the advantages of the coefficient of rank
                                       correlation over Karl Pearson’s coefficient of correlation?
                                   6.  Distinguish between  correlation and regression. Discuss least square method of  fitting
                                       regression.
                                   7.  What do you understand by linear regression?  Why there are two lines of  regression?
                                       Under what condition(s) can there be only one line?
                                   8.  What do you think as the reason behind the two lines of regression being different?

                                   9.  For a bivariate data, which variable can we have as independent? Why?
                                   10.  What can you conclude on the basis of the fact that the correlation between body weight
                                       and annual income were high and positive?





          198                               LOVELY PROFESSIONAL UNIVERSITY
   199   200   201   202   203   204   205   206   207   208   209