Page 88 - DEDU504_EDUCATIONAL_MEASUREMENT_AND_EVALUATION_ENGLISH
P. 88

Educational Measurement and Evaluation


                   Notes              find out their reliability, they are administered to certain group of students or individuals,
                                      and then the scores obtained on these two parallel tests are used to calculate correlation
                                      coefficient (r). This correlation multiple indicates their reliability.
                                      Under this method, individual differences of experience, practice and students do not influence
                                      reliability, because the material in the two tests is different. But if there is much similarity
                                      between the questions and materials of the two tests, then their reliability will be enhanced.
                                      Generally, if two tests are administered with a sufficient time interval, then experience,
                                      memory and other factors do not influence scores. It is suitable to have a time interval of
                                      four weeks between the two tests.
                                      Following points should be kept in view while preparing parallel formats :
                                       (a)  The items should be distributed equally from the standpoint of difficulty.
                                       (b)  The items should be homogeneous.
                                        (c)  The administration and scoring methods of the two tests should be equal.
                                       (d)  The number of items in the two formats should be equal.
                                       (e)  Type of items, content, difficulty level and samples should be equal in the two formats.
                                      Limitations
                                       (a)  Tests have to be conducted twice.
                                       (b)  This testing produces the problem of standardization, which in itself is a very complex
                                           and expensive process.
                                        (c)  If the time interval between the administration of two tests is longer, then the error
                                           of testing-retesting method will be repeated in this too.
                                       (d)  Exercise has its influence in this method, because the form of items is almost equal or
                                           similar.
                                       (e)  It is a difficult task to prepare two equal and similar formats for testing.
                                      Merits
                                       (a)  This method consumes far less time as compared to testing-retesting method.
                                       (b)  This method is the amended form of testing-retesting method.
                                        (c)  This has least influence of exercise and memory.
                                       (d) This method can be used for follow-up purpose too.
                                       (e)  This method can be used to ascertain reliability of speed test too.
                                  2.  Method of Rational Equivalence or K-R Formula : Kuder and Richardson presented a method
                                      to ascertain reliability, which is called method of rational equivalence or K-R formula.
                                      Under this method, the limitations of all other methods have been eliminated. Under it, the
                                      test has to be administered only once, and correlation between question items is found out
                                      in order to see similarity between them. So, the reliability coefficient obtained from this
                                      method is also called coefficient of internal consistency. The chief characteristic of internal
                                      consistence is that different items of the test have high correlation with each other. The chief
                                      assumption of the application of this method is that all items included in this test should be
                                      homogeneous, else reliability coefficient will be less than the split-half method. It entails
                                      that a test has the same type of questions, because the aim of all these questions is to
                                      measure the elements which influence ability and personality.
                                      Under this method, two points are given special importance, these are inter-correlation of
                                      the items and correlation of items with the while test.
                                      By inter-correlation of questions is meant to measure the consistency of reactions in a
                                      subject which occur for different items. That is, how consistent are one statement in relation



         82                                 LOVELY PROFESSIONAL UNIVERSITY
   83   84   85   86   87   88   89   90   91   92   93