Page 130 - DLIS402_INFORMATION_ANALYSIS_AND_REPACKAGING
P. 130

Unit 6: Information Retrieval Model and Search Strategies




            systems than those which have been characteristically assumed in information retrieval research.  Notes
            Two paradigms have been distinguished in information retrieval research, the cognitive and the
            physical, but they share the assumption of the value of delivering relevant records. For the purposes
            of the discussion here, they can be considered as a single, if heterogeneous, paradigm, linked if not
            united, by this common assumption.
            The contrasting paradigm implicitly embodied in classification and indexing may finally be
            incommensurable with that of information retrieval research, with disputes not logically resolvable
            within either paradigm. The approach taken in this paper will be suggested:
            That an alternative principle involving discriminatory power has been held, implicitly and explicitly,
            in a number of largely separate discourses;
            That the cumulative effect of recognising this is to indicate more viable and productive criteria for
            designing, using and evaluating information retrieval systems; and, finally, that the classical tradition
            of information retrieval research can itself be assimilated to the new model.
            In this final respect, the development proposed here is an exemplar of scientific development in
            which discarded paradigms are absorbed into developing ones, as special cases.
            The discourses in which an alternative principle for the design and evaluation of information retrieval
            systems can be discovered and which are to be covered here are:
            An emerging, although rather isolated and discontinuous, strand of information retrieval research;
            Accepted discussions of the principles of classification and indexing;
            Giambattista Vico’s critique of Aristotelian principles and categories for classification;
            and, most crucially, ordinary language discussion of information systems.


            Information Retrieval Research

            Information retrieval research, particularly in the experimental tradition emerging in the 1950s in
            Britain and North America, has taken as its founding assumption the principle that an ideal information
            system should deliver all (and possibly only all) the records relevant to a given information need. In
            order to evaluate information systems in relation to this desired end, or variations on it, various steps
            were taken: relevance was stabilised and quantified, sometimes being reduced to a binary or
            dichotomous variable; and measures of precision and recall, which depend on the prior stabilisation
            of relevance, were developed.
            More recent research has questioned the validity of aspects of this paradigm, although more
            frequently with reference to its subsidiary concepts (relevance and information need) and derived
            measures (precision and recall) than with regard to its founding assumption.
            The adequacy of the concept of relevance employed information retrieval research has been
            questioned. Experiments substitute a measurable phenomenon, relevance as constructed under
            artificial conditions, for an immeasurable one, relevance under operational conditions, but fail to
            demonstrate that there is an adequate correlation between the two. Most disturbingly, it has been
            suggested that operational relevance is fluid, influenced by intention, context, and other documents
            seen or read, and simply not amenable to stabilisation or, further, quantification.
            The classical measures of precision and recall are also rendered increasingly artificial by the high
            degree of interactivity enabled by recent information technology developments. How, when
            searching a CD-ROM database, is the final set of records to be isolated except by a process whose
            very arbitrariness invalidates it as a component of a measure of system performance? High
            interactivity, and unmediated searching, also reduces the need for a query to be fully articulated in
            advance of searching.







                                             LOVELY PROFESSIONAL UNIVERSITY                                   125
   125   126   127   128   129   130   131   132   133   134   135