Page 165 - DLIS402_INFORMATION_ANALYSIS_AND_REPACKAGING
P. 165

Information Analysis and Repackaging



                   Notes         Post-co-ordinated indexing where terms are combined at the time of searching would reduce this
                                 effect but the onus would be on the searcher to link appropriate terms as opposed to the information
                                 professional. In addition terms that occur infrequently may be highly significant for example a new
                                 drug may be mentioned infrequently but the novelty of the subject makes any reference significant.





                                          One method for allowing rarer terms to be included and common words to be excluded
                                          by automated techniques would be a relative frequency approach were frequency of
                                          a word in a document is compared to frequency in the database as a whole.

                                 Therefore a term that occurs more often in a document than might be expected-based on the rest of
                                 the database could then be used as an index term, and terms that occur equally frequently throughout
                                 will be excluded. Another problem with automated extraction is that it does not recognise when a
                                 concept is discussed but is not identified in the text by an indexable keyword.

                                 Assignment Indexing

                                 An alternative is assignment indexing where index terms are taken from a controlled vocabulary.
                                 This has the advantage of controlling for synonyms as the preferred term is indexed and synonyms
                                 or related terms direct the user to the preferred term. This means the user can find articles regardless
                                 of the specific term used by the author and saves the user from having to know and check all possible
                                 synonyms . It also removes any confusion caused by homographs by inclusion of a qualifying term.
                                 A third advantage is that it allows the linking of related terms whether they are linked by hierarchy
                                 or association, e.g., an index entry for an oral medication may list other oral medications as related
                                 terms on the same level of the hierarchy but would also link to broader terms such as treatment.
                                 Assignment indexing is used in manual indexing to improve inter-indexer consistency as different
                                 indexers will have a controlled set of terms to choose from. Controlled vocabularies do not completely
                                 remove inconsistencies as two indexers may still interpret the subject differently.

                                 Index Presentation

                                 The final phase of indexing is to present the entries in a systematic order. This may involve linking
                                 entries. In a pre-coordinated index the indexer determines the order in which terms are linked in an
                                 entry by considering how a user may formulate their search. In a post-coordinated index, the entries
                                 are presented singly and the user can link the entries through searches, most commonly carried out
                                 by computer software. Post-coordination results in a loss of precision in comparison to pre-
                                 coordination.

                                 Depth of Indexing

                                 Indexers must make decisions about what entries should be included and how many entries an index
                                 should incorporate. The depth of indexing describes the thoroughness of the indexing process with
                                 reference to exhaustively and specificity.

                                 Exhaustive Index
                                 An exhaustive index is one which lists all possible index terms. Greater exhaustive content gives a
                                 higher recall, or more likelihood of all the relevant articles being retrieved, however, this occurs at the
                                 expense of precision. This means that the user may retrieve a larger number of irrelevant documents
                                 or documents which only deal with the subject in little depth.





            160                              LOVELY PROFESSIONAL UNIVERSITY
   160   161   162   163   164   165   166   167   168   169   170