Page 233 - DLIS402_INFORMATION_ANALYSIS_AND

Page 233 - DLIS402_INFORMATION_ANALYSIS_AND_REPACKAGING

P. 233

Information Analysis and Repackaging

Notes on the same fields as the books that they index. Indexers can keep in mind that any given book has
a spectrum of users. A technical book that is aimed at professional computer programmers, for
instance, may also be referred to by student, academic, or hobby programmers. A professional
programmer might look up a function by its known name, where as a student may be looking for a
function that fulfills a role. Both options for look-up should be present for important functions.
Importance is the main criteria for helping the user. But there is no doubt that users sometimes look
up trivia. If the length of the index allows for it, certainly trivia can be indexed. But to index trivia
rather than deeply indexing important subjects is a mistake.
Machine generated indexes, in their present state, are more helpful to users than having no index at
all. Amateur created indexes (usually by the author of the book) are usually at least as useful as
machine-generated ones. If the amateur knows the subject materials but not professional indexing
style rules, their mistakes tend to be largely stylistic. If a professional indexer does not understand
the material being indexed, the indexing errors tend to concern the choices of entries. Professionally
produced indexes are usually far better than machine-generated or amateur-created indexes. “Better”
here means that users can find the information they seek with minimal effort.

Author/Indexer/User Knowledge Relationships

Non-fiction book authors are consciously trying to convey a body of knowledge to the readers of
their books. The author and indexer each enter the enterprise with a body of knowledge. Their
knowledge bases may or may not be very similar. At minimum both the author and indexer are
proficient in the language used for the book. We might expect, therefor, that a machine indexer
must be proficient in the language to do a good job of indexing. The author has a body of knowledge
which he distills [note MI’s: example of metaphorical use of “distill”] into book form.

The indexer also has a body of knowledge. Reading the book (which would be impossible without
a pre-existing body of knowledge) the indexer, like other readers, learns what the author conveys.
It may be a particular arrangement of knowledge the indexer already has, but typically the indexer
does some, perhaps a lot, of learning while indexing. The index reflects both the knowledge of the
author and of the indexer. The author does not have to know how to create an index as he writes the
book.
Both the author and the indexer have a knowledge of the knowledge likely to be present already in
the book’s target readers (and index users). These readers might fall into classes: students with no
prior knowledge for whom the book is course work; workers in the field who use it only for reference;
etc., depending on the book.
If the author has written an introductory text aimed at initially ignorant readers, it is possible that
the indexer may do a good job without knowing much subject matter. If the author has written a
more advanced text, an indexer with no knowledge of the subject matter is likely to produce a poor
(not that helpful to users) index. Often the indexer will have a good general knowledge of a subject
such as computer science, and will be indexing a book on a relatively narrow subject (say C++
programming, or graphics algorithms).
Given that poor indexes are produced by human indexers who do not understand the subject matter,
we might expect that an MI will produce a poor index if it does not understand the subject matter.
“Understand” is, admittedly, a difficult to define precisely. We can also say the MI needs to begin
with a knowledge base similar to a human indexers, and needs to be able to learn (add to its
knowledge) just as a human indexer does.
A professional indexer who indexes a book on a subject that is similar to a prior book they indexed
will have a general index framework in mind. Such an indexer might think thoughts such as “ahah,
my first entry on the subject of Internet browsers,” or “oh no, she’s writing about event delegates,
I’ve had trouble understanding that in the past.”

228 LOVELY PROFESSIONAL UNIVERSITY

228 229 230 231 232 233 234 235 236 237 238