Key word indexing models, Humanities

Assignment Help:

KEY WORD INDEXING MODELS 

Computers began to be used to aid information retrieval system in the 1950s. The first use of computers in information  retrieval was  the production of indexes. The Central  Intelligence Agency (CIA) of the USA is said to be the  first organisation to use the machine produced Key Words from Title Index since 1952. H P Luhn and his associates produced permuted title indexes in the International Conference of Scientific Information held at Washington in 1958. Luhn  named his index as '"Key-Word-in-Context" (KWIC) index and reported its method of generation in a paper in 1959. The success of KWIC was  established after its adoption by American Chemical Society in 1961 in its publication "Chemical Titles". "Keyword" means subject denoting words, chosen mainly from the titles and/or sometimes from abstract or text of the document for the purpose of indexing. The words chosen may be single words, multi-words or even phrases that convey content information of the document. However, the system developed by Luhn was  from the words in the title of a document. Several keywords  may be chosen for a  document to provide access form different approaches  of the user. Since the  keyword indexing is based  on natural language terminology  of the documents, this system is also known as "Natural Language Indexing System". 

The KWIC index, developed by H P Luhn, is said to be one of the earliest and successful computer-generated keyword indexes. In his method, he suggested the selection of words from the title excepting the unwanted or insignificant words. While the words will form index term, other words in the title, what he said as will be wrapped around it". These words will serve as the context. KWIC indexing system  is based on usage of natural language terminology  to generate the index entries.  All of the words in the titles of a batch of documents for which an index is   required are matched, by a computer against a stop-list. This stop-list or stop-wordlist is a record of words which are insignificant in an index. They include words like articles, auxiliary verbs together with such general words  as "aspect",  "different", "method", "very", etc. Depending upon the subject orientation of each  major search system has defined their own list of "stop-words". Some words which might be feasible access points in a general index prove worthless  in an index  devoted to a special  subject area. Indiscriminate  marking articles, prepositions, etc.  may create  problems because of important scientific and technical terms such as "Vitamin A", "On line", etc. In view of this, words to be included in the list of "stop-words" are required to be selected in the light of the subject orientation of the index. Stop-words do not appear as entry words but they are displayed in the titles in the index in order to provide the context of the document. No controlled vocabulary is required for keyword indexing. Indexing terms are  selected from the natural language of documents. In addition to KWIC index, there are a number of varieties of keyword indexes that have been developed over  the years. Two most important versions are Key Word Out of Context (KWOC) and Key Word And Context (KWAC). They differ only in terms  of their formats but indexing principles/techniques remain more or less same. 


Related Discussions:- Key word indexing models

Sears list of subject headings - background, SEARS LIST OF SUBJECT HEADINGS...

SEARS LIST OF SUBJECT HEADINGS  Background   Sears List of Subject Headings (SLSH) is  an abridged version of the Library of Congress Subject Headings, named after the f

American manners and habitus, The social arrangements of the Old South wer...

The social arrangements of the Old South were also associated with the prevalent code of ‘honour' (Wyatt-Brown 1984), and questions of honour were commonly settled by the du

Essay, I would like to know how do I start a essay papper

I would like to know how do I start a essay papper

Religious syncretism in silla, Religious Syncretism in Silla Indigeno...

Religious Syncretism in Silla Indigenous influence on Sukkuram mid-8 th century. Exterior mound not an actual cave/excavated cavern. Took slabs of rock and stacked them up

Other marc formats, Other MARC Formats   After the joint work of BNB an...

Other MARC Formats   After the joint work of BNB and LC on the MARC format, other countries also quickly started development of their national formats. They are Canada (CANMARC

Post-coordinate indexing model, POST-COORDINATE INDEXING MODEL   All th...

POST-COORDINATE INDEXING MODEL   All the pre-coordinate indexing models discussed so far are of unidimensional in nature based on the order of significance. The significance or

Kinds of entries, KINDS OF ENTRIES   We observed that in a Library Cata...

KINDS OF ENTRIES   We observed that in a Library Catalogue there is need for several entries. A library catalogue is a time saving device also. It helps us realize the objectiv

Use of general language dictionaries, General Language Dictionaries Ge...

General Language Dictionaries General language dictionaries are usually used for four broad purposes such as, i) a quick reference tool ii) a language standardiser ii

The tang state, The Tang State Northern Wei 4 th to 6 th century. ...

The Tang State Northern Wei 4 th to 6 th century. Semi-nomadic rulers initiated the process towards unification, not rulers in the south. Domestic management patterns: str

Different types of ready reference sources, DIFFERENT TYPES OF READY REFE...

DIFFERENT TYPES OF READY REFERENCE SOURCES There are many types of ready reference sources. These sources satisfy fact-finding queries or in few cases, material-finding ques

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd