SMART Information Retrieval System
The SMART Information Retrieval System is an information retrieval system developed at Cornell University in the 1960s. Many important concepts in information retrieval were developed as part of research on the SMART system, including the vector [space model], relevance feedback, and Rocchio classification.
Gerard Salton led the group that developed SMART. Other contributors included Mike Lesk.
The SMART system also provides a set of corpora, queries and reference rankings, taken from different subjects, notably
- Association for [Information Science and Technology|ADI]: publications from information science reviews
- Computer science
- Cranfield collection: publications from aeronautic reviews
- Forensic science: library science
- MEDLARS collection: publications from medical reviews
- Time magazine collection: archives of the generalist review Time in 1963
ddd.qqq, where the first three letters represents the term weighting of the collection document vector and the second three letters represents the term weighting for the query document vector. For example, ltc.lnn represents the ltc weighting applied to a collection document and the lnn weighting applied to a query document.The following tables establish the SMART notation:
The gray letters in the first, fifth, and ninth columns are the scheme used by Salton and Buckley in their 1988 paper. The bold letters in the second, sixth, and tenth columns are the scheme used in experiments reported thereafter.