Indexing is the most vital part of any Information Retrieval System. It is a process in which the documents required by the users are transformed into searchable data structures. Indexing can be also referred to as the process of extraction rather than analysis of particular content. It creates a core functionality of the IR process since it is the first step in IR and assists in efficient information retrieval.
In the process, first, the document surrogates are created to represent each document. Secondly, it requires analysis of original documents that include simple (identifying meta-information e.g., author, title, subject etc.) and complex (linguistic analysis of content) data. Indexes are the data structures that are used to make the search faster.
Evaluation in Information Retrieval is the process of systematically determining a subject’s merit, worth, and significance by using certain criteria that are governed by a set of standards.
Issues in Information Retrieval :
The main issues of the Information Retrieval (IR) are Document and Query Indexing, Query Evaluation, and System Evaluation.
- Document and Query Indexing –
Main goal of Document and Query Indexing is to find important meanings and creating an internal representation. The factors to be considered are accuracy to represent semantics, exhaustiveness, and facility for a computer to manipulate.
- Query Evaluation –
In the retrieval model how can a document be represented with the selected keywords and how are documents and query representations compared to calculate a score. Information Retrieval (IR) deals with issues like uncertainty and vagueness in information systems.
- Uncertainty :
The available representation does not typically reflect true semantics of objects such as images, videos etc.
- Vagueness :
The information that the user requires lacks clarity, is only vaguely expressed in a query, feedback or user action.
- Uncertainty :
- System Evaluation –
System Evaluation tells about the importance of determining the impact of information given on user achievement. Here, we see if the efficiency of the particular system related to time and space.
Attention reader! Don’t stop learning now. Get hold of all the important CS Theory concepts for SDE interviews with the CS Theory Course at a student-friendly price and become industry ready.
- Web Information Retrieval | Vector Space Model
- Federated database management system issues
- Data Management issues in Mobile database
- Future Works in Geographic Information System
- What is EII(Enterprise Information Integration)?
- Checkpoints in DBMS
- Partial, Unique, Secondary, Composite and Surrogate keys in DBMS
- Model Planning for Data Analytics
- Attribute Closure Algorithm and its Utilization
- Difference between Entity constraints, Referential constraints and Semantic constraints
- RPAD() Function in MySQL
- Classification of Data
- Stored Procedure for prime numbers in MYSQL
- Storage Definition Languages (SDL)
If you like GeeksforGeeks and would like to contribute, you can also write an article using contribute.geeksforgeeks.org or mail your article to email@example.com. See your article appearing on the GeeksforGeeks main page and help other Geeks.
Please Improve this article if you find anything incorrect by clicking on the "Improve Article" button below.