Syllabus

Course Code: MT-CSE-20-31    Course Name: Elective-V - (iii) Information Retrieval System

MODULE NO / UNIT COURSE SYLLABUS CONTENTS OF MODULE NOTES
1 Introduction: Text analysis, Types of text analysis, Information retrieval, IR system architecture: Text processing (Text format, Tokenization, stemming, lemmatization, Language modelling), Indexes and query matching.
Informational Retrieval: Query processing models. Probabilistic models (Binary independence model, Robertson/Spark Jones weighting formula, Two-Poisson model), Relevance feedback (Term selection, Pseudo relevance feedback); language models: Unigram, Bigram language models, Generating queries from documents, Language models and smoothing, Ranking with language models, KullbackLeibler divergence, Divergence from randomness, Passage retrieval and ranking.
2 Management of Information Retrieval Systems: Knowledge management, Information management, Digital asset management, Network management, Search engine optimization, Records compliance and risk management, Version control, Information system failure.
Types of information retrieval systems: Web retrieval and mining, Semantic web, XML information retrieval, Recommender systems and expert locators, Knowledge management systems, Decision support systems, Geographic information system(GIS).
3 Indexing: Inverted indices, Index components and Index life cycle, Interleaving Dictionary and Postings lists, Index construction, Query processing for ranked retrieval, Compression: General purpose data compression, Symbol-wise data compression, Compressing posting lists, Compressing the dictionary.
Information categorization and filtering: Classification, Probabilistic classifiers, linear classifiers, Similarity-based classifiers, Multi category ranking and classification, learning to rank, Introduction to the clustering problem, Partitioning methods, Clustering versus classification, Reduced dimensionality/spectral methods.
4 Sentiment Analysis: Introduction to sentiment analysis, Document-level sentiment analysis, Sentence-level sentiment analysis, Aspect-based sentiment analysis, Comparative sentiment analysis, baseline algorithm, Lexicons, Corpora , Tools of Sentiment analysis, Applications.
Copyright © 2020 Kurukshetra University, Kurukshetra. All Rights Reserved.