Syllabus
Course Code: MT-CSE-20-31 Course Name: Elective-V - (iii) Information Retrieval System |
||
MODULE NO / UNIT | COURSE SYLLABUS CONTENTS OF MODULE | NOTES |
---|---|---|
1 | Introduction: Text analysis, Types of text analysis, Information retrieval, IR system architecture: Text processing (Text format, Tokenization, stemming, lemmatization, Language modelling), Indexes and query matching. Informational Retrieval: Query processing models. Probabilistic models (Binary independence model, Robertson/Spark Jones weighting formula, Two-Poisson model), Relevance feedback (Term selection, Pseudo relevance feedback); language models: Unigram, Bigram language models, Generating queries from documents, Language models and smoothing, Ranking with language models, KullbackLeibler divergence, Divergence from randomness, Passage retrieval and ranking. |
|
2 | Management of Information Retrieval Systems: Knowledge management, Information management, Digital asset management, Network management, Search engine optimization, Records compliance and risk management, Version control, Information system failure. Types of information retrieval systems: Web retrieval and mining, Semantic web, XML information retrieval, Recommender systems and expert locators, Knowledge management systems, Decision support systems, Geographic information system(GIS). |
|
3 | Indexing: Inverted indices, Index components and Index life cycle, Interleaving Dictionary and Postings lists, Index construction, Query processing for ranked retrieval, Compression: General purpose data compression, Symbol-wise data compression, Compressing posting lists, Compressing the dictionary. Information categorization and filtering: Classification, Probabilistic classifiers, linear classifiers, Similarity-based classifiers, Multi category ranking and classification, learning to rank, Introduction to the clustering problem, Partitioning methods, Clustering versus classification, Reduced dimensionality/spectral methods. |
|
4 | Sentiment Analysis: Introduction to sentiment analysis, Document-level sentiment analysis, Sentence-level sentiment analysis, Aspect-based sentiment analysis, Comparative sentiment analysis, baseline algorithm, Lexicons, Corpora , Tools of Sentiment analysis, Applications. |