You are on page 1of 8

VALLIAMMAI ENGINEERING COLLEGE

SRM Nagar, Kattankulathur – 603 203

DEPARTMENT OF
COMPUTER SCIENCE AND ENGINEERING

QUESTION BANK

VII SEMESTER

CS6007-INFORMATION RETRIEVAL

Regulation – 2013

Academic Year 2017 – 18

Prepared by

Ms.G.SANGEETHA Assistant Professor/CSE

Ms. C.PABITHA, Assistant Professor/CSE


VALLIAMMAI ENGINEERING COLLEGE
SRM Nagar , Kattankulathur-603203

DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING

QUESTION BANK

SUBJECT : CS6007INFORMATION RETRIEVAL


SEM/YEAR:VII/IV

UNIT I -INTRODUCTION
Introduction -History of IR- Components of IR – Issues –Open source Search engine Frameworks – The impact of
the web on IR – The role of artificial intelligence (AI) in IR – IR Versus Web Search – Components of a Search
engine- Characterizing the web.
PART-A
Q.No Questions BT Competence
Level
1 Discussthe process stemming. BTL 2 Understand
2 Identifythe need of Information Retrieval BTL 4 Analyze
3 Listand explain the components of IR block diagram. BTL 1 Remember
4 List the fundamental concepts in IR. BTL 1 Remember
5 Expressthe need oftiered indexes. BTL 2 Understand
6 Interpretthe role of Artificial Intelligence (AI) in IR. BTL 2 Understand
7 Differentiate data retrieval and information retrieval. BTL 4 Analyze
8 Give the components of Search Engine. BTL 2 Understand
9 Whatis an extractor? BTL 1 Remember
10 Show the issues that affects IR. BTL 3 Apply
11 Give the purpose of Query Interface. BTL 6 Create
12 Summarize the queries of IR. BTL 5 Evaluate
13 Design the IR architecture diagram.. BTL 6 Create
14 State the impact of WEB on IR. BTL 1 Remember
15 Showthe type of natural language technology used in information BTL 3 Apply
retrieval.
16 DefineInformation Retrieval BTL 1 Remember
17 Whatis search engine? BTL 1 Remember
18 CompareIR vs Web Search. BTL 4 Analyze
19 Illustratethe function of Information Retrieval System. BTL 3 Apply
20 Summarize ontext acquisition. BTL 5 Evaluate
PART B
1 i)Summarize the history of IR.(8) BTL 5 Evaluate
ii) Explain the purpose of Information Retrieval System.(8)
2 Describe the various components of Information Retrieval System with BTL 1 Remember
neat diagram. (16)
3 i)DefineInformation Retrieval system and its features.(4) BTL 1 Remember
ii)Describe the different stages of IR system.(12)
4. i) Identify the various issues in IR system.(8) BTL 1 Remember
ii) Examine the various impact of WEB on IR (8)
5 Discuss in detail about the framework of Open Source Search engine BTL 2 Understand
with necessary diagrams. (16)
6 i) Compare in detail Information Retrieval and Web Search with BTL 4 Analyze
examples.(10)
ii) Analyze the fundamental concepts involved in IR system. (6)
7 Demonstrate the role of Artificial Intelligence in Information BTL 3 Apply
Retrieval Systems. (16)
8 i)Describe the various components of a Search Engine. (10) BTL 2 Understand
ii) Express the various Search Engine available in current world. (6)
9 i)Formulate the working of Search Engine.(8) BTL 6 Create
ii)Generalize the Process of Search Engine in detail.(8)
10 i) Demonstrate the working of IR architecture with a diagram.(8) BTL 3 Apply
ii) Infer How Designing Parsing and Scoring functions works in detail. (8) BTL 4 Analyze
11 i)Define Information Retrieval.(2) BTL 1 Remember
ii) Describe in detail the IR system, Fundamental concepts, need and
purpose of the system.(4+4+6)
12 Explainhow to characterize the web in detail. (16) BTL 4 Analyze
13 Explain the different types of computer software used in computer BTL 4 Analyze
architecture.(16)
14 i) Differentiate database and Information Retrieval with example (4) BTL 2 Understand
ii)Summarize the functions and features of Information Retrieval
Systems.(12)
UNIT II - INFORMATION RETRIEVAL

Boolean and vector-space retrieval models- Term weighting – TF-IDF weighting- cosine similarity – Preprocessing
– Inverted indices – efficient processing with sparse vectors – Language Model based IR – Probabilistic IR –Latent
Semantic Indexing – Relevance feedback and query expansion.
PART-A
Q.No Questions BT Competence
Level
1 Demonstrate probabilistic Information Retrieval. BTL 3 Apply
2 Analyze the Boolean model. BTL 4 Analyze
3 Construct the Multiplication hardware diagram. BTL 3 Apply
4 List the classes of retrieval model. BTL 1 Remember
5 Define Retrieval model. BTL 1 Remember
6 Express language modelling with example. BTL 2 Understand
7 Illustrate similarity measure. BTL 3 Apply
8 Analyze the problems in lexical semantics. BTL 4 Analyze
9 Differentiate language model and naïve bayes. BTL 2 Understand
10 Formulate the Bayesian rule. BTL 6 Create
11 What is meant by sparse vector? BTL 1 Remember
12 Design an Inverted file with an example. BTL 6 Create
13 Evaluate the goals of LSI. BTL 5 Evaluate
14 What is smoothing? BTL 1 Remember
15 Give probabilistic approaches to IR. BTL 2 Understand
16 What is meant inverted file? BTL 1 Remember
17 Interpret cosine similarity measure. BTL 2 Understand
18 Analyze relevance feedback BTL 4 Analyze
19 List the steps involved in preprocessing. BTL 1 Remember
20 Generalize on why distance is not preferred compared to angle. BTL 5 Evaluate
PART-B
1 i) Express what is Boolean retrieval model. (4) BTL 2 Understand
ii) Discuss the Boolean retrieval in detail with diagram. (12)
2 Illustrate the Vector space retrieval model with example (16) BTL 3 Apply

3 Describe about basic concepts of Cosine similarity. (16) BTL 1 Remember


4 Develop on example to implement term weighting .(min docs = 5) (16) BTL 6 Create
5 i) Tabulate the common preprocessing steps .(4) BTL 1 Remember
ii)Describe the document preprocessing steps in detail .(12)
6 i)Discuss in detail about term frequency and Inverse Document BTL 2 Understand
Frequency. (8)
ii)Compute TF-IDF .given a document containing terms with the given
frequencies A(3) ,B(2), C(1).Assume document collections 10,000 and
document frequencies of these terms are A(50), B(1300), C(250) (8)
7 i)Explain Latent semantic Indexing and latent semantic space with an BTL 4 Analyze
illustration.(12)
ii) Analyze the use of LSI in Information Retrieval. What is its need in
synonymy and semantic relatedness.(4)
8 i)Examine, how to form a binary term - document incidence matrix (8) BTL 1 Remember
ii)Give an example for the above. (8)
9 Describe document preprocessing and its stages in detail. (16) BTL 1 Remember
10 i)Discuss the structure of inverted indicies (8) BTL 2 Understand
Ii)Discuss the searching process in inverted file (8)
11 i)Why do we need sparse vectors ? (4) BTL 5 Evaluate
ii)Explain sparse vectors and its efficiency with diagram.(12)

12 i) Analyze the language model based IR and its probabilistic BTL 4 Analyze
representation. (8)
ii)Compare Language model vs Naive bayes and Language model vs
Vector space model (8)

13 Differentiate the various query expansion method with relevance BTL 4 Analyze
feedback.(16)
14 Apply how Probabilistic approaches to Information Retrieval are done. BTL 3 Apply
(16)

UNIT III-WEB SEARCH ENGINE – INTRODUCTION AND CRAWLING


Web search overview, web structure, the user, paid placement, search engine optimization/ spam. Web size
measurement – search engine optimization/spam – Web Search Architectures – crawling – meta-crawlers- Focused
Crawling – web indexes –- Near-duplicate detection – Index Compression – XML retrieval.
PART-A
Q.No Questions BT Competence
Level
1 Express the basics of web search with a neat diagram. BTL 2 Understand
2 Define Pay for Placement. BTL 2 Understand
3 What is meant by Search Engine Optimization? BTL 1 Remember
4 List the need of web search engine. BTL 1 Remember
5 Draw the architecture of search engine. BTL 2 Understand
6 Distinguish parallel crawler and meta crawler. BTL 2 Understand
7 List the SPAM Techniques. BTL 1 Remember
8 Evaluate use of Full text indexing and In human indexing. BTL 5 Evaluate
9 State the issues in search engines. BTL 1 Remember
10 Design the standard web crawler. BTL 6 Create
11 Classify the ways to identify duplication. BTL 4 Analyze
12 How to Apply duplicate Deduction to web pages? BTL 3 Apply
13 Assess the need for keyword stuffing. BTL 5 Evaluate
14 What are the challenges in data traversing by search engines? BTL 1 Remember
15 Show the applications of search engines. BTL 3 Apply
16 Point out the use of Web indexing. BTL 4 Analyze
17 What is focused crawler? BTL 1 Remember
18 Illustrate the hashing technique with example. BTL 3 Apply
19 Classify the types of search engines. BTL 4 Analyze
20 Generalize on XML Retrieval. BTL6 Create
PART-B
1 Discuss the Search Engine Optimization/SPAM in detail. (16) BTL 2 Understand
2 i)Describe in detail about XML Retrieval.(12) BTL 1 Remember
ii)What is Structured and Unstructured Retrieval.(4)
3 i)List the types of Search Engine and explain them. (8) BTL 1 Remember
ii)Describe the working of Search Engine.(8)
4 Design and develop a Web search Architecture and the components of BTL 6 Create
search engine and its issues.(16)
5 i)What is P4P? Elaborate on Paid Placement.(8) BTL 1 Remember
ii) What is the purpose of Web indexing?(8)
6 i) Summarize on the working of WEB CRAWLER with its diagram.(10) BTL 2 Understand
ii) Distinguish visual vs programmatic crawler.(6)
7 i)Differentiate meta crawler and focused crawler. (8) BTL 4 Analyze
ii) Analyze on URL normalization.(8)
8 Recommend the need for Near-Duplication Detection by the ways to BTL 5 Evaluate
identify the duplication. (16)
9 Examine the behavior of web crawler and the outcome of crawling BTL 3 Apply
policies.(16)
10 i)Explain the overview of Web search.(8) BTL 4 Analyze
ii)Describe the structure of WEB and its characteristics(8)
11 Discuss the process of index compression in detail.(16) BTL 2 Understand
12 (i)Explain the need for Web Search Engine.(6) BTL 4 Analyze
(ii)Point out the challenges in data traversing by search engine and how will
you overcome it.(10)
13 Describe the following with example. (4*4) BTL 1 Remember
i) Bag of Words
ii) Shingling
iii) Hashing
iv)Min Hash and Sim Hash
14 Based on the Application of Search Engines, How will you categorize BTL 3 Apply
them and what are the issues faced by them? (16)
UNIT IV- WEB SEARCH – LINK ANALYSIS AND SPECIALIZED SEARCH
Link Analysis –hubs and authorities – Page Rank and HITS algorithms -Searching and Ranking – Relevance Scoring
and ranking for Web – Similarity – Hadoop & Map Reduce – Evaluation – Personalized search – Collaborative
filtering and content-based recommendation of documents and products – handling “invisible” Web – Snippet
generation, Summarization, Question Answering, Cross- Lingual Retrieval.
PART-A
Q.No Questions BT Competence
Level
1 Describe the main idea of Link Analysis. BTL 2 Understand
2 Illustrate the web as a directed graph. BTL 3 Apply
3 List the issues of page rank algorithm. BTL 1 Remember
4 Analyze how citation analysis in done. BTL 4 Analyze
5 Quote the importance of Anchor text and indexing. BTL1 Remember
6 Define Hub. BTL1 Remember
7 What is meant by Query independent ordering? BTL 2 Understand
8 State the aim of question answering. BTL 1 Remember
9 Differentiate between citations and links. BTL 2 Understand
10 Show the working of random walks in Graphs. BTL 3 Apply
11 Evaluate on Recommender System. BTL 5 Evaluate
12 Define Lossy compression mechanisms BTL 1 Remember
13 Integrate the ideas of HITS Algorithm. BTL 6 Create
14 Assess on the parts of Search engine. BTL 5 Evaluate
15 What is mapreduce? BTL 1 Remember
16 Express Recall at rank and Precision at rank BTL 2 Understand
17 Formulate the examples for boolean queries. BTL 6 Create
18 Categorize the modules of Hadoop Framework. BTL 4 Analyze
19 Analyze the Collaborative filtering and challenges. BTL 4 Analyze
20 Demonstrate Bayesian Inferencing. BTL 3 Apply
PART-B
1 i)Define Link Analysis and explain in detail.(8) BTL 1 Remember
ii)Describe in detail about HUBS and Authorities.(8)
2 BTL 2 Understand
i) Give the concept of PAGE Ranking in detail. (8)
ii)Summarize the preprocessing and Query Processing of Page Rank along
with its issues.(8)
3 Discuss in detail about HITS Algorithm with necessary examples.(16) BTL 2 Understand

4 BTL 3 Apply
Illustrate the abstract search engine and how will you speed snippet
generation? Explain with algorithm (16)
5 BTL 1 Remember
Describe the aim and purpose of Question Answering in detail. (16)
6 i) Point out stages of summarization. (8) BTL 4 Analyze
ii) Analyze how Handling Invisible Web is done. (8)
7 Evaluate the concept of Personalized Search and its methodology (16) BTL 5 Evaluate

8 i)Analyze content based recommendations of documents and products.(8) BTL 4 Analyze


ii)Analyze the process of cross lingual retrieval (8)
9 Formulate the working of HADOOP and Map Reduce in detail. (16) BTL 6 Create
10 i) Define contextual computing and discuss on Personalized search (12) BTL 1 Remember
ii)Describe how to solve privacy problems(4)
11 BTL 4 Analyze
i) Explain working of collaborative filtering by analyzing any two case
study. (10)
ii) Give the challenges of Collaborative filtering.(6)

12 Describe the Searching and Ranking process in detail with necessary BTL 1 Remember
examples. (16)

13 BTL 3 Apply
i) Show the performance of TREC Systems. (8)
ii) Illustrate the CLIR Approaches (8)
14 Describe in detail about of SNIPPET Generation along with BTL 2 Understand
example.(16)
UNIT V-DOCUMENT TEXT MINING
Information filtering; organization and relevance feedback – Text Mining -Text classification and clustering –
Categorization algorithms: naive Bayes; decision trees; and nearest neighbor – Clustering algorithms: agglomerative
clustering; k-means; expectation maximization (EM).
PART-A
Q.No Questions BT Competence
Level
1 Distinguish IF vs IR. BTL 2 Understand
2 Define the general features of Filtering. BTL 1 Remember
3 Give the idea of filtering rules and attributes BTL 2 Understand
4 Compare Automatic vs Social Filtering BTL 4 Analyze
5 What is the need of Filtering against spamming? BTL 1 Remember
6 Give some examples of EM. BTL 1 Remember
7 What is Text Mining? BTL 1 Remember
8 Evaluate the process of Text Mining. BTL 5 Evaluate
9 Formulate the estimation of Multinomial document model and BTL 6 Create
Bernoulli document model.
10 Analyze the types of filters BTL 4 Analyze
11 Integrate the problems of k-means method. BTL 6 Create
12 State positive and negative feedback. BTL 1 Remember
13 Summarize relevance feedback with example.. BTL 2 Understand
14 What are the types of data in clustering analysis? BTL 1 Remember
15 Point out the advantages and disadvantages of Decision Tree algorithm. BTL 4 Analyze
16 Show the applications of text mining. BTL 3 Apply
17 Illustrate the advantages of Naiye Bayes. BTL 3 Apply
18 Assess how to measure distance of clusters? BTL 5 Evaluate
19 Distinguish Supervised learning and Unsupervised Learning. BTL 2 Understand
20 Summarize the major clustering approaches. BTL 3 Apply
PART-B
1 i)List the general features of Filtering, rules and its attributes.(8) BTL 1 Remember
ii) Describe the filtering using IR in detail.(8)
2 Describe in detail the various types of filters, Profiling and filtering BTL 1 Remember
technologies in detail.(16)
3 (i)Give the examples of EM method. (6) BTL 2 Understand
(ii)Summarize the profiling and Filtering Technologies.(10)
4 i)Express the process of Text Mining. (8) BTL 2 Understand
ii)Explain the challenges and application of Text Mining.(8)
5 Analyze the procedure involved in Expectation Maximization along
with the steps involved in it. (16) BTL 4 Analyze

6 (i)Define Topic detection and tracking, Clustering in TDT. (6) BTL 1 Remember
(ii)Examine in detail about Cluster Analysis in Text Clustering.(10)
7 Illustrate in detail with examples about Organization and Relevance BTL 3 Apply
feedback. (16)
8 (i)Evaluate the Agglomerative Clustering and HAC in detail. (8) BTL 5 Evaluate
(ii) Evaluate on the various classification methods of Text. (8)
9 BTL 6 Create
Summarize on Clustering Algorithms and types of data in cluster
analysis. (16)
10 i)Analyze the working of Nearest Neighbor algorithm along with one BTL 4 Analyze
representation. (8)
ii) Analyze the K-Means Clustering method and the problems in it. (8)

11 Analyze about Decision Tree Algorithm with illustration. (16) BTL 4 Analyze

12 Describe in detail about Text Mining and its process with detailed BTL 1 Remember
example. (16)
13 Discuss in detail about Text Classification and Text Clustering.(16) BTL 2 Understand
14 Apply Naive Bayes Algorithm for an example and demonstrate its BTL 3 Apply
working in detail. (16)

You might also like