Paper /Subject Code: 87011 /Information Retrieval (R-2023-24)
(2/, Hours)
(Total Marks: 75]
N.B. 1) Allqucstions are
compulsory.
2) Figurcs to the right indicate marks.
3) Illustrations, in-depth ánswers and
diagrams will be appreciated.3 : 2
4) Mixing of sub-questions is not allowed.
1. Attempt any four of the following: 20
a. What is information retrieval èxample?What are the characteristics of
retrieval. information
b What are the components and Whatare the major challenges facedin Information
3YASEB32N63I
Retrieval.
C
What is edit distance, and how.is it used in'measuring, string similárity with
suitable
example NSEB3?N63!Y85E
d 32X63!Y8S)
Explain the process of consiructing anjniverted index. How.döes it facilitate B32
efficient'
information retrieval?
e. What is relevance feedback in the context of retrieval models.
f Explain Vector space,model. Discuss TF-ID cosine sinilarity.
EB32X63|YS5E
-N63!YSSEB32X63iYS A(tempt any four of the following : 20
Define textkçätegorization and explain its importance in informationetrieval systems.
b. How canclustering be utilized för query expansion andresult grouping in information
retrieval'systems.
Explain the effectiveness ofk-means and hierarchical clusterig in text data analysis.
Explain the architecture of aweb search engine-What are thÃcomponÇnts involved
in crawling and indexing web pages.
What is the role of supervised Jeaíning techñiques in learning to rank and their
YNSEB32N63 impact on search ngine result quality.
Di_cuss the difference between the PagèRank and HITS algorithms.
3. Attempt any four of the following': 20
Explain breadth-first and depthfirst Web page crawling Techniques?
Define near-duplicate page detection and its significánce in web search. Explain the
challenges associted withidentifying 'ear-duplicle pages.
Describe common techniques used in èxtractive text summarization.
d What are Challenges associated with question answering.
e Define collaborative-filtering and content-þased filtering in recommender systens.
Explain different approaches to machine translation, inchuding rule-based, statistical,
and neural machine
332N63:yxSR:2 translation models.
53384 Page 1 of2
X631Y85EB32X63I Y8SEB32X63 IY8SEB32X631 Y8SEB32
Paper /Subject Code: 87011/ Information Retrieval (R-2023-24)
1S
Attempt any five of the following:
Discuss the steps involved in the Soundex Algorithm for phonetic matching.
Construct 2-gram, 3-gram and 4-gram index for the following terms:
a. banana
b. pineapple
c. computer Ss1
Discuss the Naive Bayes alporithm for lext classification. How does it work, and what
851B32X63|Y8SEB32X631Y8SE
are its assumptions.
KoitV
! 3s4
521NSth2X63!Y85EB32X631YSSEB32X63iY8S
6:!N
d Discuss how link analysis can beused in social network analysis and
recommendation systems.
No31851:B32X63Discuss challenges in abstractive text suñmarization:
Y8%
ER32N63!)S5E43:NG1Y5EB32X63!ysstB2N63!Y8SUD:2N63IV8SEB:631
1YSSERIN6IYSSEB32X63!VSSEB: YS5B32
5 1 : B 3Y85EB32X63!Y85FB32X63|Y8SEB32X63
2X63!
Describe the role. of test collections and benchmarking datasets in evaluating
VSSFB32NA3!Y8SEB32K63
Y8
1SYES
B 53
E /N
BG32
3:0Xsttf85E%
3\)1 3 ! y S S E
h3
Y'SS\B32N63!Y
B32X63!Y8B2X
32
h2N
h7
Y t
8iS
iY
NE8
2BS3
E
Y%5EB32X6s1YNSEB3
2B
X362
3X!Y68
3S!E B 3 2 N h 3 ! Y N
systems.
I iFB32K63|
EB32X631Y'S5EB32X63
YNSIB32N63!YS5EB32X63IYN
32X63iY8SEB32X631)
i)NSLB
33.N63:YS5TB32X63i
EB:2X63
785AD32Y63i1