UNIT 3 - Part 1 Google Docs

Download as pdf or txt
Download as pdf or txt
You are on page 1of 13

‭UNIT-3‬

‭1. Statistical Indexing‬


‭●‬ ‭Explanation:‬
‭○‬ ‭Uses statistical methods to analyze the frequency and‬
‭distribution of words within documents.‬
‭●‬ ‭Example:‬
‭○‬ ‭If the word "climate" appears frequently in a document,‬
‭statistical indexing identifies it as a key term. It might‬
‭also consider the context in which "climate" appears‬
‭alongside other terms like "change" or "policy."‬

‭1. Probabilistic Weighting‬

‭●‬ ‭Explanation:‬
‭○‬ ‭Assigns weights to terms based on their probability of‬
‭relevance to a search query. The idea is that some‬
‭terms are more likely to be relevant than others.‬
‭●‬ ‭Example:‬
‭○‬ ‭In a search for "renewable energy," the term "solar"‬
‭might have a higher probability weight than "energy"‬
‭because it's more specific and relevant.‬

‭●‬

‭2.Vector Weighting‬

‭●‬ ‭Explanation:‬
‭○‬ ‭Represents documents and queries as vectors in a‬
‭multi-dimensional space. The relevance is calculated‬
‭by measuring the angle or distance between the‬
‭vectors.‬
‭●‬ ‭Example:‬
‭○‬ ‭Think of each term in a document as a dimension. A‬
‭search for "machine learning" would result in vectors‬
‭for each document. The document with a vector‬
‭closest to the query vector is considered most‬
‭relevant.‬
‭●‬

You might also like