What is Query Breadth?
Query breadth is a measure of how general or narrow a search query is. Broad queries typically:
- Return a high number of results.
- Often consist of fewer search terms or are used very frequently.
π Key Concepts
-
πΉ Related Document Count
π What it means: The number of documents matching a query.
β High Breadth: Many results
Example: "shoes" (broad term).β Low Breadth: Few results
Example: "red leather boots size 8" (highly specific). -
πΉ Adjacent Words
π What it means: Phrases or word pairs that appear together.
β Example:
"New York" vs. "York New" (Incorrect order).
Query: "best Italian restaurant" β Narrower breadth than "best restaurant." -
πΉ Unknown Entities
π What it means: Terms that arenβt recognized by the system (e.g., slang, misspellings).
β Example:
Query: "How to fix a zommbie phone?"
"Zommbie" is flagged as an unknown entity. -
πΉ Adding more words = Narrower query.
β Example:
Broad: "apple" β Could mean the fruit, company, or movie.
Narrow: "Apple iPhone 15 release date" β Specific intent.
π Applications
-
πΉ Named Entity Recognition (NER)
π What it does: Identifies specific names.
β Example: "Elon Musk" β Recognized as a named entity.
-
πΉ Triple Creation
π What it does: Builds structured data in a subject-predicate-object format.
β Example:
[Subject: Paris] β [Predicate: capital of] β [Object: France]
π Process Flow
-
πΉ 1οΈβ£ Retrieve Query/Clicked Result Pair (402)
π What happens? Tracks which results users click for a query.
β Example:
Query: "healthy recipes"
User clicks: A blog about salads. -
πΉ 2οΈβ£ Determine Popularity Measure (404)
π What happens? Calculates how frequently a result is clicked for that query.
-
πΉ 3οΈβ£ Determine Query Breadth (406)
π What happens? Checks if the query is broad or specific.
β Example:
Broad: "coffee shops"
Specific: "Starbucks near me" -
πΉ 4οΈβ£ De-weight Popularity Measure (408)
π What happens? Reduces the bias of overly popular results for broad queries.
β Example:
Query: "shoes" has millions of results.
Fix: Prevent one generic page from dominating. -
πΉ 5οΈβ£ Store Adjusted Popularity (410)
π What happens? Saves the updated weights to improve future search accuracy.