Mid Page Query Refinements

๐Ÿ”Ž Search engines try to provide the most promising results in response to queries, but they can limit what they can return based on the queries used.

โš ๏ธ Some search queries can be too ambiguous, too general, or too specific to provide good results.

๐Ÿ“ Examples:

๐Ÿ’ก This query refinement patent application is an attempt to provide suggestions to address these problems and better match the searcher's intent. โœ…

An example of how these Query Refinements work:

๐Ÿ”Ž A searcher may try to find information on Google by entering the word โ€œjaguarโ€ into the search box and hitting enter.

๐Ÿ“„ The results (webpages containing information) might fit into several meaningful groups:

๐Ÿ” How Google Understands and Refines Searches

๐Ÿ—๏ธ Step 1: Finding Initial Results

Google first finds many search results related to "jaguar." Then, it picks the top 100 most relevant results (this number can change).

๐Ÿ”„ Step 2: Grouping the Results

Google organizes these 100 results into clusters (groups of similar documents).

Example:

Each document in these groups is matched with stored information from past searches to find related search terms.

๐Ÿ“‚ What Does "Stored Information from Past Searches" Mean?

Google remembers past searches ๐Ÿ“š and stores information about how people search and what they click on. This helps Google improve future searches!

Letโ€™s say millions of people have searched for "Jaguar" before. Google has already collected data on:

๐Ÿ’ก Example:

Now, when you search "jaguar," Google uses this stored knowledge to help suggest better searches!

๐Ÿ”„ How Does Google Match Documents with Past Search Data?

When Google finds 100 top results for "jaguar," it compares them to information from past searches (association database).

  1. Step 1: Google finds which documents (webpages) are most relevant ๐Ÿ“„
    Is it about cars? ๐Ÿš—
    Is it about the animal? ๐Ÿ†
    Is it about the Mac OS? ๐Ÿ’ป
  2. Step 2: Google checks past user searches to see how similar documents were searched before.
    If many past users searched "Jaguar car" and clicked car-related results ๐Ÿš—, then Google knows that a Jaguar car cluster exists.
    If other users searched "Jaguar animal" and clicked animal-related results ๐Ÿ†, then Google knows a Jaguar animal cluster exists.
  3. Step 3: Google finds common keywords in those groups ๐Ÿท๏ธ
    In the car cluster, the words "car," "automobile," and "vehicle" show up a lot. ๐Ÿš—
    In the animal cluster, the words "wild," "cat," and "species" appear frequently. ๐Ÿ†

๐Ÿ’ก Real-Life Example:
Imagine Netflix ๐Ÿฟ. If you watch a lot of action movies, Netflix learns from your past choices and suggests more action movies ๐ŸŽฌ.
Similarly, Google learns from past searches and suggests better search terms based on what others have looked for before!

๐Ÿ“Š Step 3: Finding Important Words

For each group, Google finds important words (called term vectors ๐Ÿ“Œ) that appear most often in the documents.

Example for Jaguar cars ๐Ÿš—:
jaguar
automobile
car
USA
UK

Example for Mac OS X Jaguar ๐Ÿ’ป:
jaguar
X
Mac
OS

๐ŸŽฏ Step 4: Suggesting Better Search Queries

Now, Google suggests better searches (called query refinements ๐Ÿ› ๏ธ) to help users find what they really want.

Examples:

๐Ÿ“Š Step 5: Ranking the Suggestions

Google ranks these suggested queries based on:

So, a popular topic like "jaguar car" might appear higher in the suggestions than "jaguar cat" ๐Ÿ†.

โž– Bonus: Excluding Irrelevant Results
Google can also suggest excluding certain meanings using a minus sign (-).
Example: If you only want results about the animal Jaguar ๐Ÿ†, you can search:
๐Ÿ” "jaguar -car -mac-os-x -racing"

๐Ÿ”ฎ Step 6: Learning from Past Searches

Google also remembers past searches and prepares suggestions in advance based on what people have searched before.

Example: If many people searched for "Jaguar car" after searching "Jaguar," Google learns that most people looking for "Jaguar" actually mean the car. ๐Ÿš—

๐ŸŒ What is a Cluster Centroid?

A cluster centroid is like the "center" or "average" of a group (cluster) of documents that are all related to the same topic. Itโ€™s the ideal or most typical point that represents all the documents in that group.

๐Ÿ” Imagine you have a group of people, and you want to find the average height of everyone in the group. You would measure the height of each person and find the "middle" height that represents the whole group. That middle height is the centroid of the group.

In Googleโ€™s case, a cluster of documents (like a group about "Jaguar cars" ๐Ÿš—) will have a centroidโ€”the most common, central ideas of all the documents in that group.

๐Ÿง  How Does the Cluster Centroid Work in Googleโ€™s Search Process?

When Google groups documents, it uses important words (term vectors ๐Ÿ“Œ) to define the center of each cluster. Then, it calculates the centroid to find the most relevant and central words for that cluster.

Letโ€™s break it down using an example:

๐Ÿš— Example: Cluster Centroid for "Jaguar Cars"
Imagine Google has a cluster of documents about Jaguar cars ๐Ÿš—. These documents might include words like:

The cluster centroid would focus on the central or most important words in this group. It might be something like this:
Jaguar
Car
Automobile
Engine

What Are Associations? ๐Ÿค

Stored Query (๐Ÿ”Ž):
This is a search term or phrase that has been saved in the system.
Example: "Best pizza recipes".

Stored Document (๐Ÿ“„):
This is a document, webpage, or piece of content that is saved and might be relevant to a stored query.
Example: An article titled "10 Amazing Pizza Recipes".

Association (๐Ÿ”—):
An association is the link between the stored query and the stored document. It indicates that the document is relevant to the query.
Example: The system recognizes that the article "10 Amazing Pizza Recipes" is related to the query "Best pizza recipes".

๐Ÿš€ What the Precomputation System Does

  1. Builds a Database ๐Ÿ—‚๏ธ
    Stores queries, documents, associations, and weights in an association database ๐Ÿ“ฆ
  2. Creates Associations ๐Ÿ”—
    Matches stored queries with stored documents
    Assigns a weight (importance score) to each pair
  3. Uses Cached Data & Logs ๐Ÿ“
    References query logs, cached queries, and cached documents to improve accuracy
  4. Four Key Modules ๐Ÿ—๏ธ
    Associator ๐Ÿค โ†’ Connects queries to documents & assigns weights
    Selector ๐ŸŽฏ โ†’ Picks documents based on search results
    Regenerator ๐Ÿ”„ โ†’ Reuses past queries to refine searches
    Inverter ๐Ÿ”ƒ โ†’ Swaps stored document-query pairings for efficiency

What It Does:

Selector: Choosing Stored Documents Based on Issued Search ๐Ÿ”

What It Does: โœ…

Selects Stored Documents: ๐Ÿ“‚
When a search is issued, the selector picks one or more stored documents for the associated stored query.
๐Ÿ” Example: After you search for "Easy vegan desserts," the selector might pick a few stored documents like blog posts, recipes, or articles that match this query.

Uses Two Methods: ๐Ÿ› ๏ธ

3. Regenerator: Utilizing the Query Log ๐Ÿ”„

๐Ÿ”„ What It Does

4. Inverter: Working with Cached Data ๐Ÿ”„๐Ÿ’พ

๐Ÿ”„ What It Does

Step 1: Pre-Processing (Before You Even Search) ๐Ÿ”โš™๏ธ

Before you even type a query (search term), the system does some prep work using a precomputation engine (a system that prepares data in advance). This engine has four key parts:

Step 2: Query Refinement (Improving Search Suggestions) ๐Ÿ› ๏ธ

Once a user enters a search query, the query refinement system works to improve it. This system also has four parts:

Step 2: Query Refinement (Improving Search Suggestions) ๐Ÿ› ๏ธ

3. The Scorer analyzes each cluster and calculates a centroid (the "center" of a cluster, representing the most important words).
๐Ÿ”น Search queries are then scored based on:

โœ… Example: If the โ€œbest affordable hiking bootsโ€ cluster has a lot of web pages and closely matches other user searches, it gets a higher score than a less relevant query.

Step 4: Presenter ๐ŸŽค

A presenter takes the highest scoring search queries and shows them to you as suggestions for refining your original search. The system keeps track of how the refinements were created but only shows the final suggestions.

Example: If you search โ€œcoffee brewing,โ€ the system might suggest:

2017 Updates to Patent:

More Topics