π Multi Stage Query Processing
This patent outlines a method for improving search engines by analyzing and refining user queries in multiple stages.
π Key Components of the Process
-
πΉ Context of Words: Understanding how words relate to each other in a query.
β Example:- Query: "Apple phone price"
- Context: The system recognizes "Apple" refers to the brand, not the fruit, based on the word "phone."
-
πΉ Stop Words Removal: Removing common words like "and," "the," or "is" that donβt impact search results.
β Example:- Original query: "What is the best restaurant near me?"
- After removal: "best restaurant near me"
-
πΉ Stemming Concrete Words: Simplifying words to their root form.
β Example:- "running" β "run", "better" β "good"
- Query: "How to improve running speed?" β "improve run speed"
-
πΉ Expanding with Synonyms & Co-occurrence:
-
Synonyms: Adding words with the same meaning.
β Example: "Car" β "vehicle," "automobile." -
Co-occurrence: Including words that frequently appear together.
β Example: "Coffee" β "espresso," "caffeine," "brew."
-
Synonyms: Adding words with the same meaning.
π Criteria Used in Processing
-
πΉ Absent Queries: Handling queries missing key terms but still returning relevant results.
β Example: "Affordable Paris flights" might still show results for "cheap flights to France." - πΉ Boolean Logic: Using operators like AND/OR/NOT to refine searches.
-
πΉ Term Weights: Prioritizing certain words.
β Example: "urgent plumber near me" β "urgent" gets higher priority. - πΉ Document Popularity: Ranking results based on how often a page is referenced.
-
πΉ Word Proximity: Checking how close keywords are in a document.
β Example: "climate change effects" vs. "effects of climate change." -
πΉ Word Adjacency: Ensuring exact phrase matches.
β Example: "black dress" vs. "dress black."
π Technologies Used
-
πΉ VIPS (Visual-based Page Segmentation): Analyzes a webpageβs layout (headers, footers, images) to prioritize content.
β Example: Extracting the main article text from a cluttered blog page. -
πΉ Web Page Layout: Uses HTML structure to understand content hierarchy.
β Example: Recognizing titles, subtitles, and body text to rank relevant sections.