WO2006047407A3 - Method of indexing gategories for efficient searching and ranking - Google Patents

Method of indexing gategories for efficient searching and ranking Download PDF

Info

Publication number
WO2006047407A3
WO2006047407A3 PCT/US2005/038167 US2005038167W WO2006047407A3 WO 2006047407 A3 WO2006047407 A3 WO 2006047407A3 US 2005038167 W US2005038167 W US 2005038167W WO 2006047407 A3 WO2006047407 A3 WO 2006047407A3
Authority
WO
WIPO (PCT)
Prior art keywords
categories
similarity distance
gategories
indexing
ranking
Prior art date
Application number
PCT/US2005/038167
Other languages
French (fr)
Other versions
WO2006047407A2 (en
Inventor
Eric Leu
Original Assignee
Yahoo Inc
Eric Leu
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Yahoo Inc, Eric Leu filed Critical Yahoo Inc
Publication of WO2006047407A2 publication Critical patent/WO2006047407A2/en
Publication of WO2006047407A3 publication Critical patent/WO2006047407A3/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising

Abstract

A document management system indexes categories for efficient retrieval based on a category sequence. The category sequence is generated such that for any first, second and third categories appearing in the sequence in ascending order, a similarity distance between the first and second categories is less than or equal to a similarity distance between the first and third categories, and a similarity distance between the second and third categories is also less than or equal to the similarity distance between the first and third categories. A category index implemented in this manner significantly reduces the number of similarity distance computations that are performed when searching for categories and documents that are most relevant to content presented on a web page.
PCT/US2005/038167 2004-10-26 2005-10-20 Method of indexing gategories for efficient searching and ranking WO2006047407A2 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US97351404A 2004-10-26 2004-10-26
US10/973,514 2004-10-26

Publications (2)

Publication Number Publication Date
WO2006047407A2 WO2006047407A2 (en) 2006-05-04
WO2006047407A3 true WO2006047407A3 (en) 2007-06-21

Family

ID=36228324

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2005/038167 WO2006047407A2 (en) 2004-10-26 2005-10-20 Method of indexing gategories for efficient searching and ranking

Country Status (1)

Country Link
WO (1) WO2006047407A2 (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9922303B2 (en) * 2012-08-30 2018-03-20 Oracle International Corporation Method and system for implementing product group mappings
US10223697B2 (en) 2012-08-30 2019-03-05 Oracle International Corporation Method and system for implementing a CRM quote and order capture context service
US9953353B2 (en) 2012-08-30 2018-04-24 Oracle International Corporation Method and system for implementing an architecture for a sales catalog

Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5819258A (en) * 1997-03-07 1998-10-06 Digital Equipment Corporation Method and apparatus for automatically generating hierarchical categories from large document collections
US6122628A (en) * 1997-10-31 2000-09-19 International Business Machines Corporation Multidimensional data clustering and dimension reduction for indexing and searching
US6167397A (en) * 1997-09-23 2000-12-26 At&T Corporation Method of clustering electronic documents in response to a search query
US6185550B1 (en) * 1997-06-13 2001-02-06 Sun Microsystems, Inc. Method and apparatus for classifying documents within a class hierarchy creating term vector, term file and relevance ranking
US20020078044A1 (en) * 2000-12-19 2002-06-20 Jong-Cheol Song System for automatically classifying documents by category learning using a genetic algorithm and a term cluster and method thereof
US20030233350A1 (en) * 2002-06-12 2003-12-18 Zycus Infotech Pvt. Ltd. System and method for electronic catalog classification using a hybrid of rule based and statistical method
US20040267725A1 (en) * 2003-06-30 2004-12-30 Harik Georges R Serving advertisements using a search of advertiser Web information
US20050267872A1 (en) * 2004-06-01 2005-12-01 Yaron Galai System and method for automated mapping of items to documents
US7028024B1 (en) * 2001-07-20 2006-04-11 Vignette Corporation Information retrieval from a collection of information objects tagged with hierarchical keywords
US7124093B1 (en) * 1997-12-22 2006-10-17 Ricoh Company, Ltd. Method, system and computer code for content based web advertising

Patent Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5819258A (en) * 1997-03-07 1998-10-06 Digital Equipment Corporation Method and apparatus for automatically generating hierarchical categories from large document collections
US6185550B1 (en) * 1997-06-13 2001-02-06 Sun Microsystems, Inc. Method and apparatus for classifying documents within a class hierarchy creating term vector, term file and relevance ranking
US6167397A (en) * 1997-09-23 2000-12-26 At&T Corporation Method of clustering electronic documents in response to a search query
US6122628A (en) * 1997-10-31 2000-09-19 International Business Machines Corporation Multidimensional data clustering and dimension reduction for indexing and searching
US7124093B1 (en) * 1997-12-22 2006-10-17 Ricoh Company, Ltd. Method, system and computer code for content based web advertising
US20020078044A1 (en) * 2000-12-19 2002-06-20 Jong-Cheol Song System for automatically classifying documents by category learning using a genetic algorithm and a term cluster and method thereof
US7028024B1 (en) * 2001-07-20 2006-04-11 Vignette Corporation Information retrieval from a collection of information objects tagged with hierarchical keywords
US20030233350A1 (en) * 2002-06-12 2003-12-18 Zycus Infotech Pvt. Ltd. System and method for electronic catalog classification using a hybrid of rule based and statistical method
US20040267725A1 (en) * 2003-06-30 2004-12-30 Harik Georges R Serving advertisements using a search of advertiser Web information
US20050267872A1 (en) * 2004-06-01 2005-12-01 Yaron Galai System and method for automated mapping of items to documents

Also Published As

Publication number Publication date
WO2006047407A2 (en) 2006-05-04

Similar Documents

Publication Publication Date Title
WO2006041950A3 (en) Classification-expanded indexing and retrieval of classified documents
WO2003079234A3 (en) Knowledge management using text classification
DE602005026609D1 (en) Identification of expressions in an information retrieval system
WO2006081325A3 (en) Multiple index based information retrieval system
ATE521947T1 (en) PHRASE BASED INDEXING IN AN INFORMATION REQUEST SYSTEM
WO2006028953A3 (en) Query-based document composition
CA2677307A1 (en) Searching structured geographical data
SG142158A1 (en) Index structure of metadata, method for providing indices of metadata, and metadata searching method and apparatus using the indices of metadata
NO20053640L (en) Phrase-based browsing in an information retrieval system
EP1524610A3 (en) Systems and methods for performing electronic information retrieval
WO2006113597A3 (en) Method for information retrieval
WO2001082114A3 (en) System for fulfilling an information need
ATE529811T1 (en) PHRASE-BASED GENERATION OF DOCUMENT DESCRIPTIONS
WO2000067159A3 (en) System and method for searching and recommending documents in a collection using shared bookmarks
EP1043665A3 (en) Methods and apparatus for retrieving audio information using content and speaker information
WO2006133252A3 (en) Doubly ranked information retrieval and area search
WO2008031062A3 (en) System and method for building and retriving a full text index
WO2007087379A3 (en) Data access using multilevel selectors and contextual assistance
Yerra et al. A sentence-based copy detection approach for web documents
WO2004114149A3 (en) Annotating a digital object
WO2005081809A3 (en) Document conversion and integration system
CN102402561A (en) Searching method and device
CN110970112B (en) Knowledge graph construction method and system for nutrition and health
WO2006047407A3 (en) Method of indexing gategories for efficient searching and ranking
TWI266213B (en) Sequence based indexing and retrieval method for text documents

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BW BY BZ CA CH CN CO CR CU CZ DK DM DZ EC EE EG ES FI GB GD GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV LY MD MG MK MN MW MX MZ NA NG NO NZ OM PG PH PL PT RO RU SC SD SG SK SL SM SY TJ TM TN TR TT TZ UG US UZ VC VN YU ZA ZM

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): BW GH GM KE LS MW MZ NA SD SZ TZ UG ZM ZW AM AZ BY KG MD RU TJ TM AT BE BG CH CY DE DK EE ES FI FR GB GR HU IE IS IT LU LV MC NL PL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 05812508

Country of ref document: EP

Kind code of ref document: A2