WO2002057961A3 - Method and system of ranking and clustering for document indexing and retrieval - Google Patents

Method and system of ranking and clustering for document indexing and retrieval Download PDF

Info

Publication number
WO2002057961A3
WO2002057961A3 PCT/US2002/000402 US0200402W WO02057961A3 WO 2002057961 A3 WO2002057961 A3 WO 2002057961A3 US 0200402 W US0200402 W US 0200402W WO 02057961 A3 WO02057961 A3 WO 02057961A3
Authority
WO
WIPO (PCT)
Prior art keywords
document
query
predicate
structures
relevance
Prior art date
Application number
PCT/US2002/000402
Other languages
French (fr)
Other versions
WO2002057961A2 (en
Inventor
Maureen Caudill
Jason Chun-Ming Tseng
Lei Wang
Original Assignee
Science Applic Int Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Science Applic Int Corp filed Critical Science Applic Int Corp
Priority to AU2002235314A priority Critical patent/AU2002235314A1/en
Publication of WO2002057961A2 publication Critical patent/WO2002057961A2/en
Publication of WO2002057961A3 publication Critical patent/WO2002057961A3/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/35Clustering; Classification
    • G06F16/353Clustering; Classification into predefined classes
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution
    • G06F16/3347Query execution using vector based model
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99931Database or file accessing
    • Y10S707/99933Query processing, i.e. searching
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99931Database or file accessing
    • Y10S707/99933Query processing, i.e. searching
    • Y10S707/99935Query augmenting and refining, e.g. inexact access

Abstract

A relevancy ranking and clustering method and system that determines the relevance of a document relative to a user's query using a similarity comparison process. Input queries are parsed into one or more query predicate structures using an ontological parser. The ontological parser parses a set of known documents to generate one or more document predicate structures. A comparison of each query predicate structure with each document predicate structure is performed to determine a matching degree, represented by a real number. A multilevel modifier strategy is implemented to assign different relevance values to the different parts of each predicate structure match to calculate the predicate structure's matching degree. The relevance of a document to a user's query is determined by calculating a similarity coefficient, based on the structures of each pair of query predicates and document predicates. Documents are autonomously clustered using a self-organizing neural network that provides a coordinate system that makes judgments in a non-subjective fashion.
PCT/US2002/000402 2001-01-18 2002-01-10 Method and system of ranking and clustering for document indexing and retrieval WO2002057961A2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
AU2002235314A AU2002235314A1 (en) 2001-01-18 2002-01-10 Method and system of ranking and clustering for document indexing and retrieval

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US09/761,188 2001-01-18
US09/761,188 US6766316B2 (en) 2001-01-18 2001-01-18 Method and system of ranking and clustering for document indexing and retrieval

Publications (2)

Publication Number Publication Date
WO2002057961A2 WO2002057961A2 (en) 2002-07-25
WO2002057961A3 true WO2002057961A3 (en) 2003-10-09

Family

ID=25061435

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2002/000402 WO2002057961A2 (en) 2001-01-18 2002-01-10 Method and system of ranking and clustering for document indexing and retrieval

Country Status (3)

Country Link
US (2) US6766316B2 (en)
AU (1) AU2002235314A1 (en)
WO (1) WO2002057961A2 (en)

Families Citing this family (325)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1049030A1 (en) * 1999-04-28 2000-11-02 SER Systeme AG Produkte und Anwendungen der Datenverarbeitung Classification method and apparatus
US6883135B1 (en) 2000-01-28 2005-04-19 Microsoft Corporation Proxy server using a statistical model
DE60005293T2 (en) * 2000-02-23 2004-07-01 Ser Solutions Inc. Method and device for processing electronic documents
AU2013205566B2 (en) * 2000-08-18 2016-09-15 Kofax International Switzerland Sàrl Associative memory
EP1182577A1 (en) * 2000-08-18 2002-02-27 SER Systeme AG Produkte und Anwendungen der Datenverarbeitung Associative memory
US9177828B2 (en) 2011-02-10 2015-11-03 Micron Technology, Inc. External gettering method and device
AUPR208000A0 (en) * 2000-12-15 2001-01-11 80-20 Software Pty Limited Method of document searching
US7174453B2 (en) 2000-12-29 2007-02-06 America Online, Inc. Message screening system
US6785676B2 (en) 2001-02-07 2004-08-31 International Business Machines Corporation Customer self service subsystem for response set ordering and annotation
US6778193B2 (en) 2001-02-07 2004-08-17 International Business Machines Corporation Customer self service iconic interface for portal entry and search specification
US6873990B2 (en) 2001-02-07 2005-03-29 International Business Machines Corporation Customer self service subsystem for context cluster discovery and validation
US6643639B2 (en) * 2001-02-07 2003-11-04 International Business Machines Corporation Customer self service subsystem for adaptive indexing of resource solutions and resource lookup
US6853998B2 (en) 2001-02-07 2005-02-08 International Business Machines Corporation Customer self service subsystem for classifying user contexts
US7231381B2 (en) * 2001-03-13 2007-06-12 Microsoft Corporation Media content search engine incorporating text content and user log mining
US6748398B2 (en) 2001-03-30 2004-06-08 Microsoft Corporation Relevance maximizing, iteration minimizing, relevance-feedback, content-based image retrieval (CBIR)
US7043500B2 (en) * 2001-04-25 2006-05-09 Board Of Regents, The University Of Texas Syxtem Subtractive clustering for use in analysis of data
USRE46973E1 (en) 2001-05-07 2018-07-31 Ureveal, Inc. Method, system, and computer program product for concept-based multi-dimensional analysis of unstructured information
US6970881B1 (en) 2001-05-07 2005-11-29 Intelligenxia, Inc. Concept-based method and system for dynamically analyzing unstructured information
US7627588B1 (en) 2001-05-07 2009-12-01 Ixreveal, Inc. System and method for concept based analysis of unstructured data
US7536413B1 (en) 2001-05-07 2009-05-19 Ixreveal, Inc. Concept-based categorization of unstructured objects
US7194483B1 (en) 2001-05-07 2007-03-20 Intelligenxia, Inc. Method, system, and computer program product for concept-based multi-dimensional analysis of unstructured information
US7269546B2 (en) * 2001-05-09 2007-09-11 International Business Machines Corporation System and method of finding documents related to other documents and of finding related words in response to a query to refine a search
US6735578B2 (en) * 2001-05-10 2004-05-11 Honeywell International Inc. Indexing of knowledge base in multilayer self-organizing maps with hessian and perturbation induced fast learning
US20020169872A1 (en) * 2001-05-14 2002-11-14 Hiroshi Nomiyama Method for arranging information, information processing apparatus, storage media and program tranmission apparatus
US6993532B1 (en) * 2001-05-30 2006-01-31 Microsoft Corporation Auto playlist generator
US7266545B2 (en) * 2001-08-07 2007-09-04 International Business Machines Corporation Methods and apparatus for indexing in a database and for retrieving data from a database in accordance with queries using example sets
US20050216456A1 (en) * 2001-08-10 2005-09-29 T-Systemev, Ltd. Method for entering, recording, distributing and reporting data
ES2375403T3 (en) * 2001-08-27 2012-02-29 BDGB Enterprise Software Sàrl A METHOD FOR THE AUTOMATIC INDEXATION OF DOCUMENTS.
US7363311B2 (en) * 2001-11-16 2008-04-22 Nippon Telegraph And Telephone Corporation Method of, apparatus for, and computer program for mapping contents having meta-information
US7283992B2 (en) * 2001-11-30 2007-10-16 Microsoft Corporation Media agent to suggest contextually related media content
US7333966B2 (en) * 2001-12-21 2008-02-19 Thomson Global Resources Systems, methods, and software for hyperlinking names
US6963869B2 (en) * 2002-01-07 2005-11-08 Hewlett-Packard Development Company, L.P. System and method for search, index, parsing document database including subject document having nested fields associated start and end meta words where each meta word identify location and nesting level
US7356461B1 (en) * 2002-01-14 2008-04-08 Nstein Technologies Inc. Text categorization method and apparatus
JP2003216650A (en) * 2002-01-28 2003-07-31 Sony Corp Graphical user interface for information intermediation system
US6952691B2 (en) 2002-02-01 2005-10-04 International Business Machines Corporation Method and system for searching a multi-lingual database
US8589413B1 (en) 2002-03-01 2013-11-19 Ixreveal, Inc. Concept-based method and system for dynamically analyzing results from search engines
JP4255239B2 (en) * 2002-03-29 2009-04-15 富士通株式会社 Document search method
US7197460B1 (en) * 2002-04-23 2007-03-27 At&T Corp. System for handling frequently asked questions in a natural language dialog service
US7039631B1 (en) * 2002-05-24 2006-05-02 Microsoft Corporation System and method for providing search results with configurable scoring formula
JP2005535007A (en) * 2002-05-28 2005-11-17 ナシプニイ、ウラジミル・ウラジミロビッチ Synthesizing method of self-learning system for knowledge extraction for document retrieval system
JP4005425B2 (en) * 2002-06-28 2007-11-07 富士通株式会社 Search result ranking change processing program, search result ranking change processing program recording medium, and content search processing method
US20040054520A1 (en) * 2002-07-05 2004-03-18 Dehlinger Peter J. Text-searching code, system and method
US7493253B1 (en) * 2002-07-12 2009-02-17 Language And Computing, Inc. Conceptual world representation natural language understanding system and method
US7249312B2 (en) * 2002-09-11 2007-07-24 Intelligent Results Attribute scoring for unstructured content
US20040064447A1 (en) * 2002-09-27 2004-04-01 Simske Steven J. System and method for management of synonymic searching
US7158957B2 (en) * 2002-11-21 2007-01-02 Honeywell International Inc. Supervised self organizing maps with fuzzy error correction
US7197503B2 (en) * 2002-11-26 2007-03-27 Honeywell International Inc. Intelligent retrieval and classification of information from a product manual
GB2395805A (en) * 2002-11-27 2004-06-02 Sony Uk Ltd Information retrieval
US7103591B2 (en) * 2002-12-02 2006-09-05 International Business Machines Corporation Method of describing business and technology information for utilization
JP3974511B2 (en) * 2002-12-19 2007-09-12 インターナショナル・ビジネス・マシーンズ・コーポレーション Computer system for generating data structure for information retrieval, method therefor, computer-executable program for generating data structure for information retrieval, computer-executable program for generating data structure for information retrieval Stored computer-readable storage medium, information retrieval system, and graphical user interface system
US7640336B1 (en) 2002-12-30 2009-12-29 Aol Llc Supervising user interaction with online services
US20040153305A1 (en) * 2003-02-03 2004-08-05 Enescu Mircea Gabriel Method and system for automated matching of text based electronic messages
US7606714B2 (en) * 2003-02-11 2009-10-20 Microsoft Corporation Natural language classification within an automated response system
US20040162724A1 (en) * 2003-02-11 2004-08-19 Jeffrey Hill Management of conversations
JP4238616B2 (en) * 2003-03-28 2009-03-18 株式会社日立製作所 Similar document search method and similar document search device
US7483877B2 (en) * 2003-04-11 2009-01-27 International Business Machines Corporation Dynamic comparison of search systems in a controlled environment
US7668801B1 (en) * 2003-04-21 2010-02-23 At&T Corp. Method and apparatus for optimizing queries under parametric aggregation constraints
US7403939B1 (en) 2003-05-30 2008-07-22 Aol Llc Resolving queries based on automatic determination of requestor geographic location
US7739313B2 (en) * 2003-05-30 2010-06-15 Hewlett-Packard Development Company, L.P. Method and system for finding conjunctive clusters
US7854009B2 (en) * 2003-06-12 2010-12-14 International Business Machines Corporation Method of securing access to IP LANs
GB2403636A (en) * 2003-07-02 2005-01-05 Sony Uk Ltd Information retrieval using an array of nodes
US8090678B1 (en) * 2003-07-23 2012-01-03 Shopping.Com Systems and methods for extracting information from structured documents
US7836010B2 (en) 2003-07-30 2010-11-16 Northwestern University Method and system for assessing relevant properties of work contexts for use by information services
GB0320205D0 (en) * 2003-08-28 2003-10-01 British Telecomm Method and apparatus for storing and retrieving data
US8548995B1 (en) * 2003-09-10 2013-10-01 Google Inc. Ranking of documents based on analysis of related documents
US8014997B2 (en) * 2003-09-20 2011-09-06 International Business Machines Corporation Method of search content enhancement
JP2005115628A (en) * 2003-10-07 2005-04-28 Hewlett-Packard Development Co Lp Document classification apparatus using stereotyped expression, method, program
US7539690B2 (en) * 2003-10-27 2009-05-26 Hewlett-Packard Development Company, L.P. Data mining method and system using regression clustering
US20050149546A1 (en) * 2003-11-03 2005-07-07 Prakash Vipul V. Methods and apparatuses for determining and designating classifications of electronic documents
US7519565B2 (en) * 2003-11-03 2009-04-14 Cloudmark, Inc. Methods and apparatuses for classifying electronic documents
US7240049B2 (en) * 2003-11-12 2007-07-03 Yahoo! Inc. Systems and methods for search query processing using trend analysis
US20050144158A1 (en) * 2003-11-18 2005-06-30 Capper Liesl J. Computer network search engine
US20050154701A1 (en) * 2003-12-01 2005-07-14 Parunak H. Van D. Dynamic information extraction with self-organizing evidence construction
US7376638B2 (en) * 2003-12-24 2008-05-20 International Business Machines Corporation System and method for addressing inefficient query processing
US8375048B1 (en) * 2004-01-20 2013-02-12 Microsoft Corporation Query augmentation
US7499913B2 (en) 2004-01-26 2009-03-03 International Business Machines Corporation Method for handling anchor text
US8296304B2 (en) 2004-01-26 2012-10-23 International Business Machines Corporation Method, system, and program for handling redirects in a search engine
US7424467B2 (en) * 2004-01-26 2008-09-09 International Business Machines Corporation Architecture for an indexer with fixed width sort and variable width sort
US7293005B2 (en) 2004-01-26 2007-11-06 International Business Machines Corporation Pipelined architecture for global analysis and index building
US7584221B2 (en) * 2004-03-18 2009-09-01 Microsoft Corporation Field weighting in text searching
US7840571B2 (en) * 2004-04-29 2010-11-23 Hewlett-Packard Development Company, L.P. System and method for information management using handwritten identifiers
US20050246353A1 (en) * 2004-05-03 2005-11-03 Yoav Ezer Automated transformation of unstructured data
US11409812B1 (en) * 2004-05-10 2022-08-09 Google Llc Method and system for mining image searches to associate images with concepts
BE1016079A6 (en) * 2004-06-17 2006-02-07 Vartec Nv METHOD FOR INDEXING AND RECOVERING DOCUMENTS, COMPUTER PROGRAM THAT IS APPLIED AND INFORMATION CARRIER PROVIDED WITH THE ABOVE COMPUTER PROGRAM.
US7562069B1 (en) 2004-07-01 2009-07-14 Aol Llc Query disambiguation
US20060047656A1 (en) * 2004-09-01 2006-03-02 Dehlinger Peter J Code, system, and method for retrieving text material from a library of documents
US7461064B2 (en) 2004-09-24 2008-12-02 International Buiness Machines Corporation Method for searching documents for ranges of numeric values
US7606793B2 (en) 2004-09-27 2009-10-20 Microsoft Corporation System and method for scoping searches using index keys
US20060074900A1 (en) * 2004-09-30 2006-04-06 Nanavati Amit A Selecting keywords representative of a document
US7761448B2 (en) 2004-09-30 2010-07-20 Microsoft Corporation System and method for ranking search results using click distance
US7739277B2 (en) * 2004-09-30 2010-06-15 Microsoft Corporation System and method for incorporating anchor text into ranking search results
US7827181B2 (en) 2004-09-30 2010-11-02 Microsoft Corporation Click distance determination
US7805446B2 (en) * 2004-10-12 2010-09-28 Ut-Battelle Llc Agent-based method for distributed clustering of textual information
US8335753B2 (en) * 2004-11-03 2012-12-18 Microsoft Corporation Domain knowledge-assisted information processing
US7542969B1 (en) * 2004-11-03 2009-06-02 Microsoft Corporation Domain knowledge-assisted information processing
US7716209B1 (en) 2004-11-03 2010-05-11 Microsoft Corporation Automated advertisement publisher identification and selection
US7428533B2 (en) 2004-12-06 2008-09-23 Yahoo! Inc. Automatic generation of taxonomies for categorizing queries and search query processing using taxonomies
US7620628B2 (en) * 2004-12-06 2009-11-17 Yahoo! Inc. Search processing with automatic categorization of queries
US7716198B2 (en) * 2004-12-21 2010-05-11 Microsoft Corporation Ranking search results using feature extraction
US7818314B2 (en) * 2004-12-29 2010-10-19 Aol Inc. Search fusion
US7571157B2 (en) * 2004-12-29 2009-08-04 Aol Llc Filtering search results
US7272597B2 (en) 2004-12-29 2007-09-18 Aol Llc Domain expert search
US7349896B2 (en) * 2004-12-29 2008-03-25 Aol Llc Query routing
EP1854030A2 (en) 2005-01-28 2007-11-14 Aol Llc Web query classification
US7725418B2 (en) * 2005-01-28 2010-05-25 Honda Motor Co., Ltd. Responding to situations using multidimensional semantic net and Bayes inference
WO2006096260A2 (en) * 2005-01-31 2006-09-14 Musgrove Technology Enterprises, Llc System and method for generating an interlinked taxonomy structure
US7792811B2 (en) * 2005-02-16 2010-09-07 Transaxtions Llc Intelligent search with guiding info
US7689520B2 (en) * 2005-02-25 2010-03-30 Microsoft Corporation Machine learning system and method for ranking sets of data using a pairing cost function
US7792833B2 (en) * 2005-03-03 2010-09-07 Microsoft Corporation Ranking search results using language types
US20060200460A1 (en) * 2005-03-03 2006-09-07 Microsoft Corporation System and method for ranking search results using file types
US7680772B2 (en) * 2005-03-09 2010-03-16 Intuit Inc. Search quality detection
JP4185500B2 (en) * 2005-03-14 2008-11-26 株式会社東芝 Document search system, document search method and program
US8719244B1 (en) * 2005-03-23 2014-05-06 Google Inc. Methods and systems for retrieval of information items and associated sentence fragments
JP2008537225A (en) * 2005-04-11 2008-09-11 テキストディガー,インコーポレイテッド Search system and method for queries
US7832003B2 (en) * 2005-04-28 2010-11-09 Microsoft Corporation Walled gardens
US7401073B2 (en) * 2005-04-28 2008-07-15 International Business Machines Corporation Term-statistics modification for category-based search
US7529736B2 (en) * 2005-05-06 2009-05-05 Microsoft Corporation Performant relevance improvements in search query results
US7630976B2 (en) * 2005-05-10 2009-12-08 Microsoft Corporation Method and system for adapting search results to personal information needs
US7502810B2 (en) 2005-05-24 2009-03-10 International Business Machines Corporation Tagging of facet elements in a facet tree
US7774383B2 (en) * 2005-05-24 2010-08-10 International Business Machines Corporation Displaying facet tree elements and logging facet element item counts to a sequence document
US7739104B2 (en) * 2005-05-27 2010-06-15 Hakia, Inc. System and method for natural language processing and using ontological searches
US20110153509A1 (en) 2005-05-27 2011-06-23 Ip Development Venture Method and apparatus for cross-referencing important ip relationships
US8249344B2 (en) * 2005-07-01 2012-08-21 Microsoft Corporation Grammatical parsing of document visual structures
US8019713B2 (en) * 2005-07-08 2011-09-13 Honda Motor Co., Ltd. Commonsense reasoning about task instructions
US7370022B2 (en) * 2005-07-08 2008-05-06 Honda Motor Co. Building plans for household tasks from distributed knowledge
US8417693B2 (en) 2005-07-14 2013-04-09 International Business Machines Corporation Enforcing native access control to indexed documents
US7853618B2 (en) 2005-07-21 2010-12-14 The Boeing Company Methods and apparatus for generic semantic access to information systems
JP2009503657A (en) * 2005-07-22 2009-01-29 エル ターラー、ステフエン Rating system using neural network
US7725485B1 (en) 2005-08-01 2010-05-25 Google Inc. Generating query suggestions using contextual information
US7599917B2 (en) * 2005-08-15 2009-10-06 Microsoft Corporation Ranking search results using biased click distance
US7747618B2 (en) * 2005-09-08 2010-06-29 Microsoft Corporation Augmenting user, query, and document triplets using singular value decomposition
US20070067268A1 (en) * 2005-09-22 2007-03-22 Microsoft Corporation Navigation of structured data
WO2007038713A2 (en) * 2005-09-28 2007-04-05 Epacris Inc. Search engine determining results based on probabilistic scoring of relevance
EP1952280B8 (en) * 2005-10-11 2016-11-30 Ureveal, Inc. System, method&computer program product for concept based searching&analysis
US20070112738A1 (en) * 2005-11-14 2007-05-17 Aol Llc Displaying User Relevance Feedback for Search Results
US20070112898A1 (en) * 2005-11-15 2007-05-17 Clairvoyance Corporation Methods and apparatus for probe-based clustering
US20070112867A1 (en) * 2005-11-15 2007-05-17 Clairvoyance Corporation Methods and apparatus for rank-based response set clustering
US7657503B1 (en) * 2005-11-29 2010-02-02 At&T Corp. System and method for generating statistical descriptors for a data stream
US8832064B2 (en) * 2005-11-30 2014-09-09 At&T Intellectual Property Ii, L.P. Answer determination for natural language questioning
US7739279B2 (en) * 2005-12-12 2010-06-15 Fuji Xerox Co., Ltd. Systems and methods for determining relevant information based on document structure
US7660786B2 (en) * 2005-12-14 2010-02-09 Microsoft Corporation Data independent relevance evaluation utilizing cognitive concept relationship
WO2007081681A2 (en) 2006-01-03 2007-07-19 Textdigger, Inc. Search system with query refinement and search method
US7676485B2 (en) * 2006-01-20 2010-03-09 Ixreveal, Inc. Method and computer program product for converting ontologies into concept semantic networks
EP1920366A1 (en) 2006-01-20 2008-05-14 Glenbrook Associates, Inc. System and method for context-rich database optimized for processing of concepts
US7814099B2 (en) * 2006-01-31 2010-10-12 Louis S. Wang Method for ranking and sorting electronic documents in a search result list based on relevance
US7603330B2 (en) * 2006-02-01 2009-10-13 Honda Motor Co., Ltd. Meta learning for question classification
US8509563B2 (en) * 2006-02-02 2013-08-13 Microsoft Corporation Generation of documents from images
US20090112828A1 (en) * 2006-03-13 2009-04-30 Answers Corporation Method and system for answer extraction
US20070233679A1 (en) * 2006-04-03 2007-10-04 Microsoft Corporation Learning a document ranking function using query-level error measurements
US8862573B2 (en) 2006-04-04 2014-10-14 Textdigger, Inc. Search system and method with text function tagging
JP4201012B2 (en) * 2006-04-06 2008-12-24 ソニー株式会社 Data processing apparatus, data processing method, and program
US7890533B2 (en) * 2006-05-17 2011-02-15 Noblis, Inc. Method and system for information extraction and modeling
US8463810B1 (en) * 2006-06-01 2013-06-11 Monster Worldwide, Inc. Scoring concepts for contextual personalized information retrieval
US7730060B2 (en) * 2006-06-09 2010-06-01 Microsoft Corporation Efficient evaluation of object finder queries
US7593934B2 (en) * 2006-07-28 2009-09-22 Microsoft Corporation Learning a document ranking using a loss function with a rank pair or a query parameter
US8117197B1 (en) 2008-06-10 2012-02-14 Surf Canyon, Inc. Adaptive user interface for real-time search relevance feedback
US20100036797A1 (en) * 2006-08-31 2010-02-11 The Regents Of The University Of California Semantic search engine
US20080256026A1 (en) * 2006-10-17 2008-10-16 Michael Glen Hays Method For Optimizing And Executing A Query Using Ontological Metadata
TWI337712B (en) * 2006-10-30 2011-02-21 Inst Information Industry Systems and methods for measuring behavior characteristics, and machine readable medium thereof
WO2008055034A2 (en) * 2006-10-30 2008-05-08 Noblis, Inc. Method and system for personal information extraction and modeling with fully generalized extraction contexts
US20080114750A1 (en) * 2006-11-14 2008-05-15 Microsoft Corporation Retrieval and ranking of items utilizing similarity
US7650317B2 (en) * 2006-12-06 2010-01-19 Microsoft Corporation Active learning framework for automatic field extraction from network traffic
US7890536B2 (en) * 2006-12-21 2011-02-15 International Business Machines Corporation Generating templates of nodes to structure content objects and steps to process the content objects
US20080183691A1 (en) * 2007-01-30 2008-07-31 International Business Machines Corporation Method for a networked knowledge based document retrieval and ranking utilizing extracted document metadata and content
US8332209B2 (en) * 2007-04-24 2012-12-11 Zinovy D. Grinblat Method and system for text compression and decompression
US7818320B2 (en) * 2007-05-31 2010-10-19 Yahoo! Inc. Enhanced search results based on user feedback relating to search result abstracts
US8612445B2 (en) * 2009-05-13 2013-12-17 Hamid Hatami-Hanza System and method for a unified semantic ranking of compositions of ontological subjects and the applications thereof
US9613138B2 (en) * 2008-09-03 2017-04-04 Hamid Hatami-Hanza Unified semantic scoring of compositions of ontological subjects
US20090070322A1 (en) * 2007-08-31 2009-03-12 Powerset, Inc. Browsing knowledge on the basis of semantic relations
US8346756B2 (en) * 2007-08-31 2013-01-01 Microsoft Corporation Calculating valence of expressions within documents for searching a document index
US8229970B2 (en) * 2007-08-31 2012-07-24 Microsoft Corporation Efficient storage and retrieval of posting lists
US8712758B2 (en) * 2007-08-31 2014-04-29 Microsoft Corporation Coreference resolution in an ambiguity-sensitive natural language processing system
US8868562B2 (en) * 2007-08-31 2014-10-21 Microsoft Corporation Identification of semantic relationships within reported speech
US7984032B2 (en) * 2007-08-31 2011-07-19 Microsoft Corporation Iterators for applying term occurrence-level constraints in natural language searching
US8463593B2 (en) * 2007-08-31 2013-06-11 Microsoft Corporation Natural language hypernym weighting for word sense disambiguation
US8639708B2 (en) * 2007-08-31 2014-01-28 Microsoft Corporation Fact-based indexing for natural language search
US8280721B2 (en) * 2007-08-31 2012-10-02 Microsoft Corporation Efficiently representing word sense probabilities
US8229730B2 (en) * 2007-08-31 2012-07-24 Microsoft Corporation Indexing role hierarchies for words in a search index
US8041697B2 (en) * 2007-08-31 2011-10-18 Microsoft Corporation Semi-automatic example-based induction of semantic translation rules to support natural language search
US8316036B2 (en) 2007-08-31 2012-11-20 Microsoft Corporation Checkpointing iterators during search
WO2009038788A1 (en) 2007-09-21 2009-03-26 Noblis, Inc. Method and system for active learning screening process with dynamic information modeling
US8280892B2 (en) 2007-10-05 2012-10-02 Fujitsu Limited Selecting tags for a document by analyzing paragraphs of the document
US9081852B2 (en) * 2007-10-05 2015-07-14 Fujitsu Limited Recommending terms to specify ontology space
US7840569B2 (en) 2007-10-18 2010-11-23 Microsoft Corporation Enterprise relevancy ranking using a neural network
US9348912B2 (en) 2007-10-18 2016-05-24 Microsoft Technology Licensing, Llc Document length as a static relevance feature for ranking search results
US20090106221A1 (en) * 2007-10-18 2009-04-23 Microsoft Corporation Ranking and Providing Search Results Based In Part On A Number Of Click-Through Features
US8140535B2 (en) * 2007-10-23 2012-03-20 International Business Machines Corporation Ontology-based network search engine
US8041702B2 (en) * 2007-10-25 2011-10-18 International Business Machines Corporation Ontology-based network search engine
US20090112841A1 (en) * 2007-10-29 2009-04-30 International Business Machines Corporation Document searching using contextual information leverage and insights
US20090254540A1 (en) * 2007-11-01 2009-10-08 Textdigger, Inc. Method and apparatus for automated tag generation for digital content
US20090144242A1 (en) * 2007-12-03 2009-06-04 Chartsource, Inc., A Delaware Corporation Indexer for searching research data
US20090144243A1 (en) * 2007-12-03 2009-06-04 Chartsource, Inc., A Delaware Corporation User interface for searching research data
US20090144265A1 (en) * 2007-12-03 2009-06-04 Chartsource, Inc., A Delaware Corporation Search engine for searching research data
US20090144318A1 (en) * 2007-12-03 2009-06-04 Chartsource, Inc., A Delaware Corporation System for searching research data
US20090144241A1 (en) * 2007-12-03 2009-06-04 Chartsource, Inc., A Delaware Corporation Search term parser for searching research data
US20090144222A1 (en) * 2007-12-03 2009-06-04 Chartsource, Inc., A Delaware Corporation Chart generator for searching research data
US20090144317A1 (en) * 2007-12-03 2009-06-04 Chartsource, Inc., A Delaware Corporation Data search markup language for searching research data
US8144841B2 (en) * 2007-12-05 2012-03-27 Microsoft Corporation Multimedia spam determination using speech conversion
US8413060B1 (en) 2007-12-18 2013-04-02 Aol Inc. Methods and systems for visually distinguishing user attribute similarities and differences
US20090226872A1 (en) * 2008-01-16 2009-09-10 Nicholas Langdon Gunther Electronic grading system
US8812493B2 (en) 2008-04-11 2014-08-19 Microsoft Corporation Search results ranking using editing distance and document information
US8069179B2 (en) * 2008-04-24 2011-11-29 Microsoft Corporation Preference judgements for relevance
US20090287668A1 (en) * 2008-05-16 2009-11-19 Justsystems Evans Research, Inc. Methods and apparatus for interactive document clustering
US20100042589A1 (en) * 2008-08-15 2010-02-18 Smyros Athena A Systems and methods for topical searching
US20100131513A1 (en) 2008-10-23 2010-05-27 Lundberg Steven W Patent mapping
US8290961B2 (en) * 2009-01-13 2012-10-16 Sandia Corporation Technique for information retrieval using enhanced latent semantic analysis generating rank approximation matrix by factorizing the weighted morpheme-by-document matrix
US9836538B2 (en) * 2009-03-03 2017-12-05 Microsoft Technology Licensing, Llc Domain-based ranking in document search
US8097823B2 (en) * 2009-03-11 2012-01-17 Geoffrey Bilder Wiring standard selector switch and universal network cable
JP2012520528A (en) * 2009-03-13 2012-09-06 インベンション マシーン コーポレーション System and method for automatic semantic labeling of natural language text
US9245243B2 (en) 2009-04-14 2016-01-26 Ureveal, Inc. Concept-based analysis of structured and unstructured data using concept inheritance
TW201044330A (en) * 2009-06-08 2010-12-16 Ind Tech Res Inst Teaching material auto expanding method and learning material expanding system using the same, and machine readable medium thereof
US8832131B2 (en) * 2009-07-08 2014-09-09 International Business Machines Corporation System, method, and apparatus for replicating a portion of a content repository using behavioral patterns
US8843506B2 (en) * 2009-07-08 2014-09-23 International Business Machines Corporation System, method, and apparatus for replicating a portion of a content repository
JP5289573B2 (en) * 2009-07-27 2013-09-11 株式会社東芝 Relevance presentation device, method and program
US8321357B2 (en) * 2009-09-30 2012-11-27 Lapir Gennady Method and system for extraction
US9158833B2 (en) * 2009-11-02 2015-10-13 Harry Urbschat System and method for obtaining document information
US9152883B2 (en) * 2009-11-02 2015-10-06 Harry Urbschat System and method for increasing the accuracy of optical character recognition (OCR)
US9213756B2 (en) * 2009-11-02 2015-12-15 Harry Urbschat System and method of using dynamic variance networks
US8954893B2 (en) * 2009-11-06 2015-02-10 Hewlett-Packard Development Company, L.P. Visually representing a hierarchy of category nodes
CN102063432A (en) 2009-11-12 2011-05-18 阿里巴巴集团控股有限公司 Retrieval method and retrieval system
US8161073B2 (en) 2010-05-05 2012-04-17 Holovisions, LLC Context-driven search
US8738635B2 (en) 2010-06-01 2014-05-27 Microsoft Corporation Detection of junk in search result ranking
US20120089421A1 (en) 2010-10-08 2012-04-12 Cerner Innovation, Inc. Multi-site clinical decision support for sepsis
US11398310B1 (en) 2010-10-01 2022-07-26 Cerner Innovation, Inc. Clinical decision support for sepsis
US10734115B1 (en) 2012-08-09 2020-08-04 Cerner Innovation, Inc Clinical decision support for sepsis
US10431336B1 (en) 2010-10-01 2019-10-01 Cerner Innovation, Inc. Computerized systems and methods for facilitating clinical decision making
US9424351B2 (en) 2010-11-22 2016-08-23 Microsoft Technology Licensing, Llc Hybrid-distribution model for search engine indexes
US9342582B2 (en) 2010-11-22 2016-05-17 Microsoft Technology Licensing, Llc Selection of atoms for search engine retrieval
US9195745B2 (en) 2010-11-22 2015-11-24 Microsoft Technology Licensing, Llc Dynamic query master agent for query execution
US8620907B2 (en) 2010-11-22 2013-12-31 Microsoft Corporation Matching funnel for large document index
US8478704B2 (en) * 2010-11-22 2013-07-02 Microsoft Corporation Decomposable ranking for efficient precomputing that selects preliminary ranking features comprising static ranking features and dynamic atom-isolated components
US9529908B2 (en) 2010-11-22 2016-12-27 Microsoft Technology Licensing, Llc Tiering of posting lists in search engine index
US8713024B2 (en) 2010-11-22 2014-04-29 Microsoft Corporation Efficient forward ranking in a search engine
US9009065B2 (en) * 2010-12-17 2015-04-14 Google Inc. Promoting content from an activity stream
US9245022B2 (en) * 2010-12-30 2016-01-26 Google Inc. Context-based person search
US10628553B1 (en) 2010-12-30 2020-04-21 Cerner Innovation, Inc. Health information transformation system
US9904726B2 (en) * 2011-05-04 2018-02-27 Black Hills IP Holdings, LLC. Apparatus and method for automated and assisted patent claim mapping and expense planning
JP5699789B2 (en) * 2011-05-10 2015-04-15 ソニー株式会社 Information processing apparatus, information processing method, program, and information processing system
US9679064B2 (en) * 2011-06-30 2017-06-13 Nokia Technologies Oy Method and apparatus for providing user-corrected search results
US9940363B2 (en) 2011-10-03 2018-04-10 Black Hills Ip Holdings, Llc Systems, methods and user interfaces in a patent management system
US8856156B1 (en) 2011-10-07 2014-10-07 Cerner Innovation, Inc. Ontology mapper
CN103092856B (en) * 2011-10-31 2015-09-23 阿里巴巴集团控股有限公司 Search result ordering method and equipment, searching method and equipment
US9117194B2 (en) 2011-12-06 2015-08-25 Nuance Communications, Inc. Method and apparatus for operating a frequently asked questions (FAQ)-based system
US8886651B1 (en) 2011-12-22 2014-11-11 Reputation.Com, Inc. Thematic clustering
US9436758B1 (en) 2011-12-27 2016-09-06 Google Inc. Methods and systems for partitioning documents having customer feedback and support content
US9495462B2 (en) 2012-01-27 2016-11-15 Microsoft Technology Licensing, Llc Re-ranking search results
US20130226906A1 (en) * 2012-02-15 2013-08-29 Nuance Communications, Inc. System And Method For A Self-Configuring Question Answering System
US8595022B1 (en) 2012-03-05 2013-11-26 Reputation.Com, Inc. Follow-up determination
US10636041B1 (en) 2012-03-05 2020-04-28 Reputation.Com, Inc. Enterprise reputation evaluation
US20130275461A1 (en) * 2012-04-11 2013-10-17 Educational Testing Service Computer-Implemented Systems and Methods for Identifying Factual Information in a Written Document
CN107066509B (en) 2012-04-26 2020-12-04 阿里巴巴集团控股有限公司 Information providing method, processing server and merge server
US10249385B1 (en) 2012-05-01 2019-04-02 Cerner Innovation, Inc. System and method for record linkage
US9275135B2 (en) * 2012-05-29 2016-03-01 International Business Machines Corporation Annotating entities using cross-document signals
US11093984B1 (en) 2012-06-29 2021-08-17 Reputation.Com, Inc. Determining themes
WO2014040263A1 (en) * 2012-09-14 2014-03-20 Microsoft Corporation Semantic ranking using a forward index
US8898154B2 (en) 2012-09-19 2014-11-25 International Business Machines Corporation Ranking answers to a conceptual query
US20140120513A1 (en) * 2012-10-25 2014-05-01 International Business Machines Corporation Question and Answer System Providing Indications of Information Gaps
US9015097B2 (en) 2012-12-19 2015-04-21 Nuance Communications, Inc. System and method for learning answers to frequently asked questions from a semi-structured data source
US8805699B1 (en) 2012-12-21 2014-08-12 Reputation.Com, Inc. Reputation report with score
US8744866B1 (en) 2012-12-21 2014-06-03 Reputation.Com, Inc. Reputation report with recommendation
US11894117B1 (en) 2013-02-07 2024-02-06 Cerner Innovation, Inc. Discovering context-specific complexity and utilization sequences
US10769241B1 (en) 2013-02-07 2020-09-08 Cerner Innovation, Inc. Discovering context-specific complexity and utilization sequences
US10946311B1 (en) 2013-02-07 2021-03-16 Cerner Innovation, Inc. Discovering context-specific serial health trajectories
US8925099B1 (en) 2013-03-14 2014-12-30 Reputation.Com, Inc. Privacy scoring
US9064001B2 (en) 2013-03-15 2015-06-23 Nuance Communications, Inc. Method and apparatus for a frequently-asked questions portal workflow
US20140350961A1 (en) * 2013-05-21 2014-11-27 Xerox Corporation Targeted summarization of medical data based on implicit queries
US10483003B1 (en) 2013-08-12 2019-11-19 Cerner Innovation, Inc. Dynamically determining risk of clinical condition
US11581092B1 (en) 2013-08-12 2023-02-14 Cerner Innovation, Inc. Dynamic assessment for decision support
GB2517212B (en) 2013-08-16 2018-04-25 Toshiba Res Europe Limited A Computer Generated Emulation of a subject
US9519859B2 (en) 2013-09-06 2016-12-13 Microsoft Technology Licensing, Llc Deep structured semantic model produced using click-through data
US9535960B2 (en) * 2014-04-14 2017-01-03 Microsoft Corporation Context-sensitive search using a deep learning model
US20150325133A1 (en) * 2014-05-06 2015-11-12 Knowledge Diffusion Inc. Intelligent delivery of educational resources
US9576048B2 (en) 2014-06-26 2017-02-21 International Business Machines Corporation Complex service network ranking and clustering
US10089580B2 (en) 2014-08-11 2018-10-02 Microsoft Technology Licensing, Llc Generating and using a knowledge-enhanced model
US9990116B2 (en) * 2014-08-29 2018-06-05 Sap Se Systems and methods for self-learning dynamic interfaces
US9424298B2 (en) * 2014-10-07 2016-08-23 International Business Machines Corporation Preserving conceptual distance within unstructured documents
WO2016068455A1 (en) * 2014-10-30 2016-05-06 주식회사 플런티코리아 Method and system for providing adaptive keyboard interface and response input method using adaptive keyboard linked with conversation content
US10716517B1 (en) * 2014-11-26 2020-07-21 Cerner Innovation, Inc. Biomechanics abnormality identification
WO2016117920A1 (en) * 2015-01-20 2016-07-28 한국과학기술원 Knowledge represention expansion method and apparatus
CN104573070B (en) * 2015-01-26 2018-06-15 清华大学 A kind of Text Clustering Method for mixing length text set
US9940354B2 (en) 2015-03-09 2018-04-10 International Business Machines Corporation Providing answers to questions having both rankable and probabilistic components
JP6596848B2 (en) * 2015-03-10 2019-10-30 富士ゼロックス株式会社 Access right estimation apparatus and access right estimation program
US10628735B2 (en) 2015-06-05 2020-04-21 Deepmind Technologies Limited Reading comprehension neural networks
MX364165B (en) * 2015-06-19 2019-04-15 Tata Consultancy Services Ltd Methods and systems for searching logical patterns.
US10331675B2 (en) * 2015-08-06 2019-06-25 Clarifai, Inc. Systems and methods for learning new trained concepts used to retrieve content relevant to the concepts learned
US10382366B2 (en) * 2015-11-11 2019-08-13 Kik Interactive Inc. Method, system and apparatus for autonomous message generation
US10909450B2 (en) 2016-03-29 2021-02-02 Microsoft Technology Licensing, Llc Multiple-action computational model training and operation
CN107402912B (en) * 2016-05-19 2019-12-31 北京京东尚科信息技术有限公司 Method and device for analyzing semantics
CN106202042B (en) * 2016-07-06 2019-07-02 中央民族大学 A kind of keyword abstraction method based on figure
US10572221B2 (en) 2016-10-20 2020-02-25 Cortical.Io Ag Methods and systems for identifying a level of similarity between a plurality of data representations
CN106446287A (en) * 2016-11-08 2017-02-22 北京邮电大学 Answer aggregation method and system facing crowdsourcing scene question-answering system
US10275514B2 (en) 2016-11-22 2019-04-30 International Business Machines Corporation System and method for generating improved search queries from natural language questions
US10614109B2 (en) * 2017-03-29 2020-04-07 International Business Machines Corporation Natural language processing keyword analysis
US10467122B1 (en) 2017-04-27 2019-11-05 Intuit Inc. Methods, systems, and computer program product for capturing and classification of real-time data and performing post-classification tasks
US10528329B1 (en) 2017-04-27 2020-01-07 Intuit Inc. Methods, systems, and computer program product for automatic generation of software application code
US10467261B1 (en) 2017-04-27 2019-11-05 Intuit Inc. Methods, systems, and computer program product for implementing real-time classification and recommendations
US10705796B1 (en) * 2017-04-27 2020-07-07 Intuit Inc. Methods, systems, and computer program product for implementing real-time or near real-time classification of digital data
US10963501B1 (en) * 2017-04-29 2021-03-30 Veritas Technologies Llc Systems and methods for generating a topic tree for digital information
US10963503B2 (en) * 2017-06-06 2021-03-30 SparkCognition, Inc. Generation of document classifiers
US11157829B2 (en) 2017-07-18 2021-10-26 International Business Machines Corporation Method to leverage similarity and hierarchy of documents in NN training
CN107491425A (en) * 2017-07-26 2017-12-19 合肥美的智能科技有限公司 Determine method, determining device, computer installation and computer-readable recording medium
US10839157B2 (en) * 2017-10-09 2020-11-17 Talentful Technology Inc. Candidate identification and matching
US10691734B2 (en) * 2017-11-21 2020-06-23 International Business Machines Corporation Searching multilingual documents based on document structure extraction
CN107862070B (en) * 2017-11-22 2021-08-10 华南理工大学 Online classroom discussion short text instant grouping method and system based on text clustering
WO2019118864A1 (en) * 2017-12-15 2019-06-20 Google Llc Training and/or using an encoder model to determine responsive action(s) for natural language input
CN108170684B (en) 2018-01-22 2020-06-05 京东方科技集团股份有限公司 Text similarity calculation method and system, data query system and computer product
US10325022B1 (en) * 2018-03-13 2019-06-18 Appian Corporation Automated expression parallelization
US11144783B2 (en) * 2018-05-21 2021-10-12 Naver Corporation Servers, non-transitory computer-readable media and methods for providing articles
US11500914B2 (en) * 2018-09-05 2022-11-15 International Business Machines Corporation Query recommendation to locate an application programming interface
US11157477B2 (en) 2018-11-28 2021-10-26 International Business Machines Corporation Handling queries in document systems using segment differential based document text-index modelling
RU2744028C2 (en) * 2018-12-26 2021-03-02 Общество С Ограниченной Ответственностью "Яндекс" Method and system for storing multiple documents
CN109871428B (en) 2019-01-30 2022-02-18 北京百度网讯科技有限公司 Method, apparatus, device and medium for determining text relevance
US11397776B2 (en) 2019-01-31 2022-07-26 At&T Intellectual Property I, L.P. Systems and methods for automated information retrieval
US11461788B2 (en) 2019-06-26 2022-10-04 International Business Machines Corporation Matching a customer and customer representative dynamically based on a customer representative's past performance
US11227250B2 (en) 2019-06-26 2022-01-18 International Business Machines Corporation Rating customer representatives based on past chat transcripts
US11210677B2 (en) 2019-06-26 2021-12-28 International Business Machines Corporation Measuring the effectiveness of individual customer representative responses in historical chat transcripts
KR102324196B1 (en) * 2019-09-18 2021-11-11 주식회사 솔트룩스 System and method for consolidating knowledge base
CN110941743B (en) * 2019-10-14 2023-09-15 广西壮族自治区科学技术情报研究所 Scientific and technological project duplicate checking method for automatically realizing field weight distribution based on deep learning algorithm
US11730420B2 (en) 2019-12-17 2023-08-22 Cerner Innovation, Inc. Maternal-fetal sepsis indicator
US11663273B2 (en) 2020-06-30 2023-05-30 International Business Machines Corporation Cognitive horizon surveillance
US11520782B2 (en) 2020-10-13 2022-12-06 Oracle International Corporation Techniques for utilizing patterns and logical entities
US11620453B2 (en) * 2020-10-16 2023-04-04 CS Disco, Inc. System and method for artificial intelligence driven document analysis, including searching, indexing, comparing or associating datasets based on learned representations
US11734332B2 (en) 2020-11-19 2023-08-22 Cortical.Io Ag Methods and systems for reuse of data item fingerprints in generation of semantic maps
CN112395854B (en) * 2020-12-02 2022-11-22 中国标准化研究院 Standard element consistency inspection method
US11409800B1 (en) 2021-07-23 2022-08-09 Bank Of America Corporation Generating search queries for database searching

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1992004681A1 (en) * 1990-08-29 1992-03-19 Gte Laboratories Incorporated Adaptive ranking system for information retrieval
US5870740A (en) * 1996-09-30 1999-02-09 Apple Computer, Inc. System and method for improving the ranking of information retrieval results for short queries
US5878386A (en) * 1996-06-28 1999-03-02 Microsoft Corporation Natural language parser with dictionary-based part-of-speech probabilities
US6070158A (en) * 1996-08-14 2000-05-30 Infoseek Corporation Real-time document collection search engine with phrase indexing

Family Cites Families (65)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4270182A (en) 1974-12-30 1981-05-26 Asija Satya P Automated information input, storage, and retrieval system
US4887212A (en) 1986-10-29 1989-12-12 International Business Machines Corporation Parser for natural language text
US4864502A (en) 1987-10-07 1989-09-05 Houghton Mifflin Company Sentence analyzer
US4914590A (en) 1988-05-18 1990-04-03 Emhart Industries, Inc. Natural language understanding system
US4984178A (en) 1989-02-21 1991-01-08 Texas Instruments Incorporated Chart parser for stochastic unification grammar
SE466029B (en) 1989-03-06 1991-12-02 Ibm Svenska Ab DEVICE AND PROCEDURE FOR ANALYSIS OF NATURAL LANGUAGES IN A COMPUTER-BASED INFORMATION PROCESSING SYSTEM
JPH02240769A (en) 1989-03-14 1990-09-25 Canon Inc Device for preparing natural language sentence
US5056021A (en) 1989-06-08 1991-10-08 Carolyn Ausborn Method and apparatus for abstracting concepts from natural language
JPH03129472A (en) 1989-07-31 1991-06-03 Ricoh Co Ltd Processing method for document retrieving device
US5146406A (en) 1989-08-16 1992-09-08 International Business Machines Corporation Computer method for identifying predicate-argument structures in natural language text
US5404295A (en) 1990-08-16 1995-04-04 Katz; Boris Method and apparatus for utilizing annotations to facilitate computer retrieval of database material
US5309359A (en) 1990-08-16 1994-05-03 Boris Katz Method and apparatus for generating and utlizing annotations to facilitate computer text retrieval
EP0473864A1 (en) 1990-09-04 1992-03-11 International Business Machines Corporation Method and apparatus for paraphrasing information contained in logical forms
US5317507A (en) 1990-11-07 1994-05-31 Gallant Stephen I Method for document retrieval and for word sense disambiguation using neural networks
JP2943447B2 (en) 1991-01-30 1999-08-30 三菱電機株式会社 Text information extraction device, text similarity matching device, text search system, text information extraction method, text similarity matching method, and question analysis device
US5680627A (en) 1991-02-15 1997-10-21 Texas Instruments Incorporated Method and apparatus for character preprocessing which translates textual description into numeric form for input to a neural network
US5265065A (en) 1991-10-08 1993-11-23 West Publishing Company Method and apparatus for information retrieval from a database by replacing domain specific stemmed phases in a natural language to create a search query
US5446891A (en) 1992-02-26 1995-08-29 International Business Machines Corporation System for adjusting hypertext links with weighed user goals and activities
GB9217886D0 (en) 1992-08-21 1992-10-07 Canon Res Ct Europe Ltd Method and apparatus for parsing natural language
US6055531A (en) 1993-03-24 2000-04-25 Engate Incorporated Down-line transcription system having context sensitive searching capability
US5475588A (en) 1993-06-18 1995-12-12 Mitsubishi Electric Research Laboratories, Inc. System for decreasing the time required to parse a sentence
US5331556A (en) 1993-06-28 1994-07-19 General Electric Company Method for natural language data processing using morphological and part-of-speech information
US5619709A (en) 1993-09-20 1997-04-08 Hnc, Inc. System and method of context vector generation and retrieval
US5873056A (en) 1993-10-12 1999-02-16 The Syracuse University Natural language processing system for semantic vector representation which accounts for lexical ambiguity
US5576954A (en) 1993-11-05 1996-11-19 University Of Central Florida Process for determination of text relevancy
JP3476237B2 (en) 1993-12-28 2003-12-10 富士通株式会社 Parser
US5706497A (en) 1994-08-15 1998-01-06 Nec Research Institute, Inc. Document retrieval using fuzzy-logic inference
JPH0877010A (en) 1994-09-07 1996-03-22 Hitachi Ltd Method and device for data analysis
US5790754A (en) 1994-10-21 1998-08-04 Sensory Circuits, Inc. Speech recognition apparatus for consumer electronic applications
US5758257A (en) 1994-11-29 1998-05-26 Herz; Frederick System and method for scheduling broadcast of and access to video programs and other data using customer profiles
US5642502A (en) 1994-12-06 1997-06-24 University Of Central Florida Method and system for searching for relevant documents from a text database collection, using statistical ranking, relevancy feedback and small pieces of text
US5794050A (en) 1995-01-04 1998-08-11 Intelligent Text Processing, Inc. Natural language understanding system
US5694523A (en) 1995-05-31 1997-12-02 Oracle Corporation Content processing system for discourse
US5675710A (en) 1995-06-07 1997-10-07 Lucent Technologies, Inc. Method and apparatus for training a text classifier
US5721938A (en) 1995-06-07 1998-02-24 Stuckey; Barbara K. Method and device for parsing and analyzing natural language sentences and text
US6026388A (en) 1995-08-16 2000-02-15 Textwise, Llc User interface and other enhancements for natural language information retrieval system and method
US5963940A (en) 1995-08-16 1999-10-05 Syracuse University Natural language information retrieval system and method
US6006221A (en) 1995-08-16 1999-12-21 Syracuse University Multilingual document retrieval system and method using semantic vector matching
US5721902A (en) 1995-09-15 1998-02-24 Infonautics Corporation Restricted expansion of query terms using part of speech tagging
US5974455A (en) 1995-12-13 1999-10-26 Digital Equipment Corporation System for adding new entry to web page table upon receiving web page including link to another web page not having corresponding entry in web page table
US5864855A (en) 1996-02-26 1999-01-26 The United States Of America As Represented By The Secretary Of The Army Parallel document clustering process
SG49804A1 (en) 1996-03-20 1998-06-15 Government Of Singapore Repres Parsing and translating natural language sentences automatically
US5802515A (en) 1996-06-11 1998-09-01 Massachusetts Institute Of Technology Randomized query generation and document relevance ranking for robust information retrieval from a database
US5915249A (en) 1996-06-14 1999-06-22 Excite, Inc. System and method for accelerated query evaluation of very large full-text databases
US5864863A (en) 1996-08-09 1999-01-26 Digital Equipment Corporation Method for parsing, indexing and searching world-wide-web pages
US6076051A (en) 1997-03-07 2000-06-13 Microsoft Corporation Information retrieval utilizing semantic representation of text
US6049799A (en) 1997-05-12 2000-04-11 Novell, Inc. Document link management using directory services
US6038560A (en) 1997-05-21 2000-03-14 Oracle Corporation Concept knowledge base search and retrieval system
US5940821A (en) 1997-05-21 1999-08-17 Oracle Corporation Information presentation in a knowledge base search and retrieval system
US6047277A (en) 1997-06-19 2000-04-04 Parry; Michael H. Self-organizing neural network for plain text categorization
US6012053A (en) 1997-06-23 2000-01-04 Lycos, Inc. Computer system with user-controlled relevance ranking of search results
US6233575B1 (en) 1997-06-24 2001-05-15 International Business Machines Corporation Multilevel taxonomy based on features derived from training documents classification using fisher values as discrimination values
US5933822A (en) 1997-07-22 1999-08-03 Microsoft Corporation Apparatus and methods for an information retrieval system that employs natural language processing of search results to improve overall precision
US5960384A (en) 1997-09-03 1999-09-28 Brash; Douglas E. Method and device for parsing natural language sentences and other sequential symbolic expressions
US5974412A (en) 1997-09-24 1999-10-26 Sapient Health Network Intelligent query system for automatically indexing information in a database and automatically categorizing users
US5953718A (en) 1997-11-12 1999-09-14 Oracle Corporation Research mode for a knowledge base search and retrieval system
CA2363017C (en) 1999-02-19 2011-04-19 The Trustees Of Columbia University In The City Of New York Multi-document summarization system and method
US6760720B1 (en) * 2000-02-25 2004-07-06 Pedestrian Concepts, Inc. Search-on-the-fly/sort-on-the-fly search engine for searching databases
US6961728B2 (en) * 2000-11-28 2005-11-01 Centerboard, Inc. System and methods for highly distributed wide-area data management of a network of data sources through a database interface
US7178100B2 (en) * 2000-12-15 2007-02-13 Call Charles G Methods and apparatus for storing and manipulating variable length and fixed length data elements as a sequence of fixed length integers
US20030101182A1 (en) * 2001-07-18 2003-05-29 Omri Govrin Method and system for smart search engine and other applications
US7225183B2 (en) * 2002-01-28 2007-05-29 Ipxl, Inc. Ontology-based information management system and method
JP4142881B2 (en) * 2002-03-07 2008-09-03 富士通株式会社 Document similarity calculation device, clustering device, and document extraction device
JP2003345810A (en) * 2002-05-28 2003-12-05 Hitachi Ltd Method and system for document retrieval and document retrieval result display system
US7039897B2 (en) * 2002-07-12 2006-05-02 Hewlett-Packard Development Company, L.P. Modeling a target system by interpolating

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1992004681A1 (en) * 1990-08-29 1992-03-19 Gte Laboratories Incorporated Adaptive ranking system for information retrieval
US5878386A (en) * 1996-06-28 1999-03-02 Microsoft Corporation Natural language parser with dictionary-based part-of-speech probabilities
US6070158A (en) * 1996-08-14 2000-05-30 Infoseek Corporation Real-time document collection search engine with phrase indexing
US5870740A (en) * 1996-09-30 1999-02-09 Apple Computer, Inc. System and method for improving the ranking of information retrieval results for short queries

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
BRIN S ET AL: "The anatomy of a large-scale hypertextual Web search engine", COMPUTER NETWORKS AND ISDN SYSTEMS, NORTH HOLLAND PUBLISHING. AMSTERDAM, NL, vol. 30, no. 1-7, 1 April 1998 (1998-04-01), pages 107 - 117, XP004121435, ISSN: 0169-7552 *
FEINSTEIN Y Z ET AL: "Relevancy ranking of Web pages using shallow parsing", PADD97 PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON THE PRACTICAL APPLICATION OF KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON THE PRACTICAL APPLICATION OF KNOWLEDGE DISCOVERY AND DATA MINING PADD 9, 1997, Blackpool, UK, Practical Application Co, UK, pages 125 - 135, XP008020004, ISBN: 0-9525554-7-6 *

Also Published As

Publication number Publication date
US20040111408A1 (en) 2004-06-10
AU2002235314A1 (en) 2002-07-30
US7496561B2 (en) 2009-02-24
WO2002057961A2 (en) 2002-07-25
US20020129015A1 (en) 2002-09-12
US6766316B2 (en) 2004-07-20

Similar Documents

Publication Publication Date Title
WO2002057961A3 (en) Method and system of ranking and clustering for document indexing and retrieval
US6167397A (en) Method of clustering electronic documents in response to a search query
US6980976B2 (en) Combined database index of unstructured and structured columns
WO2005036351A3 (en) Systems and methods for search processing using superunits
Turtle Text retrieval in the legal world
Lee et al. Object matching for information integration: A profiler-based approach
WO2000067159A3 (en) System and method for searching and recommending documents in a collection using shared bookmarks
AU3092099A (en) Perspective transformations on relational database tables
WO1998039697A3 (en) System and method for accessing heterogeneous databases
WO1997008604A3 (en) Multilingual document retrieval system and method using semantic vector matching
WO2001009747A3 (en) Apparatus and methods for collaboratively searching knowledge databases
EP2562663A3 (en) Parent-child query indexing for XML databases
Yang et al. A semi-structured document model for text mining
WO2006118659A2 (en) Database reverse query matching
Wang et al. Use link-based clustering to improve web search results
WO2002089004A3 (en) Search data management
WO2003042780A3 (en) System and method for storage and analysis of gene expression data
Doan et al. Profile-based object matching for information integration
WO2001088656A3 (en) Apparatus and method for performing transformation-based indexing of high-dimensional data
Xu et al. Effective top-k keyword search in relational databases considering query semantics
Vazirgiannis et al. Uncertainty handling in the data mining process with fuzzy logic
Chandranegara et al. Ontology Alignment using combined similarity method and matching method
Abulaish et al. Information extraction and imprecise query answering from web documents
WO2001001294A3 (en) Biological data processing
Sudarshan et al. Keyword searching and browsing in databases using BANKS

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ OM PH PL PT RO RU SD SE SG SI SK SL TJ TM TN TR TT TZ UA UG UZ VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

122 Ep: pct application non-entry in european phase
NENP Non-entry into the national phase

Ref country code: JP

WWW Wipo information: withdrawn in national office

Country of ref document: JP