WO1997008686A3 - Method and system for pattern recognition based on tree organised probability densities - Google Patents

Method and system for pattern recognition based on tree organised probability densities Download PDF

Info

Publication number
WO1997008686A3
WO1997008686A3 PCT/IB1996/000860 IB9600860W WO9708686A3 WO 1997008686 A3 WO1997008686 A3 WO 1997008686A3 IB 9600860 W IB9600860 W IB 9600860W WO 9708686 A3 WO9708686 A3 WO 9708686A3
Authority
WO
WIPO (PCT)
Prior art keywords
tree
pattern
input
probability densities
corresponds
Prior art date
Application number
PCT/IB1996/000860
Other languages
French (fr)
Other versions
WO1997008686A2 (en
Inventor
Frank Seide
Original Assignee
Philips Electronics Nv
Philips Norden Ab
Philips Patentverwaltung
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Philips Electronics Nv, Philips Norden Ab, Philips Patentverwaltung filed Critical Philips Electronics Nv
Priority to DE69613338T priority Critical patent/DE69613338T2/en
Priority to JP51005797A priority patent/JP3948747B2/en
Priority to EP96926547A priority patent/EP0788649B1/en
Publication of WO1997008686A2 publication Critical patent/WO1997008686A2/en
Publication of WO1997008686A3 publication Critical patent/WO1997008686A3/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/14Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/14Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
    • G10L15/142Hidden Markov Models [HMMs]

Abstract

The method and system are used for recognising a time-sequential input pattern (20), which is derived from a continual physical quantity, such as speech. The system comprises input means (30), which accesses the physical quantity and therefrom generates a sequence of input observation vectors. The input observation vectors represent the input pattern. A reference pattern database (40) is used for storing reference patterns, which consist of a sequence of reference units. Each reference unit is represented by associated reference probability densities. A tree builder (60) represents for each reference unit the set of associated reference probability densities as a tree structure. Each leaf node of the tree corresponds to a reference probability density. Each non-leaf node corresponds to a cluster probability density, which is derived from all reference probability densities corresponding to leaf nodes in branches below the non-leaf node. A localizer (50) is used for locating among the reference patterns stored in the reference pattern database (40) a recognised reference pattern, which corresponds to the input pattern.
PCT/IB1996/000860 1995-08-28 1996-08-26 Method and system for pattern recognition based on tree organised probability densities WO1997008686A2 (en)

Priority Applications (3)

Application Number Priority Date Filing Date Title
DE69613338T DE69613338T2 (en) 1995-08-28 1996-08-26 METHOD AND SYSTEM FOR PATTERN RECOGNITION USING TREE-STRUCTURED PROBABILITY DENSITIES
JP51005797A JP3948747B2 (en) 1995-08-28 1996-08-26 Pattern recognition method and system based on tree configuration probability density
EP96926547A EP0788649B1 (en) 1995-08-28 1996-08-26 Method and system for pattern recognition based on tree organised probability densities

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
EP95202318 1995-08-28
EP95202318.2 1995-08-28

Publications (2)

Publication Number Publication Date
WO1997008686A2 WO1997008686A2 (en) 1997-03-06
WO1997008686A3 true WO1997008686A3 (en) 1997-04-03

Family

ID=8220590

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/IB1996/000860 WO1997008686A2 (en) 1995-08-28 1996-08-26 Method and system for pattern recognition based on tree organised probability densities

Country Status (5)

Country Link
US (1) US5857169A (en)
EP (1) EP0788649B1 (en)
JP (1) JP3948747B2 (en)
DE (1) DE69613338T2 (en)
WO (1) WO1997008686A2 (en)

Families Citing this family (43)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6260013B1 (en) * 1997-03-14 2001-07-10 Lernout & Hauspie Speech Products N.V. Speech recognition system employing discriminatively trained models
US6292797B1 (en) * 1997-08-07 2001-09-18 New York University Method for determining actionable patterns in a database
CA2216224A1 (en) * 1997-09-19 1999-03-19 Peter R. Stubley Block algorithm for pattern recognition
EP0979497A1 (en) * 1997-10-08 2000-02-16 Koninklijke Philips Electronics N.V. Vocabulary and/or language model training
US5983180A (en) * 1997-10-23 1999-11-09 Softsound Limited Recognition of sequential data using finite state sequence models organized in a tree structure
US6151574A (en) * 1997-12-05 2000-11-21 Lucent Technologies Inc. Technique for adaptation of hidden markov models for speech recognition
US6148295A (en) * 1997-12-30 2000-11-14 International Business Machines Corporation Method for computing near neighbors of a query point in a database
JP4004619B2 (en) * 1998-01-08 2007-11-07 富士通株式会社 Inventory management device capable of automatic inventory allocation
US6269334B1 (en) * 1998-06-25 2001-07-31 International Business Machines Corporation Nongaussian density estimation for the classification of acoustic feature vectors in speech recognition
US6721759B1 (en) * 1998-12-24 2004-04-13 Sony Corporation Techniques for spatial representation of data and browsing based on similarity
US6684186B2 (en) * 1999-01-26 2004-01-27 International Business Machines Corporation Speaker recognition using a hierarchical speaker model tree
US6594392B2 (en) * 1999-05-17 2003-07-15 Intel Corporation Pattern recognition based on piecewise linear probability density function
US6421668B1 (en) * 1999-08-05 2002-07-16 Agilent Technologies, Inc. Method and system for partitioning data into subsets of related data
US6662184B1 (en) * 1999-09-23 2003-12-09 International Business Machines Corporation Lock-free wild card search data structure and method
US7072833B2 (en) * 2000-06-02 2006-07-04 Canon Kabushiki Kaisha Speech processing system
US6789063B1 (en) * 2000-09-01 2004-09-07 Intel Corporation Acoustic modeling using a two-level decision tree in a speech recognition system
US6978239B2 (en) * 2000-12-04 2005-12-20 Microsoft Corporation Method and apparatus for speech synthesis without prosody modification
US6845357B2 (en) * 2001-07-24 2005-01-18 Honeywell International Inc. Pattern recognition using an observable operator model
US6757651B2 (en) * 2001-08-28 2004-06-29 Intellisist, Llc Speech detection system and method
US20050228661A1 (en) * 2002-05-06 2005-10-13 Josep Prous Blancafort Voice recognition method
EP1387232A1 (en) * 2002-07-29 2004-02-04 Centre National De La Recherche Scientifique Method for determining a value to be allocated to different parameters of a system
US7788096B2 (en) * 2002-09-03 2010-08-31 Microsoft Corporation Method and apparatus for generating decision tree questions for speech processing
US7571097B2 (en) * 2003-03-13 2009-08-04 Microsoft Corporation Method for training of subspace coded gaussian models
US7496498B2 (en) * 2003-03-24 2009-02-24 Microsoft Corporation Front-end architecture for a multi-lingual text-to-speech system
GB2409750B (en) * 2004-01-05 2006-03-15 Toshiba Res Europ Ltd Speech recognition system and technique
US7542949B2 (en) * 2004-05-12 2009-06-02 Mitsubishi Electric Research Laboratories, Inc. Determining temporal patterns in sensed data sequences by hierarchical decomposition of hidden Markov models
KR100703697B1 (en) * 2005-02-02 2007-04-05 삼성전자주식회사 Method and Apparatus for recognizing lexicon using lexicon group tree
US20060235698A1 (en) * 2005-04-13 2006-10-19 Cane David A Apparatus for controlling a home theater system by speech commands
US7805301B2 (en) * 2005-07-01 2010-09-28 Microsoft Corporation Covariance estimation for pattern recognition
US7877255B2 (en) * 2006-03-31 2011-01-25 Voice Signal Technologies, Inc. Speech recognition using channel verification
JP5088030B2 (en) * 2007-07-26 2012-12-05 ヤマハ株式会社 Method, apparatus and program for evaluating similarity of performance sound
JP2010140383A (en) * 2008-12-15 2010-06-24 Sony Corp Information processor and method, and program
US20100185672A1 (en) * 2009-01-21 2010-07-22 Rising Iii Hawley K Techniques for spatial representation of data and browsing based on similarity
EP2583252B1 (en) 2010-06-16 2023-11-01 Yale University Forest inventory assessment using remote sensing data
US20140047089A1 (en) * 2012-08-10 2014-02-13 International Business Machines Corporation System and method for supervised network clustering
JP6246636B2 (en) * 2014-03-20 2017-12-13 株式会社東芝 PATTERN IDENTIFICATION DEVICE, PATTERN IDENTIFICATION METHOD, AND PROGRAM
CN106297775B (en) * 2015-06-02 2019-11-19 富泰华工业(深圳)有限公司 Speech recognition equipment and method
CN105096955B (en) * 2015-09-06 2019-02-01 广东外语外贸大学 A kind of speaker's method for quickly identifying and system based on model growth cluster
US10482196B2 (en) * 2016-02-26 2019-11-19 Nvidia Corporation Modeling point cloud data using hierarchies of Gaussian mixture models
CN107293298B (en) * 2016-04-05 2021-02-19 富泰华工业(深圳)有限公司 Voice control system and method
KR101902882B1 (en) * 2016-07-14 2018-11-13 연세대학교 산학협력단 A method for tracking a coronary artery in three dimensional coronary computed tomography angiography using a random tree walk algorithm
US20210035025A1 (en) * 2019-07-29 2021-02-04 Oracle International Corporation Systems and methods for optimizing machine learning models by summarizing list characteristics based on multi-dimensional feature vectors
US11615428B1 (en) 2022-01-04 2023-03-28 Natural Capital Exchange, Inc. On-demand estimation of potential carbon credit production for a forested area

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0627726A1 (en) * 1993-06-03 1994-12-07 Nec Corporation Pattern recognition with a tree structure used for reference pattern feature vectors or for HMM
WO1996013830A1 (en) * 1994-10-26 1996-05-09 Dictaphone Corporation (U.S.) Decision tree classifier designed using hidden markov models
US5528701A (en) * 1994-09-02 1996-06-18 Panasonic Technologies, Inc. Trie based method for indexing handwritten databases

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0627726A1 (en) * 1993-06-03 1994-12-07 Nec Corporation Pattern recognition with a tree structure used for reference pattern feature vectors or for HMM
US5528701A (en) * 1994-09-02 1996-06-18 Panasonic Technologies, Inc. Trie based method for indexing handwritten databases
WO1996013830A1 (en) * 1994-10-26 1996-05-09 Dictaphone Corporation (U.S.) Decision tree classifier designed using hidden markov models

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
ICASSP'91, Volume 1, 1991, (Toronto), L.R. BAHL et al., "Decision Trees for Phonological Rules in Continuous Speech", pages 187-188. *

Also Published As

Publication number Publication date
DE69613338T2 (en) 2002-05-29
JP3948747B2 (en) 2007-07-25
EP0788649B1 (en) 2001-06-13
EP0788649A2 (en) 1997-08-13
JPH10508392A (en) 1998-08-18
US5857169A (en) 1999-01-05
WO1997008686A2 (en) 1997-03-06
DE69613338D1 (en) 2001-07-19

Similar Documents

Publication Publication Date Title
WO1997008686A3 (en) Method and system for pattern recognition based on tree organised probability densities
WO1997008685A3 (en) Method and system for pattern recognition based on dynamically constructing a subset of reference vectors
US10971140B2 (en) Speech recognition circuit using parallel processors
ES2085428T3 (en) A SET AND A METHOD FOR THE TREATMENT OF DATA COMPRESSION BY VECTOR QUANTIFICATION OF SEARCH IN BINARY TREE.
EP0375307A3 (en) Structure for and method of arranging recursively derived data in a database
EP0313975A3 (en) Design and construction of a binary-tree system for language modelling
EP1052576A3 (en) Method for searching in large databases of automatically recognized text
CA2216224A1 (en) Block algorithm for pattern recognition
CA2124906A1 (en) Pattern recognition with a tree structure used for reference pattern feature vectors or for hmm
CA2246948A1 (en) Method and means for encoding storing and retrieving hierarchical data processing information for a computer system
EP0085545A3 (en) Pattern recognition apparatus and method for making same
EP0335739A3 (en) Pattern recognition system
Kliemann A stochastic dynamical model for the characterization of the geometrical structure of dendritic processes
EP0523347B1 (en) A fast algorithm for deriving acoustic prototypes for automatic speech recognition
Fritsch et al. Context-dependent hybrid HME/HMM speech recognition using polyphone clustering decision trees
Chen Identification of contextual factors for pronunciation networks
Stevenson Linguistic Research in the Nuba Mountains-i
JP2002507009A (en) Method for automatically controlling an electronic music device by real-time configuration and search of a multi-level data structure
EP0731447A3 (en) Reference pattern training system and speech recognition system using the same
Chen et al. Automatic discovery of contextual factors describing phonological variation
ATE227903T1 (en) DATA COMPRESSION SYSTEM AND METHOD
Padmanabhan et al. Decision-tree based quantization of the feature space of a speech recognizer.
Chan et al. Pruning of state-tying tree using Bayesian information criterion with multiple mixtures
GB9308240D0 (en) Natural language processing system
Ostendorf et al. Segment-Based Acoustic Models for Continuous Speech Recognition.

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): JP

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): AT BE CH DE DK ES FI FR GB GR IE IT LU MC NL PT SE

WWE Wipo information: entry into national phase

Ref document number: 1996926547

Country of ref document: EP

AK Designated states

Kind code of ref document: A3

Designated state(s): JP

AL Designated countries for regional patents

Kind code of ref document: A3

Designated state(s): AT BE CH DE DK ES FI FR GB GR IE IT LU MC NL PT SE

121 Ep: the epo has been informed by wipo that ep was designated in this application
WWP Wipo information: published in national office

Ref document number: 1996926547

Country of ref document: EP

WWG Wipo information: grant in national office

Ref document number: 1996926547

Country of ref document: EP