WO2003100769A1 - Method of determining uncertainty associated with noise reduction - Google Patents
Method of determining uncertainty associated with noise reduction Download PDFInfo
- Publication number
- WO2003100769A1 WO2003100769A1 PCT/US2003/016032 US0316032W WO03100769A1 WO 2003100769 A1 WO2003100769 A1 WO 2003100769A1 US 0316032 W US0316032 W US 0316032W WO 03100769 A1 WO03100769 A1 WO 03100769A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- noise
- computer
- signal
- uncertainty
- component
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/20—Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Computational Linguistics (AREA)
- Quality & Reliability (AREA)
- Signal Processing (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Circuit For Audible Band Transducer (AREA)
Abstract
Description
Claims
Priority Applications (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP03731299A EP1506542A1 (en) | 2002-05-20 | 2003-05-20 | Imethod of determining uncertainty associated with noise reduction |
AU2003241553A AU2003241553A1 (en) | 2002-05-20 | 2003-05-20 | Method of determining uncertainty associated with noise reduction |
KR10-2004-7018410A KR20050000541A (en) | 2002-05-20 | 2003-05-20 | Method of determining uncertainty associated with noise reduction |
JP2004508336A JP2005527002A (en) | 2002-05-20 | 2003-05-20 | Method for determining uncertainty associated with noise reduction |
Applications Claiming Priority (6)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US10/152,127 | 2002-05-20 | ||
US10/152,143 US7107210B2 (en) | 2002-05-20 | 2002-05-20 | Method of noise reduction based on dynamic aspects of speech |
US10/152,127 US7103540B2 (en) | 2002-05-20 | 2002-05-20 | Method of pattern recognition using noise reduction uncertainty |
US10/152,143 | 2002-05-20 | ||
US10/236,042 | 2002-09-05 | ||
US10/236,042 US7174292B2 (en) | 2002-05-20 | 2002-09-05 | Method of determining uncertainty associated with acoustic distortion-based noise reduction |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2003100769A1 true WO2003100769A1 (en) | 2003-12-04 |
Family
ID=29587546
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2003/016032 WO2003100769A1 (en) | 2002-05-20 | 2003-05-20 | Method of determining uncertainty associated with noise reduction |
Country Status (7)
Country | Link |
---|---|
US (2) | US7174292B2 (en) |
EP (1) | EP1506542A1 (en) |
JP (1) | JP2005527002A (en) |
KR (1) | KR20050000541A (en) |
CN (1) | CN1653520A (en) |
AU (1) | AU2003241553A1 (en) |
WO (1) | WO2003100769A1 (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
ES2334429A1 (en) * | 2009-09-24 | 2010-03-09 | Universidad Politecnica De Madrid | System and procedure of detection and identification of sounds in real time produced by specific sources sources. (Machine-translation by Google Translate, not legally binding) |
Families Citing this family (38)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7103540B2 (en) * | 2002-05-20 | 2006-09-05 | Microsoft Corporation | Method of pattern recognition using noise reduction uncertainty |
US7107210B2 (en) * | 2002-05-20 | 2006-09-12 | Microsoft Corporation | Method of noise reduction based on dynamic aspects of speech |
US7174292B2 (en) * | 2002-05-20 | 2007-02-06 | Microsoft Corporation | Method of determining uncertainty associated with acoustic distortion-based noise reduction |
KR100558391B1 (en) * | 2003-10-16 | 2006-03-10 | 삼성전자주식회사 | Display apparatus and control method thereof |
DE102004008225B4 (en) * | 2004-02-19 | 2006-02-16 | Infineon Technologies Ag | Method and device for determining feature vectors from a signal for pattern recognition, method and device for pattern recognition and computer-readable storage media |
JP4765461B2 (en) * | 2005-07-27 | 2011-09-07 | 日本電気株式会社 | Noise suppression system, method and program |
US20070219796A1 (en) * | 2006-03-20 | 2007-09-20 | Microsoft Corporation | Weighted likelihood ratio for pattern recognition |
US8949120B1 (en) | 2006-05-25 | 2015-02-03 | Audience, Inc. | Adaptive noise cancelation |
US8209175B2 (en) * | 2006-06-08 | 2012-06-26 | Microsoft Corporation | Uncertainty interval content sensing within communications |
KR100908121B1 (en) * | 2006-12-15 | 2009-07-16 | 삼성전자주식회사 | Speech feature vector conversion method and apparatus |
US8195453B2 (en) * | 2007-09-13 | 2012-06-05 | Qnx Software Systems Limited | Distributed intelligibility testing system |
KR100919223B1 (en) * | 2007-09-19 | 2009-09-28 | 한국전자통신연구원 | The method and apparatus for speech recognition using uncertainty information in noise environment |
US8140330B2 (en) * | 2008-06-13 | 2012-03-20 | Robert Bosch Gmbh | System and method for detecting repeated patterns in dialog systems |
US8145488B2 (en) * | 2008-09-16 | 2012-03-27 | Microsoft Corporation | Parameter clustering and sharing for variable-parameter hidden markov models |
US8160878B2 (en) * | 2008-09-16 | 2012-04-17 | Microsoft Corporation | Piecewise-based variable-parameter Hidden Markov Models and the training thereof |
US20110178800A1 (en) | 2010-01-19 | 2011-07-21 | Lloyd Watts | Distortion Measurement for Noise Suppression System |
US9558755B1 (en) | 2010-05-20 | 2017-01-31 | Knowles Electronics, Llc | Noise suppression assisted automatic speech recognition |
US8874441B2 (en) * | 2011-01-19 | 2014-10-28 | Broadcom Corporation | Noise suppression using multiple sensors of a communication device |
AU2012217606A1 (en) * | 2011-02-16 | 2013-05-09 | Visa International Service Association | Snap mobile payment apparatuses, methods and systems |
US10586227B2 (en) | 2011-02-16 | 2020-03-10 | Visa International Service Association | Snap mobile payment apparatuses, methods and systems |
BR112013021057A2 (en) | 2011-02-22 | 2020-11-10 | Visa International Service Association | universal electronic payment devices, methods and systems |
US9582598B2 (en) | 2011-07-05 | 2017-02-28 | Visa International Service Association | Hybrid applications utilizing distributed models and views apparatuses, methods and systems |
US10121129B2 (en) | 2011-07-05 | 2018-11-06 | Visa International Service Association | Electronic wallet checkout platform apparatuses, methods and systems |
US9355393B2 (en) | 2011-08-18 | 2016-05-31 | Visa International Service Association | Multi-directional wallet connector apparatuses, methods and systems |
US9710807B2 (en) | 2011-08-18 | 2017-07-18 | Visa International Service Association | Third-party value added wallet features and interfaces apparatuses, methods and systems |
US10825001B2 (en) | 2011-08-18 | 2020-11-03 | Visa International Service Association | Multi-directional wallet connector apparatuses, methods and systems |
US10242358B2 (en) | 2011-08-18 | 2019-03-26 | Visa International Service Association | Remote decoupled application persistent state apparatuses, methods and systems |
US10223730B2 (en) | 2011-09-23 | 2019-03-05 | Visa International Service Association | E-wallet store injection search apparatuses, methods and systems |
AU2013214801B2 (en) | 2012-02-02 | 2018-06-21 | Visa International Service Association | Multi-source, multi-dimensional, cross-entity, multimedia database platform apparatuses, methods and systems |
US9640194B1 (en) | 2012-10-04 | 2017-05-02 | Knowles Electronics, Llc | Noise suppression for speech processing based on machine-learning mask estimation |
CN105359210B (en) | 2013-06-21 | 2019-06-14 | 弗朗霍夫应用科学研究促进协会 | MDCT frequency spectrum is declined to the device and method of white noise using preceding realization by FDNS |
US9536540B2 (en) | 2013-07-19 | 2017-01-03 | Knowles Electronics, Llc | Speech signal separation and synthesis based on auditory scene analysis and speech modeling |
US9437212B1 (en) * | 2013-12-16 | 2016-09-06 | Marvell International Ltd. | Systems and methods for suppressing noise in an audio signal for subbands in a frequency domain based on a closed-form solution |
US20150336786A1 (en) * | 2014-05-20 | 2015-11-26 | General Electric Company | Refrigerators for providing dispensing in response to voice commands |
WO2016033364A1 (en) | 2014-08-28 | 2016-03-03 | Audience, Inc. | Multi-sourced noise suppression |
WO2016040885A1 (en) * | 2014-09-12 | 2016-03-17 | Audience, Inc. | Systems and methods for restoration of speech components |
JPWO2017037830A1 (en) * | 2015-08-31 | 2017-11-24 | 三菱電機株式会社 | Speech recognition apparatus and speech recognition processing method |
US11514314B2 (en) | 2019-11-25 | 2022-11-29 | International Business Machines Corporation | Modeling environment noise for training neural networks |
Family Cites Families (31)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4897878A (en) | 1985-08-26 | 1990-01-30 | Itt Corporation | Noise compensation in speech recognition apparatus |
GB8608289D0 (en) * | 1986-04-04 | 1986-05-08 | Pa Consulting Services | Noise compensation in speech recognition |
US5148489A (en) | 1990-02-28 | 1992-09-15 | Sri International | Method for spectral estimation to improve noise robustness for speech recognition |
US5604839A (en) | 1994-07-29 | 1997-02-18 | Microsoft Corporation | Method and system for improving speech recognition through front-end normalization of feature vectors |
US5924065A (en) | 1997-06-16 | 1999-07-13 | Digital Equipment Corporation | Environmently compensated speech processing |
US6633842B1 (en) * | 1999-10-22 | 2003-10-14 | Texas Instruments Incorporated | Speech recognition front-end feature extraction for noisy speech |
US6098040A (en) | 1997-11-07 | 2000-08-01 | Nortel Networks Corporation | Method and apparatus for providing an improved feature set in speech recognition by performing noise cancellation and background masking |
US6202047B1 (en) * | 1998-03-30 | 2001-03-13 | At&T Corp. | Method and apparatus for speech recognition using second order statistics and linear estimation of cepstral coefficients |
AU8804698A (en) * | 1998-06-29 | 2000-01-17 | Nokia Networks Oy | Symbol estimation using soft-output algorithm and feedback |
US6980952B1 (en) * | 1998-08-15 | 2005-12-27 | Texas Instruments Incorporated | Source normalization training for HMM modeling of speech |
US6173258B1 (en) | 1998-09-09 | 2001-01-09 | Sony Corporation | Method for reducing noise distortions in a speech recognition system |
US6418411B1 (en) | 1999-03-12 | 2002-07-09 | Texas Instruments Incorporated | Method and system for adaptive speech recognition in a noisy environment |
US6577997B1 (en) | 1999-05-28 | 2003-06-10 | Texas Instruments Incorporated | System and method of noise-dependent classification |
WO2001003113A1 (en) | 1999-07-01 | 2001-01-11 | Koninklijke Philips Electronics N.V. | Robust speech processing from noisy speech models |
US6633843B2 (en) | 2000-06-08 | 2003-10-14 | Texas Instruments Incorporated | Log-spectral compensation of PMC Gaussian mean vectors for noisy speech recognition using log-max assumption |
US6898566B1 (en) | 2000-08-16 | 2005-05-24 | Mindspeed Technologies, Inc. | Using signal to noise ratio of a speech signal to adjust thresholds for extracting speech parameters for coding the speech signal |
US6876966B1 (en) | 2000-10-16 | 2005-04-05 | Microsoft Corporation | Pattern recognition training method and apparatus using inserted noise followed by noise reduction |
US7003455B1 (en) | 2000-10-16 | 2006-02-21 | Microsoft Corporation | Method of noise reduction using correction and scaling vectors with partitioning of the acoustic space in the domain of noisy speech |
US6985858B2 (en) | 2001-03-20 | 2006-01-10 | Microsoft Corporation | Method and apparatus for removing noise from feature vectors |
US20030055640A1 (en) * | 2001-05-01 | 2003-03-20 | Ramot University Authority For Applied Research & Industrial Development Ltd. | System and method for parameter estimation for pattern recognition |
US7158933B2 (en) | 2001-05-11 | 2007-01-02 | Siemens Corporate Research, Inc. | Multi-channel speech enhancement system and method based on psychoacoustic masking effects |
US6915259B2 (en) | 2001-05-24 | 2005-07-05 | Matsushita Electric Industrial Co., Ltd. | Speaker and environment adaptation based on linear separation of variability sources |
US6959276B2 (en) | 2001-09-27 | 2005-10-25 | Microsoft Corporation | Including the category of environmental noise when processing speech signals |
US6990447B2 (en) * | 2001-11-15 | 2006-01-24 | Microsoft Corportion | Method and apparatus for denoising and deverberation using variational inference and strong speech models |
US6944590B2 (en) | 2002-04-05 | 2005-09-13 | Microsoft Corporation | Method of iterative noise estimation in a recursive framework |
US7117148B2 (en) | 2002-04-05 | 2006-10-03 | Microsoft Corporation | Method of noise reduction using correction vectors based on dynamic aspects of speech and noise normalization |
US7174292B2 (en) * | 2002-05-20 | 2007-02-06 | Microsoft Corporation | Method of determining uncertainty associated with acoustic distortion-based noise reduction |
US7103540B2 (en) * | 2002-05-20 | 2006-09-05 | Microsoft Corporation | Method of pattern recognition using noise reduction uncertainty |
US7107210B2 (en) * | 2002-05-20 | 2006-09-12 | Microsoft Corporation | Method of noise reduction based on dynamic aspects of speech |
US7050975B2 (en) * | 2002-07-23 | 2006-05-23 | Microsoft Corporation | Method of speech recognition using time-dependent interpolation and hidden dynamic value classes |
US7200557B2 (en) * | 2002-11-27 | 2007-04-03 | Microsoft Corporation | Method of reducing index sizes used to represent spectral content vectors |
-
2002
- 2002-09-05 US US10/236,042 patent/US7174292B2/en not_active Expired - Fee Related
-
2003
- 2003-05-20 AU AU2003241553A patent/AU2003241553A1/en not_active Abandoned
- 2003-05-20 WO PCT/US2003/016032 patent/WO2003100769A1/en not_active Application Discontinuation
- 2003-05-20 CN CNA038114038A patent/CN1653520A/en active Pending
- 2003-05-20 EP EP03731299A patent/EP1506542A1/en not_active Withdrawn
- 2003-05-20 KR KR10-2004-7018410A patent/KR20050000541A/en not_active Application Discontinuation
- 2003-05-20 JP JP2004508336A patent/JP2005527002A/en not_active Withdrawn
-
2006
- 2006-12-20 US US11/642,389 patent/US7289955B2/en not_active Expired - Fee Related
Non-Patent Citations (3)
Title |
---|
DROPPO J ET AL: "Uncertainty decoding with SPLICE for noise robust speech recognition", PROCEEDINGS OF INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (CAT. NO.02CH37334) (ICASSP'02), ORLANDO, FL, USA, 13-17 MAY 2002, Piscataway, NJ, USA, IEEE, USA, pages I - 57-60 vol.1, XP002254998, ISBN: 0-7803-7402-9 * |
JASHA DROPPO ET AL: "Evaluation of the SPLICE Algorithm on the Aurora2 Database", 7TH EUROPEAN CONFERENCE ON SPEECH COMMUNICATION AND TECHNOLOGY (EUROSPEECH 2001). PROCEEDINGS OF EUROSPEECH 2001, AALBORG, DENMARK, vol. 1, 3 September 2001 (2001-09-03) - 7 September 2001 (2001-09-07), pages 217 - 220, XP007004815 * |
LI DENG ET AL: "A Bayesian approach to speech feature enhancement using the dynamic cepstral prior", PROCEEDINGS OF INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (CAT. NO.02CH37334) (ICASSP'02), ORLANDO, FL, USA, 13-17 MAY 2002, Piscataway, NJ, USA, IEEE, USA, pages I - 829-32 vol.1, XP002254999, ISBN: 0-7803-7402-9 * |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
ES2334429A1 (en) * | 2009-09-24 | 2010-03-09 | Universidad Politecnica De Madrid | System and procedure of detection and identification of sounds in real time produced by specific sources sources. (Machine-translation by Google Translate, not legally binding) |
Also Published As
Publication number | Publication date |
---|---|
EP1506542A1 (en) | 2005-02-16 |
AU2003241553A1 (en) | 2003-12-12 |
US7174292B2 (en) | 2007-02-06 |
JP2005527002A (en) | 2005-09-08 |
US7289955B2 (en) | 2007-10-30 |
CN1653520A (en) | 2005-08-10 |
KR20050000541A (en) | 2005-01-05 |
US20030225577A1 (en) | 2003-12-04 |
US20070106504A1 (en) | 2007-05-10 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US7289955B2 (en) | Method of determining uncertainty associated with acoustic distortion-based noise reduction | |
US7617098B2 (en) | Method of noise reduction based on dynamic aspects of speech | |
EP1398762B1 (en) | Non-linear model for removing noise from corrupted signals | |
US7769582B2 (en) | Method of pattern recognition using noise reduction uncertainty | |
JP4824286B2 (en) | A method for noise estimation using incremental Bayesian learning | |
EP1396845B1 (en) | Method of iterative noise estimation in a recursive framework | |
EP1508893B1 (en) | Method of noise reduction using instantaneous signal-to-noise ratio as the Principal quantity for optimal estimation | |
US7254536B2 (en) | Method of noise reduction using correction and scaling vectors with partitioning of the acoustic space in the domain of noisy speech | |
US6944590B2 (en) | Method of iterative noise estimation in a recursive framework |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A1 Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ OM PH PL PT RO RU SC SD SE SG SK SL TJ TM TN TR TT TZ UA UG UZ VC VN YU ZA ZM ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: A1 Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LU MC NL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
DFPE | Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101) | ||
WWE | Wipo information: entry into national phase |
Ref document number: 3373/DELNP/2004 Country of ref document: IN Ref document number: 2003731299 Country of ref document: EP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 1020047018410 Country of ref document: KR |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2004508336 Country of ref document: JP Ref document number: 20038114038 Country of ref document: CN |
|
WWP | Wipo information: published in national office |
Ref document number: 1020047018410 Country of ref document: KR |
|
WWP | Wipo information: published in national office |
Ref document number: 2003731299 Country of ref document: EP |
|
WWW | Wipo information: withdrawn in national office |
Ref document number: 2003731299 Country of ref document: EP |