DE60100637D1 - Verfahren zur Rauschadaptierung mittels transformierter Matrizen in der automatischen Spracherkennung - Google Patents

Verfahren zur Rauschadaptierung mittels transformierter Matrizen in der automatischen Spracherkennung

Info

Publication number
DE60100637D1
DE60100637D1 DE60100637T DE60100637T DE60100637D1 DE 60100637 D1 DE60100637 D1 DE 60100637D1 DE 60100637 T DE60100637 T DE 60100637T DE 60100637 T DE60100637 T DE 60100637T DE 60100637 D1 DE60100637 D1 DE 60100637D1
Authority
DE
Germany
Prior art keywords
speech recognition
automatic speech
noise adaptation
transformed matrices
matrices
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
DE60100637T
Other languages
English (en)
Other versions
DE60100637T2 (de
Inventor
Christophe Cerisara
Luca Rigazio
Robert Boman
Jean-Claude Junqua
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Panasonic Holdings Corp
Original Assignee
Matsushita Electric Industrial Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Matsushita Electric Industrial Co Ltd filed Critical Matsushita Electric Industrial Co Ltd
Application granted granted Critical
Publication of DE60100637D1 publication Critical patent/DE60100637D1/de
Publication of DE60100637T2 publication Critical patent/DE60100637T2/de
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/063Training
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/20Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Artificial Intelligence (AREA)
  • Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
  • Circuit For Audible Band Transducer (AREA)
DE60100637T 2000-04-18 2001-04-18 Verfahren zur Rauschadaptierung mittels transformierter Matrizen in der automatischen Spracherkennung Expired - Fee Related DE60100637T2 (de)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US09/551,001 US6529872B1 (en) 2000-04-18 2000-04-18 Method for noise adaptation in automatic speech recognition using transformed matrices
US551001 2000-04-18

Publications (2)

Publication Number Publication Date
DE60100637D1 true DE60100637D1 (de) 2003-10-02
DE60100637T2 DE60100637T2 (de) 2004-06-17

Family

ID=24199418

Family Applications (1)

Application Number Title Priority Date Filing Date
DE60100637T Expired - Fee Related DE60100637T2 (de) 2000-04-18 2001-04-18 Verfahren zur Rauschadaptierung mittels transformierter Matrizen in der automatischen Spracherkennung

Country Status (4)

Country Link
US (2) US6529872B1 (de)
EP (1) EP1148471B1 (de)
JP (1) JP3848845B2 (de)
DE (1) DE60100637T2 (de)

Families Citing this family (42)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7387253B1 (en) 1996-09-03 2008-06-17 Hand Held Products, Inc. Optical reader system comprising local host processor and optical reader
ATE336776T1 (de) * 2000-02-25 2006-09-15 Koninkl Philips Electronics Nv Vorrichtung zur spracherkennung mit referenztransformationsmitteln
US6631348B1 (en) * 2000-08-08 2003-10-07 Intel Corporation Dynamic speech recognition pattern switching for enhanced speech recognition accuracy
US7457750B2 (en) * 2000-10-13 2008-11-25 At&T Corp. Systems and methods for dynamic re-configurable speech recognition
US7003455B1 (en) 2000-10-16 2006-02-21 Microsoft Corporation Method of noise reduction using correction and scaling vectors with partitioning of the acoustic space in the domain of noisy speech
US6876966B1 (en) * 2000-10-16 2005-04-05 Microsoft Corporation Pattern recognition training method and apparatus using inserted noise followed by noise reduction
US20020087306A1 (en) * 2000-12-29 2002-07-04 Lee Victor Wai Leung Computer-implemented noise normalization method and system
EP1229516A1 (de) * 2001-01-26 2002-08-07 Telefonaktiebolaget L M Ericsson (Publ) Verfahren, Vorrichtung, Endgerät und System zur automatischen Erkennung verzerrter Sprachdaten
US7062433B2 (en) * 2001-03-14 2006-06-13 Texas Instruments Incorporated Method of speech recognition with compensation for both channel distortion and background noise
US6985858B2 (en) * 2001-03-20 2006-01-10 Microsoft Corporation Method and apparatus for removing noise from feature vectors
US6912497B2 (en) * 2001-03-28 2005-06-28 Texas Instruments Incorporated Calibration of speech data acquisition path
US7165028B2 (en) * 2001-12-12 2007-01-16 Texas Instruments Incorporated Method of speech recognition resistant to convolutive distortion and additive distortion
US7117148B2 (en) * 2002-04-05 2006-10-03 Microsoft Corporation Method of noise reduction using correction vectors based on dynamic aspects of speech and noise normalization
GB2389217A (en) * 2002-05-27 2003-12-03 Canon Kk Speech recognition system
US20040064314A1 (en) * 2002-09-27 2004-04-01 Aubert Nicolas De Saint Methods and apparatus for speech end-point detection
JP4033299B2 (ja) * 2003-03-12 2008-01-16 株式会社エヌ・ティ・ティ・ドコモ 音声モデルの雑音適応化システム、雑音適応化方法、及び、音声認識雑音適応化プログラム
JP4333369B2 (ja) * 2004-01-07 2009-09-16 株式会社デンソー 雑音除去装置、及び音声認識装置、並びにカーナビゲーション装置
US7729909B2 (en) * 2005-03-04 2010-06-01 Panasonic Corporation Block-diagonal covariance joint subspace tying and model compensation for noise robust automatic speech recognition
US7729908B2 (en) * 2005-03-04 2010-06-01 Panasonic Corporation Joint signal and model based noise matching noise robustness method for automatic speech recognition
US7693713B2 (en) * 2005-06-17 2010-04-06 Microsoft Corporation Speech models generated using competitive training, asymmetric training, and data boosting
US7584097B2 (en) * 2005-08-03 2009-09-01 Texas Instruments Incorporated System and method for noisy automatic speech recognition employing joint compensation of additive and convolutive distortions
US20070033027A1 (en) * 2005-08-03 2007-02-08 Texas Instruments, Incorporated Systems and methods employing stochastic bias compensation and bayesian joint additive/convolutive compensation in automatic speech recognition
JP2007114413A (ja) * 2005-10-19 2007-05-10 Toshiba Corp 音声非音声判別装置、音声区間検出装置、音声非音声判別方法、音声区間検出方法、音声非音声判別プログラムおよび音声区間検出プログラム
US7877255B2 (en) * 2006-03-31 2011-01-25 Voice Signal Technologies, Inc. Speech recognition using channel verification
WO2007131530A1 (en) * 2006-05-16 2007-11-22 Loquendo S.P.A. Intersession variability compensation for automatic extraction of information from voice
JP4282704B2 (ja) * 2006-09-27 2009-06-24 株式会社東芝 音声区間検出装置およびプログラム
US8180637B2 (en) * 2007-12-03 2012-05-15 Microsoft Corporation High performance HMM adaptation with joint compensation of additive and convolutive distortions
JP4950930B2 (ja) * 2008-04-03 2012-06-13 株式会社東芝 音声/非音声を判定する装置、方法およびプログラム
US8214215B2 (en) * 2008-09-24 2012-07-03 Microsoft Corporation Phase sensitive model adaptation for noisy speech recognition
KR101239318B1 (ko) * 2008-12-22 2013-03-05 한국전자통신연구원 음질 향상 장치와 음성 인식 시스템 및 방법
US8433564B2 (en) * 2009-07-02 2013-04-30 Alon Konchitsky Method for wind noise reduction
KR20120054845A (ko) * 2010-11-22 2012-05-31 삼성전자주식회사 로봇의 음성인식방법
JP5966689B2 (ja) * 2012-07-04 2016-08-10 日本電気株式会社 音響モデル適応装置、音響モデル適応方法および音響モデル適応プログラム
US9898723B2 (en) 2012-12-19 2018-02-20 Visa International Service Association System and method for voice authentication
US8949224B2 (en) 2013-01-15 2015-02-03 Amazon Technologies, Inc. Efficient query processing using histograms in a columnar database
CN103903630A (zh) * 2014-03-18 2014-07-02 北京捷通华声语音技术有限公司 一种用于消除稀疏噪声方法及装置
JP6464650B2 (ja) * 2014-10-03 2019-02-06 日本電気株式会社 音声処理装置、音声処理方法、およびプログラム
CN106384588B (zh) * 2016-09-08 2019-09-10 河海大学 基于矢量泰勒级数的加性噪声与短时混响的联合补偿方法
JP6767326B2 (ja) * 2017-09-08 2020-10-14 日本電信電話株式会社 センサ信号処理方法、センサ信号処理装置、およびプログラム
CN110570845B (zh) * 2019-08-15 2021-10-22 武汉理工大学 一种基于域不变特征的语音识别方法
US11335329B2 (en) * 2019-08-28 2022-05-17 Tata Consultancy Services Limited Method and system for generating synthetic multi-conditioned data sets for robust automatic speech recognition
CN113223505B (zh) * 2021-04-30 2023-12-08 珠海格力电器股份有限公司 模型训练、数据处理方法、装置、电子设备及存储介质

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5226092A (en) * 1991-06-28 1993-07-06 Digital Equipment Corporation Method and apparatus for learning in a neural network
US6026359A (en) * 1996-09-20 2000-02-15 Nippon Telegraph And Telephone Corporation Scheme for model adaptation in pattern recognition based on Taylor expansion
US6182270B1 (en) * 1996-12-04 2001-01-30 Lucent Technologies Inc. Low-displacement rank preconditioners for simplified non-linear analysis of circuits and other devices
US6154716A (en) * 1998-07-29 2000-11-28 Lucent Technologies - Inc. System and method for simulating electronic circuits

Also Published As

Publication number Publication date
US6691091B1 (en) 2004-02-10
DE60100637T2 (de) 2004-06-17
JP3848845B2 (ja) 2006-11-22
EP1148471B1 (de) 2003-08-27
EP1148471A1 (de) 2001-10-24
JP2001356791A (ja) 2001-12-26
US6529872B1 (en) 2003-03-04

Similar Documents

Publication Publication Date Title
DE60100637D1 (de) Verfahren zur Rauschadaptierung mittels transformierter Matrizen in der automatischen Spracherkennung
DE60024506D1 (de) Verfahren zur mehrstufigen Spracherkennung mittels Zuverlässigkeitsmasses
DE60316912D1 (de) Verfahren zur Spracherkennung
DE60120048D1 (de) Verfahren zur Auswahl eines Objektes
DE60136901D1 (de) Verfahren zur Herstellung eines multifunktionalen akustischen Geräts
DE60309822D1 (de) Verfahren und Vorrichtung zur Spracherkennung
ATE339484T1 (de) Verfahren zur herstellung von fischer-tropsch- wachsen
DE602004022130D1 (de) Verfahren zur Zeichenerkennung
DE60144508D1 (de) Verfahren zur Herstellung von Proben
DE60221699D1 (de) Verfahren zur färbung von formkörpern
DE60108373D1 (de) Verfahren zur Detektion von Emotionen in Sprachsignalen unter Verwendung von Sprecheridentifikation
DE60107308D1 (de) Verfahren zur Erzeugung eines Wasserzeichens für Audiosignale
ATE300520T1 (de) Verfahren zur herstellung amlodipinmaleat
DE602004023364D1 (de) Vorrichtung und Verfahren zur Spracherkennung
DE60028219D1 (de) Verfahren zur Spracherkennung
DE60212725D1 (de) Verfahren zur automatischen spracherkennung
DE60124884D1 (de) Verfahren zur verbesserung der fotomaskengeometrie
DE60134650D1 (de) Verfahren zur herstellung honigförmiger waben
DE602004028008D1 (de) Verfahren zur statistischen sprachmodellierung bei der spracherkennung
DE60032776D1 (de) Verfahren zur Spracherkennung
DE602004014675D1 (de) Verfahren und Vorrichtung zur Spracherkennung
DE60125906D1 (de) Verfahren zur Verbesserung von Leistung
DE60108104D1 (de) Verfahren zur Sprecheridentifikation
DE50109658D1 (de) Vorrichtung und Verfahren zur Sprachsteuerung
DE69808339D1 (de) Verfahren zur sprachkodierung bei hintergrundrauschen

Legal Events

Date Code Title Description
8364 No opposition during term of opposition
8339 Ceased/non-payment of the annual fee
8370 Indication of lapse of patent is to be deleted
8339 Ceased/non-payment of the annual fee