WO2005098817A3 - System and method for speech-to-text conversion using constrained dictation in a speak-and-spell mode - Google Patents
System and method for speech-to-text conversion using constrained dictation in a speak-and-spell mode Download PDFInfo
- Publication number
- WO2005098817A3 WO2005098817A3 PCT/US2005/009385 US2005009385W WO2005098817A3 WO 2005098817 A3 WO2005098817 A3 WO 2005098817A3 US 2005009385 W US2005009385 W US 2005009385W WO 2005098817 A3 WO2005098817 A3 WO 2005098817A3
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- speech
- dictation
- speak
- constrained
- text
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G10L15/19—Grammatical context, e.g. disambiguation of the recognition hypotheses based on word sequence rules
- G10L15/193—Formal grammars, e.g. finite state automata, context free grammars or word networks
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/226—Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics
- G10L2015/228—Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics of application context
Abstract
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP05730114A EP1743325A4 (en) | 2004-03-25 | 2005-03-21 | System and method for speech-to-text conversion using constrained dictation in a speak-and-spell mode |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US55629604P | 2004-03-25 | 2004-03-25 | |
US60/556,296 | 2004-03-25 |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2005098817A2 WO2005098817A2 (en) | 2005-10-20 |
WO2005098817A3 true WO2005098817A3 (en) | 2006-11-23 |
Family
ID=35125762
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2005/009385 WO2005098817A2 (en) | 2004-03-25 | 2005-03-21 | System and method for speech-to-text conversion using constrained dictation in a speak-and-spell mode |
Country Status (3)
Country | Link |
---|---|
US (1) | US7676364B2 (en) |
EP (1) | EP1743325A4 (en) |
WO (1) | WO2005098817A2 (en) |
Families Citing this family (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
TW200614010A (en) * | 2004-10-28 | 2006-05-01 | Xcome Technology Co Ltd | Instant messenger system with transformation model and implementation method |
US20060173680A1 (en) * | 2005-01-12 | 2006-08-03 | Jan Verhasselt | Partial spelling in speech recognition |
US7831431B2 (en) | 2006-10-31 | 2010-11-09 | Honda Motor Co., Ltd. | Voice recognition updates via remote broadcast signal |
US8416927B2 (en) * | 2007-04-12 | 2013-04-09 | Ditech Networks, Inc. | System and method for limiting voicemail transcription |
EP2308042B1 (en) | 2008-06-27 | 2011-11-02 | Koninklijke Philips Electronics N.V. | Method and device for generating vocabulary entries from acoustic data |
US9111546B2 (en) * | 2013-03-06 | 2015-08-18 | Nuance Communications, Inc. | Speech recognition and interpretation system |
US9202459B2 (en) | 2013-04-19 | 2015-12-01 | GM Global Technology Operations LLC | Methods and systems for managing dialog of speech systems |
US20180358004A1 (en) * | 2017-06-07 | 2018-12-13 | Lenovo (Singapore) Pte. Ltd. | Apparatus, method, and program product for spelling words |
US20190279623A1 (en) * | 2018-03-08 | 2019-09-12 | Kika Tech (Cayman) Holdings Co., Limited | Method for speech recognition dictation and correction by spelling input, system and storage medium |
CN110827799B (en) * | 2019-11-21 | 2022-06-10 | 百度在线网络技术(北京)有限公司 | Method, apparatus, device and medium for processing voice signal |
US11356492B2 (en) * | 2020-09-16 | 2022-06-07 | Kyndryl, Inc. | Preventing audio dropout |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4914704A (en) * | 1984-10-30 | 1990-04-03 | International Business Machines Corporation | Text editor for speech input |
US6912498B2 (en) * | 2000-05-02 | 2005-06-28 | Scansoft, Inc. | Error correction in speech recognition by correcting text around selected area |
Family Cites Families (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5231670A (en) * | 1987-06-01 | 1993-07-27 | Kurzweil Applied Intelligence, Inc. | Voice controlled system and method for generating text from a voice controlled input |
US5210689A (en) * | 1990-12-28 | 1993-05-11 | Semantic Compaction Systems | System and method for automatically selecting among a plurality of input modes |
US5444768A (en) * | 1991-12-31 | 1995-08-22 | International Business Machines Corporation | Portable computer device for audible processing of remotely stored messages |
US5855000A (en) * | 1995-09-08 | 1998-12-29 | Carnegie Mellon University | Method and apparatus for correcting and repairing machine-transcribed input using independent or cross-modal secondary input |
US5991720A (en) * | 1996-05-06 | 1999-11-23 | Matsushita Electric Industrial Co., Ltd. | Speech recognition system employing multiple grammar networks |
US6487532B1 (en) * | 1997-09-24 | 2002-11-26 | Scansoft, Inc. | Apparatus and method for distinguishing similar-sounding utterances speech recognition |
US6064963A (en) * | 1997-12-17 | 2000-05-16 | Opus Telecom, L.L.C. | Automatic key word or phrase speech recognition for the corrections industry |
US6198808B1 (en) * | 1997-12-31 | 2001-03-06 | Weblink Wireless, Inc. | Controller for use with communications systems for converting a voice message to a text message |
US6067514A (en) * | 1998-06-23 | 2000-05-23 | International Business Machines Corporation | Method for automatically punctuating a speech utterance in a continuous speech recognition system |
US6965863B1 (en) * | 1998-11-12 | 2005-11-15 | Microsoft Corporation | Speech recognition user interface |
US6327566B1 (en) * | 1999-06-16 | 2001-12-04 | International Business Machines Corporation | Method and apparatus for correcting misinterpreted voice commands in a speech recognition system |
US6615131B1 (en) * | 1999-12-21 | 2003-09-02 | Televigation, Inc. | Method and system for an efficient operating environment in a real-time navigation system |
US6694296B1 (en) * | 2000-07-20 | 2004-02-17 | Microsoft Corporation | Method and apparatus for the recognition of spelled spoken words |
US7526431B2 (en) * | 2001-09-05 | 2009-04-28 | Voice Signal Technologies, Inc. | Speech recognition using ambiguous or phone key spelling and/or filtering |
US7143037B1 (en) * | 2002-06-12 | 2006-11-28 | Cisco Technology, Inc. | Spelling words using an arbitrary phonetic alphabet |
-
2005
- 2005-03-21 WO PCT/US2005/009385 patent/WO2005098817A2/en active Application Filing
- 2005-03-21 US US11/084,964 patent/US7676364B2/en not_active Expired - Fee Related
- 2005-03-21 EP EP05730114A patent/EP1743325A4/en not_active Withdrawn
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4914704A (en) * | 1984-10-30 | 1990-04-03 | International Business Machines Corporation | Text editor for speech input |
US6912498B2 (en) * | 2000-05-02 | 2005-06-28 | Scansoft, Inc. | Error correction in speech recognition by correcting text around selected area |
Also Published As
Publication number | Publication date |
---|---|
US20050216272A1 (en) | 2005-09-29 |
EP1743325A4 (en) | 2008-05-14 |
EP1743325A2 (en) | 2007-01-17 |
WO2005098817A2 (en) | 2005-10-20 |
US7676364B2 (en) | 2010-03-09 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2005098817A3 (en) | System and method for speech-to-text conversion using constrained dictation in a speak-and-spell mode | |
US7676371B2 (en) | Oral modification of an ASR lexicon of an ASR engine | |
Aleksic et al. | Bringing contextual information to google speech recognition. | |
Deng et al. | Challenges in adopting speech recognition | |
CN102237088B (en) | Device and method for acquiring speech recognition multi-information text | |
WO2008067562A3 (en) | Multimodal speech recognition system | |
WO2009016631A3 (en) | Automatic context sensitive language correction and enhancement using an internet corpus | |
WO2004100638A3 (en) | Source-dependent text-to-speech system | |
WO2008073850A3 (en) | Method and apparatus for reading education | |
AU2003215239A1 (en) | Voice-controlled user interfaces | |
AU2003215226A1 (en) | Voice-controlled data entry | |
WO2007118020A3 (en) | Method and system for managing pronunciation dictionaries in a speech application | |
AU2002214658A1 (en) | Speech recognition using word-in-phrase command | |
TW200802306A (en) | Voice modifier for speech processing systems | |
ATE325413T1 (en) | METHOD AND DEVICE FOR CONVERTING SPOKEN TEXTS INTO WRITTEN AND CORRECTING THE RECOGNIZED TEXTS | |
WO2009026270A3 (en) | Hmm-based bilingual (mandarin-english) tts techniques | |
WO2006070373A3 (en) | A system and a method for representing unrecognized words in speech to text conversions as syllables | |
CA2486125A1 (en) | A system and method of using meta-data in speech-processing | |
WO2008005711A3 (en) | Non-enrolled continuous dictation | |
WO2020175810A1 (en) | Electronic apparatus and method for controlling thereof | |
Fellbaum et al. | Principles of electronic speech processing with applications for people with disabilities | |
JP2004053742A (en) | Speech recognition device | |
WO2005015546A8 (en) | Speech input interface for dialog systems | |
JP2006259641A (en) | Voice recognition device and program | |
WO2003025787A1 (en) | Sentence creation apparatus and creation method |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A2 Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NA NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SM SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: A2 Designated state(s): GM KE LS MW MZ NA SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LT LU MC NL PL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
WWE | Wipo information: entry into national phase |
Ref document number: 5056/DELNP/2006 Country of ref document: IN |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
WWW | Wipo information: withdrawn in national office |
Ref document number: DE |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2005730114 Country of ref document: EP |
|
WWP | Wipo information: published in national office |
Ref document number: 2005730114 Country of ref document: EP |