WO2005098817A3 - System and method for speech-to-text conversion using constrained dictation in a speak-and-spell mode - Google Patents

System and method for speech-to-text conversion using constrained dictation in a speak-and-spell mode Download PDF

Info

Publication number
WO2005098817A3
WO2005098817A3 PCT/US2005/009385 US2005009385W WO2005098817A3 WO 2005098817 A3 WO2005098817 A3 WO 2005098817A3 US 2005009385 W US2005009385 W US 2005009385W WO 2005098817 A3 WO2005098817 A3 WO 2005098817A3
Authority
WO
WIPO (PCT)
Prior art keywords
speech
dictation
speak
constrained
text
Prior art date
Application number
PCT/US2005/009385
Other languages
French (fr)
Other versions
WO2005098817A2 (en
Inventor
Ashwin Rao
Original Assignee
Ashwin Rao
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ashwin Rao filed Critical Ashwin Rao
Priority to EP05730114A priority Critical patent/EP1743325A4/en
Publication of WO2005098817A2 publication Critical patent/WO2005098817A2/en
Publication of WO2005098817A3 publication Critical patent/WO2005098817A3/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/183Speech classification or search using natural language modelling using context dependencies, e.g. language models
    • G10L15/19Grammatical context, e.g. disambiguation of the recognition hypotheses based on word sequence rules
    • G10L15/193Formal grammars, e.g. finite state automata, context free grammars or word networks
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/226Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics
    • G10L2015/228Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics of application context

Abstract

For improving the accuracy of a speech recognition system, for the specific task of speech-to-text (dictation style speech) translation, a constrained dictation methodology using speak-and-spell mode is disclosed. The invention is perfectly suited for modern day “text-messaging” applications wherein the number of words being dictated is very small (limited by the 140-160 characters message length constraint). Additionally, the invention adds a control on the way users interact with machines, thereby making the speech recognition task easier and improving system accuracy.
PCT/US2005/009385 2004-03-25 2005-03-21 System and method for speech-to-text conversion using constrained dictation in a speak-and-spell mode WO2005098817A2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
EP05730114A EP1743325A4 (en) 2004-03-25 2005-03-21 System and method for speech-to-text conversion using constrained dictation in a speak-and-spell mode

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US55629604P 2004-03-25 2004-03-25
US60/556,296 2004-03-25

Publications (2)

Publication Number Publication Date
WO2005098817A2 WO2005098817A2 (en) 2005-10-20
WO2005098817A3 true WO2005098817A3 (en) 2006-11-23

Family

ID=35125762

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2005/009385 WO2005098817A2 (en) 2004-03-25 2005-03-21 System and method for speech-to-text conversion using constrained dictation in a speak-and-spell mode

Country Status (3)

Country Link
US (1) US7676364B2 (en)
EP (1) EP1743325A4 (en)
WO (1) WO2005098817A2 (en)

Families Citing this family (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TW200614010A (en) * 2004-10-28 2006-05-01 Xcome Technology Co Ltd Instant messenger system with transformation model and implementation method
US20060173680A1 (en) * 2005-01-12 2006-08-03 Jan Verhasselt Partial spelling in speech recognition
US7831431B2 (en) 2006-10-31 2010-11-09 Honda Motor Co., Ltd. Voice recognition updates via remote broadcast signal
US8416927B2 (en) * 2007-04-12 2013-04-09 Ditech Networks, Inc. System and method for limiting voicemail transcription
EP2308042B1 (en) 2008-06-27 2011-11-02 Koninklijke Philips Electronics N.V. Method and device for generating vocabulary entries from acoustic data
US9111546B2 (en) * 2013-03-06 2015-08-18 Nuance Communications, Inc. Speech recognition and interpretation system
US9202459B2 (en) 2013-04-19 2015-12-01 GM Global Technology Operations LLC Methods and systems for managing dialog of speech systems
US20180358004A1 (en) * 2017-06-07 2018-12-13 Lenovo (Singapore) Pte. Ltd. Apparatus, method, and program product for spelling words
US20190279623A1 (en) * 2018-03-08 2019-09-12 Kika Tech (Cayman) Holdings Co., Limited Method for speech recognition dictation and correction by spelling input, system and storage medium
CN110827799B (en) * 2019-11-21 2022-06-10 百度在线网络技术(北京)有限公司 Method, apparatus, device and medium for processing voice signal
US11356492B2 (en) * 2020-09-16 2022-06-07 Kyndryl, Inc. Preventing audio dropout

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4914704A (en) * 1984-10-30 1990-04-03 International Business Machines Corporation Text editor for speech input
US6912498B2 (en) * 2000-05-02 2005-06-28 Scansoft, Inc. Error correction in speech recognition by correcting text around selected area

Family Cites Families (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5231670A (en) * 1987-06-01 1993-07-27 Kurzweil Applied Intelligence, Inc. Voice controlled system and method for generating text from a voice controlled input
US5210689A (en) * 1990-12-28 1993-05-11 Semantic Compaction Systems System and method for automatically selecting among a plurality of input modes
US5444768A (en) * 1991-12-31 1995-08-22 International Business Machines Corporation Portable computer device for audible processing of remotely stored messages
US5855000A (en) * 1995-09-08 1998-12-29 Carnegie Mellon University Method and apparatus for correcting and repairing machine-transcribed input using independent or cross-modal secondary input
US5991720A (en) * 1996-05-06 1999-11-23 Matsushita Electric Industrial Co., Ltd. Speech recognition system employing multiple grammar networks
US6487532B1 (en) * 1997-09-24 2002-11-26 Scansoft, Inc. Apparatus and method for distinguishing similar-sounding utterances speech recognition
US6064963A (en) * 1997-12-17 2000-05-16 Opus Telecom, L.L.C. Automatic key word or phrase speech recognition for the corrections industry
US6198808B1 (en) * 1997-12-31 2001-03-06 Weblink Wireless, Inc. Controller for use with communications systems for converting a voice message to a text message
US6067514A (en) * 1998-06-23 2000-05-23 International Business Machines Corporation Method for automatically punctuating a speech utterance in a continuous speech recognition system
US6965863B1 (en) * 1998-11-12 2005-11-15 Microsoft Corporation Speech recognition user interface
US6327566B1 (en) * 1999-06-16 2001-12-04 International Business Machines Corporation Method and apparatus for correcting misinterpreted voice commands in a speech recognition system
US6615131B1 (en) * 1999-12-21 2003-09-02 Televigation, Inc. Method and system for an efficient operating environment in a real-time navigation system
US6694296B1 (en) * 2000-07-20 2004-02-17 Microsoft Corporation Method and apparatus for the recognition of spelled spoken words
US7526431B2 (en) * 2001-09-05 2009-04-28 Voice Signal Technologies, Inc. Speech recognition using ambiguous or phone key spelling and/or filtering
US7143037B1 (en) * 2002-06-12 2006-11-28 Cisco Technology, Inc. Spelling words using an arbitrary phonetic alphabet

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4914704A (en) * 1984-10-30 1990-04-03 International Business Machines Corporation Text editor for speech input
US6912498B2 (en) * 2000-05-02 2005-06-28 Scansoft, Inc. Error correction in speech recognition by correcting text around selected area

Also Published As

Publication number Publication date
US20050216272A1 (en) 2005-09-29
EP1743325A4 (en) 2008-05-14
EP1743325A2 (en) 2007-01-17
WO2005098817A2 (en) 2005-10-20
US7676364B2 (en) 2010-03-09

Similar Documents

Publication Publication Date Title
WO2005098817A3 (en) System and method for speech-to-text conversion using constrained dictation in a speak-and-spell mode
US7676371B2 (en) Oral modification of an ASR lexicon of an ASR engine
Aleksic et al. Bringing contextual information to google speech recognition.
Deng et al. Challenges in adopting speech recognition
CN102237088B (en) Device and method for acquiring speech recognition multi-information text
WO2008067562A3 (en) Multimodal speech recognition system
WO2009016631A3 (en) Automatic context sensitive language correction and enhancement using an internet corpus
WO2004100638A3 (en) Source-dependent text-to-speech system
WO2008073850A3 (en) Method and apparatus for reading education
AU2003215239A1 (en) Voice-controlled user interfaces
AU2003215226A1 (en) Voice-controlled data entry
WO2007118020A3 (en) Method and system for managing pronunciation dictionaries in a speech application
AU2002214658A1 (en) Speech recognition using word-in-phrase command
TW200802306A (en) Voice modifier for speech processing systems
ATE325413T1 (en) METHOD AND DEVICE FOR CONVERTING SPOKEN TEXTS INTO WRITTEN AND CORRECTING THE RECOGNIZED TEXTS
WO2009026270A3 (en) Hmm-based bilingual (mandarin-english) tts techniques
WO2006070373A3 (en) A system and a method for representing unrecognized words in speech to text conversions as syllables
CA2486125A1 (en) A system and method of using meta-data in speech-processing
WO2008005711A3 (en) Non-enrolled continuous dictation
WO2020175810A1 (en) Electronic apparatus and method for controlling thereof
Fellbaum et al. Principles of electronic speech processing with applications for people with disabilities
JP2004053742A (en) Speech recognition device
WO2005015546A8 (en) Speech input interface for dialog systems
JP2006259641A (en) Voice recognition device and program
WO2003025787A1 (en) Sentence creation apparatus and creation method

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NA NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SM SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): GM KE LS MW MZ NA SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LT LU MC NL PL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
WWE Wipo information: entry into national phase

Ref document number: 5056/DELNP/2006

Country of ref document: IN

NENP Non-entry into the national phase

Ref country code: DE

WWW Wipo information: withdrawn in national office

Ref document number: DE

WWE Wipo information: entry into national phase

Ref document number: 2005730114

Country of ref document: EP

WWP Wipo information: published in national office

Ref document number: 2005730114

Country of ref document: EP