US9530415B2 - System and method of providing speech processing in user interface - Google Patents
System and method of providing speech processing in user interface Download PDFInfo
- Publication number
- US9530415B2 US9530415B2 US14/928,193 US201514928193A US9530415B2 US 9530415 B2 US9530415 B2 US 9530415B2 US 201514928193 A US201514928193 A US 201514928193A US 9530415 B2 US9530415 B2 US 9530415B2
- Authority
- US
- United States
- Prior art keywords
- speech
- user
- specific field
- transcription
- indication
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 51
- 238000012545 processing Methods 0.000 title claims abstract description 30
- 230000008569 process Effects 0.000 claims abstract description 22
- 230000011664 signaling Effects 0.000 claims abstract description 7
- 238000012546 transfer Methods 0.000 claims description 5
- 230000009471 action Effects 0.000 claims description 3
- 238000013518 transcription Methods 0.000 claims 15
- 230000035897 transcription Effects 0.000 claims 15
- 230000000977 initiatory effect Effects 0.000 claims 2
- 238000005516 engineering process Methods 0.000 abstract description 16
- 230000026676 system process Effects 0.000 abstract description 2
- 235000006085 Vigna mungo var mungo Nutrition 0.000 description 22
- 240000005616 Vigna mungo var. mungo Species 0.000 description 22
- 238000004891 communication Methods 0.000 description 9
- 230000015654 memory Effects 0.000 description 9
- 230000008901 benefit Effects 0.000 description 7
- 230000004044 response Effects 0.000 description 7
- 239000008186 active pharmaceutical agent Substances 0.000 description 6
- 238000013459 approach Methods 0.000 description 6
- 230000006870 function Effects 0.000 description 6
- 230000004888 barrier function Effects 0.000 description 5
- 230000003993 interaction Effects 0.000 description 4
- 230000007246 mechanism Effects 0.000 description 4
- 239000000872 buffer Substances 0.000 description 3
- 238000013461 design Methods 0.000 description 3
- 230000015572 biosynthetic process Effects 0.000 description 2
- 230000003287 optical effect Effects 0.000 description 2
- 238000003786 synthesis reaction Methods 0.000 description 2
- 238000012549 training Methods 0.000 description 2
- 230000002776 aggregation Effects 0.000 description 1
- 238000004220 aggregation Methods 0.000 description 1
- 230000009118 appropriate response Effects 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000012790 confirmation Methods 0.000 description 1
- 238000013497 data interchange Methods 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 230000010354 integration Effects 0.000 description 1
- 238000007726 management method Methods 0.000 description 1
- 230000005055 memory storage Effects 0.000 description 1
- 238000010295 mobile communication Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000002093 peripheral effect Effects 0.000 description 1
- 230000000135 prohibitive effect Effects 0.000 description 1
- 238000009877 rendering Methods 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 230000007704 transition Effects 0.000 description 1
- 230000007723 transport mechanism Effects 0.000 description 1
- 238000010200 validation analysis Methods 0.000 description 1
- 230000000007 visual effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/03—Arrangements for converting the position or the displacement of a member into a coded form
- G06F3/041—Digitisers, e.g. for touch screens or touch pads, characterised by the transducing means
- G06F3/0416—Control or interface arrangements specially adapted for digitisers
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output
- G06F3/162—Interface to dedicated audio devices, e.g. audio drivers, interface to CODECs
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G10L15/30—Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
Abstract
Description
Claims (20)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US14/928,193 US9530415B2 (en) | 2008-01-22 | 2015-10-30 | System and method of providing speech processing in user interface |
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US2266808P | 2008-01-22 | 2008-01-22 | |
US12/128,345 US9177551B2 (en) | 2008-01-22 | 2008-05-28 | System and method of providing speech processing in user interface |
US14/928,193 US9530415B2 (en) | 2008-01-22 | 2015-10-30 | System and method of providing speech processing in user interface |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US12/128,345 Continuation US9177551B2 (en) | 2008-01-22 | 2008-05-28 | System and method of providing speech processing in user interface |
Publications (2)
Publication Number | Publication Date |
---|---|
US20160049151A1 US20160049151A1 (en) | 2016-02-18 |
US9530415B2 true US9530415B2 (en) | 2016-12-27 |
Family
ID=40877145
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US12/128,345 Active 2032-01-14 US9177551B2 (en) | 2008-01-22 | 2008-05-28 | System and method of providing speech processing in user interface |
US14/928,193 Active US9530415B2 (en) | 2008-01-22 | 2015-10-30 | System and method of providing speech processing in user interface |
Family Applications Before (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US12/128,345 Active 2032-01-14 US9177551B2 (en) | 2008-01-22 | 2008-05-28 | System and method of providing speech processing in user interface |
Country Status (1)
Country | Link |
---|---|
US (2) | US9177551B2 (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6747061B2 (en) | 2000-03-21 | 2004-06-08 | Atherogenics, Inc. | N-substituted dithiocarbamates for the treatment of biological disorders |
Families Citing this family (32)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9009797B1 (en) * | 2008-06-13 | 2015-04-14 | West Corporation | MRCP resource access control mechanism for mobile devices |
US9008618B1 (en) * | 2008-06-13 | 2015-04-14 | West Corporation | MRCP gateway for mobile devices |
US8700008B2 (en) | 2008-06-27 | 2014-04-15 | Microsoft Corporation | Providing data service options in push-to-talk using voice recognition |
US8577685B2 (en) * | 2008-10-24 | 2013-11-05 | At&T Intellectual Property I, L.P. | System and method for targeted advertising |
US8473595B2 (en) * | 2009-12-30 | 2013-06-25 | Bmc Software, Inc. | Method and system to automatically adapt web services from one protocol/idiom to another protocol/idiom |
CN110347834A (en) * | 2010-02-18 | 2019-10-18 | 株式会社尼康 | Information processing unit, mancarried device and information processing system |
US20110257958A1 (en) * | 2010-04-15 | 2011-10-20 | Michael Rogler Kildevaeld | Virtual smart phone |
US20120059655A1 (en) * | 2010-09-08 | 2012-03-08 | Nuance Communications, Inc. | Methods and apparatus for providing input to a speech-enabled application program |
WO2012090196A1 (en) * | 2010-12-30 | 2012-07-05 | Melamed Gal | Method and system for processing content |
EP2986014A1 (en) | 2011-08-05 | 2016-02-17 | Samsung Electronics Co., Ltd. | Method for controlling electronic apparatus based on voice recognition and motion recognition, and electronic apparatus applying the same |
WO2013022218A2 (en) * | 2011-08-05 | 2013-02-14 | Samsung Electronics Co., Ltd. | Electronic apparatus and method for providing user interface thereof |
US8930189B2 (en) * | 2011-10-28 | 2015-01-06 | Microsoft Corporation | Distributed user input to text generated by a speech to text transcription service |
US9129605B2 (en) | 2012-03-30 | 2015-09-08 | Src, Inc. | Automated voice and speech labeling |
CN104487932B (en) * | 2012-05-07 | 2017-10-10 | 思杰系统有限公司 | Speech recognition for remote application and desktop is supported |
US10026394B1 (en) * | 2012-08-31 | 2018-07-17 | Amazon Technologies, Inc. | Managing dialogs on a speech recognition platform |
RU2530268C2 (en) | 2012-11-28 | 2014-10-10 | Общество с ограниченной ответственностью "Спиктуит" | Method for user training of information dialogue system |
CN103076893B (en) * | 2012-12-31 | 2016-08-17 | 百度在线网络技术(北京)有限公司 | A kind of method and apparatus for realizing phonetic entry |
US10135904B2 (en) | 2015-01-27 | 2018-11-20 | Stealth Security, Inc. | Network attack detection on a mobile API of a web service |
US9966073B2 (en) * | 2015-05-27 | 2018-05-08 | Google Llc | Context-sensitive dynamic update of voice to text model in a voice-enabled electronic device |
US10083697B2 (en) | 2015-05-27 | 2018-09-25 | Google Llc | Local persisting of data for selectively offline capable voice action in a voice-enabled electronic device |
US10635505B2 (en) * | 2015-06-30 | 2020-04-28 | Coursera, Inc. | Automated batch application programming interfaces |
US10248452B2 (en) | 2016-05-20 | 2019-04-02 | Microsoft Technology Licensing, Llc | Interaction framework for executing user instructions with online services |
US10735954B2 (en) * | 2016-09-02 | 2020-08-04 | Blackberry Limited | Method and device for facilitating authentication over a wireless network |
US11295735B1 (en) * | 2017-12-13 | 2022-04-05 | Amazon Technologies, Inc. | Customizing voice-control for developer devices |
US10192554B1 (en) | 2018-02-26 | 2019-01-29 | Sorenson Ip Holdings, Llc | Transcription of communications using multiple speech recognition systems |
US10573312B1 (en) | 2018-12-04 | 2020-02-25 | Sorenson Ip Holdings, Llc | Transcription generation from multiple speech recognition systems |
US11170761B2 (en) | 2018-12-04 | 2021-11-09 | Sorenson Ip Holdings, Llc | Training of speech recognition systems |
US11017778B1 (en) | 2018-12-04 | 2021-05-25 | Sorenson Ip Holdings, Llc | Switching between speech recognition systems |
US10388272B1 (en) | 2018-12-04 | 2019-08-20 | Sorenson Ip Holdings, Llc | Training speech recognition systems using word sequences |
US11176942B2 (en) * | 2019-11-26 | 2021-11-16 | Vui, Inc. | Multi-modal conversational agent platform |
US11488604B2 (en) | 2020-08-19 | 2022-11-01 | Sorenson Ip Holdings, Llc | Transcription of audio |
US20220215056A1 (en) * | 2021-01-04 | 2022-07-07 | Oracle International Corporation | Drill back to original audio clip in virtual assistant initiated lists and reminders |
Citations (42)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5865626A (en) | 1996-08-30 | 1999-02-02 | Gte Internetworking Incorporated | Multi-dialect speech recognition method and apparatus |
US6023676A (en) | 1996-12-12 | 2000-02-08 | Dspc Israel, Ltd. | Keyword recognition system and method |
US20020004746A1 (en) | 2000-04-17 | 2002-01-10 | Ferber John B. | E-coupon channel and method for delivery of e-coupons to wireless devices |
US6343270B1 (en) | 1998-12-09 | 2002-01-29 | International Business Machines Corporation | Method for increasing dialect precision and usability in speech recognition and text-to-speech systems |
US6453292B2 (en) | 1998-10-28 | 2002-09-17 | International Business Machines Corporation | Command boundary identifier for conversational natural language |
US20030033146A1 (en) | 2001-08-03 | 2003-02-13 | Morin Philippe R. | Method for efficient, safe and reliable data entry by voice under adverse conditions |
US20030046081A1 (en) | 2000-10-06 | 2003-03-06 | Myung-Wan Koo | Auto attendant system and its method and call forwarding method using speech recognition |
US20030115060A1 (en) | 2001-12-13 | 2003-06-19 | Junqua Jean-Claude | System and interactive form filling with fusion of data from multiple unreliable information sources |
US6594629B1 (en) | 1999-08-06 | 2003-07-15 | International Business Machines Corporation | Methods and apparatus for audio-visual speech detection and recognition |
US20030182131A1 (en) | 2002-03-25 | 2003-09-25 | Arnold James F. | Method and apparatus for providing speech-driven routing between spoken language applications |
US20030204498A1 (en) * | 2002-04-30 | 2003-10-30 | Lehnert Bernd R. | Customer interaction reporting |
US20030216960A1 (en) | 2002-05-16 | 2003-11-20 | Richard Postrel | System and method for offering geocentric-based incentives and executing a commercial transaction via a wireless device |
US20040059575A1 (en) | 2002-09-25 | 2004-03-25 | Brookes John R. | Multiple pass speech recognition method and system |
US6751589B1 (en) | 2000-09-18 | 2004-06-15 | Hewlett-Packard Development Company, L.P. | Voice-actuated generation of documents containing photographic identification |
US20050080632A1 (en) | 2002-09-25 | 2005-04-14 | Norikazu Endo | Method and system for speech recognition using grammar weighted based upon location information |
US20050135571A1 (en) | 2003-12-19 | 2005-06-23 | At&T Corp. | Method and apparatus for automatically building conversational systems |
US20050222905A1 (en) | 2003-09-11 | 2005-10-06 | Scott Wills | Method and system for generating intelligent electronic banners based on user information |
US20050234725A1 (en) * | 2004-04-20 | 2005-10-20 | International Business Machines Corporation | Method and system for flexible usage of a graphical call flow builder |
US20060009973A1 (en) | 2004-07-06 | 2006-01-12 | Voxify, Inc. A California Corporation | Multi-slot dialog systems and methods |
US20060041926A1 (en) | 2004-04-30 | 2006-02-23 | Vulcan Inc. | Voice control of multimedia content |
US20060064302A1 (en) | 2004-09-20 | 2006-03-23 | International Business Machines Corporation | Method and system for voice-enabled autofill |
US7024363B1 (en) | 1999-12-14 | 2006-04-04 | International Business Machines Corporation | Methods and apparatus for contingent transfer and execution of spoken language interfaces |
US7143042B1 (en) | 1999-10-04 | 2006-11-28 | Nuance Communications | Tool for graphically defining dialog flows and for establishing operational links between speech applications and hypermedia content in an interactive voice response environment |
US20070061243A1 (en) | 2005-09-14 | 2007-03-15 | Jorey Ramer | Mobile content spidering and compatibility determination |
US20070073690A1 (en) | 2005-09-26 | 2007-03-29 | Boal Steven R | System and method for augmenting content in electronic documents with links to contextually relevant information |
US7210098B2 (en) | 2002-02-18 | 2007-04-24 | Kirusa, Inc. | Technique for synchronizing visual and voice browsers to enable multi-modal browsing |
US7225125B2 (en) | 1999-11-12 | 2007-05-29 | Phoenix Solutions, Inc. | Speech recognition system trained with regional speech characteristics |
US20070136069A1 (en) | 2005-12-13 | 2007-06-14 | General Motors Corporation | Method and system for customizing speech recognition in a mobile vehicle communication system |
US20070156842A1 (en) | 2005-12-29 | 2007-07-05 | Vermeulen Allan H | Distributed storage system with web services client interface |
US20070157075A1 (en) | 2005-12-29 | 2007-07-05 | Ritter Gerd M | Key command functionality in an electronic document |
US20070233487A1 (en) | 2006-04-03 | 2007-10-04 | Cohen Michael H | Automatic language model update |
US7305129B2 (en) | 2003-01-29 | 2007-12-04 | Microsoft Corporation | Methods and apparatus for populating electronic forms from scanned documents |
US7343551B1 (en) | 2002-11-27 | 2008-03-11 | Adobe Systems Incorporated | Autocompleting form fields based on previously entered values |
US7496511B2 (en) | 2003-01-14 | 2009-02-24 | Oracle International Corporation | Method and apparatus for using locale-specific grammars for speech recognition |
US20090270170A1 (en) | 2008-04-29 | 2009-10-29 | Bally Gaming , Inc. | Biofeedback for a gaming device, such as an electronic gaming machine (egm) |
US7711570B2 (en) | 2001-10-21 | 2010-05-04 | Microsoft Corporation | Application abstraction with dialog purpose |
US7822603B1 (en) * | 2004-01-09 | 2010-10-26 | At&T Intellectual Property Ii, L.P. | System and method for mobile automatic speech recognition |
US7890324B2 (en) | 2002-12-19 | 2011-02-15 | At&T Intellectual Property Ii, L.P. | Context-sensitive interface widgets for multi-modal dialog systems |
US8054990B2 (en) | 2006-11-22 | 2011-11-08 | General Motors Llc | Method of recognizing speech from a plurality of speaking locations within a vehicle |
US8060371B1 (en) * | 2007-05-09 | 2011-11-15 | Nextel Communications Inc. | System and method for voice interaction with non-voice enabled web pages |
US8160883B2 (en) | 2004-01-10 | 2012-04-17 | Microsoft Corporation | Focus tracking in dialogs |
US8224650B2 (en) | 2001-10-21 | 2012-07-17 | Microsoft Corporation | Web server controls for web enabled recognition and/or audible prompting |
-
2008
- 2008-05-28 US US12/128,345 patent/US9177551B2/en active Active
-
2015
- 2015-10-30 US US14/928,193 patent/US9530415B2/en active Active
Patent Citations (49)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5865626A (en) | 1996-08-30 | 1999-02-02 | Gte Internetworking Incorporated | Multi-dialect speech recognition method and apparatus |
US6023676A (en) | 1996-12-12 | 2000-02-08 | Dspc Israel, Ltd. | Keyword recognition system and method |
US20010012997A1 (en) | 1996-12-12 | 2001-08-09 | Adoram Erell | Keyword recognition system and method |
US6453292B2 (en) | 1998-10-28 | 2002-09-17 | International Business Machines Corporation | Command boundary identifier for conversational natural language |
US6343270B1 (en) | 1998-12-09 | 2002-01-29 | International Business Machines Corporation | Method for increasing dialect precision and usability in speech recognition and text-to-speech systems |
US6594629B1 (en) | 1999-08-06 | 2003-07-15 | International Business Machines Corporation | Methods and apparatus for audio-visual speech detection and recognition |
US7143042B1 (en) | 1999-10-04 | 2006-11-28 | Nuance Communications | Tool for graphically defining dialog flows and for establishing operational links between speech applications and hypermedia content in an interactive voice response environment |
US7225125B2 (en) | 1999-11-12 | 2007-05-29 | Phoenix Solutions, Inc. | Speech recognition system trained with regional speech characteristics |
US7024363B1 (en) | 1999-12-14 | 2006-04-04 | International Business Machines Corporation | Methods and apparatus for contingent transfer and execution of spoken language interfaces |
US20020004746A1 (en) | 2000-04-17 | 2002-01-10 | Ferber John B. | E-coupon channel and method for delivery of e-coupons to wireless devices |
US6751589B1 (en) | 2000-09-18 | 2004-06-15 | Hewlett-Packard Development Company, L.P. | Voice-actuated generation of documents containing photographic identification |
US20030046081A1 (en) | 2000-10-06 | 2003-03-06 | Myung-Wan Koo | Auto attendant system and its method and call forwarding method using speech recognition |
US20030033146A1 (en) | 2001-08-03 | 2003-02-13 | Morin Philippe R. | Method for efficient, safe and reliable data entry by voice under adverse conditions |
US7711570B2 (en) | 2001-10-21 | 2010-05-04 | Microsoft Corporation | Application abstraction with dialog purpose |
US8224650B2 (en) | 2001-10-21 | 2012-07-17 | Microsoft Corporation | Web server controls for web enabled recognition and/or audible prompting |
US20030115060A1 (en) | 2001-12-13 | 2003-06-19 | Junqua Jean-Claude | System and interactive form filling with fusion of data from multiple unreliable information sources |
US7124085B2 (en) * | 2001-12-13 | 2006-10-17 | Matsushita Electric Industrial Co., Ltd. | Constraint-based speech recognition system and method |
US7210098B2 (en) | 2002-02-18 | 2007-04-24 | Kirusa, Inc. | Technique for synchronizing visual and voice browsers to enable multi-modal browsing |
US20030182131A1 (en) | 2002-03-25 | 2003-09-25 | Arnold James F. | Method and apparatus for providing speech-driven routing between spoken language applications |
US20030204498A1 (en) * | 2002-04-30 | 2003-10-30 | Lehnert Bernd R. | Customer interaction reporting |
US20030216960A1 (en) | 2002-05-16 | 2003-11-20 | Richard Postrel | System and method for offering geocentric-based incentives and executing a commercial transaction via a wireless device |
US20050080632A1 (en) | 2002-09-25 | 2005-04-14 | Norikazu Endo | Method and system for speech recognition using grammar weighted based upon location information |
US20040059575A1 (en) | 2002-09-25 | 2004-03-25 | Brookes John R. | Multiple pass speech recognition method and system |
US7343551B1 (en) | 2002-11-27 | 2008-03-11 | Adobe Systems Incorporated | Autocompleting form fields based on previously entered values |
US7890324B2 (en) | 2002-12-19 | 2011-02-15 | At&T Intellectual Property Ii, L.P. | Context-sensitive interface widgets for multi-modal dialog systems |
US7496511B2 (en) | 2003-01-14 | 2009-02-24 | Oracle International Corporation | Method and apparatus for using locale-specific grammars for speech recognition |
US7305129B2 (en) | 2003-01-29 | 2007-12-04 | Microsoft Corporation | Methods and apparatus for populating electronic forms from scanned documents |
US20050222905A1 (en) | 2003-09-11 | 2005-10-06 | Scott Wills | Method and system for generating intelligent electronic banners based on user information |
US7660400B2 (en) * | 2003-12-19 | 2010-02-09 | At&T Intellectual Property Ii, L.P. | Method and apparatus for automatically building conversational systems |
US20050135571A1 (en) | 2003-12-19 | 2005-06-23 | At&T Corp. | Method and apparatus for automatically building conversational systems |
US7822603B1 (en) * | 2004-01-09 | 2010-10-26 | At&T Intellectual Property Ii, L.P. | System and method for mobile automatic speech recognition |
US8160883B2 (en) | 2004-01-10 | 2012-04-17 | Microsoft Corporation | Focus tracking in dialogs |
US20050234725A1 (en) * | 2004-04-20 | 2005-10-20 | International Business Machines Corporation | Method and system for flexible usage of a graphical call flow builder |
US20060041926A1 (en) | 2004-04-30 | 2006-02-23 | Vulcan Inc. | Voice control of multimedia content |
US20070255566A1 (en) | 2004-07-06 | 2007-11-01 | Voxify, Inc. | Multi-slot dialog systems and methods |
US7228278B2 (en) * | 2004-07-06 | 2007-06-05 | Voxify, Inc. | Multi-slot dialog systems and methods |
US20060009973A1 (en) | 2004-07-06 | 2006-01-12 | Voxify, Inc. A California Corporation | Multi-slot dialog systems and methods |
US20060074652A1 (en) * | 2004-09-20 | 2006-04-06 | International Business Machines Corporation | Method and system for voice-enabled autofill |
US20060064302A1 (en) | 2004-09-20 | 2006-03-23 | International Business Machines Corporation | Method and system for voice-enabled autofill |
US7739117B2 (en) * | 2004-09-20 | 2010-06-15 | International Business Machines Corporation | Method and system for voice-enabled autofill |
US20070061243A1 (en) | 2005-09-14 | 2007-03-15 | Jorey Ramer | Mobile content spidering and compatibility determination |
US20070073690A1 (en) | 2005-09-26 | 2007-03-29 | Boal Steven R | System and method for augmenting content in electronic documents with links to contextually relevant information |
US20070136069A1 (en) | 2005-12-13 | 2007-06-14 | General Motors Corporation | Method and system for customizing speech recognition in a mobile vehicle communication system |
US20070156842A1 (en) | 2005-12-29 | 2007-07-05 | Vermeulen Allan H | Distributed storage system with web services client interface |
US20070157075A1 (en) | 2005-12-29 | 2007-07-05 | Ritter Gerd M | Key command functionality in an electronic document |
US20070233487A1 (en) | 2006-04-03 | 2007-10-04 | Cohen Michael H | Automatic language model update |
US8054990B2 (en) | 2006-11-22 | 2011-11-08 | General Motors Llc | Method of recognizing speech from a plurality of speaking locations within a vehicle |
US8060371B1 (en) * | 2007-05-09 | 2011-11-15 | Nextel Communications Inc. | System and method for voice interaction with non-voice enabled web pages |
US20090270170A1 (en) | 2008-04-29 | 2009-10-29 | Bally Gaming , Inc. | Biofeedback for a gaming device, such as an electronic gaming machine (egm) |
Non-Patent Citations (2)
Title |
---|
Chao Huang, Eric Chang, and Tao Chen. "Accent Issues in Large Vocabulary Continuous Speech Recognition (LVCSR)-Microsoft Research." Microsoft Research, Aug. 2001. http://research.microsoft.com/apps/pubs/default.aspx?id=69899 Web, Feb. 27, 2013. |
V. Diakoloukas et al., "Development of dialect-specific speech recognizers using adaptation methods," Proc. IEEE ICASSP 97, vol. 2, pp. 1455-1458, Apr. 1997. |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6747061B2 (en) | 2000-03-21 | 2004-06-08 | Atherogenics, Inc. | N-substituted dithiocarbamates for the treatment of biological disorders |
Also Published As
Publication number | Publication date |
---|---|
US20090187410A1 (en) | 2009-07-23 |
US9177551B2 (en) | 2015-11-03 |
US20160049151A1 (en) | 2016-02-18 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US9530415B2 (en) | System and method of providing speech processing in user interface | |
US20110067059A1 (en) | Media control | |
US7415537B1 (en) | Conversational portal for providing conversational browsing and multimedia broadcast on demand | |
US8838457B2 (en) | Using results of unstructured language model based speech recognition to control a system-level function of a mobile communications facility | |
US10056077B2 (en) | Using speech recognition results based on an unstructured language model with a music system | |
KR100459299B1 (en) | Conversational browser and conversational systems | |
US8886540B2 (en) | Using speech recognition results based on an unstructured language model in a mobile communication facility application | |
US8949130B2 (en) | Internal and external speech recognition use with a mobile communication facility | |
US7366979B2 (en) | Method and apparatus for annotating a document | |
CN102792294B (en) | The system and method for the hybrid processing in natural language speech service environment | |
US20080288252A1 (en) | Speech recognition of speech recorded by a mobile communication facility | |
RU2525440C2 (en) | Markup language-based selection and utilisation of recognisers for utterance processing | |
US20090030687A1 (en) | Adapting an unstructured language model speech recognition system based on usage | |
US20090030691A1 (en) | Using an unstructured language model associated with an application of a mobile communication facility | |
US20090030685A1 (en) | Using speech recognition results based on an unstructured language model with a navigation system | |
US20090030688A1 (en) | Tagging speech recognition results based on an unstructured language model for use in a mobile communication facility application | |
US20080312934A1 (en) | Using results of unstructured language model based speech recognition to perform an action on a mobile communications facility | |
JP6971292B2 (en) | Methods, devices, servers, computer-readable storage media and computer programs for aligning paragraphs and images | |
WO2008109835A2 (en) | Speech recognition of speech recorded by a mobile communication facility | |
CN101103612A (en) | Dynamic extensible lightweight access to web services for pervasive devices | |
US20080319759A1 (en) | Integrating a voice browser into a web 2.0 environment | |
Di Fabbrizio et al. | A speech mashup framework for multimodal mobile services | |
CN108881508B (en) | Voice Domain Name System (DNS) unit based on block chain | |
US9275034B1 (en) | Exceptions to action invocation from parsing rules | |
CN114064943A (en) | Conference management method, conference management device, storage medium and electronic equipment |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
FEPP | Fee payment procedure |
Free format text: PAYOR NUMBER ASSIGNED (ORIGINAL EVENT CODE: ASPN); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY |
|
AS | Assignment |
Owner name: AT&T LABS, INC., TEXAS Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:WILPON, JAY;DI FABBRIZIO, GIUSEPPE;STERN, BENJAMIN J.;REEL/FRAME:036937/0194 Effective date: 20080523 |
|
AS | Assignment |
Owner name: AT&T INTELLECTUAL PROPERTY I, L.P., GEORGIA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:AT&T LABS, INC.;REEL/FRAME:038107/0915 Effective date: 20160204 |
|
STCF | Information on status: patent grant |
Free format text: PATENTED CASE |
|
AS | Assignment |
Owner name: NUANCE COMMUNICATIONS, INC., MASSACHUSETTS Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:AT&T INTELLECTUAL PROPERTY I, L.P.;REEL/FRAME:041498/0113 Effective date: 20161214 |
|
MAFP | Maintenance fee payment |
Free format text: PAYMENT OF MAINTENANCE FEE, 4TH YEAR, LARGE ENTITY (ORIGINAL EVENT CODE: M1551); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY Year of fee payment: 4 |
|
AS | Assignment |
Owner name: MICROSOFT TECHNOLOGY LICENSING, LLC, WASHINGTON Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:NUANCE COMMUNICATIONS, INC.;REEL/FRAME:065552/0934 Effective date: 20230920 |