US9270828B2 - System and method for voicemail to text conversion - Google Patents
System and method for voicemail to text conversion Download PDFInfo
- Publication number
- US9270828B2 US9270828B2 US12/828,677 US82867710A US9270828B2 US 9270828 B2 US9270828 B2 US 9270828B2 US 82867710 A US82867710 A US 82867710A US 9270828 B2 US9270828 B2 US 9270828B2
- Authority
- US
- United States
- Prior art keywords
- user
- voicemail
- name
- email address
- message
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active, expires
Links
- 238000006243 chemical reaction Methods 0.000 title claims abstract description 27
- 238000000034 method Methods 0.000 title claims description 14
- 230000015654 memory Effects 0.000 claims description 11
- 238000004590 computer program Methods 0.000 claims 1
- 230000006870 function Effects 0.000 description 6
- 238000004891 communication Methods 0.000 description 3
- 230000003068 static effect Effects 0.000 description 2
- 230000006978 adaptation Effects 0.000 description 1
- 238000003491 array Methods 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 230000007274 generation of a signal involved in cell-cell signaling Effects 0.000 description 1
- 239000004973 liquid crystal related substance Substances 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 230000002093 peripheral effect Effects 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 230000000644 propagated effect Effects 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M3/00—Automatic or semi-automatic exchanges
- H04M3/42—Systems providing special services or facilities to subscribers
- H04M3/50—Centralised arrangements for answering calls; Centralised arrangements for recording messages for absent or busy subscribers ; Centralised arrangements for recording messages
- H04M3/53—Centralised arrangements for recording incoming messages, i.e. mailbox systems
- H04M3/533—Voice mail systems
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M2201/00—Electronic components, circuits, software, systems or apparatus used in telephone systems
- H04M2201/40—Electronic components, circuits, software, systems or apparatus used in telephone systems using speech recognition
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M2203/00—Aspects of automatic or semi-automatic exchanges
- H04M2203/45—Aspects of automatic or semi-automatic exchanges related to voicemail messaging
- H04M2203/4536—Voicemail combined with text-based messaging
Definitions
- the present disclosure relates to voicemail to text conversion. More particularly, the present disclosure relates to a manner of improving the accuracy of a voicemail to text conversion of a user's name.
- a voicemail to text conversion service is a service which converts a voicemail message to text.
- a user's name i.e., a called party's name
- a name is a difficult word for a speech engine to accurately convert to text.
- a user of a voicemail to text conversion service may find his or her name to be constantly misspelled in their converted text messages.
- FIG. 1 shows an example of a general computer system that includes a set of instructions for voicemail to text conversion described herein;
- FIG. 2 shows an example of a system for voicemail to text conversion, according to an aspect of the present disclosure
- FIG. 3 shows an example of an algorithm performed by a voicemail platform, according to an aspect of the present disclosure.
- FIG. 4 shows an example of an algorithm performed by a speech engine, according to an aspect of the present disclosure.
- FIG. 1 is an illustrative embodiment of a general computer system that includes a set of instructions for performing processes as described herein.
- the general computer system is shown and is designated 100 .
- the computer system 100 can include a set of instructions that can be executed to cause the computer system 100 to perform any one or more of the methods or computer based functions disclosed herein.
- the computer system 100 may operate as a standalone device or may be connected, for example, using a network 101 , to other computer systems or peripheral devices.
- the computer system 100 may include or be included within any one or more of the computers, servers, systems, or communication networks described herein.
- the computer system may operate in the capacity of a server or as a client user computer in a server-client user network environment, or as a peer computer system in a peer-to-peer (or distributed) network environment.
- the computer system 100 can also be implemented as or incorporated into various devices, such as a personal computer (PC), a tablet PC, a set-top box (STB), a personal digital assistant (PDA), a mobile device, a palmtop computer, a laptop computer, a desktop computer, a communications device, a wireless telephone, a personal trusted device, a web appliance, or any other machine capable of executing a set of instructions (sequential or otherwise) that specify actions to be taken by that machine.
- PC personal computer
- PDA personal digital assistant
- the computer system 100 can be implemented using electronic devices that provide voice, video or data communication.
- the term “system” shall also be taken to include any collection of systems or sub-systems that individually or jointly execute a set, or multiple sets, of instructions to perform one or more computer functions.
- the computer system 100 may include a processor 110 , for example, a central processing unit (CPU), a graphics processing unit (GPU), or both.
- the computer system 100 can include a main memory 120 and a static memory 130 that can communicate with each other via a bus 108 .
- the computer system 100 may further include a video display unit 150 , such as a liquid crystal display (LCD), an organic light emitting diode (OLED), a flat panel display, a solid state display, or a cathode ray tube (CRT).
- LCD liquid crystal display
- OLED organic light emitting diode
- CRT cathode ray tube
- the computer system 100 may include an alpha-numeric input device 160 , such as a keyboard, another input device (not shown), such as a remote control device having a wireless keypad, a keyboard, a microphone coupled to a speech recognition engine, a camera such as a video camera or still camera, and a cursor control device 170 , such as a mouse.
- the computer system 100 can also include a disk drive unit 180 , a signal generation device 190 , such as a speaker or remote control, and a network interface device 140 .
- the disk drive unit 180 may include a computer-readable medium 182 in which one or more sets of instructions 184 , e.g. software, can be embedded.
- a computer-readable medium 182 is a tangible, non-transitory article of manufacture, from which sets of instructions 184 can be read.
- the instructions 184 may embody one or more of the methods or logic as described herein.
- the instructions 184 may reside completely, or at least partially, within the main memory 120 , the static memory 130 , and/or within the processor 110 during execution by the computer system 100 .
- the main memory 120 and the processor 110 also may include computer-readable media.
- dedicated hardware implementations such as application specific integrated circuits, programmable logic arrays and other hardware devices, can be constructed to implement one or more of the methods described herein.
- Applications that may include the apparatus and systems of various embodiments can broadly include a variety of electronic and computer systems.
- One or more embodiments described herein may implement functions using two or more specific interconnected hardware modules or devices with related control and data signals that can be communicated between and through the modules, or as portions of an application-specific integrated circuit. Accordingly, the present system encompasses software, firmware, and hardware implementations, or combinations thereof.
- the methods described herein may be implemented by software programs executable by a computer system.
- implementations can include distributed processing, component/object distributed processing, and parallel processing.
- virtual computer system processing can be constructed to implement one or more of the methods or functionality as described herein.
- the present disclosure contemplates a computer-readable medium 182 that includes instructions 184 or receives and executes instructions 184 responsive to a propagated signal, so that a device connected to a network 101 can communicate voice, video or data over the network 101 . Further, the instructions 184 may be transmitted or received over the network 101 via the network interface device 140 .
- FIG. 2 illustrates an example of a system for voicemail to text conversion.
- the system may include a voicemail platform 201 and a speech engine 202 .
- the voicemail platform 201 and the speech engine 202 may be independently owned or operated.
- the voicemail platform 201 may be operated by a wireline or wireless telephone carrier, and the speech engine 202 may be run on a server operated by a third-party vendor.
- the voicemail platform 201 and the speech engine 202 may be integrated within one system and may both be operated by a carrier.
- the voicemail platform 201 receives a voicemail message from a calling party which is directed to a voicemail user (a called party).
- the voicemail platform 201 may be a centralized computer system which stores incoming voicemail messages in personal mailboxes associated with user phone numbers.
- the voicemail messages may be stored in a storage system which includes storage media such as, but not limited to, hard disk drives.
- the voicemail platform 201 may also store data pertinent to each voicemail user.
- the voicemail platform 201 may store an email address associated with each voicemail user.
- the converted text can be delivered to the user in the form of an email message addressed to the email address stored in the voicemail platform 201 .
- the voicemail platform 201 may also store other data pertinent to each voicemail user, such as a user's name.
- the voicemail platform 201 sends the voicemail, as well as username data of the voicemail user, to the speech engine 202 .
- the voicemail platform 201 may include a processing system including one or more processors programmed to perform the algorithm illustrated in FIG. 3 .
- the voicemail platform 201 after receiving a voicemail for a user of a voicemail to text conversion service (S 301 ), the voicemail platform 201 sends the voicemail and username data of the user to the speech engine 202 (S 302 ).
- the speech engine 202 converts the voicemail to text, using the username data to correctly spell all instances of the user's name within the voicemail.
- the speech engine 202 determines the correct spelling of the user's name from the username data.
- the username data may be an email address of the user.
- a user's email address typically contains all or part of a user's name.
- a person named ‘Rick Jones’ who is an employee of XYZ corporation may have an email address of ‘rick.jones@xyz.com’.
- the speech engine 202 may be capable of parsing the email address and extracting the correct spelling of the user's name from the email address.
- the voicemail platform 201 may parse the user's email address and extract the correct spelling of the user's name from the email address, and then send the correct spelling of the user's name to the speech engine 202 as the username data.
- the speech engine 202 performs a voicemail to text conversion algorithm to convert the voicemail to text.
- the speech engine 202 may include a processing system including one or more processors programmed to perform the algorithm illustrated in FIG. 4 .
- the speech engine 202 receives the voicemail message and username data from the voicemail platform 201 (S 401 ), and converts the voicemail message to text.
- the speech engine 202 recognizes every occurrence of the user's name within the voicemail message (S 402 ), and uses the username data to correctly spell each corresponding occurrence of the user's name within the converted text (S 403 ).
- the speech engine 202 may recognize the user's name, for example, by comparing phones within the voicemail message to a predetermined phone stored in the speech engine corresponding to the user's name. After the voicemail to text conversion algorithm is performed, the speech engine 202 sends the converted text to the voicemail platform 201 (S 404 ).
- the voicemail platform 201 then receives the converted text from the speech engine 202 (S 303 ), and delivers the converted text to a device 203 of the user, such as, but not limited to, a phone, a PDA, a tablet device or a PC (S 304 ).
- the converted text may be delivered to the voicemail user in a variety of formats, such as, but not limited to, an email or a Short Message Service (SMS) text message.
- SMS Short Message Service
- a voicemail platform which provides a voicemail to text conversion service to a user includes a storage system which stores username data for a user of a voicemail to text conversion service, and a processing system.
- the processing system receives a voicemail message for the user, sends the voicemail message and the username data to a speech engine, receives text from the speech engine which is converted from the voicemail message using the username data to correctly spell all occurrences of the user's name within the voicemail message, and sends the converted text to a device of the user.
- the converted text may be delivered to the device of the user in the form of an email message.
- the username data may be an email address of the user.
- the email address may contain at least part of the user's name.
- the voicemail platform may determine the username data from an email address of the user.
- the device of the user may be a phone, a PDA, a tablet device or a PC.
- the voicemail platform may be operated by a telephone carrier, and the speech engine may be operated by a third-party vendor.
- a method for providing a voicemail to text conversion service to a user includes storing username data for a user of a voicemail to text conversion service in a storage system of a voicemail platform, receiving a voicemail message for the user at the voicemail platform, sending the voicemail message and the username data from the voicemail platform to a speech engine, receiving text at the voicemail platform from the speech engine which is converted from the voicemail message using the username data to correctly spell all occurrences of the user's name within the voicemail message, and sending the converted text from the voicemail platform to a device of the user.
- a non-transitory computer-readable medium storing a program for providing a voicemail to text conversion service to a user includes code for storing username data for a user of a voicemail to text conversion service in a storage system of a voicemail platform, code for receiving a voicemail message for the user at the voicemail platform, code for sending the voicemail message and the username data from the voicemail platform to a speech engine, code for receiving text at the voicemail platform from the speech engine which is converted from the voicemail message using the username data to correctly spell all occurrences of the user's name within the voicemail message, and code for sending the converted text from the voicemail platform to a device of the user.
- While a computer-readable medium herein may be shown to be a single medium, the term “computer-readable medium” includes a single medium or multiple media, such as a centralized or distributed database, and/or associated caches and servers that store one or more sets of instructions.
- the term “computer-readable medium” shall also include any medium that is capable of storing, encoding or carrying a set of instructions for execution by a processor or that cause a computer system to perform any one or more of the methods or operations disclosed herein.
- the computer-readable medium can include a solid-state memory such as a memory card or other package that houses one or more non-volatile read-only memories.
- the computer-readable medium can be a random access memory or other volatile re-writable memory.
- the computer-readable medium can include a magneto-optical or optical medium, such as a disk or tapes or other storage device to capture carrier wave signals such as a signal communicated over a transmission medium. Accordingly, the disclosure is considered to include any computer-readable medium or other equivalents and successor media, in which data or instructions may be stored.
- inventions of the disclosure may be referred to herein, individually and/or collectively, by the term “invention” merely for convenience and without intending to voluntarily limit the scope of this application to any particular invention or inventive concept.
- inventions merely for convenience and without intending to voluntarily limit the scope of this application to any particular invention or inventive concept.
- specific embodiments have been illustrated and described herein, it should be appreciated that any subsequent arrangement designed to achieve the same or similar purpose may be substituted for the specific embodiments shown.
- This disclosure is intended to cover any and all subsequent adaptations or variations of various embodiments. Combinations of the above embodiments, and other embodiments not specifically described herein, will be apparent to those of skill in the art upon reviewing the description.
Abstract
Description
Claims (10)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US12/828,677 US9270828B2 (en) | 2010-07-01 | 2010-07-01 | System and method for voicemail to text conversion |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US12/828,677 US9270828B2 (en) | 2010-07-01 | 2010-07-01 | System and method for voicemail to text conversion |
Publications (2)
Publication Number | Publication Date |
---|---|
US20120002794A1 US20120002794A1 (en) | 2012-01-05 |
US9270828B2 true US9270828B2 (en) | 2016-02-23 |
Family
ID=45399723
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US12/828,677 Active 2032-08-23 US9270828B2 (en) | 2010-07-01 | 2010-07-01 | System and method for voicemail to text conversion |
Country Status (1)
Country | Link |
---|---|
US (1) | US9270828B2 (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10893013B1 (en) * | 2017-08-22 | 2021-01-12 | James Peter Morrissette | Recipient notification of electronic message generated by voice-to-text engine |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8249568B2 (en) | 2006-06-15 | 2012-08-21 | Timothy Salmon | System and method for processing a voice mail |
Citations (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6775651B1 (en) * | 2000-05-26 | 2004-08-10 | International Business Machines Corporation | Method of transcribing text from computer voice mail |
US20050021339A1 (en) * | 2003-07-24 | 2005-01-27 | Siemens Information And Communication Networks, Inc. | Annotations addition to documents rendered via text-to-speech conversion over a voice connection |
US20070293272A1 (en) * | 2006-06-15 | 2007-12-20 | Timothy Salmon | System and method for processing a voice mail |
US20080167007A1 (en) * | 2007-01-07 | 2008-07-10 | Gregory Novick | Voicemail Systems and Methods |
US7542552B2 (en) * | 2005-06-24 | 2009-06-02 | Siemens Communications, Inc. | Method and apparatus for generating default personal greetings using text to speech conversion |
US7562552B2 (en) * | 2005-11-10 | 2009-07-21 | Tyco Electronics Corporation | Crimp height adjustment mechanism |
US7590550B2 (en) * | 2006-09-08 | 2009-09-15 | American Well Inc. | Connecting consumers with service providers |
US7693267B2 (en) * | 2005-12-30 | 2010-04-06 | Microsoft Corporation | Personalized user specific grammars |
US20110022386A1 (en) * | 2009-07-22 | 2011-01-27 | Cisco Technology, Inc. | Speech recognition tuning tool |
US8126718B2 (en) * | 2008-07-11 | 2012-02-28 | Research In Motion Limited | Facilitating text-to-speech conversion of a username or a network address containing a username |
US8204486B2 (en) * | 2008-12-19 | 2012-06-19 | Cox Communications, Inc. | Dynamic messaging routing and audio-to-text linking for visual voicemail |
-
2010
- 2010-07-01 US US12/828,677 patent/US9270828B2/en active Active
Patent Citations (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6775651B1 (en) * | 2000-05-26 | 2004-08-10 | International Business Machines Corporation | Method of transcribing text from computer voice mail |
US20050021339A1 (en) * | 2003-07-24 | 2005-01-27 | Siemens Information And Communication Networks, Inc. | Annotations addition to documents rendered via text-to-speech conversion over a voice connection |
US7542552B2 (en) * | 2005-06-24 | 2009-06-02 | Siemens Communications, Inc. | Method and apparatus for generating default personal greetings using text to speech conversion |
US7562552B2 (en) * | 2005-11-10 | 2009-07-21 | Tyco Electronics Corporation | Crimp height adjustment mechanism |
US7693267B2 (en) * | 2005-12-30 | 2010-04-06 | Microsoft Corporation | Personalized user specific grammars |
US20070293272A1 (en) * | 2006-06-15 | 2007-12-20 | Timothy Salmon | System and method for processing a voice mail |
US8249568B2 (en) * | 2006-06-15 | 2012-08-21 | Timothy Salmon | System and method for processing a voice mail |
US7590550B2 (en) * | 2006-09-08 | 2009-09-15 | American Well Inc. | Connecting consumers with service providers |
US20080167007A1 (en) * | 2007-01-07 | 2008-07-10 | Gregory Novick | Voicemail Systems and Methods |
US8126718B2 (en) * | 2008-07-11 | 2012-02-28 | Research In Motion Limited | Facilitating text-to-speech conversion of a username or a network address containing a username |
US8204486B2 (en) * | 2008-12-19 | 2012-06-19 | Cox Communications, Inc. | Dynamic messaging routing and audio-to-text linking for visual voicemail |
US20110022386A1 (en) * | 2009-07-22 | 2011-01-27 | Cisco Technology, Inc. | Speech recognition tuning tool |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10893013B1 (en) * | 2017-08-22 | 2021-01-12 | James Peter Morrissette | Recipient notification of electronic message generated by voice-to-text engine |
Also Published As
Publication number | Publication date |
---|---|
US20120002794A1 (en) | 2012-01-05 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US10714091B2 (en) | Systems and methods to present voice message information to a user of a computing device | |
US10455091B1 (en) | User input driven short message service (SMS) applications | |
US10818285B2 (en) | Electronic device and speech recognition method therefor | |
CN107580149A (en) | The recognition methods of outgoing call failure cause, device, electronic equipment, storage medium | |
US9148741B2 (en) | Action generation based on voice data | |
CN108877792B (en) | Method, apparatus, electronic device and computer readable storage medium for processing voice conversations | |
CN107430616A (en) | The interactive mode of speech polling re-forms | |
US11720612B2 (en) | System and method to avoid sending a message to the unintended recipient | |
US20120185240A1 (en) | System and method for generating and sending a simplified message using speech recognition | |
US20100082674A1 (en) | System for detecting user input error | |
CN111602133A (en) | Compression of word embedding for natural language processing systems | |
US11790916B2 (en) | Speech-to-text system | |
US9838486B2 (en) | Checking documents for spelling and/or grammatical errors and/or providing recommended words or phrases based on patterns of colloquialisms used among users in a social network | |
US10673853B2 (en) | Method and system for executing a transaction request using a communication channel | |
US9270828B2 (en) | System and method for voicemail to text conversion | |
US8775183B2 (en) | Application of user-specified transformations to automatic speech recognition results | |
US11532308B2 (en) | Speech-to-text system | |
US9736297B2 (en) | Phone number canonicalization and information discoverability | |
US10176151B2 (en) | Text message displaying method and system, and text message providing system | |
KR20150103855A (en) | Method and system of providing voice service using interoperation between application and server | |
US20220141161A1 (en) | Bot proxy for group communication service | |
US20190108290A1 (en) | Human assisted automated question and answer system using natural language processing of real-time requests assisted by humans for requests of low confidence | |
KR102347070B1 (en) | Method and apparatus for processing information of terminal | |
CN113709506A (en) | Multimedia playing method, device, medium and program product based on cloud mobile phone | |
US20140258398A1 (en) | System and Method for Automatic Context Detection, Sharing, and Storage in Real-Time Communication Systems |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AS | Assignment |
Owner name: AT&T MOBILITY II LLC., GEORGIA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:JACKSON, JACQUELINE;ZUBAS, MICHAEL;SIGNING DATES FROM 20100629 TO 20100701;REEL/FRAME:024625/0085 |
|
FEPP | Fee payment procedure |
Free format text: PAYOR NUMBER ASSIGNED (ORIGINAL EVENT CODE: ASPN); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY |
|
STCF | Information on status: patent grant |
Free format text: PATENTED CASE |
|
AS | Assignment |
Owner name: NUANCE COMMUNICATIONS, INC., MASSACHUSETTS Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:AT&T MOBILITY II LLC;REEL/FRAME:041494/0590 Effective date: 20161214 |
|
MAFP | Maintenance fee payment |
Free format text: PAYMENT OF MAINTENANCE FEE, 4TH YEAR, LARGE ENTITY (ORIGINAL EVENT CODE: M1551); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY Year of fee payment: 4 |
|
MAFP | Maintenance fee payment |
Free format text: PAYMENT OF MAINTENANCE FEE, 8TH YEAR, LARGE ENTITY (ORIGINAL EVENT CODE: M1552); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY Year of fee payment: 8 |
|
AS | Assignment |
Owner name: MICROSOFT TECHNOLOGY LICENSING, LLC, WASHINGTON Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:NUANCE COMMUNICATIONS, INC.;REEL/FRAME:065578/0676 Effective date: 20230920 |