US9270828B2 - System and method for voicemail to text conversion - Google Patents

System and method for voicemail to text conversion Download PDF

Info

Publication number
US9270828B2
US9270828B2 US12/828,677 US82867710A US9270828B2 US 9270828 B2 US9270828 B2 US 9270828B2 US 82867710 A US82867710 A US 82867710A US 9270828 B2 US9270828 B2 US 9270828B2
Authority
US
United States
Prior art keywords
user
voicemail
name
email address
message
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active, expires
Application number
US12/828,677
Other versions
US20120002794A1 (en
Inventor
Jacqueline JACKSON
Michael Zubas
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Microsoft Technology Licensing LLC
Original Assignee
AT&T Mobility II LLC
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by AT&T Mobility II LLC filed Critical AT&T Mobility II LLC
Priority to US12/828,677 priority Critical patent/US9270828B2/en
Assigned to AT&T MOBILITY II LLC. reassignment AT&T MOBILITY II LLC. ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: JACKSON, JACQUELINE, ZUBAS, MICHAEL
Publication of US20120002794A1 publication Critical patent/US20120002794A1/en
Application granted granted Critical
Publication of US9270828B2 publication Critical patent/US9270828B2/en
Assigned to NUANCE COMMUNICATIONS, INC. reassignment NUANCE COMMUNICATIONS, INC. ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: AT&T MOBILITY II LLC
Assigned to MICROSOFT TECHNOLOGY LICENSING, LLC reassignment MICROSOFT TECHNOLOGY LICENSING, LLC ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: NUANCE COMMUNICATIONS, INC.
Active legal-status Critical Current
Adjusted expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • H04M3/50Centralised arrangements for answering calls; Centralised arrangements for recording messages for absent or busy subscribers ; Centralised arrangements for recording messages
    • H04M3/53Centralised arrangements for recording incoming messages, i.e. mailbox systems
    • H04M3/533Voice mail systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M2201/00Electronic components, circuits, software, systems or apparatus used in telephone systems
    • H04M2201/40Electronic components, circuits, software, systems or apparatus used in telephone systems using speech recognition
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M2203/00Aspects of automatic or semi-automatic exchanges
    • H04M2203/45Aspects of automatic or semi-automatic exchanges related to voicemail messaging
    • H04M2203/4536Voicemail combined with text-based messaging

Definitions

  • the present disclosure relates to voicemail to text conversion. More particularly, the present disclosure relates to a manner of improving the accuracy of a voicemail to text conversion of a user's name.
  • a voicemail to text conversion service is a service which converts a voicemail message to text.
  • a user's name i.e., a called party's name
  • a name is a difficult word for a speech engine to accurately convert to text.
  • a user of a voicemail to text conversion service may find his or her name to be constantly misspelled in their converted text messages.
  • FIG. 1 shows an example of a general computer system that includes a set of instructions for voicemail to text conversion described herein;
  • FIG. 2 shows an example of a system for voicemail to text conversion, according to an aspect of the present disclosure
  • FIG. 3 shows an example of an algorithm performed by a voicemail platform, according to an aspect of the present disclosure.
  • FIG. 4 shows an example of an algorithm performed by a speech engine, according to an aspect of the present disclosure.
  • FIG. 1 is an illustrative embodiment of a general computer system that includes a set of instructions for performing processes as described herein.
  • the general computer system is shown and is designated 100 .
  • the computer system 100 can include a set of instructions that can be executed to cause the computer system 100 to perform any one or more of the methods or computer based functions disclosed herein.
  • the computer system 100 may operate as a standalone device or may be connected, for example, using a network 101 , to other computer systems or peripheral devices.
  • the computer system 100 may include or be included within any one or more of the computers, servers, systems, or communication networks described herein.
  • the computer system may operate in the capacity of a server or as a client user computer in a server-client user network environment, or as a peer computer system in a peer-to-peer (or distributed) network environment.
  • the computer system 100 can also be implemented as or incorporated into various devices, such as a personal computer (PC), a tablet PC, a set-top box (STB), a personal digital assistant (PDA), a mobile device, a palmtop computer, a laptop computer, a desktop computer, a communications device, a wireless telephone, a personal trusted device, a web appliance, or any other machine capable of executing a set of instructions (sequential or otherwise) that specify actions to be taken by that machine.
  • PC personal computer
  • PDA personal digital assistant
  • the computer system 100 can be implemented using electronic devices that provide voice, video or data communication.
  • the term “system” shall also be taken to include any collection of systems or sub-systems that individually or jointly execute a set, or multiple sets, of instructions to perform one or more computer functions.
  • the computer system 100 may include a processor 110 , for example, a central processing unit (CPU), a graphics processing unit (GPU), or both.
  • the computer system 100 can include a main memory 120 and a static memory 130 that can communicate with each other via a bus 108 .
  • the computer system 100 may further include a video display unit 150 , such as a liquid crystal display (LCD), an organic light emitting diode (OLED), a flat panel display, a solid state display, or a cathode ray tube (CRT).
  • LCD liquid crystal display
  • OLED organic light emitting diode
  • CRT cathode ray tube
  • the computer system 100 may include an alpha-numeric input device 160 , such as a keyboard, another input device (not shown), such as a remote control device having a wireless keypad, a keyboard, a microphone coupled to a speech recognition engine, a camera such as a video camera or still camera, and a cursor control device 170 , such as a mouse.
  • the computer system 100 can also include a disk drive unit 180 , a signal generation device 190 , such as a speaker or remote control, and a network interface device 140 .
  • the disk drive unit 180 may include a computer-readable medium 182 in which one or more sets of instructions 184 , e.g. software, can be embedded.
  • a computer-readable medium 182 is a tangible, non-transitory article of manufacture, from which sets of instructions 184 can be read.
  • the instructions 184 may embody one or more of the methods or logic as described herein.
  • the instructions 184 may reside completely, or at least partially, within the main memory 120 , the static memory 130 , and/or within the processor 110 during execution by the computer system 100 .
  • the main memory 120 and the processor 110 also may include computer-readable media.
  • dedicated hardware implementations such as application specific integrated circuits, programmable logic arrays and other hardware devices, can be constructed to implement one or more of the methods described herein.
  • Applications that may include the apparatus and systems of various embodiments can broadly include a variety of electronic and computer systems.
  • One or more embodiments described herein may implement functions using two or more specific interconnected hardware modules or devices with related control and data signals that can be communicated between and through the modules, or as portions of an application-specific integrated circuit. Accordingly, the present system encompasses software, firmware, and hardware implementations, or combinations thereof.
  • the methods described herein may be implemented by software programs executable by a computer system.
  • implementations can include distributed processing, component/object distributed processing, and parallel processing.
  • virtual computer system processing can be constructed to implement one or more of the methods or functionality as described herein.
  • the present disclosure contemplates a computer-readable medium 182 that includes instructions 184 or receives and executes instructions 184 responsive to a propagated signal, so that a device connected to a network 101 can communicate voice, video or data over the network 101 . Further, the instructions 184 may be transmitted or received over the network 101 via the network interface device 140 .
  • FIG. 2 illustrates an example of a system for voicemail to text conversion.
  • the system may include a voicemail platform 201 and a speech engine 202 .
  • the voicemail platform 201 and the speech engine 202 may be independently owned or operated.
  • the voicemail platform 201 may be operated by a wireline or wireless telephone carrier, and the speech engine 202 may be run on a server operated by a third-party vendor.
  • the voicemail platform 201 and the speech engine 202 may be integrated within one system and may both be operated by a carrier.
  • the voicemail platform 201 receives a voicemail message from a calling party which is directed to a voicemail user (a called party).
  • the voicemail platform 201 may be a centralized computer system which stores incoming voicemail messages in personal mailboxes associated with user phone numbers.
  • the voicemail messages may be stored in a storage system which includes storage media such as, but not limited to, hard disk drives.
  • the voicemail platform 201 may also store data pertinent to each voicemail user.
  • the voicemail platform 201 may store an email address associated with each voicemail user.
  • the converted text can be delivered to the user in the form of an email message addressed to the email address stored in the voicemail platform 201 .
  • the voicemail platform 201 may also store other data pertinent to each voicemail user, such as a user's name.
  • the voicemail platform 201 sends the voicemail, as well as username data of the voicemail user, to the speech engine 202 .
  • the voicemail platform 201 may include a processing system including one or more processors programmed to perform the algorithm illustrated in FIG. 3 .
  • the voicemail platform 201 after receiving a voicemail for a user of a voicemail to text conversion service (S 301 ), the voicemail platform 201 sends the voicemail and username data of the user to the speech engine 202 (S 302 ).
  • the speech engine 202 converts the voicemail to text, using the username data to correctly spell all instances of the user's name within the voicemail.
  • the speech engine 202 determines the correct spelling of the user's name from the username data.
  • the username data may be an email address of the user.
  • a user's email address typically contains all or part of a user's name.
  • a person named ‘Rick Jones’ who is an employee of XYZ corporation may have an email address of ‘rick.jones@xyz.com’.
  • the speech engine 202 may be capable of parsing the email address and extracting the correct spelling of the user's name from the email address.
  • the voicemail platform 201 may parse the user's email address and extract the correct spelling of the user's name from the email address, and then send the correct spelling of the user's name to the speech engine 202 as the username data.
  • the speech engine 202 performs a voicemail to text conversion algorithm to convert the voicemail to text.
  • the speech engine 202 may include a processing system including one or more processors programmed to perform the algorithm illustrated in FIG. 4 .
  • the speech engine 202 receives the voicemail message and username data from the voicemail platform 201 (S 401 ), and converts the voicemail message to text.
  • the speech engine 202 recognizes every occurrence of the user's name within the voicemail message (S 402 ), and uses the username data to correctly spell each corresponding occurrence of the user's name within the converted text (S 403 ).
  • the speech engine 202 may recognize the user's name, for example, by comparing phones within the voicemail message to a predetermined phone stored in the speech engine corresponding to the user's name. After the voicemail to text conversion algorithm is performed, the speech engine 202 sends the converted text to the voicemail platform 201 (S 404 ).
  • the voicemail platform 201 then receives the converted text from the speech engine 202 (S 303 ), and delivers the converted text to a device 203 of the user, such as, but not limited to, a phone, a PDA, a tablet device or a PC (S 304 ).
  • the converted text may be delivered to the voicemail user in a variety of formats, such as, but not limited to, an email or a Short Message Service (SMS) text message.
  • SMS Short Message Service
  • a voicemail platform which provides a voicemail to text conversion service to a user includes a storage system which stores username data for a user of a voicemail to text conversion service, and a processing system.
  • the processing system receives a voicemail message for the user, sends the voicemail message and the username data to a speech engine, receives text from the speech engine which is converted from the voicemail message using the username data to correctly spell all occurrences of the user's name within the voicemail message, and sends the converted text to a device of the user.
  • the converted text may be delivered to the device of the user in the form of an email message.
  • the username data may be an email address of the user.
  • the email address may contain at least part of the user's name.
  • the voicemail platform may determine the username data from an email address of the user.
  • the device of the user may be a phone, a PDA, a tablet device or a PC.
  • the voicemail platform may be operated by a telephone carrier, and the speech engine may be operated by a third-party vendor.
  • a method for providing a voicemail to text conversion service to a user includes storing username data for a user of a voicemail to text conversion service in a storage system of a voicemail platform, receiving a voicemail message for the user at the voicemail platform, sending the voicemail message and the username data from the voicemail platform to a speech engine, receiving text at the voicemail platform from the speech engine which is converted from the voicemail message using the username data to correctly spell all occurrences of the user's name within the voicemail message, and sending the converted text from the voicemail platform to a device of the user.
  • a non-transitory computer-readable medium storing a program for providing a voicemail to text conversion service to a user includes code for storing username data for a user of a voicemail to text conversion service in a storage system of a voicemail platform, code for receiving a voicemail message for the user at the voicemail platform, code for sending the voicemail message and the username data from the voicemail platform to a speech engine, code for receiving text at the voicemail platform from the speech engine which is converted from the voicemail message using the username data to correctly spell all occurrences of the user's name within the voicemail message, and code for sending the converted text from the voicemail platform to a device of the user.
  • While a computer-readable medium herein may be shown to be a single medium, the term “computer-readable medium” includes a single medium or multiple media, such as a centralized or distributed database, and/or associated caches and servers that store one or more sets of instructions.
  • the term “computer-readable medium” shall also include any medium that is capable of storing, encoding or carrying a set of instructions for execution by a processor or that cause a computer system to perform any one or more of the methods or operations disclosed herein.
  • the computer-readable medium can include a solid-state memory such as a memory card or other package that houses one or more non-volatile read-only memories.
  • the computer-readable medium can be a random access memory or other volatile re-writable memory.
  • the computer-readable medium can include a magneto-optical or optical medium, such as a disk or tapes or other storage device to capture carrier wave signals such as a signal communicated over a transmission medium. Accordingly, the disclosure is considered to include any computer-readable medium or other equivalents and successor media, in which data or instructions may be stored.
  • inventions of the disclosure may be referred to herein, individually and/or collectively, by the term “invention” merely for convenience and without intending to voluntarily limit the scope of this application to any particular invention or inventive concept.
  • inventions merely for convenience and without intending to voluntarily limit the scope of this application to any particular invention or inventive concept.
  • specific embodiments have been illustrated and described herein, it should be appreciated that any subsequent arrangement designed to achieve the same or similar purpose may be substituted for the specific embodiments shown.
  • This disclosure is intended to cover any and all subsequent adaptations or variations of various embodiments. Combinations of the above embodiments, and other embodiments not specifically described herein, will be apparent to those of skill in the art upon reviewing the description.

Abstract

A voicemail platform which provides a voicemail to text conversion service to a user includes a storage system which stores username data for a user of a voicemail to text conversion service, and a processing system. The processing system receives a voicemail message for the user, sends the voicemail message and the username data to a speech engine, receives text from the speech engine which is converted from the voicemail message using the username data to correctly spell all occurrences of the user's name within the voicemail message, and sends the converted text to a device of the user.

Description

BACKGROUND
1. Field of the Disclosure
The present disclosure relates to voicemail to text conversion. More particularly, the present disclosure relates to a manner of improving the accuracy of a voicemail to text conversion of a user's name.
2. Background Information
A voicemail to text conversion service is a service which converts a voicemail message to text. Typically, a user's name (i.e., a called party's name) will be mentioned at least once in a voicemail message. However, a name is a difficult word for a speech engine to accurately convert to text. Thus, a user of a voicemail to text conversion service may find his or her name to be constantly misspelled in their converted text messages.
BRIEF DESCRIPTION OF THE DRAWINGS
FIG. 1 shows an example of a general computer system that includes a set of instructions for voicemail to text conversion described herein;
FIG. 2 shows an example of a system for voicemail to text conversion, according to an aspect of the present disclosure;
FIG. 3 shows an example of an algorithm performed by a voicemail platform, according to an aspect of the present disclosure; and
FIG. 4 shows an example of an algorithm performed by a speech engine, according to an aspect of the present disclosure.
DETAILED DESCRIPTION
In view of the foregoing, the present disclosure, through one or more of its various aspects, embodiments and/or specific features or sub-components, is thus intended to bring out one or more of the advantages as specifically noted below.
FIG. 1 is an illustrative embodiment of a general computer system that includes a set of instructions for performing processes as described herein. The general computer system is shown and is designated 100. The computer system 100 can include a set of instructions that can be executed to cause the computer system 100 to perform any one or more of the methods or computer based functions disclosed herein. The computer system 100 may operate as a standalone device or may be connected, for example, using a network 101, to other computer systems or peripheral devices. For example, the computer system 100 may include or be included within any one or more of the computers, servers, systems, or communication networks described herein.
In a networked deployment, the computer system may operate in the capacity of a server or as a client user computer in a server-client user network environment, or as a peer computer system in a peer-to-peer (or distributed) network environment. The computer system 100, or portions thereof, can also be implemented as or incorporated into various devices, such as a personal computer (PC), a tablet PC, a set-top box (STB), a personal digital assistant (PDA), a mobile device, a palmtop computer, a laptop computer, a desktop computer, a communications device, a wireless telephone, a personal trusted device, a web appliance, or any other machine capable of executing a set of instructions (sequential or otherwise) that specify actions to be taken by that machine. In a particular embodiment, the computer system 100 can be implemented using electronic devices that provide voice, video or data communication. Further, while a single computer system 100 is illustrated, the term “system” shall also be taken to include any collection of systems or sub-systems that individually or jointly execute a set, or multiple sets, of instructions to perform one or more computer functions.
As illustrated in FIG. 1, the computer system 100 may include a processor 110, for example, a central processing unit (CPU), a graphics processing unit (GPU), or both. Moreover, the computer system 100 can include a main memory 120 and a static memory 130 that can communicate with each other via a bus 108. As shown, the computer system 100 may further include a video display unit 150, such as a liquid crystal display (LCD), an organic light emitting diode (OLED), a flat panel display, a solid state display, or a cathode ray tube (CRT). Additionally, the computer system 100 may include an alpha-numeric input device 160, such as a keyboard, another input device (not shown), such as a remote control device having a wireless keypad, a keyboard, a microphone coupled to a speech recognition engine, a camera such as a video camera or still camera, and a cursor control device 170, such as a mouse. The computer system 100 can also include a disk drive unit 180, a signal generation device 190, such as a speaker or remote control, and a network interface device 140.
In a particular embodiment, as depicted in FIG. 1, the disk drive unit 180 may include a computer-readable medium 182 in which one or more sets of instructions 184, e.g. software, can be embedded. A computer-readable medium 182 is a tangible, non-transitory article of manufacture, from which sets of instructions 184 can be read. Further, the instructions 184 may embody one or more of the methods or logic as described herein. In a particular embodiment, the instructions 184 may reside completely, or at least partially, within the main memory 120, the static memory 130, and/or within the processor 110 during execution by the computer system 100. The main memory 120 and the processor 110 also may include computer-readable media.
In an alternative embodiment, dedicated hardware implementations, such as application specific integrated circuits, programmable logic arrays and other hardware devices, can be constructed to implement one or more of the methods described herein. Applications that may include the apparatus and systems of various embodiments can broadly include a variety of electronic and computer systems. One or more embodiments described herein may implement functions using two or more specific interconnected hardware modules or devices with related control and data signals that can be communicated between and through the modules, or as portions of an application-specific integrated circuit. Accordingly, the present system encompasses software, firmware, and hardware implementations, or combinations thereof.
In accordance with various embodiments of the present disclosure, the methods described herein may be implemented by software programs executable by a computer system. Further, in an exemplary, non-limited embodiment, implementations can include distributed processing, component/object distributed processing, and parallel processing. Alternatively, virtual computer system processing can be constructed to implement one or more of the methods or functionality as described herein.
The present disclosure contemplates a computer-readable medium 182 that includes instructions 184 or receives and executes instructions 184 responsive to a propagated signal, so that a device connected to a network 101 can communicate voice, video or data over the network 101. Further, the instructions 184 may be transmitted or received over the network 101 via the network interface device 140.
FIG. 2 illustrates an example of a system for voicemail to text conversion. The system may include a voicemail platform 201 and a speech engine 202. The voicemail platform 201 and the speech engine 202 may be independently owned or operated. For example, the voicemail platform 201 may be operated by a wireline or wireless telephone carrier, and the speech engine 202 may be run on a server operated by a third-party vendor. Alternatively, the voicemail platform 201 and the speech engine 202 may be integrated within one system and may both be operated by a carrier.
The voicemail platform 201 receives a voicemail message from a calling party which is directed to a voicemail user (a called party). The voicemail platform 201 may be a centralized computer system which stores incoming voicemail messages in personal mailboxes associated with user phone numbers. The voicemail messages may be stored in a storage system which includes storage media such as, but not limited to, hard disk drives.
The voicemail platform 201 may also store data pertinent to each voicemail user. For example, the voicemail platform 201 may store an email address associated with each voicemail user. For a voicemail user who utilizes a voicemail to text conversion service, the converted text can be delivered to the user in the form of an email message addressed to the email address stored in the voicemail platform 201. The voicemail platform 201 may also store other data pertinent to each voicemail user, such as a user's name.
The voicemail platform 201 sends the voicemail, as well as username data of the voicemail user, to the speech engine 202. In this regard, the voicemail platform 201 may include a processing system including one or more processors programmed to perform the algorithm illustrated in FIG. 3. According to the algorithm shown in FIG. 3, after receiving a voicemail for a user of a voicemail to text conversion service (S301), the voicemail platform 201 sends the voicemail and username data of the user to the speech engine 202 (S302). The speech engine 202 converts the voicemail to text, using the username data to correctly spell all instances of the user's name within the voicemail.
The speech engine 202 determines the correct spelling of the user's name from the username data. The username data may be an email address of the user. In this regard, a user's email address typically contains all or part of a user's name. For example, a person named ‘Rick Jones’ who is an employee of XYZ corporation may have an email address of ‘rick.jones@xyz.com’. The speech engine 202 may be capable of parsing the email address and extracting the correct spelling of the user's name from the email address.
Alternatively, the voicemail platform 201 may parse the user's email address and extract the correct spelling of the user's name from the email address, and then send the correct spelling of the user's name to the speech engine 202 as the username data.
The speech engine 202 performs a voicemail to text conversion algorithm to convert the voicemail to text. In this regard, the speech engine 202 may include a processing system including one or more processors programmed to perform the algorithm illustrated in FIG. 4. According to the algorithm shown in FIG. 4, the speech engine 202 receives the voicemail message and username data from the voicemail platform 201 (S401), and converts the voicemail message to text. During the conversion, the speech engine 202 recognizes every occurrence of the user's name within the voicemail message (S402), and uses the username data to correctly spell each corresponding occurrence of the user's name within the converted text (S403). The speech engine 202 may recognize the user's name, for example, by comparing phones within the voicemail message to a predetermined phone stored in the speech engine corresponding to the user's name. After the voicemail to text conversion algorithm is performed, the speech engine 202 sends the converted text to the voicemail platform 201 (S404).
The voicemail platform 201 then receives the converted text from the speech engine 202 (S303), and delivers the converted text to a device 203 of the user, such as, but not limited to, a phone, a PDA, a tablet device or a PC (S304). The converted text may be delivered to the voicemail user in a variety of formats, such as, but not limited to, an email or a Short Message Service (SMS) text message.
Although the invention has been described with reference to several exemplary embodiments, it is understood that the words that have been used are words of description and illustration, rather than words of limitation. Changes may be made within the purview of the appended claims, as presently stated and as amended, without departing from the scope and spirit of the invention in its aspects. Although the invention has been described with reference to particular means, materials and embodiments, the invention is not intended to be limited to the particulars disclosed; rather the invention extends to all functionally equivalent structures, methods, and uses such as are within the scope of the appended claims.
According to an aspect of the present disclosure, a voicemail platform which provides a voicemail to text conversion service to a user includes a storage system which stores username data for a user of a voicemail to text conversion service, and a processing system. The processing system receives a voicemail message for the user, sends the voicemail message and the username data to a speech engine, receives text from the speech engine which is converted from the voicemail message using the username data to correctly spell all occurrences of the user's name within the voicemail message, and sends the converted text to a device of the user.
The converted text may be delivered to the device of the user in the form of an email message. The username data may be an email address of the user. The email address may contain at least part of the user's name.
The voicemail platform may determine the username data from an email address of the user. The device of the user may be a phone, a PDA, a tablet device or a PC. The voicemail platform may be operated by a telephone carrier, and the speech engine may be operated by a third-party vendor.
According to another aspect of the present disclosure, a method for providing a voicemail to text conversion service to a user includes storing username data for a user of a voicemail to text conversion service in a storage system of a voicemail platform, receiving a voicemail message for the user at the voicemail platform, sending the voicemail message and the username data from the voicemail platform to a speech engine, receiving text at the voicemail platform from the speech engine which is converted from the voicemail message using the username data to correctly spell all occurrences of the user's name within the voicemail message, and sending the converted text from the voicemail platform to a device of the user.
According to another aspect of the present disclosure, a non-transitory computer-readable medium storing a program for providing a voicemail to text conversion service to a user includes code for storing username data for a user of a voicemail to text conversion service in a storage system of a voicemail platform, code for receiving a voicemail message for the user at the voicemail platform, code for sending the voicemail message and the username data from the voicemail platform to a speech engine, code for receiving text at the voicemail platform from the speech engine which is converted from the voicemail message using the username data to correctly spell all occurrences of the user's name within the voicemail message, and code for sending the converted text from the voicemail platform to a device of the user.
While a computer-readable medium herein may be shown to be a single medium, the term “computer-readable medium” includes a single medium or multiple media, such as a centralized or distributed database, and/or associated caches and servers that store one or more sets of instructions. The term “computer-readable medium” shall also include any medium that is capable of storing, encoding or carrying a set of instructions for execution by a processor or that cause a computer system to perform any one or more of the methods or operations disclosed herein.
In a particular non-limiting, exemplary embodiment, the computer-readable medium can include a solid-state memory such as a memory card or other package that houses one or more non-volatile read-only memories. Further, the computer-readable medium can be a random access memory or other volatile re-writable memory. Additionally, the computer-readable medium can include a magneto-optical or optical medium, such as a disk or tapes or other storage device to capture carrier wave signals such as a signal communicated over a transmission medium. Accordingly, the disclosure is considered to include any computer-readable medium or other equivalents and successor media, in which data or instructions may be stored.
Although the present specification describes components and functions that may be implemented in particular embodiments with reference to particular standards and protocols, the disclosure is not limited to such standards and protocols. For example, standards for power over ethernet represent an example of the state of the art. Such standards are periodically superseded by faster or more efficient equivalents having essentially the same functions. Accordingly, replacement standards and protocols having the same or similar functions are considered equivalents thereof.
The illustrations of the embodiments described herein are intended to provide a general understanding of the structure of the various embodiments. The illustrations are not intended to serve as a complete description of all of the elements and features of apparatus and systems that utilize the structures or methods described herein. Many other embodiments may be apparent to those of skill in the art upon reviewing the disclosure. Other embodiments may be utilized and derived from the disclosure, such that structural and logical substitutions and changes may be made without departing from the scope of the disclosure. Additionally, the illustrations are merely representational and may not be drawn to scale. Certain proportions within the illustrations may be exaggerated, while other proportions may be minimized. Accordingly, the disclosure and the figures are to be regarded as illustrative rather than restrictive.
One or more embodiments of the disclosure may be referred to herein, individually and/or collectively, by the term “invention” merely for convenience and without intending to voluntarily limit the scope of this application to any particular invention or inventive concept. Moreover, although specific embodiments have been illustrated and described herein, it should be appreciated that any subsequent arrangement designed to achieve the same or similar purpose may be substituted for the specific embodiments shown. This disclosure is intended to cover any and all subsequent adaptations or variations of various embodiments. Combinations of the above embodiments, and other embodiments not specifically described herein, will be apparent to those of skill in the art upon reviewing the description.
The Abstract of the Disclosure is provided to comply with 37 C.F.R. §1.72(b) and is submitted with the understanding that it will not be used to interpret or limit the scope or meaning of the claims. In addition, in the foregoing Detailed Description, various features may be grouped together or described in a single embodiment for the purpose of streamlining the disclosure. This disclosure is not to be interpreted as reflecting an intention that the claimed embodiments require more features than are expressly recited in each claim. Rather, as the following claims reflect, inventive subject matter may be directed to less than all of the features of any of the disclosed embodiments. Thus, the following claims are incorporated into the Detailed Description, with each claim standing on its own as defining separately claimed subject matter.
The above disclosed subject matter is to be considered illustrative, and not restrictive, and the appended claims are intended to cover all such modifications, enhancements, and other embodiments which fall within the true spirit and scope of the present disclosure. Thus, to the maximum extent allowed by law, the scope of the present disclosure is to be determined by the broadest permissible interpretation of the following claims and their equivalents, and shall not be restricted or limited by the foregoing detailed description.

Claims (10)

What is claimed is:
1. A voicemail platform which provides a voicemail to text conversion service, comprising:
a memory which stores an email address and a name of a user for the user of a voicemail to text conversion service, the email address comprising the name of the user including at least a first and last name of the user; and
a processor which:
receives a voicemail message for the user,
sends the voicemail message and the email address together to a speech engine external to the voicemail platform, the speech engine configured to parse the email address and extract a correct spelling of the name of the user from the email address, recognize every occurrence of the name of the user within the voicemail message, and use the email address to correctly spell each corresponding occurrence of the name of the user within converted text of the voicemail message,
receives the converted text from the speech engine which is converted from the voicemail message using the email address to correctly spell all occurrences of the name of the user within the voicemail message, and
sends the converted text to a device of the user.
2. The voicemail platform according to claim 1, wherein the converted text is delivered to the device of the user via an email message.
3. The voicemail platform according to claim 1, wherein the device of the user is one of a phone, a personal digital assistant, a tablet device and a personal computer.
4. A method for providing a voicemail to text conversion service, comprising:
storing an email address and a name of a user for the user of a voicemail to text conversion service in a memory of a voicemail platform, the email address comprising the name of the user including at least a first and last name of the user;
receiving a voicemail message for the user at the voicemail platform;
sending the voicemail message and the email address together from the voicemail platform to a speech engine external to the voicemail platform, the speech engine configured to parse the email address and extract a correct spelling of the name of the user from the email address, recognize every occurrence of the name of the user within the voicemail message, and use the email address to correctly spell each corresponding occurrence of the name of the user within converted text of the voicemail message;
receiving the converted text at the voicemail platform from the speech engine which is converted, by a processor, from the voicemail message using the email address to correctly spell all occurrences of the name of the user within the voicemail message; and
sending the converted text from the voicemail platform to a device of the user.
5. The method according to claim 4, wherein the converted text is delivered to the device of the user via an email message.
6. The method according to claim 4, wherein the device of the user is one of a phone, a personal digital assistant, a tablet device and a personal computer.
7. A non-transitory computer-readable storage medium encoded with an executable computer program for providing a voicemail to text conversion service and that, when executed by a processor, causes the processor to perform operations comprising:
storing an email address and a name of a user for the user of a voicemail to text conversion service in a memory of a voicemail platform, the email address comprising the name of the user including at least a first and last name of the user;
receiving a voicemail message for the user at the voicemail platform;
sending the voicemail message and the email address together from the voicemail platform to a speech engine external to the voicemail platform, the speech engine configured to parse the email address and extract a correct spelling of the name of the user from the email address, recognize every occurrence of the name of the user within the voicemail message, and use the email address to correctly spell each corresponding occurrence of the name of the user within converted text of the voicemail message;
receiving the converted text at the voicemail platform from the speech engine which is converted from the voicemail message using the email address to correctly spell all occurrences of the name of the user within the voicemail message; and
sending the converted text from the voicemail platform to a device of the user.
8. The non-transitory computer-readable storage medium according to claim 7, wherein the converted text is delivered to the device of the user via an email message.
9. The non-transitory computer-readable storage medium according to claim 7, wherein the device of the user is one of a phone, a personal digital assistant, a tablet device and a personal computer.
10. The voicemail platform according to claim 1, wherein the converted text is delivered to the device of the user via a short message service text message.
US12/828,677 2010-07-01 2010-07-01 System and method for voicemail to text conversion Active 2032-08-23 US9270828B2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US12/828,677 US9270828B2 (en) 2010-07-01 2010-07-01 System and method for voicemail to text conversion

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US12/828,677 US9270828B2 (en) 2010-07-01 2010-07-01 System and method for voicemail to text conversion

Publications (2)

Publication Number Publication Date
US20120002794A1 US20120002794A1 (en) 2012-01-05
US9270828B2 true US9270828B2 (en) 2016-02-23

Family

ID=45399723

Family Applications (1)

Application Number Title Priority Date Filing Date
US12/828,677 Active 2032-08-23 US9270828B2 (en) 2010-07-01 2010-07-01 System and method for voicemail to text conversion

Country Status (1)

Country Link
US (1) US9270828B2 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10893013B1 (en) * 2017-08-22 2021-01-12 James Peter Morrissette Recipient notification of electronic message generated by voice-to-text engine

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8249568B2 (en) 2006-06-15 2012-08-21 Timothy Salmon System and method for processing a voice mail

Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6775651B1 (en) * 2000-05-26 2004-08-10 International Business Machines Corporation Method of transcribing text from computer voice mail
US20050021339A1 (en) * 2003-07-24 2005-01-27 Siemens Information And Communication Networks, Inc. Annotations addition to documents rendered via text-to-speech conversion over a voice connection
US20070293272A1 (en) * 2006-06-15 2007-12-20 Timothy Salmon System and method for processing a voice mail
US20080167007A1 (en) * 2007-01-07 2008-07-10 Gregory Novick Voicemail Systems and Methods
US7542552B2 (en) * 2005-06-24 2009-06-02 Siemens Communications, Inc. Method and apparatus for generating default personal greetings using text to speech conversion
US7562552B2 (en) * 2005-11-10 2009-07-21 Tyco Electronics Corporation Crimp height adjustment mechanism
US7590550B2 (en) * 2006-09-08 2009-09-15 American Well Inc. Connecting consumers with service providers
US7693267B2 (en) * 2005-12-30 2010-04-06 Microsoft Corporation Personalized user specific grammars
US20110022386A1 (en) * 2009-07-22 2011-01-27 Cisco Technology, Inc. Speech recognition tuning tool
US8126718B2 (en) * 2008-07-11 2012-02-28 Research In Motion Limited Facilitating text-to-speech conversion of a username or a network address containing a username
US8204486B2 (en) * 2008-12-19 2012-06-19 Cox Communications, Inc. Dynamic messaging routing and audio-to-text linking for visual voicemail

Patent Citations (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6775651B1 (en) * 2000-05-26 2004-08-10 International Business Machines Corporation Method of transcribing text from computer voice mail
US20050021339A1 (en) * 2003-07-24 2005-01-27 Siemens Information And Communication Networks, Inc. Annotations addition to documents rendered via text-to-speech conversion over a voice connection
US7542552B2 (en) * 2005-06-24 2009-06-02 Siemens Communications, Inc. Method and apparatus for generating default personal greetings using text to speech conversion
US7562552B2 (en) * 2005-11-10 2009-07-21 Tyco Electronics Corporation Crimp height adjustment mechanism
US7693267B2 (en) * 2005-12-30 2010-04-06 Microsoft Corporation Personalized user specific grammars
US20070293272A1 (en) * 2006-06-15 2007-12-20 Timothy Salmon System and method for processing a voice mail
US8249568B2 (en) * 2006-06-15 2012-08-21 Timothy Salmon System and method for processing a voice mail
US7590550B2 (en) * 2006-09-08 2009-09-15 American Well Inc. Connecting consumers with service providers
US20080167007A1 (en) * 2007-01-07 2008-07-10 Gregory Novick Voicemail Systems and Methods
US8126718B2 (en) * 2008-07-11 2012-02-28 Research In Motion Limited Facilitating text-to-speech conversion of a username or a network address containing a username
US8204486B2 (en) * 2008-12-19 2012-06-19 Cox Communications, Inc. Dynamic messaging routing and audio-to-text linking for visual voicemail
US20110022386A1 (en) * 2009-07-22 2011-01-27 Cisco Technology, Inc. Speech recognition tuning tool

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10893013B1 (en) * 2017-08-22 2021-01-12 James Peter Morrissette Recipient notification of electronic message generated by voice-to-text engine

Also Published As

Publication number Publication date
US20120002794A1 (en) 2012-01-05

Similar Documents

Publication Publication Date Title
US10714091B2 (en) Systems and methods to present voice message information to a user of a computing device
US10455091B1 (en) User input driven short message service (SMS) applications
US10818285B2 (en) Electronic device and speech recognition method therefor
CN107580149A (en) The recognition methods of outgoing call failure cause, device, electronic equipment, storage medium
US9148741B2 (en) Action generation based on voice data
CN108877792B (en) Method, apparatus, electronic device and computer readable storage medium for processing voice conversations
CN107430616A (en) The interactive mode of speech polling re-forms
US11720612B2 (en) System and method to avoid sending a message to the unintended recipient
US20120185240A1 (en) System and method for generating and sending a simplified message using speech recognition
US20100082674A1 (en) System for detecting user input error
CN111602133A (en) Compression of word embedding for natural language processing systems
US11790916B2 (en) Speech-to-text system
US9838486B2 (en) Checking documents for spelling and/or grammatical errors and/or providing recommended words or phrases based on patterns of colloquialisms used among users in a social network
US10673853B2 (en) Method and system for executing a transaction request using a communication channel
US9270828B2 (en) System and method for voicemail to text conversion
US8775183B2 (en) Application of user-specified transformations to automatic speech recognition results
US11532308B2 (en) Speech-to-text system
US9736297B2 (en) Phone number canonicalization and information discoverability
US10176151B2 (en) Text message displaying method and system, and text message providing system
KR20150103855A (en) Method and system of providing voice service using interoperation between application and server
US20220141161A1 (en) Bot proxy for group communication service
US20190108290A1 (en) Human assisted automated question and answer system using natural language processing of real-time requests assisted by humans for requests of low confidence
KR102347070B1 (en) Method and apparatus for processing information of terminal
CN113709506A (en) Multimedia playing method, device, medium and program product based on cloud mobile phone
US20140258398A1 (en) System and Method for Automatic Context Detection, Sharing, and Storage in Real-Time Communication Systems

Legal Events

Date Code Title Description
AS Assignment

Owner name: AT&T MOBILITY II LLC., GEORGIA

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:JACKSON, JACQUELINE;ZUBAS, MICHAEL;SIGNING DATES FROM 20100629 TO 20100701;REEL/FRAME:024625/0085

FEPP Fee payment procedure

Free format text: PAYOR NUMBER ASSIGNED (ORIGINAL EVENT CODE: ASPN); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

STCF Information on status: patent grant

Free format text: PATENTED CASE

AS Assignment

Owner name: NUANCE COMMUNICATIONS, INC., MASSACHUSETTS

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:AT&T MOBILITY II LLC;REEL/FRAME:041494/0590

Effective date: 20161214

MAFP Maintenance fee payment

Free format text: PAYMENT OF MAINTENANCE FEE, 4TH YEAR, LARGE ENTITY (ORIGINAL EVENT CODE: M1551); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

Year of fee payment: 4

MAFP Maintenance fee payment

Free format text: PAYMENT OF MAINTENANCE FEE, 8TH YEAR, LARGE ENTITY (ORIGINAL EVENT CODE: M1552); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

Year of fee payment: 8

AS Assignment

Owner name: MICROSOFT TECHNOLOGY LICENSING, LLC, WASHINGTON

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:NUANCE COMMUNICATIONS, INC.;REEL/FRAME:065578/0676

Effective date: 20230920