WO2009158077A3 - Devices and methods used in the processing of converting audio messages to text messages - Google Patents

Devices and methods used in the processing of converting audio messages to text messages Download PDF

Info

Publication number
WO2009158077A3
WO2009158077A3 PCT/US2009/044270 US2009044270W WO2009158077A3 WO 2009158077 A3 WO2009158077 A3 WO 2009158077A3 US 2009044270 W US2009044270 W US 2009044270W WO 2009158077 A3 WO2009158077 A3 WO 2009158077A3
Authority
WO
WIPO (PCT)
Prior art keywords
text
interface tool
devices
current invention
messages
Prior art date
Application number
PCT/US2009/044270
Other languages
French (fr)
Other versions
WO2009158077A2 (en
Inventor
Daniel Michael Doulton
Robert Wheatley
Andrew Daborn
Original Assignee
Spinvox Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Spinvox Inc. filed Critical Spinvox Inc.
Publication of WO2009158077A2 publication Critical patent/WO2009158077A2/en
Publication of WO2009158077A3 publication Critical patent/WO2009158077A3/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L51/00User-to-user messaging in packet-switching networks, transmitted according to store-and-forward or real-time protocols, e.g. e-mail
    • H04L51/06Message adaptation to terminal or network requirements
    • H04L51/066Format adaptation, e.g. format conversion or compression
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L51/00User-to-user messaging in packet-switching networks, transmitted according to store-and-forward or real-time protocols, e.g. e-mail
    • H04L51/07User-to-user messaging in packet-switching networks, transmitted according to store-and-forward or real-time protocols, e.g. e-mail characterised by the inclusion of specific contents
    • H04L51/10Multimedia information
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M2201/00Electronic components, circuits, software, systems or apparatus used in telephone systems
    • H04M2201/40Electronic components, circuits, software, systems or apparatus used in telephone systems using speech recognition
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M2201/00Electronic components, circuits, software, systems or apparatus used in telephone systems
    • H04M2201/60Medium conversion
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • H04M3/50Centralised arrangements for answering calls; Centralised arrangements for recording messages for absent or busy subscribers ; Centralised arrangements for recording messages
    • H04M3/53Centralised arrangements for recording incoming messages, i.e. mailbox systems
    • H04M3/533Voice mail systems
    • H04M3/53333Message receiving aspects

Abstract

In human agent assisted automated voice to text conversion processes several devices and methods are used to improve the speed of conversion while maintaining quality and accuracy. The current invention is an interface tool used between the human agent and the devices used in the voice to text conversion process. In one aspect of the embodiment of the current invention this interface tool is used in the process of entering text into the text file in the conversion process. In another aspect of the current invention, this interface tool is used to review, compare, and edit a previously converted/entered text file against its related audio file. The purpose of the review interface tool not only edits and corrects the previously converted text file but also provides input data to increase the overall predictive capabilities of the system.
PCT/US2009/044270 2008-05-15 2009-05-15 Devices and methods used in the processing of converting audio messages to text messages WO2009158077A2 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US5364008P 2008-05-15 2008-05-15
US61/053,640 2008-05-15

Publications (2)

Publication Number Publication Date
WO2009158077A2 WO2009158077A2 (en) 2009-12-30
WO2009158077A3 true WO2009158077A3 (en) 2010-04-22

Family

ID=41445162

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2009/044270 WO2009158077A2 (en) 2008-05-15 2009-05-15 Devices and methods used in the processing of converting audio messages to text messages

Country Status (1)

Country Link
WO (1) WO2009158077A2 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9042867B2 (en) 2012-02-24 2015-05-26 Agnitio S.L. System and method for speaker recognition on mobile devices

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6282510B1 (en) * 1993-03-24 2001-08-28 Engate Incorporated Audio and video transcription system for manipulating real-time testimony
US6366882B1 (en) * 1997-03-27 2002-04-02 Speech Machines, Plc Apparatus for converting speech to text
US20050060159A1 (en) * 2003-09-17 2005-03-17 Richard Jackson Text transcription with correlated image integration

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6282510B1 (en) * 1993-03-24 2001-08-28 Engate Incorporated Audio and video transcription system for manipulating real-time testimony
US6366882B1 (en) * 1997-03-27 2002-04-02 Speech Machines, Plc Apparatus for converting speech to text
US20050060159A1 (en) * 2003-09-17 2005-03-17 Richard Jackson Text transcription with correlated image integration

Also Published As

Publication number Publication date
WO2009158077A2 (en) 2009-12-30

Similar Documents

Publication Publication Date Title
EP2499582A4 (en) System and method for hybrid processing in a natural language voive services environment
HK1130935A1 (en) A method, a system and a device for converting speech
EP2157571A3 (en) Automatic answering device, automatic answering system, conversation scenario editing device, conversation server, and automatic answering method
DE60322985D1 (en) TEXT-TO-LANGUAGE SYSTEM AND METHOD, COMPUTER PROGRAM THEREFOR
WO2009111721A3 (en) Voice recognition grammar selection based on context
EP4239628A3 (en) Determining hotword suitability
WO2011044286A3 (en) Data analysis expressions
WO2010148141A3 (en) Apparatus and method for speech analysis
TW200739372A (en) Data combining method for a monitor-image device and a vehicle or a personal digital assistant and image/text data combining device
EP2306345A3 (en) Speech retrieval apparatus and speech retrieval method
BRPI0802614A2 (en) methods and apparatus for encoding and decoding object-based audio signals
WO2009126732A3 (en) Automated service-based order processing
WO2012018802A3 (en) Translating languages
EP1895512A3 (en) Multi-channel encoder
ATE515884T1 (en) SYSTEM AND METHOD FOR REALIZING A MULTILINGUAL CONFERENCE
WO2013162994A3 (en) Systems and methods for audio signal processing
WO2011069171A3 (en) Remote batch editing of formatted text via an html editor
WO2006091551A3 (en) Audio signal de-identification
WO2008038082A3 (en) Prosody conversion
WO2010013939A3 (en) An apparatus for processing an audio signal and method thereof
WO2004072846A8 (en) Automatic processing of templates with speech recognition
EP2963643A3 (en) Entity name recognition
GB201212435D0 (en) A transcription device and a method for transcribing speech
WO2006122106A3 (en) Processing information from selected sources via a single website
WO2011034561A8 (en) Method and system for processing an image received from a remote source

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 09770612

Country of ref document: EP

Kind code of ref document: A2

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 09770612

Country of ref document: EP

Kind code of ref document: A2