CN1323435A - 提供网络协同会话服务的系统和方法 - Google Patents

提供网络协同会话服务的系统和方法 Download PDF

Info

Publication number
CN1323435A
CN1323435A CN99811616A CN99811616A CN1323435A CN 1323435 A CN1323435 A CN 1323435A CN 99811616 A CN99811616 A CN 99811616A CN 99811616 A CN99811616 A CN 99811616A CN 1323435 A CN1323435 A CN 1323435A
Authority
CN
China
Prior art keywords
session
network
equipment
local
network equipment
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN99811616A
Other languages
English (en)
Other versions
CN1160700C (zh
Inventor
斯蒂芬·H·梅斯
波纳尼·格帕拉克里世南
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nuance Communications Inc
Original Assignee
International Business Machines Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by International Business Machines Corp filed Critical International Business Machines Corp
Publication of CN1323435A publication Critical patent/CN1323435A/zh
Application granted granted Critical
Publication of CN1160700C publication Critical patent/CN1160700C/zh
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • H04M3/50Centralised arrangements for answering calls; Centralised arrangements for recording messages for absent or busy subscribers ; Centralised arrangements for recording messages
    • G06Q50/40
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/957Browsing optimisation, e.g. caching or content distillation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/16Sound input; Sound output
    • G06F3/167Audio in a user interface, e.g. using voice commands for navigating, audio feedback
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/28Constructional details of speech recognition systems
    • G10L15/285Memory allocation or algorithm optimisation to reduce hardware requirements
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/02Protocols based on web technology, e.g. hypertext transfer protocol [HTTP]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • H04M3/487Arrangements for providing information services, e.g. recorded voice services or time announcements
    • H04M3/493Interactive information services, e.g. directory enquiries ; Arrangements therefor, e.g. interactive voice response [IVR] systems or voice portals
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • H04M3/487Arrangements for providing information services, e.g. recorded voice services or time announcements
    • H04M3/493Interactive information services, e.g. directory enquiries ; Arrangements therefor, e.g. interactive voice response [IVR] systems or voice portals
    • H04M3/4936Speech interaction details
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • H04M3/487Arrangements for providing information services, e.g. recorded voice services or time announcements
    • H04M3/493Interactive information services, e.g. directory enquiries ; Arrangements therefor, e.g. interactive voice response [IVR] systems or voice portals
    • H04M3/4938Interactive information services, e.g. directory enquiries ; Arrangements therefor, e.g. interactive voice response [IVR] systems or voice portals comprising a voice browser which renders and interprets, e.g. VoiceXML
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/226Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics
    • G10L2015/228Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics of application context
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/72Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
    • H04M1/724User interfaces specially adapted for cordless or mobile telephones
    • H04M1/72403User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality
    • H04M1/72445User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality for supporting Internet browser applications
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M2201/00Electronic components, circuits, software, systems or apparatus used in telephone systems
    • H04M2201/40Electronic components, circuits, software, systems or apparatus used in telephone systems using speech recognition
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M2201/00Electronic components, circuits, software, systems or apparatus used in telephone systems
    • H04M2201/60Medium conversion
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M2203/00Aspects of automatic or semi-automatic exchanges
    • H04M2203/35Aspects of automatic or semi-automatic exchanges related to information services provided via a voice call
    • H04M2203/355Interactive dialogue design tools, features or methods
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M2250/00Details of telephonic subscriber devices
    • H04M2250/74Details of telephonic subscriber devices with voice recognition means
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • H04M3/42204Arrangements at the exchange for service or number selection by voice
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • H04M3/44Additional connecting arrangements for providing access to frequently-wanted subscribers, e.g. abbreviated dialling
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • H04M3/487Arrangements for providing information services, e.g. recorded voice services or time announcements
    • H04M3/493Interactive information services, e.g. directory enquiries ; Arrangements therefor, e.g. interactive voice response [IVR] systems or voice portals
    • H04M3/4931Directory assistance systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M7/00Arrangements for interconnection between switching centres

Abstract

一个在网络连接的服务器和设备及它们相应的应用程序之间提供自动和协同共享会话资源,例如功能和主目,的系统和方法。一方面,一个提供自动和协同共享会话资源的系统包含:包含第一(100)和第二(106)网络设备的网络;此第一(100)和第二(106)网络设备每个包含一组会话资源(102、107)、一对话管理器(103、108)用来管理一会话和执行调用请求会话服务,以及一通信栈(111、115)用以使用会话协议在网络上传递消息,其中会话协议在第一和第二设备的对话管理器之间建立协同的网络通信来自动地共享第一和第二网络设备的一组会话资源,在需要时,执行它们各自所请求的会话服务。

Description

提供网络协同会话服务的系统和方法
本申请基于1998年10月2日递交的第60/102,957号美国临时申请和1999年1月27日递交的第60/117,595号美国临时中请。
本申请一般地涉及会话系统,并特别涉及一种用以在网络连接的设备、服务器和应用程序之间自动及协同共享会话功能/资源的系统和方法。
传统的会话系统(即,单纯具有语音I/O的系统或者具有语音I/O的多模式系统)一般地被限于个人计算机(PC)和具有合适结构和足够处理能力的本地机器。另一方面,对于电话技术应用,会话系统通常位于一服务器上(例如,IVR服务器)并可以经由传统电话和蜂窝电话而能访问。尽管这种会话系统变得更为流行,有代表性的是所有的会话处理或者在客户机端或者在服务器端执行(即,所有的配置或者是完全本地的或者是完全客户机/服务器式)。
随着普及计算的出现,期望数十亿计的低资源客户设备(例如,个人数据助理PDA,智能电话等)将被连网在一起。由于客户设备尺寸的减小和用户所期望这种设备所执行任务的复杂性的增加,传统的图形用户界面在这种小型的用户设备上不切实际,所以用户界面成为一个关键性的问题。因此,期望会话系统将作为用户界面的主要单元来提供纯语音/音频I/O或带有语音/音频I/O的多模式I/O。
从而,在便携式客户设备上,语音嵌入的会话应用正得到发展并进入成熟。不幸的是,由于资源有限,预期这种客户设备可能不能执行复杂的会话服务,例如语音识别(尤其是当词汇量非常大或特殊或者需要领域特定/应用特定的语言模型或语法时)、NLU(自然语言理解)、NLG(自然语言产生)、TTS(文本到语音合成)、音频捕捉和压缩/解压缩、重放、对话产生、对话管理、说话者识别、主题识别、音频/多媒体索引和查找等。例如一个设备的存储器和CPU(和其他资源)的限制可以限制这种设备能提供的会话能力。
此外,即使一已联网设备功能足够强大(在CPU和存储器方面)以便执行所有这些会话任务,但该设备可能没有合适的会话资源(例如,引擎)或者会话主目(argument,即被引擎用到的数据文件)(例如,语法、语言模型、词汇表文件、语法分析、标记、声纹和TTS规则等)来执行合适的任务。确实,一些会话功能可能太特殊而适于特定的服务,因此要求仅能从网络上的其它设备或者机器才可得到的后端信息。例如,因为用来产生对话需要的整组会话主目或者功能(例如语法分析器、标记器、翻译器等)或者需要很大数量的存储器来存储(在客户设备上不能得到)或者太广泛(依据通信带宽)以至于不能传递到客户机端,所以一个客户设备上的NLU和NLG服务典型地需要服务器端支持。这个问题在多语种应用时被进一步加重,当一个客户设备或者本地应用程序没有足够的存储器或者处理能力来存储和处理这些主目,这些主目是需要的,以便按多语种来处理语音和执行会话功能。代之,用户必须手动连接到远程的服务器上来执行这种任务。
同时,与客户机和服务器间的分布式结构和分布式处理有关的问题需要新的会话连网方法。这种方法包含了通信量和分布在网络上的资源的管理,以保证参与该网络上会话交互的每一个用户的合适的对话流。
因而,允许一个有有限资源的网络设备自动利用连网资源、以一种对用户来说自动的和透明的方式来执行复杂的会话任务的系统和方法是非常需要的。
本发明针对一种在网络连接的服务器和设备(和它们相应的应用程序)之间提供自动的和协同的共享会话资源的系统和方法。根据本发明的一个实施例,系统包含多个连网的服务器、设备和/或应用程序,通过使用会话网络协议(或者方法)传达消息,使得相互之间“明白会话”,协议自动地允许每一明白会话的网络设备自动地并以协同和同步的方式共享会话资源,以便通过某一网络设备之一的界面来提供一个无缝的会话界面。
根据本发明的一个方面,一个提供自动的和协同的共享会话资源的系统,包含:
一个网络,它包含至少第一和第二网络设备;
这第一和第二网络设备每个包含
一组会话资源;
一个对话管理器用以管理会话并执行调用来请求会话服务;以及
一使用会话协议在整个网络上传达消息的通信栈,在此通过会话协议传达的消息在第一和第二设备的对话管理器之间建立协同的网络通信,以自动地共享第一和第二网络设备成组的会话资源,当需要时,去执行它们各自请求的会话服务。
本发明允许一个低资源的客户设备透明地本地执行简单的任务,又以二进制或模拟方式与一具有更复杂会话能力的服务器(或者别的设备)相连接透明地执行复杂任务。服务器端的功能(例如语音识别)可以通过一个常规的IP网络或者LAN网络,也可以经由传统的电话线路或包交换网络数字传输,或者在无线网络上经由任何传统的无线数据协议而能被执行。
有利的是,本发明在任何有限CPU、存储器和处理能力(也含有限的会话资源)的设备(如盛行的嵌入式设备)上提供一完全成熟的会话用户界面,这使利用低资源的客户设备不需要下载,例如来自一网络服务器所必需的会话主目,便可提供复杂的会话服务。本地能力允许用户利用本地设备而不需请求连接,例如无线电话供应商的外部覆盖。同时,持续连接的花费也减小,并且当持续连接丢失时恢复的难度能够减轻。
在接下来的优选实施例的详细描述中,本发明的这些和其他方面、特征及优点将会被描述并变得明了。
图1是根据本发明的一个实施例在连网的设备之间通过自动的和协同的共享会话资源来提供会话服务的一个系统框图;
图2是根据本发明的一个方面在连网的设备之间通过自动的和协同的共享会话资源来提供会话服务的一个方法流程图;
图3是根据本发明的另一个方面在连网的设备之间通过自动的和协同的共享会话资源来提供会话服务的一个方法流程图;
图4是根据本发明采用会话浏览器的另一个实施例提供会话服务的分布式系统的一个框图;以及
图5是根据本发明采用会话浏览器的另一个实施例提供会话服务的分布式系统的一个框图。
将会了解到本发明可以被以不同形式的硬件、软件、固件、专用处理器或者它们的组合实现。更可取地,本发明作为一个包含程序指令的应用程序以软件实现,它们确实地嵌入在程序存储器设备上(例如软盘、RAM、CDROM、ROM和闪存器),并且可以被任何设备或者包含合适结构的机器执行,如一个或者多个中央处理器(CPU)、一随机存取存储器(RAM)和音频输入/输出(I/O)接口。
将会进一步了解,因为在附图中描述的组成系统的某些部件和方法步骤优选以软件实现,系统组件之间的实际联系(或者处理步骤)随本发明被编程的方式而可能不同。此间给出的说明,一个相关领域的一般技术人员将能设想到本发明的这些和相似的实现方法或结构配置。
现参看图1,框图阐明了根据本发明的一个具体范例在连网设备之间通过自动的和协同的共享会话资源和会话主目(数据文件)来提供会话服务的一个系统。该系统包含一本地客户设备100,客户设备100包含一个声学前端101来处理音频/语音输入和输出由客户设备100产生的音频/语音。例如,客户设备100可以是一个智能电话或任何能处理语音的PDA(个人数字助理)。客户设备100还包含一个或者多个本地会话引擎102用来处理声特征和/或由声学前端101产生和/或捕捉的波形并产生对话输出给用户。本地会话引擎102可以包含,例如,一个嵌入式语音识别,一个说话者识别引擎,一个TTS引擎,一个NLU和NLG引擎和一个音频捕捉和压缩/解压缩引擎及任何其它类型的会话引擎。
客户设备100还包含一个本地对话管理器103它执行任务管理并控制和协同一经由系统调用(API或者协议调用)请求的会话服务(或者本地的或者经由网络设备)的执行,同时也管理本地的和与连网设备的会话。更特别地,如下所详细描述的,对话管理器103确定是否一个给定的会话服务将在客户100上本地地或者是在远程网络连接的服务器(或者设备)上处理和执行。这个决定是基于如客户100的会话能力和其它网络连接的设备能力的比较等因素,也基于为处理这一请求的会话服务可能必需的可用资源和会话主目。其它因素包栝从连网的设备接收结果时网络通信量和预期的延迟。对话管理器103执行任务管理和资源管理任务,如装载管理和资源分配,同时也管理本地会话引擎102和能处理语音的本地应用程序104之间的对话。
如图1的示例,客户设备100经由网络105与服务器106连网,服务器106包含服务器应用程序109,也包含向客户设备100(或者任何其它网络设备或者应用程序)在需要时提供会话服务的服务器会话引擎107。和本地会话服务器引擎102一样,服务器引擎107可以包含如一个嵌入式语音识别、一个TTS引擎、一个NLU和NLG引擎和一个音频捕捉和压缩/解压缩引擎及任何其它类型的会话引擎。服务器106包含一个服务器对话管理器108,服务器对话管理器108操作方式和上面描述的本地对话管理器103相同。例如,服务器对话管理器108确定是否一个从本地对话管理器103来的一个会话服务请求被服务器106或者是远程网络连接的服务器或者设备处理和执行。此外,服务器对话管理器108管理服务器会话引擎107和能处理语音的服务器应用程序109之间的对话。
图1的系统进一步描述了客户设备100和远程服务器106被连网到一个有会话引擎和/或能被客户100和服务器106在需要时存取的会话主目的服务器110上。网络105可以是如因特网、LAN(局域网)、公司内部网、PSTN(公共交换电话网)、或者无线网(经由RF(射频)、或者IR(红外线)无线通信)。可以理解,尽管图1描述了一个客户机/服务器系统,其中术语可以被擅长该技术的人理解,图1的系统可以包含许多连网的服务器、设备和相互“明白会话”的应用程序来提供自动的和协同的共享会话功能、主目和资源。如下文更详细的说明,这种“明白会话”可以通过利用会话网络协议(或者方法)传输被各自的对话管理器处理的消息来达到,使连网的设备以一种自动的和同步的方式共享会话资源和功能。这种会话协同提供了一个无缝的会话界面,通过网络设备之一的界面来访问远程服务器、设备和应用程序。
特别是,在连网设备之间提供会话协同来共享它们的会话功能、资源和主目,每一连网设备利用会话协议(或者方法)传递消息来交换关于它们会话能力和需求的信息。例如,如图1所示,客户设备100包含一个利用会话协议112、会话发现、注册和协商协议113和语音传输协议114(或者会话编码协议)用以传送和接收消息的通信栈111。同样,服务器106包含一个服务器通信栈115,服务器通信栈115包含会话协议116、会话发现、注册和协商协议117和语音传输协议118。这些协议(方法)在同时递交的、名称为“Conversational ComputingVia Conversational Virtual Machine”的专利申请(IBM Docket NO.Y0999-111p)中相对于CVM(会话虚拟机器)得以详细讨论,此专利已一般转让,此处加以引用。
简单的说,会话协议112、116(或者说是在Y0999-111P中称为“分布式会话协议”)是允许连网设备(例如,客户机100和服务器106)或者应用程序与其它网络设备的对话管理器传输消息,以便注册它们的会话状态、主目和上下文的协议(或者方法)。会话协议112、116也允许设备交换其它信息如Java小应用程序、ActiveX构件和其他可执行的编码,这些编码允许设备或者相关的应用程序来协调按照如主/从或对等会话网络配置的这些设备之间的会话。分布式会话协议112、116允许信息交换来协同包含多设备或者应用程序,包括主/从会话网络、对等会话网络和匿名伙伴的会话。在连网设备之间利用分布式会话协议可以交换的信息包含指向数据文件(主目)的指针、数据文件和其他会话主目的传输(如果需要)、输入通知、输出事件和识别结果、会话引擎API调用和结果、状态通知和上下文改变及其他系统事件、注册更新:注册协商更新的握手信号:协商握手信号、及当一请求的资源丢失时的发现更新。
本(分布式)会话协议也包含对话管理器(DM)协议,此协议允许对话管理器去分配服务、行为和会话应用、I/O和引擎API如在IBMDocket NO.Y0999-111P中所描述的。例如,DM协议允下列信息被交换:(1)DM体系结构注册(例如,每一DM可以是本地DM的汇集);(2)与元信息关联的指针(用户、设备能力、应用需求等);(3)DM网络拓扑协商(例如,主/从、对等);(4)数据文件(会话主目)如果可用,即,如果引擎被利用则被一个主DM控制;(5)I/O事件通知,如用户输入、输出到多个用户以传输到引擎和/或加到上下文;(6)识别事件通知;(7)传输被处理的输入从引擎到一主DM;(8)传输主DM的职责到注册的DM;(9)DM处理结果事件;(10)DM异常;(11)可信度和模糊性结果的传输,建议的反馈和输出,建议的期望状态,建议的操作,建议的上下文改变,建议的新的对话状态;(12)通知决定,上下文更新,动作更新,状态更新等;(13)完成、失败或者被中断动作通知;(14)上下文改变通知;和/或者(15)由于动作引起的数据文件、上下文和状态的更新。
例如,在主一从网络配置中在任何时候仅仅连网设备之一驱动会话。特别是,主设备(即主设备的对话管理器)管理和协调网络设备间的会话并决定哪一设备来执行给定的会话服务或功能。这个决定可以基于每一设备或者应用程序提供的关于它们会话能力的信息。这个决定也可以基于主设备决定那一个从设备(具备所需的会话能力)能最佳地执行给定的会话功能。例如,主设备可以要求多个从设备来执行语音识别并提供结果给主设备。主设备然后选择最佳的结果。可以理解这里在语音识别水平上所描述的是分布式对话管理器之间在DM(对话管理器)协议水平上的机制(如在Y0999-111P中所描述)。确实,当在多个对话管理器之间发生对话时,主设备将获得每一对话管理器结果得分的度量,从而作出决定看好哪一个对话管理器继续进行输入,不但要根据语音识别的准确性,并且也要根据对话(含意)、上下文和历史,也考虑其他项目,如用户的优先选择、历史和应用的偏好。
在对等连接中,每一设备将试图确定自己能执行的功能并记录一个请求去执行之。可接受到任务的设备将执行这个任务然后对其执行性能评分。然后依据它们的评分设备协商哪一个设备将执行这个任务。
在一个实施例中,分布式会话协议112、116经由RMI(远程方法调用)和RPC(远程过程请求)系统调用,以完成应用程序和整个网络上不同会话引擎之间的调用来实现。行内人都知道,RPC是允许一个应用通过网络从另一个应用请求一个服务的协议。同样地,RMI是在分布式网络上对象可以交互的一种方法。RMI允许一个或者多个对象随同请求被传递。此外,信息也可以被存储在一个对象中,该对象经由CORBA或者DCOM被交换,或者以说明性的方式呈现(例如经由XML)。如在上面插入的专利申请IBM Docket NO.Y0999-111P中所讨论,会话协议(方法)(或者分布式协议)可以被用来经由会话API在会话应用和CVM命外壳之间或者经由会话引擎API在CVM和会话引擎之间获得由一个CVM(会话虚拟机器)外壳支持的会话功能的分布式实现。会话引擎API是核心引擎和应用程序之间的界面,通过这个界面和协议与核心引擎(本地的或者连网的)通信。会话API提供一个API层来挂钩或者发展明白会话的应用,包括建立会话用户界面的基础类和组件。
同样地,根据本发明一对话管理器可经由API与应用程序和引擎(本地的或者连网的)通信。通过这种方式,一对话管理器可以对来自所有远程过程的结果和回调起作用(对远程引擎和应用程序的过程调用),如同它是一个本地的应用,以至于,例如,在应用和资源(本地的或者连网的)之间仲裁、区分优先次序和确定激活的应用,以及确定那一个结果认为是有效的。
会话发现、注册和协商协议113、117是被用作“发现”本地的或者网络的明白会话的系统(即“讲”会话协议的应用程序或者设备)的网络协议(或者方法)。注册协议允许设备或者应用程序注册它们的会话功能、状态和主目。协商协议允许设备协商主-从、对等或者匿名伙伴网络。
在一个实施例中,发现协议实现一个“广播和收听”方法来触发一个从其它“广播和收听”设备来的反应。这能允许如网络动态地和自发地创建(例如下面讨论的蓝牙和Hopping网络)。在另一个实施例中,一个缺省的服务器(可能是主设备)装置可以被利用它注册不同网络设备的“地址”。在这个实施例中,发现总计网络上与服务器通信的各个设备来核对注册的设备列表,以便确定那些设备与这些设备连接。经由发现协议交换的信息包括如下:(1)握手信号的广播请求或者收听请求;(2)设备标识符交换;(3)第一次注册时的句柄/指针交换;和(4)第一次协商时的句柄交换。
在实现注册协议的一个实施例中依据连接,设备可以以一预定的协议(例如,TTS英文、任何文本、语音识别、500字+FSG语法、无说话者识别等)通过交换一组标志或一设备特性对象来交换关于它们会话能力的信息。同样,应用程序可以交换引擎需求列表。通过一个主/从网络配置,主对话管理器可以编辑所有列表并使功能和需要与会话能力相匹配。在没有主设备(对话管理器)的情况下,一个普通服务器可以被用来传输会话信息到网络上每一个机器或者设备。注册协议允许以下信息被交换:(1)能力和装载消息包括定义和更新事件;(2)引擎资源(是否一个给定设备包括NLU、DM、MLG、TTS、说话者识别、语音识别压缩、编码、存储等);(3)I/O能力;(4)CPU、存储器和装载能力;(5)数据文件类型(范围指定、字典、语言模型、语种等);(6)网络地址和特征;(7)关于用户的信息(定义和更新事件);用户对设备、应用或对话的偏好;(9)用户化;(10)用户经验;(11)帮助;(12)每一应用(和应用状态)(定义和更新事件)的能力需求;(13)CUI服务和行为的元信息(帮助文件、编目、会话优先权等)(定义和更新事件、一般通过指向表格的指针);(14)协议握手;和/或(15)拓扑协商。
注册可以使用传统的通信协议如TCP/IP、TCP/IP29、X-10或者CEBus和设备之间的套接口实现。设备利用一个分布式会话结构与它们相关的会话引擎和对话管理器及它们的会话主目(例如,激活的词汇表、语法和语言模型、语法分析和翻译/标记模型、声波纹、合成规则、基本格式(发音规则)和声型)通信。这个信息或者以文件或者以流的形式传递给对话管理器和会话引擎或者作为URL。此外,上下文信息可以通过指示通路或者指到设备的上下文栈/历史,或控制器可以访问和添加到其上下文栈的应用而能被传递。设备也传输关于它们多模式I/O和UI能力(屏幕/无屏幕、音频入和出的能力、键盘等)的信息。会话主目允许会话引擎基于当前状态和上下文估计NLU引擎有关的新询问。
至于协商协议,依据各个网络设备的注册需求和能力,在协商期间网络设备和应用程序可以对一给定的暂时性配置进行投票。如果一个应用利用此配置(即拓扑),则决定被自动地强加。否则它可以是请求成为一个主设备或者从设备或者对等机。根据请求计数,一个优选的配置被决定并与所有的设备和应用程序通信(保持在每个设备/应用程序的可用资源表格中)。无论什么时候一个系统改变它的状态和需求时,它将会与其它连网的对话管理器/资源通信来开始一个新的协商并沟通新的状态和上下文信息。
语音传输协议114、118允许设备传送和接收压缩的语音或者本地处理结果到/从网络上的其它设备和应用程序。会话引擎102、107更适宜包括压缩/解压缩引擎用以传输时压缩语音(或者结果)和为了本地处理对通过网络从另一设备或应用程序获得的压缩语音(或者结果)进行解压缩。语音传输协议被在设备中执行的语音传输的客户机利用来向/从其它连网设备、系统或者应用程序传送/接收被压缩的语音进行处理。设备的语音传输客户机与压缩、解压缩和重建引擎联合操作,利用合适的压缩硬件,处理通过网络传输的语音。语音编码器提供感知可接受的或可理解的被压缩语音的重建和优化的会话性能(如,字错率)。语音在各自的连网设备上利用声信号处理引擎(音频子系统)和合适的音频硬件被捕捉(和转换成特征)。此外,压缩的语音文件格式在处理语音的设备之间可以被传输和接收。特别是,语音传输协议允许设备向/从网络上的其它设备和应用程序传输和接收压缩的语音或本地的处理结果。在一个实施例中,当在一个传输设备和一个接收设备之间握手处理后,一数据流(基于信息包)被发送到接收器。包的头最好指出为编码此语音(或者结果)所利用的编码方案和编码主目(即,抽样频率、特征特性、维数、前端应用的变换、前端的性质等)。此外,纠错信息也可以被引入(例如,如果前一个包丢失或者延迟,前一个包的最后特征矢量纠正差动解码器)或者合适的消息来恢复(重发)这些丢失的包。
此外,对话管理器可以经由对话管理器协议或者DM协议通信。(如在上面引用的IBM Docket No.Y0999-111P中所讨论的)。DM协议可以被用来在多个对话管理器之间协商哪个对话管理器被激活,或者哪个对话管理器应该接收该输入。因为在本实施例中,服务器资源仅仅当被真正需要时被“轮询”,DM协议提供一种变化:本地对话管理器进行一预先测试以决定是否此功能应该被远程执行。在有错误产生或者存在疑惑的情况下,对话管理器可以等待本地引擎的推测并在仍存有疑惑时决定轮询一服务器来比较。
因此,根据上面的讨论,应该明白网络协议提供在连网设备之间的协同(或者一个协同界面)来共享会话服务和功能。术语协同界面意味着一单个会话可以在不同的参加者(设备/应用程序)之间被把握,如同它们都理解整个会话并合适地知道在任意给定的时间谁被访问。每一会话系统的行为或者应用可以被对话管理器(例如,主-从模式中的主设备)、应用(此应用可以确立谁是主、从或者对等)、系统对话管理器(如果有)、组织和协商(在对等模式中)控制/管理,在合适的系统上对用户透明地执行每一会话功能。对于一个客户设备上的用户来说,提供了一个无缝的会话界面(即,所有的会话交互似乎是经由一个单一的会话系统),尽管某些会话功能、系统和资源可能被提供几个连网的设备(例如,一蜂窝式电话、一呼机和一PDA)。
协同的会话系统的一个例子(上面描述的公开在前面引用的的IBMDocket No.Y0999-111P中)是称作UCA(通用会话设备)的远程控制。UCA发现明白会话的设备。每个会话连接的设备将发送它的会话主目(词汇表和语法)到UCA。UCA作为这种设备的主设备并且当一会话交互随同用户结果成为一命令到此设备时将更新适当的设备。相反地,依据命令的执行或者每次设备改变状态时,它将发送一个更新到远程控制。一个没有其他会话能力(除了相应主目)的会话设备是那个被称作“匿名伙伴”的。
在另一个实施例中,一个服务器或者基站,除音频捕捉压缩和由远程控制(或UCA)执行的传送外可以执行所有的会话功能。远程控制也可以提供一些UI给用户来通知他/她不同设备的状态。这可以经由语音、GUI或者这些形式(或者其它)的任何会话的组合完成。
尽管会话网络拓扑可以与图1的系统联系起来使用,一个最优的网络拓扑是提供一种自发的动态连网(即,在一定的通信范围内的设备之间自发的建立的一网络)。这种自发的连网可以通过应用当前新兴的例如描述在http://www.bluetooth.com上的“蓝牙”连网协议实现。简单地说,蓝牙是给网络协议的一个代号,蓝牙网络协议提供特别的无线网络的连通性。更特别的是,蓝牙是在一个特定的范围内的设备(例如,智能电话、蜂窝式电话、呼机、PDA、便携式电脑、移动设备等)之间在特定的范围内提供短程无线射频链路来动态地和自发地在这样的设备之间建立一个网络(或是大家知道的“piconet”(微网))。一个微网指的是在网络连接的剩余期间与在每个微网中扮演主设备的一个结点以特殊方式连接的蓝牙允许设备(结点)的集合。两个或更多的微网可以被网络连接以构成所谓的一个“scatternet”(分散网)。
看得出,依照本发明任何自发的动态连网协议可以被实现。例如,图1的网络拓扑可以依照在美国专利申请序列号No.09/198,378描述的“hopping”通信网络实现,该专利申请1998年11月24日递交、名称为“Automated Traffic Mapping”,此专利已一般转让,此处引用以做参考。
现参看图2,图2是描述根据本发明的一个方面在连网设备之间提供自动和协同共享会话功能的一种方法的流程图。特别是,图2进一步详细描述了图1系统操作的一种方法。开始,用户发出一个口头命令(或者否则发出一个询问)到本地客户设备,这个口头命令或询问通过数字化或抽取数字化语音信号的相关特征而被预处理过(步骤200)。作为选择,本地对话管理器可以接收从本地应用程序104传来的请求产生合成语音(TTS)以输出到该用户(步骤200)。
(经由本地对话管理器103)决定本地处理是否可用(步骤201),例如,是否语音识别或者语音合成可在本地执行。对于这个决定可以看出,本地对话管理器103可以明确地预先确定识别/合成必须在其上发生的一个远程服务器,(例如,一IP地址对于套接口连接的、一URL地址对于经由小服务程序(servlet)基于服务器登记,或者一电话号码对于直接连接或连接到一个IVR的)。此外,本地设备没有可以利用的资源或者主目来执行(或者有效地执行)一任务的确定,也可由对话管理器103依据本地应用程序104的执行根据由该应用程序在头文件中指示的资源需求而作出。另外,某些命令或者请求的功能可以引起对话管理器自动地连接到一个远程服务器。例如,安全性应用程序(如,说话者确认)可以自动地被切换到服务器端处理,所以声波纹不被分配到客户机。再者,本地汽车导航系统可以利用电话或者基于套接口的服务器能被自动地切换到远程服务器,以致于本地设备不必存储大量的导航信息。
如果确定本地处理是可行的(肯定的确定在步骤201),则处理将经由本地引擎102在本地执行(步骤202)。另一方面,如果确定本地处理是不可行的(否定的确定在步骤201),那么相关的特征/波形/信息被自动传输给一个远程网络连接的服务器(步骤204)(经由IP、LAN、蓝牙、IR、RF或者经由电话或者IP电话),在其中远程处理(例如,语音识别/合成)被执行(步骤205)(可能与某个用户/服务器交互)。
显然从本地客户机到远程网络连接的服务器的语音传输(反之亦然)可以利用不同的技术实现。例如,可以当作文件、流或者信息包流直接传送波形。另外,一个被压缩的波形可以利用传统的方法如ADPCM和APC传输。而且,一个特征的流可以依据G.Ramaswamy等人在“Compression Of Acoustic Features For Speech Recognition InNetwork Environments,”Vol.2,pp.977-980,Proc.ICASSP,1998公开的方法传输,此处引入以供参考。此方法允许在接收端识别(语音识别、说话者识别或者NLU)而不重建信号。此外,语音传输可以利用任何编码方法或者基于压缩特征和音调估计的方案,允许语音信号以足够可理解的质量和平滑舒缓的重放(用以重放、校正、进一步人性处理、或者归档)而被重建。这种编码方案应该提供低到4kbits/s和5kbits/s之间的数据率而不降低识别性能。其结果是以后端(服务器)资源支持的的交互式交换,甚至通过无线调制解调器或者无线数据链路,就可以实时执行。可以理解,利用相似的编码方案若要提供非常高质量的重放,其它的方案必须被采用。另外,任何允许cepstra特征和音调压缩、允许在服务器端识别(语音、说话者、NLU)且在接收端没有退化并重建信号的方法可与本发明结合使用。此重建对于以后从服务器的重放或者从客户机(若是本地存储)的重放和处理,为随后的校对抄本、错误校正或者人的处理监控是有用的。可以理解任何合适的压缩方案(编码协议)可以被利用。
可以理解,压缩或者编码方案(传送协议或编码协议)在不同的设备之间可以不同。例如,从音频捕捉系统(客户机的)传送输入语音到连网资源的编码可能不同于从连网资源(服务器)到音频输出(客户机)用于传送输出语音(例如提示、重放或者TTS)的编码协议。确实,在第一种情况下,编码应该被优化以在服务器端提供良好的识别性能,回放的重建固然重要但并不引人注目地重要。当然比特率(压缩率)是重要的。压缩率的折衷可在鲁棒性(错误率-特征失真)和知觉质量之间作出调整以达到和保持一个目标比特率。同样,某些方案可以被选择以给某些信道或者背景失真增加鲁棒性。另一方面,对于后面的任务(输出信号),应该为了可理解或者知觉质量和舒适性,或者保持声音或音频某些特殊的特征而优化编码。经过本地处理(步骤202)或远程处理(步骤205)后,一个关于该处理结果是否被接受的决定(步骤203和206)被作出(经由本地对话管理器103或者服务器对话管理器108)。如果确定处理的结果是不可接受的(否定决定在步骤203和206),则本地客户机或者远程服务器将自动地(经由IP、LAN、蓝牙、IR、RF或者经由电话或者IP电话连接)转发该特征或者波形给能执行这种处理的服务器(步骤204和步骤207)。例如,如果这种结果是未知或者没有被识别或者是模糊不清(或者基于与每个资源(本地的或服务器)的对话管理器相关的可信度度量),则对语音识别结果或NLU的拒绝可能发生。更可取的是,从本地的或者远程系统到一个服务器系统的自动连接可以基于声音或者被返回的LM(语言模型)得分的水平上,该得分由本地语音解码器应用如在美国专利申请5,937,383,Ittycheriah等人、名称为“Apparatus and Methods ForSpeech Recognition Including Individual or Speaker ClassDependent Decoding History Caches For Fast Word Acceptance orRejection”中所教导的技术,此专利已一般转让,此处引入以供参考(例如,当这些得分被判定低于一个给定的阈值时远程服务器被连接)。可以理解,估计可信度或识别说话和询问(在其期间或者对话以后)的任何合适的度量或者方法可以被用来决定一个被会话系统获得的结果是否可以接受(在这种情况下另一个系统被考虑)。
相似地对TTS,本地的和远程的对话管理器103、108可以检查一个文本的复杂性来决定是否此TTS将被本地或者远程地执行。例如,当一个字的发音规则不知道时或者当文本需要复杂的句法分析时TTS将被远程执行。另一个例子是如果此TTS必须被用一个不同的口音、方言或者一种不同的语种发音或者假定模仿某人特定的句子时。
当处理被远程执行后,结果被发送回本地客户机(步骤208)(经由电话、IP地址、MAC(媒体存取控制)地址等)。很明显,输出(即,给用户输出的语音)可以被本地地或者在服务器上合成。如果合成是在服务器上进行,被合成的语音可以以压缩格式传输(利用上面讨论的语音传输协议)给用户在本地解压缩。可以理解,编码方案可以和用于从客户机到服务器传输语音特征的方案相同或者不同。作为选择,例如,在另外的模拟PSTN线路上,用通过从客户机到服务器的电话呼叫(回叫)建立的联接,语音可以通过服务器被直接“广播”。
最近的努力开始发展识别语音的合适的可信性度量。例如,在“LVCSR Hub5 Workshop”,1996年4.29-5.1,MITAGS,MD,由NIST和DARPA组织,提出不同的方法来把一个可信度级别加于一个字一个可信度级别上。一个利用决策树的方法在字相关特征(训练说话的数量、最小和平均三音素(triphone)事件、语言模型训练事件、音素/lefemes数量、持续时间、声音得分(快速匹配或者精细匹配)、语音非语音),句子相关特征(信号噪音比、讲话速率估计:每秒钟的字数或lefemes数或者元音数、语言模型提供句子的似然、似然率、每帧规一化平均似然、语言模型中的三字母组事件),上下文中的字特征(语言模型中三字母组事件)及说话者外貌特征(口音、方言、性别、年龄、讲话速率、身份、音频质量、SNR等…)上进行训练。对于此树的每一个叶在训练数据上都计算了错误的概率。构造这样一个树的算法被Breiman等人在“Classification and RegressionTrees”,Chapman & Hal,1993中讨论过。识别方面,所有的或这些特征中的一些在识别过程中被度量,并且对于每一个字,决策树走到提供一个可信度级别的叶。此外,参考Neti等人的标题为“Word BasedConfidenee Measure As A Guide For Stack Search In SpeechRecognition”,ICASSP97,慕尼黑,德国,1997.4一文,描述了一完全依靠IBM栈解码器返回的得分的方法(使用对数似然一实际上用的是平均增量对数似然、精细匹配、快速匹配)。
在LVCSR处理中,使用通过线性回归的预测器估计可信度级别的另一种方法被执行。被利用的预测器是字的持续时间、语言模型得分、每一帧平均声音得分(最好得分)和NBEST列表中同顶部选择相同的字的一部分。显然根据本发明的一个实施例,两种方法(经由决策树度量的可信度级别和经由线性预测器度量的可信度级别)结合起来,在任何转换处理中系统地提取可信度级别,并不局限于语音识别。
基于过去的改进和该领域的飞速发展,现在我们可以说对于几种转换可以连系一个可信度值到正被转换的组件上,例如,从0到1,0意味着无转换被执行,1意味着转换无疑,此处组件可以是要转换的文本、短语、字和更一般地任何材料的逻辑块。如上描述的线性预测器和决策树的结合是本发明优选采用的一种方法。确实,作为例子,由说话者识别引擎返回的得分累计(快速匹配得分和精细匹配得分及背景模型和同伴的得分)可以被用来构建可信度级别的一决策树和/或一线性预测器,这样说话者真正被正确鉴别。实际上,在说话者识别的情况下,这个执行验证的总量与识别阶段获得的相等。
可以看出远程服务器可以发送信息例如TTS规则或者基本格式、语法等给本地客户机以存储在高速缓存中,这样本地设备随后可以应用这种信息本地处理类似的请求。因为本地设备由于缺乏所需要的资源而不能执行某些任务,由服务器对话管理器108发送这种处理信息给本地客户机的决定可以根据由本地设备在与远程服务器连接时关于它的会话能力向远程服务器的注册(经由上面所讨论的注册协议)而被作出。
显然本发明可以在这种情况下实现,此时由一设备(经由它的对话管理器)执行的会话功能量,设备是不能提供必须的资源用于这些功能的即时执行(例如,IVR有太多的被系统同时使用的端口)。因而,对话管理器可以实现提供会话系统管理和装载管理,借此,在一个特定的功能执行时对话管理器可以决定利用另一个会话系统来继续处理被请求的功能。特别是,参看图3,开始,用户发出一个口头命令给本地客户设备,它是被预处理过的,例如,被数字化和提取数字化信号的相关的特征(步骤300)。作为另一种选择,本地对话管理器可以从一本地应用程序104接收一请求来产生一个合成语音(TTS)输出给用户(步骤300)。对话管理器将决定是否本地处理应该被执行(步骤301)(例如,是否要语音识别、对话管理或者语音合成)。这个决定根据的不仅是本地会话能力、主目和资源(如上面所讨论的),而且也根据由于网络交通拥塞网络引起的延迟与利用可用的但被约束的本地资源(假定本地的和远程的设备可以执行相同的功能)执行此会话功能可能引起的延迟相比较的评价。因而,例如,当命令和控制功能在本地/通过网络受到威胁被延迟时可以远程地/本地地执行来减小延迟。确实,可以掌握更长延迟的查询(例如,由于和后端功能相联合可以适应延迟如因特网或视听查询),可以在一个优化其资源或花费的系统上执行。
此外,万一网络连接暂时不可用或者缺乏网络资源,则所有可以被本地执行的功能将被执行。其他功能可以被细分成可以以延期的模式执行的功能(在以后连接时可被重新建立)以及不共存的功能。典型的例子是更新地址薄、经由口述或者大体上的口述应答e-mail或者消息。再者,最好应用程序可以决定是否该命令是本地的或者是延期的。也可以考虑一种带有延期模式的对等,在此一延期模式管理器和一本地引擎决定是否该功能是本地的或者是延期的。
再参看图3,如果对话管理器确定本地处理是合适的(肯定决定在步骤301),则对话管理器将分配必需的会话引擎给端口(步骤302)。一旦此会话引擎被分配给该端口,如果会话引擎当前没有被最初分配的端口所使用,则对话管理器可以分配那个引擎给其它端口(步骤303)(例如,当前说话者没有说话而只是在听)。当本地引擎再次被最初被分配的端口所需要时,如果此本地端口不可得到,另一个可用的引擎(本地或者在一个远程设备上)可以被利用(步骤304)。这种动态分配过程与传统的装载管理大不相同,在传统的装载管理中对话管理器决定并在功能调用的整个持续时间,分配会话引擎到每个端口。
不难看出管理和决定传输语音例如到一个网络服务器或者设备不但可以根据系统管理/装载平衡的水平(通过客户机或者服务器上的对话管理器),而且还根据网络的通信量。例如,如果一个连接(特别是在因特网上基于TCP/IP的网络连接)被认为是超载(步骤305),则基于通信量一个新的服务器或者设备被选择(步骤306)。这个决定可以在传统协议如VoIP协议(因特网上的声音协议)顶端作出,象RSVP协议(资源预订协议)一样,借此,当需要一个信道时,连接可以随同相关质量服务的合适预订一起建立。否则,如上所述远程处理将被执行(步骤307)并返回结果。
显然这里所描述的系统和方法可以实现以用于各种允许的语音和会话应用程序。本发明对于满足在嵌入的和流行的计算世界的不断增长的要求及NLU/NLG会话系统方面特别有用。然而,可以理解,本发明可以为不同的应用而被发展,并不局限于嵌入式系统。下面作为范例的实施例将说明本发明的优点。
例如,在智能电话上应用的商业可用嵌入式的姓名拨号器(例如,一个具有PDA(个人数字助理)能力的无线电话)是一个典型的应用。例如,假定客户设备100是一个具有姓名拨号器本地应用的智能电话。用户将在智能电话的电子地址薄中本地存储一个所希望的姓名和地址的表。然后用户可以发出一个命令如“dial first name last name(拨姓和名)at…可能的限定符(家庭、办公室、蜂窝式电话)”,通过对命令的识别/理解(经由本地会话引擎102),智能电话将自动地拨地址薄中与此人相关的电话号码(经由本地应用程序104)。另一方面,当发出的姓名不在地址薄中时(因而不被识别/理解),但该姓名存在于一个更大的公司的(或者公用的)目录中(如包含在远程服务器106中),则这个请求可以被(以特征或者波形)保存和传输给一个远程服务器106来识别。然后拨号可以直接被远程服务器执行或者依据通过远程服务器接收的合适信息,由智能电话来执行。作为替代,在第一个情况下用户可以被连接到一个远程的服务器并建立一个对话,或重新请求要拨号的姓名或者要求进一步的信息(在白页或者黄页服务类型的情况下)。
本发明的另一种有用的应用涉及个人信息系统,例如商业可用的PointCast(参见http://www.pointcast.com),这个PointCast允许用户根据预定的用户优先选择得到例如,股票报价、关于某一主题特定的信息和关于此主题最近公布的信息。应用根据本发明构成的个人信息系统,如果用户希望获得关于一股票(例如,IBM)或者一主题(例如,预报在Kent的绿豆生产)的信息,则用户可以发出一个语音请求给客户设备100。如果“IBM”是在本地词汇表(用户概况)中,则它将立即被解码,用户得到最新报价,例如获得最近一次更新(PointCast)。另一方面,如果用户关于“绿豆”的请求没有被本地客户设备100理解,则该请求被作为特征流自动向前传递给(内容提供者的)远程服务器106,在此处服务器可以投入更多的资源来解码这种请求和检索相关信息(无论如何都得做),然后传输这种信息给本地系统。如果远程服务器遵循一“推销方法(push approach)”,则上面这些在下次更新时就被完成(例如,PointCast)。
客户设备100也可以是一个允许使用语音的PVA(个人交通工具助理),用以提供如会话的汽车导航。例如,如果用户不想在此系统应用CD-ROM(因为缺乏空间、能源要求、重量、成本、防震等),用户可以决定存储有限的信息,例如有关用户当前所处位置、用户最近所处位置、及用户想出游的区域/位置的词表和地图。在这个例子中,无论何时,当用户的请求与本地词表或者地图设置不匹配时,这个请求可以被自动发送给一个远程服务器106,并解码(甚至以提示返回给用户去缩小范围查找)来获得路线、地图(或者更新地图)下载到车上。再者,这种操作(即使花费高代价的下载)对用户来说是基本上透明的,仅开始需要本地道路。
此外,一个NLU/FSG系统可以根据本发明被设计,以便如果用户的请求需要FSG(限定状态语法),则该请求可以被本地执行,除非该请求更复杂并处在蒙昧状态,因而需要向前传递给一个远程服务器来识别。
公司的姓名-拨号器服务器提供另一种有意思的特点。一个公司为它的雇员们保留一个电话号码的有效数据库。这个数据库总是最新的。用户可以选择周期性地来使它本地的信息与公司数据库保持同步。这是一个传统的概念。然而,当用户利用姓名拨号器并且需要经由TCP/IP连接到服务器上时,此同步可以实现(在语音识别期间),这样本地拨号信息总被更新。同样,当用户请求导航到一个新的其信息没有包括在用户本地地图中时,系统可以下载位于本地PVA上的声信息及用户所希望出游地区的一组导航信息。
图1的系统也可以用会话浏览器系统实现,会话浏览器系统描述于IBM Docket No.Y0998-392P,当前同此发明一起被递交,名称为“Conversational Browser and Conversational System”,并已被一般转让,引入此处以供参考,其中从一个内容提供器(服务器)传输(并被会话浏览器处理)的CML(会话标记语言)页在概念上类似用于可视显示的HTML(超文本标记语言)页,用来描述要被呈现给用户的一个会话UI。在这个例子中,会话浏览器可以是客户设备100的本地应用程序104和/或远程(IVR)服务器106中的服务器应用程序109。可由内容提供者或应用开发者(或者代理/代码转换器)来决定用户应该提供的一个给定的项目(例如,NLU,或FSG输入表单或经口授填充一空表单)必须在服务器106上被识别,而不是提供所有的数据给客户设备100在本地识别(因为此任务对于本地资源来说太复杂或者因为通过网络必须发送太多的信息)。其实现是通过例如在一个CML文件中提供一个URL(统一资源定位器)和标记来指示服务器其中一个处理将发生,或者通过在CML页中装载一小应用程序、一Active X组件或者一个捕捉音频的插件(或者它的任何变化),实现可能的一些会话功能,并为其它功能将其传输给其它设备(这是由页面的制作者作出的典型的决定)。这个决定可以自动地被代码转换器和注册机制实现,如在IBM Docket No.Y0998-392P中所描述,由此,浏览器向它取出CML页的服务器明白地描述它的能力。当代码转换器另考虑浏览器的能力并使内容适合这种能力时(这种能力即是所提到的会话代理),现在,根据浏览器的能力,代码转换器可以增加服务器的URL(s)以重定向服务器。在这种情况下,被客户设备100收集到的语音可以被作为一个波形(被压缩或者没有)或者作为特征流发送给远程服务器106或者识别发生其上的网络服务器110(或者NLU/NLG)。然后识别的结果可以被发送回客户设备100或者CML提供者服务器(远程服务器106)来决定操作的下一个行动或者进一步处理。再者如上面所述的,这可以被能直接结合资源/引擎/服务器的URL的应用程序决定或者要被用来识别一个给定的输入、菜单表格或者对话的本地设备决定。此外,本发明在一个CML页必须重放/合成声音或者文本对于本地设备102的本地会话引擎102来说太复杂的情况下是有用的。太复杂的部分可以作为特征流或者被压缩波形从任一个特定的服务器(这个服务器可能是或者不是提供CML页的服务器)上得到。而且,对于多语言系统,如果一个CML页包含一种不同的语种,则没有合适能力的本地客户设备100可以请求一个远程服务器来执行按那种语言的会话功能。
显然,会话协同可以在有会话浏览器的分布式应用中使用。例如,参看图4,一个分布式系统有一个表示服务器400、一个引擎服务器401、和一具有会话浏览器403的客户机402(如在上面参考Y0998-392P中所讨论)。该浏览器403从表示服务器400接收CML的页并处理此CML页。CML页可包含允许浏览器403决定发送语音到何处进行处理的信息。一个语音服务器位于引擎服务器401上。假定CML页需要引擎服务器401处理语音,浏览器403可以经由HTTP(或者套接口或者RMI)与语音服务器联系(传输呼叫)来传输音频给语音服务器并发送合适的数据文件指令和引擎呼叫。客户浏览器403被假定有一些本地处理能力来执行语音处理(经由语音API和语音识别引擎405)。如上面提到,在本地语音处理和服务器端语音处理之间的转移由从表示服务器400接收到的CML页决定。这个决定可以被内容提供者编码或者适应此设备(客户402决定它能执行此任务并把该任务发送给一个已知的或者已被发现的服务器或者代码转换器)。
在图5,浏览器403位于被客户402存取的一个浏览器服务器404上(浏览器服务器404在客户机402和表示服务器400之间充当中介物)。再者,浏览器403决定是否执行本地或者服务器端的处理。如此处所描述,音频是可以利用会话编码被传输。
本发明允许一个低资源客户设备透明地本地执行简单的任务,又以二进制或模拟方式与一具有更复杂会话能力的服务器(或者别的设备)相连接透明地执行复杂任务。服务器端功能(例如语音识别)可以通过一个常规的IP网络或者LAN网络实现,也可以经由传统的电话线路或包交换网络数字传输,或者在无线网络上经由任何传统的无线数据协议而能被执行。模拟/数字连接实施例描述了至少两个方案。第一,它是等价于一个调制解调器实现的二进制连接,所有的功能是数据传递功能。此外,当一个服务器/分布式资源被包含时,系统可以调用一个电话服务器作为资源,且声音在网络上被发送(而不是波形数据或者它的变换如cepstra)。这一方案的一个例子是与本地语音识别功能(姓名拨号器和数字拨号器)的一无线连接,此本地语音识别功能经由一个常规的无线连接到一个具有IVR的电话服务器上来获取其它功能,象语音浏览因特网、获得股票/共有基金报价和通过语音执行金融交易。这种机制在今天可以利用现存的蜂窝式电话设备上装备一些语音识别能力。
另外,不同的机制可以被利用来管理流通量和在网络上分布的资源,以保证在网络上保持一合适的会话交互的对话流。这种机制包括:会话协议(如上面所讨论的),音频:RecoVC(识别兼容VoCoder)(允许回放重建带有音调的编码协议),应用程序和元信息:分布式应用协议、发现、注册、协商、维护对话流的服务器负载管理、保持对话流的流量平衡和路由选择、基于任务特征和能力需求及会话主目可用性(数据文件)的引擎服务器选择,会话主目分配:存储、流量/路由选择和高速缓存。
尽管图例的实施例在此参考附图已被描述,应该理解,本系统和方法并不局限于这些刻板的实施例,在不脱离本发明的范围和精神的情况下,技术上熟悉的人可作各种不同的改变和修改。所有这种改变和修该都应包括在所附加的权利要求书中定义的范围内。

Claims (24)

1.一种提供会话资源的自动和协同共享的系统,包括:
一个包含至少第一和第二网络设备的网络;
第一和第二网络设备每个包括:
一组会话资源;
一对话管理器,用于管理会话,并执行调用请求会话服务;及
一通信栈,使用会话协议在网络上传递消息,其中使用会话协议传递的消息,在第一和第二网络设备的对话管理器之间建立协同的网络通信,以自动地共享第一和第二网络设备的一组会话资源,当需要时,执行它们各自请求的会话服务。
2.按照权利要求1的系统,其中所说的第一和第二网络设备的一组会话资源集包括至少一语音识别引擎、一说话者识别引擎、一文本到语音合成TTS引擎、一自然语言理解NLU引擎、一自然语言产生NLG引擎、一个音频捕捉和压缩/解压缩引擎、一主题识别引擎、一音频/多媒体索引和查找引擎,及其组合。
3.按照权利要求1的系统,其中的会话协议包括协同协议,以允许第一和第二设备的对话管理器交换包括它们各自的会话状态、主目和上下文信息,并交换对话组件。
4.按照权利要求3的系统,其中的协同协议用于对第一和第二设备对话管理器之间的主/从和对等网络通信中的一个进行协同。
5.按照权利要求1的系统,其中的会话协议包括发现协议,以允许第一和第二设备发现明白会话的设备及在网络上的应用程序。
6.按照权利要求5的系统,其中的发现协议实现“广播和收听”方法。
7.按照权利要求6的系统,其中的发现协议被执行,以至少在第一和第二网络设备之间建立动态和自发的网络。
8.按照权利要求1的系统,其中的会话协议包括注册协议,用来交换关于会话资源、能力、及需求的信息。
9.按照权利要求8的系统,其中的会话协议包括协商协议,用来交换信息,以便根据它们各自会话资源、能力,在第一和第二网络设备之间建立一网络配置。
10.按照权利要求9的系统,其中的网络配置包括主/从网络和对等网络之一。主/从网络中第一和第二设备之一的对话管理器控制第一和第二设备两者的会话资源;对等网络中第一和第二设备的对话管理器协商以控制会话资源。
11.按照权利要求1的系统,其中的会话协议包括语音传输协议,用于在第一和第二设备之间传送压缩的语音波形,压缩的语音特征及压缩的结果之一。
12.一种提供会话资源的自动和协同共享的系统,包括:
一客户机,包含本地会话资源和一对话管理器,其中对话管理器用以管理本地会话资源、处理对会话服务请求,并判断是否会话服务请求能用本地会话资源实现;及
一包含服务器会话资源的服务器,其中如果使用本地会话资源不能执行请求的会话服务,客户机的对话管理器将自动地访问服务器,以便用服务器的会话资源处理。
13.一种在网络设备之间提供会话资源的自动和协同共享的方法,包括以下步骤:
由第一网络设备接收一会话服务请求;
由第一网络设备判断用哪一种方式:是本地地用第一网络设备的会话资源,还是远程地用至少一第二网络设备的会话资源,或是本地地和远程地用本地和远程两者的会话资源去处理请求的会话服务;及
如果判断出会话服务至少一部分要远程地使用至少第二网络设备的会话资源而被处理,则自动地同至少第二网络设备通信。
14.按照权利要求13的方法,还包括使用会话协议传输消息的步骤,以便在第一和至少第二网络设备之间建立协同的网络通信,以共享会话资源。
15.按照权利要求13的方法,其中的判断步骤包括判断是否本地的会话资源可用来处理请求的会话服务的步骤;并且该方法还包括步骤:
如果判断本地的会话资源可用来处理请求的会话服务,使用本地的会话资源执行此请求的会话服务;
判断本地处理的结果是否可接受;及
如果确定本地处理的结果不可接受,自动地访问至少第二网络设备,远程地处理此请求的会话服务。
16.按照权利要求13的方法,其中的判断步骤包括判断是否至少第二网络设备是由第一网络设备预先指派,以处理会话服务的步骤。
17.按照权利要求13的方法,其中的判断步骤是基于网络连接的可用性和第一网络设备和至少第二网络设备之间的网络通信量之一。
18.按照权利要求14的方法,其中的自动访问步骤包括以下步骤:
通过使用会话协议传输的消息,同至少第二网络设备自动建立网络连结;及
把压缩的语音特征和压缩的波形之一传送到至少第二网络设备。
19.一种通过机器可读的程序存储器设备,确实地嵌入机器可执行的程序指令,以实现在网络设备之间提供会话资源的自动和协同共享的方法步骤,这些方法步骤包括:
由第一网络设备接收一会话服务请求;
由第一网络设备判断用哪一种方式:是本地地用第一网络设备的会话资源,还是远程地用至少一第二网络设备的会话资源,或是本地地和远程地用本地和远程两者的会话资源去处理请求的会话服务;及
如果判断出会话服务至少一部分要远程地使用至少第二网络设备的会话资源而被处理,自动地同至少第二网络设备通信。
20.按照权利要求19的程序存储器设备,还包括按会话协议执行传输消息的步骤的指令,以便在第一和至少第二网络设备之间建立协同的网络通信,共享会话资源。
21.按照权利要求19的程序存储器设备,其中执行判断步骤的指令包括执行判断是否本地的会话资源可用来处理请求的会话服务的指令;并且其中程序存储器设备还包括执行下面步骤的指令:
如果判断本地的会话资源可用来处理请求的会话服务,使用本地的会话资源执行此请求的会话服务;
判断本地处理的结果是否可接受;及
如果确定本地处理的结果不可接受,自动地访问至少第二网络设备,远程地处理此请求的会话服务。
22.按照权利要求19的方法,其中执行判断步骤的指令包括执行判断是否至少第二网络设备是由第一网络设备预先指派,以处理此会话服务的步骤的指令。
23.按照权利要求19的程序存储器设备,其中的判断步骤是基于网络连接的可用性和第一网络设备和至少第二网络设备之间的网络通信量之一。
24.按照权利要求20的方法,其中执行自动访问步骤的指令包括执行下列步骤的指令:
使用会话协议,同至少第二网络设备自动建立网络连结;及
把压缩的语音特征和压缩的波形之一传送到至少第二网络设备。
CNB998116165A 1998-10-02 1999-10-01 提供网络协同会话服务的系统和方法 Expired - Lifetime CN1160700C (zh)

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US10295798P 1998-10-02 1998-10-02
US60/102,957 1998-10-02
US11759599P 1999-01-27 1999-01-27
US60/117,595 1999-01-27

Publications (2)

Publication Number Publication Date
CN1323435A true CN1323435A (zh) 2001-11-21
CN1160700C CN1160700C (zh) 2004-08-04

Family

ID=26799924

Family Applications (4)

Application Number Title Priority Date Filing Date
CNB998116181A Expired - Fee Related CN1171200C (zh) 1998-10-02 1999-10-01 通过会话虚拟机进行会话式计算的系统和方法
CNB998116076A Expired - Lifetime CN100472500C (zh) 1998-10-02 1999-10-01 会话浏览器和会话系统
CNB998116173A Expired - Lifetime CN1151488C (zh) 1998-10-02 1999-10-01 通过一般分层对象进行有效语音导航的结构框架
CNB998116165A Expired - Lifetime CN1160700C (zh) 1998-10-02 1999-10-01 提供网络协同会话服务的系统和方法

Family Applications Before (3)

Application Number Title Priority Date Filing Date
CNB998116181A Expired - Fee Related CN1171200C (zh) 1998-10-02 1999-10-01 通过会话虚拟机进行会话式计算的系统和方法
CNB998116076A Expired - Lifetime CN100472500C (zh) 1998-10-02 1999-10-01 会话浏览器和会话系统
CNB998116173A Expired - Lifetime CN1151488C (zh) 1998-10-02 1999-10-01 通过一般分层对象进行有效语音导航的结构框架

Country Status (10)

Country Link
US (4) US7137126B1 (zh)
EP (4) EP1125279B1 (zh)
JP (4) JP2002527800A (zh)
KR (4) KR100431972B1 (zh)
CN (4) CN1171200C (zh)
AT (1) ATE383640T1 (zh)
CA (4) CA2345665C (zh)
DE (1) DE69937962T2 (zh)
IL (5) IL140805A0 (zh)
WO (4) WO2000021232A2 (zh)

Cited By (50)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103117058A (zh) * 2012-12-20 2013-05-22 四川长虹电器股份有限公司 基于智能电视平台的多语音引擎切换系统及方法
CN103295575A (zh) * 2012-02-27 2013-09-11 北京三星通信技术研究有限公司 一种语音识别方法和客户端
CN104598257A (zh) * 2013-10-30 2015-05-06 华为技术有限公司 远程应用程序运行的方法和装置
CN105551488A (zh) * 2015-12-15 2016-05-04 深圳Tcl数字技术有限公司 语音控制方法及系统
CN103457810B (zh) * 2006-07-07 2017-04-12 三星电子株式会社 在网络环境中提供服务菜单和服务的方法及服务提供设备
CN106663428A (zh) * 2014-07-16 2017-05-10 索尼公司 装置、方法、非暂态计算机可读介质及系统
CN107319857A (zh) * 2017-06-30 2017-11-07 广东工业大学 一种交互镜子及具有该交互镜子的智能家电系统
CN108475503A (zh) * 2015-10-15 2018-08-31 交互智能集团有限公司 用于多语言通信排序的系统和方法
CN108629041A (zh) * 2012-07-19 2018-10-09 格兰斯电讯网络有限公司 协同浏览与其他形式的信息共享的集成
CN110021291A (zh) * 2018-12-26 2019-07-16 阿里巴巴集团控股有限公司 一种语音合成文件的调用方法及装置
CN110073326A (zh) * 2016-10-19 2019-07-30 搜诺思公司 基于仲裁的语音识别
CN112217905A (zh) * 2014-09-19 2021-01-12 康维达无线有限责任公司 服务层会话迁移和共享
CN113035226A (zh) * 2019-12-24 2021-06-25 中兴通讯股份有限公司 语音通话方法、通信终端和计算机可读介质
US11343614B2 (en) 2018-01-31 2022-05-24 Sonos, Inc. Device designation of playback and network microphone device arrangements
US11405430B2 (en) 2016-02-22 2022-08-02 Sonos, Inc. Networked microphone device control
US11482978B2 (en) 2018-08-28 2022-10-25 Sonos, Inc. Audio notifications
US11482224B2 (en) 2020-05-20 2022-10-25 Sonos, Inc. Command keywords with input detection windowing
US11500611B2 (en) 2017-09-08 2022-11-15 Sonos, Inc. Dynamic computation of system response volume
US11501773B2 (en) 2019-06-12 2022-11-15 Sonos, Inc. Network microphone device with command keyword conditioning
US11514898B2 (en) 2016-02-22 2022-11-29 Sonos, Inc. Voice control of a media playback system
US11531520B2 (en) 2016-08-05 2022-12-20 Sonos, Inc. Playback device supporting concurrent voice assistants
US11538460B2 (en) 2018-12-13 2022-12-27 Sonos, Inc. Networked microphone devices, systems, and methods of localized arbitration
US11538451B2 (en) 2017-09-28 2022-12-27 Sonos, Inc. Multi-channel acoustic echo cancellation
US11540047B2 (en) 2018-12-20 2022-12-27 Sonos, Inc. Optimization of network microphone devices using noise classification
US11545169B2 (en) 2016-06-09 2023-01-03 Sonos, Inc. Dynamic player selection for audio signal processing
US11557294B2 (en) 2018-12-07 2023-01-17 Sonos, Inc. Systems and methods of operating media playback systems having multiple voice assistant services
US11562740B2 (en) 2020-01-07 2023-01-24 Sonos, Inc. Voice verification for media playback
US11563842B2 (en) 2018-08-28 2023-01-24 Sonos, Inc. Do not disturb feature for audio notifications
US11641559B2 (en) 2016-09-27 2023-05-02 Sonos, Inc. Audio playback settings for voice interaction
US11646023B2 (en) 2019-02-08 2023-05-09 Sonos, Inc. Devices, systems, and methods for distributed voice processing
US11646045B2 (en) 2017-09-27 2023-05-09 Sonos, Inc. Robust short-time fourier transform acoustic echo cancellation during audio playback
US11696074B2 (en) 2018-06-28 2023-07-04 Sonos, Inc. Systems and methods for associating playback devices with voice assistant services
US11698771B2 (en) 2020-08-25 2023-07-11 Sonos, Inc. Vocal guidance engines for playback devices
US11710487B2 (en) 2019-07-31 2023-07-25 Sonos, Inc. Locally distributed keyword detection
US11714600B2 (en) 2019-07-31 2023-08-01 Sonos, Inc. Noise classification for event detection
US11736860B2 (en) 2016-02-22 2023-08-22 Sonos, Inc. Voice control of a media playback system
US11741948B2 (en) 2018-11-15 2023-08-29 Sonos Vox France Sas Dilated convolutions and gating for efficient keyword spotting
US11769505B2 (en) 2017-09-28 2023-09-26 Sonos, Inc. Echo of tone interferance cancellation using two acoustic echo cancellers
US11778259B2 (en) 2018-09-14 2023-10-03 Sonos, Inc. Networked devices, systems and methods for associating playback devices based on sound codes
US11792590B2 (en) 2018-05-25 2023-10-17 Sonos, Inc. Determining and adapting to changes in microphone performance of playback devices
US11790911B2 (en) 2018-09-28 2023-10-17 Sonos, Inc. Systems and methods for selective wake word detection using neural network models
US11790937B2 (en) 2018-09-21 2023-10-17 Sonos, Inc. Voice detection optimization using sound metadata
US11798553B2 (en) 2019-05-03 2023-10-24 Sonos, Inc. Voice assistant persistence across multiple network microphone devices
US11797263B2 (en) 2018-05-10 2023-10-24 Sonos, Inc. Systems and methods for voice-assisted media content selection
US11854547B2 (en) 2019-06-12 2023-12-26 Sonos, Inc. Network microphone device with command keyword eventing
US11862161B2 (en) 2019-10-22 2024-01-02 Sonos, Inc. VAS toggle based on device orientation
US11869503B2 (en) 2019-12-20 2024-01-09 Sonos, Inc. Offline voice control
US11893308B2 (en) 2017-09-29 2024-02-06 Sonos, Inc. Media playback system with concurrent voice assistance
US11900937B2 (en) 2017-08-07 2024-02-13 Sonos, Inc. Wake-word detection suppression
US11899519B2 (en) 2018-10-23 2024-02-13 Sonos, Inc. Multiple stage network microphone device with reduced power consumption and processing load

Families Citing this family (635)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7003463B1 (en) 1998-10-02 2006-02-21 International Business Machines Corporation System and method for providing network coordinated conversational services
IL140805A0 (en) * 1998-10-02 2002-02-10 Ibm Structure skeletons for efficient voice navigation through generic hierarchical objects
US6163794A (en) * 1998-10-23 2000-12-19 General Magic Network system extensible by users
US6886047B2 (en) * 1998-11-13 2005-04-26 Jp Morgan Chase Bank System and method for managing information retrievals for integrated digital and analog archives on a global basis
FR2787902B1 (fr) * 1998-12-23 2004-07-30 France Telecom Modele et procede d'implementation d'un agent rationnel dialoguant, serveur et systeme multi-agent pour la mise en oeuvre
WO2001013255A2 (en) 1999-08-13 2001-02-22 Pixo, Inc. Displaying and traversing links in character array
US7330815B1 (en) * 1999-10-04 2008-02-12 Globalenglish Corporation Method and system for network-based speech recognition
US9076448B2 (en) 1999-11-12 2015-07-07 Nuance Communications, Inc. Distributed real time speech recognition system
US6665640B1 (en) 1999-11-12 2003-12-16 Phoenix Solutions, Inc. Interactive speech based learning/training system formulating search queries based on natural language parsing of recognized user queries
US7725307B2 (en) 1999-11-12 2010-05-25 Phoenix Solutions, Inc. Query engine for processing voice based queries including semantic decoding
US6633846B1 (en) 1999-11-12 2003-10-14 Phoenix Solutions, Inc. Distributed realtime speech recognition system
EP1107522B1 (en) 1999-12-06 2010-06-16 Telefonaktiebolaget LM Ericsson (publ) Intelligent piconet forming
JP2001188784A (ja) * 1999-12-28 2001-07-10 Sony Corp 会話処理装置および方法、並びに記録媒体
US20010043234A1 (en) * 2000-01-03 2001-11-22 Mallik Kotamarti Incorporating non-native user interface mechanisms into a user interface
WO2001059999A1 (en) * 2000-02-11 2001-08-16 Convergent Networks, Inc. Service level executable environment for integrated pstn and ip networks and call processing language therefor
WO2003050799A1 (en) * 2001-12-12 2003-06-19 International Business Machines Corporation Method and system for non-intrusive speaker verification using behavior models
US8645137B2 (en) 2000-03-16 2014-02-04 Apple Inc. Fast, language-independent method for user authentication by voice
AU2001256902A1 (en) * 2000-05-05 2001-11-20 James Ewing A method and a system relating to protocol communications within a pre-existing www server framework
US6769010B1 (en) * 2000-05-11 2004-07-27 Howzone.Com Inc. Apparatus for distributing information over a network-based environment, method of distributing information to users, and method for associating content objects with a database wherein the content objects are accessible over a network communication medium by a user
FR2809204B1 (fr) * 2000-05-17 2003-09-19 Bull Sa Interface applicative multiprosseur, ne necessitant pas l'utilisation d'un systeme d'exploitation multiprocesseur
US7039951B1 (en) 2000-06-06 2006-05-02 International Business Machines Corporation System and method for confidence based incremental access authentication
US7047196B2 (en) 2000-06-08 2006-05-16 Agiletv Corporation System and method of voice recognition near a wireline node of a network supporting cable television and/or video delivery
US6604050B2 (en) * 2000-06-16 2003-08-05 Bayer Corporation System, method and biosensor apparatus for data communications with a personal data assistant
US6654722B1 (en) * 2000-06-19 2003-11-25 International Business Machines Corporation Voice over IP protocol based speech system
US6788667B1 (en) * 2000-06-20 2004-09-07 Nokia Corporation Wireless access to wired network audio services using mobile voice call
WO2002005153A2 (en) * 2000-07-07 2002-01-17 2020Me Holdings Ltd. System, method and medium for facilitating transactions over a network
US7139709B2 (en) 2000-07-20 2006-11-21 Microsoft Corporation Middleware layer between speech related applications and engines
DE60125597T2 (de) * 2000-08-31 2007-05-03 Hitachi, Ltd. Vorrichtung für die Dienstleistungsvermittlung
US6785651B1 (en) * 2000-09-14 2004-08-31 Microsoft Corporation Method and apparatus for performing plan-based dialog
US20020059072A1 (en) * 2000-10-16 2002-05-16 Nasreen Quibria Method of and system for providing adaptive respondent training in a speech recognition application
US7072982B2 (en) 2000-11-22 2006-07-04 Microsoft Corporation Universal naming scheme for peer to peer resources
US7594030B2 (en) 2000-11-22 2009-09-22 Microsoft Corporation Locator and tracking service for peer to peer resources
EP1211588B1 (de) * 2000-12-04 2005-09-21 Siemens Aktiengesellschaft Verfahren zum Nutzen einer Datenverarbeitungsanlage abhängig von einer Berechtigung, zugehörige Datenverarbeitungsanlage und zugehöriges Programm
US7016847B1 (en) * 2000-12-08 2006-03-21 Ben Franklin Patent Holdings L.L.C. Open architecture for a voice user interface
GB0030958D0 (en) * 2000-12-19 2001-01-31 Smart Card Solutions Ltd Compartmentalized micro-controller operating system architecture
DE10064661A1 (de) * 2000-12-22 2002-07-11 Siemens Ag Kommunikationsanordnung und Verfahren für Kommunikationssysteme mit interaktiver Sprachfunktion
JP3322262B2 (ja) * 2000-12-22 2002-09-09 日本電気株式会社 無線携帯端末通信システム
FI20002860A (fi) * 2000-12-27 2002-06-28 Nokia Corp Laiteroolit ja pikoverkkoyhteydet
US6925645B2 (en) * 2000-12-29 2005-08-02 Webex Communications, Inc. Fault tolerant server architecture for collaborative computing
US6964023B2 (en) 2001-02-05 2005-11-08 International Business Machines Corporation System and method for multi-modal focus detection, referential ambiguity resolution and mood classification using multi-modal input
US7461148B1 (en) * 2001-02-16 2008-12-02 Swsoft Holdings, Ltd. Virtual private server with isolation of system components
EP1233590A1 (en) 2001-02-19 2002-08-21 Sun Microsystems, Inc. Content provider for a computer system
US7606909B1 (en) * 2001-02-20 2009-10-20 Michael Ely Method and apparatus for a business contact center
GB2372864B (en) * 2001-02-28 2005-09-07 Vox Generation Ltd Spoken language interface
CN1291307C (zh) 2001-03-22 2006-12-20 佳能株式会社 信息处理装置和方法
US7174514B2 (en) * 2001-03-28 2007-02-06 Siebel Systems, Inc. Engine to present a user interface based on a logical structure, such as one for a customer relationship management system, across a web site
US6996531B2 (en) * 2001-03-30 2006-02-07 Comverse Ltd. Automated database assistance using a telephone for a speech based or text based multimedia communication mode
FR2822994B1 (fr) * 2001-03-30 2004-05-21 Bouygues Telecom Sa Assistance au conducteur d'un vehicule automobile
CA2344904A1 (en) * 2001-04-23 2002-10-23 Bruno Richard Preiss Interactive voice response system and method
US7698228B2 (en) 2001-04-27 2010-04-13 Accenture Llp Tracking purchases in a location-based services system
US7437295B2 (en) 2001-04-27 2008-10-14 Accenture Llp Natural language processing for a location-based services system
US7610547B2 (en) 2001-05-04 2009-10-27 Microsoft Corporation Markup language extensions for web enabled recognition
US7020841B2 (en) 2001-06-07 2006-03-28 International Business Machines Corporation System and method for generating and presenting multi-modal applications from intent-based markup scripts
US7366673B2 (en) * 2001-06-15 2008-04-29 International Business Machines Corporation Selective enablement of speech recognition grammars
US7647597B2 (en) * 2001-06-28 2010-01-12 Microsoft Corporation Transparent and sub-classable proxies
US7606712B1 (en) * 2001-06-28 2009-10-20 At&T Intellectual Property Ii, L.P. Speech recognition interface for voice actuation of legacy systems
US6839896B2 (en) 2001-06-29 2005-01-04 International Business Machines Corporation System and method for providing dialog management and arbitration in a multi-modal environment
JP3997459B2 (ja) * 2001-10-02 2007-10-24 株式会社日立製作所 音声入力システムおよび音声ポータルサーバおよび音声入力端末
US7441016B2 (en) 2001-10-03 2008-10-21 Accenture Global Services Gmbh Service authorizer
US7324947B2 (en) 2001-10-03 2008-01-29 Promptu Systems Corporation Global speech user interface
US7472091B2 (en) 2001-10-03 2008-12-30 Accenture Global Services Gmbh Virtual customer database
EP1708470B1 (en) * 2001-10-03 2012-05-16 Accenture Global Services Limited Multi-modal callback system
US7233655B2 (en) 2001-10-03 2007-06-19 Accenture Global Services Gmbh Multi-modal callback
EP2378436B1 (en) * 2001-10-03 2013-03-13 Accenture Global Services Limited Virtual customer database
ITFI20010199A1 (it) 2001-10-22 2003-04-22 Riccardo Vieri Sistema e metodo per trasformare in voce comunicazioni testuali ed inviarle con una connessione internet a qualsiasi apparato telefonico
US7222073B2 (en) 2001-10-24 2007-05-22 Agiletv Corporation System and method for speech activated navigation
US7133830B1 (en) 2001-11-13 2006-11-07 Sr2, Inc. System and method for supporting platform independent speech applications
US20030101054A1 (en) * 2001-11-27 2003-05-29 Ncc, Llc Integrated system and method for electronic speech recognition and transcription
US8498871B2 (en) * 2001-11-27 2013-07-30 Advanced Voice Recognition Systems, Inc. Dynamic speech recognition and transcription among users having heterogeneous protocols
GB2388209C (en) 2001-12-20 2005-08-23 Canon Kk Control apparatus
US7302394B1 (en) * 2001-12-20 2007-11-27 Ianywhere Solutions, Inc. Front-end device independence for natural interaction platform
WO2003063137A1 (en) * 2002-01-22 2003-07-31 V-Enable, Inc. Multi-modal information delivery system
US6990639B2 (en) 2002-02-07 2006-01-24 Microsoft Corporation System and process for controlling electronic components in a ubiquitous computing environment using multimodal integration
US6807529B2 (en) * 2002-02-27 2004-10-19 Motorola, Inc. System and method for concurrent multimodal communication
US6704396B2 (en) 2002-02-27 2004-03-09 Sbc Technology Resources, Inc. Multi-modal communications method
US20030187944A1 (en) * 2002-02-27 2003-10-02 Greg Johnson System and method for concurrent multimodal communication using concurrent multimodal tags
GB0204768D0 (en) * 2002-02-28 2002-04-17 Mitel Knowledge Corp Bimodal feature access for web applications
FR2840484B1 (fr) * 2002-05-28 2004-09-03 France Telecom Protocole de communication entre un module d'application vocale et une plate-forme vocale dans un serveur vocal
US7398209B2 (en) 2002-06-03 2008-07-08 Voicebox Technologies, Inc. Systems and methods for responding to natural language speech utterance
US7822609B2 (en) 2002-06-14 2010-10-26 Nuance Communications, Inc. Voice browser with integrated TCAP and ISUP interfaces
KR20040002031A (ko) * 2002-06-29 2004-01-07 주식회사 케이티 다수의 응용프로그램들간의 음성인식엔진 공유 방법
US7693720B2 (en) * 2002-07-15 2010-04-06 Voicebox Technologies, Inc. Mobile systems and methods for responding to natural language speech utterance
AU2002345308A1 (en) * 2002-07-17 2004-02-02 Nokia Corporation Mobile device having voice user interface, and a methode for testing the compatibility of an application with the mobile device
US20040054757A1 (en) * 2002-09-14 2004-03-18 Akinobu Ueda System for remote control of computer resources from embedded handheld devices
US8321427B2 (en) 2002-10-31 2012-11-27 Promptu Systems Corporation Method and apparatus for generation and augmentation of search terms from external and internal sources
US7519534B2 (en) 2002-10-31 2009-04-14 Agiletv Corporation Speech controlled access to content on a presentation medium
CN1720520B (zh) * 2002-12-02 2010-05-05 索尼株式会社 对话控制设备和方法,以及机器人设备
KR20040051285A (ko) * 2002-12-12 2004-06-18 삼성전자주식회사 애플릿이 링크된 마크업 문서의 디스플레이 방법 및 그시스템
US7822612B1 (en) * 2003-01-03 2010-10-26 Verizon Laboratories Inc. Methods of processing a voice command from a caller
US7673304B2 (en) * 2003-02-18 2010-03-02 Microsoft Corporation Multithreaded kernel for graphics processing unit
US7669134B1 (en) 2003-05-02 2010-02-23 Apple Inc. Method and apparatus for displaying information during an instant messaging session
US7249025B2 (en) * 2003-05-09 2007-07-24 Matsushita Electric Industrial Co., Ltd. Portable device for enhanced security and accessibility
US9202467B2 (en) * 2003-06-06 2015-12-01 The Trustees Of Columbia University In The City Of New York System and method for voice activating web pages
WO2005004334A2 (en) 2003-06-26 2005-01-13 Agile Tv Corporation Zero-search, zero-memory vector quantization
US7313528B1 (en) * 2003-07-31 2007-12-25 Sprint Communications Company L.P. Distributed network based message processing system for text-to-speech streaming data
US7383302B2 (en) * 2003-09-15 2008-06-03 International Business Machines Corporation Method and system for providing a common collaboration framework accessible from within multiple applications
US7428273B2 (en) 2003-09-18 2008-09-23 Promptu Systems Corporation Method and apparatus for efficient preamble detection in digital data receivers
US7249348B2 (en) * 2003-10-21 2007-07-24 Hewlett-Packard Development Company, L.P. Non-interfering status inquiry for user threads
US7199802B2 (en) * 2003-10-24 2007-04-03 Microsoft Corporation Multiple-mode window presentation system and process
US7856632B2 (en) * 2004-01-29 2010-12-21 Klingman Edwin E iMEM ASCII architecture for executing system operators and processing data operators
US20050229185A1 (en) * 2004-02-20 2005-10-13 Stoops Daniel S Method and system for navigating applications
US7904059B2 (en) * 2004-02-25 2011-03-08 Sk Telecom Co., Ltd. Method and mobile terminal for implementing vector animation interactive service on mobile phone browser
US7925510B2 (en) * 2004-04-28 2011-04-12 Nuance Communications, Inc. Componentized voice server with selectable internal and external speech detectors
US7596788B1 (en) * 2004-05-11 2009-09-29 Platform Computing Corporation Support of non-trivial scheduling policies along with topological properties
CN101031882B (zh) * 2004-06-08 2010-09-08 达尔特设备互操作有限公司 用于通用设备互操作性平台的设备组招募和内容再现的体系结构、装置和方法
US7814502B2 (en) * 2004-06-14 2010-10-12 Nokia Corporation Shared landmark user interfaces
KR100627228B1 (ko) * 2004-07-01 2006-09-25 에스케이 텔레콤주식회사 기업용 무선 어플리케이션 서비스를 위한 전용 브라우저시스템 및 그 운영방법
US7398471B1 (en) * 2004-07-29 2008-07-08 Emc Corporation System and method for the administration of resource groups
FI20041075A0 (fi) * 2004-08-12 2004-08-12 Nokia Corp Tiedon lähettäminen ryhmälle vastaanottolaitteita
US8744852B1 (en) * 2004-10-01 2014-06-03 Apple Inc. Spoken interfaces
US7693719B2 (en) * 2004-10-29 2010-04-06 Microsoft Corporation Providing personalized voice font for text-to-speech applications
US8706501B2 (en) * 2004-12-09 2014-04-22 Nuance Communications, Inc. Method and system for sharing speech processing resources over a communication network
US7640162B2 (en) * 2004-12-14 2009-12-29 Microsoft Corporation Semantic canvas
KR100654447B1 (ko) * 2004-12-15 2006-12-06 삼성전자주식회사 지역별로 존재하는 컨텐츠를 글로벌로 공유하고 거래하는방법 및 시스템
US7987244B1 (en) * 2004-12-30 2011-07-26 At&T Intellectual Property Ii, L.P. Network repository for voice fonts
KR100725719B1 (ko) * 2005-02-04 2007-06-08 주식회사 유진로봇 음성과 태스크 오브젝트 기반의 대화시스템
US7490042B2 (en) * 2005-03-29 2009-02-10 International Business Machines Corporation Methods and apparatus for adapting output speech in accordance with context of communication
US9111253B2 (en) * 2005-04-22 2015-08-18 Sap Se Groupware time tracking
US20060271520A1 (en) * 2005-05-27 2006-11-30 Ragan Gene Z Content-based implicit search query
US7689253B2 (en) 2005-06-13 2010-03-30 E-Lane Systems, Inc. Vehicle immersive communication system
US8090084B2 (en) * 2005-06-30 2012-01-03 At&T Intellectual Property Ii, L.P. Automated call router for business directory using the world wide web
JP4667138B2 (ja) * 2005-06-30 2011-04-06 キヤノン株式会社 音声認識方法及び音声認識装置
US7640160B2 (en) * 2005-08-05 2009-12-29 Voicebox Technologies, Inc. Systems and methods for responding to natural language speech utterance
US7620549B2 (en) * 2005-08-10 2009-11-17 Voicebox Technologies, Inc. System and method of supporting adaptive misrecognition in conversational speech
US7949529B2 (en) 2005-08-29 2011-05-24 Voicebox Technologies, Inc. Mobile systems and methods of supporting natural language human-machine interactions
US7634409B2 (en) 2005-08-31 2009-12-15 Voicebox Technologies, Inc. Dynamic speech sharpening
US8677377B2 (en) 2005-09-08 2014-03-18 Apple Inc. Method and apparatus for building an intelligent automated assistant
US8073700B2 (en) * 2005-09-12 2011-12-06 Nuance Communications, Inc. Retrieval and presentation of network service results for mobile device using a multimodal browser
US9009046B1 (en) * 2005-09-27 2015-04-14 At&T Intellectual Property Ii, L.P. System and method for disambiguating multiple intents in a natural language dialog system
US7633076B2 (en) 2005-09-30 2009-12-15 Apple Inc. Automated response to and sensing of user activity in portable devices
US7708196B2 (en) * 2005-10-05 2010-05-04 Dun And Bradstreet Corporation Modular web-based ASP application for multiple products
US8041800B2 (en) * 2005-11-08 2011-10-18 International Business Machines Corporation Automatic orchestration of dynamic multiple party, multiple media communications
US20070124142A1 (en) * 2005-11-25 2007-05-31 Mukherjee Santosh K Voice enabled knowledge system
JP4829600B2 (ja) * 2005-11-28 2011-12-07 キヤノン株式会社 通信装置及び通信装置の制御方法
US9215748B2 (en) 2005-11-28 2015-12-15 Canon Kabushiki Kaisha Communication apparatus, control method for communication apparatus, and communication system
JP2007164732A (ja) * 2005-12-16 2007-06-28 Crescent:Kk コンピュータ実行可能なプログラム、および情報処理装置
DE102005061365A1 (de) * 2005-12-21 2007-06-28 Siemens Ag Verfahren zur Ansteuerung zumindest einer ersten und zweiten Hintergrundapplikation über ein universelles Sprachdialogsystem
CN100382083C (zh) * 2005-12-23 2008-04-16 威盛电子股份有限公司 网络会话管理方法及系统
US7840964B2 (en) * 2005-12-30 2010-11-23 Intel Corporation Mechanism to transition control between components in a virtual machine environment
KR101204408B1 (ko) * 2006-01-19 2012-11-27 삼성전자주식회사 유니폼 객체 모델을 이용한 유비쿼터스 서비스/디바이스테스트 방법 및 장치
US20070180365A1 (en) * 2006-01-27 2007-08-02 Ashok Mitter Khosla Automated process and system for converting a flowchart into a speech mark-up language
US8078990B2 (en) 2006-02-01 2011-12-13 Research In Motion Limited Secure device sharing
US20070225983A1 (en) * 2006-03-23 2007-09-27 Theodore Maringo Worldwide time device
KR100807307B1 (ko) * 2006-07-10 2008-02-28 한국전자통신연구원 대화형 음성 인터페이스 시스템 및 그 응답 방법
US9976865B2 (en) 2006-07-28 2018-05-22 Ridetones, Inc. Vehicle communication system with navigation
US20100030557A1 (en) 2006-07-31 2010-02-04 Stephen Molloy Voice and text communication system, method and apparatus
US8589869B2 (en) 2006-09-07 2013-11-19 Wolfram Alpha Llc Methods and systems for determining a formula
US9318108B2 (en) 2010-01-18 2016-04-19 Apple Inc. Intelligent automated assistant
US7957976B2 (en) 2006-09-12 2011-06-07 Nuance Communications, Inc. Establishing a multimodal advertising personality for a sponsor of a multimodal application
KR100832534B1 (ko) * 2006-09-28 2008-05-27 한국전자통신연구원 음성 인식을 통한 컨텐츠 정보 서비스 제공 장치 및 그방법
US8073681B2 (en) 2006-10-16 2011-12-06 Voicebox Technologies, Inc. System and method for a cooperative conversational voice user interface
US10803474B2 (en) 2006-11-22 2020-10-13 Qualtrics, Llc System for creating and distributing interactive advertisements to mobile devices
US8700014B2 (en) 2006-11-22 2014-04-15 Bindu Rama Rao Audio guided system for providing guidance to user of mobile device on multi-step activities
US8478250B2 (en) 2007-07-30 2013-07-02 Bindu Rama Rao Interactive media management server
US11256386B2 (en) 2006-11-22 2022-02-22 Qualtrics, Llc Media management system supporting a plurality of mobile devices
US7827033B2 (en) * 2006-12-06 2010-11-02 Nuance Communications, Inc. Enabling grammars in web page frames
US20080140390A1 (en) * 2006-12-11 2008-06-12 Motorola, Inc. Solution for sharing speech processing resources in a multitasking environment
CA2572116A1 (en) * 2006-12-27 2008-06-27 Ibm Canada Limited - Ibm Canada Limitee System and method for processing multi-modal communication within a workgroup
US7818176B2 (en) 2007-02-06 2010-10-19 Voicebox Technologies, Inc. System and method for selecting and presenting advertisements based on natural language processing of voice-based input
US8843376B2 (en) 2007-03-13 2014-09-23 Nuance Communications, Inc. Speech-enabled web content searching using a multimodal browser
US8977255B2 (en) 2007-04-03 2015-03-10 Apple Inc. Method and system for operating a multi-function portable electronic device using voice-activation
US8788620B2 (en) * 2007-04-04 2014-07-22 International Business Machines Corporation Web service support for a multimodal client processing a multimodal application
AU2012244080B2 (en) * 2007-05-24 2014-04-17 Microsoft Technology Licensing, Llc Personality-based Device
US8131549B2 (en) 2007-05-24 2012-03-06 Microsoft Corporation Personality-based device
US9794348B2 (en) * 2007-06-04 2017-10-17 Todd R. Smith Using voice commands from a mobile device to remotely access and control a computer
US8612972B2 (en) * 2007-06-27 2013-12-17 Microsoft Corporation Running add-on components in virtual environments
US20090018818A1 (en) * 2007-07-10 2009-01-15 Aibelive Co., Ltd. Operating device for natural language input
US9264483B2 (en) 2007-07-18 2016-02-16 Hammond Development International, Inc. Method and system for enabling a communication device to remotely execute an application
ITFI20070177A1 (it) 2007-07-26 2009-01-27 Riccardo Vieri Sistema per la creazione e impostazione di una campagna pubblicitaria derivante dall'inserimento di messaggi pubblicitari all'interno di uno scambio di messaggi e metodo per il suo funzionamento.
US9053089B2 (en) 2007-10-02 2015-06-09 Apple Inc. Part-of-speech tagging using latent analogy
US8165886B1 (en) 2007-10-04 2012-04-24 Great Northern Research LLC Speech interface system and method for control and interaction with applications on a computing system
US8595642B1 (en) 2007-10-04 2013-11-26 Great Northern Research, LLC Multiple shell multi faceted graphical user interface
US8364694B2 (en) 2007-10-26 2013-01-29 Apple Inc. Search assistant for digital media assets
US8620662B2 (en) 2007-11-20 2013-12-31 Apple Inc. Context-aware unit selection
US8472934B2 (en) * 2007-11-28 2013-06-25 Nokia Corporation Portable apparatus, method and user interface
JP2010020273A (ja) * 2007-12-07 2010-01-28 Sony Corp 情報処理装置、および情報処理方法、並びにコンピュータ・プログラム
US8140335B2 (en) 2007-12-11 2012-03-20 Voicebox Technologies, Inc. System and method for providing a natural language voice user interface in an integrated voice navigation services environment
US10002189B2 (en) 2007-12-20 2018-06-19 Apple Inc. Method and apparatus for searching using an active ontology
US9703784B2 (en) * 2007-12-24 2017-07-11 Sandisk Il Ltd. Mobile device and method for concurrently processing content according to multiple playlists
US8219407B1 (en) 2007-12-27 2012-07-10 Great Northern Research, LLC Method for processing the output of a speech recognizer
US9330720B2 (en) 2008-01-03 2016-05-03 Apple Inc. Methods and apparatus for altering audio output signals
US8327272B2 (en) 2008-01-06 2012-12-04 Apple Inc. Portable multifunction device, method, and graphical user interface for viewing and managing electronic calendars
US8086455B2 (en) * 2008-01-09 2011-12-27 Microsoft Corporation Model development authoring, generation and execution based on data and processor dependencies
US20090198496A1 (en) * 2008-01-31 2009-08-06 Matthias Denecke Aspect oriented programmable dialogue manager and apparatus operated thereby
US8065143B2 (en) 2008-02-22 2011-11-22 Apple Inc. Providing text input using speech data and non-speech data
US8289283B2 (en) 2008-03-04 2012-10-16 Apple Inc. Language input interface on a device
CA2719301C (en) * 2008-03-25 2016-10-04 E-Lane Systems Inc. Multi-participant, mixed-initiative voice interaction system
US8996376B2 (en) 2008-04-05 2015-03-31 Apple Inc. Intelligent text-to-speech conversion
US10496753B2 (en) * 2010-01-18 2019-12-03 Apple Inc. Automatically adapting user interfaces for hands-free interaction
EP2279602B1 (en) * 2008-05-19 2013-07-31 Citrix Systems, Inc. Systems and methods for remoting multimedia plugin calls
US8589161B2 (en) 2008-05-27 2013-11-19 Voicebox Technologies, Inc. System and method for an integrated, multi-modal, multi-device natural language voice services environment
US9305548B2 (en) 2008-05-27 2016-04-05 Voicebox Technologies Corporation System and method for an integrated, multi-modal, multi-device natural language voice services environment
EP2286356A4 (en) * 2008-06-03 2013-03-06 Whirlpool Co EQUIPMENT DEVELOPMENT TOOLKIT
US8464150B2 (en) 2008-06-07 2013-06-11 Apple Inc. Automatic language identification for dynamic text processing
US8838075B2 (en) 2008-06-19 2014-09-16 Intelligent Mechatronic Systems Inc. Communication system with voice mail access and call by spelling functionality
US20090328062A1 (en) * 2008-06-25 2009-12-31 Microsoft Corporation Scalable and extensible communication framework
CN101621480B (zh) * 2008-06-30 2012-03-28 国际商业机器公司 呈现服务器和在其中使用的xsl文件处理方法
US9652023B2 (en) 2008-07-24 2017-05-16 Intelligent Mechatronic Systems Inc. Power management system
US20100030549A1 (en) 2008-07-31 2010-02-04 Lee Michael M Mobile device having human language translation capability with positional feedback
US8463053B1 (en) 2008-08-08 2013-06-11 The Research Foundation Of State University Of New York Enhanced max margin learning on multimodal data mining in a multimedia database
US8959053B2 (en) * 2008-08-13 2015-02-17 Alcatel Lucent Configuration file framework to support high availability schema based upon asynchronous checkpointing
JP5829000B2 (ja) * 2008-08-20 2015-12-09 株式会社ユニバーサルエンターテインメント 会話シナリオ編集装置
US8768702B2 (en) 2008-09-05 2014-07-01 Apple Inc. Multi-tiered voice feedback in an electronic device
US8898568B2 (en) 2008-09-09 2014-11-25 Apple Inc. Audio user interface
US8712776B2 (en) 2008-09-29 2014-04-29 Apple Inc. Systems and methods for selective text to speech synthesis
US8583418B2 (en) 2008-09-29 2013-11-12 Apple Inc. Systems and methods of detecting language and natural language strings for text to speech synthesis
US8355919B2 (en) 2008-09-29 2013-01-15 Apple Inc. Systems and methods for text normalization for text to speech synthesis
US8396714B2 (en) 2008-09-29 2013-03-12 Apple Inc. Systems and methods for concatenation of words in text to speech synthesis
US8352272B2 (en) 2008-09-29 2013-01-08 Apple Inc. Systems and methods for text to speech synthesis
US8352268B2 (en) 2008-09-29 2013-01-08 Apple Inc. Systems and methods for selective rate of speech and speech preferences for text to speech synthesis
US8676904B2 (en) 2008-10-02 2014-03-18 Apple Inc. Electronic devices with voice command and contextual data processing capabilities
US9959870B2 (en) 2008-12-11 2018-05-01 Apple Inc. Speech recognition involving a mobile device
US8121842B2 (en) 2008-12-12 2012-02-21 Microsoft Corporation Audio output of a document from mobile device
US8914417B2 (en) 2009-01-07 2014-12-16 International Business Machines Corporation Apparatus, system, and method for maintaining a context stack
TW201027378A (en) * 2009-01-08 2010-07-16 Realtek Semiconductor Corp Virtual platform and related simulation method
US8498867B2 (en) * 2009-01-15 2013-07-30 K-Nfb Reading Technology, Inc. Systems and methods for selection and use of multiple characters for document narration
WO2010084497A1 (en) * 2009-01-26 2010-07-29 Tomer London System and method for providing visual content and services to wireless terminal devices in response to a received voice call
US9529573B2 (en) * 2009-01-28 2016-12-27 Microsoft Technology Licensing, Llc Graphical user interface generation through use of a binary file
US8862252B2 (en) 2009-01-30 2014-10-14 Apple Inc. Audio user interface for displayless electronic device
US8326637B2 (en) 2009-02-20 2012-12-04 Voicebox Technologies, Inc. System and method for processing multi-modal device interactions in a natural language voice services environment
US8380507B2 (en) 2009-03-09 2013-02-19 Apple Inc. Systems and methods for determining the language to use for speech generated by a text to speech engine
CN101533409B (zh) * 2009-04-23 2011-03-02 何海群 一种网络浏览器系统和控制方法
US8601015B1 (en) 2009-05-15 2013-12-03 Wolfram Alpha Llc Dynamic example generation for queries
US8788524B1 (en) 2009-05-15 2014-07-22 Wolfram Alpha Llc Method and system for responding to queries in an imprecise syntax
US8577543B2 (en) * 2009-05-28 2013-11-05 Intelligent Mechatronic Systems Inc. Communication system with personal information management and remote vehicle monitoring and control features
US10540976B2 (en) 2009-06-05 2020-01-21 Apple Inc. Contextual voice commands
US9858925B2 (en) 2009-06-05 2018-01-02 Apple Inc. Using context information to facilitate processing of commands in a virtual assistant
US10241752B2 (en) 2011-09-30 2019-03-26 Apple Inc. Interface for a virtual digital assistant
US10241644B2 (en) 2011-06-03 2019-03-26 Apple Inc. Actionable reminder entries
US20120311585A1 (en) 2011-06-03 2012-12-06 Apple Inc. Organizing task items that represent tasks to perform
US9002713B2 (en) * 2009-06-09 2015-04-07 At&T Intellectual Property I, L.P. System and method for speech personalization by need
US9055151B2 (en) * 2009-06-22 2015-06-09 Avaya Inc. Method to set the flag as replied or forwarded to all replied or forwarded voice messages
US8442189B2 (en) * 2009-06-22 2013-05-14 Avaya Inc. Unified communications appliance
WO2010148518A1 (en) 2009-06-27 2010-12-29 Intelligent Mechatronic Systems Vehicle internet radio interface
US9431006B2 (en) 2009-07-02 2016-08-30 Apple Inc. Methods and apparatuses for automatic speech recognition
US9171541B2 (en) * 2009-11-10 2015-10-27 Voicebox Technologies Corporation System and method for hybrid processing in a natural language voice services environment
WO2011059997A1 (en) 2009-11-10 2011-05-19 Voicebox Technologies, Inc. System and method for providing a natural language content dedication service
US8682649B2 (en) 2009-11-12 2014-03-25 Apple Inc. Sentiment prediction from textual data
US20110110534A1 (en) * 2009-11-12 2011-05-12 Apple Inc. Adjustable voice output based on device status
US20110125733A1 (en) * 2009-11-25 2011-05-26 Fish Nathan J Quick access utility
US9978272B2 (en) 2009-11-25 2018-05-22 Ridetones, Inc Vehicle to vehicle chatting and communication system
US20110144976A1 (en) * 2009-12-10 2011-06-16 Arun Jain Application user interface system and method
US8868427B2 (en) * 2009-12-11 2014-10-21 General Motors Llc System and method for updating information in electronic calendars
GB0922608D0 (en) * 2009-12-23 2010-02-10 Vratskides Alexios Message optimization
EP4318463A3 (en) * 2009-12-23 2024-02-28 Google LLC Multi-modal input on an electronic device
JP2011139303A (ja) * 2009-12-28 2011-07-14 Nec Corp 通信システム、制御装置、通信制御方法、およびプログラム
US8600743B2 (en) 2010-01-06 2013-12-03 Apple Inc. Noise profile determination for voice-related feature
US8381107B2 (en) 2010-01-13 2013-02-19 Apple Inc. Adaptive audio feedback system and method
US8311838B2 (en) 2010-01-13 2012-11-13 Apple Inc. Devices and methods for identifying a prompt corresponding to a voice input in a sequence of prompts
US9665620B2 (en) * 2010-01-15 2017-05-30 Ab Initio Technology Llc Managing data queries
US10553209B2 (en) 2010-01-18 2020-02-04 Apple Inc. Systems and methods for hands-free notification summaries
US10705794B2 (en) 2010-01-18 2020-07-07 Apple Inc. Automatically adapting user interfaces for hands-free interaction
US10276170B2 (en) 2010-01-18 2019-04-30 Apple Inc. Intelligent automated assistant
US10679605B2 (en) 2010-01-18 2020-06-09 Apple Inc. Hands-free list-reading by intelligent automated assistant
US8645914B2 (en) * 2010-01-22 2014-02-04 AgentSheets, Inc. Conversational programming
WO2011089450A2 (en) 2010-01-25 2011-07-28 Andrew Peter Nelson Jerram Apparatuses, methods and systems for a digital conversation management platform
US8682667B2 (en) 2010-02-25 2014-03-25 Apple Inc. User profiling for selecting user specific voice input processing information
US8433828B2 (en) 2010-02-26 2013-04-30 Apple Inc. Accessory protocol for touch screen device accessibility
US8554948B2 (en) * 2010-03-03 2013-10-08 At&T Intellectual Property I, L.P. Methods, systems and computer program products for identifying traffic on the internet using communities of interest
JP5560814B2 (ja) * 2010-03-24 2014-07-30 日本電気株式会社 負荷分散システム、負荷分散サーバおよび負荷分散方法
US8612932B2 (en) * 2010-04-18 2013-12-17 Voxeo Corporation Unified framework and method for call control and media control
US8776091B2 (en) * 2010-04-30 2014-07-08 Microsoft Corporation Reducing feedback latency
US8484015B1 (en) 2010-05-14 2013-07-09 Wolfram Alpha Llc Entity pages
US8639516B2 (en) 2010-06-04 2014-01-28 Apple Inc. User-specific noise suppression for voice quality improvements
US8352908B2 (en) 2010-06-28 2013-01-08 International Business Machines Corporation Multi-modal conversion tool for form-type applications
US8713021B2 (en) 2010-07-07 2014-04-29 Apple Inc. Unsupervised document clustering using latent semantic density analysis
US9246914B2 (en) * 2010-07-16 2016-01-26 Nokia Technologies Oy Method and apparatus for processing biometric information using distributed computation
US9104670B2 (en) 2010-07-21 2015-08-11 Apple Inc. Customized search or acquisition of digital media assets
US8812298B1 (en) 2010-07-28 2014-08-19 Wolfram Alpha Llc Macro replacement of natural language input
US8719006B2 (en) 2010-08-27 2014-05-06 Apple Inc. Combined statistical and rule-based part-of-speech tagging for text-to-speech synthesis
US8688435B2 (en) * 2010-09-22 2014-04-01 Voice On The Go Inc. Systems and methods for normalizing input media
US8719014B2 (en) 2010-09-27 2014-05-06 Apple Inc. Electronic device with text error correction based on voice recognition data
US10515147B2 (en) 2010-12-22 2019-12-24 Apple Inc. Using statistical language models for contextual lookup
US10762293B2 (en) 2010-12-22 2020-09-01 Apple Inc. Using parts-of-speech tagging and named entity recognition for spelling correction
US20120173242A1 (en) * 2010-12-30 2012-07-05 Samsung Electronics Co., Ltd. System and method for exchange of scribble data between gsm devices along with voice
US9953653B2 (en) 2011-01-07 2018-04-24 Nuance Communications, Inc. Configurable speech recognition system using multiple recognizers
US9183843B2 (en) 2011-01-07 2015-11-10 Nuance Communications, Inc. Configurable speech recognition system using multiple recognizers
US8781836B2 (en) 2011-02-22 2014-07-15 Apple Inc. Hearing assistance system for providing consistent human speech
US9542657B2 (en) * 2011-02-23 2017-01-10 Avaya Inc. Method and system for optimizing contact center performance
US9262612B2 (en) 2011-03-21 2016-02-16 Apple Inc. Device access using voice authentication
US9202465B2 (en) * 2011-03-25 2015-12-01 General Motors Llc Speech recognition dependent on text message content
US8171137B1 (en) * 2011-05-09 2012-05-01 Google Inc. Transferring application state across devices
CN103534696B (zh) * 2011-05-13 2018-02-16 微软技术许可有限责任公司 针对口语语言理解中的域检测利用查询点击记录
US20120317492A1 (en) * 2011-05-27 2012-12-13 Telefon Projekt LLC Providing Interactive and Personalized Multimedia Content from Remote Servers
US10672399B2 (en) 2011-06-03 2020-06-02 Apple Inc. Switching between text data and audio data based on a mapping
US10057736B2 (en) 2011-06-03 2018-08-21 Apple Inc. Active transport based notifications
US8812294B2 (en) 2011-06-21 2014-08-19 Apple Inc. Translating phrases from one language into another using an order-based set of declarative rules
US9069814B2 (en) 2011-07-27 2015-06-30 Wolfram Alpha Llc Method and system for using natural language to generate widgets
US8706472B2 (en) 2011-08-11 2014-04-22 Apple Inc. Method for disambiguating multiple readings in language conversion
US8994660B2 (en) 2011-08-29 2015-03-31 Apple Inc. Text correction processing
US8645978B2 (en) * 2011-09-02 2014-02-04 Compuverde Ab Method for data maintenance
US9734252B2 (en) 2011-09-08 2017-08-15 Wolfram Alpha Llc Method and system for analyzing data using a query answering system
US8645532B2 (en) * 2011-09-13 2014-02-04 BlueStripe Software, Inc. Methods and computer program products for monitoring the contents of network traffic in a network device
US8762156B2 (en) 2011-09-28 2014-06-24 Apple Inc. Speech recognition repair using contextual information
US9851950B2 (en) 2011-11-15 2017-12-26 Wolfram Alpha Llc Programming in a precise syntax using natural language
US8595016B2 (en) * 2011-12-23 2013-11-26 Angle, Llc Accessing content using a source-specific content-adaptable dialogue
KR101962126B1 (ko) * 2012-02-24 2019-03-26 엘지전자 주식회사 음성 인식 결과에 따라 데이터베이스에 액세스하는 멀티미디어 디바이스 및 그 제어 방법
US10134385B2 (en) 2012-03-02 2018-11-20 Apple Inc. Systems and methods for name pronunciation
US9483461B2 (en) 2012-03-06 2016-11-01 Apple Inc. Handling speech synthesis of content for multiple languages
WO2013169759A2 (en) * 2012-05-07 2013-11-14 Citrix Systems, Inc. Speech recognition support for remote applications and desktops
US20130304536A1 (en) * 2012-05-10 2013-11-14 Ebay, Inc. Harvest Customer Tracking Information
US9280610B2 (en) 2012-05-14 2016-03-08 Apple Inc. Crowd sourcing information to fulfill user requests
US10417037B2 (en) 2012-05-15 2019-09-17 Apple Inc. Systems and methods for integrating third party services with a digital assistant
US8775442B2 (en) 2012-05-15 2014-07-08 Apple Inc. Semantic search using a single-source semantic model
US10395270B2 (en) 2012-05-17 2019-08-27 Persado Intellectual Property Limited System and method for recommending a grammar for a message campaign used by a message optimization system
US10019994B2 (en) 2012-06-08 2018-07-10 Apple Inc. Systems and methods for recognizing textual identifiers within a plurality of words
US9721563B2 (en) 2012-06-08 2017-08-01 Apple Inc. Name recognition system
US9502029B1 (en) * 2012-06-25 2016-11-22 Amazon Technologies, Inc. Context-aware speech processing
US20140006318A1 (en) * 2012-06-29 2014-01-02 Poe XING Collecting, discovering, and/or sharing media objects
US9495129B2 (en) 2012-06-29 2016-11-15 Apple Inc. Device, method, and user interface for voice-activated navigation and browsing of a document
US8554559B1 (en) * 2012-07-13 2013-10-08 Google Inc. Localized speech recognition with offload
US9710475B1 (en) 2012-07-16 2017-07-18 Tintri Inc. Synchronization of data
US9424233B2 (en) * 2012-07-20 2016-08-23 Veveo, Inc. Method of and system for inferring user intent in search input in a conversational interaction system
US9485330B2 (en) 2012-07-30 2016-11-01 Tencent Technology (Shenzhen) Company Limited Web browser operation method and system
CN103577444B (zh) * 2012-07-30 2017-04-05 腾讯科技(深圳)有限公司 一种操控浏览器的方法及系统
US9465833B2 (en) 2012-07-31 2016-10-11 Veveo, Inc. Disambiguating user intent in conversational interaction system for large corpus information retrieval
US9378073B2 (en) 2012-08-14 2016-06-28 International Business Machines Corporation Remote procedure call for a distributed system
US9384736B2 (en) * 2012-08-21 2016-07-05 Nuance Communications, Inc. Method to provide incremental UI response based on multiple asynchronous evidence about user input
US9405424B2 (en) 2012-08-29 2016-08-02 Wolfram Alpha, Llc Method and system for distributing and displaying graphical items
KR102150289B1 (ko) * 2012-08-30 2020-09-01 삼성전자주식회사 사용자 단말에서 사용자 인터페이스 장치 및 이를 지원하는 방법
CN102831229A (zh) * 2012-08-30 2012-12-19 浙大网新科技股份有限公司 适合盲人使用的网页浏览方法
US9576574B2 (en) 2012-09-10 2017-02-21 Apple Inc. Context-sensitive handling of interruptions by intelligent digital assistant
US8700396B1 (en) * 2012-09-11 2014-04-15 Google Inc. Generating speech data collection prompts
US9547647B2 (en) 2012-09-19 2017-01-17 Apple Inc. Voice-based media searching
US8935167B2 (en) 2012-09-25 2015-01-13 Apple Inc. Exemplar-based latent perceptual modeling for automatic speech recognition
US10276157B2 (en) * 2012-10-01 2019-04-30 Nuance Communications, Inc. Systems and methods for providing a voice agent user interface
US9886944B2 (en) 2012-10-04 2018-02-06 Nuance Communications, Inc. Hybrid controller for ASR
US8983849B2 (en) 2012-10-17 2015-03-17 Nuance Communications, Inc. Multiple device intelligent language model synchronization
WO2014073206A1 (ja) * 2012-11-07 2014-05-15 日本電気株式会社 情報処理装置、及び、情報処理方法
KR101980834B1 (ko) * 2012-11-28 2019-05-21 한국전자통신연구원 컨텍스트 정보 기반 응용프로그램 관리방법 및 그 장치
US9003039B2 (en) 2012-11-29 2015-04-07 Thales Canada Inc. Method and apparatus of resource allocation or resource release
KR101940719B1 (ko) * 2012-12-14 2019-01-23 한국전자통신연구원 태스크 그래프 기반 대화 처리 시스템의 태스크 그래프 구축 장치 및 방법
CN103020278A (zh) * 2012-12-28 2013-04-03 深圳创维数字技术股份有限公司 一种网页的控制方法及浏览器
US9430420B2 (en) 2013-01-07 2016-08-30 Telenav, Inc. Computing system with multimodal interaction mechanism and method of operation thereof
US9922639B1 (en) 2013-01-11 2018-03-20 Amazon Technologies, Inc. User feedback for speech interactions
CN103971687B (zh) * 2013-02-01 2016-06-29 腾讯科技(深圳)有限公司 一种语音识别系统中的负载均衡实现方法和装置
KR20230137475A (ko) 2013-02-07 2023-10-04 애플 인크. 디지털 어시스턴트를 위한 음성 트리거
US9817835B2 (en) 2013-03-12 2017-11-14 Tintri Inc. Efficient data synchronization for storage containers
US10652394B2 (en) 2013-03-14 2020-05-12 Apple Inc. System and method for processing voicemail
US9977779B2 (en) 2013-03-14 2018-05-22 Apple Inc. Automatic supplementation of word correction dictionaries
US9218052B2 (en) 2013-03-14 2015-12-22 Samsung Electronics Co., Ltd. Framework for voice controlling applications
US10642574B2 (en) 2013-03-14 2020-05-05 Apple Inc. Device, method, and graphical user interface for outputting captions
US9733821B2 (en) 2013-03-14 2017-08-15 Apple Inc. Voice control to diagnose inadvertent activation of accessibility features
US10572476B2 (en) 2013-03-14 2020-02-25 Apple Inc. Refining a search based on schedule items
US9368114B2 (en) 2013-03-14 2016-06-14 Apple Inc. Context-sensitive handling of interruptions
US9348555B2 (en) * 2013-03-15 2016-05-24 Volkswagen Ag In-vehicle access of mobile device functions
US9125180B1 (en) 2013-03-15 2015-09-01 Google Inc. Techniques for automatically establishing a long-lasting connection across computing devices configured for short-range wireless communication
US9922642B2 (en) 2013-03-15 2018-03-20 Apple Inc. Training an at least partial voice command system
US11151899B2 (en) 2013-03-15 2021-10-19 Apple Inc. User training by intelligent digital assistant
WO2014168730A2 (en) 2013-03-15 2014-10-16 Apple Inc. Context-sensitive handling of interruptions
WO2014144579A1 (en) 2013-03-15 2014-09-18 Apple Inc. System and method for updating an adaptive speech recognition model
US10748529B1 (en) 2013-03-15 2020-08-18 Apple Inc. Voice activated device for use with a voice-based digital assistant
US9875494B2 (en) * 2013-04-16 2018-01-23 Sri International Using intents to analyze and personalize a user's dialog experience with a virtual personal assistant
WO2014183035A1 (en) 2013-05-10 2014-11-13 Veveo, Inc. Method and system for capturing and exploiting user intent in a conversational interaction based information retrieval system
US9431008B2 (en) 2013-05-29 2016-08-30 Nuance Communications, Inc. Multiple parallel dialogs in smart phone applications
WO2014197334A2 (en) 2013-06-07 2014-12-11 Apple Inc. System and method for user-specified pronunciation of words for speech synthesis and recognition
US9582608B2 (en) 2013-06-07 2017-02-28 Apple Inc. Unified ranking with entropy-weighted information for phrase-based semantic auto-completion
WO2014197336A1 (en) 2013-06-07 2014-12-11 Apple Inc. System and method for detecting errors in interactions with a voice-based digital assistant
WO2014196984A1 (en) * 2013-06-07 2014-12-11 Nuance Communications, Inc. Speech-based search using descriptive features of surrounding objects
WO2014197335A1 (en) 2013-06-08 2014-12-11 Apple Inc. Interpreting and acting upon commands that involve sharing information with remote devices
US10176167B2 (en) 2013-06-09 2019-01-08 Apple Inc. System and method for inferring user intent from speech inputs
EP3937002A1 (en) 2013-06-09 2022-01-12 Apple Inc. Device, method, and graphical user interface for enabling conversation persistence across two or more instances of a digital assistant
KR101809808B1 (ko) 2013-06-13 2017-12-15 애플 인크. 음성 명령에 의해 개시되는 긴급 전화를 걸기 위한 시스템 및 방법
US9311298B2 (en) * 2013-06-21 2016-04-12 Microsoft Technology Licensing, Llc Building conversational understanding systems using a toolset
KR102053820B1 (ko) 2013-07-02 2019-12-09 삼성전자주식회사 서버 및 그 제어방법과, 영상처리장치 및 그 제어방법
US10956433B2 (en) * 2013-07-15 2021-03-23 Microsoft Technology Licensing, Llc Performing an operation relative to tabular data based upon voice input
JP6163266B2 (ja) 2013-08-06 2017-07-12 アップル インコーポレイテッド リモート機器からの作動に基づくスマート応答の自動作動
US10109273B1 (en) 2013-08-29 2018-10-23 Amazon Technologies, Inc. Efficient generation of personalized spoken language understanding models
US9361289B1 (en) * 2013-08-30 2016-06-07 Amazon Technologies, Inc. Retrieval and management of spoken language understanding personalization data
US10628378B2 (en) 2013-09-03 2020-04-21 Tintri By Ddn, Inc. Replication of snapshots and clones
US9495965B2 (en) * 2013-09-20 2016-11-15 American Institutes For Research Synthesis and display of speech commands method and system
CN103474065A (zh) * 2013-09-24 2013-12-25 贵阳世纪恒通科技有限公司 基于自动分类技术的语音意图判定识别方法
US10223457B2 (en) * 2013-10-02 2019-03-05 Empire Technology Development Llc Identification of distributed user interface (DUI) elements
KR101740332B1 (ko) * 2013-11-05 2017-06-08 한국전자통신연구원 자동 번역 장치 및 방법
US9754591B1 (en) * 2013-11-18 2017-09-05 Amazon Technologies, Inc. Dialog management context sharing
US9690854B2 (en) * 2013-11-27 2017-06-27 Nuance Communications, Inc. Voice-enabled dialog interaction with web pages
US10296160B2 (en) 2013-12-06 2019-05-21 Apple Inc. Method for extracting salient dialog usage from live data
US9558176B2 (en) 2013-12-06 2017-01-31 Microsoft Technology Licensing, Llc Discriminating between natural language and keyword language items
KR101489826B1 (ko) * 2013-12-30 2015-02-04 유승우 더미 터미널 및 본체
JP6508063B2 (ja) * 2014-02-13 2019-05-08 日本電気株式会社 通信システム、通信装置、及び通信方法
WO2015148725A2 (en) * 2014-03-25 2015-10-01 Mark Stephen Meadows Systems and methods for executing cryptographically secure transactions using voice and natural language processing
US20150278852A1 (en) * 2014-04-01 2015-10-01 DoubleVerify, Inc. System And Method For Identifying Online Advertisement Laundering And Online Advertisement Injection
JP2015207819A (ja) * 2014-04-17 2015-11-19 株式会社リコー 情報処理装置、情報処理システム、通信制御方法およびプログラム
US9514376B2 (en) * 2014-04-29 2016-12-06 Google Inc. Techniques for distributed optical character recognition and distributed machine language translation
US9620105B2 (en) 2014-05-15 2017-04-11 Apple Inc. Analyzing audio input for efficient speech and music recognition
US9990433B2 (en) 2014-05-23 2018-06-05 Samsung Electronics Co., Ltd. Method for searching and device thereof
US10592095B2 (en) 2014-05-23 2020-03-17 Apple Inc. Instantaneous speaking of content on touch devices
US11314826B2 (en) 2014-05-23 2022-04-26 Samsung Electronics Co., Ltd. Method for searching and device thereof
US9502031B2 (en) 2014-05-27 2016-11-22 Apple Inc. Method for supporting dynamic grammars in WFST-based ASR
US9633004B2 (en) 2014-05-30 2017-04-25 Apple Inc. Better resolution when referencing to concepts
US9760559B2 (en) 2014-05-30 2017-09-12 Apple Inc. Predictive text input
EP3480811A1 (en) 2014-05-30 2019-05-08 Apple Inc. Multi-command single utterance input method
US9785630B2 (en) 2014-05-30 2017-10-10 Apple Inc. Text prediction using combined word N-gram and unigram language models
US10170123B2 (en) 2014-05-30 2019-01-01 Apple Inc. Intelligent assistant for home automation
US10289433B2 (en) 2014-05-30 2019-05-14 Apple Inc. Domain specific language for encoding assistant dialog
US10078631B2 (en) 2014-05-30 2018-09-18 Apple Inc. Entropy-guided text prediction using combined word and character n-gram language models
US9430463B2 (en) 2014-05-30 2016-08-30 Apple Inc. Exemplar-based natural language processing
US9715875B2 (en) 2014-05-30 2017-07-25 Apple Inc. Reducing the need for manual start/end-pointing and trigger phrases
US9842101B2 (en) 2014-05-30 2017-12-12 Apple Inc. Predictive conversion of language input
US9734193B2 (en) 2014-05-30 2017-08-15 Apple Inc. Determining domain salience ranking from ambiguous words in natural speech
US10659851B2 (en) 2014-06-30 2020-05-19 Apple Inc. Real-time digital assistant knowledge updates
US9338493B2 (en) 2014-06-30 2016-05-10 Apple Inc. Intelligent automated assistant for TV user interactions
US10446141B2 (en) 2014-08-28 2019-10-15 Apple Inc. Automatic speech recognition based on user feedback
US9818400B2 (en) 2014-09-11 2017-11-14 Apple Inc. Method and apparatus for discovering trending terms in speech requests
US10789041B2 (en) 2014-09-12 2020-09-29 Apple Inc. Dynamic thresholds for always listening speech trigger
US9275641B1 (en) * 2014-09-14 2016-03-01 Speaktoit, Inc. Platform for creating customizable dialog system engines
US9898459B2 (en) 2014-09-16 2018-02-20 Voicebox Technologies Corporation Integration of domain information into state transitions of a finite state transducer for natural language processing
CN107003996A (zh) 2014-09-16 2017-08-01 声钰科技 语音商务
US9606986B2 (en) 2014-09-29 2017-03-28 Apple Inc. Integrated word N-gram and class M-gram language models
US9646609B2 (en) 2014-09-30 2017-05-09 Apple Inc. Caching apparatus for serving phonetic pronunciations
US10074360B2 (en) 2014-09-30 2018-09-11 Apple Inc. Providing an indication of the suitability of speech recognition
US9668121B2 (en) 2014-09-30 2017-05-30 Apple Inc. Social reminders
US9886432B2 (en) 2014-09-30 2018-02-06 Apple Inc. Parsimonious handling of word inflection via categorical stem + suffix N-gram language models
US10127911B2 (en) 2014-09-30 2018-11-13 Apple Inc. Speaker identification and unsupervised speaker adaptation techniques
WO2016061309A1 (en) 2014-10-15 2016-04-21 Voicebox Technologies Corporation System and method for providing follow-up responses to prior natural language inputs of a user
US9648055B2 (en) * 2014-10-15 2017-05-09 Ayla Networks, Inc. Applications in a flexible registration framework
US9813505B2 (en) 2014-10-15 2017-11-07 Ayla Networks, Inc. Devices in a flexible registration framework
EP3207452A4 (en) * 2014-10-15 2018-06-20 Ayla Networks, Inc. Registration framework for connected consumer devices
US9800619B2 (en) * 2014-10-15 2017-10-24 Ayla Networks, Inc. Registration framework for connected consumer devices
CN107003723A (zh) * 2014-10-21 2017-08-01 罗伯特·博世有限公司 用于会话系统中的响应选择和组成的自动化的方法和系统
US10431214B2 (en) 2014-11-26 2019-10-01 Voicebox Technologies Corporation System and method of determining a domain and/or an action related to a natural language input
US10614799B2 (en) 2014-11-26 2020-04-07 Voicebox Technologies Corporation System and method of providing intent predictions for an utterance prior to a system detection of an end of the utterance
US10552013B2 (en) 2014-12-02 2020-02-04 Apple Inc. Data detection
US9965464B2 (en) * 2014-12-05 2018-05-08 Microsoft Technology Licensing, Llc Automatic process guidance
US9711141B2 (en) 2014-12-09 2017-07-18 Apple Inc. Disambiguating heteronyms in speech synthesis
US10154041B2 (en) * 2015-01-13 2018-12-11 Microsoft Technology Licensing, Llc Website access control
US9854049B2 (en) 2015-01-30 2017-12-26 Rovi Guides, Inc. Systems and methods for resolving ambiguous terms in social chatter based on a user profile
US10180985B2 (en) * 2015-02-19 2019-01-15 At&T Intellectual Property I, L.P. Apparatus and method for automatically redirecting a search
US9865280B2 (en) 2015-03-06 2018-01-09 Apple Inc. Structured dictation using intelligent automated assistants
US10152299B2 (en) 2015-03-06 2018-12-11 Apple Inc. Reducing response latency of intelligent automated assistants
US10567477B2 (en) 2015-03-08 2020-02-18 Apple Inc. Virtual assistant continuity
US9886953B2 (en) 2015-03-08 2018-02-06 Apple Inc. Virtual assistant activation
US9721566B2 (en) 2015-03-08 2017-08-01 Apple Inc. Competing devices responding to voice triggers
US9899019B2 (en) 2015-03-18 2018-02-20 Apple Inc. Systems and methods for structured stem and suffix language models
US9672829B2 (en) * 2015-03-23 2017-06-06 International Business Machines Corporation Extracting and displaying key points of a video conference
US9842105B2 (en) 2015-04-16 2017-12-12 Apple Inc. Parsimonious continuous-space phrase representations for natural language processing
US20160321025A1 (en) * 2015-04-30 2016-11-03 Kabushiki Kaisha Toshiba Electronic apparatus and method
US10460227B2 (en) 2015-05-15 2019-10-29 Apple Inc. Virtual assistant in a communication session
CN106302089B (zh) * 2015-05-21 2019-06-11 阿里巴巴集团控股有限公司 会话处理方法和系统
US10200824B2 (en) 2015-05-27 2019-02-05 Apple Inc. Systems and methods for proactively identifying and surfacing relevant content on a touch-sensitive device
EP3792842A1 (en) * 2015-05-27 2021-03-17 Orion Labs Intelligent agent features for wearable personal communication nodes
US9721569B2 (en) * 2015-05-27 2017-08-01 Intel Corporation Gaussian mixture model accelerator with direct memory access engines corresponding to individual data streams
US10083688B2 (en) 2015-05-27 2018-09-25 Apple Inc. Device voice control for selecting a displayed affordance
US10324979B2 (en) 2015-06-01 2019-06-18 AffectLayer, Inc. Automatic generation of playlists from conversations
US10367940B2 (en) 2015-06-01 2019-07-30 AffectLayer, Inc. Analyzing conversations to automatically identify product feature requests
US10181326B2 (en) 2015-06-01 2019-01-15 AffectLayer, Inc. Analyzing conversations to automatically identify action items
US10133999B2 (en) 2015-06-01 2018-11-20 AffectLayer, Inc. Analyzing conversations to automatically identify deals at risk
US10360911B2 (en) 2015-06-01 2019-07-23 AffectLayer, Inc. Analyzing conversations to automatically identify product features that resonate with customers
US10110743B2 (en) 2015-06-01 2018-10-23 AffectLayer, Inc. Automatic pattern recognition in conversations
US10679013B2 (en) 2015-06-01 2020-06-09 AffectLayer, Inc. IoT-based call assistant device
US10387573B2 (en) 2015-06-01 2019-08-20 AffectLayer, Inc. Analyzing conversations to automatically identify customer pain points
US10127220B2 (en) 2015-06-04 2018-11-13 Apple Inc. Language identification from short strings
US9578173B2 (en) 2015-06-05 2017-02-21 Apple Inc. Virtual assistant aided communication with 3rd party service in a communication session
US10101822B2 (en) 2015-06-05 2018-10-16 Apple Inc. Language input correction
US10255907B2 (en) 2015-06-07 2019-04-09 Apple Inc. Automatic accent detection using acoustic models
US11025565B2 (en) 2015-06-07 2021-06-01 Apple Inc. Personalized prediction of responses for instant messaging
US10186254B2 (en) 2015-06-07 2019-01-22 Apple Inc. Context-based endpoint detection
US20160378747A1 (en) 2015-06-29 2016-12-29 Apple Inc. Virtual assistant for media playback
US9407989B1 (en) 2015-06-30 2016-08-02 Arthur Woodrow Closed audio circuit
US10740384B2 (en) 2015-09-08 2020-08-11 Apple Inc. Intelligent automated assistant for media search and playback
US10671428B2 (en) 2015-09-08 2020-06-02 Apple Inc. Distributed personal assistant
US10747498B2 (en) 2015-09-08 2020-08-18 Apple Inc. Zero latency digital assistant
US10331312B2 (en) 2015-09-08 2019-06-25 Apple Inc. Intelligent automated assistant in a media environment
US9697820B2 (en) 2015-09-24 2017-07-04 Apple Inc. Unit-selection text-to-speech synthesis using concatenation-sensitive neural networks
US10366158B2 (en) 2015-09-29 2019-07-30 Apple Inc. Efficient word encoding for recurrent neural network language models
US11010550B2 (en) 2015-09-29 2021-05-18 Apple Inc. Unified language modeling framework for word prediction, auto-completion and auto-correction
US11587559B2 (en) 2015-09-30 2023-02-21 Apple Inc. Intelligent device identification
US10504137B1 (en) 2015-10-08 2019-12-10 Persado Intellectual Property Limited System, method, and computer program product for monitoring and responding to the performance of an ad
US9688225B2 (en) * 2015-10-09 2017-06-27 Livio, Inc. Methods and systems for a mobile device to emulate a vehicle human-machine interface
GB2557532A (en) * 2015-10-21 2018-06-20 Google Llc Parameter collection and automatic dialog generation in dialog systems
US10691473B2 (en) 2015-11-06 2020-06-23 Apple Inc. Intelligent automated assistant in a messaging environment
US10956666B2 (en) 2015-11-09 2021-03-23 Apple Inc. Unconventional virtual assistant interactions
US10049668B2 (en) 2015-12-02 2018-08-14 Apple Inc. Applying neural network language models to weighted finite state transducers for automatic speech recognition
US11018888B2 (en) * 2015-12-03 2021-05-25 Whirlpool Corporation Methods of remote control of appliances
US10832283B1 (en) 2015-12-09 2020-11-10 Persado Intellectual Property Limited System, method, and computer program for providing an instance of a promotional message to a user based on a predicted emotional response corresponding to user characteristics
US10223066B2 (en) 2015-12-23 2019-03-05 Apple Inc. Proactive assistance based on dialog communication between devices
US10732783B2 (en) 2015-12-28 2020-08-04 Microsoft Technology Licensing, Llc Identifying image comments from similar images
US10417346B2 (en) * 2016-01-23 2019-09-17 Microsoft Technology Licensing, Llc Tool for facilitating the development of new language understanding scenarios
US10446143B2 (en) 2016-03-14 2019-10-15 Apple Inc. Identification of voice inputs providing credentials
US10224026B2 (en) * 2016-03-15 2019-03-05 Sony Corporation Electronic device, system, method and computer program
EP3241343B1 (en) * 2016-03-17 2018-11-28 Google LLC Multi-provider data provision with request batching
US9996531B1 (en) * 2016-03-29 2018-06-12 Facebook, Inc. Conversational understanding
US10319371B2 (en) * 2016-05-04 2019-06-11 GM Global Technology Operations LLC Disambiguation of vehicle speech commands
US9934775B2 (en) 2016-05-26 2018-04-03 Apple Inc. Unit-selection text-to-speech synthesis based on predicted concatenation parameters
CN106095593B (zh) 2016-05-31 2019-04-16 Oppo广东移动通信有限公司 一种前、后景应用程序行为同步方法及装置
KR101995540B1 (ko) * 2016-06-03 2019-07-15 주식회사 허브케이 판독/입력 오류 단어 정정 장치 및 방법
US9972304B2 (en) 2016-06-03 2018-05-15 Apple Inc. Privacy preserving distributed evaluation framework for embedded personalized systems
US11227589B2 (en) 2016-06-06 2022-01-18 Apple Inc. Intelligent list reading
US10249300B2 (en) 2016-06-06 2019-04-02 Apple Inc. Intelligent list reading
US10049663B2 (en) 2016-06-08 2018-08-14 Apple, Inc. Intelligent automated assistant for media exploration
DK179309B1 (en) 2016-06-09 2018-04-23 Apple Inc Intelligent automated assistant in a home environment
US10192552B2 (en) 2016-06-10 2019-01-29 Apple Inc. Digital assistant providing whispered speech
US10586535B2 (en) 2016-06-10 2020-03-10 Apple Inc. Intelligent digital assistant in a multi-tasking environment
US10490187B2 (en) 2016-06-10 2019-11-26 Apple Inc. Digital assistant providing automated status report
US10509862B2 (en) 2016-06-10 2019-12-17 Apple Inc. Dynamic phrase expansion of language input
US10067938B2 (en) 2016-06-10 2018-09-04 Apple Inc. Multilingual word prediction
DK179049B1 (en) 2016-06-11 2017-09-18 Apple Inc Data driven natural language event detection and classification
DK179343B1 (en) 2016-06-11 2018-05-14 Apple Inc Intelligent task discovery
DK179415B1 (en) 2016-06-11 2018-06-14 Apple Inc Intelligent device arbitration and control
DK201670540A1 (en) 2016-06-11 2018-01-08 Apple Inc Application integration with a digital assistant
US11200891B2 (en) * 2016-06-21 2021-12-14 Hewlett-Packard Development Company, L.P. Communications utilizing multiple virtual assistant services
WO2018023106A1 (en) 2016-07-29 2018-02-01 Erik SWART System and method of disambiguating natural language processing requests
KR101827773B1 (ko) 2016-08-02 2018-02-09 주식회사 하이퍼커넥트 통역 장치 및 방법
US10474753B2 (en) 2016-09-07 2019-11-12 Apple Inc. Language identification using recurrent neural networks
KR102481881B1 (ko) 2016-09-07 2022-12-27 삼성전자주식회사 외부 디바이스를 제어하는 서버 및 그 방법
US10043516B2 (en) 2016-09-23 2018-08-07 Apple Inc. Intelligent automated assistant
US11281993B2 (en) 2016-12-05 2022-03-22 Apple Inc. Model and ensemble compression for metric learning
CN106847272A (zh) * 2016-12-12 2017-06-13 北京光年无限科技有限公司 与运行应用关联的智能机器人的数据交互方法
CN106844453B (zh) * 2016-12-20 2020-09-08 江苏瀚远科技股份有限公司 一种电子文件格式转换方法
US10593346B2 (en) 2016-12-22 2020-03-17 Apple Inc. Rank-reduced token representation for automatic speech recognition
US11204787B2 (en) 2017-01-09 2021-12-21 Apple Inc. Application integration with a digital assistant
US10971157B2 (en) 2017-01-11 2021-04-06 Nuance Communications, Inc. Methods and apparatus for hybrid speech recognition processing
US20180247644A1 (en) * 2017-02-27 2018-08-30 Intel Corporation Queueing spoken dialogue output
EP3392884A1 (en) * 2017-04-21 2018-10-24 audEERING GmbH A method for automatic affective state inference and an automated affective state inference system
US10417266B2 (en) 2017-05-09 2019-09-17 Apple Inc. Context-aware ranking of intelligent response suggestions
DK201770383A1 (en) 2017-05-09 2018-12-14 Apple Inc. USER INTERFACE FOR CORRECTING RECOGNITION ERRORS
DK201770439A1 (en) 2017-05-11 2018-12-13 Apple Inc. Offline personal assistant
US10726832B2 (en) 2017-05-11 2020-07-28 Apple Inc. Maintaining privacy of personal information
US10395654B2 (en) 2017-05-11 2019-08-27 Apple Inc. Text normalization based on a data-driven learning network
DK201770428A1 (en) 2017-05-12 2019-02-18 Apple Inc. LOW-LATENCY INTELLIGENT AUTOMATED ASSISTANT
DK179745B1 (en) 2017-05-12 2019-05-01 Apple Inc. SYNCHRONIZATION AND TASK DELEGATION OF A DIGITAL ASSISTANT
DK179496B1 (en) 2017-05-12 2019-01-15 Apple Inc. USER-SPECIFIC Acoustic Models
US11301477B2 (en) 2017-05-12 2022-04-12 Apple Inc. Feedback analysis of a digital assistant
DK201770431A1 (en) 2017-05-15 2018-12-20 Apple Inc. Optimizing dialogue policy decisions for digital assistants using implicit feedback
US10127227B1 (en) * 2017-05-15 2018-11-13 Google Llc Providing access to user-controlled resources by automated assistants
US11436417B2 (en) 2017-05-15 2022-09-06 Google Llc Providing access to user-controlled resources by automated assistants
DK201770432A1 (en) 2017-05-15 2018-12-21 Apple Inc. Hierarchical belief states for digital assistants
US10311144B2 (en) 2017-05-16 2019-06-04 Apple Inc. Emoji word sense disambiguation
US20180336892A1 (en) 2017-05-16 2018-11-22 Apple Inc. Detecting a trigger of a digital assistant
DK179560B1 (en) 2017-05-16 2019-02-18 Apple Inc. FAR-FIELD EXTENSION FOR DIGITAL ASSISTANT SERVICES
US11048995B2 (en) 2017-05-16 2021-06-29 Google Llc Delayed responses by computational assistant
US20180336275A1 (en) 2017-05-16 2018-11-22 Apple Inc. Intelligent automated assistant for media exploration
US10403278B2 (en) 2017-05-16 2019-09-03 Apple Inc. Methods and systems for phonetic matching in digital assistant services
US10657328B2 (en) 2017-06-02 2020-05-19 Apple Inc. Multi-task recurrent neural network architecture for efficient morphology handling in neural language modeling
JP6777887B2 (ja) * 2017-06-30 2020-10-28 京セラドキュメントソリューションズ株式会社 リモート通信制御システム、セッション管理システムおよびセッション管理プログラム
US20190018545A1 (en) * 2017-07-13 2019-01-17 International Business Machines Corporation System and method for rapid financial app prototyping
KR102419374B1 (ko) * 2017-07-21 2022-07-11 삼성전자주식회사 사용자 발화를 처리하는 전자 장치 및 그 전자 장치의 제어 방법
TWI646529B (zh) * 2017-07-25 2019-01-01 雲拓科技有限公司 主動聊天裝置
CN109426530B (zh) * 2017-08-17 2022-04-05 阿里巴巴集团控股有限公司 页面确定方法、装置、服务器和存储介质
US10445429B2 (en) 2017-09-21 2019-10-15 Apple Inc. Natural language understanding using vocabularies with compressed serialized tries
US10957313B1 (en) 2017-09-22 2021-03-23 Amazon Technologies, Inc. System command processing
US10600419B1 (en) * 2017-09-22 2020-03-24 Amazon Technologies, Inc. System command processing
CN107479906A (zh) * 2017-09-28 2017-12-15 电子科技大学 基于Cordova的跨平台在线教育移动终端
US10755051B2 (en) 2017-09-29 2020-08-25 Apple Inc. Rule-based natural language processing
US10453454B2 (en) * 2017-10-26 2019-10-22 Hitachi, Ltd. Dialog system with self-learning natural language understanding
CN107767856B (zh) * 2017-11-07 2021-11-19 中国银行股份有限公司 一种语音处理方法、装置及服务器
US10636424B2 (en) 2017-11-30 2020-04-28 Apple Inc. Multi-turn canned dialog
JP6797916B2 (ja) * 2017-12-08 2020-12-09 グーグル エルエルシーGoogle LLC スタック形式のデータ構造のデジタルアシスタント処理
US20190180753A1 (en) * 2017-12-12 2019-06-13 Ca, Inc. Analysis of collaborative dialog data structures from speech processing computer system
CN108008728B (zh) * 2017-12-12 2020-01-17 深圳市银星智能科技股份有限公司 清洁机器人以及基于清洁机器人的最短路径规划方法
US11900928B2 (en) 2017-12-23 2024-02-13 Soundhound Ai Ip, Llc System and method for adapted interactive experiences
US11443646B2 (en) 2017-12-22 2022-09-13 Fathom Technologies, LLC E-Reader interface system with audio and highlighting synchronization for digital books
WO2019125486A1 (en) * 2017-12-22 2019-06-27 Soundhound, Inc. Natural language grammars adapted for interactive experiences
US10671251B2 (en) 2017-12-22 2020-06-02 Arbordale Publishing, LLC Interactive eReader interface generation based on synchronization of textual and audial descriptors
US10733982B2 (en) 2018-01-08 2020-08-04 Apple Inc. Multi-directional dialog
CN108256066B (zh) * 2018-01-16 2022-01-11 智言科技(深圳)有限公司 端到端层次解码任务型对话系统
CN108269205A (zh) * 2018-01-24 2018-07-10 成都安信思远信息技术有限公司 一种利用云平台的电子数据鉴定系统
US20190236137A1 (en) * 2018-01-31 2019-08-01 Microsoft Technology Licensing, Llc Generating conversational representations of web content
US10733375B2 (en) 2018-01-31 2020-08-04 Apple Inc. Knowledge-based framework for improving natural language understanding
CN108520748B (zh) * 2018-02-01 2020-03-03 百度在线网络技术(北京)有限公司 一种智能设备功能引导方法及系统
US10891430B2 (en) * 2018-02-02 2021-01-12 Fuji Xerox Co., Ltd. Semi-automated methods for translating structured document content to chat-based interaction
US10789959B2 (en) 2018-03-02 2020-09-29 Apple Inc. Training speaker recognition models for digital assistants
KR102507249B1 (ko) * 2018-03-08 2023-03-07 삼성전자주식회사 성능 모드 제어 방법 및 이를 지원하는 전자 장치
US10592604B2 (en) 2018-03-12 2020-03-17 Apple Inc. Inverse text normalization for automatic speech recognition
US10818288B2 (en) 2018-03-26 2020-10-27 Apple Inc. Natural assistant interaction
US10909331B2 (en) 2018-03-30 2021-02-02 Apple Inc. Implicit identification of translation payload with neural machine translation
US11056107B2 (en) * 2018-03-30 2021-07-06 International Business Machines Corporation Conversational framework
EP3776246A1 (en) * 2018-04-19 2021-02-17 Google LLC Dependency graph conversation modeling for use in conducting human-to-computer dialog sessions with a computer-implemented automated assistant
CN108614773A (zh) * 2018-04-24 2018-10-02 广州链创信息科技有限公司 一种手机小程序用展示发布体验系统
US11094316B2 (en) * 2018-05-04 2021-08-17 Qualcomm Incorporated Audio analytics for natural language processing
US11145294B2 (en) 2018-05-07 2021-10-12 Apple Inc. Intelligent automated assistant for delivering content from user experiences
US10928918B2 (en) 2018-05-07 2021-02-23 Apple Inc. Raise to speak
EP3570189B1 (de) * 2018-05-14 2023-08-09 Siemens Aktiengesellschaft Computerimplementiertes verfahren zum bereitstellen eines adaptiven dialogsystems und ein adaptives dialogsystem
US10984780B2 (en) 2018-05-21 2021-04-20 Apple Inc. Global semantic word embeddings using bi-directional recurrent neural networks
DK179822B1 (da) 2018-06-01 2019-07-12 Apple Inc. Voice interaction at a primary device to access call functionality of a companion device
US10892996B2 (en) 2018-06-01 2021-01-12 Apple Inc. Variable latency device coordination
DK180639B1 (en) 2018-06-01 2021-11-04 Apple Inc DISABILITY OF ATTENTION-ATTENTIVE VIRTUAL ASSISTANT
DK201870355A1 (en) 2018-06-01 2019-12-16 Apple Inc. VIRTUAL ASSISTANT OPERATION IN MULTI-DEVICE ENVIRONMENTS
US11386266B2 (en) 2018-06-01 2022-07-12 Apple Inc. Text correction
US11076039B2 (en) 2018-06-03 2021-07-27 Apple Inc. Accelerated task performance
US10776582B2 (en) * 2018-06-06 2020-09-15 International Business Machines Corporation Supporting combinations of intents in a conversation
US10811009B2 (en) 2018-06-27 2020-10-20 International Business Machines Corporation Automatic skill routing in conversational computing frameworks
EP3937030A1 (en) 2018-08-07 2022-01-12 Google LLC Assembling and evaluating automated assistant responses for privacy concerns
US11436215B2 (en) 2018-08-20 2022-09-06 Samsung Electronics Co., Ltd. Server and control method thereof
CN112313742A (zh) * 2018-08-23 2021-02-02 谷歌有限责任公司 根据多助理环境的特性来调节助理响应性
US11010561B2 (en) 2018-09-27 2021-05-18 Apple Inc. Sentiment prediction from textual data
US11170166B2 (en) 2018-09-28 2021-11-09 Apple Inc. Neural typographical error modeling via generative adversarial networks
US10762893B2 (en) * 2018-09-28 2020-09-01 Comcast Cable Communications, Llc Monitoring of one or more audio/video collection devices
US11462215B2 (en) 2018-09-28 2022-10-04 Apple Inc. Multi-modal inputs for voice commands
US10839159B2 (en) 2018-09-28 2020-11-17 Apple Inc. Named entity normalization in a spoken dialog system
US11475898B2 (en) 2018-10-26 2022-10-18 Apple Inc. Low-latency multi-speaker speech recognition
US10594837B1 (en) 2018-11-02 2020-03-17 International Business Machines Corporation Predictive service scaling for conversational computing
US10770072B2 (en) 2018-12-10 2020-09-08 International Business Machines Corporation Cognitive triggering of human interaction strategies to facilitate collaboration, productivity, and learning
US11638059B2 (en) 2019-01-04 2023-04-25 Apple Inc. Content playback on multiple devices
EP3703342B1 (en) * 2019-03-01 2023-07-26 ABB Schweiz AG Dynamic load balancing in network centric process control systems
WO2020185880A1 (en) * 2019-03-12 2020-09-17 Beguided, Inc. Conversational artificial intelligence for automated self-service account management
US11348573B2 (en) 2019-03-18 2022-05-31 Apple Inc. Multimodality in digital assistant systems
CN111800537B (zh) * 2019-04-09 2021-07-20 Oppo广东移动通信有限公司 终端使用状态的评估方法、装置、存储介质及电子设备
CN111857219B (zh) * 2019-04-25 2023-06-30 益逻触控系统公司 具有三层系统架构的自助服务装置
US11475884B2 (en) 2019-05-06 2022-10-18 Apple Inc. Reducing digital assistant latency when a language is incorrectly determined
US11217251B2 (en) 2019-05-06 2022-01-04 Apple Inc. Spoken notifications
US11307752B2 (en) 2019-05-06 2022-04-19 Apple Inc. User configurable task triggers
US11423908B2 (en) 2019-05-06 2022-08-23 Apple Inc. Interpreting spoken requests
US11640504B2 (en) 2019-05-17 2023-05-02 Samsung Electronics Co., Ltd. Electronic apparatus and controlling method thereof
US11140099B2 (en) 2019-05-21 2021-10-05 Apple Inc. Providing message response suggestions
DK201970510A1 (en) 2019-05-31 2021-02-11 Apple Inc Voice identification in digital assistant systems
DK180129B1 (en) 2019-05-31 2020-06-02 Apple Inc. USER ACTIVITY SHORTCUT SUGGESTIONS
US11496600B2 (en) 2019-05-31 2022-11-08 Apple Inc. Remote execution of machine-learned models
US11289073B2 (en) 2019-05-31 2022-03-29 Apple Inc. Device text to speech
US11360641B2 (en) 2019-06-01 2022-06-14 Apple Inc. Increasing the relevance of new available information
US11526780B2 (en) * 2019-08-05 2022-12-13 Microsoft Technology Licensing, Llc Converting nonnative skills for conversational computing interfaces
US10915227B1 (en) 2019-08-07 2021-02-09 Bank Of America Corporation System for adjustment of resource allocation based on multi-channel inputs
CN110647736A (zh) * 2019-08-13 2020-01-03 平安科技(深圳)有限公司 插件式坐席系统登录方法、装置、计算机设备及存储介质
CN110400564A (zh) * 2019-08-21 2019-11-01 科大国创软件股份有限公司 一种基于栈的聊天机器人对话管理方法
WO2021056255A1 (en) 2019-09-25 2021-04-01 Apple Inc. Text detection using global geometry estimators
CN110647305B (zh) * 2019-09-29 2023-10-31 阿波罗智联(北京)科技有限公司 应用程序的语音交互方法、装置、设备和介质
JP7029434B2 (ja) * 2019-10-23 2022-03-03 サウンドハウンド,インコーポレイテッド コンピュータによって実行される方法、サーバ装置、情報処理システム、プログラム、およびクライアント端末
WO2021096382A2 (ru) * 2019-11-15 2021-05-20 Федеральное Государственное Автономное Образовательное Учреждение Высшего Образования "Московский Физико-Технический Институт" (Национальный Исследовательский Университет) (Мфти) Способ и система управления диалоговым агентом в канале взаимодействия с пользователем
CN111105289B (zh) * 2019-11-22 2023-05-05 北京云杉信息技术有限公司 一种基于aop方式的客户端订单取消判断方法及装置
US11664044B2 (en) 2019-11-25 2023-05-30 Qualcomm Incorporated Sound event detection learning
EP3836043A1 (en) 2019-12-11 2021-06-16 Carrier Corporation A method and an equipment for configuring a service
WO2021117251A1 (ja) * 2019-12-13 2021-06-17 株式会社日立産機システム 通信モジュール、通信システム及び通信方法
CN111223485A (zh) * 2019-12-19 2020-06-02 深圳壹账通智能科技有限公司 智能交互方法、装置、电子设备及存储介质
US11209573B2 (en) 2020-01-07 2021-12-28 Northrop Grumman Systems Corporation Radio occultation aircraft navigation aid system
US11550552B2 (en) * 2020-03-30 2023-01-10 Nuance Communications, Inc. Development system and method for a conversational application
US11038934B1 (en) 2020-05-11 2021-06-15 Apple Inc. Digital assistant hardware abstraction
US11514799B2 (en) 2020-11-11 2022-11-29 Northrop Grumman Systems Corporation Systems and methods for maneuvering an aerial vehicle during adverse weather conditions
US11410677B2 (en) 2020-11-24 2022-08-09 Qualcomm Incorporated Adaptive sound event classification
CN112527435B (zh) * 2020-12-08 2024-02-13 建信金融科技有限责任公司 浏览器操作方法及系统
US11329933B1 (en) 2020-12-28 2022-05-10 Drift.com, Inc. Persisting an AI-supported conversation across multiple channels
US20220415311A1 (en) * 2021-06-24 2022-12-29 Amazon Technologies, Inc. Early invocation for contextual data processing
US20230197072A1 (en) * 2021-12-17 2023-06-22 Google Llc Warm word arbitration between automated assistant devices
CN115064168B (zh) * 2022-08-17 2022-12-13 广州小鹏汽车科技有限公司 语音交互方法、服务器和存储介质
CN116719649A (zh) * 2022-09-01 2023-09-08 荣耀终端有限公司 会话通知的提示方法、电子设备及可读存储介质
CN116450383A (zh) * 2023-06-19 2023-07-18 合肥景云软件工程有限责任公司 一种通过网页调用操作系统的方法

Family Cites Families (104)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4787031A (en) * 1985-01-04 1988-11-22 Digital Equipment Corporation Computer with virtual machine mode and multiple protection rings
JPH0673135B2 (ja) * 1988-03-01 1994-09-14 工業技術院長 自然言語対話処理装置
US5282265A (en) * 1988-10-04 1994-01-25 Canon Kabushiki Kaisha Knowledge information processing system
JPH0384652A (ja) * 1989-08-29 1991-04-10 Personal Joho Kankyo Kyokai ヒューマンインタフェースのアーキテクチャモデル
US5333237A (en) * 1989-10-10 1994-07-26 Hughes Aircraft Company Hypermedia structured knowledge base system
JP3235728B2 (ja) * 1990-02-09 2001-12-04 沖ソフトウェア株式会社 会話制御方式
JPH05304567A (ja) * 1990-04-05 1993-11-16 Texas Instr Inc <Ti> 遠隔通信システムならびにその遠隔通信サービスプレゼンテーションインターフェース及び遠隔通信ネットワーク
JPH0410153A (ja) * 1990-04-27 1992-01-14 Nippon Telegr & Teleph Corp <Ntt> 情報検索システム
US5133004A (en) * 1990-05-07 1992-07-21 Unisys Corporation Digital computer platform for supporting telephone network applications
US5255305A (en) 1990-11-01 1993-10-19 Voiceplex Corporation Integrated voice processing system
US5274695A (en) * 1991-01-11 1993-12-28 U.S. Sprint Communications Company Limited Partnership System for verifying the identity of a caller in a telecommunications network
JPH0792801B2 (ja) * 1991-08-02 1995-10-09 工業技術院長 非単調推論方法
US5283888A (en) 1991-08-27 1994-02-01 International Business Machines Corporation Voice processing interface unit employing virtual screen communications for accessing a plurality of primed applications
US6400996B1 (en) * 1999-02-01 2002-06-04 Steven M. Hoffberg Adaptive pattern recognition based control system and method
JP3213056B2 (ja) * 1992-06-19 2001-09-25 株式会社日立製作所 マルチメディア処理装置
US5384890A (en) * 1992-09-30 1995-01-24 Apple Computer, Inc. Method and apparatus for providing multiple clients simultaneous access to a sound data stream
FR2696574B1 (fr) * 1992-10-06 1994-11-18 Sextant Avionique Procédé et dispositif d'analyse d'un message fourni par des moyens d'interaction à un système de dialogue homme-machine.
JPH06131437A (ja) * 1992-10-20 1994-05-13 Hitachi Ltd 複合形態による操作指示方法
JPH06149534A (ja) * 1992-11-13 1994-05-27 Toshiba Corp 音声画面操作装置
US6029188A (en) * 1993-01-18 2000-02-22 Institute For Personalized Information Environment Information processing system for an architecture model capable of interfacing with humans and capable of being modified
US5544228A (en) * 1993-09-27 1996-08-06 The Walt Disney Company Method and apparatus for transmission of full frequency digital audio
US5509060A (en) * 1993-11-19 1996-04-16 At&T Corp. Network-accessible intelligent telephone service
US5748841A (en) * 1994-02-25 1998-05-05 Morin; Philippe Supervised contextual language acquisition system
US5644686A (en) * 1994-04-29 1997-07-01 International Business Machines Corporation Expert system and method employing hierarchical knowledge base, and interactive multimedia/hypermedia applications
JP3408332B2 (ja) * 1994-09-12 2003-05-19 富士通株式会社 ハイパーテキスト読上装置
US5682539A (en) * 1994-09-29 1997-10-28 Conrad; Donovan Anticipated meaning natural language interface
US5594789A (en) * 1994-10-13 1997-01-14 Bell Atlantic Network Services, Inc. Transaction implementation in video dial tone network
DE69511425T2 (de) * 1994-11-08 2000-03-09 Vermeer Tech Inc Herstellungshilfe für online-dienste mit gebührenfeststellung
JPH08137697A (ja) * 1994-11-15 1996-05-31 Kobe Steel Ltd 自律分散型システム
US6029195A (en) * 1994-11-29 2000-02-22 Herz; Frederick S. M. System for customized electronic identification of desirable objects
US5748974A (en) * 1994-12-13 1998-05-05 International Business Machines Corporation Multimodal natural language interface for cross-application tasks
JP2746218B2 (ja) * 1995-02-23 1998-05-06 富士ゼロックス株式会社 資源管理装置および資源管理方法
US5890123A (en) * 1995-06-05 1999-03-30 Lucent Technologies, Inc. System and method for voice controlled video screen display
US5742845A (en) * 1995-06-22 1998-04-21 Datascape, Inc. System for extending present open network communication protocols to communicate with non-standard I/O devices directly coupled to an open network
JP3729918B2 (ja) * 1995-07-19 2005-12-21 株式会社東芝 マルチモーダル対話装置及び対話方法
US5794205A (en) 1995-10-19 1998-08-11 Voice It Worldwide, Inc. Voice recognition interface apparatus and method for interacting with a programmable timekeeping device
JPH10124378A (ja) 1995-12-01 1998-05-15 Zakuson R & D:Kk インターネットおよびイントラネット上におけるホームページ上の文書録画支援装置および文書録画支援システム
US6240448B1 (en) * 1995-12-22 2001-05-29 Rutgers, The State University Of New Jersey Method and system for audio access to information in a wide area computer network
US6101528A (en) * 1996-03-27 2000-08-08 Intel Corporation Method and apparatus for discovering server applications by a client application in a network of computer systems
US5884262A (en) * 1996-03-28 1999-03-16 Bell Atlantic Network Services, Inc. Computer network audio access and conversion system
KR19990028327A (ko) * 1996-04-22 1999-04-15 제프리 엠. 웨이닉 오디오 인터페이스를 이용한 정보 검색 방법 및 장치
US5956507A (en) * 1996-05-14 1999-09-21 Shearer, Jr.; Bennie L. Dynamic alteration of operating system kernel resource tables
JPH09319760A (ja) * 1996-05-30 1997-12-12 Nippon Telegr & Teleph Corp <Ntt> 情報検索運行システム
DE69734650T2 (de) * 1996-06-03 2006-08-10 Worldvoice Licensing, Inc. Fernsprechsprachpostverfahren mit netzwerkadress-und sprach-authentifizierung
JPH1078952A (ja) * 1996-07-29 1998-03-24 Internatl Business Mach Corp <Ibm> 音声合成方法、音声合成装置、ハイパーテキストの制御方法及び制御装置
US5799063A (en) * 1996-08-15 1998-08-25 Talk Web Inc. Communication system and method of providing access to pre-recorded audio messages via the Internet
JPH10124293A (ja) * 1996-08-23 1998-05-15 Osaka Gas Co Ltd 音声指令可能なコンピュータとそれ用の媒体
US5850629A (en) * 1996-09-09 1998-12-15 Matsushita Electric Industrial Co., Ltd. User interface controller for text-to-speech synthesizer
EP0863466A4 (en) * 1996-09-26 2005-07-20 Mitsubishi Electric Corp INTERACTIVE PROCESSOR
US5721827A (en) * 1996-10-02 1998-02-24 James Logan System for electrically distributing personalized information
US5915001A (en) * 1996-11-14 1999-06-22 Vois Corporation System and method for providing and using universally accessible voice and speech data files
US5774857A (en) * 1996-11-15 1998-06-30 Motorola, Inc. Conversion of communicated speech to text for tranmission as RF modulated base band video
JP3826239B2 (ja) * 1996-11-22 2006-09-27 日本電信電話株式会社 ハイパーテキスト中継方法及び装置
US6282511B1 (en) * 1996-12-04 2001-08-28 At&T Voiced interface with hyperlinked information
US5960399A (en) 1996-12-24 1999-09-28 Gte Internetworking Incorporated Client/server speech processor/recognizer
US6456974B1 (en) * 1997-01-06 2002-09-24 Texas Instruments Incorporated System and method for adding speech recognition capabilities to java
JP2907174B2 (ja) * 1997-02-04 1999-06-21 日本電気株式会社 監視制御システムのユーザインタフェースシステム
JPH10222342A (ja) * 1997-02-06 1998-08-21 Nippon Telegr & Teleph Corp <Ntt> ハイパーテキスト音声制御方法及び装置
JPH10222525A (ja) 1997-02-07 1998-08-21 Hitachi Ltd ナビゲーション方法
JPH10243048A (ja) * 1997-02-27 1998-09-11 Nippon Telegr & Teleph Corp <Ntt> 情報伝送方法及びシステム
US5987102A (en) * 1997-03-14 1999-11-16 Efusion, Inc. Method and apparatus for bridging a voice call including selective provision of information in non-audio to the caller
US5884266A (en) 1997-04-02 1999-03-16 Motorola, Inc. Audio interface for document based information resource navigation and method therefor
JP3646471B2 (ja) * 1997-05-14 2005-05-11 日本電信電話株式会社 情報提供方法及び個人適応情報提供サーバ及び情報提供プログラムを格納した記憶媒体
US5897616A (en) * 1997-06-11 1999-04-27 International Business Machines Corporation Apparatus and methods for speaker verification/identification/classification employing non-acoustic and/or acoustic models and databases
US6044347A (en) * 1997-08-05 2000-03-28 Lucent Technologies Inc. Methods and apparatus object-oriented rule-based dialogue management
US6108711A (en) * 1998-09-11 2000-08-22 Genesys Telecommunications Laboratories, Inc. Operating system having external media layer, workflow layer, internal media layer, and knowledge base for routing media events between transactions
US6353824B1 (en) * 1997-11-18 2002-03-05 Apple Computer, Inc. Method for dynamic presentation of the contents topically rich capsule overviews corresponding to the plurality of documents, resolving co-referentiality in document segments
US6157705A (en) * 1997-12-05 2000-12-05 E*Trade Group, Inc. Voice control of a server
US6370508B2 (en) * 1998-09-11 2002-04-09 Genesys Telecommunications Laboratories, Inc. Interface engine for managing business processes within a multimedia communication-center
EP1073957B1 (en) * 1998-03-23 2003-05-21 Microsoft Corporation Application program interfaces in an operating system
US6018711A (en) * 1998-04-21 2000-01-25 Nortel Networks Corporation Communication system user interface with animated representation of time remaining for input to recognizer
US6012030A (en) * 1998-04-21 2000-01-04 Nortel Networks Corporation Management of speech and audio prompts in multimodal interfaces
US6859451B1 (en) * 1998-04-21 2005-02-22 Nortel Networks Limited Server for handling multimodal information
US6438523B1 (en) * 1998-05-20 2002-08-20 John A. Oberteuffer Processing handwritten and hand-drawn input and speech input
US6397259B1 (en) * 1998-05-29 2002-05-28 Palm, Inc. Method, system and apparatus for packet minimized communications
US6253326B1 (en) * 1998-05-29 2001-06-26 Palm, Inc. Method and system for secure communications
US6269366B1 (en) * 1998-06-24 2001-07-31 Eastman Kodak Company Method for randomly combining images with annotations
US6269336B1 (en) 1998-07-24 2001-07-31 Motorola, Inc. Voice browser for interactive services and methods thereof
US6119147A (en) * 1998-07-28 2000-09-12 Fuji Xerox Co., Ltd. Method and system for computer-mediated, multi-modal, asynchronous meetings in a virtual space
US6779060B1 (en) * 1998-08-05 2004-08-17 British Telecommunications Public Limited Company Multimodal user interface
US6195696B1 (en) * 1998-10-01 2001-02-27 International Business Machines Corporation Systems, methods and computer program products for assigning, generating and delivering content to intranet users
US6539359B1 (en) 1998-10-02 2003-03-25 Motorola, Inc. Markup language for interactive services and methods thereof
US7003463B1 (en) * 1998-10-02 2006-02-21 International Business Machines Corporation System and method for providing network coordinated conversational services
IL140805A0 (en) * 1998-10-02 2002-02-10 Ibm Structure skeletons for efficient voice navigation through generic hierarchical objects
US6615240B1 (en) * 1998-12-18 2003-09-02 Motive Communications, Inc. Technical support chain automation with guided self-help capability and option to escalate to live help
US6851115B1 (en) * 1999-01-05 2005-02-01 Sri International Software-based architecture for communication and cooperation among distributed electronic agents
US6523061B1 (en) * 1999-01-05 2003-02-18 Sri International, Inc. System, method, and article of manufacture for agent-based navigation in a speech-based data navigation system
US7216351B1 (en) * 1999-04-07 2007-05-08 International Business Machines Corporation Systems and methods for synchronizing multi-modal interactions
US6519617B1 (en) * 1999-04-08 2003-02-11 International Business Machines Corporation Automated creation of an XML dialect and dynamic generation of a corresponding DTD
US6636831B1 (en) * 1999-04-09 2003-10-21 Inroad, Inc. System and process for voice-controlled information retrieval
US6529889B1 (en) * 1999-07-27 2003-03-04 Acappella Software, Inc. System and method of knowledge architecture
US6377913B1 (en) * 1999-08-13 2002-04-23 International Business Machines Corporation Method and system for multi-client access to a dialog system
US7069220B2 (en) * 1999-08-13 2006-06-27 International Business Machines Corporation Method for determining and maintaining dialog focus in a conversational speech system
US6799169B1 (en) * 1999-08-13 2004-09-28 International Business Machines Corporation Method and system for modeless operation of a multi-modal user interface through implementation of independent decision networks
US6738803B1 (en) * 1999-09-03 2004-05-18 Cisco Technology, Inc. Proxy browser providing voice enabled web application audio control for telephony devices
US6615172B1 (en) * 1999-11-12 2003-09-02 Phoenix Solutions, Inc. Intelligent query engine for processing voice based queries
US6829603B1 (en) * 2000-02-02 2004-12-07 International Business Machines Corp. System, method and program product for interactive natural dialog
US7415537B1 (en) * 2000-04-07 2008-08-19 International Business Machines Corporation Conversational portal for providing conversational browsing and multimedia broadcast on demand
US6934756B2 (en) * 2000-11-01 2005-08-23 International Business Machines Corporation Conversational networking via transport, coding and control conversational protocols
US6970935B1 (en) * 2000-11-01 2005-11-29 International Business Machines Corporation Conversational networking via transport, coding and control conversational protocols
US6801604B2 (en) * 2001-06-25 2004-10-05 International Business Machines Corporation Universal IP-based and scalable architectures across conversational applications using web services for speech and audio processing resources
US6839896B2 (en) * 2001-06-29 2005-01-04 International Business Machines Corporation System and method for providing dialog management and arbitration in a multi-modal environment
US7398209B2 (en) * 2002-06-03 2008-07-08 Voicebox Technologies, Inc. Systems and methods for responding to natural language speech utterance
US7349845B2 (en) * 2003-09-03 2008-03-25 International Business Machines Corporation Method and apparatus for dynamic modification of command weights in a natural language understanding system

Cited By (64)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103457810B (zh) * 2006-07-07 2017-04-12 三星电子株式会社 在网络环境中提供服务菜单和服务的方法及服务提供设备
CN103295575A (zh) * 2012-02-27 2013-09-11 北京三星通信技术研究有限公司 一种语音识别方法和客户端
CN103295575B (zh) * 2012-02-27 2019-01-25 北京三星通信技术研究有限公司 一种语音识别方法和客户端
CN108629041A (zh) * 2012-07-19 2018-10-09 格兰斯电讯网络有限公司 协同浏览与其他形式的信息共享的集成
CN103117058B (zh) * 2012-12-20 2015-12-09 四川长虹电器股份有限公司 基于智能电视平台的多语音引擎切换系统及方法
CN103117058A (zh) * 2012-12-20 2013-05-22 四川长虹电器股份有限公司 基于智能电视平台的多语音引擎切换系统及方法
CN104598257B (zh) * 2013-10-30 2019-01-18 华为技术有限公司 远程应用程序运行的方法和装置
CN104598257A (zh) * 2013-10-30 2015-05-06 华为技术有限公司 远程应用程序运行的方法和装置
US10057364B2 (en) 2013-10-30 2018-08-21 Huawei Technologies Co., Ltd. Method and apparatus for remotely running application program
CN106663428A (zh) * 2014-07-16 2017-05-10 索尼公司 装置、方法、非暂态计算机可读介质及系统
CN106663428B (zh) * 2014-07-16 2021-02-09 索尼公司 装置、方法、非暂态计算机可读介质及系统
US11888942B2 (en) 2014-09-19 2024-01-30 Convida Wireless, Llc Systems and methods for service layer session migration and sharing
CN112217905A (zh) * 2014-09-19 2021-01-12 康维达无线有限责任公司 服务层会话迁移和共享
CN108475503B (zh) * 2015-10-15 2023-09-22 交互智能集团有限公司 用于多语言通信排序的系统和方法
CN108475503A (zh) * 2015-10-15 2018-08-31 交互智能集团有限公司 用于多语言通信排序的系统和方法
CN105551488A (zh) * 2015-12-15 2016-05-04 深圳Tcl数字技术有限公司 语音控制方法及系统
US11736860B2 (en) 2016-02-22 2023-08-22 Sonos, Inc. Voice control of a media playback system
US11514898B2 (en) 2016-02-22 2022-11-29 Sonos, Inc. Voice control of a media playback system
US11863593B2 (en) 2016-02-22 2024-01-02 Sonos, Inc. Networked microphone device control
US11750969B2 (en) 2016-02-22 2023-09-05 Sonos, Inc. Default playback device designation
US11832068B2 (en) 2016-02-22 2023-11-28 Sonos, Inc. Music service selection
US11405430B2 (en) 2016-02-22 2022-08-02 Sonos, Inc. Networked microphone device control
US11545169B2 (en) 2016-06-09 2023-01-03 Sonos, Inc. Dynamic player selection for audio signal processing
US11531520B2 (en) 2016-08-05 2022-12-20 Sonos, Inc. Playback device supporting concurrent voice assistants
US11641559B2 (en) 2016-09-27 2023-05-02 Sonos, Inc. Audio playback settings for voice interaction
US11308961B2 (en) 2016-10-19 2022-04-19 Sonos, Inc. Arbitration-based voice recognition
CN110073326A (zh) * 2016-10-19 2019-07-30 搜诺思公司 基于仲裁的语音识别
US11727933B2 (en) 2016-10-19 2023-08-15 Sonos, Inc. Arbitration-based voice recognition
CN107319857A (zh) * 2017-06-30 2017-11-07 广东工业大学 一种交互镜子及具有该交互镜子的智能家电系统
US11900937B2 (en) 2017-08-07 2024-02-13 Sonos, Inc. Wake-word detection suppression
US11500611B2 (en) 2017-09-08 2022-11-15 Sonos, Inc. Dynamic computation of system response volume
US11646045B2 (en) 2017-09-27 2023-05-09 Sonos, Inc. Robust short-time fourier transform acoustic echo cancellation during audio playback
US11769505B2 (en) 2017-09-28 2023-09-26 Sonos, Inc. Echo of tone interferance cancellation using two acoustic echo cancellers
US11538451B2 (en) 2017-09-28 2022-12-27 Sonos, Inc. Multi-channel acoustic echo cancellation
US11893308B2 (en) 2017-09-29 2024-02-06 Sonos, Inc. Media playback system with concurrent voice assistance
US11343614B2 (en) 2018-01-31 2022-05-24 Sonos, Inc. Device designation of playback and network microphone device arrangements
US11689858B2 (en) 2018-01-31 2023-06-27 Sonos, Inc. Device designation of playback and network microphone device arrangements
US11797263B2 (en) 2018-05-10 2023-10-24 Sonos, Inc. Systems and methods for voice-assisted media content selection
US11792590B2 (en) 2018-05-25 2023-10-17 Sonos, Inc. Determining and adapting to changes in microphone performance of playback devices
US11696074B2 (en) 2018-06-28 2023-07-04 Sonos, Inc. Systems and methods for associating playback devices with voice assistant services
US11563842B2 (en) 2018-08-28 2023-01-24 Sonos, Inc. Do not disturb feature for audio notifications
US11482978B2 (en) 2018-08-28 2022-10-25 Sonos, Inc. Audio notifications
US11778259B2 (en) 2018-09-14 2023-10-03 Sonos, Inc. Networked devices, systems and methods for associating playback devices based on sound codes
US11790937B2 (en) 2018-09-21 2023-10-17 Sonos, Inc. Voice detection optimization using sound metadata
US11790911B2 (en) 2018-09-28 2023-10-17 Sonos, Inc. Systems and methods for selective wake word detection using neural network models
US11899519B2 (en) 2018-10-23 2024-02-13 Sonos, Inc. Multiple stage network microphone device with reduced power consumption and processing load
US11741948B2 (en) 2018-11-15 2023-08-29 Sonos Vox France Sas Dilated convolutions and gating for efficient keyword spotting
US11557294B2 (en) 2018-12-07 2023-01-17 Sonos, Inc. Systems and methods of operating media playback systems having multiple voice assistant services
US11538460B2 (en) 2018-12-13 2022-12-27 Sonos, Inc. Networked microphone devices, systems, and methods of localized arbitration
US11540047B2 (en) 2018-12-20 2022-12-27 Sonos, Inc. Optimization of network microphone devices using noise classification
CN110021291A (zh) * 2018-12-26 2019-07-16 阿里巴巴集团控股有限公司 一种语音合成文件的调用方法及装置
CN110021291B (zh) * 2018-12-26 2021-01-29 创新先进技术有限公司 一种语音合成文件的调用方法及装置
US11646023B2 (en) 2019-02-08 2023-05-09 Sonos, Inc. Devices, systems, and methods for distributed voice processing
US11798553B2 (en) 2019-05-03 2023-10-24 Sonos, Inc. Voice assistant persistence across multiple network microphone devices
US11854547B2 (en) 2019-06-12 2023-12-26 Sonos, Inc. Network microphone device with command keyword eventing
US11501773B2 (en) 2019-06-12 2022-11-15 Sonos, Inc. Network microphone device with command keyword conditioning
US11710487B2 (en) 2019-07-31 2023-07-25 Sonos, Inc. Locally distributed keyword detection
US11714600B2 (en) 2019-07-31 2023-08-01 Sonos, Inc. Noise classification for event detection
US11862161B2 (en) 2019-10-22 2024-01-02 Sonos, Inc. VAS toggle based on device orientation
US11869503B2 (en) 2019-12-20 2024-01-09 Sonos, Inc. Offline voice control
CN113035226A (zh) * 2019-12-24 2021-06-25 中兴通讯股份有限公司 语音通话方法、通信终端和计算机可读介质
US11562740B2 (en) 2020-01-07 2023-01-24 Sonos, Inc. Voice verification for media playback
US11482224B2 (en) 2020-05-20 2022-10-25 Sonos, Inc. Command keywords with input detection windowing
US11698771B2 (en) 2020-08-25 2023-07-11 Sonos, Inc. Vocal guidance engines for playback devices

Also Published As

Publication number Publication date
DE69937962D1 (de) 2008-02-21
EP1125277A1 (en) 2001-08-22
WO2000021073A1 (en) 2000-04-13
WO2000021075A1 (en) 2000-04-13
EP1163576A2 (en) 2001-12-19
IL142366A0 (en) 2002-03-10
JP4171585B2 (ja) 2008-10-22
CN1151488C (zh) 2004-05-26
CA2345665C (en) 2011-02-08
IL142366A (en) 2008-11-26
WO2000020962A2 (en) 2000-04-13
JP2002527919A (ja) 2002-08-27
CA2345662A1 (en) 2000-04-13
EP1163576A4 (en) 2005-11-30
IL142363A0 (en) 2002-03-10
JP2003525477A (ja) 2003-08-26
WO2000021232A2 (en) 2000-04-13
WO2000020962A3 (en) 2000-07-13
US20070043574A1 (en) 2007-02-22
KR20010073105A (ko) 2001-07-31
CA2345665A1 (en) 2000-04-13
US7729916B2 (en) 2010-06-01
EP1125279B1 (en) 2008-01-09
US20090313026A1 (en) 2009-12-17
ATE383640T1 (de) 2008-01-15
CN1342290A (zh) 2002-03-27
CN100472500C (zh) 2009-03-25
KR20010075552A (ko) 2001-08-09
KR100431972B1 (ko) 2004-05-22
KR100459299B1 (ko) 2004-12-03
KR20010085878A (ko) 2001-09-07
DE69937962T2 (de) 2008-12-24
WO2000021075A9 (en) 2000-06-08
CN1321295A (zh) 2001-11-07
EP1133734A4 (en) 2005-12-14
EP1125279A1 (en) 2001-08-22
US7137126B1 (en) 2006-11-14
CN1171200C (zh) 2004-10-13
KR100430953B1 (ko) 2004-05-12
EP1125279A4 (en) 2005-02-23
IL142364A0 (en) 2002-03-10
KR20020004931A (ko) 2002-01-16
JP3703082B2 (ja) 2005-10-05
JP2002526855A (ja) 2002-08-20
WO2000021232A3 (en) 2000-11-02
CA2345660C (en) 2006-01-31
CA2345660A1 (en) 2000-04-13
CA2345661A1 (en) 2000-04-13
CN1321296A (zh) 2001-11-07
CN1160700C (zh) 2004-08-04
EP1133734A2 (en) 2001-09-19
US7418382B1 (en) 2008-08-26
EP1125277A4 (en) 2002-02-20
JP2002527800A (ja) 2002-08-27
US8082153B2 (en) 2011-12-20
IL140805A0 (en) 2002-02-10
KR100620826B1 (ko) 2006-09-13

Similar Documents

Publication Publication Date Title
CN1160700C (zh) 提供网络协同会话服务的系统和方法
US9761241B2 (en) System and method for providing network coordinated conversational services
US6970935B1 (en) Conversational networking via transport, coding and control conversational protocols
US20070043868A1 (en) System and method for searching for network-based content in a multi-modal system using spoken keywords
US7382770B2 (en) Multi-modal content and automatic speech recognition in wireless telecommunication systems
US8838457B2 (en) Using results of unstructured language model based speech recognition to control a system-level function of a mobile communications facility
CN1333385C (zh) 用于通信系统的语音浏览器启用器
US7912725B2 (en) Method and system for providing news information by using three dimensional character for use in wireless communication network
US20090030687A1 (en) Adapting an unstructured language model speech recognition system based on usage
CN106409283A (zh) 基于音频的人机混合交互系统及方法
US7313522B2 (en) Voice synthesis system and method that performs voice synthesis of text data provided by a portable terminal
JP4725512B2 (ja) 音声処理システム、音声処理方法、音声処理サーバ装置、およびプログラム
CN108881507B (zh) 一种包括语音浏览器和区块链语音dns单元的系统
JPH10177469A (ja) 移動端末音声認識/データベース検索/リソースアクセス通信システム
JP2004515859A (ja) インターネット・アクセス用分散型音声認識
CN1427394A (zh) 语音浏览网关
KR20060096654A (ko) 멀티모달 플랫폼을 이용한 모바일 서비스 시스템 및 그 방법
KR20050039826A (ko) 유무선 기반의 음성 인터페이스를 이용한 멀티모달 시스템및 이를 수행하는 방법

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
ASS Succession or assignment of patent right

Owner name: NUANCE COMMUNICATIONS INC.

Free format text: FORMER OWNER: INTERNATIONAL BUSINESS MACHINES CORPORATION

Effective date: 20100705

C41 Transfer of patent application or patent right or utility model
COR Change of bibliographic data

Free format text: CORRECT: ADDRESS; FROM: NEW YORK, THE UNITED STATES TO: MASSACHUSETTS, THE UNITED STATES

TR01 Transfer of patent right

Effective date of registration: 20100705

Address after: Massachusetts, USA

Patentee after: Nuance Communications Inc

Address before: American New York

Patentee before: International Business Machines Corp.

CX01 Expiry of patent term

Granted publication date: 20040804

CX01 Expiry of patent term