WO2016019072A1 - System and method for addressing communication issues for contact center service quality - Google Patents

System and method for addressing communication issues for contact center service quality Download PDF

Info

Publication number
WO2016019072A1
WO2016019072A1 PCT/US2015/042755 US2015042755W WO2016019072A1 WO 2016019072 A1 WO2016019072 A1 WO 2016019072A1 US 2015042755 W US2015042755 W US 2015042755W WO 2016019072 A1 WO2016019072 A1 WO 2016019072A1
Authority
WO
WIPO (PCT)
Prior art keywords
caller
issues
language
agent
call
Prior art date
Application number
PCT/US2015/042755
Other languages
French (fr)
Inventor
Herbert Willi Artur Ristock
Yochai Konig
Vyacheslav Zhakov
Vyacheslav Sayko
Eric TAMBLYN
Original Assignee
Genesys Telecommunications Laboratories, Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US14/445,629 external-priority patent/US20160036972A1/en
Priority claimed from US14/445,646 external-priority patent/US20160036980A1/en
Application filed by Genesys Telecommunications Laboratories, Inc. filed Critical Genesys Telecommunications Laboratories, Inc.
Publication of WO2016019072A1 publication Critical patent/WO2016019072A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • H04M3/50Centralised arrangements for answering calls; Centralised arrangements for recording messages for absent or busy subscribers ; Centralised arrangements for recording messages
    • H04M3/51Centralised call answering arrangements requiring operator intervention, e.g. call or contact centers for telemarketing
    • H04M3/5175Call or contact centers supervision arrangements

Definitions

  • Contact centers may be used by an organization to communicate in an efficient and systematic manner with outside parties. Such contact centers may for example have large numbers of agents staffing telephones and interacting with outside parties and with each other, The contact centers can include an interactive voice response (1 VR) system to handle calls, record messages and/or place calls with agents at the contact center.
  • 1 VR interactive voice response
  • Figure 1 is a schematic block diagram of an exemplary system supporting a contact center.
  • Figure 2 is a block diagram of an exemplary system associated with the recognition server for capturing and analyzing data.
  • Figure 3 is a table illustrating an exemplary relation, between MOS values, R- vaiues and user satisfaction.
  • Figure 4 is a block diagram of exemplary categorization of customer calls, e.g., based on determined phrases.
  • Figure 5 Is a flow chart of an exemplary logic of the system to determine, analyze and address hard-to-understand communications, e.g.. In the context of a contact center.
  • Figures 6 is a block d iagra m, of an exemplary computing device.
  • Figures 7 is a block diagram of an exemplary computing device.
  • Figures 8 is a block diagram of an exemplary computing device
  • Figures 9 is a block diagram of an exemplary computing device.
  • Figure 10 is a block diagram of an exemplary network environment including several computing devices.
  • Systems and methods can provide for determinin g and adjusting to hard-to- understand sessions, e.g., for improving service quality in the contact center setting, in one example, a phone conversation with bad transmission quality can create stress to a customer and/or agent since the brain tries to fil l the missing gaps, even if the customer and. agent do not realize It This can lead to mental exhaustion and a negative emotion.
  • the systems and methods can also identify a helpfulness of the agent to the customer such as an agent understands the customer's issue clearly and adjusts accordingly.
  • the systems and methods .address different scenarios in the context of hard-to-tmderstand, e.g., media related issues regarding voice and text quality, and content related scenarios.
  • the communication peers may consciously or unconsciously perceive that there are issues. For example, even when the callers can filter out distractive noise and fill in gaps without callers noticing, their exhaustive brain's work can gradually make them- unhappy and result in bad experience.
  • the systems and methods can determine and address explicitly perceived communication issues, e.g.. low experience scores and/or explicit negative phrases or words used during the conversation.
  • the systems and methods can also determine and address unconsciously perceived issues, e.g., by monitoring a quality of the communication lines, monitoring phrases used during a conversation, etc.
  • Figure 1 is a schematic block diagram of an exemplary system, e.g., a system supporting a contact center.
  • the system can be configured to distribute information and task, assignments related to interactions with end users (also referred to as customers), to employees of an enterprise, e.g.. customer care agents. These task assignments are also referred to herein, as work, items.
  • the contact center may be an in-house facility of the enterprise and may serve the enterprise in performing the functions of sales and service relative to the products and. services available through the enterprise. In another exemplary embodiment, the contact center may be a third-party service provider.
  • a quality of the communication lines between the customers and the agents can be poor, even if imperceptibly so, A risk of a low quality voice connection for borne agents may be even higher than for agents that work from the contact center. Additionally, knowledge workers who are experts in the enterprise but not full-time agents may answer customer calls with their mobile phone, which can exhibit poorer quality than landlines for example.
  • the contact center infrastructure- may be hosted in equipment dedicated to the enterprise or third-parry service provider, and/or hosted in a. remote computing environment such as, for example, a private or public cloud environment with infrastructure for supporting multiple contact centers for multiple enterprises.
  • the contact center can include resources (e.g. personnel, computers, and telecommunication equipment) to enable delivery of services via telephone or other communication mechanisms. Such services may vary depending on the type of contact center, and may range from customer service to help desk, emergency response, telemarketing, order taking, and the like. These are some exemplary contexts for the hard-to-understand sessions,
  • the end user devices 10 may be a communication device, for example, a telephone, wireless phone, smart phone, personal computer, electronic tablet, and/or the like.
  • the mechanisms of contact, and the corresponding user devices 10, need not be limited to real-time voice communications as in a traditional telephone call, but may be non-voice communications including text, video, and the like, and may include email or other non «real «time means of communication.
  • This generalized form of a contact between an end user and the contact center may include methods of communication other than, voice, and an endpoint other than a telephone, e.g. interactions.
  • Inbound and outbound interactions from and to the end user devices 10 may traverse a telephone, cellular, and/or data communication network 14 depending on the type of device that is being used.
  • the communications network 14 may include a private or public switched telephone network (PSTN), local area, network (LAN), private wide area network (WAN), and/or public wide area network such as, for example, the internet
  • PSTN public switched telephone network
  • LAN local area
  • WAN private wide area network
  • the communications network 14 may also include a wireless carrier network including a code division multiple access (CDMA) network, global system for mobile communications (GSM) network, and/or any 3G, 4G, LTE, etc. network.
  • CDMA code division multiple access
  • GSM global system for mobile communications
  • the contact center can also include an outbound contact server 54 to perform outbound functions, e.g., in which contact center agents make outbound calls to customers on behalf of a business or client. Calls made from the contact center can include telemarketing, sales or fund-raising calls, as well as calls for contact list updating, surveys or verification services.
  • the systems and methods described herein can be used for both, inbound and outbound communications, e.g., to determine if hard-to-understand. conditions of the communications exist,
  • the contact center includes a switch/media gateway 12 coupled to the communications network 14 for receiving and transmitting interactions and/or data between end users and the contact center.
  • the switch/media gateway 1 2 may include a. telephony switch configured to function as a central switch for agent level routing within the center, in this regard, the switch/media gateway 12 may include an automatic interaction distributor, a private branch exchange (PBX), an IP-based software switch, and/or any other switch configured to receive Iniernet-soureed interactions and/or telephone network-sourced interactions.
  • PBX private branch exchange
  • IP-based software switch an IP-based software switch
  • the switch can be coupled to a call server 18 which may, for example, serve as an adaptor or interface between the switch/media gateway 12 and the remainder of the routing, monitoring, and other interaction-handling systems of the contact center.
  • the call server 18 can connect to other elements, e.g., described herein, via a communication/message bits S 3.
  • the contact center may also include a multimedia/social media server 24 connected with the communication/message bus 13.
  • the multimedia/social media server 24 may also be referred to as an mteraction server, for engaging in media interactions other than voice interactions with the end user devices 10 and/or web servers 32.
  • the media interactions may be related, for example, to email, chat, text-messaging, web, social .media, and the like.
  • the web servers 32 may include, tor example, social interaction site hosts for a variety of known social interaction sites to which an end user mav subscribe, such as. for example, FACEBOOK. TM , TWITTERTM, and the like.
  • the web servers may also provide web pages for the enterprise that is being supported by the contact, center. End users may browse the web pages and get information about the enterprise's products and services.
  • the web pages may .also provide a mechanism for contacting the contact center, via, for example, web chat, voice call, email, web real time communication (WebRTC), or the like.
  • the switch can be coupled to an interactive voice response (IVR) server 34.
  • IVR server 34 is configured, for example, with an I VR script for querying customers oh their needs. For example, a contact center for a bank may tell callers, via the I VR script, to "press I " if they wish to get an account balance. If this is the case, through continued interaction with the IVR, customers may complete service without needing to speak with an agent.
  • the interaction is forwarded to the call server 18 which interacts with a routing server, referred to as a Universal Routing Server (URS) 20. for finding the most appropriate agent for processing the interaction. Additionally or alternatively, the URS 20 can handle routing, orchestration and conversation management among other things.
  • the call server 18 may be configured to process PSTN calls, VoIP calls, and the like.
  • the call server 18 may include a session initiation protocol (SIP) server for processing SIP calls.
  • the call server 18 may include a telephony server (T-server).
  • the call server may place the interaction in an mteraction queue.
  • the interaction queue may be implemented via any data structure, such as. for example, a linked list, array, and/or the like.
  • the data structure may be maintained, for example, in buffer memory provided by the call server 1 8.
  • the call is removed from the call queue and transferred to the corresponding agent device 38a-38b.
  • Collected information about the caller and/or the caller's historical information may also be provided to the agent device for aiding the agent in better servicing the call.
  • the information may also be provided to a stakeholder device 38c for monitoring and training purposes,
  • a stakeholder may be a contact center manager or a supervisor of one or more agents.
  • Stakeholders need not be contact center employees; a product manager employed by the same enterprise, or by another enterprise supported by the contact center, may for example be a stakeholder.
  • the agent/stakeholder device 38a-c may include a telephone adapted for regular telephone calls, VoIP calls, and the like.
  • the agent and stakeholder devices 38a-c may also include a computer for communicating with one or more servers of the contact center and performing data processing associated with contact center operations.
  • the selection of an appropriate agent for routing an inbound interaction may be based, for example . , on a routing strategy employed by the routing server 20, and further based on information about agent availability, skills, agent location, and other routing parameters provided, for example, by a statistics (stat) server 22.
  • the siat server 22 may accumulate data about places, agents, and place/agent, groups, convert the data, into .statistically useful information, and pass the calculations to other software applications.
  • the siat server 22 may provide information to the .routing server about agents' capabilities in terms of interactions they are handling, the media type of an interaction, and so on.
  • An exemplary routing strategy employed by the routing server 20 may be that if a particular agent agent group, or department is requested, the interaction is routed to the requested agent, agent group, or department as soon the requested entity becomes available, if a particular agent has not been requested, the interaction may be routed to agents with the requested skill as soon as those agents become available, if a particular agent group or department has not been requested, the interaction is removed from the routing server queue and routed to an agent group or department handling back-office work.
  • the interaction may be routed directly to agents tor immediate processing in some instances.
  • the interaction may be placed into a queue, or for deferred media, the interaction may be placed in a workbin 26a- c, etc. associated with a back-office agent group or department.
  • the workbin 26a-c can include various types of workbins, including a personal agent level workbin, an agent, group workbin, an administrative workbin, etc. in this regard, the routing server 20 may be enhanced with functionality for managing back-oftice/offline activities that are assigned to enterprise employees. Such activities may include, for example, responding to emails and letters, attending training seminars, or perforating any other activity (whether related to the contact center or not) that does not entail synchronous, real-time communication w ith end users, for example, a non-contact center activity that may be routed to a knowledge worker may be to fill out forms for the enterprise, process claims, and the like.
  • the work item may appear in the agent's workbin 26a-26b (collectively referenced as 26) as a work item to be completed by the agent or the work item may be immediately processed by the agent, e.g., similar to voice calls.
  • the agent's workbin may be implemented via any data structure, such as, for example, a linked list, array, and/or the like.
  • the workbin may be maintained, for example, in buffer memory of each agent's computer device and/or maintained on a server to allow for work, item reassignments to other agents.
  • a stakeholder device 38c may also have an associated workbin 26c storing work items for which the stakeholder is responsible.
  • Work items may be assigned to various targets, including, as described above, agents and stakeholders, including other persons associated with an enterprise, and including non-human targets sack as a servers or computing devices.
  • targets including, as described above, agents and stakeholders, including other persons associated with an enterprise, and including non-human targets sack as a servers or computing devices.
  • the assignment of a work item to a target may ha ve the effect, of activating a particular email, or a voice response announcing, "You are complaining about a slow internet connection. We are experiencing a problem in your area and are working to resolve it.”
  • the multimedia/social media server 24 may also be configured to provide, to an end user, a mobile application for downloading onto the end user device 10.
  • the mobile application may provide user configurable settings that indicate, for example, whether the user is available, not available, or availability is unknown, for purposes of being contac ted by a contact center agent.
  • the multimedia/social media server 24 may also monitor the status settings.
  • the contact center may also include a reporting server 28 configured to generate reports from data aggregated by the siat server 22.
  • Other sources for reporting include an interaction concentrator (ICON) collecting atomic events from various media servers and composing call detail record (CDR) type records. These data are read by an extract, transform and load Ell. tool 220 of the mining system 60, and into a consolidated data source for business analytics and data-mining, e.g., the Ge.nesys Info Mart (GIM) by Oenesys Telecommunications Laboratories, Inc., which serves a business intelligence (Bi) application.
  • Such reports may include near real-time reports or historical reports concerning the state of resources, such as, for example, average waiting time, abandonment rate, agent occupancy, and the like.
  • the reports may be generated automatically or in response to specific requests from a requestor, e.g. agent/stakeholder, contact center application, and/or the like.
  • An interaction analytics server 46 may be used to monitor the interactions in the contact center and analyze all or some of them to identify or quanti fy certain characteristics of the interaction. These characteristics may include topics, sentiment, satisfaction, or business outcome.
  • An intelligent workload distribution server (iWD server) may be used to create work items; the iWD server may employ a rules system (GRS) 44, which may be a separate entity, or which may be an element of the iWD server.
  • GRS rules system
  • a work item may be more effective than, e.g., an email request, in that the system may assign a due date, monitor progress, and escalate the work item to a supervisor if ft is not completed.
  • the i WD server may prioritize a work item and specify characteristics, such as particular skills, needed, to handle the work item.
  • the work item may then be sent to another server, such as a routing server 20, which, using information provided by a stat server 22, may identify a particular agent with the specified characteristics, e.g., qualified to handle the work item, and assign the work item to that agent.
  • a routing server 20 which, using information provided by a stat server 22, may identify a particular agent with the specified characteristics, e.g., qualified to handle the work item, and assign the work item to that agent.
  • the GRS 44 can also ' be used by other services, e.g., orchestration or ITS multi -media (reprioritization).
  • the interaction analytics server 46 can also use rules (GRS) directly rather than through the I WD server.
  • the interaction analytics server 46 can trigger actions, such as notifying agents, supervisors and customers, and can perform speech analytics and actionable sentiment analysis, e.g., for determining hard-to-understand communications. Findings can be stored in a universal contact server (UCS) 50 for follow-up analysis, e.g. correlation with survey.
  • the contact center can also include a mass storage device 30 for storing data related to contact center operations such as, for example, information related to agents, customers, customer interactions, and the like.
  • the mass storage device may take the form of a hard disk or disk, array.
  • the various servers in the contact center may he a process or thread, running on one or more processors, in one or more computing devices 600 (e.g., Figure 6. Figure 7), executing computer program instructions and interacting with other system components for performing the various functionalities described herein.
  • the computer program instructions are stored in a memory which may he implemented in a computing device using a standard memory device, such as, for example, a random access memory (RAM).
  • the computer program instructions may also be stored in other non-transitory computer readable media such as, for example, a CD-ROM, flash drive, or the like.
  • a computing device may be implemented via firmware (e.g. an application-specific integrated circuit), hardware, or a. combination of software, firmware, and hardware.
  • a server may be a software module, which may also simply be referred to as a module.
  • the set of modules in the contact center may include servers, and other modules.
  • Other contact center elements that can be used for determining, analyzing and addressing hard-lo-understand communication conditions, e.g., in the contact center or other environment, include a workforce management server 52, a quality of service monitor 56, survey feedback services 58, hard-to-understand assessment server 59 and a recognition server 60.
  • Figure 2 is a block diagram of an exemplary system associated with the recognition server 60, e.g., for capturing and analyzing audio and metadata, e.g., to determine hard-to-understand sessions.
  • the data raining system 60 can provide for real-time detection of hard-to-understand, e.g., for being able to trigger corrective actions, and/or for non-real time scenarios, e.g., checking whether negative survey responses correlate with hard-to- understand sessions.
  • the example is a customer interacting with a contact center via a telephone, but other implementations may use the systems and methods.
  • a recording system 210 can record interactions between the customer and the contact center, including live voice calls, voicemails, email texts, scanned copies of letters, etc.
  • the ETL tool 220 can extract varying types of call and .other interaction data from the recording system 210, prepare files and corresponding metadata for processing, and load the files for storage in an input folder. The results can be uniformly stored as an audio file and an xml file,
  • a feicher task 230 can move the audio files from the input, folder to a store folder 240 and write the metadata to a database server 250,
  • One or more recognition servers 60 includes a recognizer task to -read audio files from the store folder 240 and create a compressed version of the audio file in the store folder 240,
  • the recognition servers 60 can identify data thai indicates that customer is having or had a poor response to the interaction with the contact, center.
  • the contact center notices a customer's poor .response to the interaction, with the contact center, in one instance the poor response can imply thai the agent is doing a bad job or the IVR script is composed poorly.
  • another potential root cause of the poor response can be a bad connection between the contact center and. customer, the connection including high jitter, packet loss, latency, etc.
  • a network probe system including an interface to the hard-to- understand assessment server 59 can be used to detect and measure the bad. connections.
  • the bad connection may cause the customer to have a hard time communicating with the contact, center, and vice versa, thereby making the experience more stressful and less enjoyable. Communication issues can occur when the customer interacts with the IVR 34, a live agent, etc. Even poor music quality while the customer is on hold may affect the customer's experience with the contact center.
  • the customer experience data can be determined by the recognition servers 60,
  • the recognition server's recognizer task can write recognition results and a categorizer task, can write category results to the to the database server 250.
  • a computer 270 can make updates or changes to the ' recognition and category results.
  • An index task writer can write recognition results, category results and metadata to an index folder 280 on a network server 285, e.g., web server.
  • a computer 290 of the contact center agent can access search, reports* dashboards, etc., to view customer experience data.
  • a contact center agent can access the data via the computer 290. Changes to the contact center personal, equipment, networks, etc. can be made in response to the customer experience data, e.g., pre-connecfion with the agent, during the call with the agent and/or after the call.
  • the system cars suggest to the customer to change his location, for a poor mean opinion score (MOS) the system can suggest to customer to call again using different phone, the system can ask the customer if he prefers a call back at a specified time and/or suggest to the customer to use non-voice self-help option.
  • MOS mean opinion score
  • the system can suggest switching to a text chat communication, suggest scheduling another call and/or co-browse options since speech analytics performance can improve when visual information is added to the conversation.
  • Adding video e.g., to voice can also help address hard-to-understand scenarios which are not caused by poor network connection, but, e.g. pronunciation.
  • the video can also help in case of background noise.
  • the system can alert a supervisor to join the call if available and/or for a severe dissatisfaction level the system can suggest transferring the call to another agent.
  • the system can ask if the customer prefer to switch to an agent speaking a different language, and/or different education level of language, if available and/or via a pop-up message to the agent (Agent Assist), instruct the agent, to repeat, the important facts slowly and clearly and make sure that the customer understands them.
  • Agent Assist the actions can be initiated automatically by the system.
  • Post-call for agent issues, e.g., including technical QoS and content related issues, if an agent has several hard-to-understand call sessions above a certain threshold then the communication channels can be checked, a coaching/training session can be scheduled and/or the agent pulled off the calls. Agents and/or customers can be rated based on. the
  • the system can consider other measures of a quality of the communication, e.g., the hard-to-understand condition can be also checked and taken into account when triggering follow-up actions regarding net promoter scores (MPS), For example, after the call the customer can be asked how likely it is that they would recommend the company to a friend or colleague to determine the NFS. For severe agent communication issues, the system can follow-up with the customer via out bound message and suggest another call with supervisor or highly skilled agent, or the system can automatically make that call. Calls with low average MOS score indicating a poor telecommunication system performance during the call may not be utilized against the agent during quality management processes.
  • Another agent characteristic is the agent's ability to adapt to the customer questions.
  • One measure of the ability to adapt is the richness of language used by the agent.
  • One measure of the richness is perplexity which is based on established information theoretic principles and measures the difficult)' of the task.
  • the perplexity of the agent speech can correlate with less predictability and less scripted conversation. Therefore, the language skill levels of agents can be considered.
  • the customers' language skill levels can also be assessed because if a customer cannot fully understand the agent the same effect of hard-to-understand may occur.
  • Voice recognition can help to determine a customers' language skills, in one example, customers with poor language skills can be connected to agents with cleaner pronunciation.
  • the agent can be coached to be more flexible, f or customer communication issues, the system can follow with outbound message to memorialize the call details in writing.
  • the call interaction data can be used to detect customers' emotions and communication, issues and corrective actions can. be triggered by the contact center agent or automatically by the systems and method, or both. Both the customer .and the contact center agent can be exposed to the same communication conditions, e.g., a poor quality connections.
  • the computer 290 in one example agent devices 38a-3Sh or admin device 38c. can display the conditions to the agent.
  • the computer 290 can display the MOS value of the quality of the network because the agent may not consciously notice the noise on the network,
  • the connection can be terminated and redialed based on the MOS value and possible other factors, e.g., taking into account the applied coder/decoders
  • codecs codecs
  • the customer can determine to adjust their communication mode if they are made aware of the situation. For background, channel noise or language Issues, the call can be switched to chat or video, etc., as described.
  • the systems and methods can provide suggestions in contexts other than voice calls. For example, in the con text of chat and tex ts, the system can suggest a call or video call when a hard-to-understand session is detected.
  • figure 3 is a table 300 illustrating an exemplary relation between R-values (transmission rating factor) 302, MOS values 304.
  • OoB percentage good of better
  • PoW percentage poor or worse
  • user satisfaction 310 e.g., based on a G J 07 international telecommunication union (ITU) scale.
  • the hard-to-understand assessment server 59 can extract features and measurements from either a sell-help call, e.g., with IVR 34, or with a customer-agent call.
  • the MOS value 304 includes an overall noise estimation, e.g., a measurement of the overall noise level of the call.
  • a call with MOS value below 3.6 can be considered a hard-to-understand session, whether explicitly identified by the caller as such or not.
  • Multiple levels of severity of user dissatisfaction can exist, e.g., 3.1 that many users are dissatisfied.,, and 2,58 that nearly all users are dissatisfied.
  • the background noise can be estimated separately from the overall noise represented by the MGS value 304. For example, an application installed on a mobile phone can estimate the background noise during pauses in a conversation and broadcast the estimated noise back to the hard-to-understand assessment server 59 or other location.
  • the customer experience with the IVR 35 or Eve agent can be adversely affected, even if the customer if not conscious of the background noise.
  • Background noise can include traffic noise, street noise, airport noise, babies crying, dogs barking, and other noises in the environment During a call with the agent or even pre-call when the customer is interacting with the IVR 34, the system can prompt the customer to move away from the background noise. Additionally or alternatively, if a problem with background noise is detected during the call, the call can be switched to chat, video, etc. to help reduce the effects of hard»toiraderstand sessions due to the background noise,
  • Figure 4 is a block diagram of exemplary categorization of customer calls, e.g., based on determined phrases.
  • the customer may verbalize communication issues with the I VR 34 or agent Speech analytics can infer if the customer is complaining about communication issues, e.g., by looking for spoken phrases.
  • the phrases can be categorized into topics 410, e.g., by communication, language, repeat requests, etc.
  • the categories can be determined as union of mapped phrases 420. For example, if the caller states "I can't hear anything'* the call can be classified as a communication issue.
  • a call can be classified as a repeat requests if the customer utters phrases such as *Can you repeat it please?" or "I need you to say it again".
  • the agent can express his inability to understand the customer speech. Additionally or alternatively, the system, can determine a helpfulness or lack of helpfulness of the agent using speech analytics with regard to whether or not the agent understands the customer's issue clearly and/or has the experience level to be able to address the issue.
  • a speech analytics system can be used to perform phrase recognition to detect such phrases in a phone conversation.
  • An exemplary speech recognition system is described, in U.S. Patent No. 7,487,094 B l , "System and Method of Call Classification, with Context Modeling based on Composite Words", Konig et.aS.
  • ASR Automatic Speech Recognition
  • l.VCSR Large Vocabulary Continuous Speech Recognition
  • speech-to-text speech-to-text
  • the average con fidence can be served, as a measure of understands bility of the spoken words i n. the conversation.
  • the measure can be computed for the agent side and for customer side separately.
  • Figure 5 is a flow chart of an exemplary logic 500 of the system to determine, analyze and address hard-to-understand sessions, e.g.. in the context of a contact center, While audio communication which is hard-to-understand can negatively impact the customer experience during a contact center call and lead to dissatisfaction, e.g., customer's bad rating In a survey or frustration observed during call monitoring/recording, the service itself might have been actually good. A conclusion from customer's negative feedback need not indicate that the agent did not do a good job and needs training on the subject, needs transferring to a different job. needs his proficiency downgraded, etc.
  • the customer's dissatisfaction may have been mainiv caused bv hard-to-understand conditions which can be addressed differently.
  • the system can inform both the customer and the agent, because they might not be aware about it.
  • Other implementations include the system notifying only the agent, who might notify customer, the system notifying only the customer, e.g. during SYR call, etc. fetting the customer and/or agent know about the hard-to-understand conditions can help to improve the situation and trigger real time corrective actions.
  • the following hard-to-imderstand situations can be considered: poor audio transmission, quality, e.g., low MOS, .language barrier, and/or an. agent's ability to understand the customer's issue clearly.
  • a language barrier can include a customer's low language proficiency, gender preference, partial hearing disability, and/or a customer, agent or IVR's low language proficiency, ability to pronounce clearly, proficiency with foreign names, and dialect, e.g. using uncommon expressions.
  • a contact center agents ' language proficiency can be taken into consideration, for example through corresponding skill level assignment and incorporation in call routing strategies.
  • the audio transmission quality e.g., due to background noise and/or channel noise such as high jitter, packet loss, latency, etc.. can be detected (502). Poor audio transmission quality can create stress at the listening party, either consciously or
  • Language related aspects can be captured and rated, e.g., proficiency, level, dialect, etc., in customer's profile (504).
  • the customer is associated, with one or several, languages, and when receiving a call from the customer the appropriate language is selected for IVR self-service, and for assisted service the call is routed, to an. agent with required language skills. There may be still a language-related mismatch, which can have similar results as in case of poor audio transmission.
  • the MOS of the audio connection is determined (506).
  • Parameters for determining the MOS include codec -related
  • One or more MOS thresholds can be determined as:
  • TI is about 4,03
  • T2 is about 3.6
  • T3 is about 3. 1.
  • Other values can be used.
  • the thresholds can be iteratively adjusted based on actual experiences during contact center operation. Thresholds can be also determined based, on the company that provides the service or product, because some companies can tolerate low MOS more than others. For example, the service level transfer level, escalation level information can be considered, .for the particular company and/or as compared to benchmark data for a group of companies. If the company is performing better than peers it may want to have more tolerance, or if performing worse than peers the company may want to have less tolerance.
  • MOS value of a given is call can be recorded as part of call metadata, e.g., metadata described with figure 2, and can. be utilized during post processing.
  • a similar system logic can be applied when there are language related issues and/or agent helpfulness issues that prevent customers to interact conveniently with the contact center, both with S. VR and live agents.
  • real time call recording and analysis can be used to determine potential issues.
  • the customer may ask the agent frequently to repeat something, potentially also asking the agent to say it differently.
  • the agent may also have problems in understanding the customer.
  • the language matching level (L.ML) can be quantified and captured to be added to the call metadata (508).
  • the LML value can be based on information and -measures of language matching and proficiency.
  • the LML and/or MOS values can be used during ongoing live call, for example a warning displayed to agent for either adjustment or suggested/automatic transfer to better matching agent, and during post processing, e.g. when assessing survey results (510).
  • the system logic can be used to capture details on the customer's language skills and preferences.
  • the information can be used in routing of a customer's future calls, e.g. selecting an agent with customer's preferred dialect, or an agent with very
  • the LML information can be used also for contact center planning, e.g. training or hiring agents to better match customers' language specifics.
  • a technical implementation for MOS can include analyzing real-time transport protocol RTF streams for packet loss and latency, taking the codec into account and calculating the MOS.
  • LML the real time speech analysis can be integrated in order to measure requests to repeat something, e.g., by customer or agent, misunderstandings, i f either party continues conversation in a way that contradicts with what has been actual ly said, etc.
  • the LML value can be based on a determined scale and used to compose a customer's language profile, which can be taken into account for future call routing and IVR applications selecting.
  • the system can maintain different !VR scripts on the same subject for different customer language profiles, even for a same base language such as English.
  • Other examples include maintaining different IVR scripts with more or less sensitivity to poor voice connection, e.g., based on the actual content (words) and/or intonation (including
  • Additional interdependences captured as metadata can include calf duration, e.g., exhaustion and. stress are higher for long duration calls, and whether or not the customer and agent have been informed about detected hard-to-imderstand condition already during the call. If a customer accepts the invitation for answering to the NFS or other survey, the hard-to-understand condition for the customer's call can be factored, if the given call suffered from poor MOS, long duration, etc. then this can be displayed as additional information, to the customer.
  • intelligent quality of service (QoS) alerting can be distributed among the contact center systems.
  • the customers can be offered new channels if the dialog is detected as being poor.
  • Agent scripting can be controlled dynamically based on the detection of negative customer experience or if the system detects a compliance risk. The ease of dynamic scripting allows speech analytics to trigger new scripting for the agent as the system detects missing contest or negative customer sentiment.
  • a customer calls a contact center and when interacting with the IVR 34 the system detects a low MOS of the telecommunication connection.
  • the IVR 34 can prompt the customer to use another phone.
  • the system detects a language issue, e.g., detects the phrase "I don't understand your English" and suggest or automatically switches the customer to an agent in the U.S.
  • a native speaking agent is not qualified to helpfully address the customer's issue, so the customer is transferred to a supervisor.
  • the supervisor understands the customer's Issue and is able to help the customer resolve it.
  • the adjustments from one call to the next can occur automatically andtor by the system making suggestions to the customer.
  • Post call since the MOS can he correlated with survey results, e.g., NFS results, if the customer gave a poor service rating and there was low MOS then the system can consider the low MOS to be a cause of unfavorable NFS results (512).
  • a result list can be generated, correlated to hard-to-un derstand scenarios, and acted on based on the low MOS calls, e.g., signifying those calls may be less relevant for determining agent performance, addressing poor connections, calling the customer back to follow up with them, etc.
  • the MOS related issues may not be counted against the contact center agent for agent review purposes.
  • Figures 6-10 are non-limiting examples of elements that can be used to execute the above description.
  • Figure 6 and Figure ? depict block diagrams of an exemplary computing device 609 as may be deployed with the systems and methods described herein, in Figure 6 and Figure 7, the computing devices 600 can include a central processing unit 621 , and a main memory unit 622.
  • the computing devices 600 may include a storage device 628, a removable media interface 616, a network interface 618, an
  • I/O controller 623 input/output controller 623, one or more display devices 630c, a keyboard 630a and a. pointing device 630b, such as a mouse.
  • the storage device 628 may include, without limitation, storage for an operating system and software.
  • the computing devices 600 may also include additional optional elements, such as a memory-' pott 603, a bridge 670, one or more additional input/output devices 630d, 630e and a cache memory 640 in communication with the central processing unit 621 .
  • Input/output devices, e.g., 630a, 630b, 630d, and 630e may be referred to herein using reference numeral 630,
  • the central processing unit 621 is any logic circuitry that responds to and processes instructions fetched from the main memory unit 622. It may be implemented, for example, in an integrated circuit, in the form of a microprocessor, microcontroller, or graphics processing unit (GPU), or in a field-programmable gate array (PPGA) or application-specific integrated circuit (ASIC).
  • Main memory unit 622 may be one or more memory chips capable of storing data and allowing any storage location to be directly accessed by the central processing unit 621 , In the embodiment shown in Figure 6, the central processing unit 621 communicates with main memory 622 via a system bus 650.
  • Figure 7 depicts an embodiment of a computing device 600 in which the central processing unit 621 communicates directly with main memory 622 via a memory port 603, [0070]
  • Figure 7 depicts an embodiment in which the central processing unit 621 communicates directly with cache memory 640 via a secondary bus, sometimes referred to as a backside bus.
  • the central, processing unit 621 communicates with cache memory 640 using the system bus 650
  • Cache memory 640 typically has a faster response time than main memory 622.
  • the central processing unit 621 communicates with various I/O devices 630 via a local system bus 650.
  • Various buses may be used as a local system bus 650.
  • VESA Video Electronics Standards Association
  • VLB Video Electronics Standards Association
  • ISA industry Standard Architecture
  • EISA Extended industry Standard Architecture
  • MCA MicroChannel Architecture
  • PCI Peripheral Component Interconnect
  • PCi-X PCI Extended
  • PCI-Bxpress PCI-Bxpress bus
  • NuBus NuBus
  • the central processing unit 621 may communicate with the display device 630c through an Advanced Graphics Port (AGP).
  • AGP Advanced Graphics Port
  • Figure 7 depicts an embodiment of a computer 600 in which the central processing unit 621 communicates directly with I/O device 630e.
  • Figure 7 also depicts an embodiment in which local busses and direct communication are mixed: the central processing unit 621 communicates with I/O device 63 Od using a local system bus 630 while communicating with I/O device 630e directly.
  • I/O devices 630 may be present in the computing device 600, Input devices include one or more keyboards 630a, mice, trackpads, trackballs, microphones, and drawing tablets. Output devices include video display devices 630c, speakers, and printers.
  • An I/O controller 623 in Figure ( >, may control the I/O devices. The I/O controller may control one or more I/O devices such as a keyboard 630a and a pointing device 630b, e.g., a mouse or optical pen,
  • the computing device 600 may support one or more removable media interfaces 616, such as a floppy disk drive, a CD-ROM drive, a DVD-ROM drive, tape drives oS various formats, a USB port, a. Secure Digital or COMPACT FLASH " memory card port, or any other device suitable for reading data from read-only media, or for reading data from, or writing data to, read-write media.
  • An I/O device 630 may be a bridge between the system bus 650 and a removable media interface 616.
  • the removable media interface 616 may for example be used for installing software and programs.
  • the computing device 600 may further comprise a storage device 628, such as one or more hard disk drives or hard disk drive arrays, for storing an operating system, and other related software, and for storing application software programs.
  • a removable media interface 616 may also be used as the storage device.
  • the operating system and the software may be run f rom a bootable medium, for example, a bootable CO.
  • the computing device 600 may comprise or be connected to multiple display devices 630c, which each may be of the same or different type and/or form.
  • any of the I/O devices 630 and/or the I/O controller 623 may comprise any type and/or form of suitable hardware, software, or combination of hardware and software to support, enable or provide for the connection to, and use of, multiple display devices 630c by the computing device 600
  • the computing device 600 may include any type and/or form Of video adapter, video card, driver, and/or library to interface, communicate, connect or otherwise use the display devices 630c
  • a video adapter may comprise multiple connectors to interface to multiple display devices 630c
  • the computing device 600 may include multiple video adapters, with each video adapter connected to one or more of the display devices 630c .
  • any portion of the operating system of the computing de v ice 600 may be configured for using multiple display devices 630c.
  • one or more of the display devices 630c may be provided by one or more other computing devices, connected, for example, to the computing device 600 via a network.
  • These embodiments may include any type of software designed and constructed to use the display device of another computing .device as a second display device 630c for the computing device 600.
  • a computing device 600 may be configured to have multiple display devices 630c.
  • a computing device 600 of the sort depicted in Figure 6 and Figure 7 may operate under the control of an operating system, which controls scheduling of tasks and access to system resources,
  • the computing device 600 may be running any operating system, any embedded operating system, any real-time operating system, any open source operating system, any proprietary operating system, any operating systems for mobile computing devices, or any other operating system capable of running on the computing device and performing the operations described herein.
  • the computing device 600 may be any workstation, desktop computer, laptop or notebook computer, server machine, handheld computer, mobile telephone or other portable telecommunication device, media playing device, gaming system, mobile computing device, or any other type and/or form of computing, telecommunications or media device that is capable of communication and that has sufficient processor power and memory capacity to perform the operations described herein, in some embodiments, the computing dev ice 600 may have different processors, operating systems, and input devices consistent with the device,
  • the computing device 600 is a mobile device, such as a .lava-enabled cellular telephone or personal digital assistant (PDA), a smart phone, a digital audio player, or a portable media player.
  • the computing device 600 comprises a combination of devices, such as a mobile phone combined with a digital audio player or portable media player,
  • the central processing unit 621 may comprise multiple processors P I,
  • the computing device 600 may comprise a parallel processor with one or more cores.
  • the computing device 600 is a shared memory parallel device, with multiple processors and/or multiple processor cores, accessing all available memory as a single global address space, in another of these embodiments, the computing device 600 is a distributed memory parallel device with multiple processors each accessing local memory only. In still another of these embodiments, the computing device 600 has both some memory which is shared and some memory which may only be accessed by particular processors or subsets of processors.
  • the central processing unit 62 S comprises a multicore microprocessor, which combines. two or more independent processors into a single package, e.g., into a single integrated circuit (iC).
  • the computing device 600 includes at least one central processing unit 62 ! and at least one graphics processing unit 621'.
  • a central processing unit 62.1 provides single instruction, multiple data (SMD) .functionality, e.g., execution of a single instruction simultaneously on multiple pieces of data.
  • SMD single instruction, multiple data
  • several processors in the central processing unit 621 may provide functionality for execution of multiple instructions simultaneously on multiple pieces of data (MSMD).
  • MSMD multiple pieces of data
  • the central processing unit 62 1 may use any combination of SlMD and MIMD cores in a single device.
  • A. computing device may be one of a plurality of machines connected by a network, or It may comprise a plurality of machines so connected.
  • Figure 10 shows an exemplary network environment.
  • the network environment comprises one or more local machines 602a, 602b (also generally referred to as local machine(s) 602. client(s) 602, client node(s) 602, client machine(s) 602, client computers) 602, client device(s) 602, endpoint(s) 602, or endpoint node(s) 602) in commtmication with one or more remote machines 606a, 606b, 606c (also generally referred to as server maehine(s) 606 or remote maehine(s) 606) via one or more networks 604.
  • a local machine 602 has the capacity to function as both a client node seeking access to resources provided by a server machine and as a server machine providing access to hosted resources for other clients 602a, 602b.
  • the network 604 may he a local-area network (LAN), e.g., a private network such as a company Intranet, a metropolitan area network (MAN), or a. wide area network (WAN), such as the Internet, or another public network, or a combination thereof,
  • LAN local-area network
  • MAN metropolitan area network
  • WAN wide area network
  • the computing device 600 may include a network interface 618 to interface to the network 604 through a variety of connections including, but not limited to, standard telephone lines, local-area network (LAN), or wide area network (WAN) links, broadband connections, wireless connections, or a combination of any or all of the above. Connections may be established using a variety of communicatlon protocols.
  • the computing device 600 communicates with other computing devices 600 via any type and/or form of gateway or tunneling protocol such as Secure Socket Layer (SSL) or Transport Layer Security ( TLS),
  • SSL Secure Socket Layer
  • TLS Transport Layer Security
  • the network interface 618 may comprise a built-in network adapter, such as a network interface card, suitable for interlacing the computing device 600 to any type of network capable of communication and performing the operations described herein.
  • An I/O device 630 may be a bridge between the system bus 650 and an external communication bus,
  • the systems and methods described above may be implemented in many different ways in many different combinations of hardware, software firmware, or any combination thereof.
  • the systems and methods can be implemented with a processor and a memory, where the memory stores instructions, which when executed by the processor, causes the processor to perform the systems and methods.
  • the processor may mean any type of circuit such as, but not limited to, a microprocessor, a microcontroller, a graphics processor, a digital signal processor, or another processor.
  • the processor may also be implemented with discrete 1 ogic or compon ents, or a combination of other types of analog or digital circuitry, combined on a single integrated circuit or distributed among multiple integrated circuits.
  • All or part of the logic described above may be implemented as instructions for execution by the processor, controller, or other processing device and may be stored in a tangible or non-transitory .machine-readable or computer-readable medium, such as flash memory, random access memory (RAM) or read only memory (ROM), erasable programmable read only memory (EPROM) or other machine-readable medium such as a compact disc read only memory (CDROM), or magnetic or optical disk.
  • a product such as a computer program product, may include a storage medium and computer readable instructions stored on the medium, which when executed in an endpoint, computer system, or other device, cause the device to perform operations according to any of the description above.
  • the memory can be implemen ted with one or more hard drives, and/or one or more drives thai handle removable media, such as diskettes, compact disks (CDs), digital video disks (DVDs), flash memory keys, and other removable media.
  • the processing capability of the system may be distributed among multiple system components, such as among multiple processors and memories, optionally including multiple distributed processing systems.
  • Parameters, databases, and other data structures may ⁇ be separately stored and managed, may be incorporated into a single memory or database, may be logically and physically organized in many different ways, and may implemented in many ways, including data structures such as linked lists, hash tables, or implicit storage mechanisms.
  • Programs may be parts (e.g., subroutines) of a single program, separate programs, distributed across several memories and processors, or implemented in many different ways, such as in a library, such as a shared library (eg,, a dynamic link library (DLL)).
  • the DL L for example, may store code that performs any of the system processing described above.

Abstract

A system and method include a processor and a memory, where the memory stores instructions, which when executed by the processor, causes the processor to determine whether a session is hard-to-understand. When the session is hard-to-understand the processor provides an adjustment for the session.

Description

SYSTEM AND METHOD FOR ADDRESSING
COMMUNICATION ISSUES FOE CONTACT
CENTER SERVICE QUALITY
BACKGROUND
[0001] Contact centers may be used by an organization to communicate in an efficient and systematic manner with outside parties. Such contact centers may for example have large numbers of agents staffing telephones and interacting with outside parties and with each other, The contact centers can include an interactive voice response (1 VR) system to handle calls, record messages and/or place calls with agents at the contact center.
BRIEF DESCRIPTION OF THE DRAWINGS
[0002] In association with the following detailed description, reference is made. to the accompanying drawings, where like numerals in different figures can refer to the same element.
[0003] Figure 1 is a schematic block diagram of an exemplary system supporting a contact center.
[0004] Figure 2 is a block diagram of an exemplary system associated with the recognition server for capturing and analyzing data.
[0005] Figure 3 is a table illustrating an exemplary relation, between MOS values, R- vaiues and user satisfaction.
[0006] Figure 4 is a block diagram of exemplary categorization of customer calls, e.g., based on determined phrases.
[0007] Figure 5 Is a flow chart of an exemplary logic of the system to determine, analyze and address hard-to-understand communications, e.g.. In the context of a contact center.
[0008] Figures 6 is a block d iagra m, of an exemplary computing device.
[0009] Figures 7 is a block diagram of an exemplary computing device.
[0010] Figures 8 is a block diagram of an exemplary computing device,
[0011] Figures 9 is a block diagram of an exemplary computing device.
[0012] Figure 10 is a block diagram of an exemplary network environment including several computing devices.
DETAILED DESCRIPTION
[0013] Systems and methods can provide for determinin g and adjusting to hard-to- understand sessions, e.g., for improving service quality in the contact center setting, in one example, a phone conversation with bad transmission quality can create stress to a customer and/or agent since the brain tries to fil l the missing gaps, even if the customer and. agent do not realize It This can lead to mental exhaustion and a negative emotion. There is increasing interference in the world due to wide-spread use of radio signals, which often decreases a quality of mobile telephony. Internet and landline connections can be affected as well. Additionally or alternatively, there can be language barriers when customer and agents communicate, e.g., in terms of vocabulary -and regional accents. Additionally or
alternatively, the systems and methods can also identify a helpfulness of the agent to the customer such as an agent understands the customer's issue clearly and adjusts accordingly. The systems and methods .address different scenarios in the context of hard-to-tmderstand, e.g., media related issues regarding voice and text quality, and content related scenarios.
[0014] For the different scenarios, the communication peers may consciously or unconsciously perceive that there are issues. For example, even when the callers can filter out distractive noise and fill in gaps without callers noticing, their exhaustive brain's work can gradually make them- unhappy and result in bad experience. The systems and methods can determine and address explicitly perceived communication issues, e.g.. low experience scores and/or explicit negative phrases or words used during the conversation. The systems and methods can also determine and address unconsciously perceived issues, e.g., by monitoring a quality of the communication lines, monitoring phrases used during a conversation, etc.
[0015] Figure 1 is a schematic block diagram of an exemplary system, e.g., a system supporting a contact center. The system can be configured to distribute information and task, assignments related to interactions with end users (also referred to as customers), to employees of an enterprise, e.g.. customer care agents. These task assignments are also referred to herein, as work, items. The contact center may be an in-house facility of the enterprise and may serve the enterprise in performing the functions of sales and service relative to the products and. services available through the enterprise. In another exemplary embodiment, the contact center may be a third-party service provider. Sometimes a quality of the communication lines between the customers and the agents can be poor, even if imperceptibly so, A risk of a low quality voice connection for borne agents may be even higher than for agents that work from the contact center. Additionally, knowledge workers who are experts in the enterprise but not full-time agents may answer customer calls with their mobile phone, which can exhibit poorer quality than landlines for example.
[0016] The contact center infrastructure- may be hosted in equipment dedicated to the enterprise or third-parry service provider, and/or hosted in a. remote computing environment such as, for example, a private or public cloud environment with infrastructure for supporting multiple contact centers for multiple enterprises. The contact center can include resources (e.g. personnel, computers, and telecommunication equipment) to enable delivery of services via telephone or other communication mechanisms. Such services may vary depending on the type of contact center, and may range from customer service to help desk, emergency response, telemarketing, order taking, and the like. These are some exemplary contexts for the hard-to-understand sessions,
[0017] Customers, potential customers, or other end users desiring to receive services from the contact center may initiate inbound calls to the contact center and/or recei ve outbound calls via their end user devices 10a-10e (collectively referenced as 10). The end user devices 10 may be a communication device, for example, a telephone, wireless phone, smart phone, personal computer, electronic tablet, and/or the like. The mechanisms of contact, and the corresponding user devices 10, need not be limited to real-time voice communications as in a traditional telephone call, but may be non-voice communications including text, video, and the like, and may include email or other non«real«time means of communication. This generalized form of a contact between an end user and the contact center may include methods of communication other than, voice, and an endpoint other than a telephone, e.g. interactions.
[0018] Inbound and outbound interactions from and to the end user devices 10 may traverse a telephone, cellular, and/or data communication network 14 depending on the type of device that is being used. For example, the communications network 14 may include a private or public switched telephone network (PSTN), local area, network (LAN), private wide area network (WAN), and/or public wide area network such as, for example, the internet The communications network 14 may also include a wireless carrier network including a code division multiple access (CDMA) network, global system for mobile communications (GSM) network, and/or any 3G, 4G, LTE, etc. network.
[0019] The contact center can also include an outbound contact server 54 to perform outbound functions, e.g., in which contact center agents make outbound calls to customers on behalf of a business or client. Calls made from the contact center can include telemarketing, sales or fund-raising calls, as well as calls for contact list updating, surveys or verification services. The systems and methods described herein can be used for both, inbound and outbound communications, e.g., to determine if hard-to-understand. conditions of the communications exist,
[0020] hi general, the contact center includes a switch/media gateway 12 coupled to the communications network 14 for receiving and transmitting interactions and/or data between end users and the contact center. The switch/media gateway 1 2 may include a. telephony switch configured to function as a central switch for agent level routing within the center, in this regard, the switch/media gateway 12 may include an automatic interaction distributor, a private branch exchange (PBX), an IP-based software switch, and/or any other switch configured to receive Iniernet-soureed interactions and/or telephone network-sourced interactions. The switch can be coupled to a call server 18 which may, for example, serve as an adaptor or interface between the switch/media gateway 12 and the remainder of the routing, monitoring, and other interaction-handling systems of the contact center. The call server 18 can connect to other elements, e.g., described herein, via a communication/message bits S 3.
[0021] The contact center may also include a multimedia/social media server 24 connected with the communication/message bus 13. The multimedia/social media server 24 may also be referred to as an mteraction server, for engaging in media interactions other than voice interactions with the end user devices 10 and/or web servers 32. The media interactions may be related, for example, to email, chat, text-messaging, web, social .media, and the like. The web servers 32 may include, tor example, social interaction site hosts for a variety of known social interaction sites to which an end user mav subscribe, such as. for example, FACEBOOK.TM, TWITTER™, and the like. The web servers may also provide web pages for the enterprise that is being supported by the contact, center. End users may browse the web pages and get information about the enterprise's products and services. The web pages may .also provide a mechanism for contacting the contact center, via, for example, web chat, voice call, email, web real time communication (WebRTC), or the like.
[0022] The switch can be coupled to an interactive voice response (IVR) server 34. The IVR server 34 is configured, for example, with an I VR script for querying customers oh their needs. For example, a contact center for a bank may tell callers, via the I VR script, to "press I " if they wish to get an account balance. If this is the case, through continued interaction with the IVR, customers may complete service without needing to speak with an agent.
[0023] If the interaction is to be routed to an agent, the interaction is forwarded to the call server 18 which interacts with a routing server, referred to as a Universal Routing Server (URS) 20. for finding the most appropriate agent for processing the interaction. Additionally or alternatively, the URS 20 can handle routing, orchestration and conversation management among other things. The call server 18 may be configured to process PSTN calls, VoIP calls, and the like. For example, the call server 18 may include a session initiation protocol (SIP) server for processing SIP calls. The call server 18 may include a telephony server (T-server).
[0024] in one example, while an agent is being located and until such agent becomes available, the call server may place the interaction in an mteraction queue. The interaction queue may be implemented via any data structure, such as. for example, a linked list, array, and/or the like. The data structure may be maintained, for example, in buffer memory provided by the call server 1 8.
[0025] Once an appropriate agent is located and available to handle a call, the call is removed from the call queue and transferred to the corresponding agent device 38a-38b.
Collected information about the caller and/or the caller's historical information may also be provided to the agent device for aiding the agent in better servicing the call. The information may also be provided to a stakeholder device 38c for monitoring and training purposes, A stakeholder may be a contact center manager or a supervisor of one or more agents.
Stakeholders need not be contact center employees; a product manager employed by the same enterprise, or by another enterprise supported by the contact center, may for example be a stakeholder. The agent/stakeholder device 38a-c may include a telephone adapted for regular telephone calls, VoIP calls, and the like. The agent and stakeholder devices 38a-c may also include a computer for communicating with one or more servers of the contact center and performing data processing associated with contact center operations.
[0026] The selection of an appropriate agent for routing an inbound interaction (e.g. a telephony call or other multimedia interaction) may be based, for example., on a routing strategy employed by the routing server 20, and further based on information about agent availability, skills, agent location, and other routing parameters provided, for example, by a statistics (stat) server 22. For example, the siat server 22 may accumulate data about places, agents, and place/agent, groups, convert the data, into .statistically useful information, and pass the calculations to other software applications. The siat server 22 may provide information to the .routing server about agents' capabilities in terms of interactions they are handling, the media type of an interaction, and so on.
[0027] An exemplary routing strategy employed by the routing server 20 may be that if a particular agent agent group, or department is requested, the interaction is routed to the requested agent, agent group, or department as soon the requested entity becomes available, if a particular agent has not been requested, the interaction may be routed to agents with the requested skill as soon as those agents become available, if a particular agent group or department has not been requested, the interaction is removed from the routing server queue and routed to an agent group or department handling back-office work. The interaction may be routed directly to agents tor immediate processing in some instances. The interaction may be placed into a queue, or for deferred media, the interaction may be placed in a workbin 26a- c, etc. associated with a back-office agent group or department. The workbin 26a-c can include various types of workbins, including a personal agent level workbin, an agent, group workbin, an administrative workbin, etc. in this regard, the routing server 20 may be enhanced with functionality for managing back-oftice/offline activities that are assigned to enterprise employees. Such activities may include, for example, responding to emails and letters, attending training seminars, or perforating any other activity (whether related to the contact center or not) that does not entail synchronous, real-time communication w ith end users, for example, a non-contact center activity that may be routed to a knowledge worker may be to fill out forms for the enterprise, process claims, and the like.
[0028] Once a work item is assigned to an agent, the work item may appear in the agent's workbin 26a-26b (collectively referenced as 26) as a work item to be completed by the agent or the work item may be immediately processed by the agent, e.g., similar to voice calls. The agent's workbin may be implemented via any data structure, such as, for example, a linked list, array, and/or the like. The workbin may be maintained, for example, in buffer memory of each agent's computer device and/or maintained on a server to allow for work, item reassignments to other agents. A stakeholder device 38c may also have an associated workbin 26c storing work items for which the stakeholder is responsible. Work items may be assigned to various targets, including, as described above, agents and stakeholders, including other persons associated with an enterprise, and including non-human targets sack as a servers or computing devices. For example, the assignment of a work item to a target may ha ve the effect, of activating a particular email, or a voice response announcing, "You are complaining about a slow internet connection. We are experiencing a problem in your area and are working to resolve it."
[0029] The multimedia/social media server 24 may also be configured to provide, to an end user, a mobile application for downloading onto the end user device 10. The mobile application may provide user configurable settings that indicate, for example, whether the user is available, not available, or availability is unknown, for purposes of being contac ted by a contact center agent. The multimedia/social media server 24 may also monitor the status settings.
[0030] The contact center may also include a reporting server 28 configured to generate reports from data aggregated by the siat server 22. Other sources for reporting include an interaction concentrator (ICON) collecting atomic events from various media servers and composing call detail record (CDR) type records. These data are read by an extract, transform and load Ell. tool 220 of the mining system 60, and into a consolidated data source for business analytics and data-mining, e.g., the Ge.nesys Info Mart (GIM) by Oenesys Telecommunications Laboratories, Inc., which serves a business intelligence (Bi) application. Such reports may include near real-time reports or historical reports concerning the state of resources, such as, for example, average waiting time, abandonment rate, agent occupancy, and the like. The reports may be generated automatically or in response to specific requests from a requestor, e.g. agent/stakeholder, contact center application, and/or the like.
[0031] An interaction analytics server 46 may be used to monitor the interactions in the contact center and analyze all or some of them to identify or quanti fy certain characteristics of the interaction. These characteristics may include topics, sentiment, satisfaction, or business outcome. An intelligent workload distribution server (iWD server) may be used to create work items; the iWD server may employ a rules system (GRS) 44, which may be a separate entity, or which may be an element of the iWD server. A work item may be more effective than, e.g., an email request, in that the system may assign a due date, monitor progress, and escalate the work item to a supervisor if ft is not completed. The i WD server may prioritize a work item and specify characteristics, such as particular skills, needed, to handle the work item. The work item may then be sent to another server, such as a routing server 20, which, using information provided by a stat server 22, may identify a particular agent with the specified characteristics, e.g., qualified to handle the work item, and assign the work item to that agent. The GRS 44 can also 'be used by other services, e.g., orchestration or ITS multi -media (reprioritization).
[0032] The interaction analytics server 46 can also use rules (GRS) directly rather than through the I WD server. The interaction analytics server 46 can trigger actions, such as notifying agents, supervisors and customers, and can perform speech analytics and actionable sentiment analysis, e.g., for determining hard-to-understand communications. Findings can be stored in a universal contact server (UCS) 50 for follow-up analysis, e.g. correlation with survey. The contact center can also include a mass storage device 30 for storing data related to contact center operations such as, for example, information related to agents, customers, customer interactions, and the like. The mass storage device may take the form of a hard disk or disk, array.
[0033] The various servers in the contact center may he a process or thread, running on one or more processors, in one or more computing devices 600 (e.g., Figure 6. Figure 7), executing computer program instructions and interacting with other system components for performing the various functionalities described herein. The computer program instructions are stored in a memory which may he implemented in a computing device using a standard memory device, such as, for example, a random access memory (RAM). The computer program instructions may also be stored in other non-transitory computer readable media such as, for example, a CD-ROM, flash drive, or the like. Also, a computing device may be implemented via firmware (e.g. an application-specific integrated circuit), hardware, or a. combination of software, firmware, and hardware. The functionality of various computing devices may be combined or integrated into a single computing device, or the functionality of a. particular computing device may be distributed across one or more other computing devices. A server may be a software module, which may also simply be referred to as a module. The set of modules in the contact center may include servers, and other modules.
[0034] Other contact center elements that can be used for determining, analyzing and addressing hard-lo-understand communication conditions, e.g., in the contact center or other environment, include a workforce management server 52, a quality of service monitor 56, survey feedback services 58, hard-to-understand assessment server 59 and a recognition server 60.
[0035] Figure 2 is a block diagram of an exemplary system associated with the recognition server 60, e.g., for capturing and analyzing audio and metadata, e.g., to determine hard-to-understand sessions. The data raining system 60 can provide for real-time detection of hard-to-understand, e.g., for being able to trigger corrective actions, and/or for non-real time scenarios, e.g., checking whether negative survey responses correlate with hard-to- understand sessions. [0036] For purposes of explanation, the example is a customer interacting with a contact center via a telephone, but other implementations may use the systems and methods. A recording system 210 can record interactions between the customer and the contact center, including live voice calls, voicemails, email texts, scanned copies of letters, etc. The ETL tool 220 can extract varying types of call and .other interaction data from the recording system 210, prepare files and corresponding metadata for processing, and load the files for storage in an input folder. The results can be uniformly stored as an audio file and an xml file, A feicher task 230 can move the audio files from the input, folder to a store folder 240 and write the metadata to a database server 250, One or more recognition servers 60 includes a recognizer task to -read audio files from the store folder 240 and create a compressed version of the audio file in the store folder 240,
[0037] The recognition servers 60 can identify data thai indicates that customer is having or had a poor response to the interaction with the contact, center. When the contact center notices a customer's poor .response to the interaction, with the contact center, in one instance the poor response can imply thai the agent is doing a bad job or the IVR script is composed poorly.
[0038] Referring also to Figure 1 , another potential root cause of the poor response can be a bad connection between the contact center and. customer, the connection including high jitter, packet loss, latency, etc. A network probe system including an interface to the hard-to- understand assessment server 59 can be used to detect and measure the bad. connections. The bad connection may cause the customer to have a hard time communicating with the contact, center, and vice versa, thereby making the experience more stressful and less enjoyable. Communication issues can occur when the customer interacts with the IVR 34, a live agent, etc. Even poor music quality while the customer is on hold may affect the customer's experience with the contact center. In one example, the customer experience data can be determined by the recognition servers 60,
[0039] The recognition server's recognizer task can write recognition results and a categorizer task, can write category results to the to the database server 250. A computer 270 can make updates or changes to the 'recognition and category results. An index task writer can write recognition results, category results and metadata to an index folder 280 on a network server 285, e.g., web server. A computer 290 of the contact center agent can access search, reports* dashboards, etc., to view customer experience data. For example, a contact center agent can access the data via the computer 290. Changes to the contact center personal, equipment, networks, etc. can be made in response to the customer experience data, e.g., pre-connecfion with the agent, during the call with the agent and/or after the call.
[0040] For example, during a customer self-help with the IVR 34, e.g., for high background noise the system cars, suggest to the customer to change his location, for a poor mean opinion score (MOS) the system can suggest to customer to call again using different phone, the system can ask the customer if he prefers a call back at a specified time and/or suggest to the customer to use non-voice self-help option. During a customer-agent call, for a poor MOS the system can suggest switching to a text chat communication, suggest scheduling another call and/or co-browse options since speech analytics performance can improve when visual information is added to the conversation. Adding video, e.g., to voice can also help address hard-to-understand scenarios which are not caused by poor network connection, but, e.g. pronunciation. The video can also help in case of background noise.
[0041] For agent communication issues, the system can alert a supervisor to join the call if available and/or for a severe dissatisfaction level the system can suggest transferring the call to another agent. For customer communication issues, the system can ask if the customer prefer to switch to an agent speaking a different language, and/or different education level of language, if available and/or via a pop-up message to the agent (Agent Assist), instruct the agent, to repeat, the important facts slowly and clearly and make sure that the customer understands them. As used herein, alternative to suggesting a video chat, browse option, transfer to a supervisor, etc. the actions can be initiated automatically by the system.
[0042] Post-call, for agent issues, e.g., including technical QoS and content related issues, if an agent has several hard-to-understand call sessions above a certain threshold then the communication channels can be checked, a coaching/training session can be scheduled and/or the agent pulled off the calls. Agents and/or customers can be rated based on. the
communications and the information stored with the metadata for using to more accurately connect customers to agents on future calls. In other examples, from the metadata it may be determined that the agent scores low for harder to understand for particular days of the week, determined topics, for customers initiating calls from identified parts of the world, etc. and therefore the agent is not worked on those days. The metadata collected and the actions taken can be implementation dependent. Poor connection issues may not be counted against the agent, for example, but a home agent with consistently poor technical QoS or MOS can be removed from service until the connection problem is fixed.
[0043] In. addition to MOS, the system can consider other measures of a quality of the communication, e.g., the hard-to-understand condition can be also checked and taken into account when triggering follow-up actions regarding net promoter scores (MPS), For example, after the call the customer can be asked how likely it is that they would recommend the company to a friend or colleague to determine the NFS. For severe agent communication issues, the system can follow-up with the customer via out bound message and suggest another call with supervisor or highly skilled agent, or the system can automatically make that call. Calls with low average MOS score indicating a poor telecommunication system performance during the call may not be utilized against the agent during quality management processes. [0044] Another agent characteristic is the agent's ability to adapt to the customer questions. One measure of the ability to adapt is the richness of language used by the agent. One measure of the richness is perplexity which is based on established information theoretic principles and measures the difficult)' of the task. The perplexity of the agent speech can correlate with less predictability and less scripted conversation. Therefore, the language skill levels of agents can be considered. The customers' language skill levels can also be assessed because if a customer cannot fully understand the agent the same effect of hard-to-understand may occur. Voice recognition can help to determine a customers' language skills, in one example, customers with poor language skills can be connected to agents with cleaner pronunciation. In agent low perplexity situations, the agent, can be coached to be more flexible, f or customer communication issues, the system can follow with outbound message to memorialize the call details in writing.
[0045] Therefore, the call interaction data can be used to detect customers' emotions and communication, issues and corrective actions can. be triggered by the contact center agent or automatically by the systems and method, or both. Both the customer .and the contact center agent can be exposed to the same communication conditions, e.g., a poor quality connections. The computer 290, in one example agent devices 38a-3Sh or admin device 38c. can display the conditions to the agent. For example, the computer 290 can display the MOS value of the quality of the network because the agent may not consciously notice the noise on the network, In one example, the connection can be terminated and redialed based on the MOS value and possible other factors, e.g., taking into account the applied coder/decoders
(codecs). In another example, the customer can determine to adjust their communication mode if they are made aware of the situation. For background, channel noise or language Issues, the call can be switched to chat or video, etc., as described.
[0046] As used herein, the systems and methods can provide suggestions in contexts other than voice calls. For example, in the con text of chat and tex ts, the system can suggest a call or video call when a hard-to-understand session is detected.
[0047] figure 3 is a table 300 illustrating an exemplary relation between R-values (transmission rating factor) 302, MOS values 304. OoB (percentage good of better) 306, PoW (percentage poor or worse) 308 and user satisfaction 310, e.g.,., based on a G J 07 international telecommunication union (ITU) scale. The hard-to-understand assessment server 59 can extract features and measurements from either a sell-help call, e.g., with IVR 34, or with a customer-agent call. The MOS value 304 includes an overall noise estimation, e.g., a measurement of the overall noise level of the call. In one implementation, a call with MOS value below 3.6 can be considered a hard-to-understand session, whether explicitly identified by the caller as such or not. Multiple levels of severity of user dissatisfaction can exist, e.g., 3.1 that many users are dissatisfied.,, and 2,58 that nearly all users are dissatisfied. [0048] In. some implementations, the background noise can be estimated separately from the overall noise represented by the MGS value 304. For example, an application installed on a mobile phone can estimate the background noise during pauses in a conversation and broadcast the estimated noise back to the hard-to-understand assessment server 59 or other location. If the estimated background noise is above a determined level, the customer experience with the IVR 35 or Eve agent can be adversely affected, even if the customer if not conscious of the background noise. Background noise can include traffic noise, street noise, airport noise, babies crying, dogs barking, and other noises in the environment During a call with the agent or even pre-call when the customer is interacting with the IVR 34, the system can prompt the customer to move away from the background noise. Additionally or alternatively, if a problem with background noise is detected during the call, the call can be switched to chat, video, etc. to help reduce the effects of hard»toiraderstand sessions due to the background noise,
[0049] Figure 4 is a block diagram of exemplary categorization of customer calls, e.g., based on determined phrases. The customer may verbalize communication issues with the I VR 34 or agent Speech analytics can infer if the customer is complaining about communication issues, e.g., by looking for spoken phrases. The phrases can be categorized into topics 410, e.g., by communication, language, repeat requests, etc. The categories can be determined as union of mapped phrases 420. For example, if the caller states "I can't hear anything'* the call can be classified as a communication issue. A call can be classified as a repeat requests if the customer utters phrases such as *Can you repeat it please?" or "I need you to say it again". Similarly, the agent can express his inability to understand the customer speech. Additionally or alternatively, the system, can determine a helpfulness or lack of helpfulness of the agent using speech analytics with regard to whether or not the agent understands the customer's issue clearly and/or has the experience level to be able to address the issue. A speech analytics system can be used to perform phrase recognition to detect such phrases in a phone conversation. An exemplary speech recognition system is described, in U.S. Patent No. 7,487,094 B l , "System and Method of Call Classification, with Context Modeling based on Composite Words", Konig et.aS.
[0050] Automatic Speech Recognition (ASR.) systems, and l.VCSR (Large Vocabulary Continuous Speech Recognition) transcription (speech-to-text) engines can output a sequence of recognized words and for each word an associated confidence measure. The average con fidence can be served, as a measure of understands bility of the spoken words i n. the conversation. The measure can be computed for the agent side and for customer side separately.
[0051] Figure 5 is a flow chart of an exemplary logic 500 of the system to determine, analyze and address hard-to-understand sessions, e.g.. in the context of a contact center, While audio communication which is hard-to-understand can negatively impact the customer experience during a contact center call and lead to dissatisfaction, e.g., customer's bad rating In a survey or frustration observed during call monitoring/recording, the service itself might have been actually good. A conclusion from customer's negative feedback need not indicate that the agent did not do a good job and needs training on the subject, needs transferring to a different job. needs his proficiency downgraded, etc. The customer's dissatisfaction may have been mainiv caused bv hard-to-understand conditions which can be addressed differently. When detecting the hard~to«undersiand condition during the ongoing conversation the system can inform both the customer and the agent, because they might not be aware about it. Other implementations include the system notifying only the agent, who might notify customer, the system notifying only the customer, e.g. during SYR call, etc. fetting the customer and/or agent know about the hard-to-understand conditions can help to improve the situation and trigger real time corrective actions.
[0052] For explanation purposes, the following hard-to-imderstand situations can be considered: poor audio transmission, quality, e.g., low MOS, .language barrier, and/or an. agent's ability to understand the customer's issue clearly. A language barrier can include a customer's low language proficiency, gender preference, partial hearing disability, and/or a customer, agent or IVR's low language proficiency, ability to pronounce clearly, proficiency with foreign names, and dialect, e.g. using uncommon expressions. A contact center agents' language proficiency can be taken into consideration, for example through corresponding skill level assignment and incorporation in call routing strategies.
[0053] The audio transmission quality, e.g., due to background noise and/or channel noise such as high jitter, packet loss, latency, etc.. can be detected (502). Poor audio transmission quality can create stress at the listening party, either consciously or
subconsciously, because the brain tries to fill the missing gaps, which can leads to mental exhaustion and negative emotion. The listening party may not even be aware of this because it is happening subconsciously at slight degradation of sound quality which may not be noticeable yet. Poor transmission quality is. increasing due to widespread use of radio signals for different purpose, which can cause interference. Similar effects can happen in ease of a language barrier.
[0054] Language related aspects can be captured and rated, e.g., proficiency, level, dialect, etc., in customer's profile (504). The customer is associated, with one or several, languages, and when receiving a call from the customer the appropriate language is selected for IVR self-service, and for assisted service the call is routed, to an. agent with required language skills. There may be still a language-related mismatch, which can have similar results as in case of poor audio transmission.
[0055] During a customer's call with the contact center the MOS of the audio connection is determined (506). Parameters for determining the MOS include codec -related
impairments, Impairments due to the packet loss and delay-related impairments. The parameters can be measured in real time. One or more MOS thresholds can be determined as:
[0056]
Figure imgf000015_0001
no action required;
[0057]
Figure imgf000015_0002
degraded but still acceptable, potentially causing stress and negative emotion;
[0058]
Figure imgf000015_0003
degraded but as exception acceptable, high probability of causing stress and negative emotion; and
[0059]
Figure imgf000015_0004
unacceptable low, immediate corrective action required;
[0060] where TI is about 4,03, T2 is about 3.6 and T3 is about 3. 1. Other values can be used. For example, the thresholds can be iteratively adjusted based on actual experiences during contact center operation. Thresholds can be also determined based, on the company that provides the service or product, because some companies can tolerate low MOS more than others. For example, the service level transfer level, escalation level information can be considered, .for the particular company and/or as compared to benchmark data for a group of companies. If the company is performing better than peers it may want to have more tolerance, or if performing worse than peers the company may want to have less tolerance.
[0061] In case of low MOS values, for example between T1 and 13, there is a risk of customers becoming stressed and dissatisfied with the call because of poor voice quality over the telecommunication system, e.g., regardless of how well the system, is communicating with the customer. To mitigate the effects of the poor connections, the customer can be informed about degraded voice quality of line, e.g. through IV R, text message (SMS) or a pop-up on the screen if customer is interacting through web she. In case of assisted service the message can a!so be shown also to the agent, both, for .informing about potentially expected customer's dissatisfaction, but also tor agent's own benefit who may experience the same
stress/dissatisfaction. During assisted service the system can let the agent inform the customer about the poor connection, in addition to or instead of sending a respective message to the customer. Additionally or alternatively, telecommunication lines with low MOS values can be disabled and/or calls dropped if MOS is too low. The MOS value of a given is call can be recorded as part of call metadata, e.g., metadata described with figure 2, and can. be utilized during post processing.
[0062] A similar system logic can be applied when there are language related issues and/or agent helpfulness issues that prevent customers to interact conveniently with the contact center, both with S. VR and live agents. In this case real time call recording and analysis can be used to determine potential issues. For example, the customer may ask the agent frequently to repeat something, potentially also asking the agent to say it differently. The agent may also have problems in understanding the customer. The language matching level (L.ML) can be quantified and captured to be added to the call metadata (508). The LML value can be based on information and -measures of language matching and proficiency. The LML and/or MOS values can be used during ongoing live call, for example a warning displayed to agent for either adjustment or suggested/automatic transfer to better matching agent, and during post processing, e.g. when assessing survey results (510).
[0063] The system logic can be used to capture details on the customer's language skills and preferences. The information can be used in routing of a customer's future calls, e.g. selecting an agent with customer's preferred dialect, or an agent with very
clean/correct/adjusted pronunciation, e.g., pronouncing geographical names in Spanish for customer of Mexican origin, even if the call is in English. The LML information can be used also for contact center planning, e.g. training or hiring agents to better match customers' language specifics.
[0064] A technical implementation for MOS can include analyzing real-time transport protocol RTF streams for packet loss and latency, taking the codec into account and calculating the MOS. For LML the real time speech analysis can be integrated in order to measure requests to repeat something, e.g., by customer or agent, misunderstandings, i f either party continues conversation in a way that contradicts with what has been actual ly said, etc. The LML value can be based on a determined scale and used to compose a customer's language profile, which can be taken into account for future call routing and IVR applications selecting. For example, the system can maintain different !VR scripts on the same subject for different customer language profiles, even for a same base language such as English. Other examples include maintaining different IVR scripts with more or less sensitivity to poor voice connection, e.g., based on the actual content (words) and/or intonation (including
male/female voice), eic. iiiierdependencies between MOS and LML can also be considered, e.g., low MOS can cause degraded LML, Additional interdependences captured as metadata can include calf duration, e.g., exhaustion and. stress are higher for long duration calls, and whether or not the customer and agent have been informed about detected hard-to-imderstand condition already during the call. If a customer accepts the invitation for answering to the NFS or other survey, the hard-to-understand condition for the customer's call can be factored, if the given call suffered from poor MOS, long duration, etc. then this can be displayed as additional information, to the customer.
[0065] Additionally or alternatively, intelligent quality of service (QoS) alerting can be distributed among the contact center systems. The customers can be offered new channels if the dialog is detected as being poor. Agent scripting can be controlled dynamically based on the detection of negative customer experience or if the system detects a compliance risk. The ease of dynamic scripting allows speech analytics to trigger new scripting for the agent as the system detects missing contest or negative customer sentiment.
[0066] Therefore, in one example a. customer calls a contact center and when interacting with the IVR 34 the system detects a low MOS of the telecommunication connection. The IVR 34 can prompt the customer to use another phone. When the customer calls back he is connected with a non-native speaking agent. The system detects a language issue, e.g., detects the phrase "I don't understand your English" and suggest or automatically switches the customer to an agent in the U.S. Then a native speaking agent is not qualified to helpfully address the customer's issue, so the customer is transferred to a supervisor. The supervisor understands the customer's Issue and is able to help the customer resolve it. The adjustments from one call to the next can occur automatically andtor by the system making suggestions to the customer.
[0067] Post call, since the MOS can he correlated with survey results, e.g., NFS results, if the customer gave a poor service rating and there was low MOS then the system can consider the low MOS to be a cause of unfavorable NFS results (512). A result list can be generated, correlated to hard-to-un derstand scenarios, and acted on based on the low MOS calls, e.g., signifying those calls may be less relevant for determining agent performance, addressing poor connections, calling the customer back to follow up with them, etc. The MOS related issues may not be counted against the contact center agent for agent review purposes.
[0068] Figures 6-10 are non-limiting examples of elements that can be used to execute the above description. Figure 6 and Figure ? depict block diagrams of an exemplary computing device 609 as may be deployed with the systems and methods described herein, in Figure 6 and Figure 7, the computing devices 600 can include a central processing unit 621 , and a main memory unit 622. In Figure 6, a. computing device 600 may include a storage device 628, a removable media interface 616, a network interface 618, an
input/output (I/O) controller 623, one or more display devices 630c, a keyboard 630a and a. pointing device 630b, such as a mouse. The storage device 628 may include, without limitation, storage for an operating system and software. In Figure 7, the computing devices 600 may also include additional optional elements, such as a memory-' pott 603, a bridge 670, one or more additional input/output devices 630d, 630e and a cache memory 640 in communication with the central processing unit 621 . Input/output devices, e.g., 630a, 630b, 630d, and 630e, may be referred to herein using reference numeral 630,
[0069] The central processing unit 621 is any logic circuitry that responds to and processes instructions fetched from the main memory unit 622. It may be implemented, for example, in an integrated circuit, in the form of a microprocessor, microcontroller, or graphics processing unit (GPU), or in a field-programmable gate array (PPGA) or application-specific integrated circuit (ASIC). Main memory unit 622 may be one or more memory chips capable of storing data and allowing any storage location to be directly accessed by the central processing unit 621 , In the embodiment shown in Figure 6, the central processing unit 621 communicates with main memory 622 via a system bus 650. Figure 7 depicts an embodiment of a computing device 600 in which the central processing unit 621 communicates directly with main memory 622 via a memory port 603, [0070] Figure 7 depicts an embodiment in which the central processing unit 621 communicates directly with cache memory 640 via a secondary bus, sometimes referred to as a backside bus. In other embodiments, the central, processing unit 621. communicates with cache memory 640 using the system bus 650, Cache memory 640 typically has a faster response time than main memory 622. in the embodiment shown in Figure 6, the central processing unit 621 communicates with various I/O devices 630 via a local system bus 650. Various buses may be used as a local system bus 650. including a Video Electronics Standards Association (VESA) Local bus (VLB), an industry Standard Architecture (ISA) bus, an Extended industry Standard Architecture (EISA) bus, a MicroChannel Architecture (MCA) bus, a Peripheral Component Interconnect (PCI) bus, a PCI Extended (PCi-X) bus, a PCI-Bxpress bus, or a NuBus. For embodiments in which an I/O device is a display device 630c, the central processing unit 621 may communicate with the display device 630c through an Advanced Graphics Port (AGP). Figure 7 depicts an embodiment of a computer 600 in which the central processing unit 621 communicates directly with I/O device 630e. Figure 7 also depicts an embodiment in which local busses and direct communication are mixed: the central processing unit 621 communicates with I/O device 63 Od using a local system bus 630 while communicating with I/O device 630e directly.
[0071] A wide variety of I/O devices 630 may be present in the computing device 600, Input devices include one or more keyboards 630a, mice, trackpads, trackballs, microphones, and drawing tablets. Output devices include video display devices 630c, speakers, and printers. An I/O controller 623, in Figure (>, may control the I/O devices. The I/O controller may control one or more I/O devices such as a keyboard 630a and a pointing device 630b, e.g., a mouse or optical pen,
[0072] Referring again to Figure 6, the computing device 600 may support one or more removable media interfaces 616, such as a floppy disk drive, a CD-ROM drive, a DVD-ROM drive, tape drives oS various formats, a USB port, a. Secure Digital or COMPACT FLASH " memory card port, or any other device suitable for reading data from read-only media, or for reading data from, or writing data to, read-write media. An I/O device 630 may be a bridge between the system bus 650 and a removable media interface 616.
[0073] The removable media interface 616 may for example be used for installing software and programs. The computing device 600 may further comprise a storage device 628, such as one or more hard disk drives or hard disk drive arrays, for storing an operating system, and other related software, and for storing application software programs. Optionally, a removable media interface 616 may also be used as the storage device. For example, the operating system and the software may be run f rom a bootable medium, for example, a bootable CO.
[0074] in some embodiments, the computing device 600 may comprise or be connected to multiple display devices 630c, which each may be of the same or different type and/or form. As such, any of the I/O devices 630 and/or the I/O controller 623 may comprise any type and/or form of suitable hardware, software, or combination of hardware and software to support, enable or provide for the connection to, and use of, multiple display devices 630c by the computing device 600, For example, the computing device 600 may include any type and/or form Of video adapter, video card, driver, and/or library to interface, communicate, connect or otherwise use the display devices 630c, In one embodiment, a video adapter may comprise multiple connectors to interface to multiple display devices 630c, In other embodiments, the computing device 600 may include multiple video adapters, with each video adapter connected to one or more of the display devices 630c . In some embodiments, any portion of the operating system of the computing de v ice 600 may be configured for using multiple display devices 630c. In other embodiments, one or more of the display devices 630c may be provided by one or more other computing devices, connected, for example, to the computing device 600 via a network. These embodiments may include any type of software designed and constructed to use the display device of another computing .device as a second display device 630c for the computing device 600. A computing device 600 may be configured to have multiple display devices 630c.
[0075] A computing device 600 of the sort depicted in Figure 6 and Figure 7 may operate under the control of an operating system, which controls scheduling of tasks and access to system resources, The computing device 600 may be running any operating system, any embedded operating system, any real-time operating system, any open source operating system, any proprietary operating system, any operating systems for mobile computing devices, or any other operating system capable of running on the computing device and performing the operations described herein.
[00 75] The computing device 600 may be any workstation, desktop computer, laptop or notebook computer, server machine, handheld computer, mobile telephone or other portable telecommunication device, media playing device, gaming system, mobile computing device, or any other type and/or form of computing, telecommunications or media device that is capable of communication and that has sufficient processor power and memory capacity to perform the operations described herein, in some embodiments, the computing dev ice 600 may have different processors, operating systems, and input devices consistent with the device,
[0076] In other embodiments the computing device 600 is a mobile device, such as a .lava-enabled cellular telephone or personal digital assistant (PDA), a smart phone, a digital audio player, or a portable media player. In some embodiments, the computing device 600 comprises a combination of devices, such as a mobile phone combined with a digital audio player or portable media player,
[0077] In Figure 8, the central processing unit 621 may comprise multiple processors P I,
Ρ2, P3, P4, and may provide functionality for simultaneous execution of instructions or for simultaneous execution of one instruction on more than one piece of data, in some embodiments, the computing device 600 may comprise a parallel processor with one or more cores. In one of these embodimenis, the computing device 600 is a shared memory parallel device, with multiple processors and/or multiple processor cores, accessing all available memory as a single global address space, in another of these embodiments, the computing device 600 is a distributed memory parallel device with multiple processors each accessing local memory only. In still another of these embodiments, the computing device 600 has both some memory which is shared and some memory which may only be accessed by particular processors or subsets of processors. In still even another of these embodiments, the central processing unit 62 S comprises a multicore microprocessor, which combines. two or more independent processors into a single package, e.g., into a single integrated circuit (iC). In one exemplary embodiment, depicted in Figure 9, the computing device 600 includes at least one central processing unit 62 ! and at least one graphics processing unit 621'.
[0079] In some embodiments, a central processing unit 62.1 provides single instruction, multiple data (SMD) .functionality, e.g., execution of a single instruction simultaneously on multiple pieces of data. In other embodiments, several processors in the central processing unit 621 may provide functionality for execution of multiple instructions simultaneously on multiple pieces of data (MSMD). In still other embodiments, the central processing unit 62 1 may use any combination of SlMD and MIMD cores in a single device.
[0080] A. computing device may be one of a plurality of machines connected by a network, or It may comprise a plurality of machines so connected. Figure 10 shows an exemplary network environment. The network environment, comprises one or more local machines 602a, 602b (also generally referred to as local machine(s) 602. client(s) 602, client node(s) 602, client machine(s) 602, client computers) 602, client device(s) 602, endpoint(s) 602, or endpoint node(s) 602) in commtmication with one or more remote machines 606a, 606b, 606c (also generally referred to as server maehine(s) 606 or remote maehine(s) 606) via one or more networks 604. In some embodiments, a local machine 602 has the capacity to function as both a client node seeking access to resources provided by a server machine and as a server machine providing access to hosted resources for other clients 602a, 602b.
Although only two clients 602 and three server machines 606 are illustrated in Figure 10, there may, in general, be an arbitrary number of each. The network 604 may he a local-area network (LAN), e.g., a private network such as a company Intranet, a metropolitan area network (MAN), or a. wide area network (WAN), such as the Internet, or another public network, or a combination thereof,
[0081] The computing device 600 may include a network interface 618 to interface to the network 604 through a variety of connections including, but not limited to, standard telephone lines, local-area network (LAN), or wide area network (WAN) links, broadband connections, wireless connections, or a combination of any or all of the above. Connections may be established using a variety of communicatlon protocols. In one embodiment, the computing device 600 communicates with other computing devices 600 via any type and/or form of gateway or tunneling protocol such as Secure Socket Layer (SSL) or Transport Layer Security ( TLS), The network interface 618 may comprise a built-in network adapter, such as a network interface card, suitable for interlacing the computing device 600 to any type of network capable of communication and performing the operations described herein. An I/O device 630 may be a bridge between the system bus 650 and an external communication bus,
[0082] The systems and methods described above may be implemented in many different ways in many different combinations of hardware, software firmware, or any combination thereof. In one example, the systems and methods can be implemented with a processor and a memory, where the memory stores instructions, which when executed by the processor, causes the processor to perform the systems and methods. The processor may mean any type of circuit such as, but not limited to, a microprocessor, a microcontroller, a graphics processor, a digital signal processor, or another processor. The processor may also be implemented with discrete 1 ogic or compon ents, or a combination of other types of analog or digital circuitry, combined on a single integrated circuit or distributed among multiple integrated circuits. All or part of the logic described above may be implemented as instructions for execution by the processor, controller, or other processing device and may be stored in a tangible or non-transitory .machine-readable or computer-readable medium, such as flash memory, random access memory (RAM) or read only memory (ROM), erasable programmable read only memory (EPROM) or other machine-readable medium such as a compact disc read only memory (CDROM), or magnetic or optical disk. A product, such as a computer program product, may include a storage medium and computer readable instructions stored on the medium, which when executed in an endpoint, computer system, or other device, cause the device to perform operations according to any of the description above. The memory can be implemen ted with one or more hard drives, and/or one or more drives thai handle removable media, such as diskettes, compact disks (CDs), digital video disks (DVDs), flash memory keys, and other removable media.
[0083] The processing capability of the system may be distributed among multiple system components, such as among multiple processors and memories, optionally including multiple distributed processing systems. Parameters, databases, and other data structures may¬ be separately stored and managed, may be incorporated into a single memory or database, may be logically and physically organized in many different ways, and may implemented in many ways, including data structures such as linked lists, hash tables, or implicit storage mechanisms. Programs may be parts (e.g., subroutines) of a single program, separate programs, distributed across several memories and processors, or implemented in many different ways, such as in a library, such as a shared library (eg,, a dynamic link library (DLL)). The DL L, for example, may store code that performs any of the system processing described above.
[0084] While various embodiments have been described, ii can be apparent that many more embodiments and implementations are possible. Accordingly, the embodiments are not to be restricted.

Claims

Claims;
1 . A system, comprising:
a processor and a memory, where the memory stores instructions, which when executed by the processor, causes the processor to extract, transform and load call data and detect communication issues, language Issues and repeat requests of a caller; and when the communication issues, language issues or .repeat requests are identified provide an adjustment.
2. The system of claim 1 , where the communication issues, language issues and repeat requests are determined pre-cal 1 to a. contact center agent.
3. The system of claim 2, where the adjustment pre-cal! comprises a message to the caller to change a phone or change a location, of the caller.
4. The system of claim 1, where the communication issues, language issues and repeat requests are determined during a call with a contact center agent.
5. The system of claim 4, where the adjustment during the call with the contact center agent comprises transferring the caller to another agent or adding video, text or chat to the call
6. The system of claim I , where the communication issues, language issues and repeat requests are determined post-call to a contact center agent
7. The system of claim 6, where the communication issues, language issues and repeat requests are not counted against the contact center agent
8. The system of claim where the communication issues comprise background noise or channel noise.
9. The system of claim 1 , where the language issues comprise caller language issues or customer language issues.
10. The system of claim 1 , where a caller profile is updated, based on the communication issues, language Issues and repeat requests.
11. The system of claim 1, further comprising to detect a contact center agent ability to understand a caller issue.
12. A method, comprising:
extracting call data for a session of a caller;
transforming the extracted call data for loading;
loading the transformed call data for processing;
detecting, by a processor, from: the loaded call data communication, issues, language issues and repeat requests of the caller; and
adjusting, by the processor, the session when the communication issues, language issues or repeat requests are identified.
13. The method of claim 12, where adjusting comprises sending a message to the caller to change a phone or change a location of the caller.
14. The method of claim 12, where adjusting comprises at least one of transferring the caller to another agent and adding video, text or chat to the call.
1 5. The method of claim 12, where the communication issues comprise background noise or channel noise.
16. The method of claim 12, where the language issues comprise caller language issues or customer language issues.
1 7. The method of claim 1 2, further comprising updating a caller profile based on the communication issues, language issues and repeat requests.
18. The method of claim 12, further comprising detecting a contact center agent ability to understand a caller issue and adjusting the session if the contact center agent does not understand the issue.
19. The method of claim 12, where adjusting comprises changing an interactive voice response script.
20. The method of claim 12, further comprising determining if the communication issues comprise background noise or channel noise and adjusting based on the determination.
21. A system, comprising: it processor and a memory, where the memory stores instructions, which when executed by the processor, causes the processor to determine whether a session with a caller is hard4o-undersUmd; and
when the session is hard-to-understand provides an adjustment for the session.
22. The system of claim 21, where hard4o-un.derst.and comprises at least one of a poor audio transmission quality and a language barrier.
23. The system of claim 22, where the poor audio transmission quality Is determined by a mean opinion score.
24. The system of claim 23, where a threshold for the mean opinion score is iterativeiy determined.
25. The. system of claim 22, where the language barrier comprises at least one of a customer language proficiency, an agent language proficiency and an interactive voice response script.
26. The system of claim 21 , where the adjustment comprises at least one of informing the call of a poor audio transmission quality, requesting the- caller to change a phone, and adding at least one of video, text and chat to the session.
27. The system of claim. 21, where the adjustment comprises of transferring the caller to another agent or changing an interactive voice response script based on a language barrier with the caller.
28. The system of claim 21, where hard-to-understand comprises a contact center agent not able to understand a caller issue due to an experience level.
29. The system of claim 28, where the adjustment comprises transferring the caller to an agent with a proficiency for the issue.
30. A method, comprising:
extracting call information for a session with a cal ler;
determining, by a processor, whether the session with the caller is hard-to- understand based on the extracted call information; and
adjusting the session when the session is hard-to-understand.
31. The method of claim 30, where adjusting comprises at least one of Informing the call of a poor audio transmission quality, requesting the caller to change a phone, and adding at least one of video, text and chat to the session.
32. The method of claim 30, where adjusting comprises of transferring the caller to another agent or changing an interactive voice response script based on a language barrier with the caller.
33. The method of claim 30, where hard-to-understand comprises a contact center agent not able to understand a caller issue.
34. The method of claim 33, where adjusting comprises transferring the caller to an agent with a proficiency for the issue.
35. The method of claim 3.0, where hard-to-understand comprises at least one of a poor audio transmission quality and a language barrier.
36. The method of claim 35, where the poor audio transmission quality is determined by a mean opinion score,
37. The method of claim 35, where the language barrier comprises at least one of a customer language proficiency, an agent language proficiency and art interactive voice response script.
38. The method of claim 30, further comprising informing the caller of the hard-to-understand condition.
39. The method of claim 30, further comprising sending an email to the caller after the call to memorialize details of the call based on the session being hard-to- understand.
40. The method of claim 30, where hard-to-understand comprises a long duration call.
41. A system substantially as hereinbefore described with reference to the accompanying drawings.
42. A method substantially as hereinbefore described with reference to the aecompanymg drawings.
PCT/US2015/042755 2014-07-29 2015-07-29 System and method for addressing communication issues for contact center service quality WO2016019072A1 (en)

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US14/445,629 US20160036972A1 (en) 2014-07-29 2014-07-29 System and Method for Addressing Communication Issues for Contact Center Service Quality
US14/445,646 US20160036980A1 (en) 2014-07-29 2014-07-29 System and Method for Addressing Hard-To-Understand for Contact Center Service Quality
US14/445,646 2014-07-29
US14/445,629 2014-07-29

Publications (1)

Publication Number Publication Date
WO2016019072A1 true WO2016019072A1 (en) 2016-02-04

Family

ID=55218291

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2015/042755 WO2016019072A1 (en) 2014-07-29 2015-07-29 System and method for addressing communication issues for contact center service quality

Country Status (1)

Country Link
WO (1) WO2016019072A1 (en)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6847714B2 (en) * 2002-11-19 2005-01-25 Avaya Technology Corp. Accent-based matching of a communicant with a call-center agent
US20060140389A1 (en) * 1999-08-02 2006-06-29 Stuart Robert O System and method for providing a service to a customer via a communication link
US7734032B1 (en) * 2004-03-31 2010-06-08 Avaya Inc. Contact center and method for tracking and acting on one and done customer contacts
US8589172B2 (en) * 2004-09-09 2013-11-19 At&T Intellectual Property Ii, L.P. Automatic detection, summarization and reporting of business intelligence highlights from automated dialog systems
US20140161249A1 (en) * 2012-12-11 2014-06-12 Genesys Telecommunications Laboratories, Inc. Contact center session preservation

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060140389A1 (en) * 1999-08-02 2006-06-29 Stuart Robert O System and method for providing a service to a customer via a communication link
US6847714B2 (en) * 2002-11-19 2005-01-25 Avaya Technology Corp. Accent-based matching of a communicant with a call-center agent
US7734032B1 (en) * 2004-03-31 2010-06-08 Avaya Inc. Contact center and method for tracking and acting on one and done customer contacts
US8589172B2 (en) * 2004-09-09 2013-11-19 At&T Intellectual Property Ii, L.P. Automatic detection, summarization and reporting of business intelligence highlights from automated dialog systems
US20140161249A1 (en) * 2012-12-11 2014-06-12 Genesys Telecommunications Laboratories, Inc. Contact center session preservation

Similar Documents

Publication Publication Date Title
US20160036980A1 (en) System and Method for Addressing Hard-To-Understand for Contact Center Service Quality
US10277744B2 (en) System and method for performance-based routing of interactions in a contact center
US20160036972A1 (en) System and Method for Addressing Communication Issues for Contact Center Service Quality
US10375241B2 (en) System and method for automatic quality management in a contact center environment
US10902737B2 (en) System and method for automatic quality evaluation of interactions
US10038783B2 (en) System and method for handling interactions with individuals with physical impairments
US10290301B2 (en) Fast out-of-vocabulary search in automatic speech recognition systems
US10896395B2 (en) System and method for automatic quality management and coaching
US20140337072A1 (en) Actionable workflow based on interaction analytics analysis
AU2018216823B2 (en) System and method for speech-based interaction resolution
US20210174288A1 (en) System and method for predicting performance for a contact center via machine learning
US11528362B1 (en) Agent performance measurement framework for modern-day customer contact centers
US11367029B2 (en) System and method for adaptive skill level assignments
WO2016019072A1 (en) System and method for addressing communication issues for contact center service quality
WO2018064199A2 (en) System and method for automatic quality management and coaching

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 15826989

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 15826989

Country of ref document: EP

Kind code of ref document: A1