A computer program listing appendix stored on compact disc, submitted herewith in duplicate, is provided. Each disc contains the files Confirmation.doc, size 182,784 bytes and created on May 7, 2007; XML Sample.doc, size 224,256 bytes and created on May 7, 2007; VoiceXML Example.doc, size 434,176 bytes and created on May 7, 2007; GlobalXSLVars.doc, size 198,144 bytes and created on May 7, 2007; and XML Structure.doc, size 52,736 bytes and created on May 25, 2007. The contents of each disc is hereby incorporated by reference in its entirety.
VoiceXML is an industry-standard markup language developed by a consortium for building distributed Internet-based voice applications. The language enables Web authors and designers to create applications similar to HTML but with audio functionality. VoiceXML is designed to create audio dialogs with the goal of bringing the advantages of Web-based development and content delivery to interactive voice response applications.
Traditional voice/audio-based applications employ VoiceXML code, which is embedded within a Java Server Pages (JSP) application, to invoke dynamic audio functionality. The resulting JSP file, however, is cluttered with Java code and VoiceXML code and is difficult to debug. Thus, maintenance and testing are problematic. Further, development of a VoiceXML document is a lengthy and time consuming process with code being duplicated from document-to-document to implement various functions. Accordingly, what is needed is an efficient tool to assist developers to rapidly develop VoiceXML applications using reliable and time-tested components in a streamlined manner so that even non-technical personnel can develop voice/audio-based applications.
The accompanying figures depict certain illustrative embodiments and may aid in understanding the following detailed description. The embodiments depicted are to be understood as exemplary and in no way limiting of the overall scope of the invention. The detailed description will make reference to each of the figures, in which:
Throughout the drawings, like reference numbers refer to like elements, features, and structures.
The matters exemplified in this description are provided to assist in a comprehensive understanding of various exemplary embodiments disclosed with reference to the accompanying figures. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications of the exemplary embodiments described herein can be made without departing from the spirit and scope of the claimed invention. Descriptions of well-known functions and constructions are omitted for clarity and conciseness.
VoiceXML documents operating under voice portal applications include a series of dialogs that facilitate operational flow through the application by invoking specific caller-centric events. The dialogs are formed of modules that include event classes such as, for example, play prompt modules, branch-on-condition modules, database look-up modules, user input collection modules, and confirmation modules. To simplify the process of forming VoiceXML documents from dialogs, the dialogs are divided into two components—XML and XSL dialog modules. The division of these modules causes a clean separation between data (the XML component) and behavior (the XSL component). Because core behavioral aspects generally do not change from application to application, a library of behavior templates can be developed and made available for reuse in various applications. Each behavior template encapsulates the core behavior for which it was developed. An example of a behavior suitable for a template is getting an address from a caller. As used herein, a “user” can be a human operator or an authorized application, that is, a subscribing program operating automatically with or without human intervention. Usage of the program can be real-time or otherwise.
Behavior templates, which are formed of a set of prompts and conditions, can be characterized as representing an “information providing” state and an “information gathering” state. In an information providing state, appropriate prompts are executed and information is provided based on run-time values and conditions. In an information gathering state, prompts and conditions are employed to request that a user speak something, recognize the spoken language, then return a confirmation of the recognized language.
Behavior templates are formed of XSL templates that can be coupled with an XML skeletal framework or structure to enable subsequent data, that is, the XML component, to operate with the template. By way of example, code for an XML structure is provided in the accompanying computer program listing appendix in the file XML Structure.doc. The XSL template coupled with the XML structure forms an XSL dialog module (XDM). XDMs are reusable and customizable and include XSL template files containing logic inherent to voice applications. XDMs specify an XML structure for an XML file that is used along with an XSL template when generating a VoiceXML file. The XSL template and XML structure conforming an XML specification generate a VoiceXML file for interpretation by a VoiceXML interpreter. Because XDMs address routine core behavior, once developed they can be retained to form a library of reusable software components that is time-tested and reliable. Additionally, because application behavior is localized to the templates, changes and enhancements to library XDMs will be implemented globally.
XSL templates are, generally, oriented toward a particular task and once created rarely require modification. Thus, development of voice portal applications is reduced to the writing of XML files with appropriate prompts, conditions, range of valid inputs (grammar), and other static resources. And as long as the XML data component is correct, the XDM need not be tested on each use. With such a simplification, even non-technical users can write voice portal applications. Such a development tool lends itself ideally to a web-based user interface.
The XDM logic can include logic inherent to voiceportal applications that allow applications to interact with callers. Such logic anticipates possible user interaction scenarios, and each XDM can be tailored to encompass user interaction where a specific type of input is expected from a caller. Common interaction scenarios include input collection (for example, capturing general text), getting confirmation from a caller (for example, yes/no response), repeats, hang-ups, capture of a 10-digit telephone number, getting an amount paid by the caller (that is, currency), getting a postal address, getting apartment/suite information, and simply executing prompts for the caller. These processing routines and run-time event handling routines should be compatible with a voiceportal application and implementable using different XDMs.
XDM modules can operate in tandem with XML instances for a given dialog and each XDM can be applied to a variety of XML instances of a particular type without necessitating change to the XDM, which facilitates reusability of the XDMs. Thus, since XDM modules are reusable, the XML instance is the only component customized for each dialog, thereby resulting in a savings in development time. Additionally, this development paradigm allows developers to focus their test cases on situations pertaining to a specific XML instance; the developer need not worry about having to test the common voiceportal behaviors in the XDM, which, again, facilitates quick development cycles.
A representative collection of XDMs that serve different caller interactions include, but are not limited to, the following:
getAddress.xsl (which is used to call individual address collection modules as a subdialog);
getAddressStreet.xsl (which is used to collect a street address);
getApartment.xsl (which is used to collect an apartment/suite number);
getAddressConfirmation.xsl (which is used to confirm the address collected);
getCurrency.xsl (which is used to collect currency input from a user);
getDate.xsl (which is used to collect a date);
getDigits.xsl (which is used to collect digits);
getGeneric.xsl (which is used to collect general input based on grammar);
getMultiDigits.xsl (which is used to collect multiple digits);
getNBestGeneric.xsl (which is used for N-Best collection);
confirmation.xsl (which is used for confirmation of input);
playPrompt.xsl (which is used for play prompt UI modules); and
playHours.xsl (which is used for playing hours).
By way of example, code for confirmation.xsl is provided in the accompanying computer program listing appendix in the file Confirmation.doc. In support of this XDM, code for GlobalXSLVars.xsl is also provided in the appendix in file GlobalXSLVars.doc. GlobalXSLVars.xsl is a file included by reference in the program code for confirmation.xsl. So called “include” files are modules containing logic and functionality common to a number of XDMs. By locating such common code in a separate file that can be shared amongst XDMs, changes to the code can be made in a single file yet implemented across all XDMs incorporating the code via the include reference.
In order to form a VoiceXML document for each User Interface module, an XSL transformation is applied to the XML instance using an XDM appropriate for the module type. This results in a VoiceXML document that can then be passed to the VoiceXML interpreter to be interpreted and played out to a user. By way of example, code for a VoiceXML document is provided in the accompanying computer program listing appendix in the file VoiceXML Example.doc. This example VoiceXML document is formed from the file XML Sample.doc, provided in the accompanying computer program listing appendix in the file XML Sample.doc, and confirmation.xsl.
In an exemplary embodiment, one or more servers 220 . . . 220n house a computer program product for causing creation of XDMs in a network 200. Although shown as a discrete unit in
In another exemplary embodiment, the network system of
As discussed, certain exemplary embodiments can be written as computer-readable code/instructions/programs and can be implemented in digital computers that execute the code/instructions/programs using a computer readable medium. Examples of a computer readable medium include magnetic storage media (for example, ROM, floppy disks, hard disks, among others), random-access memory (RAM), optical recording media for example, CD-ROMs, or DVDs), and storage media such as carrier waves (for example, transmission through the Internet). The computer readable medium can also be distributed over network-coupled computer systems so that the computer readable code/instructions/programs can be stored and executed in a distributed fashion. Further, functional programs, code, and code segments for implementing this disclosure can be easily construed by programmers of ordinary skill in the art to which this disclosure pertains.
While the present application has been particularly shown and described with reference to certain exemplary embodiments thereof, it will be understood by those of ordinary skill in the art that various changes in form and detail may be made therein without departing from its spirit and scope as defined by the appended claims and equivalents thereof.