This invention relates to file storage and retrieval systems in general and more particularly to such systems and methods for adding metadata to files.
It is now commonplace to retrieve data files from storage locations. In the situation, files are stored on the same system handling the request for the file. Situations arise where it is desired to have several systems access a common set of files. This can be accomplished by setting up a file storage arrangement that is shared across a network. In such file storage arrangements, whether the arrangement serves one system or a plurality of systems, the desired file is retrieved by specifying the name of the file, as well as the full directory path to the file, when required. Thus, files may be moved about over the network arbitrarily.
Such an arrangement has problems when metadata is associated with the different files. Typically, metadata associated with a file is placed in a separate database. Thus, the database must be moved each time a file is moved.
One example of a complex file structure is the audio file structure used in interactive voice response (IVR) systems. In order for a script to play a specific audio file, the <audio> tag in the script must provide a fully resolved address or URL pointing to the web address where the desired audio file resides. The fully resolved address is a complete address to the file through the existing hierarchical structure. The server then directs the request to the desired address and the desired audio file is retrieved from the specified address.
While a hierarchical directory structure with full path-name access is effective, it has some drawbacks. Many applications, particularly media applications which use audio files, have thousands of such audio files. An application may have the same audio files recorded in multiple languages, multiple speakers, or different emotional tones (stern, happy). Deciding on a hierarchical folder and file-naming scheme can be a challenge. Should one file the files according to language? To speaker? To content? If one files according to language, how does one find all the files by a specific speaker? Thus, using a hierarchical storage structure and fully resolved path names to retrieve a particular file is cumbersome and often limiting. Storing files in a nonhierarchical structure and requesting the files by attribute, provides a more flexible approach to file access than hierarchical structures.
In one embodiment, metadata attributes of a desired file are sent along with a submit tag to a file server, and the specialized file server associates the metadata with the file before or during file storage. The metadata may be embedded within the file during the storage. The attributes may be used for subsequent retrieval of the file. Attributes, such as the text or title of the file, the language of the file, the creator of the file, etc, can all be associated with the file in the form of metadata. Files that are recorded without metadata can have metadata associated therewith and may be retrieved using the metadata.
In this patent's embodiment, metadata is NOT embedded into a file before the file is stored into the file server. For example, utterances recorded during the execution of an application, and streamed “whole call” recordings, both do not typically have metadata embedded when they are created. To facilitate metadata storages, the embodiment initially carries the metadata to the server, and the server embeds the metadata in the file as part of the storage process. The metadata may also be used to index the file.
In a further embodiment, the metadata and files are used in an IVR system. This embodiment may embed metadata in recorded audio files at the time the files are being stored on the file server. The files may then be located and retrieved using the metadata attributes instead of pathnames.
In a still further embodiment, aspects of the embodiment are intended for audio files that are created during the course of an application, when there are no processes that can be embed the metadata before sending the file to the audio file server. Note that prompts are typically created and their associated metadata is known well before their storage. However, it requires a detailed knowledge of the specific file structure to safely embed metadata in a file without damaging the original file data. Many applications do not have any processes that have the knowledge of file structures, so the embedding process is relegated to the specialized file server. However, aspects of the embodiment may be used with prompts.
For a more complete understanding of the present invention, reference is now made to the following descriptions taken in conjunction with the accompanying drawings, in which:
Embodiments of the invention define how a file can be placed on a server while embedding metadata attributes into the file. The metadata may have been created before, simultaneously, or after creation of the file. In some cases, the metadata may already have been embedded into or associated with its respective file before it is be stored on a file server. In such a case, a standard HTTP “PUT” may be used to store the metadata-embedded file on the file server in a generic folder. When the already-metadata-embedded file arrives at the server, it may be indexed and stored by the server. Placing files with pre-embedded metadata on the file server is described in U.S. patent application Ser. No. ______ [Attorney Docket No. 47524-P137US-10501428] entitled “SYSTEM AND METHOD FOR ORGANIZING FILES ON A FILE SERVER USING METADATA INDEXING AND A SEARCH ENGINE,” and U.S. patent application Ser. No. ______ [Attorney Docket No. 47524-P138US-10501429] entitled “SYSTEM AND METHOD FOR RETRIEVING FILES FROM A FILE SERVER USING FILE ATTRIBUTES,” the disclosures of which are hereby incorporated by reference, and is not covered in this patent.
Embodiments of the invention are directed to situations where a file to be stored does not have embedded or associated metadata, and the process that will create or send the file does not embed or associate metadata in the file. One specific problem with embedding metadata in files in general is that the embedding process needs understand the file structure and format of the file that is being modified to embed the metadata. If the embedding process does not understand the target file format, then the target file may be corrupted or the data in the file may be destroyed.
One embodiment of the invention is directed to metadata and files are used in an IVR system. When a VXML system records a message from a user, the file is stored on the VXML browser until the recording is completed. Once the recording has finished the file is moved to the file server using a VXML PUT command. However, without embodiments of the invention, this action would place the file on the file server with no embedded metadata. Embodiments of the invention put the file in the server, and embed metadata in the file. Other embodiments put the file on the file server and embed the metadata at the same time.
In system 20, an application server 21, creates a document using the VXML scripting protocol, and communicates the document with browser 22, using, for example, the standard HTTP protocol to the file server 23.
A script can use VXML <record> tag to do the recording and save the filename in “recordMessage” variable. The “recordMessage” will be passed as last parameter in the namelist in the storeMedia servlet call.
After the recording is done, the script can use <submit> in the <subdialog> to submit the recorded message to the file server. The first parameter in the namelist will be the staging directory that file server is configured to use. A result for this submit will be returned to the application “submitFile.result” and the recorded file name in file server will also be returned to the application “submitFile.uuid. Between the first and last parameters in the namelist will be a list of metadata goes into the message. In the following example, <appName, BankA>, <accountName, 12345678>, and <question, What is my checking account balance”> are the metadata <key, value> for this message. In this example, the hardcoded string is put into expr and application can get the information (metadata) from the caller and passed into the expr as variable.
Below is the VXML script sent back from file server so that browser can get the result and UUID. Application can get the result from the <submit> in the <subdialog> call. UUID can be used for future media retrieval from the file server.
A result for this submit will be returned to the application “submitFile.result” and the recorded file name in the file server will also be returned to the application “submitFile.uuid.”
The metadata attributes (and their respective values) with this example includes account number (12345678) and question (what is my checking account balance). Note that the record file name is also passed in the call. When HTTP servlet 26 of file server 23, namely Media Hub server, receives the servlet call, it will store the media file in its staging area, add the metadata to the media file via concatenation engine 25, and then move the media file to “production area” with UUID filename provided by UUID generator 24
Below is an example VXML script sent back from the file server to the browser so that browser can have the result and UUID for future use.
HTTP servlet 26 in the prompt server (process 210) fetches the file from the temp storage on the browser. Process 212 generates a UUID for the file. This generates a unique name for the file, and avoids duplicate names in the file storage system.
Process 212 inserts the metadata into the .wav file and the .wav file is moved, via process 213, to the production area of the prompt server. Process 214 sends a wake-up call to the file monitor, so that the new metadata may be indexed immediately instead of waiting for indexing on a timed basis.
Process 215 creates VXML page with the result and the UUID and process 216 sends the VXML page to the VXML browser, if desired. The purpose of sending this page is to give information to the browser for subsequent retrieval of the file.
Process 203, in the browser, determines when a new browser page has been received. When it has, process 204 processes the return such that the browser obtains the code and UUID from the created page and the UUID is saved (if desired) for future file retrieval purposes.
Note that embodiments of the invention are useful for the types of files that do not get metadata inserted in at file creation time, such as audio messages recorded on a voice browser during the course of an application, and audio streams of telephone conversations coming from the network. Files recorded on the VXML browser are usually recordings of user utterances such as voicemail messages, personal notes and comments, messages for transcription, etc. that are created during the process of running the application (unlike prompts, which are created before the application is run). These file typically need to be stored on the audio file server for later retrieval. The VXML “record” command can create these types of files on a VXML browser, but the VXML record command does not comprehend metadata in the recorded file. When the recording is finished on the browser, the complete recorded file is transmitted to the audio file server.
Full recordings of telephone conversations are typically streamed directly into the audio file server, and will not pass through the browser. Whatever process instigates the audio streaming should also inform the file server of the metadata to be added to the file after the recording is stopped. The final file with its inserted metadata is then placed into storage.
Although the present invention and its advantages have been described in detail, it should be understood that various changes, substitutions and alterations can be made herein without departing from the spirit and scope of the invention as defined by the appended claims. Moreover, the scope of the present application is not intended to be limited to the particular embodiments of the process, machine, manufacture, composition of matter, means, methods and steps described in the specification. As one of ordinary skill in the art will readily appreciate from the disclosure of the present invention, processes, machines, manufacture, compositions of matter, means, methods, or steps, presently existing or later to be developed that perform substantially the same function or achieve substantially the same result as the corresponding embodiments described herein may be utilized according to the present invention. Accordingly, the appended claims are intended to include within their scope such processes, machines, manufacture, compositions of matter, means, methods, or steps.
The present application is related to copending and commonly assigned U.S. patent application Ser. No. ______ [Attorney Docket No. 47524-P137US-10501428] entitled “SYSTEM AND METHOD FOR ORGANIZING FILES ON A FILE SERVER USING METADATA INDEXING AND A SEARCH ENGINE,” patent application Ser. No. ______ [Attorney Docket No. 47524-P138US-10501429] entitled “SYSTEM AND METHOD FOR RETRIEVING FILES FROM A FILE SERVER USING FILE ATTRIBUTES,” and patent application Ser. No. ______ [Attorney Docket No. 47524-P140US-10506201] entitled “SYSTEM AND METHOD FOR DEFINING, SYNTHESIZING AND RETRIEVING VARIABLE FIELD UTTERANCES FROM A FILE SERVER,” filed concurrently herewith, the disclosures of which are hereby incorporated by reference.