The present invention relates to a method and a system for transferring data contained in an electronic message, in particular in a MIME attachment, to a relational database.
Relational databases such as MYSQL (cf. http://www.myqsl.com/) or DB2 (http://www-306.ibm.com/software/data/db2/) are used commonly to store large amounts of data. Many organizations worldwide store critical data in relational databases. Relational database data is retrieved or updated using Structured Query Language (SQL, cf. “A Relational Model of Data for Large Shared Data Banks” E.F. Codd) commands. Relational databases consist of one or more tables where each table has one or more columns. Database table columns are most commonly simple data types such as character strings or numbers.
Relational databases have evolved to be able to contain more complex, non-structured data such as sound files, digital images or word processing documents. These new types of data are very large. Depending on the vendor, these columns can be up to 4 GB in size, or more. These are known as large objects (LOB). There are two types of large objects: Character Large Object (CLOB) and Binary Large Object (BLOB). Database columns containing LOB data are accessed like any other table column using SQL commands. Various databases use different terms for character and binary large object types. In Microsoft SQL Server (cf. http://msdn2.microsoft.com/en-us/library/ms187993.aspx), for example, TEXT columns are used to store variable length character data and IMAGE columns are used to store variable length binary data.
Businesses and organizations make relational databases available on the internet or intranet to support initiatives such as electronic commerce. It is common for input to or output from internet or intranet accessible databases to be exchanged in messages such as e-mail messages or files that are delivered using the file transfer protocol (FTP, cf. http://www.faqs.org/rfcs/rfc959.html) or hypertext transfer protocol (HTTP, cf. http://www.faqs.org/rfcs/rfc2616.html).
Messages can have a complex structure consisting of a message body and one or more body parts known as attachments. Messages and attachments are for example known as Multipurpose Internet Mail Extensions (RFC 2045, cf. http://www.ietf.org/rfc/rfc2045.txt). While RFC 2045 originally described the structure of e-mail messages it has also been applied to other types of messages such as Simple Object Access Protocol (SOAP, cf http://www.w3.org/TR/soap/) messages or Java Message Service (JMS, cf. http://iava.sun.com/products/jms/index.jsp) messages.
Message attachments can contain not only simple data types such as strings or numbers but also complex data such as JPEG or MP3 files or very large text files. Increasingly, organizations wish to store data such as JPEG files, MP3 files or large text files in relational databases. Conversely, organizations also wish to create messages with attachments containing such files stored in the database.
Prior systems for exchanging data between messages and relational databases do not provide a flexible and simple mechanism for transferring data between message attachments and relational database tables with CLOB or BLOB columns. Exchanging data between message attachments and relational databases requires coding of complex computer programs in the Java, C or other programming languages. Such programs are difficult to write and to change as databases and business requirements change.
The present invention is therefore based on the problem to provide a method and a system allowing a more flexible control of the data transfer between messages, in particular messages with MIME attachments, and relational databases.
According to one aspect of the invention, this problem is solved by a method for the transfer of data contained in an electronic message, in particular in a MIME attachment, to a relational database comprising the steps of
Accordingly, rather than using the fixed instructions of a custom code for processing the message so that the desired database access is performed, one embodiment of the present invention uses two different elements: The essentially static sequencer, which is an executable component and the sequence, which is a XML file, the entries of which determine how a message is processed by the sequencer to transfer data contained in the message to a relational database. Changing the entries of the XML sequence therefore allows modifying the processing performed by the sequencer and thereby the way the data contained in the message is transferred to or from the relational database. Since the overall process does not need to be fully coded and compiled again, such an adaptation can be easily performed in a short amount of time.
Preferably, the method further comprises loading of at least one SQL template, which is processed by the sequencer to provide the executable SQL statement. Using an SQL template is, however, not compulsory for the present invention. For example, the sequence may already contain one or more fully qualified SQL statements, which can be executed to access the data base. However, the use of templates, which are loaded by the sequencer in response to corresponding XML statements in the sequence, provides more structured and shorter sequences, which are easier to design and to understand. SQL templates may be used if the SQL statement has parameters.
In a preferred embodiment, the SQL template comprises at least one parameter, the value of which is determined by the processing of the message. Accordingly, the sequencer uses preferably the data derived from the message as parameter values for the SQL statement loaded from the template. Parameter data can be extracted from the message body or from an attachment. For example, a SQL template might have two parameters, wherein the value of a first parameter is determined by data in a message body and wherein the value of a second parameter is determined by data in a message attachment.
The execution of the at least one SQL statement provides in some embodiments output data which are stored as data in an output message, wherein the output data comprise preferably also a first and a second parameter, wherein the value of the first and the second parameter are stored in the message body and the attachment of the output message, respectively. Accordingly, this more advanced embodiment of the method of the invention allows to use a message, such as an email message with a MIME attachment, not only for adding a new data element to the relational database but also to retrieve one or more data elements, which can be n sent to a recipient as a message having possibly one or more attachments
According to a further aspect, one embodiment of the present invention is directed to a system for transferring data from a message, in particular a MIME attachment, to a relational database, the system comprising a XML sequence, which determines, how the data is to be processed, an executable sequencer, adapted to process the message based on the XML sequence, wherein the sequencer is further adapted to execute at least one SQL statement for accessing the database, while processing the message. This system allows to perform any of the above described methods.
Further dependent claims relate to preferred embodiments of the invention.
In the following detailed description presently preferred embodiments of the invention are described with reference to the drawings which show:
a-b: An example of a message and a corresponding table in a relational database;
In the following, presently preferred embodiments of the method and the system according to the invention are described with reference to the processing of an email with a MIME attachment. However, it is to be understood that the present invention can in principle also be used for other types of messages such as SOAP or Java Message Service messages and other types of attachments. Further, the described method and system also allow to process messages without any attachments at all.
a schematically presents an email 10 with data to be inserted into a relational database. The email 10 comprises a message body 11 and a MIME attachment 12. In the example, the message body 11 contains a brief XML statement with data 13 and the attachment 12 contains data 14 that needs to be inserted into the relational database (not shown in
b present an example of the intended result of the described method and system, namely a table 20 of a relational database, wherein a new row 21 has been added comprising the data 13 of the message body in the first column 22 and the data 14 of the MIME attachment in the second column 23. However, there are many other ways how the data 13 and/or the data 14 of the email 10 can be inserted into the relational database.
The sequencer 30 processes the message 110, which leads to an access of the database 40. In addition to the message 110, the sequencer 30 receives a sequence 50 as an input. The sequence 50, an example of which is shown in
The double headed arrows in
In a next step 204 a decision is made whether to load a SQL template 60 or not. As mentioned above, using an SQL template is required if the SQL statement has parameters. The group of XML statements 52 shown in
An exemplary SQL template is shown in
Once the SQL template 60 has been loaded in step 205 (cf.
The steps 207-209 show that the described method can also be used for extracting parameters from the message body or the attachment of the message, if desired.
Once the parameter values have been inserted, the fully defined SQL statement is executed (step 210), which leads to the result shown in
The fully defined SQL statement is then executed by the sequencer using suitable JDBC (Java Database Connectivity) calls.
The further steps 211-215 below the dashed line in the flowchart of
If return data are found in step 211, the steps 212-214 are performed to identify, whether the return data are to be included in the messages body and/or the attachment of an outgoing message. ContentID parameters are preferably included in the attachment (cf. steps 212 and 213), whereas other parameters are preferably included in the message body (cf. step 214). The outgoing message is prepared in step 215.
Step 217 shows that the sequencer 30 finally disconnects from the database. Corresponding XML statements can again be found in the sequence 50 of
It is apparent from the above description that the processing flow shown in
Number | Name | Date | Kind |
---|---|---|---|
6678705 | Berchtold et al. | Jan 2004 | B1 |
6816885 | Raghunandan | Nov 2004 | B1 |
6882996 | Preisig et al. | Apr 2005 | B2 |
7152090 | Amirisetty et al. | Dec 2006 | B2 |
20020046248 | Drexler | Apr 2002 | A1 |
20020065892 | Malik | May 2002 | A1 |
20030018832 | Amirisetty et al. | Jan 2003 | A1 |
20040107250 | Marciano | Jun 2004 | A1 |
20040162827 | Nakano | Aug 2004 | A1 |
20050228824 | Gattuso et al. | Oct 2005 | A1 |
20050234843 | Beckius et al. | Oct 2005 | A1 |
20070022078 | Gupta et al. | Jan 2007 | A1 |
20070118601 | Pacheco | May 2007 | A1 |
Number | Date | Country | |
---|---|---|---|
20070271251 A1 | Nov 2007 | US |