A portion of the disclosure of this patent document contains material which is subject to copyright protection. The copyright owner has no objection to the facsimile reproduction by anyone of the patent document or the patent disclosure, as it appears in the Patent and Trademark Office patent file or records, but otherwise reserves all copyright rights whatsoever.
The invention relates generally to data integration methods and specifically to a system architecture for enabling application data integration.
The trend in enterprise or distributed computing, and in information processing in general, is toward sharing or integration of data sources between a variety of otherwise disparate applications. The traditional data processing architecture utilizes large-scale database systems, rarely if ever linked together by any intelligent means, that are used to serve up data to independent, focused, and often heavily customized applications. As a result of the historical legacy in computer architecture and the prevailing use of traditional methods of data processing, much useful data is locked in application “stove pipes”, creating islands of information whose full potential is not fully realizable.
Integrating these islands of information is a complex and often costly exercise. Typically such projects require the skills of experienced data experts, and the need for expensive one-off data interfaces for each individual project. The result is a complex information technology (IT) environment, with continuously increasing maintenance needs and costs.
To address these issues, some system providers have turned to data warehousing techniques to better share and facilitate data exchange between an enterprise suite of applications. However, data warehousing alone cannot provide the answer—in most cases the data becomes stale too rapidly for meaningful or reliable integration. Data warehouses are of most use in decision support systems that rely on the ability to quickly scan a database or data repository and base a decision-making process on, and are of much less when the data changes rapidly. Enterprise Application Integration (EAI) systems have attempted to bring data warehousing benefits to the application level, but they typically demand procedural, synchronous programming that is highly optimized for tightly coupled applications. However, no mechanism currently exists for reliably and tightly coupling the wide variety of applications with the underlying data in a fully integrated manner.
The invention includes a software architecture that provides applications with uniform access to data that is available from heterogeneous data sources. Applications access such data, hereafter referred to as “Liquid Data”, through a set of unified XML views that have been derived from the schemas of the underlying data sources. Liquid Data provides a concept of data independence in the sense that application developers can write application components that do not need to be changed if an underlying data source changes. Benefits of the invention include the following:
Other benefits and features provided by the invention include a flexibility in adapting to changes in data sources and views, ease of application and data extension (across the Enterprise, or the Internet, etc.); data independence from changes in data sources; ease of maintenance/updates; reduced complexity in building applications; unified access to heterogeneous data; ability to contract development over less skilled resources; ability to focus development teams not on application development itself but on conversations, and assembly of network applications.
The following terms are used throughout this document, and take the following meaning and their equivalents:
To take full advantage of the invention, and due to the fact that extensive use is made of XML, very efficient XML parsers and compiled XSLT are used. XSLT is sufficient for transforming the structure of an XML document, but Java/XSL plug-ins can be used for actual data transformations or complex operations such as converting a birth date to an age.
Because of the associated overhead and complexity, Liquid Data is most appropriate for multiple distributed data sources, rather than those with simple data requirements. The actual of retrieving the data from the Liquid can be implemented by using finders like an entity bean, instead of the use of general queries.
To maximize performance, caching is used for the input argument transformations, and especially for key arguments. The information architect responsible for the initial configuration of the system should ideally be well versed in the various data resources available, due to the complexity of integrating multiple data schemas. The effort for the application developer in developing applications is however greatly reduced.
Liquid Data Framework
Within the framework itself, a binder mechanism is used to bind requests from the client applications to the framework itself. A merger 138, that includes a query processor, takes these requests and processes them in a way that makes sense to the liquid data framework. The actual mechanism and rules used to process requests can be specified by a data analyst 192 during a setup procedure, and the processing rules themselves stored in a meta schema database 148. A liquefier passes the now processed requests to the adaptors for relay to the data resources. The actual format in which the requests are phrased can be determined by a plurality of XML files 190 that define the liquefier's data handling properties.
When the data resource responds with actual data, the liquid data framework is used to translate the data into a format usable by the client or portal application. As responses or results are passed back to the merger, a view manager 140 molds or shapes the results into a format understandable by the calling application. At this point, a custom view may be used to filter the data to meet the specific needs of the portal application. When properly formed, the response is bound, again using the binder, and returned to the calling application 130.
Unified Data Description Mechanism
A feature of the invention is its ability to provide a unified data description mechanism, that can then be used to provide customized and yet tightly coupled access for a client application to one or more data resources. A large number of data resources can be accessed simultaneously, providing the illusion of a totally integrated, enterprise-spanning, and uniformly integrated data source.
Sample Implementation
The following section briefly illustrates some advantages of the invention when applied to a specific sample implementation, such as that of an airline customer service and ticketing system.
Customer Service Application
In this scenario three data sources are used—Main customer data source 416; Frequent flyer data source 418; and Marketing data source 414. One unified customer view 408 provides the main customer view. One application data view provides the customer service view 412. This scenario illustrates how to use Liquid Data for an application, such as a Customer Service Rep application in this example.
Customer Self Admin Application
Here, we are using the same back-end data sources (414, 416, 418) and the same unified customer view 408. A self-admin application view 410 can be used to access a different view of the data via the common unified customer view 408. This scenario shows how an existing data infrastructure can be re-used.
Data Source Modification
This scenario shows that a back end data source change requires only a change to an XSL transform. This can be taken care of with a graphical tool.
Best Effort Programming
If a data source is unavailable, the Liquid Data framework will provide as much data as possible.
The invention is particularly useful in providing flexibility in adapting application access to changes in data sources and views. The techniques provided herein can be extended with ease to other data distribution models (Enterprise, Internet, etc). Data independence can be maintained throughout changes in data sources. The invention is also useful for allowing ease of maintenance/updates, and reduced complexity in building applications. A feature of the liquid data framework provided by the invention is its unified access to heterogeneous data. This allows an open, flexible, standards based foundation for a new programming model or methodology. Software development can be accomplished by less skilled resources, and development teams can be focused on conversations and assembly of network applications. As such, ISV's can easily adapt to the changes in the underlying framework.
The present invention may be conveniently implemented using a conventional general purpose or a specialized digital computer or microprocessor programmed according to the teachings of the present disclosure. Appropriate software coding can readily be prepared by skilled programmers based on the teachings of the present disclosure, as will be apparent to those skilled in the software art.
In some embodiments, the present invention includes a computer program product which is a storage medium (media) having instructions stored thereon/in which can be used to program a computer to perform any of the processes of the present invention. The storage medium can include, but is not limited to, any type of disk including floppy disks, optical discs, DVD, CD-ROMs, microdrive, and magneto-optical disks, ROMs, RAMs, EPROMs, EEPROMs, DRAMs, VRAMs, flash memory devices, magnetic or optical cards, nanosystems (including molecular memory ICs), or any type of media or device suitable for storing instructions and/or data.
The foregoing description of the present invention has been provided for the purposes of illustration and description. It is not intended to be exhaustive or to limit the invention to the precise forms disclosed. Many modifications and variations will be apparent to the practitioner skilled in the art. Particularly, it will be evident that while the examples described herein illustrate how the invention may be used in a WebLogic environment, other application servers and systems may use and benefit from the invention.
The embodiments were chosen and described in order to best explain the principles of the invention and its practical application, thereby enabling others skilled in the art to understand the invention for various embodiments and with various modifications that are suited to the particular use contemplated. It is intended that the scope of the invention be defined by the following claims and their equivalence.
This application claims the benefit of U.S. Provisional Application No. 60/381,026, filed May 16, 2002, and is incorporated herein by reference.
Number | Name | Date | Kind |
---|---|---|---|
6065002 | Knotts et al. | May 2000 | A |
6167405 | Rosensteel, Jr. et al. | Dec 2000 | A |
6330569 | Baisley et al. | Dec 2001 | B1 |
6704724 | Ellmann et al. | Mar 2004 | B1 |
6725212 | Couch et al. | Apr 2004 | B2 |
6785673 | Fernandez et al. | Aug 2004 | B1 |
6915507 | Kaler et al. | Jul 2005 | B1 |
6917939 | Baer et al. | Jul 2005 | B1 |
7035842 | Kauffman et al. | Apr 2006 | B2 |
7058698 | Chatterjee et al. | Jun 2006 | B2 |
7146399 | Fox et al. | Dec 2006 | B2 |
7152062 | Draper et al. | Dec 2006 | B1 |
7197491 | Chou et al. | Mar 2007 | B1 |
20020046209 | De Bellis | Apr 2002 | A1 |
20020069081 | Ingram et al. | Jun 2002 | A1 |
20020147810 | Traversat et al. | Oct 2002 | A1 |
20030105884 | Upton | Jun 2003 | A1 |
20040030685 | Helles et al. | Feb 2004 | A1 |
20040128276 | Scanlon et al. | Jul 2004 | A1 |
20060020586 | Prompt et al. | Jan 2006 | A1 |
20070038610 | Omoigui | Feb 2007 | A1 |
Number | Date | Country | |
---|---|---|---|
20040088370 A1 | May 2004 | US |
Number | Date | Country | |
---|---|---|---|
60381026 | May 2002 | US |