The present invention relates to the field of Internet interactivity and, more particularly, to a system for accessing and invoking automation objects over the Internet.
In the early days of desktop computing, all applications were monolithic, i.e., they were self-contained, standalone programs. As good as these programs were, a problem still existed with these monolithic applications. Development of traditional software applications required the application executables to be compiled and linked with their dependencies. Thus, every time developers wanted to update the processing logic or implement new capabilities, they would have to modify and recompile the entire primary application in order to do so. In essence, in order to make any changes to any portion of the program, the entire application had to be rewritten. This made it impractical to upgrade the application as minor improvements were made.
This problem was addressed by the introduction of a component software paradigm. A basic principle of component software is that applications can be built from a series of prebuilt and easily developed, understood, and changed software modules called components, each providing a particular function. Thus, applications could be delivered, enhanced, or extended much more quickly and at a lower cost simply by updating or adding new components.
Unfortunately, the component software paradigm suffers a problem similar to that of the monolithic application. Each time the components are enhanced and upgraded, as with applications, the components must be recompiled by the component developers. Either the application developer or the end user would have to monitor for and obtain updated components. The distributed component paradigm has provided a solution to this problem.
Distributed components exist at specific locations. Developers of applications or other components that require a distributed component need only find the component and then use it. The developer does not need to compile or recompile the component. This is done by the creators of the component. Thus, the latest and greatest version of each component is always available to developers and other users.
The widespread use of the Internet, an open environment, presents many new opportunities for distributed component software, and some associated shortcomings as well. The availability of a vast number of vendors, each creating a number of components increases the ease with which applications may be built and increases the flexibility to tailor an application to suits a user's needs. Unfortunately, the open environment of the Internet means that no one can implicitly “trust” everyone else, as is the case with a traditional client-server system. Thus, all but some dedicated server machines are hidden behind firewalls to protect against unwanted intrusions. Firewalls are barriers that filter packets based on certain criteria, such as a type of packet, and/or based on an Internet address. Firewalls shield servers by controlling traffic between the Internet and the server and controlling which packets may pass through them.
Since only certain types of packets may pass through to the server when firewalls are in place, the ability to access remote components over the Internet is severely limited.
A second problem with today's Internet is not so much a problem as a shortcoming. Much of the Internet's use is conducted through the World Wide Web, hereinafter referred to as “WWW,” or simply the “Web,” in which linked pages of static content, composed of a variety of media, such as text, images, audio, and video, are described using hypertext markup language (HTML). While the WWW revolution opened the doors to a wealth of information at the fingertips of ordinary people, and while HTML is a very good way of describing static documents, it provides no means to interact with the Web pages. In this static model, a Web browser uses the Hypertext Transport Protocol (HTTP) to request an HTML file from a Web server. HTTP is an Internet protocol designed for rapid and efficient delivery of HTML documents. HTTP is a stateless protocol, meaning that each request to the Web server is treated independently, with the server retaining no “memory” of any previous connections. The Web server receives the request and sends the HTML page to the Web browser, which formats and displays the page. Although this model provides a client with ready access to nicely formatted pages of information, it provides only limited interaction between the client and the Web server. Furthermore, HTML pages must be manually edited in order to change what the Web server sends to a client, such as a Web browser. Thus, much of the potential richness of the World Wide Web is not fully realized.
One of the biggest challenges to any Web site is to offer dynamic content, i.e., content that changes in realtime. This requires applications to be run from the Web servers. Changing from a static web content to a dynamic web content model would allow WWW content providers to provide interactive business applications rather than merely publishing pages of static information. For example, a travel agency could enable customers to check available flights, compare fares, and reserve seats on flights, rather than merely looking at flight schedules.
HTTP is not well-suited for implementing dynamic Web pages because interacting with Web pages potentially involves a large number of requests. In a typical scenario, a client, such as a web browser, is used to initiate a query, which is sent to an HTTP server operating on a host computer somewhere on the Internet. The query might represent a request for documents containing certain data, or may represent the address, or Uniform Resource Locator (URL), of a particular Web page. The server locates the documents and sends their contents back to the client. In loading the documents for viewing, the client often encounters additional files such as embedded images or sounds, that need to be loaded. The client continues making requests to the server until all of the additional files are received and loaded.
Since HTTP is a stateless protocol, as mentioned above, existing HTTP servers create a separate process for each request received. The greater the number of concurrent requests, the greater the number of concurrent processes created by the server. Unfortunately, creating a process for every request is time-consuming and requires large amounts of server resources such as memory and processor cycles. In addition, creating a process for every request can restrict the server resources available for sharing, slowing down performance, and increasing wait times.
In summary, since most servers are protected by firewalls, only certain types of packets, such as HTTP packets, may pass through to the server, and since HTTP is not suited for interactivity, the goal of providing dynamic content over the Internet is severely limited. Thus, in order to fully realize the potential of distributed component software and of dynamic content on the World Wide Web, there exists a need for software having the ability to access and invoke Automation objects through firewalls.
In accordance with the present invention, a method and software program that provides end users and developers with all the advantages of distributed component software, and capitalizes on the resources available on a computer network such as the Internet, to provide a richer, more interactive content is provided. The invention achieves this result by defining a protocol capable of accessing and invoking methods in Automation objects across the Internet and through firewalls. The protocol, called a Simple Object Access Protocol (SOAP), is an application layer protocol that is layered on top of HTTP and allows Microsoft Component Object Model (COM) Automation objects to be accessed and methods to be invoked over the Internet through Web servers protected by firewalls. “Application layer” refers to the highest layer in the seven-layer Reference Model for Open Systems Interconnection (OSI Reference Model), an international standard for networking by the International Standards Organization (ISO). The application layer is concerned with the semantics of the information exchanged; it ensures that two application processes performing an information processing task on either side of a network understand each other. The OSI Reference Model, as described in “Open Systems Interconnection (OSI)—New International Standards Architecture and Protocols for Distributed Information Systems,” special issue, Proc. IEEE, vol. 71, no. 12, December 1983, is hereby incorporated by reference.
The inventive protocol includes a data structure which encodes, as a SOAP request, the name of the Automation object of interest, a method to invoke in that object, and any valid Automation [in][out] parameters to be exchanged with the object, and creates a client-side SOAP proxy for the Automation object. The range of valid parameter types is defined by the COM Automation Variant type. In addition to Variant data types, the protocol also supports passing ActiveX Data Object Recordset objects (ADO). Variant and Automation “object” classes such as the ADO Recordset may be used as either [in], [out], or [in, out] parameters. The SOAP proxy packages the SOAP request into a multipart MIME type.
MIME, which stands for Multipurpose Internet Mail Extensions, is an extension to the traditional Internet Mail protocol to allow for multimedia electronic mail. MIME was developed to accommodate electronic mail messages containing many parts of various types such as text, images, video, and audio. MIME is defined in Document RFC 1521 of the Network Working Group, September 1993, which is hereby incorporated by reference.
The SOAP proxy marshals and transfers the multipart MIME-encoded SOAP request to an Applications Programming Interface (API) which acts as a server-side SOAP stub for processing SOAP messages. Marshaling is the process of packaging up the data so that when it is sent from one process to another, the receiving process can decipher the data. The SOAP stub, which is running on the Web server, unpacks and parses the SOAP request, instantiates the COM Automation object, and invokes the method with the marshaled [in] parameters. The SOAP stub also returns any [out], or [in, out], or return, parameters from the COM Automation object instance to the SOAP proxy, and the Automation object instance is reclaimed. Thus, SOAP is a stateless protocol, i.e., one where object lifetimes only extend to one method call, and which are recreated for each call to the object.
The foregoing aspects and many of the attendant advantages of this invention will become more readily appreciated as the same becomes better understood by reference to the following detailed description, when taken in conjunction with the accompanying drawings, wherein:
As will be better understood from the following description, with reference to
The remote servers 30 may take the form of a host computer 30a, a minicomputer 30b, a mainframe computer 30c, or any other configuration of computer. A typical client computer 10a for implementing the invention is a general purpose computing device such as a conventional personal computer, which comprises such well-known items as a central processing unit 12, system memory 14, a modem and/or network card 16 for connecting the local computer to the Internet 20, a display 18, and other components not specifically shown in
In a present embodiment, Automation objects are implemented as COM automation objects. COM is the Component Object Model, by Microsoft Corporation of Redmond, Wash., is an implementation of component software technology, i.e., the idea of breaking large, complex software applications into a series of pre-built and easily developed, understood, and changed software modules called components. COM is described in Dale Rogerson, Inside COM, Microsoft Press, 1997, which is hereby incorporated by reference.
As shown in
SOAP is a data transmission paradigm. The data transmission paradigm includes a three-section data structure that comprises a header, body, and trailer. The data structure is used to package information referring to a request to invoke a method of an Automation object. In operation, when the client process 110 requires certain data from an Automation object, the process issues a method call, which causes an Advanced DataSpace 120 to be created. The Advanced DataSpace 120, in turn, creates a SOAP proxy 140 for the Automation object 180. The SOAP proxy 180 packages the data structure as an HTTP POST message in multipart MIME packets, and sends the message as a binary data stream through the network, i.e., the Internet 20, to the server computer 30 where the Automation object 180 is located.
When the server computer 30 receives the HTTP POST message, the server process 160, i.e., the Web server 160, invokes a SOAP stub 170 for the SOAP proxy 140. The SOAP stub that is invoked is chosen based on an identifier contained in the header of the data structure. The SOAP stub 170 unpackages the multipart MIME packets and instantiates the Automation object 180 identified in the header of the data structure. A method name field also identified in the header of the data structure indicates the method of the Automation object 180 to invoke. The method is invoked by the SOAP stub 170 using [in] parameters contained in the body of the data structure.
After the method has finished executing, return, or [out], parameters are returned to the SOAP stub 170, which packages the [out] parameters as multipart MIME packets and transmits a resulting HTTP Response message as a binary data stream across the Internet 20 to the SOAP proxy 170. The SOAP proxy 170 unpackages the multipart MIME packets and returns the [out] and [in, out] parameters to the client process 100. The instance of the Automation object 180 is reclaimed after the [out] parameters are returned to the SOAP stub 170. The [out] parameters, like the [in] parameters, are contained in the body of the data structure.
As noted above and illustrated in
Referring to
After the SOAP proxy 140 is created, a method call from the client process 110 is made on the SOAP proxy 140, which converts the method call into an HTTP POST message. See block 527. The HTTP POST message is shown in
In an exemplary embodiment of the present invention, the HTTP version information is used by the client computer 10 to indicate to the server computer 30 the highest permissible version of HTTP that can be used to format response messages produced by the server computer 30. While HTTP versions 1.0 and 1.1 are presently contemplated for use in actual embodiments of the invention, those skilled in the art will readily appreciate that any other versions of HTTP, as well as various versions of other Internet protocols may be used without departing from the spirit and scope of the invention.
The header 310 of the HTTP POST message also includes fields for holding data representing the PROGRAM ID (progid) 322 of the COM Automation object to be instantiated, a method name 324 that identifies the object method to be invoked following the progid 22, and any [in] parameters 326 that are needed by the method. The progid is an alphanumeric representation of the unique GUID used to identify the Automation class to instantiate on the target machine. The progid is used to obviate the need for application developers to encode long (128 bit) numeric sequences to identify Automation objects. This concept of progids is a part of the COM Automation model defined by Microsoft Corporation. The body of the HTTP POST message includes a MIME-encoding of the [in] parameters to be passed to the instantiated Automation object. Trailer 330 of the HTTP POST message preferably includes a field for holding data representing a checksum 332 for error-checking and correction purposes.
Returning to
An example of a suitable Web server 160 is the Internet Information Server (IIS), from Microsoft Corporation, of Redmond, Wash. A suitable API is the Internet Services Applications Programming Interface (ISAPI) Dynamic Link Library (DLL), an example of which is the Advanced Data ISAPI (ADISAPI) component, from Microsoft Corporation, of Redmond, Wash. The API forms the server side stub for the SOAP proxy 140, i.e., the API acts as the SOAP stub 170. Thus, the SOAP stub 170 is a server-side Applications Programming Interface (API) that interacts with the SOAP proxy 140. Those skilled in the art will recognize that the Web server 160 and SOAP stub 170 may be implemented using programs other than IIS and ADISAPI, respectively, without departing from the spirit and scope of the invention.
Returning to
At block 550, the Automation object 180 is instantiated by the SOAP stub 170, and the method call is made on the instantiated Automation object 180. For simplicity, any discussion of “Automation object” refers to the instance of the COM Automation object executing on the Web server 160, rather than the program code merely stored on a disk or in memory.
Next, at block 555, the Automation object 180 invokes the called method using the [in] parameters provided by the SOAP proxy 140, and returns the results, or [out] parameters, to the SOAP stub 170. See block 560. The SOAP stub 170 repackages the data as an HTTP response message in multipart MIME packets. See block 562.
Those skilled in the art will appreciate that the Automation object 180 and associated method may be used for performing a variety of functions. For instance, the method could be used to access and retrieve data from a database 190 connected to the server computer, or the method could be used to insert data into the database 190, or both, by first retrieving data records from the database 190, updating the data records and then replacing the old database records with updated data records.
The HTTP Response message is shown in
Returning to
As will be readily appreciated by those skilled in the art, the present invention solves problems associated with passing distributed component software through a firewall. More specifically, the invention provides a way of allowing richer, interactive Web content to pass through firewalls. This is accomplished by an application layer protocol that allows remote Automation objects to be accessed using existing protocols which can pass through firewalls.
While the preferred embodiment of the invention has been illustrated and described, it will be appreciated that within the scope of appended claims various changes can be made therein without departing from the spirit of the invention.
This application is a continuation of prior application Ser. No. 09/741,586, filed Dec. 15, 2000, now U.S. Pat. No. 6,782,542, which is a divisional of prior application Ser. No. 08/967,082, filed Nov. 10, 1997, now U.S. Pat. No. 6,457,066 B1, priority from the filing dates of which are hereby claimed under 35 U.S.C. § 120.
Number | Name | Date | Kind |
---|---|---|---|
5491800 | Goldsmith et al. | Feb 1996 | A |
5548723 | Pettus | Aug 1996 | A |
5931917 | Nguyen et al. | Aug 1999 | A |
5938733 | Heimsoth et al. | Aug 1999 | A |
5956483 | Grate et al. | Sep 1999 | A |
5956509 | Kevner | Sep 1999 | A |
5987504 | Toga | Nov 1999 | A |
5991802 | Allard et al. | Nov 1999 | A |
6052710 | Saliba et al. | Apr 2000 | A |
6125352 | Frankline et al. | Sep 2000 | A |
6169992 | Beall et al. | Jan 2001 | B1 |
6370561 | Allard et al. | Apr 2002 | B1 |
6433795 | MacNaughton et al. | Aug 2002 | B1 |
Number | Date | Country | |
---|---|---|---|
Parent | 08967082 | Nov 1997 | US |
Child | 09741586 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 09741586 | Dec 2000 | US |
Child | 10349536 | US |