The present invention generally relates to services available over an information network and, more particularly, to techniques for providing dynamic service discovery from web service representation chains.
As an enabling technology, World Wide Web (or web, for short) services have been adopted to represent services accessible over the Internet and to communicate with other such services in a standard way. The emergence of web services expedites the next evolution of dynamic and on-demand electronic business (e-business). Web services are reusable web components with standard interfaces described in Web Services Description Language (WSDL) and can be accessed by universal clients such as wireless devices, web clients and other application clients over the Internet.
Web services can be published to a Universal Description, Discovery and Integration (UDDI) registry, public or private, or service description documents such as Extensible Markup Language (XML) based WS-Inspection (or WSIL, for short) documents.
The design of a UDDI registry enables publishing as well as a search of trading partners' businesses and their web services to specified categories. A UDDI registry is a central place to store such information and locations about web services. There are two types of UDDI registries, i.e., private and public registries. For an application developer, he or she can publish the web services to public UDDI registries operated, for example, by International Business Machines (IBM) Corporation, Microsoft, Hewlett Packard, or SAP. However, if the web services are private or confidential in nature, the best way is to publish them to private UDDI registries.
On the other hand, for testing purposes or small-scale integration, publishing web services to service description documents such as WS-Inspection documents would be the easiest way since WSIL enables web services discovery, deployment and invocation without the need for a UDDI registry.
It is known that a WS-Inspection document provides a mechanism for aggregating references to pre-existing service description documents which have been authored in any number of formats. These inspection documents are then made available at the point-of-offering for the service as well as through references which may be placed within a content medium such as HyperText Markup Language (HTML). For example, a uniform resource locator (URL) convention to locate WSILs may be as follows: http://www.myorg-wsd.com/inspection.wsil. Furthermore, the UDDI registries and the WSILs are tightly associated by a WS-Inspection data tag “wsiluddi.” In a WSIL, a reference pointer is used to connect to a business or service published in a UDDI registry.
WSIL or future equivalent service specification mechanisms are attractive to an extending business user community, as they do not require the rigor and complexity of setting up and maintaining a fully operational business registry such as a UDDI. Hence, more users are experimenting using WSIL as a convenient registry for their web services. What is simply required is the access from the users web sites to published web services and gathering the web service links into one service description document at a default location, for example, http://www.xmethods.net/inspection.wsil.
Therefore, exploring appropriate business applications published as web services in the service description documents is a critical issue. As mentioned above, service description documents are collections of pointers to other documents that list web services available on a web site. Service description documents can point to other service description documents, a UDDI business or service entry, and WSDL documents. Once you have found the service you want at a site, you can import the WSDL document to generate a web services invocation client proxy for consuming those web services.
One typical WSIL based web services discovery application scenario is a design collaboration. Service description documents provide an easy and convenient way to allow design partners and supply chain companies to publish their services on the Web. However, design collaboration requires an effective and efficient service discovery mechanism for design team building and design service outsourcing to work together to create innovative, profitable products that meet narrow market windows.
Thus, a search or discovery mechanism for such applications should be effective in terms of time and uniform in terms of interfaces.
Currently, there are manual, iterative search processes. An example of such a process is as follows:
To obtain all the web services referenced by the links, i.e., to find by service name or business name, one needs to repeat step (3), step (4), and gather services by name manually.
Some of the major shortcomings of the manual search process can be summarized as follows:
Therefore, a need exists for improved service description document discovery techniques.
The present invention provides techniques for automatically discovering services available in accordance with an information network. For example, in a first aspect of the invention, such a technique comprises the following steps/operations. A request is obtained from a client to perform a search for one or more services. A set (e.g., chain) of one or more service description documents is searched, based on the client request. The searching step/operation further comprises detecting that one or more changes have occurred in the set of one or more service description documents. Then, a result of the search is made available to the client.
The detecting step/operation may further comprise comparing at least a portion of a current instance of the set of one or more service description documents to at least a portion of a previous instance of the set of one or more service description documents. The portion of the previous instance of the set of one or more service description documents may be stored in a cache. The portion of the current instance of the set of one or more service description documents may be used to update a cache.
The services discovery technique may further comprise the step/operation of aggregating sub-results obtained during the searching step/operation into an aggregated result, and then making the aggregated result available to the client. The sub-results may comprise services obtained during the searching step/operation.
Still further, the services discovery technique may comprise the step/operation of obtaining information used to control performance of the searching step/operation. Also, the searching step/operation may be performed in association with multiple data sources. The searching step/operation may also be configurable to be performed at different levels of granularity. The one or more services being discovered may comprise one or more web services.
In a second aspect of the invention, a technique for querying one or more service description documents comprises the following steps/operations. At least one query (e.g., client request) is received. A target data source is identified for the query. Then, the target data source with at least one service description document is explored. The query may comprise at least one location of a service description document. The query can be expressed in terms of one or more of: (i) an extensible markup language; (ii) a HyperText Transport Protocol request string; (iii) one or more input parameters in an application programming interface; and (iv) a form comprising at least one of a location of a service description document and a search criterion. The target data source may comprise at least one of: (i) at least one service description document with zero or more traverse links to other service description documents; and (ii) a service container.
In a third aspect of the invention, a system for automatic exploration of one or more multi-level service description document chains comprises the following components: (i) a service container containing at least one cached web service; (ii) a chain change detection module to detect changes in service description document chains; and (iii) a service description document exploration engine coupled to the service container and the chain change detection module for performing automatic exploration of multi-level service description document chains. The system may further comprise one or more control parameters to control operation of the service description document exploration engine.
The service container may comprise at least one of: (i) cached information about one or more web services referenced in at least one service description document chain; (ii) service description document chain information including at least the location of at least one service description document; and (iii) utilities to update and maintain the cached content. The cached information for each web service found in the service description document chain further may comprise at least one of: (i) a service description document name/URL; (ii) a service name; (iii) an abstract; (iv) a WSDL location; and (iv) one or more category descriptions. The cached information for each service description document chain may comprise at least one of: (i) a creation time; (ii) a document size; (iii) a URL or service description document location; and (iv) other signatures of a service description document.
Further, the chain change detection module may perform one of a time-initiated checking operation and an on-demand checking operation. The one or more control parameters may comprise one or more parameters for controlling: (i) a maximum depth to traverse; (ii) turning caching on or off; (iii) target data source display data; (iv) a stopping condition for performing chain change detection; and (v) a determination of which data source is to be a target data source. The system may also serve as a web services search agent.
In a fourth aspect of the invention, a technique for creating a service description document chain comprises the following steps/operations. A set of published service description documents is collected by using a manual search and/or an automated search engine application programming interface. Related service description documents are linked to form a service description document chain. A chain change detection process is invoked to collect changes to the service description documents in the chain. Then, a service description document exploration process is invoked to explore the chain. Results of the processes are stored in a cache.
In a fifth aspect of the invention, a technique for providing a service, in accordance with a service provider, to allow discovery of services available in accordance with an information network, comprises the step of deploying a service discovery system operative to: (i) obtain a request from a client to perform a search for one or more services; (ii) search a set of one or more service description documents, based on the client request, wherein the searching step further comprises detecting that one or more changes have occurred in the set of one or more service description documents; and (iii) make a result of the search available to the client.
Thus, advantageously, the invention may provide methods and apparatus for dynamic service discovery from web service representation chains (i.e., service description document chains) with one or more of automatic change detection of the chain, result aggregation and caching capability. The invention solves the above-mentioned business problems and enables businesses to easily retrieve up-to-date web services linked and nested multi-level deep in the service description documents.
For example, in an illustrative embodiment of the invention, a service discovery technique includes steps to automatically search linked and nested service description documents for web services, aggregate the services found in each document, and return them to the requester program. Thus, the invention automates the discovery process and provides real-time feedback. This is important because the web service descriptions in those documents can change frequently as new web services get published and old ones get removed. The ability to dynamically re-explore the linked and nested service description documents for an updated list of web services is extremely valuable to businesses requiring access to web services. The invention therefore provides a solution to automate and manage the repetitive elements of the task while rendering the service exploration aspect efficient through pre-fetched link calculation and reference caching and aggregation.
In another illustrative embodiment of the invention, a method is provided for aggregating the services found in each document, grouping them per document, and returning all web services found to the requester. In addition, the invention may also aggregate search results from multiple data sources, e.g., UDDI registries and service description documents such as WSIL documents.
These and other objects, features and advantages of the present invention will become apparent from the following detailed description of illustrative embodiments thereof, which is to be read in connection with the accompanying drawings.
The following description will illustrate the invention using an exemplary WSIL environment to specify and describe a service and related access specification process, including references or aggregation of references thereof. It should be understood, however, that the invention is not limited to use with any particular environment. Rather, the invention is more generally applicable to any environment in which it is desirable to provide efficient and effective service discovery techniques.
It is to be appreciated that a “service description document chain,” as illustratively referred to herein, pertains to WSIL documents that are hosted on the Web and linked together via a web link or uniform resource identifier (URI). Thus, these linked documents can be traversed and contents be retrieved by following the web links, which can be nested multi-level deep.
The remainder of the detailed description is organized as follows. First, there is a description of main components of a service discovery system, followed by a description of mechanisms used to enhance the WSIL search capability. Next, there is a description of an exemplary service description document chain exploration methodology usable by a service discovery system.
Referring initially to
The following will explain the functionality of the main components of system 100 that process service description documents containing web services descriptions. Again, WSIL is used as an example of such a service description document.
Service description document exploration engine 102 is the component that provides the mechanism for automatic, deep exploration of service description documents linked together. Details of engine operation will be provided below in the context of
Services container 104 stores service cached information of each service description document chain and web services in the chain to be used by chain change detection module 106. At a minimum, the service name and the sources (e.g., the WSIL signature) will be captured for each service in the appropriate service container. Further details on the services container will be given below.
Chain change detection module 106 implements a unique caching methodology that categorizes the service description document chains for related web services, grouped/linked via a root service description document. More specifically, module 106 automatically detects changes in the service description documents, using attributes such as creation time, size, and other signatures of a service description document, by checking the service description document chains on a time-initiated basis against the contents cached in services container 104. Further details on the chain change detection module will be given below.
Control parameters 108 include initial setup data for service description document exploration engine 102. By way of example, control parameter data 108 includes: (i) data specifying maximum depth or level to traverse with configurable granularity, i.e., service description document exploration engine 102 explores in a depth-first fashion for a given link before the peer links are explored; (ii) data specifying whether caching mechanism is turned on or off; (iii) data specifying chain change detection with configurable granularity; (iv) target data source information including default data to be cached in the service container and/or the known service description document chains; and (v) information related to a display data level, i.e., short or long description.
The invention provides techniques/mechanisms to enhance WSIL search capability, by way of example:
With reference still to
The implementation flow corresponding to system 100 shown in
As shown, methodology 200 begins, at step 202, with the input of a service description document location or filename. This is received from client 120 or 122. In step 204, service description document exploration engine 102 invokes the chain change detection process (106). If any changes (block 206), the engine explores such changes in step 208. The exploration process is described below in the context of
Referring now to
In step 302, a service description document location (e.g., uniform resource locator) or filename is input. In step 304, the WSIL content is read. In step 306, service information and chain information is obtained, and the cache (114) in service container 104 is updated.
In step 308, it is determined whether the maximun depth (which may be a configurable parameter) of the service description document link level has been reached. If not, it is determined in step 310 whether there are any links in the document. If yes, the correct service description document location in the first link in the document is resolved in step 312. If no links in the document, it is determined in step 314 whether there are any peer links, which are referred to as Web links or URIs in the same WSIL document. If yes, in step 316, the peer link is processed, and the correct document location of the peer link is resolved. Step 316 is also performed when the maximum link traversal depth is reached in step 308. After step 312 and/or step 316 is performed, the methodology returns to step 304. The methodology ends at block 318.
The detailed description now provides further detail of functional components of
I. Services Container
As shown in
For each service description document chain, the following information is maintained in the services container:
Category name—The name that describes the category that all services within this service description document chain belong to in human-friendly terms, i.e., travel, finance, car, food, clothing, etc.
For each service description document linked in the chain, the following metadata may be stored in the service container: (i) creation time; (ii) document size; (iii) URL or WSIL location; and (iv) other signatures of a service description document, etc.
For each service found in the service description document chain, the following information may be cached in the services container: (i) service description document name/URL; (ii) service name; (iii) abstract; and (iv) WSDL location.
Descriptions for the categories may be maintained in the services container that serve as an index. The index may be based on the service abstract and service name and include additional metadata and keywords to facilitate a search.
II. Chain Change Detection
Chain change detection (module 106) enables configuration of different granularity for different conditions. For example, a condition may be to stop checking for changes and immediately return at the first change, or search entire service description document chain after detecting a change. Another condition may be associated with activating chain change detection. In time-initiated checking, there may be a configurable interval of time that the chain change detection process will be turned on or off. This can be configured as the default behavior as well. In accordance with an on-demand basis, chain change detection may also accept a request from the service description document exploration engine to detect changes of a specific service description document chain or chains at runtime before exploring the chain or chains.
In an intialization stage, methodology 500 fetches configuration data for a stopping condition (step 502). Then, methodology 500 is applied to each chain in the known service description document chain (blcok 504).
In step 506, for each input service description document location (URL) in the chain, a document attribute is checked against information in the service container. If not changed (step 508), methodology 500 returns to step 506. If changed (step 508), the service description document URL is saved in step 510. If the results are to be returned immediately (step 512), then this is done and the process for that chain is completed (block 516). However, results for all the chains may be aggregated (step 514) and returned together.
III. Exploration of Service Description Document Chains
An example of a service description document chain 600 is shown in
Listing 1. Inspection.wsil:
Listing 2. Moreservices.wsil:
Listing 3. Anotherservices.wsil:
Listing 4. Shipping.wsil:
(a) WSIL Exploration for Design Collaboration Scenario
As mentioned above, design collaboration solutions realized by the invention enable companies to bring a product development team up in a matter of hours and could be used to establish an effective team-working environment within the enterprise as well as across the enterprise.
An example of a service description document chain 700 for design collaboration is shown in
(b) WSIL Explorer APIs
Java APIs are provided for the client to obtain a list of web services from multiple service description document chains. Sample APIs are listed below in Listing 5. They may take different input, i.e., WSIL URL, WSILDocument, and wsilFileName:
Listing 5:
In Listing 6 below, the initial service description document URL is “http://tempuri.wsil.com/wsil/inspection.wsil”, and it shows an example output after traversing a service description document chain of four service description documents and finding a total of four services, displayed in three separate fields, i.e., abstract, name and location.
Listing 6:
In Listing 7 below, an example is shown of an output after traversing the same service description document chain of four service description documents with a total of four services found, displayed in service description document format for the same three separate fields, i.e., abstract, name and location, for each service.
Listing 7:
(c) Example Embodiment: WSIL Exploration Portal
In this example embodiment, a sample portal (e.g., portal 110) is provided to search services in a given service description document chain or WSIL chain using the service description document exploration engine.
In this example, a service description document chain with the same set of four service description documents mentioned above in section (a) is used in the portal embodiment.
In
In the text area, the service name starting with ‘Address’ is entered.
(d) Search Result Aggregation from Multiple Service Description Document Chains
As mentioned above, the WSIL exploration mechanism described herein is capable of aggregate results from multiple service description document chains as illustrated by the ‘Explore’ function of the WSIL exploration portal. That is, the service description document exploration engine can iteratively explore multiple service description document chains and then aggregate the results together for display all at once.
In Listing 8 below, an XML script is shown that describes two queries for two different service description document chains with one root service description document being inspection1.wsil and the other being inspection2.wsil.
Listing 8:
(e) Service Description Document Chain Creation
Methodology 1200 begins at step 1202 where published WSILs are collected (either manually or via search APIs). In step 1204, the methodology categorizes and creates a WSIL chain linking related WSILs as one chain. In step 1206, a services container is initialized. In step 1208, the methodology invokes chain change detection for newly created WSIL chains. If there are any changes (step 1210), these changes are explored in step 1212 as explained above. If no changes (step 1210), the creation methodology ends in block 1214.
(f) WSIL Explorer Based Web Services Integration Framework
An advanced web services discovery framework (WSDF) according to the invention deals with the above-mentioned problems and limitations in a UDDI search head-on. The framework provides an easy to use mechanism, XML-based script, that shields the application developers from complex Java programming using either WSIL4J (Web Services Description Language for Java Toolkit) or UDDI4J (UDDI for Java, a class library that provides an API to interact with a UDDI (Universal Description, Discovery and Integration) registry), as well as provides capabilities for union and intersection of multiple search queries from one or more UDDI registries.
The framework is designed to meet the following objectives:
A script based search agent can play an important role in simplifying the application developer's job in developing web browser-based clients or e-business applications for web services discovery.
As shown in architecture 1300 of
The web services search agent implements a sophisticated result aggregation mechanism and communicates with multiple UDDI registries and WS-Inspection documents. When a service requester looks for a web service, the search agent can respond with one or all of the three basic data types, businessEntity, businessService, and ServiceType (i.e., Technical Model, or t-Model) retrieved from UDDI registries. The example aggregation includes, but is not limited to, operations of intersection, union and script-based logic to operate on the responses from multiple sources. The final response to the search requester maybe a new XML format or an existing XML format such as WSIL, which can emerge as a player in representing the aggregated search results.
In the mean time, the web services search agent automatically invokes a set of web services to obtain the actual results or just to explore the web services capabilities. The search agent could be deployed on a separate machine from UDDI registries or service description documents or deployed on the same machine that the UDDI registries or service description documents reside.
Listing 9 below shows an example search script that searches for one UDDI registry and one service description document chain. The aggregated search result is shown in
Listing 9:
Referring finally to
It is to be understood that such individual components/methodologies may be implemented on one such computer system, or on more than one such computer system. In the case of an implementation in a distributed computing system, the individual computer systems and/or devices may be connected via a suitable network, e.g., the Internet or World Wide Web. However, the system may be realized via private or local networks. The invention is not limited to any particular network.
As shown, computer system 1500 may be implemented in accordance with a processor 1502, a memory 1504, I/O devices 1506, and a network interface 1508, coupled via a computer bus 1510 or alternate connection arrangement.
It is to be appreciated that the term “processor” as used herein is intended to include any processing device, such as, for example, one that includes a CPU (central processing unit) and/or other processing circuitry. It is also to be understood that the term “processor” may refer to more than one processing device and that various elements associated with a processing device may be shared by other processing devices.
The term “memory” as used herein is intended to include memory associated with a processor or CPU, such as, for example, RAM, ROM, a fixed memory device (e.g., hard drive), a removable memory device (e.g., diskette), flash memory, etc.
In addition, the phrase “input/output devices” or “I/O devices” as used herein is intended to include, for example, one or more input devices (e.g., keyboard, mouse, etc.) for entering data to the processing unit, and/or one or more output devices (e.g., speaker, display, etc.) for presenting results associated with the processing unit. Such output devices may also be used to present graphical user interfaces such as those shown in
Still further, the phrase “network interface” as used herein is intended to include, for example, one or more transceivers to permit the computer system to communicate with another computer system via an appropriate communications protocol.
Accordingly, software components including instructions or code for performing the methodologies described herein may be stored in one or more of the associated memory devices (e.g., ROM, fixed or removable memory) and, when ready to be utilized, loaded in part or in whole (e.g., into RAM) and executed by a CPU.
Advantageously, the invention provides the above and other features but does not need to attach any property to the document itself nor need to use an attached property to control the behavior, manipulate or reconstruct the document contents, or change system configurations. Documents being searched need not be self-contained, and may be unaltered with any properties attached. Instead, the invention builds a centralized control structure around the documents it searches. Also, the invention provides a search engine to: (i) track and detect the changes of the documents by periodically updating and looking up the centralized control structure; and (ii) deep search multi-layered documents. The invention also provides caching of the document and control structure to speed up the lookup as well as provides a mechanism, i.e., control parameters, to manipulate and adjust the granularity of the caching. The invention may also pre-categorize the documents of interest prior to search.
Although illustrative embodiments of the present invention have been described herein with reference to the accompanying drawings, it is to be understood that the invention is not limited to those precise embodiments, and that various other changes and modifications may be made by one skilled in the art without departing from the scope or spirit of the invention.