The invention generally relates to the field of data processing and more particularly to a method and a system for dynamically integrating a process in a system.
Contemporary applications increasingly require combinations of different types of data sources and automated processes based on different types of algorithms. Examples of such applications include internet applications such as Smart cities, Smart homes, Large scale crisis management, advanced solutions to surveillance in defense and civilian domains and security applications. These kinds of domains are characterized through large quantities of very heterogeneous data stemming from different and generally spatially dispersed devices. Such rich body of data can be exploited by using algorithms resulting in improved assessment and prediction of the states of the world process control. In this way, better informed decisions can be made that are likely to lead to a better use of resources involving for example a reduced energy consumption, a better protection of citizens and countries, etc. Further, such decisions can mitigate negative effects more efficiently (e.g. less pollution, less damage, etc.).
Such contemporary applications generally involve the use of specific data sources depending on the role of the applications (devices, databases, etc.), as well as the use of processes or algorithms which uses the devices' outputs to perform a particular processing in view of the applications' role. For example, a “smart home” application, which aim is to reduce energy, can involve the use of devices such as motion detectors and individual air conditioning units/subsystems in different rooms (living room, bedroom, kitchen), whereas automated processes implemented for such application may include arbitrary algorithms for controlling the intensity of the air conditioning operation in each room, where the algorithm inputs are data from the motion sensors and preferences by the users. The algorithms can be of different types, ranging from algebraic functions to Artificial Intelligence (Al), such as rule based functions, Neural networks, probabilistic networks, etc., depending on the nature of the application.
There is accordingly a need for adequate computing solutions which support analysis of the data and provide automated processes (e.g. controllers) with reliable assessments of the situation. However, the targeted applications involve large numbers of very heterogeneous data sources (such as sensors, databases, etc.) and analysis services. Further, in such settings, new analysis capabilities must be added frequently whereas it is generally complex to introduce new data sources and new algorithms into an existing working environment because of the following access and interoperability issues:
There exist conventional approaches to interoperability based on Service-Oriented Architecture (SOA) technology, such as the solution disclosed in Lucas Zamboulis, Alexandra Poulovassilis, George Roussos: Flexible data integration and ontology-based data access to medical records. BIBE 2008: 1-6, or the solution disclosed in Eduard Hovy: Using an ontology to simplify data access, Communications of the ACM, Volume 46, Issue 1, January 2013. In such approaches, the creation of heterogeneous service descriptions and the discovery of the relevant data types depend on elaborate domain ontologies describing relations between different concepts. Ontologies are typically developed and maintained in a central way. However, this requires a substantial domain knowledge, which is unlikely to be present at a single organization that has the technical background to support formalization of concepts (i.e. ontology development and maintenance). Further, if the experts providing the algorithms generally possess the domain knowledge, they do not have generally the knowledge of the underlying concept formalization. In addition, these experts usually do not have the knowledge needed to get access to the relevant data.
In addition, while interoperability standards may be available for some simpler data types corresponding to the outputs of common devices (e.g. simple sensors), it is likely that such standards will not exist for the new types of advanced services at the point of their introduction. For example, international interoperability standards will not exist at the point of the introduction of a new complex service, such as a specific diagnostic analysis of a device performance/status, system control, etc. Instead, standards will evolve within a specific environment (e.g. an Internet of Thing/smart homes network offered by a specific provider). Moreover, device providers need to make their devices compatible with the overall existing system.
There are currently no efficient solutions that allow environment operators to easily specify the standards and help the developers of devices and services to make their solutions compatible with the overall environment.
In order to address these and other problems, there is provided a process integration method as defined in the appended independent claim 1 and a process integration system as defined in appended claim 12. Preferred embodiments are defined in the dependent claims.
The various embodiments of the invention thus allow rapid integration of new algorithms and devices into open-ended processing environments.
In particular, the integration method and system according to the various embodiments of the invention solves interoperability problems in an important class of contemporary processing environments by combining (i) fast definition of services associated with the outputs of algorithm, (ii) fast association of the internal variables with the existing types of data-objects used in the processing environment and (iii) software components which allow runtime integration of algorithms into workflows.
Further advantages of the present invention will become clear to the skilled person upon examination of the drawings and detailed description. It is intended that any additional advantages be incorporated herein.
Embodiments of the present invention will now be described by way of example with reference to the accompanying drawings in which references denote similar elements, and in which:
It is noted that the drawings of the invention are not necessarily to scale. The drawings are merely schematic representations, not intended to portray specific parameters of the invention. The drawings are intended to depict only typical embodiments of the invention, and therefore should not be considered as limiting the scope of the invention.
Referring to
The target systems 111 may be based on the process proxies. In such target systems, each process (i.e. algorithm) comprises a dedicated process proxy. A process proxy generally implements semantic interoperability, supports service discovery and creates and maintains information flows between compatible processes. More specifically, process proxies of different processes implement a processing environment in which compatible processes can interoperate. The following description will be made with reference to class of target systems 111 based on such process proxies.
Each algorithm in the target system 111 may be associated with input and output data objects defined in the target system 111. The process proxy 4 manages the communication, i.e. exchange of relevant data-objects, with other processes in the target system 111. The data-objects may be of arbitrary complexity, ranging from primitive data-types to complex data-structures bundling heterogeneous data-types. The target system 111 may maintain a description of:
While the invention is not limited to a specific proxy implementation, it is assumed thereinafter that a proxy comprises two distinctive functional parts, namely a communication and a processing component, which may communicate synchronously or asynchronously.
The following description will be made with reference to the process proxies depicted in
In one embodiment of the invention, the process proxy can be a Dynamic process Integration Framework (DPIF) proxy. DPIF proxies, as described in Gregor Pavlin, Michiel Kamermans, Mihnea Scafes, “Dynamic Process Integration Framework: Toward Efficient Information Processing in Complex Distributed Systems”, Informatica 34(4): 477-490 (2010), can be used to create fast advanced hybrid analysis systems.
A system of process proxies 4 may be created in a targeted system 111 and configured with the help of the integration system 110 to manage the access to data and interoperability between heterogeneous processes based on different types of algorithms. Examples of such applications may include for example Internet of things, Smart homes, or large scale crisis management applications.
The integration system 110 may operate in an arbitrary environment. The integration system 110 allows integration of a given algorithm 1 (also referred to as an “input algorithm”) into the target system 111. The input algorithm can be any type of code implementing a process related to a specific application. Input algorithms can roughly be classified into the following types:
It should be noted that physical devices may be integrated via drivers, dedicated algorithms. Moreover, the input algorithm may be previously coded in an arbitrary programming language such as JAVA, C or C++, and compiled into software libraries.
Turning back to
The proxy constructor 2 is configured to generate components which are required by the proxy injector 3 to create the dedicated process proxy 4 for the input algorithm 1 that is to be integrated in the target system 111. Proxy constructor 2 may produce linked software libraries comprised of the coded input algorithm 1, and software libraries implementing the generic functionality of the communication and processing components. Moreover, the proxy constructor may also support creation configuration files by using information contained in a data-object type repository 112 which describes the data object types in the target system 111 as detailed thereinafter. Upon injection, the linked software libraries may be used to create a new process proxy while the configuration files may be used by the newly created proxy to (i) publish its output data types and (ii) subscribe to the relevant types of data objects. In addition, the configuration file may also comprise the data used for the negotiation and other information over security and access aspects. This makes the wrapped data source or algorithm instantaneously visible and accessible. The configured process proxies enable sharing of relevant information between different processes. Accordingly, the suitable information can be delivered to the suitable process at the suitable moment in time.
Each process proxy 4 associated with an algorithm can be further configured to continuously listen to the information relevant for the algorithm and/or disseminate information (e.g. intelligence products) to the target recipients.
The target system 111 created by the process proxies 4 thus represents various data sources and algorithms.
The proxy constructor 2 may use a configuration tool 22 to allow users (e.g. providers of devices and algorithms) to integrate their algorithms into the target system 111, without the need of any knowledge about specific programming techniques, languages or formal data models. From the description specified by the users through the user interface, the configuration tool 22 may generate the descriptions of the data-object types produced by the input algorithm 1 encoded in any arbitrary language and format (example is an ontology language such as OWL.
Moreover, by using the configuration tool 22, each user can describe relations between the output data-objects types and the required data object types. Such information can be used by process proxies 4 to integrate the expert/process into a workflow via service discovery. The configuration tool 22 may be for example based on the OntoWizard tool disclosed in the Ate Penders, Gregor Pavlin, Michiel Kamermans: “A Collaborative Approach to Construction of Large Scale Distributed Reasoning Systems”, International Journal on Artificial Intelligence Tools 20(6): 1083-1106 (2011). Such OntoWizard tool enables the participants to align the semantics and the syntax of the data-objects types used by the algorithms in the target system 111.
Each output data-type may be previously described by the algorithm provider via the configuration tool 22. Such output data-type descriptions may be stored in the data-object type repository 112. The data-object type repository 112 is thus provided to collect descriptions of all types of data-objects that can be produced by the algorithms in the target system 111. This allows inspection of the available types of data-objects in the target system and specification of relations between the local input algorithm variables and data-object types in the target system 111.
The integration system 10 according to certain embodiments of the invention may be based on a representation of the relations between specific variables in the input algorithm 1 and the data object types supported by the target system 111. These relations may be encoded in a Variable-Linker file 9, which may be a text file in any format such as for example XML. The Variable-Linker file 9 may be used by the process proxy to connect the variables in the input algorithm 1 with arbitrary data sources or consumers in the target system 111.
By combining the integration system 110 with process proxies, the outputs of processes or the data sources can become visible in the overall target system 111, have a well-defined semantics, and be in the suitable format so that other components/processes can use it. Further, the process inputs connected to the suitable sources can provide data in the suitable format and with the correct meaning.
According to one embodiment of the invention, the processing component 123 of each process proxy 4 may implement specific mechanisms which are based on a combination of certain types of internal data structures, generic as well as special functions. All these elements may be coded, i.e. implemented by using an arbitrary programming language and compiled into binary code.
Accordingly, the mapping between semantically rich data representations can be achieved dynamically without the need for hardcoding of various data-dependent aspects, which would require capturing the relations between algorithm variables and data-objects from target system 111 directly in the program code.
As shown, the processing component 123 of each process proxy may comprise the following main components:
In a preferred embodiment of the invention, the input and output functions 42 and 47 may be generic (i.e. identical code is used in all proxies). The two input/output functions 42 and 47 may be read by the VLF file 9 which provides the mapping in order to translate between internal variables used by the input algorithm and the data-objects 41 and 48.
The algorithm interfacing function 46 implements algorithm specific calls. The function interfacing 46 may be derived from a template (e.g. a sub-class in object oriented programming environments) and only a few custom lines of code may be added. The adapted function may then be compiled into a specific library 16 (as shown in
Such architecture of the processing component 123 of each process proxy 4 makes it possible to support implementation of interoperability standards resulting from the configuration process supported by Proxy Constructor 2.
Turning back to
The input algorithm 1 is initially coded in an arbitrary programming language (such as JAVA or C, C++, C#) and compiled, which provides software libraries associated with the algorithm 1. The linker component 23 is configured to link the compiled algorithm code (1) with:
the generic proxy software libraries 13 implementing the communication component 120 and the generic parts of the processing component 123 (such as 42, 43, 45 and 46), and the libraries 16 comprising the specific algorithm interfacing function 54.
The set of linked libraries 1, 13 and 16 is used by the proxy injector 3 to create a running proxy in the target system 111.
The running proxy may then be configured such that it can subscribe to the suitable data-object types corresponding to the input internal variables and publish the outputs in terms of the suitable data-object types in the target system 111.
A proxy configuration method may be implemented by the proxy configuration tool 22 to prepare the process proxy configuration files.
Such Proxy configuration method may comprise the creation of a Variable Declaration File 8 (also referred to thereinafter as VDF), based on the coded algorithm 1, to describe the types and names of variables representing inputs and outputs of the algorithm 1. This can be done automatically or manually, by using a suitable format.
Independently of the configuration files generation method, each entry in the VDF 8 may comprise the following parameters:
The proxy configuration tool 22 facilitates the generation of the following configuration files associated with the algorithm 1, which may include:
The generation of the Variable Linker File 9 comprises linking local variables with data objects. More specifically, the VLF 9 may be created from the Variable Description File 8 (VDF).
To create the VLF 9, the configuration tool 22 may implement a special reader of VDF 8 and a graphical user interface (GUI) which displays the internal variables of algorithm 1 (i.e. variables declared in VDF 8) in the interface 51 of the configuration tool.
The interface 51 may comprise:
A first graphical element 52 may be used to represent the variables of the algorithm declared in VDF 8. The graphical element 52 may be any arbitrary graphical element, such as a drop down list, scroll list, etc. Moreover, a second graphical element 53, such as a scroll list, may be used to represent the list of data-object types from the data-object type repository 112. For each displayed internal variable, the user may either
For each selected internal variable, the user may use the browser tool 53 to scroll to the adequate description of a corresponding data-object type and uses any suitable control to select the data-object type, such as for example a select button or double click in the browser. Variables may be linked by selecting a variable in 52 and the corresponding data-object type in 63 and by activating a suitable control element, such as for example a button 64 or double click on the right element in 53.
After all internal variables used by algorithm 1 are associated with the corresponding data-object types in the target system 111, the configuration tool 22 is configured to automatically generate a VLF 9 describing the relations. The VLF 9 may be described in any arbitrary format supported by the process proxy components, typically the functions 42 and 47.
Turning back to
Each time a new process based on the same algorithm 1 is created, the same software libraries 15, the VLF 9 and Metadata OWL file 14 can be reused to spawn a new process proxy in the target system 111.
Accordingly, the system 10 according to the various embodiments of the invention further allows rapid integration of arbitrarily complex analysis processes, devices (e.g. sensors, smart appliances) and databases.
The integration system may be used as a basis for the development stacks which will be a significant enabler of improved exploitation of the data sources and advanced algorithms in open ended systems, where new data sources are being added at a rapid pace and increasingly complex processes have to be controlled.
The integration system according to the embodiments of the invention makes it possible to easily populate open ended environments with very heterogeneous devices and processes and to quickly couple devices and processes, thereby harnessing the full potential of the advanced hardware and algorithms.
In step 600, the algorithm coded in a selected language previously prepared by the designer of a given process is received. To prepare the algorithm, the designer of the process may code the algorithm using a suitable language, preferably an object programming language such as Java or an equivalent programming language, but is not limited to this. The designer may then compile the code into software libraries. As will be readily understood by the skilled person, the algorithms thus prepared form the basis for the integration of data sources and analysis capabilities. In particular, the input algorithm 1 may be provided, for example, to read outputs from the sensor device driver software, query databases or support arbitrarily complex system control, state estimation, learning, etc.
In step 602, the Variable Declaration File (VDF) 8 is created. The Variable Declaration File 8 represents a file having a predefined format that describes the types of arguments and the outputs of the compiled algorithm from step 600. The file 8 may be encoded in any suitable format such as in XML. According to one aspect of the invention, the VDF file 8 may be configured to capture the variable names and the variable types. The VDF file 8 may be used to generate a display of a special view on the Graphical User Interface (GUI) 51 in which the user (e.g. the developer) can associate the variables from the algorithm with existing types of data/information in the overall processing environment based on algorithm proxies, thereby creating an entry for each input and output variable in the algorithm (as represented in
Moreover, in step 602 the libraries implementing the communication component 120 and processing component 123 are linked. In particular, the generic proxy libraries are linked with the algorithm library and a specific proxy library which implements calls to the algorithm library.
In step 604, the Variable Linker File (VLF) 9 is then created. In this step, the algorithm inputs and outputs are associated with information types (i.e. data-object types) used in the overall processing environment (i.e. target system 111). The Variable Declaration File 8 resulting from step 602 may be used to generate a view of the variables declared in VDF 8 in an appropriate Graphical User Interface of the configuration tool 22.
The user (for example Software designer) may then be prompted by the configuration tool 22 to associate the variables listed in the configuration file 8 with the data-object types that exist in the target information processing system 111. The data-object types can be associated with arbitrarily complex representations and rich semantics. For example, a data-object type can be represented by a complex structure whose elements have different semantics.
To allow the user to connect the variables listed in VDF 8 to the data-object types maintained in the target system 111, a special user interface in the configuration tool 22 may be used. This results in:
In step 606, the process is injected in the runtime environment 111. In this step, a new interface view may be generated for the finalization of the integration of the algorithm. More specifically, in response to the selection of a finalization button by the user, the following steps may be performed:
Accordingly, the mapping between the algorithm and the data types in the existing processing system 111 can be achieved by combining:
The described embodiments of the invention thus provide a process integration method for systems where each device or algorithm is represented by a process proxy. The process proxies 4 support service discovery and creation of workflows between data producers and consumers at runtime. The workflow formation is based on local domain knowledge/models of each process proxy; i.e. no centralized service composition is required, but not excluded. The process integration method solves interoperability challenges in environments consisting of very heterogeneous types of services. In particular, it supports efficient description of new services and discovery/inspection of existing compatible services. This allows definition of new services and their standardization in open ended environments. The approach is based on principles that explicitly take into account the properties of organizations.
The process integration system according to the various embodiments of the invention can be applied in a wide class of domains, such as Internet of things, Smart cities, Smart homes, Large scale crisis management, advanced solutions to surveillance in defense and civilian domains, security, etc.
Such domains are characterized through large quantities of very heterogeneous data stemming from different, usually spatially dispersed devices. Such rich body of data can be exploited for improved assessment and prediction of the states of the world/relevant processes in the domain of interest. In this way better informed decisions can be likely leading to a better use of resources (e.g. reduced energy consumption, better protection of citizens and countries, etc.) and more effective mitigation of negative effects (e.g. less pollution, less damage, etc.).
Even if not limited to such environments, the invention has particular advantages in open ended environments, where the integration system can allow a dynamic integration of new types of services at a fast pace, such as for example sensors of different types, smart appliances providing complex diagnosis services, energy consumption monitoring services based on arbitrarily complex algorithms, etc.
The integration system according to the various embodiments of the invention additionally allows for efficient creation of device interfaces that are compatible with a specific, quickly evolving environment.
The integration system 10 according to the described embodiments of the invention may be used for example for integrating smart home control solutions in an existing processing system. The control solutions can be based on various algorithmic approaches, such as for example:
In such example, the process integration method according to the embodiments of the invention can be used for:
In scenario (ii), the integration method according to the embodiments of the invention can be used to create libraries and metadata that allows instantaneous integration of a specific type of device, without any coding at the time of the installation.
Another exemplary application of the invention may be the fast integration of a process or an algorithm in an existing processing system, for instance for the installation of a new smart algorithm by a developer to:
The inputs to the new control algorithm may be, in this example, data from the motion sensors and preferences by the users. To be integrated to the overall Internet Of Things environment, the input and output parameters of such motion sensors are to be connected with adequate data sources and consumers.
To install a new control system, the developer may need to plug the controller into the smart home environment on the fly, for example by:
The process integration system 10 according to the embodiments of the invention may be used as follows:
It should be noted that this installation could alternatively take place remotely.
During the life cycle of the smart home system, improved smarter control solutions can be offered to the end-user, such as a solution which learns the typical behavior of the inhabitants to further reduce the energy consumption. With the process integration system, the new control algorithms can be plugged into the overall smart home environment fastly (e.g. in 5 minutes) either remotely or via a service person sent to the home.
Yet another exemplary application of the process integration system according to the embodiments of the invention can be the fast integration of a device. For example, the operator of an Internet of Things environment can provide a device provider with a development tool based on the process integration system to allow fast design of interfaces that make the devices compatible with the Internet of Things (IoT) environment. The development tool can access to a specific repository defining the existing data types in the IoT environment (i.e. Data-object type Repository 112). The development tool can be used to make outputs of a certain device type, such as a sensor, compatible with a specific environment by executing the following steps:
It should be noted that a similar approach can be used by developers of algorithms which are likely to be used in large numbers of processes (i.e. many processes using this algorithm will be created) to prepare such metadata and specific libraries prior to the integration. Each time a new process based on the same algorithm is created, the same software libraries 13, VLF 9 and Metadata OWL 14 files can be used to spawn a new proxy, such as a process proxy.
The process proxy 4 in combination with the Metadata file 14 and the variable linker file 9 thus represent a sort of a “high-level device driver”, which makes the device services available in a specific operational environment with specific description/definition of data sources and processing capabilities.
The skilled person will readily understood that the invention is not limited to Internet Of Thing applications and can apply to a wide range of fields, for example in defense and security fields where a large numbers of heterogeneous sensors and processing capabilities are to be integrated into coherent decision support systems or in large scale crisis management fields.
The process integration system according to the embodiments of the invention can thus allow developers of devices and algorithms to make their products compatible with the overall integration environment.
The described embodiments of invention more generally enable rapid integration of arbitrary processes into rich data environments. In particular, the invention allows providers of arbitrary algorithms and data sources (e.g. devices and databases) to dynamically embed arbitrary processing and data acquisition capabilities into information flows. Further, the integration process system according to the described embodiments of the invention allow for a fast and efficient description of the outputs of algorithms, devices and databases, such that they can easily be combined with other components in the overall distributed processing framework. Moreover, the integration process system allows an easy creation of information flows that feed the algorithms with suitable data, at low costs. It is adapted to a wide range of applications, such as Internet applications (Smart cities, Smart homes, Large scale crisis management, advanced solutions to surveillance in defense and civilian domains and security, etc.).
Although the embodiments of the present invention have been described in detail, it should be understood that various changes and substitutions can be made therein without departing from spirit and scope of the inventions as defined by the appended claims. Variations described for the present invention can be realized in any combination desirable for each particular application. Thus particular limitations, and/or embodiment enhancements described herein, which may have particular advantages to a particular application need not be used for all applications. Also, not all limitations need be implemented in methods, systems and/or apparatus including one or more concepts of the present invention.
Embodiments of the present invention can take the form of an embodiment containing both software and/or hardware elements.
In one embodiment, the invention is implemented in software, which includes but is not limited to firmware, resident software, microcode, etc.
Furthermore, the invention can take the form of a computer program product accessible from a computer-usable or computer-readable medium providing program code for use by or in connection with a computer or any instruction execution system. For the purposes of this description, a computer-usable or computer readable medium can be any apparatus that can contain, store, communicate, propagate, or transport the program for use by or in connection with the instruction execution system, apparatus, or device.
Number | Date | Country | Kind |
---|---|---|---|
13199900.5 | Dec 2013 | EP | regional |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/EP2014/077622 | 12/12/2014 | WO | 00 |