Industrial asset control systems that operate physical systems (e.g., associated with power turbines, jet engines, locomotives, autonomous vehicles, etc.) are increasingly connected to the Internet. As a result, these control systems may be vulnerable to threats, such as cyber-attacks (e.g., associated with a computer virus, malicious software, etc.), that could disrupt electric power generation and distribution, damage engines, inflict vehicle malfunctions, etc. Current methods primarily consider threat detection in Information Technology (“IT,” such as, computers that store, retrieve, transmit, manipulate data) and Operation Technology (“OT,” such as direct monitoring devices and communication bus interfaces). Cyber-threats can still penetrate through these protection layers and reach the physical “domain” as seen in 2010 with the Stuxnet attack. Such attacks can diminish the performance of an industrial asset and may cause a total shut down or even catastrophic damage to a plant. Currently, Fault Detection Isolation and Accommodation (“FDIA”) approaches only analyze sensor data, but a threat might occur even in other types of threat monitoring nodes such as actuators, control logical(s), etc. Also note that FDIA is limited only to naturally occurring faults in one sensor at a time. FDIA systems do not address multiple simultaneously occurring faults as they are normally due to malicious intent. Note that quickly detecting an attack may be important when responding to threats in an industrial asset (e.g., to reduce damage, to prevent the attack from spreading to other assets, etc.). Making such a detection quickly (e.g., at substantially sample speed), however, can be a difficult task. Cyber-physical systems often have an overwhelmingly large number of physical measurements, which makes attack detection directly based on the physical measurements challenging. It would therefore be desirable to abstract underlying characteristics of a cyber-physical system in an automatic, rapid, and accurate manner.
According to some embodiments, a data source may provide a plurality of time-series measurements that represent normal operation of a cyber-physical system (e.g., in substantially real-time during online operation of the cyber-physical system). A stateful, nonlinear embedding computer may receive the plurality of time-series measurements and execute stateful, nonlinear embedding to project the plurality of time-series measurements to a lower-dimensional latent variable space. In this way, redundant and irrelevant information may be reduced, and temporal and spatial dependence among the measurements may be captured. The output of the stateful, nonlinear embedding may be utilized to automatically identify underlying system characteristics of the cyber-physical system. In some embodiments, a stateful generative adversarial network may be used to achieve stateful embedding. According to some embodiments, an off-line model training platform may train a stateful, nonlinear embedding model prior to a current on-line operation of the cyber-physical asset.
Some embodiments comprise: means for receiving, at a stateful, nonlinear embedding computer from a data source, a plurality of time-series measurements that represent normal operation of a cyber-physical system; means for executing stateful, nonlinear embedding to project the plurality of time-series measurements to a lower-dimensional latent variable space such that redundant and irrelevant information are reduced and temporal and spatial dependence among the measurements are captured; and means for utilizing output of the stateful, nonlinear embedding to automatically identify underlying system characteristics of the cyber-physical system.
Some technical advantages of some embodiments disclosed herein are improved systems and methods to abstract underlying characteristics of a cyber-physical system in an automatic, rapid, and accurate manner.
In the following detailed description, numerous specific details are set forth in order to provide a thorough understanding of embodiments. However, it will be understood by those of ordinary skill in the art that the embodiments may be practiced without these specific details. In other instances, well-known methods, procedures, components and circuits have not been described in detail so as not to obscure the embodiments.
With the advent of Internet of things (“IoT”), cyber-physical systems have become increasingly complex. The complexity of cyber-physical systems and the heterogeneity of cyber-physical system components pose significant challenges to system health and security. Accurately characterizing a cyber-physical system under normal operation conditions may be a substantial step towards successful development of system health and security. Abstracting characteristics, or simply characterization, of a cyber-physical system is a process of distilling salient information based on a large number of heterogeneous system measurements from the sensors, the actuators, and the control of the cyber-physical system. Since these systems can be highly dynamic (e.g., non-stationary) in nature, the raw system measurements may contain not only strong temporal dependence but also strong spatial dependence among many measurements. Traditional characterization methods, which are generally linear (e.g., Principal Component Analysis (“PCA”)) might not be able to adequately capture the underlying system characteristics.
Some embodiments described herein provide a system and method for abstracting underlying system characteristics of cyber-physical systems. Specifically, a system and method may perform abstraction by stateful, nonlinear embedding of real-time system measurements, such as sensors, actuators and control, of the cyber-physical system. Given the fact that such systems have a complex and dynamic nature and system measurements may be noisy, some embodiments may effectively eliminate redundant and irrelevant information from the noisy measurements, while still capturing complex temporal dependence among the measurements (thus preserving the underlying system characteristics). The abstracted characteristics can be further utilized as features or signatures for building effective predictive models, e.g., attack or fault detection, attack localization, and attack neutralization.
Some embodiments introduce a deep leaning-based stateful, nonlinear embedding scheme that enables powerful and effective characterization of cyber-physical systems. Such an approach may leverage information from diverse measurements and be highly scalable to a wide range of cyber-physical systems.
Some embodiments described herein may characterize a cyber-physical system based on the real-time system measurements from sensors, actuators, and control of the physical system. The measurements may be in the form of multivariate time series. Since the system may be highly dynamic (non-stationary) in nature, system measurements used for system characterization may exhibit strong temporal dependence (i.e., the current behavior of the system is dependent on a history of past behavior). Additionally, there exists high dependence among the system measurements (i.e., spatial dependency). Note that effectively and efficiently handling both temporal and spatial dependences during system characterization may be important.
Information from the data source may be provided to a stateful, nonlinear embedding computer 150 that generates an output 160 (e.g. associated with underlying system characteristics of the cyber-physical system). In this way, redundant and irrelevant information may be reduced. Moreover, temporal and spatial dependence among the measurements may be captured.
As used herein, devices, including those associated with the system 100 and any other device described herein, may exchange information via any communication network which may be one or more of a Local Area Network (“LAN”), a Metropolitan Area Network (“MAN”), a Wide Area Network (“WAN”), a proprietary network, a Public Switched Telephone Network (“PSTN”), a Wireless Application Protocol (“WAP”) network, a Bluetooth network, a wireless LAN network, and/or an Internet Protocol (“IP”) network such as the Internet, an intranet, or an extranet. Note that any devices described herein may communicate via one or more such communication networks.
The stateful, nonlinear embedding computer 150 may store information into and/or retrieve information from various data sources, such as the data source 110. The various data sources may be locally stored or reside remote from the stateful, nonlinear embedding computer 150 (which might also be associated with, for example, offline or online learning). Although a single stateful, nonlinear embedding computer 150 is shown in
A user may access the system 100 via one of the monitoring devices (e.g., a Personal Computer (“PC”), tablet, smartphone, or remotely through a remote gateway connection) to view information about and/or manage information in accordance with any of the embodiments described herein. In some cases, an interactive graphical display interface may let a user define and/or adjust certain parameters (e.g., time-series measurement properties or data about the cyber-physical system) and/or provide or receive automatically generated recommendations or results from the stateful, nonlinear embedding computer 150 (as well as other devices).
Note that nonlinear embedding may project a high-dimensional input to a lower-dimensional latent variable space (or hidden states) such that the system characteristics are maintained in low dimensional latent space (which inherently takes care of spatial dependency).
For example,
At S210, a “stateful,” nonlinear embedding computer may receive, from a data source, a plurality of time-series measurements that represent normal operation of the cyber-physical system. As used herein, the term “stateful” may refer to a program or process designed to remember preceding events. According to some embodiments, the plurality of time-series measurement may be received in substantially real time during on-line operation of the cyber-physical system. At least one of the time-series measurements might be associated with, for example, a sensor monitoring node (e.g., measuring an attribute associated with the cyber-physical system), an actuator monitoring node, and/or a control monitoring node.
At S220, stateful, nonlinear embedding may be executed to project the plurality of time-series measurements to a lower-dimensional latent variable space such that redundant and irrelevant information are reduced and temporal and spatial dependence among the measurements are captured. The stateful, nonlinear embedding might be associated with a deep neural network, an autoencoder, a variational autoencoder, a generative adversarial network, etc.
For example, the stateful, nonlinear embedding might augment a stateless, nonlinear embedding process by using a window of consecutive samples of the time-series measurements as a matrix input to the stateless, nonlinear embedding process. As another example, the stateful, nonlinear embedding might augment a stateless embedding process by using a first independent sample of the time-series measurements as a first vector input to the stateless embedding to receive a first output. The system may then use a second independent sample of the time-series measurements as a second vector input to the stateless embedding to receive a second output. Statistics of the first and second outputs may then be calculated with post-processing to obtain lower-dimensional latent variable space.
According to some embodiments, the stateful, nonlinear embedding is associated with a recurrent “autoencoder.” As used herein, the term “autoencoder” might refer to, for example, an artificial neural network used for unsupervised learning of efficient codes to learn a representation (encoding) for a set of data (e.g., to achieve dimensionality reduction). For example, the recurrent autoencoder might be implemented using a stateful “generative adversarial network.” As used herein, the phrase “generative adversarial network may refer to, for example, a class of artificial intelligence algorithms used in unsupervised machine learning implemented by a system of two neural networks contesting with each other in a zero-sum game framework. The stateful generative adversarial network could include, for example, a generator (e.g., having a recurrent neural network encoder and a recurrent neural network decoder) and a discriminator with a deep network. According to some embodiments, the generator is further associated with long short-term memory. Many of these techniques, in their original form, were not designed for dynamic system applications (that is, they are “stateless”).
At S230, the output of the stateful, nonlinear embedding may be utilized to automatically identify underlying system characteristics of the cyber-physical system. The identified underlying system characteristics might then be used, for example, to create a decision boundary for cyber-attack detection, fault detection, abnormality localization, abnormality neutralization, etc.
The stateful, nonlinear embedding computer 350 might implement, for example, an autoencoder to generate the latent representation 360.
min E(W,b,d′)=minW,b,d′ΣΣj=1p∥xj−gθ(fθ(xj))∥2
where xj corresponds to samples of data and P is equal to the number of samples.
Note that an autoencoder implementation may use the cross entropy error function instead of mean squared error. Moreover, an expected value may be required when using cross entropy:
minE(W,b,d′)=minW,b,d′E[L(x,z)]
where L(x,z) is the cross-entropy loss L(x,z) shown above.
Broadly speaking, there may be two categories of strategies to achieve stateful embedding. The first one is to augment existing stateless embedding to make it stateful. For example, instead of taking an independent sample (an input vector) as the input to the stateless embedding, a system might take a window of consecutive samples (a matrix) as the input to the embedding, enabling the resultant embedding to be temporal dependent. For example,
At S520, a stateless, nonlinear embedding process may be augmented by using a window of consecutive samples of the time-series measurements as a matrix input. For example,
This is the simplest strategy to make stateless embedding to be stateful. Note that a Denoising Auto-Encoder (“DAE”), a variant of autoencoding, may be used as the stateless, nonlinear embedding algorithm to extract salient features for anomaly detection applications. However, this strategy has several drawbacks. One of them is that it increases the number of input elements to the embedding algorithm, which results in a more complex network structure and makes the network learning more difficult. For example, given an original input comprising a 20-element vector and a window size of 50 samples, the required input would have 1000 (20*50) elements.
To address issue of increased input size, instead of taking a window of samples as the input to the stateless embedding algorithm, a system may take an independent sample as the input, which keeps the network structure unchanged. The system may then post-process the outputs of the stateless embedding to multiple consecutive inputs. Specifically, the system may obtain the latent representation or hidden states by calculating statistics of a window of stateless embedding outputs. For example,
At S720, stateful, nonlinear embedding may be achieved by using a first independent sample of the time-series measurements as a first vector input to receive a first output. At S730, the system may use a second independent sample of the time-series measurements as a second vector input to receive a second output. Statistics of the first and second outputs may be calculated with post-processing at S740 to obtain lower-dimensional latent variable space. For example,
Another category of strategies to achieve stateful embedding is to directly construct a stateful embedding algorithm or network, such as a recurrent autoencoder. According to some embodiments, a stateful generative adversarial network may be provided. Note that generative adversarial networks are a relatively new type of generative models that have been recently developed. As illustrated in
In such a stateless GAN 950, the generator 960 and discriminator 720 networks are feed-forward neural networks. To make a GAN stateful, embodiments may utilize recurrent types of networks, e.g. recurrent neural networks and/or long short-term memory for a generator network.
At S1010, a plurality of time-series measurements that represent normal operation of the cyber-physical system may be received. At S1020, stateful, nonlinear embedding may be achieved with a recurrent autoencoder, such as a stateful generative adversarial network. For example,
Appropriate stateful, nonlinear embedding for a particular cyber-physical system may be achieved in a number of different ways. For example,
Note that embodiments described herein may provide advantages such as: a novel method for abstracting cyber-physical system characteristics; handling both spatial and temporal dependences of the underlying system; and a generic method with a wide range of applications. For example, some embodiments might be associated with cyber-physical security techniques.
The embodiments described herein may be implemented using any number of different hardware configurations. For example,
The processor 1510 also communicates with a storage device 1530. The storage device 1530 may comprise any appropriate information storage device, including combinations of magnetic storage devices (e.g., a hard disk drive), optical storage devices, mobile telephones, and/or semiconductor memory devices. The storage device 1530 stores a program 1512 and/or a stateful, non-linear embedding engine 1514 for controlling the processor 1510. The processor 1510 performs instructions of the programs 1512, 1514, and thereby operates in accordance with any of the embodiments described herein. For example, the processor 1510 may receive a plurality of time-series measurements that represent normal operation of a cyber-physical system (e.g., in substantially real-time during online operation of the cyber-physical system). The processor 1510 may then execute stateful, nonlinear embedding to project the plurality of time-series measurements to a lower-dimensional latent variable space. In this way, redundant and irrelevant information may be reduced, and temporal and spatial dependence among the measurements may be captured. The output of the stateful, nonlinear embedding may be utilized, for example, by the processor 1510 to automatically identify underlying system characteristics of the cyber-physical system.
The programs 1512, 1514 may be stored in a compressed, uncompiled and/or encrypted format. The programs 1512, 1514 may furthermore include other program elements, such as an operating system, clipboard application, a database management system, and/or device drivers used by the processor 1510 to interface with peripheral devices.
As used herein, information may be “received” by or “transmitted” to, for example: (i) the stateful, nonlinear embedding platform 1500 from another device; or (ii) a software application or module within the stateful, nonlinear embedding platform 1500 from another software application, module, or any other source.
In some embodiments (such as the one shown in
Referring to
The measurement identifier 1602 may be, for example, a unique alphanumeric code identifying a node to be monitored (e.g., associated with a sensor). The time series of values 1604 may represent, for example, normal and/or abnormal data from a sensor or other monitoring node. The abstraction technique might indicate, for example, how characteristics of the system are being identified, such as by stateful, nonlinear estimation, post processing, a stateful generative adversarial network, etc. The initial latent representation 1608 may be used, in some approaches, to calculate the final representation 1610.
The following illustrates various additional embodiments of the invention. These do not constitute a definition of all possible embodiments, and those skilled in the art will understand that the present invention is applicable to many other embodiments. Further, although the following embodiments are briefly described for clarity, those skilled in the art will understand how to make any changes, if necessary, to the above-described apparatus and methods to accommodate these and other embodiments and applications.
Although specific hardware and data configurations have been described herein, note that any number of other configurations may be provided in accordance with embodiments of the present invention (e.g., some of the information associated with the databases described herein may be combined or stored in external systems). For example, although some embodiments are focused on industrial assets such as gas turbine generators, any of the embodiments described herein could be applied to other types of cyber-physical systems, such as dams, the power grid, military devices, etc. Moreover, note that some embodiments may be associated with a display of cyber-physical system data to an operator. For example,
The present invention has been described in terms of several embodiments solely for the purpose of illustration. Persons skilled in the art will recognize from this description that the invention is not limited to the embodiments described, but may be practiced with modifications and alterations limited only by the spirit and scope of the appended claims.
The present application claims the benefit of U.S. Provisional Patent Application No. 62/619,345 entitled “SYSTEM AND METHOD FOR ABSTRACTING CHARACTERISTICS OF CYBER-PHYSICAL SYSTEMS” and filed Jan. 19, 2018. The entire content of that application is incorporated herein by reference.
Number | Date | Country | |
---|---|---|---|
62619345 | Jan 2018 | US |