Decentralized Federated Learning Systems, Devices, and Methods for Security Threat Detection and Reaction

FIELD

Various embodiments are described herein that generally relate to systems, devices, and methods for decentralized federated learning-based security threat detection and reaction.

INTRODUCTION

The following paragraphs are provided by way of background to the present disclosure. They are not, however, an admission that anything discussed therein is prior art or part of the knowledge of persons skilled in the art.

Various security systems exist for protecting individuals' personal security and digital privacy. Some advanced smart security systems can use facial recognition, using data from home security cameras or smart doorbells. In conventional smart security systems, however, data is typically sent to remote servers for analysis, creating data privacy concerns. In federated learning systems, system nodes are trained with local samples and exchange machine learning parameters with other nodes in the system or with a central server to reduce or eliminate the need for local data to be sent externally. However, current systems can be unreliable and ill-equipped to respond to security threats. Blockchain technology is sometimes integrated within federated learning systems to improve reliability and trustworthiness. For example, federated learning with blockchain has been used for vehicular communication networking. However, current systems can be slow and have high memory requirements.

There is a need for systems, devices and methods for security threat detection and reaction that address the challenges and/or shortcomings described above.

SUMMARY

Various embodiments of a system, device and method for decentralized federated learning-based security threat detection and reaction, and computer products for use therewith, are provided according to the teachings herein.

According to one aspect of the present disclosure, there is provided a device of a plurality of devices in a decentralized federated learning security system. The device comprises one or more local AI models each configured to receive inputs from the one or more sensors and to be trained to make a prediction relating to events of an event type being sensed by the one or more sensors. The device also comprises one or more associated global AI models each configured to receive inputs from the one or more sensors and to make a prediction relating to events of an event type being sensed by the one or more sensors, wherein each of the one or more global AI models relating to a given event type is comprised of an aggregation of local AI models from the plurality of devices relating to the given event type. The device also comprises one or more processors. The one or more processors are configured to train a local AI model relating to an associated global AI model using new inputs received from the one or more sensors when inputting the new input into the associated global AI model fails to result in a prediction having threshold characteristics, thereby creating a newly trained local AI model, and send the newly trained local AI model to other devices of the plurality of devices. The device also comprises a memory containing newly trained local AI models of the plurality of devices.

In some examples, the one or more processors are further configured to receive a newly trained local AI model associated with a particular event type from another device of the plurality of devices. The one or more processors are also further configured to validate the received newly trained local AI model by: selecting a plurality of the most recent local AI models associated with the particular event type from the memory, aggregating the selected local AI models and the received newly trained AI model into an aggregated AI model, detecting anomalies in the aggregated AI model, and sending a validation signal associated to the newly trained AI model to a set of devices of the plurality of devices if no anomaly is detected.

In some examples, the one or more processors are further configured to, upon receipt of a validation signal from a device of the plurality of devices: store a newly trained model associated with the validation signal to the memory, select a plurality of the most recent local AI models associated with the particular event type from the memory, and aggregate the selected local AI models and the received newly trained AI model into a new global AI model.

In some examples, the step of aggregating the selected local AI models includes summing the local AI models.

In some examples, validation of the newly trained model is further performed using a consensus mechanism.

In some examples, the consensus mechanism is a proof-of-stake consensus mechanism.

In some examples, the device further comprises a local interpretation module configured to interpret predictions made by the global machine learning model using local information relevant to the user of the edge device in order to produce a threat assessment.

In some examples, the threat assessment comprises a determination of one of three or more threat levels.

In some examples, the determination of the one of three or more threat levels is based at least in part on the threshold characteristics.

In some examples, the threat assessment is used to perform an action by the system.

In some examples, the action is one of: notifying a user and/or owner of the system, notifying the police, doing nothing, and sounding an alarm.

In some examples, the device comprises one or more of the one or more sensors.

In some examples, the threshold characteristics include a confidence level related to the prediction.

In some examples, the one or more sensors includes a video camera, and the event type is associated with the detection of an optical or auditory characteristic of the video feed.

In some examples, the detection of an optical or auditory characteristic includes facial recognition.

In some examples, the one or more sensors includes a packet analyzer, and the event type is associated with packet features.

In some examples, the packet features include one or more of packet source address, packet destination addresses, type of service, total length, protocol, checksum, and data/payload.

In some examples, the one or more sensors is an Internet of Things (IoT) sensor, and the event type is associated with signals received from the IoT sensor.

In some examples, the memory comprises a blockchain containing newly trained local AI models of the plurality of devices.

In some examples, each block in the blockchain comprising a newly trained local machine learning model of a given device contains a pointer to the immediately preceding version of the newly trained machine learning model of the given device.

According to another aspect of the present disclosure, there is provided a method of operating a device of a plurality of devices in a decentralized federated learning security system. Each device comprises one or more local AI models each configured to receive inputs from the one or more sensors and to be trained to make a prediction relating to events of an event type being sensed by the one or more sensors, and one or more associated global AI models each configured to receive inputs from the one or more sensors and to make a prediction relating to events of an event type being sensed by the one or more sensors. Each of the one or more global AI models relating to a given event type is comprised of an aggregation of local AI models from the plurality of devices relating to the given event type, and a memory containing newly trained local AI models of the plurality of devices. The method comprises training a local AI model relating to an associated global AI model using new inputs received from the one or more sensors when inputting the new input into the associated global AI model fails to result in a prediction having threshold characteristics, thereby creating a newly trained local AI model. The method also comprises sending the newly trained local AI model to other devices of the plurality of devices.

In some examples, the method further comprises receiving a newly trained local AI model associated with a particular event type from another device of the plurality of devices. The method also comprises validating the received newly trained local AI model by: selecting a plurality of the most recent local AI models associated with the particular event type from the memory, aggregating the selected local AI models and the received newly trained AI model into an aggregated AI model, detecting anomalies in the aggregated AI model, and sending a validation signal associated to the newly trained AI model to a set of devices of the plurality of devices if no anomaly is detected.

In some examples, upon receipt of a validation signal from a device of the plurality of devices: storing a newly trained model associated with the validation signal on the memory, selecting a plurality of the most recent local AI models associated with the particular event type from the memory, and aggregating the selected local AI models and the received newly trained AI model into a new global AI model.

In some examples, aggregating the selected local AI models includes summing the local AI models.

In some examples, validation of the newly trained model is further performed using a consensus mechanism.

In some examples, the consensus mechanism is a proof-of-stake consensus mechanism.

In some examples, the method further comprises interpreting predictions made by the global machine learning model using local information relevant to the user of the edge device in order to produce a threat assessment.