DYNAMIC RESOURCE ALLOCATION METHOD FOR SENSOR-BASED NEURAL NETWORKS USING SHARED CONFIDENCE INTERVALS

Information

  • Patent Application
  • 20230419099
  • Publication Number
    20230419099
  • Date Filed
    June 28, 2022
    2 years ago
  • Date Published
    December 28, 2023
    10 months ago
Abstract
A method, computer program, and computer system are provided for resource allocation for sensor-based neural networks. One or more nodes associated with an edge computing environment are identified. Data corresponding to a classification dataset is received from the identified nodes. The dataset includes a reference classification and confidence value data. A node is selected from among the identified nodes based on the selected node having a greatest confidence interval associated with the reference classification within the confidence value data. The selected node is assigned to process the classification dataset.
Description
FIELD

This disclosure relates generally to field of machine learning, and more particularly to sensor-based neural networks.


BACKGROUND

An Artificial Neural Network (ANN), also referred to simply as a neural network, is a computing system made up of a number of simple, highly interconnected processing elements (nodes), which process information by their dynamic state response to external inputs. ANNs are processing devices (algorithms and/or hardware) that are loosely modeled after the neuronal structure of the mammalian cerebral cortex but on much smaller scales. A large ANN might have hundreds or thousands of processor units, whereas a mammalian brain has billions of neurons with a corresponding increase in magnitude of their overall interaction and emergent behavior.


A Deep Learning Neural Network, referred to herein as a Deep Neural Network (DNN) is an artificial neural network (ANN) with multiple hidden layers of units between the input and output layers. Similar to shallow ANNs, DNNs can model complex non-linear relationships. DNN architectures, e.g., for object detection and parsing, generate compositional models where the object is expressed as a layered composition of image primitives. The extra layers enable composition of features from lower layers, giving the potential of modeling complex data with fewer units than a similarly performing shallow network. DNNs are typically designed as feedforward networks.


SUMMARY

Embodiments relate to a method, system, and computer readable medium for resource allocation for sensor-based neural networks. According to one aspect, a method for resource allocation for sensor-based neural networks is provided. The method may include identifying one or more nodes associated with an edge computing environment. Data corresponding to a classification dataset is received from the identified nodes. The dataset includes a reference classification and confidence value data. A node is selected from among the identified nodes based on the selected node having a greatest confidence interval associated with the reference classification within the confidence value data. The selected node is assigned to process the classification dataset.


According to another aspect, a computer system for resource allocation for sensor-based neural networks is provided. The computer system may include one or more processors, one or more computer-readable memories, one or more computer-readable tangible storage devices, and program instructions stored on at least one of the one or more storage devices for execution by at least one of the one or more processors via at least one of the one or more memories, whereby the computer system is capable of performing a method. The method may include identifying one or more nodes associated with an edge computing environment. Data corresponding to a classification dataset is received from the identified nodes. The dataset includes a reference classification and confidence value data. A node is selected from among the identified nodes based on the selected node having a greatest confidence interval associated with the reference classification within the confidence value data. The selected node is assigned to process the classification dataset.


According to yet another aspect, a computer readable medium for resource allocation for sensor-based neural networks is provided. The computer readable medium may include one or more computer-readable storage devices and program instructions stored on at least one of the one or more tangible storage devices, the program instructions executable by a processor. The program instructions are executable by a processor for performing a method that may accordingly include identifying one or more nodes associated with an edge computing environment. Data corresponding to a classification dataset is received from the identified nodes. The dataset includes a reference classification and confidence value data. A node is selected from among the identified nodes based on the selected node having a greatest confidence interval associated with the reference classification within the confidence value data. The selected node is assigned to process the classification dataset.


According to one or more aspects, the node is selected based on historical confidence interval data for the node associated with the reference classification, historical identification data associated with the node, and current confidence value data for the node.


According to one or more aspects, the node is selected through a load-balancing queue and manager based on determined that a node is better at servicing a given operation than other nodes from among the identified nodes.


According to one or more aspects, the node is selected based on a physical location associated with the node.


According to one or more aspects, the physical location corresponds to a relative location of the node in relation to the other nodes from among the identified nodes.


According to one or more aspects, wherein the node is selected based on a current use of processing resources associated with the other nodes from among the identified nodes.


According to one or more aspects, the method may also include assigning additional nodes from among the identified nodes to process the classification dataset.





BRIEF DESCRIPTION OF THE DRAWINGS

These and other objects, features and advantages will become apparent from the following detailed description of illustrative embodiments, which is to be read in connection with the accompanying drawings. The various features of the drawings are not to scale as the illustrations are for clarity in facilitating the understanding of one skilled in the art in conjunction with the detailed description. In the drawings:



FIG. 1 depicts a cloud computing environment according to one or more embodiments;



FIG. 2 depicts abstraction model layers according to one or more embodiments;



FIG. 3 depicts a block diagram of an example edge computing environment in accordance with one or more embodiments;



FIG. 4 depicts a block diagram of an example edge computing environment that shows high level block diagrams of exemplary edge devices and an edge server in accordance with one or more embodiments;



FIG. 5 depicts a block diagram of an example edge network in accordance with one or more embodiments;



FIG. 6 depicts a block diagram of an example edge network in accordance with one or more embodiments;



FIG. 7 depicts a block diagram of an example CI data packet in accordance with one or more embodiments;



FIG. 8 depicts a block diagram of an example edge computing environment that shows a high level block diagram of an exemplary CIS module in accordance with one or more embodiments; and



FIG. 9 depicts a flowchart of an example process for resource allocation for sensor-based neural networks with one or more embodiments.





DETAILED DESCRIPTION

Detailed embodiments of the claimed structures and methods are disclosed herein; however, it can be understood that the disclosed embodiments are merely illustrative of the claimed structures and methods that may be embodied in various forms. Those structures and methods may, however, be embodied in many different forms and should not be construed as limited to the exemplary embodiments set forth herein. Rather, these exemplary embodiments are provided so that this disclosure will be thorough and complete and will fully convey the scope to those skilled in the art. In the description, details of well-known features and techniques may be omitted to avoid unnecessarily obscuring the presented embodiments.


Aspects are described herein with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and computer readable media according to the various embodiments. It will be understood that each block of the flowchart illustrations and/or block diagrams, and combinations of blocks in the flowchart illustrations and/or block diagrams, can be implemented by computer readable program instructions.


As previously described, an Artificial Neural Network (ANN), also referred to simply as a neural network, is a computing system made up of a number of simple, highly interconnected processing elements (nodes), which process information by their dynamic state response to external inputs. ANNs are processing devices (algorithms and/or hardware) that are loosely modeled after the neuronal structure of the mammalian cerebral cortex but on much smaller scales. A large ANN might have hundreds or thousands of processor units, whereas a mammalian brain has billions of neurons with a corresponding increase in magnitude of their overall interaction and emergent behavior. A Deep Learning Neural Network, referred to herein as a Deep Neural Network (DNN) is an artificial neural network (ANN) with multiple hidden layers of units between the input and output layers. Similar to shallow ANNs, DNNs can model complex non-linear relationships. DNN architectures, e.g., for object detection and parsing, generate compositional models where the object is expressed as a layered composition of image primitives. The extra layers enable composition of features from lower layers, giving the potential of modeling complex data with fewer units than a similarly performing shallow network. DNNs are typically designed as feedforward networks.


Recent years have seen a rapid increase in the amount of sensor data being generated by Internet of Things (IoT) technologies and other edge computing devices. This has led to a strong interest in adapting Artificial Intelligence (AI) technologies to perform closer to the network edge where the data is generated. Local AI analytics has the potential to provide data inferences closer to real-time as it allows for more immediate processing of the data as it is generated. This is due, in part, to the elimination of network latency that otherwise occurs when the data must be transferred to a cloud server or datacenter for cognitive analysis.


However, in order to benefit from the elimination of network latency, research has been conducted to find ways to adapt cognitive processes to be as fast on edge devices as they are when performed by cloud or datacenter servers having more hardware resources. For example, a growing area of research involves various techniques known as model compression. The goal of model compression is to simplify a large, complex model to produce a lightweight counterpart model that is suitable for deployment in edge devices.


Examples of compression techniques include pruning, quantization, low-rank approximation and sparsity, knowledge distillation, and neural architecture search (NAS). Ideally, the simplified model will achieve the same level of accuracy as the original model. Many of these techniques have shown significant improvement in performance, making it feasible for deployment on edge devices. However, the performance improvements typically involve a tradeoff that results in a reduction in accuracy. Such accuracy reductions are commonly in a range of 2% to 5%, but may in some cases be more than a 20% reduction in accuracy.


The illustrative embodiments recognize that presently available solutions do not address or provide adequate solutions for this reduction in accuracy. The illustrative embodiments described herein generally address and solve the above-described problems and other problems related to the accuracy of compressed machine-learning models.


While neural networks are becoming more ubiquitous, each operates independently, making its own decisions based on its gathered sensor data. However, the illustrative embodiments recognize that it should be expected that two well-trained neural networks, despite being systems from the same or different vendors, should come to the same conclusion or confidence interval (CI). These confidence intervals are used to make decisions for the systems connected to the neural network. For example, in one or more embodiments, two neural network nodes are observing the same relative area, making their own decisions from onboard sensors, and generating their own CIs about what they are detecting. These two neural networks share their data and CI with each other for a range of benefits.


Embodiments relate generally to the field of machine learning, and more particularly to sensor-based neural networks. The following described exemplary embodiments provide a system, method and computer program for, among other things, dynamic resource allocation in a NN focused on object identification, based on confidence intervals (CI's) shared and processed throughout edge-of-network sensors. Information will be shared between nodes to determine what tasks to focus processing resources on and which node(s) in the network can best service the task in progress, based on a history of correct decisions with high CI's.


The goal of dynamic resource allocation is to distribute workload as evenly as possible across a network. In a neural network (NN) focusing on object identification, this can be accomplished using a resource manager that relies on a history of shared confidence intervals to make resource allocation decisions


Thus, disclosed embodiments allow for machine-learning nodes on a sensor-based edge network to share data and confidence intervals with other machine-learning nodes on the same or different sensor-based edge networks. This ability enhances the machine-learning node's decisions based on its detected data by having another machine-learning node, possibly with different sensors or training algorithms, reinforce its conclusions or provide additional information.


In one or more embodiments, an edge network includes one or more edge servers and a plurality of edge devices that each include a confidence interval sharing (CIS) module. In one or more embodiments, one or more of the edge devices communicates directly with the edge servers or through other network devices, such as a switch or router. Alternatively, the edge devices communicate directly with other edge devices.


In one or more embodiments, the edge devices are machine-learning nodes. The edge devices each host a respective local AI analytics engine that uses a trained machine-learning model, for example for object identification or classification. In one or more embodiments, an edge device generates datasets based on sensor data from an onboard sensor, e.g., where the sensor data may be image data from an image sensor, temperature data from a temperature sensor, audio data from a microphone, etc. The edge device then processes the sensor data to reach a conclusion as directed by an edge application. For example, the edge device determines a classification for datasets generated from the sensor data, and also generates confidence values associated with each of the classifications.


In one or more embodiments, an edge device is configured to share these classifications and confidence values with others edge devices. In some such embodiments, the edge device shares only classifications that are associated with confidence values that are within a prescribed confidence interval. For example, in one or more embodiments, the edge device may be configured by user settings or default settings set by a manufacturer. When the edge device processes a dataset, the output will typically include a collection of classifications and associated confidence values.


For example, the edge device may be configured for identifying types of animals or objects in captured images. In such embodiments, the processing of a dataset for a captured image will output a list of possible classifications (e.g., a list of animals or objects) and confidence values associated with each possible classification. In one or more embodiments, the edge device will share only the classification associated with the highest confidence value rather than the entire list. Embodiments in which the edge device shares only the classification data for the classification associated with the highest confidence value avoid excessive network traffic since other edge devices may have no need for classifications associated with low confidence values.


In one or more embodiments, the edge device may predict that an image includes a particular animal or object with a low degree of confidence. In one or more embodiments, the edge device is configured to only share classification data when the confidence value is within a specified confidence interval, for example greater than a specified confidence interval (CI) threshold value. In various embodiments, the CI threshold value will be highly implementation-specific, and therefore may be any desired value. Also, in one or more embodiments, the CI threshold value is adjustable, for example by user settings. In some such embodiments, if the CI threshold value is greater than the highest confidence value, then the edge device will not share any classification data associated with these results. On the other hand, if the CI threshold value is less than the highest confidence value, then the edge device will share the classification associated with the highest confidence value.


In one or more embodiments, the edge device may request a classification and associated confidence value from one or more of the other edge devices. An edge device may request classification data from other edge devices for various reasons, for example to use as a reference point of comparison for one or more results of its own classification processing. In one or more embodiments, the edge device compares the highest confidence value to a confidence threshold value, and if the confidence value is less than the confidence threshold value, then the edge device will request classification data from one or more of the other edge devices that may have captured sensor data for the same object. In various embodiments, the confidence threshold value will be highly implementation-specific, and therefore may be any desired value. Also, in one or more embodiments, the confidence threshold value is adjustable, for example by user settings.


In one or more embodiments, if the confidence threshold value is greater than the highest confidence value, then the edge device will generate a request for classification data from one or more of the other edge devices. In one or more embodiments, the edge device may seek to use the confidence value generated by one or more other edge devices as a reference confidence value that serves as a point of comparison for the classification data generated by edge device. In one or more embodiments, the edge device transmits the request for the reference confidence value to one or more other edge devices directly. In one or more embodiments, the edge device transmits the request for the reference confidence value to one or more of the edge servers. In some such embodiments, the request transmitted to one or more of the edge servers includes an instruction executable by the edge server(s) to cause the edge server(s) to identify and provide the reference confidence values from one or more of the other edge devices.


In one or more embodiments, the edge device determines whether to use classification data generated by other edge devices based on metadata received with the classification data. In one or more embodiments, the metadata may include identifying information about the source edge device that generated the classification data and/or about software, firmware, and hardware on the source edge device. For example, the metadata may include information about the source edge device such as a node identifier, a network address, a manufacturer or vendor name or other identifier, a software identifier and/or version identifier, a firmware identifier and/or version identifier, and/or a hardware identifier and/or version identifier. In some such embodiments, the edge device parses the metadata received with the classification data and extracts information from the metadata. In some such embodiments the edge device compares one or more values extracted from the metadata with stored acceptance values. For example, the edge device may have stored acceptance values that include a list of one or more vendors and/or software versions. In this example, the edge device will compare the vendor and software version extracted from the metadata to its stored acceptance values in order to determine whether to use the classification data.


In one or more embodiments, if the edge device receives classification data that is acceptable, the edge device calculates a confidence difference between the confidence value generated by the edge device and the confidence value used as a reference confidence value that was generated by, and received from, another one of the edge devices. The confidence difference may be used by the edge device as an indication of how much more or less certain another edge device was about its classification conclusion. In one or more embodiments, the edge device then compares the confidence difference to a difference threshold value. In some such embodiments the edge device may use this comparison as a health check. In some such embodiments, if the confidence difference exceeds the difference threshold value, this may indicate that the edge device is malfunctioning or needs to be updated. In some such embodiments, the edge device generates, as an output replacement for the first classification dataset, a replacement dataset comprising the reference classification and an indication that the first confidence value is less than the reference confidence value. Thus, the difference threshold value will be highly implementation-specific, and therefore may be any desired value. Also, in one or more embodiments, the difference threshold value is adjustable, for example by user settings.


In one or more embodiments, the edge device may conversely receive a request for classification data, e.g., for the classification and associated confidence value for the highest confidence value. In one or more embodiments, the edge device may determine whether to provide this classification based on whether the highest confidence value is within a specified confidence interval as described above. In one or more embodiments, the edge device responds to the request by transmitting the requested classification data to the requesting edge device either directly or via an edge server.


In one or more embodiments, the edge device shares classification data if the highest confidence value is within the specified confidence interval without the need to receive a request. In some such embodiments, as the edge device generates classification data, the edge device evaluates each confidence value to determine if the confidence value is within the specified confidence interval. For classification data that meets this criteria, the edge device broadcasts the classification data towards other edge devices on the edge network. On the other hand, in one or more embodiments, the edge device does not broadcast classification data that includes confidence values that are not within the specified confidence interval.


It is to be understood that although this disclosure includes a detailed description on cloud computing, implementation of the teachings recited herein are not limited to a cloud computing environment. Rather, embodiments disclosed herein are capable of being implemented in conjunction with any other type of computing environment now known or later developed.


Cloud computing is a model of service delivery for enabling convenient, on-demand network access to a shared pool of configurable computing resources (e.g., networks, network bandwidth, servers, processing, memory, storage, applications, virtual machines, and services) that can be rapidly provisioned and released with minimal management effort or interaction with a provider of the service. This cloud model may include at least five characteristics, at least three service models, and at least four deployment models.


Characteristics are as follows:

    • On-demand self-service: a cloud consumer can unilaterally provision computing capabilities, such as server time and network storage, as needed automatically without requiring human interaction with the service's provider.
    • Broad network access: capabilities are available over a network and accessed through standard mechanisms that promote use by heterogeneous thin or thick client platforms (e.g., mobile phones, laptops, and PDAs).
    • Resource pooling: the provider's computing resources are pooled to serve multiple consumers using a multi-tenant model, with different physical and virtual resources dynamically assigned and reassigned according to demand. There is a sense of location independence in that the consumer generally has no control or knowledge over the exact location of the provided resources but may be able to specify location at a higher level of abstraction (e.g., country, state, or datacenter).
    • Rapid elasticity: capabilities can be rapidly and elastically provisioned, in some cases automatically, to quickly scale out and rapidly released to quickly scale in. To the consumer, the capabilities available for provisioning often appear to be unlimited and can be purchased in any quantity at any time.
    • Measured service: cloud systems automatically control and optimize resource use by leveraging a metering capability at some level of abstraction appropriate to the type of service (e.g., storage, processing, bandwidth, and active user accounts). Resource usage can be monitored, controlled, and reported, providing transparency for both the provider and consumer of the utilized service.


Service Models are as follows:

    • Software as a Service (SaaS): the capability provided to the consumer is to use the provider's applications running on a cloud infrastructure. The applications are accessible from various client devices through a thin client interface such as a web browser (e.g., web-based e-mail). The consumer does not manage or control the underlying cloud infrastructure including network, servers, operating systems, storage, or even individual application capabilities, with the possible exception of limited user-specific application configuration settings.
    • Platform as a Service (PaaS): the capability provided to the consumer is to deploy onto the cloud infrastructure consumer-created or acquired applications created using programming languages and tools supported by the provider. The consumer does not manage or control the underlying cloud infrastructure including networks, servers, operating systems, or storage, but has control over the deployed applications and possibly application hosting environment configurations.
    • Infrastructure as a Service (IaaS): the capability provided to the consumer is to provision processing, storage, networks, and other fundamental computing resources where the consumer is able to deploy and run arbitrary software, which can include operating systems and applications. The consumer does not manage or control the underlying cloud infrastructure but has control over operating systems, storage, deployed applications, and possibly limited control of select networking components (e.g., host firewalls).


Deployment Models are as follows:

    • Private cloud: the cloud infrastructure is operated solely for an organization. It may be managed by the organization or a third party and may exist on-premises or off-premises.
    • Community cloud: the cloud infrastructure is shared by several organizations and supports a specific community that has shared concerns (e.g., mission, security requirements, policy, and compliance considerations). It may be managed by the organizations or a third party and may exist on-premises or off-premises.
    • Public cloud: the cloud infrastructure is made available to the general public or a large industry group and is owned by an organization selling cloud services.
    • Hybrid cloud: the cloud infrastructure is a composition of two or more clouds (private, community, or public) that remain unique entities but are bound together by standardized or proprietary technology that enables data and application portability (e.g., cloud bursting for load-balancing between clouds).


A cloud computing environment is service oriented with a focus on statelessness, low coupling, modularity, and semantic interoperability. At the heart of cloud computing is an infrastructure that includes a network of interconnected nodes.


With reference to FIG. 1, this figure illustrates cloud computing environment 100. As shown, cloud computing environment 100 includes one or more cloud computing nodes 10 with which local computing devices used by cloud consumers, such as, for example, personal digital assistant (PDA) or cellular telephone 54A, desktop computer 54B, laptop computer 54C, and/or automobile computer system 54N may communicate. Nodes 10 may communicate with one another. They may be grouped (not shown) physically or virtually, in one or more networks, such as Private, Community, Public, or Hybrid clouds as described hereinabove, or a combination thereof. This allows cloud computing environment 100 to offer infrastructure, platforms and/or software as services for which a cloud consumer does not need to maintain resources on a local computing device. It is understood that the types of computing devices 54A-N shown in FIG. 1 are intended to be illustrative only and that computing nodes 10 and cloud computing environment 100 can communicate with any type of computerized device over any type of network and/or network addressable connection (e.g., using a web browser).


With reference to FIG. 2, this figure depicts a set of functional abstraction layers 200 provided by cloud computing environment 100 (FIG. 1). It should be understood in advance that the components, layers, and functions shown in FIG. 2 are intended to be illustrative only and embodiments are not limited thereto. As depicted, the following layers and corresponding functions are provided:


Hardware and software layer 60 includes hardware and software components. Examples of hardware components include: mainframes 61; RISC (Reduced Instruction Set Computer) architecture based servers 62; servers 63; blade servers 64; storage devices 65; and networks and networking components 66. In one or more embodiments, software components include network application server software 67 and database software 68.


Virtualization layer 70 provides an abstraction layer from which the following examples of virtual entities may be provided: virtual servers 71; virtual storage 72; virtual networks 73, including virtual private networks; virtual applications and operating systems 74; and virtual clients 75.


In one example, management layer 80 may provide the functions described below. Resource provisioning 81 provides dynamic procurement of computing resources and other resources that are utilized to perform tasks within the cloud computing environment. Metering and Pricing 82 provide cost tracking as resources are utilized within the cloud computing environment, and billing or invoicing for consumption of these resources. In one example, these resources may include application software licenses. Security provides identity verification for cloud consumers and tasks, as well as protection for data and other resources. User portal 83 provides access to the cloud computing environment for consumers and system administrators. Service level management 84 provides cloud computing resource allocation and management such that required service levels are met. Service Level Agreement (SLA) planning and fulfillment 85 provide pre-arrangement for, and procurement of, cloud computing resources for which a future requirement is anticipated in accordance with an SLA.


Workloads layer 90 provides examples of functionality for which the cloud computing environment may be utilized. Examples of workloads and functions which may be provided from this layer include: mapping and navigation 91; software development and lifecycle management 92; virtual classroom education delivery 93; data analytics processing 94; transaction processing 95; and edge network management processing 96.


With reference to FIG. 3, this figure depicts a block diagram of an example edge computing environment 300 in accordance with one or more embodiments. In one or more embodiments, the edge computing environment 300 includes a cloud server 302 in communication with an edge network 304. In one or more embodiments, the cloud server 302 includes management modules 308, 312 that are deployed in workloads layer 90 of FIG. 2 providing edge network processing 96.


In one or more embodiments, the cloud server 302 includes a device registry 306, a device management module 308, a model repository 310, and a model management module 312. The device registry 306 stores information about devices that the edge system can read, communicate with, control, provision, or manage. The device management module 308 manages edge nodes and the service software lifecycle on edge nodes autonomously. The model repository 310 stores AI models for reference and further analysis. The model management module 312 supports storage, delivery, and security of models and other metadata packages.


In one or more embodiments, the edge network 304 includes one or more edge servers 314, a router 316, and a plurality of edge devices 318-325 that each include a confidence interval sharing (CIS) module 326. The edge servers 314 facilitate communications between the edge devices 318-325 and the cloud server 302. In one or more embodiments, the edge servers 314 also facilitate communications between the edge devices 318-325. In one or more embodiments, one or more of the edge devices 318-325 communicates directly with the edge servers 314 or through other network devices, such as the router 316. For example, in one or more embodiments, the edge devices 318-325 communicate directly with the edge servers 314, and edge devices 322-325 communicate with the edge servers 314 via the router 316.


In one or more embodiments, the edge devices 318-325 serve as non-limiting examples of machine-learning nodes. In one or more embodiments, the edge devices include a box camera 318, temperature sensor 319, smoke detector 320, smart phone 321, smoke alarm 322, dome camera 323, motion sensor 324, and box camera 325. Actual implementations may include additional or fewer edge devices.


In one or more embodiments, edge devices 318-325 each host a respective local AI analytics engine (e.g., AI analytics engine 412 of FIG. 4) that has been trained, for example by model management module 312 in the cloud server 302. The local AI analytics engines may be manually deployed to the edge devices 318-325, for example, when the edge devices 318-325 are deployed into an application. In some examples, however, the edge devices 318-325 may request that an analytics engine be downloaded from the cloud server 302. In one or more embodiments, the AI analytics engines vary amongst the edge devices 318-325. For example, in one or more embodiments, some of the edge devices 318-325 and their respective AI analytics engines are identical, while others of the edge devices 318-325 have respective AI analytics engines that differ from each, for example by including software, firmware, and/or hardware from different vendors, and/or different software, firmware, and/or hardware versions.


Illustrative embodiments will be described using the edge device 318 as an example for the sake of simplicity with the understanding that the description applies equally to each of the edge devices 318-325. In one or more embodiments, the edge device 318 generates datasets based on sensor data from its sensor, e.g., where the sensor data may be image data from an image sensor of the edge device 318. The edge device 318 then determines a classification for the datasets and confidence values associated with each of the classifications.


In one or more embodiments, the edge device 318 is configured to share these classifications and confidence values with others edge devices 319-325. In some such embodiments the edge device 318 shares only classifications that are associated with confidence values that are within a prescribed confidence interval. For example, in one or more embodiments, the edge device 318 may be configured by user settings or default settings set by a manufacturer. When the edge device 318 processes a dataset, the output will typically include a collection of classifications and associated confidence values. For example, the edge device 318 may be configured for identifying types of animals in images may capture an image of a dog, and the processing of the dataset for this image will output a list of possible animals and confidence values associated with each possible animal. If the edge device 318 is able to identify that the image is an image of a dog with a high degree of confidence, the output may be a list of animals and associated confidence values in which the confidence value for dog is very high, and the confidence values for other animals is very low. A very simplified example of such an output is provided for explanatory purposes in Table 1 below:












TABLE 1









DOG
0.96



CAT
0.02



COW
0.00



PIG
0.01



HORSE
0.00



RABBIT
0.01



BIRD
0.00










In one or more embodiments, the edge device 318 will share only the classification DOG and its associated confidence value 0.96 rather than the entire list. Embodiments in which the edge device 318 shares only the classification data for the classification associated with the highest confidence value avoid excessive network traffic since other edge devices may have no need for classifications associated with low confidence values.


It is also possible that the edge device 318 may conclude that the image is an image of a dog, but arrive at this conclusion with a relatively low degree of confidence. A very simplified example of such an output is provided for explanatory purposes in Table 2 below:












TABLE 2









DOG
0.60



CAT
0.26



COW
0.00



PIG
0.12



HORSE
0.00



RABBIT
0.02



BIRD
0.00










In one or more embodiments, the edge device 318 is configured to only share classification data when the confidence value is within a specified confidence interval, for example greater than a specified confidence interval (CI) threshold value. In various embodiments, the CI threshold value will be highly implementation-specific, and therefore may be any desired value. Also, in one or more embodiments, the CI threshold value is adjustable, for example by user settings. In the example results shown in Table 2, if the CI threshold value is greater than 0.60, then the edge device 318 will not share any classification data associated with these results. On the other hand, if the CI threshold value is less than 0.60, then the edge device 318 will share the classification associated with the highest confidence value (DOG, 0.60).


In one or more embodiments, the edge device 318 may request a classification and associated confidence value from one or more of the other edge devices 319-325. An edge device may request classification data from other edge devices for various reasons, for example to use as a reference point of comparison for one or more results of its own classification processing. Using the results shown in Table 2 above as an example, the edge device 318 compares the confidence value (0.60) to a confidence threshold value, and if the confidence value is less than the confidence threshold value, then the edge device 318 will request classification data from one or more of the other edge devices 319-325 that may have captured sensor data for the same animal. In various embodiments, the confidence threshold value will be highly implementation-specific, and therefore may be any desired value. Also, in one or more embodiments, the confidence threshold value is adjustable, for example by user settings.


In the example shown in Table 2, if the confidence threshold value is greater than 0.60, then the edge device 318 will generate a request for classification data from one or more of the other edge devices 319-325 that may have captured sensor data of the same animal. In one or more embodiments, the edge device 318 may seek to use the confidence value generated by one or more other edge devices as a reference confidence value that serves as a point of comparison for the classification data generated by edge device 318. In one or more embodiments, the edge device 318 transmits the request for the reference confidence value to one or more other edge devices 319-325 directly. In one or more embodiments, the edge device 318 transmits the request for the reference confidence value to one or more of the edge servers 314. In some such embodiments, the request transmitted to one or more of the edge servers 314 includes an instruction executable by the edge server(s) 314 to cause the edge server(s) 314 to identify and provide the reference confidence values from one or more of the other edge devices 319-325.


In one or more embodiments, the edge device 318 receives a classification dataset that includes a classification and an associated confidence value generated by another of the edge devices 319-325 and used by the edge device 318 as a reference classification and an associated reference confidence value. The edge device 318 determines whether to use the reference classification and reference confidence value based on metadata received with the classification dataset. In one or more embodiments, the metadata may include identifying information about the source edge device that generated the classification dataset and/or about software, firmware, and hardware on the source edge device. For example, the metadata may include information about the source edge device such as a node identifier, a network address, a manufacturer or vendor name or other identifier, a software identifier and/or version identifier, a firmware identifier and/or version identifier, and/or a hardware identifier and/or version identifier. In some such embodiments, the edge device 318 parses the metadata received with the classification dataset and extracts information from the metadata. In some such embodiments the edge device 318 compares one or more values extracted from the metadata with stored acceptance values. For example, the edge device 318 may have stored acceptance values that include a list of one or more vendors and/or software versions. In this example, the edge device 318 will compare the vendor and software version extracted from the metadata to its stored acceptance values in order to determine whether to use the classification dataset.


In one or more embodiments, if the edge device 318 determines that the classification dataset is acceptable, the edge device 318 calculates a confidence difference between the confidence value generated by the edge device 318 and the reference confidence value. The confidence difference may be used by the edge device 318 as an indication of how much more or less certain another edge device was about its classification conclusion. In one or more embodiments, the edge device 318 then compares the confidence difference to a difference threshold value. In some such embodiments, if the confidence difference exceeds the difference threshold value, this may indicate that another of the edge devices 319-325 that generated the reference classification value, and the reference confidence value had a better perspective of the subject of the classification processing. Thus, if the reference classification is different than the classification generated by the edge device 318, and the reference confidence value is much higher than the confidence value generated by the edge device 318, it will likely be more accurate for the edge device 318 to output the reference classification in place of the classification generated by the edge device 318. Thus, in one or more embodiments, the edge device 318 generates a replacement dataset as an output replacement for the classification dataset generated by the edge device 318 in which the output replacement comprises the reference classification along with, or in place of, the classification generated by the edge device 318. Also, in one or more embodiments, as a safeguard, the replacement output also includes an indication that the confidence value generated by the edge device 318 is less than the reference confidence value.


In some such embodiments the edge device 318 may use this comparison as a health check. In some such embodiments, if the confidence difference exceeds the difference threshold value, this may indicate that the edge device 318 is malfunctioning or needs to be updated. In some such embodiments, the edge device 318 generates, as an output replacement for the first classification dataset, a replacement dataset comprising the reference classification and an indication that the first confidence value is less than the reference confidence value. Thus, the difference threshold value will be highly implementation-specific, and therefore may be any desired value. Also, in one or more embodiments, the difference threshold value is adjustable, for example by user settings.


In one or more embodiments, the edge device 318 may conversely receive a request for classification data, e.g., for the classification and associated confidence value for the highest confidence value (e.g., DOG and 0.96 in the example shown in Table 1). In one or more embodiments, the edge device 318 may determine whether to provide this classification based on whether the highest confidence value is within a specified confidence interval as described above. In one or more embodiments, the edge device 318 responds to the request by transmitting the requested classification data to the requesting edge device either directly or via an edge server 314.


In one or more embodiments, the edge device 318 shares classification data if the highest confidence value is within the specified confidence interval without the need to receive a request. In some such embodiments, as the edge device 318 generates classification data, the edge device 318 evaluates each confidence value to determine if the confidence value is within the specified confidence interval. For classification data that meets these criteria, the edge device 318 broadcasts the classification data towards other edge devices 319-325 on the edge network 304. On the other hand, in one or more embodiments, the edge device 318 does not broadcast classification data that includes confidence values that are not within the specified confidence interval.


The edge device 318 may take an output from a node in the neural network and determine whether to continue processing it. The edge device 318 may determine where on the network to perform the processing, such as passing processing on to different/better-trained nodes such as edge devices 319-325. The edge device 318 may perform resource allocation through a load balancer or load manager.


Allocation of the resources may include metadata, such as node identification details, a physical location of the node, a relative location, and a timestamp. The edge device 318 may also determine available resources through a load-balancing queue and manager. The edge device 318 may determine which nodes are better at servicing certain types of operations by comparing history of CIs on shared common events. The edge device 318 may account for scope and/or locality so that localities or partitions may be created to handle allocation. The edge device 318 may also factor in current use of processing resources, number of tasks already in progress, and whether the node is already working on the task under inquiry.


The edge devices 318-325 may be considered to be nodes in the edge network 304. The nodes can share processing power, both available and consumed, as well as a sub-network of under-utilized nodes. Some nodes may have to process more information than others by the nature of the network. Nodes will have different latencies when communicating back to the central system. Thus, the edge computing environment 300 may have the ability to decide which node(s) can best service the task in progress. For this decision, a neural network node begins by making a preliminary identification on an object (ex. a person/animal/tree). It then passes off data (including CI's), as well as the rest of the processing, to a different node(s) which is better trained (OR to central processing which has more processing power to do further work). In the case of passing to a “better trained” node, such a node may be identified based on the node's historically high confidence interval for the object in question, the node's historically accurate identifications of the object, and the node's currently high confidence interval for the object in question


With reference to FIG. 4, this figure depicts a block diagram of an example edge computing environment 400 that shows high level block diagrams of exemplary edge devices 404A and 404B and edge server 418 in accordance with one or more embodiments. In one or more embodiments, edge devices 404A and 404B are examples of edge devices 318-321 of FIG. 3 and edge server 418 is an example of edge server 314 of FIG. 3.


In one or more embodiments, the edge server 418 on an edge network 402 is in communication with a plurality of edge devices, including edge devices 404A and 404B. The edge server 418 is also in communication with a cloud server 426 and a user device 428. Each of the edge devices 404A and 404B includes an edge application 406, a sensor 408, an AI model 410, an AI analytics engine 412, a CIS module 414, and a Network Interface Controller (NIC) 416. Also, the edge server 418 includes an edge application 420, a user interface 422, and a NIC 424. In one or more embodiments, the functionality described herein is distributed among a plurality of systems, which can include combinations of software and/or hardware based systems, for example Application-Specific Integrated Circuits (ASICs), computer programs, or smart phone applications.


In one or more embodiments, the edge applications 406 and 420 are applications, or components thereof, that utilize classification data generated by the edge devices 404A and 404B. The edge devices 404A and 404B each host a respective local AI analytics engines 412 that uses a trained AI model 410.


Illustrative embodiments will be described using the edge device 404A as an example for the sake of simplicity with the understanding that the description applies equally to each of the edge devices 404A and 404B. In one or more embodiments, the AI analytics engine 412 generates datasets based on sensor data from its sensor 408. The AI analytics engine 412 then determines a classification for the datasets and confidence values associated with each of the classifications.


In one or more embodiments, the CIS module 414 is an example of CIS module 326 of FIG. 3. The CIS module 414 is configured to share the classifications and confidence values from the AI analytics engine 412 with edge device 404B. In some such embodiments, the CIS module 414 shares only classifications that are associated with confidence values that are within a prescribed confidence interval. For example, in one or more embodiments, the CIS module 414 may be configured by user settings using the user interface 422 or default settings set by a manufacturer. When the AI analytics engine 412 processes a dataset, the output will typically include a collection of classifications and associated confidence values, such as those shown in Tables 1 and 2. With reference to the example shown in Table 1, the CIS module 414 will share only the classification DOG and its associated confidence value 0.96 rather than the entire list. Embodiments in which the CIS module 414 shares only the classification data for the classification associated with the highest confidence value avoid excessive network traffic since other edge devices may have no need for classifications associated with low confidence values.


In one or more embodiments, the CIS module 414 is configured to only share classification data when the confidence value is within a specified confidence interval, for example greater than a specified confidence interval (CI) threshold value. In various embodiments, the CI threshold value will be highly implementation-specific, and therefore may be any desired value. Also, in one or more embodiments, the CI threshold value is adjustable, for example by user settings set by a user via the user interface 422. In the example results shown in Table 2, if the CI threshold value is greater than 0.60, then the CIS module 414 will not share any classification data associated with these results. On the other hand, if the CI threshold value is less than 0.60, then the CIS module 414 will share the classification associated with the highest confidence value (DOG, 0.60).


In one or more embodiments, the CIS module 414 may request a classification and associated confidence value from the edge device 404B. An edge device may request classification data from other edge devices for various reasons, for example to use as a reference point of comparison for one or more results of its own classification processing. Using the results shown in Table 2 above as an example, the CIS module 414 compares the confidence value (0.60) to a confidence threshold value, and if the confidence value is less than the confidence threshold value, then the CIS module 414 will request classification data from edge device 404B. In various embodiments, the confidence threshold value will be highly implementation-specific, and therefore may be any desired value. Also, in one or more embodiments, the confidence threshold value is adjustable, for example by user settings set via the user interface 422.


In one or more embodiments, the CIS module 414 transmits the request for the reference confidence value via the NIC 416 to edge device 404B directly. In one or more embodiments, the edge device 404A transmits the request for the reference confidence value via the NIC 416 to edge server 418. In some such embodiments, the request transmitted to the edge server 418 is received by NIC 424. The request includes an instruction executable by the edge server 418 to cause the edge server 418 to identify and provide the reference confidence values from edge device 404B.


In one or more embodiments, the CIS module 414 determines whether to use classification data generated by other edge device 404B based on metadata received with the classification data. In one or more embodiments, the metadata may include identifying information about the source edge device that generated the classification data and/or about software, firmware, and hardware on the source edge device. For example, the metadata may include information about the source edge device such as a node identifier, a network address, a manufacturer or vendor name or other identifier, a software identifier and/or version identifier, a firmware identifier and/or version identifier, and/or a hardware identifier and/or version identifier. In some such embodiments, the CIS module 414 parses the metadata received with the classification data and extracts information from the metadata. In some such embodiments, the CIS module 414 compares one or more values extracted from the metadata with stored acceptance values. For example, the CIS module 414 may have stored acceptance values that include a list of one or more vendors and/or software versions. In this example, the CIS module 414 will compare the vendor and software version extracted from the metadata to its stored acceptance values in order to determine whether to use the classification data.


In one or more embodiments, if the CIS module 414 receives classification data that is acceptable, the CIS module 414 calculates a confidence difference between the confidence value generated by the AI analytics engine 412 and the confidence value used as a reference confidence value that was generated by, and received from, the edge device 404B. The confidence difference may be used by the CIS module 414 as an indication of how much more or less certain another edge device was about its classification conclusion. In one or more embodiments, the CIS module 414 then compares the confidence difference to a difference threshold value. In some such embodiments the edge device 404A may use this comparison as a health check. In some such embodiments, if the confidence difference exceeds the difference threshold value, this may indicate that the edge device 404A is malfunctioning or needs to be updated. In some such embodiments the edge device 404A generates, as an output replacement for the first classification dataset, a replacement dataset comprising the reference classification and an indication that the first confidence value is less than the reference confidence value. Thus, the difference threshold value will be highly implementation-specific, and therefore may be any desired value. Also, in one or more embodiments, the difference threshold value is adjustable, for example by user settings.


In one or more embodiments, the CIS module 414 may conversely receive a request for classification data, e.g., for the classification and associated confidence value for the highest confidence value (e.g., DOG and 0.96 in the example shown in Table 1). In one or more embodiments, the CIS module 414 may determine whether to provide this classification based on whether the highest confidence value is within a specified confidence interval as described above. In one or more embodiments, the CIS module 414 responds to the request by transmitting the requested classification data to the requesting edge device either directly or via edge server 418.


In one or more embodiments, the CIS module 414 shares classification data if the highest confidence value is within the specified confidence interval without the need to receive a request. In some such embodiments, as the AI analytics engine 412 generates classification data, the CIS module 414 evaluates each confidence value to determine if the confidence value is within the specified confidence interval. For classification data that meets this criteria, the CIS module 414 broadcasts the classification data towards other edge devices, including edge device 404B on the edge network 402. On the other hand, in one or more embodiments, the CIS module 414 does not broadcast classification data that includes confidence values that are not within the specified confidence interval.


With reference to FIG. 5, this figure depicts a block diagram of an example edge network 500 in accordance with one or more embodiments. In one or more embodiments, the edge network includes edge devices 502-504 in communication with an edge server 510. In one or more embodiments, edge devices 502-504 are examples of edge devices 318-321 of FIG. 3 and edge server 510 is an example of edge server 314 of FIG. 3.


One or more embodiments is shown and described according to one or more embodiments in which the edge devices 502-504 are dome cameras having image-capturing sensors. However, alternative embodiments include other types of edge devices have other types of sensors. The edge devices 502-504 each host a respective local AI analytics engine (e.g., AI analytics engine 412 of FIG. 4). In one or more embodiments, the edge devices 502-504 are configured for identifying various objects, such as suitcase 506, oversized luggage 507, backpack 508, and luggage cart 509.


The edge devices 502-504 identify these and other types of objects in images captured by their respective image sensors. The edge devices 502-504 may be physically located close enough to each other to capture images of a same object at the same time, but at different angles and at different distances. As a result, the image quality will vary among the edge devices 502-504, as will the confidence values associated with the images. However, in one or more embodiments, the edge devices 502-504 are configured to share confidence values via the edge server 510. Thus, when one of the edge devices 502-504 is unable to get a clear image due to the distance, angle, or other factors of an object, it may use shared confidence values form the other edge devices 502-504 to help confirm a classification having a low confidence value.


For example, edge device 504 may classify object 506 as a suitcase with a low confidence value, while edge device 502 is able to classify object 506 as a suitcase with a high confidence value due to edge device 502 being much closer to object 506 than edge device 504. In one or more embodiments, the edge device 504 detects the low confidence value and, responsive to detecting the low confidence value, requests classification data via the edge server 510. The edge server 510 receives classification data from the edge device 502 having a high confidence value and provides this classification data to the edge device 504. The edge device 504 is then able to use this classification data from edge device 502 to confirm its classification result.


With reference to FIG. 6, this figure depicts a block diagram of an example edge network 600 in accordance with one or more embodiments. In one or more embodiments, the edge network includes edge devices 602-604 in communication with each other via the edge network 600. In one or more embodiments, edge devices 602-604 are examples of edge devices 318-321 of FIG. 3.


One or more embodiments is shown and described according to one or more embodiments in which the edge devices 602-604 are dome cameras having image-capturing sensors. However, alternative embodiments include other types of edge devices having other types of sensors. The edge devices 602-604 each host a respective local AI analytics engine (e.g., AI analytics engine 412 of FIG. 4). In one or more embodiments, the edge devices 602-604 are configured for identifying various objects, such as suitcase 606, oversized luggage 607, backpack 608, and luggage cart 609.


The edge devices 602-604 identify these and other types of objects in images captured by their respective image sensors. The edge devices 602-604 may be physically located close enough to each other to capture images of the same object at the same time, but at different angles and at different distances. As a result, the image quality will vary among the edge devices 602-604, as will the confidence values associated with the images. However, in one or more embodiments, the edge devices 602-604 are configured to share confidence values directly via the edge network 600. Thus, when one of the edge devices 602-604 is unable to get a clear image due to the distance, angle, or other factors of an object, it may use shared confidence values from the other edge devices 602-604 to help confirm a classification having a low confidence value.


For example, edge device 604 may classify object 606 as a suitcase with a low confidence value, while edge device 602 is able to classify object 606 as a suitcase with a high confidence value due to edge device 602 being much closer to object 606 than edge device 604. In one or more embodiments, the edge device 604 detects the low confidence value and, responsive to detecting the low confidence value, requests classification data via the edge network 600. The edge device 602 receives the request and responds by transmitting the classification data having a high confidence value to the edge device 604. The edge device 604 is then able to use this classification data from edge device 602 to confirm its classification result.


With reference to FIG. 7, this figure depicts a block diagram of an example CI data packet 700. In one or more embodiments, the CI data packet 700 is an example of classification data shared between edge devices, such as edge devices 602-604 of FIG. 6.


One or more embodiments is shown and described according to one or more embodiments in which the CI data packet 700 is generated by an edge device configured for identifying various objects, such as the suitcase 606, oversized luggage 607, backpack 608, and luggage cart 609 of FIG. 6. In one or more embodiments, the CI data packet 700 includes a data header 702 and a data body 706. The CI data packet 700 may be formatted according to any desired data model (e.g., comma-separated values (CSV), Extensible Markup Language (XML), JavaScript Object Notation (JSON), Yet Another Markup Language (YAML), etc.).


The data header 702 includes metadata 704. The metadata 704 may include various types of data. The types of data included in the metadata 704 will be highly implementation-specific, and therefore may be any desired types of data. In one or more embodiments, the metadata 704 includes one or more timestamps indicating times and dates for when the data was captured, location data indicating a location of the edge device transmitting the CI data packet 700 (e.g., GPS coordinates, floor of a building, room or corridor identifier, etc.), identifier data indicating an identity of the edge device transmitting the CI data packet 700 (e.g., node identifier, IP address, serial or inventory number, etc.), and/or software version data (e.g., software name, vendor name, software version, etc.).


The data body 706 includes one or more item names 708 and associated confidence interval (CI) data 710. The item names 708 include the classification result arrived at by the AI analytics engine of the transmitting edge device (e.g., suitcase, oversized luggage, backpack, luggage cart).


The CI data 710 includes the confidence values associated with each of the items 708 (e.g., 0.85, 0.89, 0.92, 0.90). The CI data 710 in one or more embodiments includes values that are low precision floating point values ranging from 0 to 1, with 0 being very uncertain and 1 being very certain. In alternative embodiments, the CI data 710 includes values that are higher or lower precision and/or are based on alternative scales and/or alternative formats (e.g., 0 to 100, 0% to 100%, etc.). In one or more embodiments, the CI data 710 may be in a format other than a numerical format, such as alphanumeric characters or words indicating certainty (e.g., unknown, uncertain, fairly certain, extremely certain, etc.).


With reference to FIG. 8, this figure depicts a block diagram of an example edge computing environment 800 that shows a high level block diagram of an exemplary CIS module 802 in accordance with one or more embodiments. In one or more embodiments, CIS module 802 is an example of CIS module 414 of FIG. 4.


In one or more embodiments, the CIS module 802 includes confidence system (CS) request response module 804, CS receiving module 806, reference CS requesting module 808, reference CS receiving module 810, CS comparison module 812, and alert generating module 814. In one or more embodiments, the functionality described herein is distributed among a plurality of systems, which can include combinations of software and/or hardware based systems, for example Application-Specific Integrated Circuits (ASICs), computer programs, or smart phone applications. In the view shown in FIG. 8, elements having the same element number as elements of FIG. 4 generally identify the same or similar elements and/or components as those described with reference to FIG. 4 (i.e., AI analytics engine 412, CIS module 414, edge device 404B, edge server 418, and user interface 422).


In one or more embodiments, the CIS module 802 is configured to share the classifications and confidence values from the AI analytics engine 412 with edge device 404B. The CS receiving module 806 receives the classifications and confidence values from the AI analytics engine 412 and may provide functions such as buffering and filtering to prepare the data for other modules of the CIS module 802.


In one or more embodiments, the CIS module 802 shares only classifications that are associated with confidence values that are within a prescribed confidence interval. For example, in one or more embodiments, the CIS module 802 may be configured by user settings using the user interface 422 or default settings set by a manufacturer. When the AI analytics engine 412 processes a dataset, the output will typically include a collection of classifications and associated confidence values, such as those shown in Tables 1 and 2. With reference to the example shown in Table 1, the CIS module 802 will share only the classification DOG and its associated confidence value 0.96 rather than the entire list. Embodiments in which the CIS module 802 shares only the classification data for the classification associated with the highest confidence value avoid excessive network traffic since other edge devices may have no need for classifications associated with low confidence values.


In one or more embodiments, the CIS module 802 is configured to only share classification data when the confidence value is within a specified confidence interval. The CS comparison module 812 provides comparison functionality for determining if a confidence value is within the specified confidence interval, for example greater than a specified confidence interval (CI) threshold value. In various embodiments, the CI threshold value will be highly implementation-specific, and therefore may be any desired value. Also, in one or more embodiments, the CI threshold value is adjustable, for example by user settings set by a user via the user interface 422. In the example results shown in Table 2, if the CI threshold value is greater than 0.60, then the CS comparison module 812 will detect that none of the confidence values are greater than the specified CI threshold value, and will prevent the CIS module 802 from sharing any classification data associated with these results. On the other hand, if the CI threshold value is less than 0.60, then the CS comparison module 812 will allow the CIS module 802 to share the classification associated with the highest confidence value (DOG, 0.60).


In one or more embodiments, the reference CS requesting module 808 may request a classification and associated confidence value from the edge device 404B. An edge device may request classification data from other edge devices for various reasons, for example to use as a reference point of comparison for one or more results of its own classification processing. Using the results shown in Table 2 above as an example, the CS comparison module 812 compares the confidence value (0.60) to a confidence threshold value, and if the confidence value is less than the confidence threshold value, then the CS comparison module 812 signals the reference CS requesting module 808 to request classification data from edge device 404B. In various embodiments, the confidence threshold value will be highly implementation-specific, and therefore may be any desired value. Also, in one or more embodiments, the confidence threshold value is adjustable, for example by user settings set via the user interface 422.


In one or more embodiments, the reference CS requesting module 808 transmits the request for the reference confidence value to edge device 404B directly. In one or more embodiments, the reference CS requesting module 808 transmits the request for the reference confidence value to an edge server (e.g., edge server 418 of FIG. 4). In some such embodiments, the reference CS requesting module 808 prepares a request that includes an instruction executable by the edge server to cause the edge server to identify and provide the reference confidence values from edge device 404B.


In one or more embodiments, the reference CS receiving module 810 determines whether to use classification data generated by other edge device 404B based on metadata received with the classification data. In one or more embodiments, the metadata may include identifying information about the source edge device that generated the classification data and/or about software, firmware, and hardware on the source edge device. For example, the metadata may include information about the source edge device such as a node identifier, a network address, a manufacturer or vendor name or other identifier, a software identifier and/or version identifier, a firmware identifier and/or version identifier, and/or a hardware identifier and/or version identifier. In some such embodiments, the reference CS receiving module 810 parses the metadata received with the classification data and extracts information from the metadata. In some such embodiments, the reference CS receiving module 810 compares one or more values extracted from the metadata with stored acceptance values. For example, the reference CS receiving module 810 may have stored acceptance values that include a list of one or more vendors and/or software versions. In this example, the reference CS receiving module 810 will compare the vendor and software version extracted from the metadata to its stored acceptance values in order to determine whether to use the classification data.


In one or more embodiments, if the reference CS receiving module 810 receives classification data that is acceptable, the reference CS receiving module 810 provides the received data to the CS comparison module 812. The CS comparison module 812 calculates a confidence difference between the confidence value generated by the AI analytics engine 412 and the confidence value used as a reference confidence value that was generated by, and received from, the edge device 404B. The confidence difference may be used by the CIS module 802 as an indication of how much more or less certain another edge device was about its classification conclusion. In one or more embodiments, the reference CS receiving module 810 compares the confidence difference to a difference threshold value. In some such embodiments the CIS module 802, or edge device hosting the CIS module 802, may use this comparison as a health check. In some such embodiments, if the confidence difference exceeds the difference threshold value, this may indicate a malfunction or other issue requiring attention. In some such embodiments, the reference CS receiving module 810 notifies the alert generating module 814 of the issue. In response, the alert generating module 814 generates, as an output replacement for the first classification dataset, a replacement dataset comprising the reference classification and an indication that the first confidence value is less than the reference confidence value. For example, in one or more embodiments, the alert generating module 814 provides the replacement dataset to the user interface 422.


In one or more embodiments, the request response module 804 may receive a request for classification data, e.g., for the classification and associated confidence value for the highest confidence value (e.g., DOG and 0.96 in the example shown in Table 1). In one or more embodiments, the reference CS receiving module 810 may determine whether to provide this classification based on whether the highest confidence value is within a specified confidence interval as described above. In one or more embodiments, the request response module 804 responds to the request by transmitting the requested classification data to the requesting edge device either directly or via an edge server.


In one or more embodiments, the CIS module 802 shares classification data if the highest confidence value is within the specified confidence interval without the need to receive a request. In some such embodiments, as the AI analytics engine 412 generates classification data, the reference CS receiving module 810 evaluates each confidence value to determine if the confidence value is within the specified confidence interval. For classification data that meets this criteria, the CIS module 802 broadcasts the classification data towards other edge devices, including edge device 404B. On the other hand, in one or more embodiments, the CIS module 802 does not broadcast classification data that includes confidence values that are not within the specified confidence interval.


Referring now to FIG. 9, an operational flowchart illustrating the steps of a method 900 carried out by a program for resource allocation in sensor-based neural networks is depicted. The method 900 will be described with the aid of the exemplary embodiments depicted in FIGS. 1-8.


At 902, the method 900 may include identifying nodes associated with an edge computing environment. The nodes may be a plurality of edge devices that each include a confidence interval sharing (CIS) module. The edge devices may include, among other things, a box camera, temperature sensor, smoke detector, smart phone, smoke alarm, dome camera, motion sensor, and box camera. In operation, the cloud server 304 (FIG. 3) may identify edge devices 318-325 (FIG. 3) on the edge network 304 (FIG. 3).


At 904, the method 900 may include receiving data corresponding to a classification dataset from the identified nodes, wherein the dataset includes a reference classification and confidence value data. The data may correspond to an object to be determined by the edge devices. In operation, the cloud server 302 (FIG. 3) may receive classification data from the edge devices 318-325 (FIG. 3) and confidence value data from the confidence interval sharing module within each of the edge devices 318-325.


At 906, the method 900 may include selecting a node from among the identified nodes based on selected node having a greatest confidence interval associated with the reference classification within the confidence value data. The node may be selected based on historical confidence interval data for the node associated with the reference classification, historical identification data associated with the node, and current confidence value data for the node. The node may be selected through a load-balancing queue and manager based on determined that a node is better at servicing a given operation than other nodes from among the identified nodes. The node may be selected based on a physical location associated with the node, which may correspond to a relative location of the node in relation to the other nodes from among the identified nodes. The node may also be selected based on a current use of processing resources associated with the other nodes from among the identified nodes. In operation, the cloud server 302 (FIG. 3) may select edge device 318 (FIG. 3) from among the edge devices 318-325 (FIG. 3) based on a determination that edge device 318 is best trained to perform object recognition and has a highest confidence interval associated with the object recognition.


At 908, the method 900 may include assigning the selected node to process the classification dataset. Additional nodes may also be assigned from among the identified nodes to process the classification dataset. In operation, the cloud server 302 (FIG. 3) may assign the edge device 318 (FIG. 3) to perform processing of data for object recognition. The cloud server 302 may also assign other devices from among the edge devices 319-325 for processing of the data.


One or more embodiments may relate to a system, a method, and/or a computer readable medium at any possible technical detail level of integration. The computer readable medium may include a computer-readable non-transitory storage medium (or media) having computer readable program instructions thereon for causing a processor to carry out operations.


The computer readable storage medium can be a tangible device that can retain and store instructions for use by an instruction execution device. The computer readable storage medium may be, for example, but is not limited to, an electronic storage device, a magnetic storage device, an optical storage device, an electromagnetic storage device, a semiconductor storage device, or any suitable combination of the foregoing. A non-exhaustive list of more specific examples of the computer readable storage medium includes the following: a portable computer diskette, a hard disk, a random access memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or Flash memory), a static random access memory (SRAM), a portable compact disc read-only memory (CD-ROM), a digital versatile disk (DVD), a memory stick, a floppy disk, a mechanically encoded device such as punch-cards or raised structures in a groove having instructions recorded thereon, and any suitable combination of the foregoing. A computer readable storage medium, as used herein, is not to be construed as being transitory signals per se, such as radio waves or other freely propagating electromagnetic waves, electromagnetic waves propagating through a waveguide or other transmission media (e.g., light pulses passing through a fiber-optic cable), or electrical signals transmitted through a wire.


Computer readable program instructions described herein can be downloaded to respective computing/processing devices from a computer readable storage medium or to an external computer or external storage device via a network, for example, the Internet, a local area network, a wide area network and/or a wireless network. The network may comprise copper transmission cables, optical transmission fibers, wireless transmission, routers, firewalls, switches, gateway computers and/or edge servers. A network adapter card or network interface in each computing/processing device receives computer readable program instructions from the network and forwards the computer readable program instructions for storage in a computer readable storage medium within the respective computing/processing device.


Computer readable program code/instructions for carrying out operations may be assembler instructions, instruction-set-architecture (ISA) instructions, machine instructions, machine dependent instructions, microcode, firmware instructions, state-setting data, configuration data for integrated circuitry, or either source code or object code written in any combination of one or more programming languages, including an object oriented programming language such as Smalltalk, C++, or the like, and procedural programming languages, such as the “C” programming language or similar programming languages. The computer readable program instructions may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server. In the latter scenario, the remote computer may be connected to the user's computer through any type of network, including a local area network (LAN) or a wide area network (WAN), or the connection may be made to an external computer (for example, through the Internet using an Internet Service Provider). In one or more embodiments, electronic circuitry including, for example, programmable logic circuitry, field-programmable gate arrays (FPGA), or programmable logic arrays (PLA) may execute the computer readable program instructions by utilizing state information of the computer readable program instructions to personalize the electronic circuitry, in order to perform aspects or operations.


These computer readable program instructions may be provided to a processor of a general-purpose computer, special purpose computer, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions/acts specified in the flowchart and/or block diagram block or blocks. These computer readable program instructions may also be stored in a computer readable storage medium that can direct a computer, a programmable data processing apparatus, and/or other devices to function in a particular manner, such that the computer readable storage medium having instructions stored therein comprises an article of manufacture including instructions which implement aspects of the function/act specified in the flowchart and/or block diagram block or blocks.


The computer readable program instructions may also be loaded onto a computer, other programmable data processing apparatus, or other device to cause a series of operational steps to be performed on the computer, other programmable apparatus or other device to produce a computer implemented process, such that the instructions which execute on the computer, other programmable apparatus, or other device implement the functions/acts specified in the flowchart and/or block diagram block or blocks.


The flowchart and block diagrams in the Figures illustrate the architecture, functionality, and operation of possible implementations of systems, methods, and computer readable media according to various embodiments. In this regard, each block in the flowchart or block diagrams may represent a module, segment, or portion of instructions, which comprises one or more executable instructions for implementing the specified logical function(s). The method, computer system, and computer readable medium may include additional blocks, fewer blocks, different blocks, or differently arranged blocks than those depicted in the Figures. In some alternative implementations, the functions noted in the blocks may occur out of the order noted in the Figures. For example, two blocks shown in succession may, in fact, be executed concurrently or substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved. It will also be noted that each block of the block diagrams and/or flowchart illustration, and combinations of blocks in the block diagrams and/or flowchart illustration, can be implemented by special purpose hardware-based systems that perform the specified functions or acts or carry out combinations of special purpose hardware and computer instructions.


It will be apparent that systems and/or methods, described herein, may be implemented in different forms of hardware, firmware, or a combination of hardware and software. The actual specialized control hardware or software code used to implement these systems and/or methods is not limiting of the implementations. Thus, the operation and behavior of the systems and/or methods were described herein without reference to specific software code—it being understood that software and hardware may be designed to implement the systems and/or methods based on the description herein.


No element, act, or instruction used herein should be construed as critical or essential unless explicitly described as such. Also, as used herein, the articles “a” and “an” are intended to include one or more items and may be used interchangeably with “one or more.” Furthermore, as used herein, the term “set” is intended to include one or more items (e.g., related items, unrelated items, a combination of related and unrelated items, etc.), and may be used interchangeably with “one or more.” Where only one item is intended, the term “one” or similar language is used. Also, as used herein, the terms “has,” “have,” “having,” or the like are intended to be open-ended terms. Further, the phrase “based on” is intended to mean “based, at least in part, on” unless explicitly stated otherwise.


The descriptions of the various aspects and embodiments have been presented for purposes of illustration but are not intended to be exhaustive or limited to the embodiments disclosed. Even though combinations of features are recited in the claims and/or disclosed in the specification, these combinations are not intended to limit the disclosure of possible implementations. In fact, many of these features may be combined in ways not specifically recited in the claims and/or disclosed in the specification. Although each dependent claim listed below may directly depend on only one claim, the disclosure of possible implementations includes each dependent claim in combination with every other claim in the claim set. Many modifications and variations will be apparent to those of ordinary skill in the art without departing from the scope of the described embodiments. The terminology used herein was chosen to best explain the principles of the embodiments, the practical application or technical improvement over technologies found in the marketplace, or to enable others of ordinary skill in the art to understand the embodiments disclosed herein.

Claims
  • 1. A method of resource allocation for sensor-based neural networks, executable by a processor, comprising: identifying nodes associated with an edge computing environment;receiving data corresponding to a classification dataset from the identified nodes, wherein the dataset includes a reference classification and confidence value data;selecting a node from among the identified nodes based on selected node having a greatest confidence interval associated with the reference classification within the confidence value data; andassigning the selected node to process the classification dataset.
  • 2. The method of claim 1, wherein the node is selected based on historical confidence interval data for the node associated with the reference classification, historical identification data associated with the node, and current confidence value data for the node.
  • 3. The method of claim 1, wherein the node is selected through a load-balancing queue and manager based on determined that a node is better at servicing a given operation than other nodes from among the identified nodes.
  • 4. The method of claim 1, wherein the node is selected based on a physical location associated with the node.
  • 5. The method of claim 4, wherein the physical location corresponds to a relative location of the node in relation to the other nodes from among the identified nodes.
  • 6. The method of claim 1, wherein the node is selected based on a current use of processing resources associated with the other nodes from among the identified nodes.
  • 7. The method of claim 1, further comprising assigning additional nodes from among the identified nodes to process the classification dataset.
  • 8. A computer system for resource allocation for sensor-based neural networks, the computer system comprising: one or more computer-readable non-transitory storage media configured to store computer program code; andone or more computer processors configured to access said computer program code and operate as instructed by said computer program code, said computer program code including: identifying code configured to cause the one or more computer processors to identify nodes associated with an edge computing environment;receiving code configured to cause the one or more computer processors to receive data corresponding to a classification dataset from the identified nodes, wherein the dataset includes a reference classification and confidence value data;selecting code configured to cause the one or more computer processors to select a node from among the identified nodes based on selected node having a greatest confidence interval associated with the reference classification within the confidence value data; andassigning code configured to cause the one or more computer processors to assign the selected node to process the classification dataset.
  • 9. The computer system of claim 8, wherein the node is selected based on historical confidence interval data for the node associated with the reference classification, historical identification data associated with the node, and current confidence value data for the node.
  • 10. The computer system of claim 8, wherein the node is selected through a load-balancing queue and manager based on determined that a node is better at servicing a given operation than other nodes from among the identified nodes.
  • 11. The computer system of claim 8, wherein the node is selected based on a physical location associated with the node.
  • 12. The computer system of claim 11, wherein the physical location corresponds to a relative location of the node in relation to the other nodes from among the identified nodes.
  • 13. The computer system of claim 8, wherein the node is selected based on a current use of processing resources associated with the other nodes from among the identified nodes.
  • 14. The computer system of claim 8, further comprising assigning code configured to cause the one or more computer processors to assign additional nodes from among the identified nodes to process the classification dataset.
  • 15. A non-transitory computer readable medium having stored thereon a computer program for resource allocation for sensor-based neural networks, the computer program configured to cause one or more computer processors to: identify nodes associated with an edge computing environment;receive data corresponding to a classification dataset from the identified nodes, wherein the dataset includes a reference classification and confidence value data;select a node from among the identified nodes based on selected node having a greatest confidence interval associated with the reference classification within the confidence value data; andassign the selected node to process the classification dataset.
  • 16. The computer readable medium of claim 15, wherein the node is selected based on historical confidence interval data for the node associated with the reference classification, historical identification data associated with the node, and current confidence value data for the node.
  • 17. The computer readable medium of claim 15, wherein the node is selected through a load-balancing queue and manager based on determined that a node is better at servicing a given operation than other nodes from among the identified nodes.
  • 18. The computer readable medium of claim 15, wherein the node is selected based on a physical location associated with the node.
  • 19. The computer readable medium of claim 18, wherein the physical location corresponds to a relative location of the node in relation to the other nodes from among the identified nodes.
  • 20. The computer readable medium of claim 15, wherein the node is selected based on a current use of processing resources associated with the other nodes from among the identified nodes.