The decision processes of acting under uncertainty and reasoning about the possibilities of future states is a widely cited challenge that has been researched for many years. When applied to autonomous systems, a prominent class of problems that can be addressed with this decision process can be summed up as whether to act now based on current evidence or to wait for more evidence that may potentially improve the action selection, at the cost of delay.
By way of a practical example, physically situated systems such as robots or embodied conversational agents typically rely on continual sensing to make inferences about the state of their sensed world and to guide their decisions. To identify ideal actions over time, these systems need to evaluate whether to act immediately using current sensory data or wait for more data that may possibly improve state estimates before acting. Consider a conversational agent embodied as a program that operates a display monitor, speakers, microphone and camera mounted outside a person's office. The agent may use a combination of face detection and tracking components to track the trajectory of people in its vicinity based on an analysis of pixels in the video stream. In addition, a face recognition component may be used to identify actors in the scene. At a higher level, the spatial trajectory and identity percepts can be fused to make inferences about the person's goals, and ultimately drive interaction decisions, such as when to initiate or break conversational engagement with people nearby.
The traditional approach to deliberating about the value of collecting additional information in advance of action is to compute the expected value of information (VOI), which is a measure of the difference of the expected value of the best decision before and after information is collected, considering the cost of acquiring the information. This includes the loss in value associated with the delay of action to await for the new information. However, with an autonomous system such as a conversational agent, the nature of the sensory evidence is streaming and high-dimensional (e.g., thousands of pixels regularly received in captured frames). There are challenges with computing VOI in settings with streaming, high-dimensional sensory evidence that make the traditional approaches unsuitable.
This Summary is provided to introduce a selection of representative concepts in a simplified form that are further described below in the Detailed Description. This Summary is not intended to identify key features or essential features of the claimed subject matter, nor is it intended to be used in any way that would limit the scope of the claimed subject matter.
Briefly, various aspects of the subject matter described herein are directed towards constructing one or more belief projection models from existing evidence, including streaming evidence, to predict a future belief over a state at a future time. The prediction of the future belief is used to determine whether to act or wait for additional evidence.
In one aspect, processing logic is coupled to a sensor set comprising one or more sensors, and is coupled to an output mechanism set comprising one or more output mechanisms. The processing logic is configured to process evidence received via the sensor set, including streaming evidence, into one or more belief projection models, and to construct one or more probability distributions based upon the belief projection models to predict possible future beliefs over a state at a future time using the received evidence. The processing logic uses the predicted future beliefs to determine whether to act via the output mechanism set or wait for additional evidence to be received.
In one aspect there is described receiving sensory evidence, including high-dimensional streaming evidence, and processing the sensory evidence to project future beliefs over states. The predicted future belief is used to make a decision, e.g., to wait for additional evidence to be received, or to select which action to take, without waiting for additional evidence.
Other advantages may become apparent from the following detailed description when taken in conjunction with the drawings.
The present invention is illustrated by way of example and not limited in the accompanying figures in which like reference numerals indicate similar elements and in which:
Various aspects of the technology described herein are generally directed towards using discriminatively trained conditional models to predict future belief states from existing evidence, along with using these models to weigh the tradeoffs between acting immediately, waiting for more sensory evidence to accumulate, and/or orchestrating which sensors are to be activated at a given time. The models may be learned automatically from data via self-supervision, and may be included into hierarchical inferential architectures.
In general, instead of using a generative model that predicts the probability of future evidence, the model described herein predicts what the belief over the state at a future time will be, based upon the existing evidence. In other words, using the current evidence, the prediction is based upon a projected (computed) belief of what the future belief likely will be. Predictions about future beliefs may be used to compute the expected cost of taking an optimal action at that time.
It should be understood that any of the examples herein are non-limiting. As such, the present invention is not limited to any particular embodiments, aspects, concepts, structures, functionalities or examples described herein. Rather, any of the embodiments, aspects, concepts, structures, functionalities or examples described herein are non-limiting, and the present invention may be used various ways that provide benefits and advantages in computing and computerized decision making in general.
The exemplified assistant 102 uses multiple sensors 116, such as a wide-angle camera, microphone array (e.g., based upon Kinect™ technology), and RFID badge reader, to make real-time inferences about people in its proximity, including their identities, activities and goals. This is represented by the image data 118 (e.g., frames of captured video), audio data 119 and other data 120.
The assistant 102 may be a domain expert in the presence and availability of its owner. For example, the assistant 102 may have access to the computer activity of its owner, Wi-Fi fingerprints of devices on the network being used by the owner, and calendar data, as represented via activity data 123 and calendar data 124. The underlying system may continuously (or frequently/occasionally/on demand) makes probabilistic forecasts about arrival times and likely availabilities of the owner.
By constructing one or more conditional models (collectively labeled 126 in
In general, engagement is a process by which participants in a conversation coordinate their actions to initiate, maintain and terminate their interactions. For autonomous systems, a conservative engagement policy is to wait for users to initiate engagement with the system by entering in a user-initiated “f-formation” with the system, in which the user approached and stood closely in front of a camera sensor. This policy was designed to minimize false positives, i.e., cases where the system would initiate engagement with someone who was just walking by or who was standing nearby but talking to someone in another office, and also was straightforward to detect.
However, this prior engagement policy did not account for those people who do not initiate engagement, including people who are waiting for the owner to return to his or her office or to become available (that is, when the owner is already in the office, but busy). Indeed, in actual situations, instead of seeking engagement, many times a person tends to bypass the autonomous system and sit in a nearby chair, or talk to others while waiting. In these situations, the conservative user-initiated engagement policy missed important opportunities to engage a waiting person in dialog on behalf of the owner. The cost of these missed opportunities can be high. As one example, the system may know that the owner is running late for a scheduled meeting, but because the visitor does not initiate engagement, the system using the conservative engagement policy does not let the visitor know before he or she leaves in frustration.
As described below, rather than relying exclusively on user-initiated engagement, a system described herein (implemented as the assistant 102) is configured to proactively initiate engagement with someone in the scene, even at a distance, if the system knows (e.g., to a threshold confidence) that the person is looking for the owner and that the system can provide helpful information. The mixed-initiative engagement policy hinges on inferences about the engagement action and the goal of the person. As described herein, the proactive engagement policy balances the costs of engaging people that are not looking for the owner, with the costs of missed opportunities to engage visitors before they leave.
In one aspect, the quality of sensory evidence collected and the inferences made from this evidence often may be improved at the cost of additional time delay for sensing and computation. In general, accumulating additional sensory evidence over time can lead to more accurate inferences (e.g., for face identification, intention recognition, and so forth). Also, more powerful sensors can be turned on, and/or more sophisticated algorithms for audiovisual scene analysis may be run, e.g., at sub-real-time speeds. In addition, systems may be able to solicit and obtain external assistance in quasi-real time. For instance, recognizing faces far from the camera may be difficult, but a system may be able to query people drawn from a knowledgeable crowd for assistance with identifying the person. In this case, the crowd acts in effect as a time-delayed sensor; however the additional evidence may arrive with various delays. When sensors and inferences are characterized by different levels of accuracy and by stochastic time-delays, tradeoffs arise between acting immediately and waiting for more information to accumulate prior to acting.
Thus, time plays a role in the evidence collecting and decision making processes, including that some perceptual inferences may become more accurate over time, as the system gathers additional sensory evidence. In the above example, face identification can become more accurate over time and as the person moves closer to the camera. In addition, the assistant has the ability to seek external assistance in real time, e.g., the system can take and send a snapshot of the scene to human volunteers and/or employees (such as receptionists) in real-time and ask them to help identify the person in the scene, with responses to such information-gathering actions arriving with a stochastic delay. Note that the crowd need not be completely knowledgeable with respect to a person's identity, e.g., a crowd may be asked a series of questions, such as whether the person appears to be a male or female, whether the person appears to be above a certain height, an approximate age of the person and so on to help the system narrow in on the correct identity of the person/user.
Furthermore, the unknown person might leave after a while, before the system has had a chance to reliably identify him or her. The methods described herein enable the assistant 102 to reason about the current evidence and the value of additional sensory evidence that will likely be accumulated in the near future, and to resolve tradeoffs between different courses of action, in this case engaging, not engaging, or waiting for additional evidence to be accumulated (possibly including seeking expert assistance).
Thus, one aspect is directed towards the tradeoff between acting immediately based on existing evidence versus collecting additional evidence prior to acting. In a decision theoretic setting, this tradeoff may be resolved by computing the value of information (VOI). Let p(s|E) be a model that infers the world state s based on existing evidence E, and C(s, a) be a cost function defined on world states and the actions aεA that the system may take. The value of information computation determines the difference between the expected value of taking an information gathering action ainfo which reveals additional evidence e and selecting the best domain action a and terminating the decision process.
and the expected value of acting immediately, based on the existing evidence E:
This approach can be extended in a straightforward manner to reason about sequences of information gathering actions. However, VOI can be intractable to compute for problems with large state spaces or high-dimensional sensory evidence (as in
Notwithstanding, using technology described herein, the value of information (VOI) approach for guiding the decision of whether to wait versus act is applicable in settings with high-dimensional, streaming sensory evidence. The waiting action can be viewed as an information gathering action, that is, additional sensory evidence e is collected while the system is waiting. Let ψt denote the sensory evidence observed by the system up to the current time-point t, i.e., E=ψt. The new evidence e that will be revealed by waiting until some future time t+k is E=ψt+k, where in one implementation ψt+k comprises a sequence of high-dimensional sensory evidence vectors that are collected from time 1 to time t+k, ψt+k={ψi}i=1:t+k. If for generality it is assumed that state s changes over time, and st+k denotes the state at time t+k, the expected value of waiting, computed based upon equation (1) becomes:
A direct computation of the expected value of information (or waiting), requires a model for p(ψt+k|ψt), that is, p(ψ1, ψ2, . . . , ψt+k|ψt). Building this type of generative model for future sensory evidence is in most cases intractable due to the streaming and high-dimensional nature of the sensory evidence ψi. Alternative formulations often used in Bayes Nets that rely on a factorization of p(ψt+k|ψt) based on p(ψt+k|st)·p(st|ψt) encounter similar tractability challenges.
A model for generating p(ψt+k|ψt) is described herein to estimate the future state st+k, with the sensory evidence ψt+k needed to estimate st+k, via p(st+k|ψt+k). Because learning a generative model for future sensory evidence p(ψt+k|ψt) is intractable, described herein is a reformulation of the expected value of information computation that (unlike the traditional approach) relies on a direct prediction of what the results of the sensory inference bt+k(st+k)=p(st+k|ψt+k) will be at future times t+k, conditioned on the current evidence at time t:
p(bt+k|ψt)=p(p(st+k|ψt+k)|ψt)
Note that p(bt+k|ψt) is referred to herein as a belief projection model. This model may be used in the expected value of waiting computation as follows:
Thus, instead of using a generative model p(ψt+k|ψt) that predicts the probability of future evidence, a model is used that directly predicts what the belief over the state st+k at time t+k will be, bt+k(st+k)=p(st+k|ψt+k) conditioned on the existing evidence ψt. This predicted future belief may be used to compute the expected cost of taking the optimal action at that time, e.g., maxaΣs
The belief projection model p(bt+k|ψt) can be trained in a supervised fashion based on a corpus of labeled data. For each training data point (ψt,bt+k) the features ψt describe the sensory evidence collected up to time t. The corresponding label bt+k comprises the output of the state inference models at some future time t+k, i.e., p(st+k|ψt+k); the training label is a belief over the state st+k. Training data can be collected by running a system with a given inference model p(sl|ψl); and recording the input features and the belief bl over sl produced by this model at each time point 1.
A belief projection model may be learned automatically from data via parametric machine learning approaches, e.g., like fitted mixtures of Beta or Dirichlet distributions. A belief projection model may be learned automatically from data via non-parametric machine learning approaches, e.g., like decision trees. A belief projection model may be manually constructed via a set of heuristic rules, e.g., by a domain expert.
The belief projection model computes a belief over the belief of the state st+k, given the current evidence. The training labels therefore comprise bt+k beliefs over the state st+k. For instance, if the state is binary, i.e., st+kε{0,1}, the belief over st+k is defined over the unit simplex, i.e., bt+kεΔ1, which is the [0, 1] real interval. In this case the belief projection model constructs a probability distribution over this simplex, or over the [0, 1] interval. An approach to the belief projection model is to employ a mixture of Beta distributions and learn the model parameters in a maximum likelihood manner. An alternative is to discretize the [0, 1] interval into several bins, treat the problem as multinomial classification, and build a model via discriminative learning techniques such as maximum entropy models or decision trees. The complexity of the learning problem increases as the size of the original state space increases. For instance, if instead of binary, the state is a multinomial variable with m possible values, i.e., st+kε{0, 1, . . . m−1}, the belief of st+k is defined over the unit m−1 simplex, i.e., bt+kεΔm-1. The belief projection model may be constructed in this case as a mixture of Dirichlet distributions, and model parameters may be learned in a maximum likelihood manner. Approaches based on discretizing Δm-1 into bins, e.g., based on memory-based learning and sampling, also may be employed
Note that the described approach sums over all possible beliefs bt+k(st+k). In practice, a tractable solution for computing this sum (integral) may be used. One approach that works well when the underlying state space is small is to discretize the belief space (the simplex) into a number of bins, and sum over the corresponding probabilities. Another alternative is to construct belief projection models with parametric forms that allow for analytic integration. Sampling methods may be used to sample the beliefs and approximate the integral according to the belief projection model p(bt+k|ψt).
In practice, many physically situated systems are comprised via a coupling of multiple, modular inference components into more complex architectures. A hierarchical structure is often harnessed for state inference. For instance, lower level inference components such as speech recognition, face tracking, and face identification may abstract the high-dimensional streaming sensory evidence such as raw audio and video data into fewer lower-dimensional percepts, such as words spoken, the location and identity of a person, and so forth. The outputs of these perceptual models are then used as inputs for making higher-level inferences about goals, activities, and other relevant state variables, which ultimately drive interaction decisions. In engineering such integrative systems, the lower-level, perceptual models may be off-the-shelf components that are trained and optimized individually, prior to integration in a given application. These models tend to be more domain independent than the higher-level state inference models, which are often trained for a specific domain.
One approach described herein for computing VOI can be extended to such modular inference architectures. Let R denote a set of lower-level perceptual inference models and {right arrow over (σ)}t=σtr denote the n-tuple of percepts from each inference model rεR (
where p({right arrow over (σ)}t|ψt)=Πrp(σtr|ψt). The higher level state inference model conditioned on percepts p(st|σtr) is assumed known. In this hierarchical structure, the expected value of waiting may be computed as follows:
where the perceptual inference models are bt+k({right arrow over (σt+k)})=p({right arrow over (σt+k)}|ψt+k). Note that this formulation predicts future beliefs over the lower-level percepts {right arrow over (σt+k)}, i.e., perceptual inference projection models are constructed conditioned on the current evidence p(p(σt+kn|ψt+k)|ψt). These perceptual inference projection models can be trained independently from each other by recording the outputs of the perceptual inference models p(σt+kn|ψt+k) over time under the assumption that the action await has no effect on the environment and evidence.
Returning to the example of
State inference may be based on a hierarchical graphical model such as represented in
As described above, the belief projection models make predictions about future beliefs at the perceptual level. The three perceptual inference models described above construct beliefs (i.e., probability distributions) over the corresponding binary percepts. The domain for the output of each perceptual model is the 1-dimensional simplex, i.e., the interval [0, 1] in this example. The belief projection models in turn model a belief (or probability distribution) over this domain. The belief projection models may be constructed in this case heuristically based on mixtures of Beta distributions, and/or they may be learned from data.
The action space for the mixed-initiative engagement policy includes two task actions: Engage, in which the Assistant engages the user immediately, and DontEngage, in which the Assistant decides to not engage the user at the current time-step. Utilities for the various combinations of state and action may be obtained from the assistant's owner; examples are shown in the Table below:
The cost for taking a wait action may be elicited or estimated based on the current state (e.g., 0.05 in this example).
In addition, actions may be included to collect additional information: Wait(t) to collect additional sensory information and AskAndWait(t) to ask an external source and also collect sensory information while waiting for the response, where t ranges from 1 to 100 seconds, for example.
With the Wait(t) action the assistant 120 waits for a duration t, then takes the optimal action between Engage or DontEngage. The expected utility computation in this case takes into account (via the perceptual belief projection models) the likely impact of the sensory evidence to be accumulated by time t. In addition, the computation also takes into account the likelihood that the person might leave the scene. This probability is modeled based on the time since the actor was detected, e.g., via a mixture of two linear hazard rate distributions: the first component has a mean of around five seconds and models people that simply pass through the corridor, and the second component has a mean of around three-hundred seconds and models people that stay for a while in an area near the assistant 102.
With the AskAndWait(t) action, the assistant 102 launches an external query about the user's identity, waits for a duration t, then takes the optimal action between Engage and DontEngage based on the accumulated information. As with Wait(t), the computation takes into account the impact of future sensory evidence and the fact that the actor might leave by time t. In addition, in this case, the expected utility computation takes into account the probability that the response will arrive at some future time. The latter is modeled via a log-normal distribution with a mean time of some number of (e.g., forty) seconds.
At every time step, the assistant re-runs the decision algorithm and chooses the action with maximum expected utility, under the current uncertainty from sensor data. By taking this re-planning approach, the assistant 102 may choose a particular action such as Wait(10) at a certain time, and at the next time step the action selected may change (e.g., to something like Engage or Wait(50)) based on the accumulated evidence. Additionally, note that the actions are myopic with a short time horizon, and that the ability to re-plan with additional information is likely to improve the action decisions.
Consider an example when a person (a possible visitor) approaches the office where the assistant 102 is stationed, passes by the assistant 102 and sits down in a nearby chair. The width of the detected face, which correlates with the distance between the person and the assistant 102, is determined, as represented in
At this point the utility of launching an external information-gathering action may exceed the utility of waiting (
The computations performed at different time steps include when the visitor is detected at and the assistant 102 starts using the decision theoretic engagement computations at time t1, once the face identification algorithm provides a first estimate for the identity of the visitor. Between times t1 and t2, as the visitor is getting closer, the probability of f-formation and approaching are increasing; the assistant 102 is uncertain about whether this visitor is on-calendar (
As also shown in
The projected beliefs for the On-calendar percept (
The projected beliefs for the Activity percept computed at time are shown in
As
Next, in this example, the visitor passes by the assistant 102 and sits in a nearby chair. In
A few seconds later in this example, at time t4, the answer arrives, namely that the visitor is indeed the person the owner is expecting, whereby the corresponding probability for on-calendar increases to 1.0 (see
Based upon the future belief, step 808 represents a determination as to whether to act now or wait for more evidence. If the decision is to wait, step 808 branches to step 810.
Step 810 represents determining whether to use/activate one or more sensors; (one or more other sensors may be turned off, e.g., if their information is no longer needed or relevant, or cannot change over time). If so, step 812 represents activating the one or more sensors. Note that as used herein, asking a crowd/expert for assistance is considered activating another sensor at step 812.
Step 814 represents taking the action, which may be to do something active, or end the process. For example, in the assistant scenario described above, the action may be a decision to engage, in which audio and/or visible data (and possibly other data such as haptic feedback) is output to the user. Conversely, the decision may be to not engage, in which event the process may end until triggered again, or, for example adjust to give attention to a different user who is approaching.
As can be seen, described herein is a technology that addresses various challenges of computing the value of information in systems that operate with high-dimensional streaming sensory evidence. The technology is based upon developing belief projection models, comprising direct conditional models that can be trained from data to predict future beliefs from existing evidence. The technology may leverage such models to resolve tradeoffs between acting immediately versus waiting for more sensory evidence to accumulate. The technology is conducive for computing value of information in systems that use modular, hierarchical architectures for making state inferences.
The technology may be implemented in a deployed physically situated interactive agent with a mixed-initiative engagement policy. The system is able to resolve tradeoffs between waiting for more information to accumulate from the face identification sensor, soliciting help in real time from a local group of experts to identify a person, and/or acting immediately (proactively engaging the person) based on the existing face identification data.
As mentioned, advantageously, the techniques described herein can be applied to any device. It can be understood, therefore, that handheld, portable and other computing devices and computing objects of all kinds are contemplated for use in connection with the various embodiments. Accordingly, the below general purpose remote computer described below in
Embodiments can partly be implemented via an operating system, for use by a developer of services for a device or object, and/or included within application software that operates to perform one or more functional aspects of the various embodiments described herein. Software may be described in the general context of computer executable instructions, such as program modules, being executed by one or more computers, such as client workstations, servers or other devices. Those skilled in the art will appreciate that computer systems have a variety of configurations and protocols that can be used to communicate data, and thus, no particular configuration or protocol is considered limiting.
With reference to
Computer 510 typically includes a variety of computer readable media and can be any available media that can be accessed by computer 510. The system memory 530 may include computer storage media in the form of volatile and/or nonvolatile memory such as read only memory (ROM) and/or random access memory (RAM). By way of example, and not limitation, system memory 530 may also include an operating system, application programs, other program modules, and program data.
A user can enter commands and information into the computer 510 through input devices 540. A monitor or other type of display device is also connected to the system bus 522 via an interface, such as output interface 550. In addition to a monitor, computers can also include other peripheral output devices such as speakers and a printer, which may be connected through output interface 550.
The computer 510 may operate in a networked or distributed environment using logical connections to one or more other remote computers, such as remote computer 570. The remote computer 570 may be a personal computer, a server, a router, a network PC, a peer device or other common network node, or any other remote media consumption or transmission device, and may include any or all of the elements described above relative to the computer 510. The logical connections depicted in
As mentioned above, while example embodiments have been described in connection with various computing devices and network architectures, the underlying concepts may be applied to any network system and any computing device or system in which it is desirable to improve efficiency of resource usage.
Also, there are multiple ways to implement the same or similar functionality, e.g., an appropriate API, tool kit, driver code, operating system, control, standalone or downloadable software object, etc. which enables applications and services to take advantage of the techniques provided herein. Thus, embodiments herein are contemplated from the standpoint of an API (or other software object), as well as from a software or hardware object that implements one or more embodiments as described herein. Thus, various embodiments described herein can have aspects that are wholly in hardware, partly in hardware and partly in software, as well as in software.
The word “exemplary” is used herein to mean serving as an example, instance, or illustration. For the avoidance of doubt, the subject matter disclosed herein is not limited by such examples. In addition, any aspect or design described herein as “exemplary” is not necessarily to be construed as preferred or advantageous over other aspects or designs, nor is it meant to preclude equivalent exemplary structures and techniques known to those of ordinary skill in the art. Furthermore, to the extent that the terms “includes,” “has,” “contains,” and other similar words are used, for the avoidance of doubt, such terms are intended to be inclusive in a manner similar to the term “comprising” as an open transition word without precluding any additional or other elements when employed in a claim.
As mentioned, the various techniques described herein may be implemented in connection with hardware or software or, where appropriate, with a combination of both. As used herein, the terms “component,” “module,” “system” and the like are likewise intended to refer to a computer-related entity, either hardware, a combination of hardware and software, software, or software in execution. For example, a component may be, but is not limited to being, a process running on a processor, a processor, an object, an executable, a thread of execution, a program, and/or a computer. By way of illustration, both an application running on computer and the computer can be a component. One or more components may reside within a process and/or thread of execution and a component may be localized on one computer and/or distributed between two or more computers.
The aforementioned systems have been described with respect to interaction between several components. It can be appreciated that such systems and components can include those components or specified sub-components, some of the specified components or sub-components, and/or additional components, and according to various permutations and combinations of the foregoing. Sub-components can also be implemented as components communicatively coupled to other components rather than included within parent components (hierarchical). Additionally, it can be noted that one or more components may be combined into a single component providing aggregate functionality or divided into several separate sub-components, and that any one or more middle layers, such as a management layer, may be provided to communicatively couple to such sub-components in order to provide integrated functionality. Any components described herein may also interact with one or more other components not specifically described herein but generally known by those of skill in the art.
In view of the example systems described herein, methodologies that may be implemented in accordance with the described subject matter can also be appreciated with reference to the flowcharts of the various figures. While for purposes of simplicity of explanation, the methodologies are shown and described as a series of blocks, it is to be understood and appreciated that the various embodiments are not limited by the order of the blocks, as some blocks may occur in different orders and/or concurrently with other blocks from what is depicted and described herein. Where non-sequential, or branched, flow is illustrated via flowchart, it can be appreciated that various other branches, flow paths, and orders of the blocks, may be implemented which achieve the same or a similar result. Moreover, some illustrated blocks are optional in implementing the methodologies described hereinafter.
While the invention is susceptible to various modifications and alternative constructions, certain illustrated embodiments thereof are shown in the drawings and have been described above in detail. It should be understood, however, that there is no intention to limit the invention to the specific forms disclosed, but on the contrary, the intention is to cover all modifications, alternative constructions, and equivalents falling within the spirit and scope of the invention.
In addition to the various embodiments described herein, it is to be understood that other similar embodiments can be used or modifications and additions can be made to the described embodiment(s) for performing the same or equivalent function of the corresponding embodiment(s) without deviating therefrom. Still further, multiple processing chips or multiple devices can share the performance of one or more functions described herein, and similarly, storage can be effected across a plurality of devices. Accordingly, the invention is not to be limited to any single embodiment, but rather is to be construed in breadth, spirit and scope in accordance with the appended claims.