The present disclosure relates generally to information handling systems, and more particularly to determining and reporting the health of components in information handling systems.
As the value and use of information continues to increase, individuals and businesses seek additional ways to process and store information. One option available to users is information handling systems. An information handling system generally processes, compiles, stores, and/or communicates information or data for business, personal, or other purposes thereby allowing users to take advantage of the value of the information. Because technology and information handling needs and requirements vary between different users or applications, information handling systems may also vary regarding what information is handled, how the information is handled, how much information is processed, stored, or communicated, and how quickly and efficiently the information may be processed, stored, or communicated. The variations in information handling systems allow for information handling systems to be general or configured for a specific user or specific use such as financial transaction processing, airline reservations, enterprise data storage, or global communications. In addition, information handling systems may include a variety of hardware and software components that may be configured to process, store, and communicate information and may include one or more computer systems, data storage systems, and networking systems.
Information handling systems such as, for example, server devices, desktop computing devices, laptop/notebook computing devices, tablet computing devices, mobile phones, and/or other computing devices known in the art, often have their health monitored and reported in order to ensure issues associated with the computing device are addressed and unavailability of the computing device is minimized. For example, conventional computing device health determination and reporting systems utilize a Central Processing Unit (CPU) in the computing device to collect telemetry data from computing device components in the computing device, and then provide that telemetry data via a network to a health report generation system that uses that telemetry data to generate a health report for the computing device. The health report generation system may then transmit that health report via the network to the computing device for display to a user. However, such conventional computing device health determination and reporting systems suffer from several issues, particularly when utilized in computing devices provided in “edge” environments.
For example, the network-connected and centralized health report generation described above requires relatively large amounts of telemetry data to be collected and transmitted via the network between the computing device and the health report generation system, stored, and analyzed. When the analysis of the telemetry data is performed using models trained on individual datapoints provided by the telemetry data, those models can be susceptible to anomalies in the telemetry data (e.g., anomalies caused by electromagnetic noise, transducer malfunctions, etc.) that can introduce missing datapoints that require imputation before the telemetry data may be provided to the model, that may result in the model producing false health report alerts, and/or that may suffer from other telemetry-data-anomaly-related issues known in the art. Furthermore, commercially available toolkits that enable the models discussed above are often optimized for particular processing systems and do not scale easily or efficiently across different computing device manufacturers, and are often designed to provide models across a wide variety of platforms via their inclusion of reverse mapping capabilities and supporting infrastructure that ensures wide-scale usability, scalability, and flexibility at the expense of storage space (which may be limited in the computing devices provided in the “edge” environments discussed above).
As will be appreciated by one of skill in the art in possession of the present disclosure, the desire to provide health determination and reporting for computing devices continues to grow, and as the centralized health report generation discussed above is performed for an increasing number of computing devices, the health report generation infrastructure must be expanded, which increases costs associated with health determination and reporting. Furthermore, the telemetry data collection and transmission discussed above can consume a relatively large amount of the processing resources in the computing device, and can introduce latency into the health determination and reporting process that can be exacerbated when network congestion, network outages, and/or other network issues introduce delays that can render the health determination and reporting operations ineffective. Solutions to such issues include reducing the telemetry data provisioning frequency for any particular computing devices, and/or reducing the amount of telemetry data provided via the network to the health report generation system by transmitting subsets of the telemetry data (e.g., a snapshot of the telemetry data) based on predetermine rules.
However, solutions like the reduced telemetry data provisioning frequency or telemetry data snapshots discussed above may provide the health report generation system with a telemetry data subset that represents a limited or momentary state of the computing device, as such limited subsets of the telemetry data may not capture or represent a holistic view of the operational state of the computing device. As will be appreciated by one of skill in the art in possession of the present disclosure, relatively robust computing device health determinations require comprehensive information about different computing device operational states and computing device operational state transitions, and reduced telemetry data provisioning frequency techniques and/or telemetry data subset provisioning techniques like the snapshots described above often result in a pseudorandomized data collection process that can lead to relatively significant telemetry data loss and inaccurate health reports for computing devices.
Accordingly, it would be desirable to provide a health determination and reporting system that addresses the issues discussed above.
According to one embodiment, an Information Handling System (IHS) includes a component chassis; a processing system that is included in the component chassis; and a memory system that is included in the component chassis, that is coupled to the processing system, and that includes instructions that, when executed by the processing system, cause the processing system to provide a health report generation engine that is configured to: receive telemetry data from each of at least one component subsystem included in the component chassis; perform data compaction operations on the telemetry data to generate compacted telemetry data; perform inference operations on the compacted telemetry data to generate a health report for the IHS; and provide the health report to a computing device.
For purposes of this disclosure, an information handling system may include any instrumentality or aggregate of instrumentalities operable to compute, calculate, determine, classify, process, transmit, receive, retrieve, originate, switch, store, display, communicate, manifest, detect, record, reproduce, handle, or utilize any form of information, intelligence, or data for business, scientific, control, or other purposes. For example, an information handling system may be a personal computer (e.g., desktop or laptop), tablet computer, mobile device (e.g., personal digital assistant (PDA) or smart phone), server (e.g., blade server or rack server), a network storage device, or any other suitable device and may vary in size, shape, performance, functionality, and price. The information handling system may include random access memory (RAM), one or more processing resources such as a central processing unit (CPU) or hardware or software control logic, ROM, and/or other types of nonvolatile memory. Additional components of the information handling system may include one or more disk drives, one or more network ports for communicating with external devices as well as various input and output (I/O) devices, such as a keyboard, a mouse, touchscreen and/or a video display. The information handling system may also include one or more buses operable to transmit communications between the various hardware components.
In one embodiment, IHS 100,
Referring now to
In the illustrated embodiment, the computing devices 202a-202c are coupled to a network 204 that may be provided by a Local Area Network (LAN), the Internet, combinations thereof, and/or any other network that would be apparent to one of skill in the art in possession of the present disclosure. As illustrated, a support system 206 may be coupled to the computing devices 202a-202c via the network 204. In an embodiment, the support system 206 may be provided by the IHS 100 discussed above with reference to
Referring now to
In the illustrated embodiment, the computing device 300 includes a chassis 302 that houses the components of the computing device 300, only some of which are illustrated and discussed below. For example, the chassis 302 may house a processing system (not illustrated, but which may include the processor 102 discussed above with reference to
In the specific example provided in
The chassis 302 may also house a plurality of computing device components 306a, 306b, and up to 306c, each of which may be coupled to the computing device engine 304 (e.g., via a coupling between the computing device components 306a-306c and the processing system), and each of which may be provided by any of a variety of computing device components that would be apparent to one of skill in the art in possession of the present disclosure. For example, many of the embodiments discussed below describe the computing device components 306a-306c as storage devices such as Non-Volatile Memory express (NVMe) Solid State Drive (SSD) storage devices that have been provided with a storage device compute subsystem that is configured to perform storage device compute operations that are separate from the host compute operations that may be performed by the computing device engine 304.
However, while described as storage devices in many of the specific examples provided below, one of skill in the art in possession of the present disclosure will appreciate how the computing device components 306a-306c may be provided by any other computing device component (e.g., memory subsystems, networking subsystems, fan subsystems, power subsystems, Graphics Processing Unit (GPU) subsystems, etc.) that has been configured to perform component compute operations that are separate from the host compute operations and that provide for the component health determination and reporting operations described below. Furthermore, while the computing device components that perform the component health determination and reporting operations described below are illustrated and discussed as being included in a computing device, one of skill in the art in possession of the present disclosure will appreciate that the computing device components of the present disclosure may be provided outside a computing device (e.g., as standalone components such as IoT devices, external storage devices, and/or other components known in the art) while remaining within the scope of the present disclosure as well.
The chassis 302 may also house a communication system 308 that is coupled to the computing device engine 304 (e.g., via a coupling between the communication system 308 and the processing system) and that may be provided by a Network Interface Controller (NIC), wireless communication systems (e.g., BLUETOOTH®, Near Field Communication (NFC) components, WiFi components, etc.), and/or any other communication components that would be apparent to one of skill in the art in possession of the present disclosure. However, while a specific computing device 300 has been illustrated and described, one of skill in the art in possession of the present disclosure will recognize that computing devices (or other devices operating according to the teachings of the present disclosure in a manner similar to that described below for the computing device 300) may include a variety of components and/or component configurations for providing conventional computing device functionality, as well as the functionality discussed below, while remaining within the scope of the present disclosure as well.
Referring now to
In the illustrated embodiment, the computing device component 400 includes a chassis 402 that houses the sub-components of the computing device component 400, only some of which are illustrated and discussed below. For example, the chassis 402 may house a component processing system (not illustrated, but which may be similar to the processor 102 discussed above with reference to
Continuing with the specific example provided above, the computing device component 400 may be provided by a storage device such as an NVMe SSD storage device that is provided with a storage device processing system and a storage device memory system that is coupled to the storage device processing system and that includes instructions that, when executed by the storage device processing system, cause the storage device processing system to perform storage device compute operations that are separate from host device compute operations provided by a computing device in which it is located, and that are utilized to provide the health report generation engine 404 described below. However, as also discussed below, the computing device component 400 may instead be provided by other computing device components that have been configured to perform component compute operations that are separate from host device compute operations provided by a computing device in which it is located, and that are utilized to provide the health report generation engine 404 described below, while remaining within the scope of the present disclosure as well. Furthermore, as discussed above, non-illustrated embodiments of the present disclosure may provide the computing device component 400 and its functionality described below without the computing device described herein.
The chassis 402 may also house a plurality of computing device component subsystems 406a, 406b, and up to 406c, each of which may be coupled to the health report generation engine 404 (e.g., via a coupling between the computing device component subsystems 406a-406c and the component processing system). In an embodiment, each of the computing device component subsystems 406a-406c may be provided by any of a variety of computing device hardware and/or software that one of skill in the art in possession of the present disclosure will appreciate may be included in a computing device component. Continuing with the examples discussed above that describe the computing device component 400 as a storage device such as an NVMe SSD storage device, the computing device component subsystems 406a-406c may be provided by storage device subsystems that may include a NAND storage subsystem (e.g., an array of NAND storage elements), storage device sensors (e.g., temperature sensors, bandwidth sensors, and/or other sensors that would be apparent to one of skill in the art in possession of the present disclosure), storage device firmware (e.g., an SSD controller), storage device applications (e.g., garbage collection applications, wear leveling applications, error identification and/or correction applications, and/or other storage device applications that would be apparent to one of skill in the art in possession of the present disclosure), and/or any other storage device component subsystems that one of skill in the art in possession of the present disclosure would recognize as capable of generating and providing the telemetry data discussed below. However, while a storage device with particular storage device subsystems has been described, one of skill in the art in possession of the present disclosure will appreciate how different computing device components may include different computing device component subsystems that will fall within the scope of the present disclosure as well.
The chassis 402 may also house a communication system 408 that is coupled to the health report generation engine 404 (e.g., via a coupling between the communication system 408 and the component processing system) and that may be provided by any of a variety of communication sub-components that would be apparent to one of skill in the art in possession of the present disclosure. However, while a specific computing device component 400 has been illustrated and described, one of skill in the art in possession of the present disclosure will recognize that computing device components (or other components operating according to the teachings of the present disclosure in a manner similar to that described below for the computing device component 400) may include a variety of sub-component and/or sub-component configurations for providing conventional computing device component functionality, as well as the functionality discussed below, while remaining within the scope of the present disclosure as well.
Referring now to
As also described above, the health report generation engine 500 may be provided by a component memory system that is included in the computing device component 400, and that component memory system may be provided as part of the IC discussed above and is illustrated in
Furthermore, as also illustrated in
As also illustrated in
Referring now to
Furthermore, the data compaction module 602 may be coupled to the computing device component subsystems 406a-406c via a communications interface 608 that may be provided by, for example, an Application Programming Interface (API) integration (e.g., with the API of a telemetry bus coupled to the computing device component subsystems 406a-406c) and/or other communications interface components that would be apparent to one of skill in the art in possession of the present disclosure. The data compaction module 602 may also be coupled to the inference module 604 and the communications module 606 via communications interfaces 610 and 612, respectively, that may be provided by, for example, respective API integrations and/or other communications interface components that would be apparent to one of skill in the art in possession of the present disclosure.
The inference module 604 may also be coupled to the communications module 606 via a communications interface 614 that may be provided by, for example, an API integration and/or other communications interface components that would be apparent to one of skill in the art in possession of the present disclosure. The communications module 606 may be coupled to the application memory space 506b via a communications interface 616 that may be provided by, for example, an API integration and/or other communications interface components that would be apparent to one of skill in the art in possession of the present disclosure. The communications module 606 may also be coupled to the operating system 504a via a communications interface 618 that may be provided by, for example, an API integration and/or other communications interface components that would be apparent to one of skill in the art in possession of the present disclosure. In a specific example, the data compaction module 602, the inference module 604, the communication module 606, and the communications interfaces 608, 610, 612, 614, 616, and 618 may be provided in the compressed, read-only ROM/RAM file system image discussed above, and may only be updatable via an operating system update for the IC that provides the health report generation engine 404/500 in order to, for example, ensure relatively robust and fast boot/initialization operation times. However, while a specific application has been illustrated and described, one of skill in the art in possession of the present disclosure will appreciate how the application of the present disclosure and its associated functionality described herein may be provided in other manners while remaining within the scope of the present disclosure as well.
Referring now to
In some embodiments of the present disclosure, the support system 206 discussed above with reference to
As will be appreciated by one of skill in the art in possession of the present disclosure, inference operation parameters may be generated by the support system 206 and provided to the computing device components for use in generating the health reports via the inference operations discussed above, the compacted telemetry data and health reports generated by those computing device components may then be provided to the support system 206 for use in the deep learning training operations on the inference model, and this process may iterate. Thus, as the computing device components operate over time and experience health issues, the deep learning training operations on the inference model discussed above may identify relationship between compacted telemetry data, health reports, and health issues actually experienced by computing device components, with updated inference operation parameters determined based on those identified relationships and provided to the computing device components for use in subsequent inference operations. One of skill in the art in possession of the present disclosure will appreciate how this repeating of this process will increase accuracy of the health reports generated by the inference module 604 over time. As such, one of skill in the art in possession of the present disclosure will appreciate that the method 700 described below may be performed any time during the deep leaning training operations on the inference model described above.
The method 700 begins at block 702 where a health report generation subsystem in a component receives telemetry data from one or more component subsystems in the component. With reference to
Continuing with the embodiment discussed above in which the computing device component 400 is a storage device, the telemetry data received at block 702 may include telemetry data generated and transmitted by a NAND storage subsystem (e.g., an array of NAND storage elements), storage device sensors (e.g., temperature sensors, bandwidth sensors, and/or other sensors that would be apparent to one of skill in the art in possession of the present disclosure), storage device firmware (e.g., an SSD controller), storage device applications (e.g., garbage collection applications, wear leveling applications, error identification and/or correction applications, and/or other storage device applications that would be apparent to one of skill in the art in possession of the present disclosure), and/or any other storage device component subsystems that one of skill in the art in possession of the present disclosure would recognize as generating and providing telemetry data. However, while storage-device-specific telemetry data is described in detail herein, one of skill in the art in possession of the present disclosure will appreciate how other computing devices may provide other telemetry data while remaining within the scope of the present disclosure as well.
The method 700 then proceeds to block 704 where the health report generation subsystem in the component performs data compaction operations on the telemetry data to generate compacted telemetry data. In an embodiment, at block 704, the health report generation engine 304 in the computing device component 400 may perform data compaction operations on the telemetry data received at block 702 in order to generate compacted telemetry data. For example, at block 704, the data compaction module 602 in the application 504b/600 provided by the heat generation engine 404/500 may perform the data compaction operations on the telemetry data received at block 702 in order to generate the compacted telemetry data. As will be appreciated by one of skill in the art in possession of the present disclosure, data compaction operations may be configured to reduce the number of data elements, bandwidth, cost, and/or time associated with the generation, transmission, and/or storage of data without an associated loss of information of interest in that telemetry data via the elimination of redundant data, the removal of irrelevant data, and/or the use of particular coding techniques. Some specific examples of data compaction operations utilize fixed-tolerance bands, variable-tolerance bands, slope-key points, sample changes, curve patterns, curve fitting, variable-precision coding, frequency analysis, probability analysis, and data embeddings, although one of skill in the art in possession of the present disclosure will appreciate that data compaction operations utilizing other data compaction techniques will fall within the scope of the present disclosure as well.
To provide a specific, simplified example of data compaction operations, assume a time series of telemetry data received from n computing device component subsystems over t time steps is provided by the following telemetry data matrix:
Using data compaction operations similar to those discussed above, the information in the telemetry data matrix above may be represented using the following compacted telemetry data matrix of dimensions (m, n), where m is the number of parameters required for the particular data compaction technique being utilized:
To provide a greatly simplified example, the element φ1 in the compacted telemetry data matrix may provide a mean of the time-series data X11, X21, . . . Xt1 in the telemetry data matrix, the element ω1 in the compacted telemetry data matrix may provide a standard deviation of the time-series data X11, X21, . . . Xt1 in the telemetry data matrix, the element α1 in the compacted telemetry data matrix may identify anomalies in the time-series data X11, X21, . . . Xt1 in the telemetry data matrix; the element φ2 in the compacted telemetry data matrix may provide a mean of the time-series data X12, X22, . . . Xt2 in the telemetry data matrix, the element ω2 in the compacted telemetry data matrix may provide a standard deviation of the time-series data X12, X22, . . . Xt2 in the telemetry data matrix, the element α2 in the compacted telemetry data matrix may identify anomalies in the time-series data X12, X22, . . . Xt2 in the telemetry data matrix; and so on. As such, one of skill in the art in possession of the present disclosure will appreciate how data compaction techniques like those described above may generate compacted telemetry data that eliminates noise, “fluke” events, and/or other anomalies that are present in the telemetry data, thus preventing such anomalies from effecting the inference operations discussed below.
As will be appreciated by one of skill in the art in possession of the present disclosure, the compacted telemetry data matrix may be considerably smaller than the telemetry data matrix, as m may be much smaller than t, and the smaller size of the compacted telemetry data matrix requires less storage space and results in lower latencies when transferred via the network, while also preserving information of interest in the telemetry data matrix. Mathematically, these data compaction operations may be represented by the following equation:
X′
(m,n)=δ(X(t,n)),m<<t
Where X is the telemetry data matrix and X′ is the corresponding compacted telemetry data matrix, and δ is a mathematical representation of the data compaction method. As will be appreciated by one of skill in the art in possession of the present disclosure, the data compaction method utilized in the present disclosure may be considered a hyperparameter and may be selected based on the performance of different data compaction methods in particular use cases.
As discussed above, in some embodiments the health report generation engine 304 in the computing device component 400 may receive the telemetry data from the computing device component subsystems 406a-406c in telemetry data streams and, in such embodiments, the data compaction module 602 in the application 504b/600 provided by the health report generation engine 404/500 may perform the data compaction operations discussed above on that telemetry data stream, with the resulting compacted telemetry data having a size that may be transferred with relative low amounts of network bandwidth, stored with relatively low amounts of storage, and/or providing other benefits that would be apparent to one of skill in the art in possession of the present disclosure. As such, in some examples, the telemetry data may have the data compaction operations performed on it as it is continuously streamed to the data compaction module 602 to generate the compacted telemetry data, which one of skill in the art in possession of the present disclosure will recognize allows for relatively high frequency telemetry data collection without the associated network transfer latencies and required storage resources introduced in conventional computing device health determination and reporting systems.
As will be appreciated by one of skill in the art in possession of the present disclosure, the data compaction operations on the telemetry data streams discussed above may allow the telemetry data to be discarded by the application 504b/600 provided by the health report generation engine 404/500 as the corresponding compacted telemetry data is generated (e.g., the compacted telemetry data may be continuously modified/updated using the telemetry data provided in the telemetry data stream, with that telemetry data discarded after use in modifying/updating the compacted telemetry data stream via the data compaction operations), which allows the information of interest in the telemetry data to be captured without the need to store that telemetry data that would otherwise require relatively large files/storage for the health report generation engine 404/500. For example, such telemetry data stream compaction operations may be represented by the following equation:
X′
t=δ(Xt,X′t-1)
Wherein X′t and Xt are compacted telemetry data and a telemetry data stream at time t, respectively, X′t-1 is the compacted telemetry data at time t−1, and δ is mathematical representation of the data compaction method.
The method 700 then proceeds to block 706 where the health report generation subsystem in the component performs inference operations on the compacted telemetry data to generate a health report. In an embodiment, following the data compaction operations at block 704, the health report generation engine 404 in the computing device component 400 may perform inference operations on the compacted telemetry data to generate a health report. For example, with reference to
With reference to
As discussed above, the inference module 604 in the application 504b/600 provided by the heat generation engine 404/500 may utilize inference operation parameters generated via deep learning training on an inference model. As will be appreciated by one of skill in the art in possession of the present disclosure, such deep learning models may include a sequence of layers that may perform linear operations or element-wise transformations, and may provide a neural network model that can be expressed via the following composite function:
y′=ηw(X)
where y′ is an estimate of a target (y), η is a composite function with weight/parameters set W, and X is an input feature matrix.
One of skill in the art in possession of the present disclosure will recognize how such a neural network model may be initialized with random weights/parameters and, during the iterative training process discussed above, the optimal weights/parameters will then be learned through back propagation. In the component health determination and reporting system of the present disclosure, the majority of the computational load associated with the neural network model will be spent by the support system 206 during that iterative training process using large databases of compacted telemetry data provided by the computing device components 306a-306c, while the performance of the inference operations by the inference module 604 in the application 504b/600 provided by the heat generation engine 404/500 will be relatively light on the computational resources of the computing device components 400 (i.e., the inference/“forward pass” operations are a sequence of linear operations (O(n3)) for one data point and have a significantly smaller computational overhead relative to the deep learning training operations that generate the inference operation parameters used in those operations).
As discussed above, in some embodiments such as when the computing device 202a-202c/300 and computing device component 306a-306c/400 are provided in an “edge” environment that constrains their processing resources to particular tasks, the inference module 604 may be provided via inference module code that is custom written (e.g., using a relatively low-level programming language such as C++) and relatively highly optimized for the computing device component 400 to perform the “forward pass” inference operations discussed above. As discussed above, commercially available toolkits that are conventionally used to enable the models discussed above may be impractical for edge environments that require limited use of processing resources and storage resources in that edge environment. For example, the volatile memory subsystem 504 in the computing device component 400 may include a storage capacity in the tens of megabytes, and may be shared by multiple microservices and/or micro-applications, and thus a forward-pass-only inference module representing the deep learning/neural network model 11w described above may perform the inference operations at block 706. As discussed above, that deep learning/neural network model 11w may be a composite of multiple functions and may be written as follows:
ηW(X′)=ηkWk(ηk-1Wk-1( . . . (η1W1(X))),Wk,Wk-1, . . . ,W1∈W
where ηk, ηk-1, . . . , η1 are the functions representing the k layers that compose the model raw, and Wk, Wk-1, . . . , W1 are the weights/parameters associated with the k layers, jointly constituting the superset W.
As such, one of skill in the art in possession of the present disclosure will recognize how the inference module 604 provided by the inference module code discussed above may utilize the inference operations parameters described above (e.g., from their respective binary files stored in the non-volatile memory subsystem 506) along with the compacted telemetry data to generate a health report from the sequence of k functions/layers at block 706. In an embodiment, the health report generated at block 706 may include a current health status of any of the computing device component subsystems 406a-406c (e.g., expressed as a health “percentage” for the computing device component subsystem(s), the computing device component(s), and/or the computing device between 1.0/100% (i.e., “new”) and 0.0/0% (i.e., “failure”)), health alerts for any of the computing device component subsystems 406a-406c (e.g., based on health information for those computing device component subsystems 406a-406c being outside a range or exceeding a threshold), and/or a variety of other health report information that would be apparent to one of skill in the art in possession of the present disclosure. However, while particular processes and techniques for generating the health report at block 706 have been described, one of skill in the art in possession of the present disclosure will appreciate how other processes and/or techniques for generating the health report of the present disclosure will fall within the scope of the present disclosure as well. As will be appreciated by one of skill in the art in possession of the present disclosure, the use of the compute resources in the computing device components 306a-306c/400 alleviates the need to utilize compute resources (e.g., the CPU) of the computing device 300.
With reference to
With reference to
The method 700 then proceeds to block 708 where the health report generation subsystem in the component provides the health report to a computing device. In an embodiment, at block 706, the health report generation engine 404 in the computing device component 400 provides the health report generated at block 706 to the computing device 300. With reference to
With reference to
The method 700 then proceeds to block 710 where the health report generation subsystem in the component transmits the compacted telemetry data and the health report to a support system. With reference to
The method 700 then proceeds to block 712 where the health report generation subsystem in the component receives updated inference operation parameters from the support system. As discussed above, the support system 206 may utilize compacted telemetry data and health reports like those received at block 710 to perform deep learning training operations on an inference model to identify relationships between the compacted telemetry data and health reports (as well as health issues that are actually experienced by computing device components and that may be provided to the support system 206 using a variety of techniques that would be apparent to one of skill in the art in possession of the present disclosure), and determining updated inference operation parameters based on those identified relationships.
As such, with reference to
Thus, systems and methods have been described that provide components that collect telemetry data from their component subsystems, compact that telemetry data to generate compacted telemetry data, and perform inference operations on that compacted telemetry data to generate their own health reports, with those health reports provided to a computing device and possibly resulting in the provisioning of health alerts for the computing device. For example, the component health determination and reporting system of the present disclosure may include a computing device having a computing device component. The computing device component includes computing device component subsystem(s) coupled to a health report generation subsystem. The health report generation subsystem receives telemetry data from each of the computing device component subsystem(s), performs data compaction operations on the telemetry data to generate compacted telemetry data, performs inference operations on the compacted telemetry data to generate a health report for the computing device component, and provides health alert(s) for the computing device based on the health report. As such, the issues with conventional computing device health determination and reporting systems discussed above are remedied.
Although illustrative embodiments have been shown and described, a wide range of modification, change and substitution is contemplated in the foregoing disclosure and in some instances, some features of the embodiments may be employed without a corresponding use of other features. Accordingly, it is appropriate that the appended claims be construed broadly and in a manner consistent with the scope of the embodiments disclosed herein.
Number | Date | Country | |
---|---|---|---|
20240135208 A1 | Apr 2024 | US |