Various example embodiments relate to machine learning model renewal. More specifically, various example embodiments exemplarily relate to measures (including methods, apparatuses and computer program products) for realizing machine learning model renewal.
The present specification generally relates to utilization and management of machine learning (ML) models.
There are user equipment (UE) based ML use cases that realize and/or enhance control-plane (CP) network functions for radio access network (RAN)/core network (CN).
An example of a handover (HO) optimization use case shows how the network can benefit by running an artificial intelligence (AI)/ML model at the UE. As detailed in 3rd Generation Partnership Project (3GPP) technical report (TR) 28.809, the handover optimization is currently based on radio conditions for selecting the target gNodeB (gNB) upon handover. The handover process may result into wastage of UE and network resources and the handover process may introduce service disruption due to increased latency and radio link failures. To improve handover performances, the 5G system (5GS) may train an AI/ML inference model using UE-related data such as reference signal received power (RSRP), UE location, and UE trajectory from multiple UEs. Then, the 5GS distributes the trained AI/ML inference model to each UE for proactively triggering a handover based on the output of the AI/ML inference model.
Another example of a use case of AI/ML models is load prediction to prevent cell congestion as described in use case #2 in clause 5.1.2 in TR 23.700-91. In this use case, the 5GS, to prevent cell congestion, may train an AI/ML inference model to forecast a UE's load pattern. The 5GS forwards the trained AI/ML inference model to the UEs that can notify the 5GS about their load pattern forecast by using the input data such as UE-specific location data, channel and signal conditions, etc. Based on the load pattern forecast returned by the UEs, the 5GS is able to optimize the network, to thereby avoid cell congestion.
These UE-based ML use cases utilize their ML models and the data from the UEs' measurements of the network to make predictions and take proper actions. These ML models can be provided by the network or self-trained by the UEs with their measurements of a certain network. These ML models can work correctly only for the network (or its alike) where the profile/natural of its measurement data is same as or very similar to that of the data used to train these models.
There are also ML cases in which the model is shared among network functions (NF). In TR 23.700-91, KI #19 studies trained data model sharing between multiple network data analytics function (NWDAF) instances. According thereto, the possibility for a NWDAF consumer to request a new AI model in case the performance does not meet the requirements is foreseen.
When a UE with the ML model received from the network (trained e.g. by an NWDAF or a management data analytics function (MDAF)) moves away from the network where its measurements data have been used to train the model, or the network context changes, the UE may get into a different network with clearly different measurement data. In such case, the performance of the ML model and its corresponding action would likely degrade and become erroneous, as the network context has changed.
Hence, the problem arises that a UE may apply an AI/ML (inference) model which is not suitable (anymore) for the network situation experienced by the UE.
Hence, there is a need to provide for machine learning model renewal.
Various example embodiments aim at addressing at least part of the above issues and/or problems and drawbacks.
Various aspects of example embodiments are set out in the appended claims.
According to an exemplary aspect, there is provided a method comprising receiving a first machine learning model message including a first machine learning inference model, obtaining network related input data, feeding said first machine learning inference model with said network related input data, receiving, upon unsuitability of said first machine learning inference model for an experienced network condition, a second machine learning model message including a second machine learning inference model, and replacing said first machine learning inference model with said second machine learning inference model.
According to an exemplary aspect, there is provided a method comprising training a first machine learning inference model based on a first training data set, transmitting a first machine learning model message including said first machine learning inference model, retraining, upon unsuitability of said first machine learning inference model for an experienced network condition, a second machine learning inference model based on a second training data set, and transmitting a second machine learning model message including said second machine learning inference model.
According to an exemplary aspect, there is provided a method comprising receiving a data collection request, collecting data, and transmitting a data collection response including said collected data.
According to an exemplary aspect, there is provided an apparatus comprising receiving circuitry configured to receive a first machine learning model message including a first machine learning inference model, obtaining circuitry configured to obtain network related input data, feeding circuitry configured to feed said first machine learning inference model with said network related input data, receiving circuitry configured to receive, upon unsuitability of said first machine learning inference model for an experienced network condition, a second machine learning model message including a second machine learning inference model, and replacing circuitry configured to replace said first machine learning inference model with said second machine learning inference model.
According to an exemplary aspect, there is provided an apparatus comprising training circuitry configured to train a first machine learning inference model based on a first training data set, transmitting circuitry configured to transmit a first machine learning model message including said first machine learning inference model, retraining circuitry configured to retrain, upon unsuitability of said first machine learning inference model for an experienced network condition, a second machine learning inference model based on a second training data set, and transmitting circuitry configured to transmit a second machine learning model message including said second machine learning inference model.
According to an exemplary aspect, there is provided an apparatus comprising receiving circuitry configured to receive a data collection request, collecting circuitry configured to collect data, and transmitting circuitry configured to transmit a data collection response including said collected data.
According to an exemplary aspect, there is provided an apparatus comprising at least one processor, at least one memory including computer program code, and at least one interface configured for communication with at least another apparatus, the at least one processor, with the at least one memory and the computer program code, being configured to cause the apparatus to perform receiving a first machine learning model message including a first machine learning inference model, obtaining network related input data, feeding said first machine learning inference model with said network related input data, receiving, upon unsuitability of said first machine learning inference model for an experienced network condition, a second machine learning model message including a second machine learning inference model, and replacing said first machine learning inference model with said second machine learning inference model.
According to an exemplary aspect, there is provided an apparatus comprising at least one processor, at least one memory including computer program code, and at least one interface configured for communication with at least another apparatus, the at least one processor, with the at least one memory and the computer program code, being configured to cause the apparatus to perform training a first machine learning inference model based on a first training data set, transmitting a first machine learning model message including said first machine learning inference model, retraining, upon unsuitability of said first machine learning inference model for an experienced network condition, a second machine learning inference model based on a second training data set, and transmitting a second machine learning model message including said second machine learning inference model.
According to an exemplary aspect, there is provided an apparatus comprising at least one processor, at least one memory including computer program code, and at least one interface configured for communication with at least another apparatus, the at least one processor, with the at least one memory and the computer program code, being configured to cause the apparatus to perform receiving a data collection request, collecting data, and transmitting a data collection response including said collected data.
According to an exemplary aspect, there is provided a computer program product comprising computer-executable computer program code which, when the program is run on a computer (e.g. a computer of an apparatus according to any one of the aforementioned apparatus-related exemplary aspects of the present disclosure), is configured to cause the computer to carry out the method according to any one of the aforementioned method-related exemplary aspects of the present disclosure.
Such computer program product may comprise (or be embodied) a (tangible) computer-readable (storage) medium or the like on which the computer-executable computer program code is stored, and/or the program may be directly loadable into an internal memory of the computer or a processor thereof.
Any one of the above aspects enables efficient ways to detect a mismatch with respect to the AI/ML (inference) model and the network (context), to indicate the mismatch, and to get a new (suitable) model from network to thereby solve at least part of the problems and drawbacks identified in relation to the prior art.
By way of example embodiments, there is provided machine learning model renewal. More specifically, by way of example embodiments, there are provided measures and mechanisms for realizing machine learning model renewal.
Thus, improvement is achieved by methods, apparatuses and computer program products enabling/realizing machine learning model renewal.
In the following, the present disclosure will be described in greater detail by way of non-limiting examples with reference to the accompanying drawings, in which
The present disclosure is described herein with reference to particular non-limiting examples and to what are presently considered to be conceivable embodiments. A person skilled in the art will appreciate that the disclosure is by no means limited to these examples, and may be more broadly applied.
It is to be noted that the following description of the present disclosure and its embodiments mainly refers to specifications being used as non-limiting examples for certain exemplary network configurations and deployments. Namely, the present disclosure and its embodiments are mainly described in relation to 3GPP specifications being used as non-limiting examples for certain exemplary network configurations and deployments. As such, the description of example embodiments given herein specifically refers to terminology which is directly related thereto. Such terminology is only used in the context of the presented non-limiting examples, and does naturally not limit the disclosure in any way. Rather, any other communication or communication related system deployment, etc. may also be utilized as long as compliant with the features described herein.
Hereinafter, various embodiments and implementations of the present disclosure and its aspects or embodiments are described using several variants and/or alternatives. It is generally noted that, according to certain needs and constraints, all of the described variants and/or alternatives may be provided alone or in any conceivable combination (also including combinations of individual features of the various variants and/or alternatives).
According to example embodiments, in general terms, there are provided measures and mechanisms for (enabling/realizing) machine learning model renewal.
As discussed above, the problem arises that a UE may apply AI/ML (inference) models which are not suitable (anymore) for the network situation experienced by the UE.
The same problem exists also when the ML model is shared among NFs. Even though in TR 23.700-91 in clause 6.5.2 (Sol. #5 to KI #19) the option to request a ML model update is included, no mechanisms to detect the need of ML model renewal are proposed.
Example embodiments thus address the necessity to detect a mismatch between the UE's pre-trained model and the current network context. When the mismatch is detected, the UE's model can be either renewed with the matching model provided by the current network, or the current model can be retrained on data from the current network context.
Example embodiments are hereinafter discussed in general terms.
According to example embodiments, a solution to detect the mismatch between the pre-trained model sent by the 5GS to the AI/ML model consumer (i.e., UE or NF or MnF) and the network context is provided.
According to example embodiments, the 5GS is responsible to train the AI/ML model that is being sent to the AI/ML model consumer for inference (AI/ML inference model, also mentioned as machine learning inference model). According to further example embodiments, if the ground truth data (i.e., real outputs of respective inputs) is not available to evaluate the performance of the inference model, the 5GS is also responsible to train another AI/ML model (AI/ML monitoring model, also mentioned as machine learning monitoring model) to profile the dataset used to train the AI/ML inference model.
According to example embodiments, function entities of an AI/ML model consumer, the 5GS, and a data producer (entity), as well as their interworking to detect the mismatch between the pretrained (pre-trained) model and the current network context are provided.
Based on the operational conditions, one of the following detection mechanisms is selected as the suitable detection mechanism to check whether the AI/ML inference model is working properly in terms of performance:
An AI/ML monitoring model could be an unsupervised ML (monitoring) model able to identify whether there is a clear profile difference between the dataset utilized to train the inference model and the current data used as input to the inference model in runtime. When the AI/ML monitoring model detects the existence of such a clear profile difference, the AI/ML monitoring model determines that the current network context is clearly different from the network context where the AI/ML inference model has been trained with its training data. The 5GS is then requested to retrain the AI/ML inference model and, if relevant, the AI/ML monitoring model.
Example embodiments are applicable to any entity responsible for providing the ML model to the AI/ML model consumer and to any sharing mechanism adopted.
Example embodiments may impact open radio access network (O-RAN) specification documents, e.g. the AI/ML Workflow Technical Report, and the A1 specifications (for A1-ML-ML Model Management Service). Example embodiments may be reflected as well in the Non-Real Time RAN intelligent controller (RIC) Technical Report. Example embodiments may also impact R1 technical specification.
In addition, example embodiments impact TR 28.809, TS 23.288 and TR 23.700-91 by providing the solution to the proposed use cases in the technical reports, and since MDAF and NWDAF are the management function and 5G core network (5GC) NF responsible to provide analytics, represent the best candidates to train, and share AI/ML models, according to example embodiments, these (MDAF and NWDAF) are enhanced to support the functionalities introduced herein in relation to the 5GS.
In addition, example embodiments impact any interface utilized by a sharing mechanism to provide the ML models to the AI/ML model consumer. An example of interfaces that may be impacted are N1, N2, Namf, and Nnwdaf in case the AI/ML models are sent to the AI/ML model consumer by the 5GS core network (CN), or Uu in the case the AI/ML models are shared by the RAN. According to example embodiments, these interfaces are enhanced to enable the exchange of messages, AI/ML inference and monitoring models, AI/ML model-related information such as performance data (including non-3GPP data) needed to perform the inferences and training of the models. In this case, example embodiments also impact TS 23.501, TS 23.502.
According to example embodiments, a method according to the above-discussed principle is also applicable for a network/OAM entity to monitor the operability of an inference model.
Subsequently, example embodiments as discussed above are explained in other words.
As shown in
In an embodiment at least some of the functionalities of the apparatus shown in
In particular a case is considered where ground truth is not available/accessible at the network entity (UE/NF/MnF), and where e.g. the collects the input data (network related input data).
According to a variation of the procedure shown in
In particular a case is considered where the ground truth is not available/accessible at the network entity (UE/NF/MnF), and where e.g. the network entity (UE/NF/MnF) collects the input data.
According to a variation of the procedure shown in
According to a variation of the procedure shown in
According to further example embodiments, said second machine learning model message includes a second machine learning monitoring model.
In particular a case is considered where the ground truth available at the network entity (UE/NF/MnF).
According to a variation of the procedure shown in
According to a variation of the procedure shown in
According to further example embodiments, said second machine learning model message includes information on a second reference performance of said second machine learning inference model.
As shown in
In an embodiment at least some of the functionalities of the apparatus shown in
According to a variation of the procedure shown in
In particular a case is considered where the ground truth is not available/accessible at e.g. an UE/NF/MnF, and where the network entity (5GS/NWDAF) or e.g. the UE/NF/MnF collects the input data (network related input data).
According to a variation of the procedure shown in
In particular a case is considered where the ground truth is not available/accessible at e.g. an UE/NF/MnF, and where the network entity (5GS/NWDAF) collects the input data.
According to a variation of the procedure shown in
According to a variation of the procedure shown in
According to a variation of the procedure shown in
According to a variation of the procedure shown in
In particular a case is considered where the ground truth is not available/accessible at e.g. an UE/NF/MnF, and where e.g. the UE/NF/MnF collects the input data.
According to further example embodiments, said first machine learning model message includes said first machine learning monitoring model.
According to a variation of the procedure shown in
According to a variation of the procedure shown in
In particular a case is considered where the ground truth is available at e.g. an UE/NF/MnF.
According to a variation of the procedure shown in
According to a variation of the procedure shown in
According to a variation of the procedure shown in
According to a variation of the procedure shown in
As shown in
In an embodiment at least some of the functionalities of the apparatus shown in
Subsequently, example embodiments as discussed above are explained in more specific terms with reference to
In particular, example embodiments explained below illustrate a specific mechanism to detect the profile difference (i.e., the mismatch), the specific mismatch detection procedures where the mismatch detection procedure to use is selected based on the conditions, and enhancement to the impacted interfaces with the UE taking the role of AI/ML model consumer. According to example embodiments, the same mechanisms are alternatively applied to NFs, e.g. with an NWDAF taking the role of AI/ML model consumer and the AI/ML models being shared by another NWDAF.
As illustrated in
According to example embodiments, the profile of the data utilized in such case would consist of the following (1) to (3):
According to example embodiments, this data profile is utilized for mismatch detection with the AI/ML monitoring model.
Namely, if data observations collected outside the above operational scope during the inference time amount to more than the threshold (item (3) above), there is a mismatch between the pretrained inference model and the current network context. Hence, retraining of both the inference model and the monitoring model is needed, and a respective decision is made according to example embodiments.
It is noted that there are different outlier/inlier/novelty detection solutions applicable. In the example of
In detail, according to example embodiments, one of the mismatch detection procedures is selected according to the conditions described above (availability/accessibility of ground truth, responsibility of input data (network related input data) collection).
That is, according to example embodiments, ML model consumer (e.g. UE) and ML model provider (e.g. 5GS, NWDAF) are not only enabled to apply any of the disclosed mismatch detection procedures but may also be configured to select the suitable mismatch detection procedure based on the conditions described above (availability/accessibility of ground truth, responsibility of input data (network related input data) collection).
Namely, as is illustrated in
Further, as is illustrated in
Here, it is noted that an access and mobility management function (AMF) may be interposed between the NWDAF (being the ML model provider) and the UE (being the ML model consumer), where the AMF relays the communication between the ML model provider and the ML model consumer. Further, an OAM is exemplarily deployed as the data collector.
As already discussed in relation to
Namely, as is illustrated in
Further, as is illustrated in
As is further illustrated in
Example embodiments impact interfaces utilized by any sharing mechanism, requiring an enhancement in order to convey the AI/ML inference and monitoring models along with their performance information. Thus, example embodiments would impact all the specifications related to the affected interfaces.
As an example (see
For O-RAN, example embodiments might impact the following O-RAN specification documents: the AI/ML Workflow Technical Report, and the A1 specifications (the A1-ML-ML Model Management Service). Example embodiments may be reflected as well in the Non-Real Time RIC Technical Report.
The above-described procedures and functions may be implemented by respective functional elements, processors, or the like, as described below.
In the foregoing exemplary description of the network entity, only the units that are relevant for understanding the principles of the disclosure have been described using functional blocks. The network entity may comprise further units that are necessary for its respective operation. However, a description of these units is omitted in this specification. The arrangement of the functional blocks of the devices is not construed to limit the disclosure, and the functions may be performed by one block or further split into sub-blocks.
When in the foregoing description it is stated that the apparatus, i.e. network entity (or some other means) is configured to perform some function, this is to be construed to be equivalent to a description stating that a (i.e. at least one) processor or corresponding circuitry, potentially in cooperation with computer program code stored in the memory of the respective apparatus, is configured to cause the apparatus to perform at least the thus mentioned function. Also, such function is to be construed to be equivalently implementable by specifically configured circuitry or means for performing the respective function (i.e. the expression “unit configured to” is construed to be equivalent to an expression such as “means for”).
In
The processor 1211/1231/1251 and/or the interface 1213/1233/1253 may also include a modem or the like to facilitate communication over a (hardwire or wireless) link, respectively. The interface 1213/1233/1253 may include a suitable transceiver coupled to one or more antennas or communication means for (hardwire or wireless) communications with the linked or connected device(s), respectively. The interface 1213/1233/1253 is generally configured to communicate with at least one other apparatus, i.e. the interface thereof.
The memory 1212/1232/1252 may store respective programs assumed to include program instructions or computer program code that, when executed by the respective processor, enables the respective electronic device or apparatus to operate in accordance with the example embodiments.
In general terms, the respective devices/apparatuses (and/or parts thereof) may represent means for performing respective operations and/or exhibiting respective functionalities, and/or the respective devices (and/or parts thereof) may have functions for performing respective operations and/or exhibiting respective functionalities.
When in the subsequent description it is stated that the processor (or some other means) is configured to perform some function, this is to be construed to be equivalent to a description stating that at least one processor, potentially in cooperation with computer program code stored in the memory of the respective apparatus, is configured to cause the apparatus to perform at least the thus mentioned function. Also, such function is to be construed to be equivalently implementable by specifically configured means for performing the respective function (i.e. the expression “processor configured to [cause the apparatus to] perform xxx-ing” is construed to be equivalent to an expression such as “means for xxx-ing”).
According to example embodiments, an apparatus representing the network entity 10 comprises at least one processor 1211, at least one memory 1212 including computer program code, and at least one interface 1213 configured for communication with at least another apparatus. The processor (i.e. the at least one processor 1211, with the at least one memory 1212 and the computer program code) is configured to perform receiving a first machine learning model message including a first machine learning inference model (thus the apparatus comprising corresponding means for receiving), to perform obtaining network related input data (thus the apparatus comprising corresponding means for obtaining), to perform feeding said first machine learning inference model with said network related input data (thus the apparatus comprising corresponding means for feeding), to perform receiving, upon unsuitability of said first machine learning inference model for an experienced network condition, a second machine learning model message including a second machine learning inference model, and to perform replacing said first machine learning inference model with said second machine learning inference model (thus the apparatus comprising corresponding means for replacing).
According to example embodiments, an apparatus representing the network entity 30 comprises at least one processor 1231, at least one memory 1232 including computer program code, and at least one interface 1233 configured for communication with at least another apparatus. The processor (i.e. the at least one processor 1231, with the at least one memory 1232 and the computer program code) is configured to perform training a first machine learning inference model based on a first training data set (thus the apparatus comprising corresponding means for training), to perform transmitting a first machine learning model message including said first machine learning inference model (thus the apparatus comprising corresponding means for transmitting), to perform retraining, upon unsuitability of said first machine learning inference model for an experienced network condition, a second machine learning inference model based on a second training data set (thus the apparatus comprising corresponding means for retraining), and to perform transmitting a second machine learning model message including said second machine learning inference model.
According to example embodiments, an apparatus representing the network entity 50 comprises at least one processor 1251, at least one memory 1252 including computer program code, and at least one interface 1253 configured for communication with at least another apparatus. The processor (i.e. the at least one processor 1251, with the at least one memory 1252 and the computer program code) is configured to perform receiving a data collection request (thus the apparatus comprising corresponding means for receiving), to perform collecting data (thus the apparatus comprising corresponding means for collecting), and to perform transmitting a data collection response including said collected data (thus the apparatus comprising corresponding means for transmitting).
For further details regarding the operability/functionality of the individual apparatuses, reference is made to the above description in connection with any one of
For the purpose of the present disclosure as described herein above, it should be noted that
In general, it is to be noted that respective functional blocks or elements according to above-described aspects can be implemented by any known means, either in hardware and/or software, respectively, if it is only adapted to perform the described functions of the respective parts. The mentioned method steps can be realized in individual functional blocks or by individual devices, or one or more of the method steps can be realized in a single functional block or by a single device.
Generally, any method step is suitable to be implemented as software or by hardware without changing the idea of the present disclosure. Devices and means can be implemented as individual devices, but this does not exclude that they are implemented in a distributed fashion throughout the system, as long as the functionality of the device is preserved. Such and similar principles are to be considered as known to a skilled person.
Software in the sense of the present description comprises software code as such comprising code means or portions or a computer program or a computer program product for performing the respective functions, as well as software (or a computer program or a computer program product) embodied on a tangible medium such as a computer-readable (storage) medium having stored thereon a respective data structure or code means/portions or embodied in a signal or in a chip, potentially during processing thereof.
The present disclosure also covers any conceivable combination of method steps and operations described above, and any conceivable combination of nodes, apparatuses, modules or elements described above, as long as the above-described concepts of methodology and structural arrangement are applicable.
In view of the above, there are provided measures for machine learning model renewal. Such measures exemplarily comprise receiving a first machine learning model message including a first machine learning inference model, obtaining network related input data, feeding said first machine learning inference model with said network related input data, receiving, upon unsuitability of said first machine learning inference model for an experienced network condition, a second machine learning model message including a second machine learning inference model, and replacing said first machine learning inference model with said second machine learning inference model.
Even though the disclosure is described above with reference to the examples according to the accompanying drawings, it is to be understood that the disclosure is not restricted thereto. Rather, it is apparent to those skilled in the art that the present disclosure can be modified in many ways without departing from the scope of the inventive idea as disclosed herein.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/EP2021/052092 | 1/29/2021 | WO |