MODEL TRAINING PROCESSING METHOD AND APPARATUS, TERMINAL, AND NETWORK SIDE DEVICE

TECHNICAL FIELD

This application relates to the field of communication technologies, and specifically, to a model training processing method and apparatus, a terminal, and a network side device.

BACKGROUND

With the development of communication technologies, a communication scenario based on artificial intelligence (Artificial Intelligence, AI) has been introduced in wireless communication. Currently, in many scenarios of wireless communication based on AI, it is difficult to obtain a large volume of labeled data. Without a large volume of labeled data, it is impossible to train a suitable model through supervised learning, resulting in low reliability in communication.

SUMMARY

According to a first aspect, a model training processing method is provided, including:

- obtaining, by a first device, first information, the first information including first data; and
- processing, by the first device, the first data by using a first model to obtain second data, where
- both the first data and the second data are usable for training a second model, the second model is a service model, and the second data meets at least one of the following: in a case that the first data is unlabeled data, the second data is labeled data; and a data volume of the second data is greater than a data volume of the first data in a case that the first data is labeled data.

According to a second aspect, a model training processing method is provided, including:

- sending, by a second device, second information to a first device, the second information including a first model, and the first model being used by the first device to obtain second data based on first data, where
- both the first data and the second data are usable for training a second model, the second model is a service model, and the second data meets at least one of the following: in a case that the first data is unlabeled data, the second data is labeled data; and a data volume of the second data is greater than a data volume of the first data in a case that the first data is labeled data.

According to a third aspect, a model training processing apparatus is provided, including:

- an obtaining module, configured to obtain first information, the first information including first data; and
- a processing module, configured to process the first data by using a first model to obtain second data, where
- both the first data and the second data are usable for training a second model, the second model is a service model, and the second data meets at least one of the following: in a case that the first data is unlabeled data, the second data is labeled data; and a data volume of the second data is greater than a data volume of the first data in a case that the first data is labeled data.

According to a fourth aspect, a model training processing apparatus is provided, including:

- a second sending module, configured to send second information to a first device, the second information including a first model, and the first model being used by the first device to obtain second data based on first data, where
- both the first data and the second data are usable for training a second model, the second model is a service model, and the second data meets at least one of the following: in a case that the first data is unlabeled data, the second data is labeled data; and a data volume of the second data is greater than a data volume of the first data in a case that the first data is labeled data.

According to a fifth aspect, a model training processing apparatus is provided, including:

- a third sending module, configured to send first information to a first device, the first information including first data, and the first data being used by the first device to obtain second data based on a first model, where
- both the first data and the second data are usable for training a second model, the second model is a service model, and the second data meets at least one of the following: in a case that the first data is unlabeled data, the second data is labeled data; and a data volume of the second data is greater than a data volume of the first data in a case that the first data is labeled data.

According to a sixth aspect, a terminal is provided, including a processor and a memory, where the memory stores a program or an instruction runnable on the processor, and the program or the instruction, when executed by the processor, implements the steps of the method according to the first aspect.

According to a seventh aspect, a terminal is provided, including a processor and a communication interface.

When the terminal is a first device, the processor is configured to obtain first information, the first information including first data; and process the first data by using a first model to obtain second data, where both the first data and the second data are usable for training a second model, the second model is a service model, and the second data meets at least one of the following: in a case that the first data is unlabeled data, the second data is labeled data; and a data volume of the second data is greater than a data volume of the first data in a case that the first data is labeled data.

Alternatively, when the terminal is a first device, the communication interface is configured to send second information to the first device, the second information including a first model, and the first model being used by the first device to obtain second data based on first data, where both the first data and the second data are usable for training a second model, the second model is a service model, and the second data meets at least one of the following: in a case that the first data is unlabeled data, the second data is labeled data; and a data volume of the second data is greater than a data volume of the first data in a case that the first data is labeled data.

Alternatively, when the terminal is a first device, the communication interface is configured to send first information to the first device, the first information including first data, and the first data being used by the first device to obtain second data based on a first model, where both the first data and the second data are usable for training a second model, the second model is a service model, and the second data meets at least one of the following: in a case that the first data is unlabeled data, the second data is labeled data; and a data volume of the second data is greater than a data volume of the first data in a case that the first data is labeled data.

According to an eighth aspect, a network side device is provided, including a processor and a memory, where the memory stores a program or an instruction runnable on the processor, and the program or the instruction, when executed by the processor, implements the steps of the method according to the second aspect.

According to a ninth aspect, a network side device is provided, including a processor and a communication interface.

When the network side device is a first device, the processor is configured to obtain first information, the first information including first data; and process the first data by using a first model to obtain second data, where both the first data and the second data are usable for training a second model, the second model is a service model, and the second data meets at least one of the following: in a case that the first data is unlabeled data, the second data is labeled data; and a data volume of the second data is greater than a data volume of the first data in a case that the first data is labeled data.

Alternatively, when the network side device is a first device, the communication interface is configured to send second information to the first device, the second information including a first model, and the first model being used by the first device to obtain second data based on first data, where both the first data and the second data are usable for training a second model, the second model is a service model, and the second data meets at least one of the following: in a case that the first data is unlabeled data, the second data is labeled data; and a data volume of the second data is greater than a data volume of the first data in a case that the first data is labeled data.

Alternatively, when the network side device is a first device, the communication interface is configured to send first information to the first device, the first information including first data, and the first data being used by the first device to obtain second data based on a first model, where both the first data and the second data are usable for training a second model, the second model is a service model, and the second data meets at least one of the following: in a case that the first data is unlabeled data, the second data is labeled data; and a data volume of the second data is greater than a data volume of the first data in a case that the first data is labeled data.

According to a tenth aspect, a readable storage medium is provided, storing a program or an instruction, where the program or the instruction, when executed by a processor, implements the steps of the method according to the first aspect, or implements the steps of the method according to the second aspect.

According to an eleventh aspect, a chip is provided, including a processor and a communication interface, where the communication interface is coupled to the processor, and the processor is configured to run a program or an instruction to implement the steps of the method according to the first aspect, or implement the steps of the method according to the second aspect.

According to a twelfth aspect, a computer program/program product is provided, stored in a storage medium and executed by at least one processor to implement the steps of the method according to the first aspect, or implement the steps of the method according to the second aspect.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a structural diagram of a network system to which an embodiment of this application is applicable;

FIG. 2 is a schematic structural diagram of neurons;

FIG. 3 is a first flowchart of a model training processing method according to an embodiment of this application;

FIG. 4 is a second flowchart of a model training processing method according to an embodiment of this application;

FIG. 5 is a third flowchart of a model training processing method according to an embodiment of this application;

FIG. 6 is a fourth flowchart of a model training processing method according to an embodiment of this application;

FIG. 7 is a fifth flowchart of a model training processing method according to an embodiment of this application;

FIG. 8 is a sixth flowchart of a model training processing method according to an embodiment of this application;

FIG. 9 is a first structural diagram of a model training processing apparatus according to an embodiment of this application;

FIG. 10 is a second structural diagram of a model training processing apparatus according to an embodiment of this application;

FIG. 11 is a third structural diagram of a model training processing apparatus according to an embodiment of this application;

FIG. 12 is a structural diagram of a communication device according to an embodiment of this application;

FIG. 13 is a structural diagram of a terminal according to an embodiment of this application; and

FIG. 14 is a structural diagram of another network side device according to an embodiment of this application.

DETAILED DESCRIPTION

The technical solutions in embodiments of this application are clearly described in the following with reference to the accompanying drawings in the embodiments of this application. Apparently, the described embodiments are merely some rather than all of the embodiments of this application. All other embodiments obtained by a person of ordinary skill in the art based on the embodiments of this application fall within the protection scope of this application.

The terms “first”, “second”, and so on in this specification and claims of this application are intended to distinguish between similar objects but are not intended to describe a specific order or sequence. It should be understood that the terms used in such a way are interchangeable in proper circumstances, so that the embodiments of this application can be implemented in other sequences than those illustrated or described herein. In addition, the objects distinguished by “first” and “second” are usually of one type, and there is no limitation on quantities of the objects. For example, there may be one or more first objects. In addition, “and/or” in this specification and the claims indicate at least one of the connected objects, and the character “/” usually indicates an “or” relationship between the associated objects.

It should be noted that, the technologies described in the embodiments of this application are not limited to a Long Term Evolution (Long Term Evolution, LTE)/LTE-Advanced (LTE-Advanced, LTE-A) system, and can be further used in other wireless communication systems, such as Code Division Multiple Address (Code Division Multiple Access, CDMA), Time Division Multiple Access (Time Division Multiple Access, TDMA), Frequency Division Multiple Access (Frequency Division Multiple Access, FDMA), orthogonal frequency division multiple access (Orthogonal Frequency Division Multiple Access, OFDMA), single-carrier frequency division multiple access (Single-carrier Frequency-Division Multiple Access, SC-FDMA), and other systems. The terms “system” and “network” in the embodiments of this application are often used interchangeably, and the described technologies can be used not only for the above-mentioned systems and radio technologies, but also for other systems and radio technologies. The following description describes a New Radio (New Radio, NR) system for exemplary purposes, and uses NR terms in most of the following descriptions, but these technologies are also applicable to applications other than the NR system application, such as a 6th generation (6th Generation, 6G) communication system.

FIG. 1 is a block diagram of a wireless communication system to which an embodiment of this application is applicable. The wireless communication system includes a terminal 11 and a network side device 12. The terminal 11 may be a mobile phone, a tablet personal computer (Tablet Personal Computer), a laptop computer (Laptop Computer) or referred to as a notebook computer, a personal digital assistant (Personal Digital Assistant, PDA), a palmtop computer, a netbook, an ultra-mobile personal computer (ultra-mobile personal computer, UMPC), a mobile Internet device (Mobile Internet Device, MID), an augmented reality (augmented reality, AR)/virtual reality (virtual reality, VR) device, a robot, a wearable device (Wearable Device), a vehicular user equipment (Vehicle User Equipment, VUE), a pedestrian user equipment (Pedestrian User Equipment, PUE), a smart home (a home device having a wireless communication function, such as a refrigerator, a television, a washing machine, or furniture), a game console, a personal computer (personal computer, PC), a teller machine, a self-service machine, and other terminal-side devices. The wearable device includes a smartwatch, a smart band, a smart headphone, smart glasses, smart jewelry (a smart bracelet, a smart wrist-band, a smart ring, a smart necklace, a smart anklet, a smart ankle chain, and the like), a smart wrist strap, smart clothes, and the like. It should be noted that a specific type of the terminal 11 is not limited in this embodiment of this application. The network side device 12 may include an access network device or a core network device. The access network device may also be referred to as a radio access network device, a radio access network (Radio Access Network, RAN), a radio access network function, or a radio access network unit. The access network device may include a base station, a wireless local area network (Wireless Local Area Network, WLAN) access point, a WiFi node, or the like. The base station may be referred to as a Node B, an evolved Node B (eNB), an access point, a base transceiver station (Base Transceiver Station, BTS), a radio base station, a radio transceiver, a basic service set (Basic Service Set, BSS), an extended service set (Extended Service Set, ESS), a home node B, a home evolved node B, a transmitting/receiving point (Transmitting Receiving Point, TRP), or some other suitable term in the art. As long as the same technical effect is achieved, the base station is not limited to a specific technical term. It should be noted that in this embodiment of this application, only a base station in an NR system is used as an example for description, and a specific type of the base station is not limited.

For case of understanding, some contents involved in the embodiments of this application are described below.

I. Artificial Intelligence

At present, artificial intelligence is widely applied in various fields. There are a plurality of implementations of AI modules, for example, a neural network, a decision tree, a support vector machine, and a Bayes classifier. In this application, a neural network is used as an example for description, but a specific type of the AI module is not limited.

The neural network includes neurons, and a schematic diagram of the neurons is shown in FIG. 2. z=Σ_i=1^ka_iw_i+b, a_iis an input, w is a weight (multiplicative coefficient), b is a bias (additive coefficient), and σ (.) is an activation function. Common activation functions include a S-type function (Sigmoid), a hyperbolic tangent function (tan h), a rectified linear unit ReLU (Rectified Linear Unit), and the like.

Parameters of the neural network are optimized by using an optimization algorithm. Optimization algorithms are a type of algorithms that can help minimize or maximize an objective function (sometimes referred to as a loss function). The objective function is usually a mathematical combination of a model parameter and data. For example, data x and a label Y corresponding to the data x are given, and a neural network model f (.) is constructed. With this model, a predicted output f (x) may be obtained based on an input x, and a gap (f(x)−Y) between a predicted value and a real value may be calculated. This is a loss function. The objective of this application is to find proper W, b to minimize the value of the loss function. A smaller loss value indicates that the model is closer to the real situation.

Currently, common optimization algorithms are basically based on an error back propagation (error Back Propagation, BP) algorithm. A basic idea of the BP algorithm is that a learning process includes two processes: signal forward propagation and error back propagation. During forward propagation, an input sample is transmitted from an input layer, processed layer by layer by each hidden layer, and then transmitted to an output layer. If an actual output of the output layer does not match an expected output, an error back propagation stage is performed. The error back propagation is to transmit an output error in a form layer by layer back to the input layer through hidden layers, and distribute the error to all units at each layer, to obtain an error signal of the units at each layer. This error signal is used as a basis for correcting a weight of each unit. Such a weight adjustment process at each layer of signal forward propagation and error back propagation is performed cyclically. The process of continuously adjusting the weight is a learning and training process of the network. This process continues until an error outputted by the network is reduced to an acceptable degree or a preset quantity of times of learning is performed.

According to different types of solutions, selected AI algorithms and used models also vary. Currently, a main method for improving performance of a 5G network through AI is to enhance or replace an existing algorithm or processing module by using an algorithm and a model based on a neural network. In a specific scenario, an algorithm and a model based on a neural network can achieve better performance than a deterministic algorithm. Commonly used neural networks include a deep neural network, a convolutional neural network, a recurrent neural network, and the like. With the help of existing AI tools, neural networks can be built, trained, and verified.

It should be understood that training of an AI model requires support of a large volume of data. If the data volume is insufficient, the training process of the model may not converge, or a trained model may be overfitted. However, in many scenarios in wireless communication, labeled data cannot be obtained, or a volume of labeled data is small (due to collection overheads, transmission overheads, and the like). Therefore, a problem of model training when there is insufficient labeled data needs to be resolved in wireless communication. Therefore, a model training processing method of this application is provided.

A model training processing method provided in the embodiments of this application is described in detail below through some embodiments and application scenarios thereof with reference to the accompanying drawings.

As shown in FIG. 3, an embodiment of this application provides a model training processing method, including the following steps.

Step 301: A first device obtains first information, the first information including first data.

Step 302: The first device processes the first data by using a first model to obtain second data.

Both the first data and the second data are usable for training a second model, the second model is a service model, and the second data meets at least one of the following: in a case that the first data is unlabeled data, the second data is labeled data; and a data volume of the second data is greater than a data volume of the first data in a case that the first data is labeled data.

In this embodiment of this application, the first device may be a network side device or a terminal, and the first data may be at least part of data for training the second model. The first data may be labeled data or unlabeled data. The first model may be understood as a model that enhances training data of the second model. For example, when the first data is labeled data, the first model is used for expanding the first data, to obtain the second data with a larger data volume. When the first data is unlabeled data, the first model is used for labeling the first data, so that more labeled training data can be obtained. In this way, after the first data is processed through the first model, more labeled training data can be obtained, thereby ensuring that there is enough labeled training data to train the second model, so that the second model can effectively converge during the training process, and performance of the second model is improved.

For example, in some embodiments, the first data is N pieces of labeled data. In this case, after the N pieces of labeled data are inputted into the first model, M pieces of labeled data (that is, the second data is M pieces of labeled data) may be outputted. In this case, M is greater than N, and usually, M is much greater than N.

For another example, in some embodiments, the first data is M pieces of unlabeled data. In this case, after the M pieces of unlabeled data are inputted into the first model, M pieces of labeled data (that is, the second data is M pieces of labeled data) may be outputted.

Optionally, the first information may be stored in the first device or the second device. In addition, the first model may be stored in the first device, or may be stored in the second device, and is sent by the second device to the first device. When the first device is a core network device, the second device may be a base station. When the first device is a base station (for example, a base station A), the second device may be a base station (for example, a base station B) or a terminal. When the first device is a terminal (for example, a terminal A), the second device may be a base station or a terminal (for example, a terminal B). It should be understood that, in this embodiment of this application, the first information and the first model are stored in different devices.

In this embodiment of this application, a first device obtains first information, the first information including first data; and the first device processes the first data by using a first model to obtain second data. Both the first data and the second data are usable for training a second model, the second model is a service model, and the second data meets at least one of the following: in a case that the first data is unlabeled data, the second data is labeled data; and a data volume of the second data is greater than a data volume of the first data in a case that the first data is labeled data. In this way, more labeled training data can be obtained by using the first model, so that the second model can effectively converge in a training process, thereby improving performance of the second model. Therefore, reliability of AI-based wireless communication can be improved in this embodiment of this application.

Optionally, in some implementations, the first model is stored in the second device. In this case, before the processing, by the first device, the first data by using a first model, the method further includes:

- receiving, by the first device, second information from a second device, the second information including the first model.

In this embodiment of this application, that the second information includes the first model may be understood as follows: The second information includes a parameter of the first model or address information of the first model, so that the first device can obtain the first model.

Further, in some embodiments, the second information further includes at least one of configuration information and first assistance information. The configuration information is used for indicating a usage manner of the first model. The first assistance information includes statistical information and environment information required for running the first model. The statistical information is used for representing a distribution feature of an input of the first model.

Optionally, the configuration information is used for indicating a usage method for the first model, and may include, for example, a data dimension or an input data format, an output dimension or an output data format, an input data volume, and an output data volume of the first model. The environment information may be understood as environment information related to a data augmentation algorithm of the first model. The environment information may include a software environment, a hardware environment, and the like required for running the model, for example, may include a software architecture, a hardware architecture, a power demand, a storage demand, and a computing power demand that need to be used. The statistical information may include distribution feature information such as a mean value and a variance of inputs of the model.

Optionally, in some embodiments, before the receiving, by the first device, second information from a second device, the method further includes:

- sending, by the first device, a first request message to the second device, the first request message being used for requesting to obtain the second information.

In this embodiment of this application, when the first device needs to expand the first data, the second information may be obtained in a requested manner, thereby improving pertinence of obtaining the second information. Certainly, in other embodiments, the second device may alternatively proactively send the second information to the first device. For example, when the second device establishes a connection to the first device, the second device sends the second information to the first device. Alternatively, the second device may broadcast the second information, and the first device directly obtains the second information from broadcast information when needed.

Optionally, in some embodiments, after the processing, by the first device, the first data by using a first model to obtain second data, the method further includes:

- training, by the first device, the second model based on the second data to obtain a third model.

In this embodiment of this application, the first device may train the second model to obtain the third model. The third model may be used on the first device or the second device.

It should be understood that the second model may be sent by the second device to the first device, or may be preconfigured in the first device in a protocol. This is not further limited herein.

Optionally, in some embodiments, if use of the third model is on the second device, after the training, by the first device, the second model based on the second data to obtain a third model, the method further includes:

- sending, by the first device, the third model to the second device.

In this embodiment of this application, the sending the third model may be understood as sending a parameter of the third model or sending address information of the third model. This is not further limited herein. In this way, when inference of a corresponding service is performed by using the trained third model, accuracy of the inference can be improved, thereby ensuring communication reliability.

It should be noted that, initial training data of the second model may include the first data, and may further include labeled third data. If the first data is labeled data, the training data used may include the first data, the second data, and the third data during the training of the second model. If the first data is unlabeled data, the training data used may include the second data and the third data during the training of the second model.

Optionally, based on different storage locations of the first information, corresponding manners of obtaining the first information are different. For example, in some embodiments, the obtaining, by a first device, first information includes either of the following:

- receiving, by the first device, the first information from a second device; and
- obtaining, by the first device, the first information locally.

In this embodiment of this application, when the first information is stored in the second device, the first device may receive the first information from the second device. When the first information is stored in the first device, the first device may obtain the first information locally.

Optionally, before the receiving, by the first device, the first information from a second device, the method further includes:

- sending, by the first device, instruction information to the second device, the instruction information being used for instructing the second device to send the first information.

In this embodiment of this application, the first information is stored in the second device, and the first device needs to instruct the second device to send the first information. For example, the first device may schedule the second device to send the first information.

Optionally, in some embodiments, before the receiving, by the first device, the first information from a second device, the method further includes:

- receiving, by the first device, a second request message from the second device, the second request message being used by the second device to request to send the first information.

In this embodiment of this application, before the second device sends the first information, the second device may request the first device to send the first information. Subsequently, the second device may send the first information on a preconfigured resource, or the first device may dynamically schedule the second device to send the first information. For example, the second device is instructed, through the instruction information, to send the first information.

Optionally, in a case that the first device receives the first information from the second device, after the processing, by the first device, the first data by using a first model to obtain second data, the method further includes:

- sending, by the first device, third information to the second device, the third information including the second data.

In this embodiment of this application, the process of training the second model is performed by the second device, and the first device needs to send the second data to the second device, for the second device to train the second model to obtain the third model. The training of the second model by the second device is similar to the training of the second model by the first device. For the definition of the training data, refer to the foregoing example. Details are not described herein again.

Further, in some embodiments, the third information further includes identification information, and the identification information is used for indicating that the second data is obtained based on the first model.

Optionally, in some embodiments, the first information further includes second assistance information, and the second assistance information is used for representing a distribution feature of the first data.

In this embodiment of this application, the second assistance information may include information representing distribution features such as a mean value and a variance of the first data.

For better understanding of this application, description is provided below through some examples.

In some embodiments, a device A sends a first model to a device B, the device B performs data enhancement by using the received first model and first data of the device B to obtain second data, and the device B then trains a second model based on the second data. As shown in FIG. 4, the following procedure is specifically included.

Step 401: A device B sends a first message to a device A, the first message being used for requesting a first model, a configuration parameter, and first assistance information.

Step 402: A device A sends the first model, the configuration parameter, and the first assistance information to the device B.

Step 403: The device B enhances first data based on the first model, the configuration parameter, and the first assistance information to obtain second data.

Step 404: The device B trains a second model based on the second data to obtain a third model.

In some embodiments, the device B sends the first data to the device A, the device A performs data enhancement by using the received first data and a first model trained by the device A, to obtain second data, and then the device A trains a second model based on the second data to obtain a third model. Finally, the third model is sent to the device B. As shown in FIG. 5, the following procedure is specifically included.

Step 501: A device B sends a second message to a device A, the second message being used for requesting to send first data.

Step 502: The device A sends a third message to the device B, the third message being used for instructing to send the first data.

Step 503: The device B sends the first data to the device A.

Step 504: The device A enhances the first data based on a first model of the device A, a configuration parameter, and first assistance information to obtain second data.

Step 505: The device A trains a second model based on the second data to obtain a third model.

Step 506: The device A sends the third model to the device B.

In some embodiments, the device B sends the first data to the device A, the device A performs data enhancement by using the received first data and a first model trained by the device A, to obtain second data, the device A then sends the second data to the device B, and the device B trains a second model by using the received second data. As shown in FIG. 6, the following procedure is specifically included.

Step 601: A device B sends a second message to a device A, the second message being used for requesting to send first data.

Step 602: The device A sends a third message to the device B, the third message being used for instructing to send the first data.

Step 603: The device B sends the first data to the device A.

Step 604: The device A enhances the first data based on a first model of the device A, a configuration parameter, and first assistance information to obtain second data.

Step 605: The device A sends the second data to the device B.

Step 606: The device B trains a second model based on the second data to obtain a third model.

Referring to FIG. 7, an embodiment of this application further provides a model training processing method, including:

Step 701: A second device sends second information to a first device, the second information including a first model, and the first model being used by the first device to obtain second data based on first data.

Optionally, the second information further includes at least one of configuration information and first assistance information. The configuration information is used for indicating a usage manner of the first model. The first assistance information includes statistical information and environment information required for running the first model. The statistical information is used for representing a distribution feature of an input of the first model.

Optionally, before the sending, by the second device, second information to a first device, the method further includes:

- receiving, by the second device, a first request message from the first device, the first request message being used for requesting to obtain the second information.

Referring to FIG. 8, an embodiment of this application further provides a model training processing method, including:

Step 801: A second device sends first information to a first device, the first information including first data, and the first data being used by the first device to obtain second data based on a first model.

Optionally, after the sending, by a second device, first information to a first device, the method further includes:

- receiving, by the second device, a third model from the first device, the third model being obtained by the first device by training the second model based on the second data.

Optionally, after the sending, by a second device, first information to a first device, the method further includes:

- receiving, by the second device, third information from the first device, the third information including the second data.

Optionally, the third information further includes identification information, and the identification information is used for indicating that the second data is obtained based on the first model.

Optionally, after the receiving, by the second device, third information from the first device, the method further includes:

- training, by the second device, the second model based on the second data to obtain a third model.

Optionally, before the sending, by a second device, first information to a first device, the method further includes:

- receiving, by the second device, instruction information from the first device, the instruction information being used for instructing the second device to send the first information.

Optionally, before the sending, by a second device, first information to a first device, the method further includes:

- sending, by the second device, a second request message to the first device, the second request message being used by the second device to request to send the first information.

Optionally, the first information further includes second assistance information, and the second assistance information is used for representing a distribution feature of the first data.

The model training processing method provided in the embodiments of this application may be performed by a model training processing apparatus. In the embodiments of this application, an example in which the model training processing apparatus performs the model training processing method is used to describe the model training processing apparatus provided in the embodiments of this application.

Referring to FIG. 9, an embodiment of this application further provides a model training processing apparatus. As shown in FIG. 9, the model training processing apparatus 900 includes:

- an obtaining module 901, configured to obtain first information, the first information including first data; and
- a processing module 902, configured to process the first data by using a first model to obtain second data.