MODEL TEST METHOD AND APPARATUS

BACKGROUND

In a wireless communication network, for example, in a mobile communication network, increasingly diversified services are supported by the network. Therefore, increasingly diversified usage is to be met. For example, the network is to be capable of supporting an ultra-high rate, ultra-low latency, and/or a massive connection. These features make network planning, network configuration, and/or resource scheduling increasingly complex. In addition, because a function of the network is increasingly powerful, for example, a supported spectrum is increasingly high, and new technologies such as a high-order multiple-input multiple-output (multiple input multiple output, MIMO) technology, beamforming, and/or beam management are supported, network energy saving becomes a hot research topic. These new usage, scenarios, and features bring an unprecedented challenge to network planning, operation and maintenance, and efficient operation. To meet this challenge, an artificial intelligence technology is introduced into the wireless communication network, to implement network intelligence. Based on this, how to effectively implement artificial intelligence in the network is a problem worth studying.

SUMMARY

Embodiments described herein provide a model test method and an apparatus. A network device tests an AI model deployed in a terminal, so that the network device evaluates, manages, and controls the AI model deployed in the terminal.

According to a first aspect, a model test method is provided. The method is performed by a terminal, or is performed by a component (a processor, a chip, or another component) disposed in a terminal, a software module, or the like. The method includes: receiving first information from a network device, where the first information is used to determine at least one test set: testing an AI model based on the at least one test set, to obtain a test result; and sending second information to the network device, where the second information indicates the test result of the AI model.

According to the foregoing method, the terminal determines the at least one test set based on the first information sent by the network device, and tests the AI model by using the at least one test set, to obtain the test result. In addition, the terminal feeds back the test result of the AI model to the network device, so that the network device tests an AI model deployed in the terminal. Therefore, the network device evaluates, manages, and controls the AI model used by the terminal.

In a design, the first information includes indication information of the at least one test set. Alternatively, the first information includes a reference signal, and the method includes: determining the at least one test set based on the reference signal.

According to the foregoing method, the terminal determines the at least one test set based on the indication information of the at least one test set, the reference signal, or the like. Particularly, in response to the terminal device determining the at least one test set based on the reference signal, the network device does not separately send a test set to the terminal device. The terminal determines the test set, and the like by using the reference signal. Therefore, signaling overheads are reduced.

In a design, the second information indicates at least one of the following: AI models participating in the test, a test result corresponding to each AI model, or a test set corresponding to each test result.

In a design, whether the AI model meets a performance goal is evaluated by the network device. For example, the network device determines, based on the test result reported by the terminal, whether the AI models participating in the test meets the goal. Subsequently, the network device sends first indication information to the terminal based on whether the AI models participating in the test meets the goal, and the like.

Optionally, in this design, the test result includes a first test result. The first test result indicates an output that is of the AI model and that is obtained based on the at least one test set: or the first test result indicates a performance indicator of the AI model obtained by testing the AI model based on the at least one test set.

Optionally, in this design, the first indication information indicates that each of the AI models participating in the test meets or does not meet the performance goal, indicates an AI model that meets the performance goal and that is in the AI models participating in the test, indicates an AI model that does not meet the performance goal and that is in the AI models participating in the test, indicates an AI model to be subsequently used by the terminal, or indicates the terminal to perform a corresponding operation in a non-AI manner.

According to the foregoing method, the terminal reports the test result to the network device. The network device determines, based on the test result, whether the AI models participating in the performance test meets the performance goal, and sends the first indication information based on performance information of the AI model. In this design, in an aspect, the network device evaluates performance of the AI model, so that the network device conveniently manages and schedules the AI model in the terminal in a unified manner. In another aspect, the terminal directly reports the test result to the network device without another evaluation operation. This reduces power consumption of the terminal.

In another design, specifically, the terminal evaluates the AI model deployed by the terminal. In this design, the network device is to indicate a performance goal for the AI model to the terminal. For example, the terminal receives second indication information from the network device. The second indication information indicates the performance goal for the AI model. Alternatively, the performance goal for the AI model is specified in a protocol. The terminal tests the AI model, and determines a test result based on the performance goal for the AI model. The test result is referred to as a second test result. The second test result indicates that each of the AI models participating in the test meets or does not meet the performance goal, indicates an AI model that meets the performance goal and that is in the AI models participating in the test, indicates an AI model that does not meet the performance goal and that is in the AI models participating in the test, indicates an AI model to be subsequently used by the terminal, or indicates the terminal to perform a subsequent operation in a non-AI manner.

According to the foregoing method, the terminal directly evaluates performance of the AI model, and reports an evaluation result, namely, the second test result, to the network device. Therefore, an operation process of the network device is reduced.

In a design, the method further includes: sending request information to the network device. The request information is used to request the first information, or is used to request to test the AI model. Optionally, the request information includes indication information of an input format of the AI models participating in the test.

According to the foregoing method, the terminal notifies, by using the request information, the network device of the input format of the AI models participating in the test. The network device configures, for the terminal based on the input format of the AI model, test data that matches the format. Subsequently, the terminal tests the model by using the configured test data, and does not perform format conversion on training data. Therefore, the power consumption of the terminal is reduced.

According to a second aspect, a model test method is provided. The method is performed by the network device corresponding to the first aspect. For beneficial effects, refer to the first aspect. Details are not described again. The method is performed by the network device, or is performed by a component (a processor, a chip, or another component) disposed in the network device, a software module, or the like. The method includes: sending first information to a terminal, where the first information is used to determine at least one test set; and receiving second information from the terminal, where the second information indicates a test result of an AI model, and the test result corresponds to the at least one test set.

For descriptions of the first information, the second information, and the test result, refer to the first aspect. Details are not described again.

In a design, the method further includes: receiving request information from the terminal. The request information is used to request the first information, or is used to request to test the AI model. Optionally, the request information includes indication information of an input format of an AI models participating in a test.

According to a third aspect, an apparatus is provided. For beneficial effects, refer to the descriptions of the first aspect. The apparatus is a terminal, an apparatus disposed in a terminal, or an apparatus that is used together with a terminal. In a design, the apparatus includes units in a one-to-one correspondence with the method/operation/step/actions described in the first aspect. The units are implemented by a hardware circuit, software, or a combination of a hardware circuit and software.

For example, the apparatus includes a processing unit and a communication unit. The processing unit and the communication unit perform corresponding functions in any design example of the first aspect.

Specifically, the communication unit is configured to receive first information from a network device. The first information indicates at least one test set.

The processing unit is configured to test an AI model based on the at least one test set, to obtain a test result.

The communication unit is further configured to send second information to the network device. The second information indicates the test result of the AI model. For specific execution processes of the processing unit and the communication unit, refer to the first aspect. Details are not described herein again.

For example, the apparatus includes a processor, configured to implement the method described in the first aspect. The apparatus further includes a memory, configured to store instructions and/or data. The memory is coupled to the processor. In response to executing program instructions stored in the memory, the processor implements the method described in the first aspect. The apparatus further includes a communication interface. The communication interface is used by the apparatus to communicate with another device. For example, the communication interface is a transceiver, a circuit, a bus, a module, a pin, or another type of communication interface, and the another device is a network device or the like. In at least one embodiment, the apparatus includes:

- a memory, configured to store program instructions:
- a communication interface, configured to: receive first information from a network device, where the first information indicates at least one test set; and send second information to the network device, where the second information indicates a test result of an AI model; and
- a processor, configured to test the artificial intelligence AI model based on the at least one test set, to obtain the test result.

For specific execution processes of the communication interface and the processor, refer to the descriptions in the first aspect. Details are not described again.

According to a fourth aspect, an apparatus is provided. For beneficial effects, refer to the descriptions in the second aspect. The apparatus is a network device, an apparatus disposed in a network device, or an apparatus that is used together with the network device. In a design, the apparatus includes units in a one-to-one correspondence with the method/operation/step/actions described in the second aspect. The unit is a hardware circuit, software, or a combination of a hardware circuit and software.

Specifically, the processing unit is configured to determine at least one test set.

The communication unit is configured to: send first information to a terminal, where the first information is used to determine the at least one test set; and receive second information from the terminal, where the second information indicates a test result of an AI model, and the test result corresponds to the at least one test set.

For specific execution processes of the processing unit and the communication unit, refer to the second aspect. Details are not described herein again.

For example, the apparatus includes a processor, configured to implement the method described in the second aspect. The apparatus further includes a memory, configured to store instructions and/or data. The memory is coupled to the processor. In response to executing program instructions stored in the memory, the processor implements the method described in the second aspect. The apparatus further includes a communication interface. The communication interface is used by the apparatus to communicate with another device. For example, the communication interface is a transceiver, a circuit, a bus, a module, a pin, or another type of communication interface, and the another device is a terminal or the like. In at least one embodiment, the apparatus includes:

- a memory, configured to store program instructions:
- a processor, configured to determine at least one test set; and
- a communication interface, configured to: send first information to a terminal, where the first information is used to determine at least one test set; and receive second information from the terminal, where the second information indicates a test result of an AI model, and the test result corresponds to the at least one test set.

For specific execution processes of the communication interface and the processor, refer to the descriptions in the second aspect. Details are not described again.

According to a fifth aspect, at least one embodiment further provides a computer-readable storage medium, including instructions. In response to the instructions being run on a computer, the computer is enabled to perform the method according to the first aspect or the second aspect.

According to a sixth aspect, at least one embodiment further provides a chip system. The chip system includes a processor, and further includes a memory, configured to implement the method according to the first aspect or the second aspect. The chip system includes a chip, or includes a chip and another discrete component.

According to a seventh aspect, at least one embodiment further provides a computer program product, including instructions. In response to the instructions being run on a computer, the computer is enabled to perform the method according to the first aspect or the second aspect.

According to an eighth aspect, at least one embodiment further provides a system. The system includes the apparatus according to the third aspect and the apparatus according to the fourth aspect.

BRIEF DESCRIPTION OF DRAWINGS

FIG. 1 is a schematic diagram of a communication system according to at least one embodiment:

FIG. 2a is a schematic diagram of a neuron according to at least one embodiment:

FIG. 2b is a schematic diagram of a neural network according to at least one embodiment:

FIG. 2c is a schematic diagram of an AI model according to at least one embodiment:

FIG. 3a and FIG. 3b each are a schematic diagram of a network architecture according to at least one embodiment:

FIG. 4 is a schematic diagram of machine learning according to at least one embodiment:

FIG. 5 to FIG. 8 each are a flowchart of a model test according to at least one embodiment; and

FIG. 9 and FIG. 10 each are a schematic diagram of a communication apparatus according to at least one embodiment.

DESCRIPTION OF EMBODIMENTS

FIG. 1 is a schematic diagram of an architecture of a communication system 1000 to which at least one embodiment is applicable. As shown in FIG. 1, the communication system includes a radio access network 100 and a core network 200. Optionally, the communication system 1000 further includes an internet 300. The radio access network 100 includes at least one access network device (for example, 110a and 110b in FIG. 1), and further includes at least one terminal (for example, 120a to 120j in FIG. 1). The terminal is connected to an access network device in a wireless manner, and the access network device is connected to the core network in a wireless or wired manner. A core network device and the access network device are independent and different physical devices, a function of a core network device and a logical function of the access network device is integrated into a same physical device, or some functions of a core network device and some functions of the access network device is integrated into one physical device. A wired or wireless manner is used for a connection between terminals and a connection between access network devices. FIG. 1 is only a schematic diagram. The communication system further includes another network device, for example, further includes a wireless relay device, a wireless backhaul device, and the like. This is not shown in FIG. 1.

The access network device is a base station (base station), an evolved NodeB (evolved NodeB, eNodeB), a transmission reception point (transmission reception point, TRP), a next generation NodeB (next generation NodeB, gNB) in a 5th generation (5th generation, 5G) mobile communication system, an access network device in an open radio access network (open radio access network, O-RAN), a next generation base station in a 6th generation (6th generation, 6G) mobile communication system, a base station in a future mobile communication system, an access node in a wireless fidelity (wireless fidelity, Wi-Fi) system, or the like. Alternatively, the access network device is a module or unit that completes some functions of a base station, for example, is a central unit (central unit, CU), a distributed unit (distributed unit, DU), a central unit control plane (CU control plane, CU-CP) module, or a central unit user plane (CU user plane, CU-UP) module. The access network device is a macro base station (for example, 110a in FIG. 1), is a micro base station or an indoor base station (for example, 110b in FIG. 1), or is a relay node or a donor node. A specific technology and a specific device form used by the access network device are not limited in at least one embodiment.

In at least one embodiment, an apparatus configured to implement a function of the access network device is an access network device, or is an apparatus that supports the access network device in implementing the function, for example, a chip system, a hardware circuit, a software module, or a combination of a hardware circuit and a software module. The apparatus is installed in the access network device or is used in a manner of matching the access network device. In at least one embodiment, the chip system includes a chip, or includes a chip and another discrete component. For ease of description, the technical solutions provided in at least one embodiment are described below by using an example in which the apparatus configured to implement the function of the access network device is the access network device and the access network device is a base station.

(1) Protocol Layer Structure

Communication between an access network device and a terminal complies with a specific protocol layer structure. The protocol layer structure includes a control plane protocol layer structure and a user plane protocol layer structure. For example, the control plane protocol layer structure includes functions of protocol layers such as a radio resource control (radio resource control, RRC) layer, a packet data convergence protocol (packet data convergence protocol, PDCP) layer, a radio link control (radio link control, RLC) layer, a media access control (media access control, MAC) layer, and a physical layer. For example, the user plane protocol layer structure includes functions of protocol layers such as a PDCP layer, an RLC layer, a MAC layer, and a physical layer. In at least one embodiment, a service data adaptation protocol (service data adaptation protocol, SDAP) layer is further included above the PDCP layer.

Optionally, the protocol layer structure between the access network device and the terminal further includes an artificial intelligence (artificial intelligence, AI) layer, where the artificial intelligence layer is used for transmission of data related to an AI function.

(2) Central Unit (Central Unit, CU) and Distributed Unit (Distributed Unit, DU)

An access network device includes a CU and a DU. A plurality of DUs is controlled by one CU in a centralized manner. For example, an interface between the CU and the DU is referred to as an F1 interface. A control plane (control plane, CP) interface is F1-C, and a user plane (user plane, UP) interface is F1-U. A specific name of each interface is not limited in at least one embodiment. The CU and the DU are classified based on a protocol layer of a wireless network. For example, functions of a PDCP layer and a protocol layer above the PDCP layer are configured in the CU, and functions of protocol layers below the PDCP layer (for example, an RLC layer, a MAC layer, and the like) are configured in the DU. For another example, a function of a protocol layer above the PDCP layer is configured in the CU, and functions of the PDCP layer and a protocol layer below the PDCP layer are configured in the DU. This is not limited.

The classification of processing functions of the CU and the DU based on the protocol layer is merely an example, and there is other classification. For example, the CU or the DU has functions of more protocol layers through classification. For another example, the CU or the DU has some processing functions of the protocol layer through classification. In a design, some functions of the RLC layer and a function of a protocol layer above the RLC layer are configured in the CU, and a remaining function of the RLC layer and a function of a protocol layer below the RLC layer are configured in the DU. In another design, classification of functions of the CU or the DU is alternatively performed based on a service type or another system goal. For example, classification is performed based on a latency. A function whose processing time is to satisfy a latency goal is configured in the DU, and a function whose processing time does not satisfy the latency goal is configured in the CU. In another design, the CU alternatively has one or more functions of a core network. For example, the CU is disposed on a network side to facilitate centralized management. In another design, a radio unit (radio unit, RU) of the DU is disposed remotely. Optionally, the RU has a radio frequency function.

Optionally, the DU and the RU are classified at a physical layer (physical layer, PHY). For example, the DU implements higher-layer functions of the PHY layer, and the RU implements lower-layer functions of the PHY layer. For sending, functions of the PHY layer includes at least one of the following functions: cyclic redundancy check (cyclic redundancy check, CRC) code adding, channel coding, rate matching, scrambling, modulation, layer mapping, precoding, resource mapping, physical antenna mapping, and/or radio frequency sending. For receiving, functions of the PHY layer includes at least one of the following functions: CRC check, channel decoding, rate de-matching, descrambling, demodulation, layer de-mapping, channel detection, resource de-mapping, physical antenna de-mapping, and/or radio frequency receiving. The higher-layer functions of the PHY layer include some functions of the PHY layer. For example, the some functions are closer to the MAC layer. The lower-layer functions of the PHY layer include some other functions of the PHY layer. For example, the some other functions are closer to the radio frequency function. For example, the higher-layer functions of the PHY layer includes CRC code adding, channel coding, rate matching, scrambling, modulation, and layer mapping, and the lower-layer function of the PHY layer includes precoding, resource mapping, physical antenna mapping, and radio frequency sending functions. Alternatively, the higher-layer functions of the PHY layer includes CRC code adding, channel coding, rate matching, scrambling, modulation, layer mapping, and precoding, and the lower-layer functions of the PHY layer includes resource mapping, physical antenna mapping, and radio frequency sending functions. For example, the higher-layer functions of the PHY layer includes CRC check, channel decoding, rate de-matching, decoding, demodulation, and layer de-mapping, and the lower-layer functions of the PHY layer includes channel detection, resource de-mapping, physical antenna de-mapping, and radio frequency receiving functions. Alternatively, the higher-layer functions of the PHY layer includes CRC check, channel decoding, rate de-matching, decoding, demodulation, layer de-mapping, and channel detection, and the lower-layer functions of the PHY layer includes resource de-mapping, physical antenna de-mapping, and radio frequency receiving functions.

For example, a function of the CU is implemented by one entity, or is implemented by different entities. For example, the function of the CU is further divided. In other words, a control plane and a user plane are separated and implemented by using different entities: a control plane CU entity (namely, a CU-CP entity) and a user plane CU entity (namely, a CU-UP entity). The CU-CP entity and the CU-UP entity are coupled to the DU, to jointly complete a function of an access network device.

Optionally, any one of the DU, the CU, the CU-CP, the CU-UP, and the RU is a software module, a hardware structure, or a combination of a software module and a hardware structure. This is not limited. Different entities exist in different forms. This is not limited. For example, the DU, the CU, the CU-CP, and the CU-UP are software modules, and the RU is a hardware structure. These modules and methods performed by the modules also fall within the protection scope of this disclosure.

In at least one embodiment, the access network device includes the CU-CP, the CU-UP, the DU, and the RU. For example, at least one embodiment is executed by the DU, is executed by the DU and the RU, is executed by the CU-CP, the DU, and the RU, or is executed by the CU-UP, the DU, and the RU. This is not limited. Methods performed by the modules also fall within the protection scope of at least one embodiment.

The terminal is also referred to as a terminal device, user equipment (user equipment, UE), a mobile station, a mobile terminal, or the like. The terminal is widely used in communication in various scenarios, for example, including but not limited to at least one of the following scenarios: a device-to-device (device-to-device, D2D) scenario, a vehicle-to-everything (vehicle-to-everything, V2X) scenario, a machine-type communication (machine-type communication, MTC) scenario, an internet of things (internet of things, IOT) scenario, a virtual reality scenario, an augmented reality scenario, an industrial control scenario, an automatic driving scenario, a telemedicine scenario, a smart grid scenario, a smart furniture scenario, a smart office scenario, a smart wearable scenario, a smart transportation scenario, a smart city scenario, or the like. The terminal is a mobile phone, a tablet computer, a computer having a wireless transceiver function, a wearable device, a vehicle, an unmanned aerial vehicle, a helicopter, an airplane, a ship, a robot, a robot arm, a smart home device, or the like. A specific technology and a specific device form that are used by the terminal are not limited in at least one embodiment.

In at least one embodiment, an apparatus configured to implement a function of the terminal is a terminal, or is an apparatus that supports the terminal in implementing the function, for example, a chip system, a hardware circuit, a software module, or a combination of a hardware circuit and a software module. The apparatus is installed in the terminal or is used in a manner of matching the terminal. For ease of description, the technical solutions provided in at least one embodiment are described below by using an example in which the apparatus configured to implement the function of the terminal is the terminal.

The base station and the terminal are fixed or movable. The base station and/or the terminal is deployed on the land, including an indoor device, an outdoor device, a handheld device, or a vehicle-mounted device, is deployed on the water, or is deployed on an airplane, a balloon, and an artificial satellite in the air. Application scenarios of the base station and the terminal are not limited in at least one embodiment. The base station and the terminal is deployed in a same scenario or different scenarios. For example, the base station and the terminal are both deployed on the land. Alternatively, the base station is deployed on the land, and the terminal is deployed on the water. Examples are not described one by one.

Roles of the base station and the terminal are relative. For example, a helicopter or an unmanned aerial vehicle 120i in FIG. 1 is configured as a mobile base station. For a terminal 120j that accesses the radio access network 100 via 120i, a terminal 120i is a base station. However, for a base station 110a, 120i is a terminal. In other words, 110a and 120i communicate with each other via a radio air interface protocol. 110a and 120i alternatively communicate with each other via an interface protocol between base stations. In this case, 120i is also a base station relative to 110a. Therefore, both the base station and the terminal is collectively referred to as communication apparatuses, 110a and 110b in FIG. 1 each is referred to as a communication apparatus having a base station function, and 120a to 120j in FIG. 1 each is referred to as a communication apparatus having a terminal function.

Communication between the base station and the terminal, between base stations, or between terminals is performed by using a licensed spectrum, an unlicensed spectrum, both a licensed spectrum and an unlicensed spectrum, a spectrum below 6 gigahertz (gigahertz, GHz), a spectrum above 6 GHz, or both a spectrum below 6 GHz and a spectrum above 6 GHz. A spectrum resource used for wireless communication is not limited in at least one embodiment.

In at least one embodiment, the base station sends a downlink signal or downlink information to the terminal, where the downlink information is carried on a downlink channel; and the terminal sends an uplink signal or uplink information to the base station, where the uplink information is carried on an uplink channel. To communicate with the base station, the terminal establishes a wireless connection to a cell controlled by the base station. The cell that establishes the wireless connection to the terminal is referred to as a serving cell of the terminal. In response to communicating with the serving cell, the terminal is further interfered by a signal from a neighboring cell.

In at least one embodiment, an independent network element (for example, referred to as an AI network element or an AI node) is introduced into the communication system shown in FIG. 1, to implement an AI-related operation. The AI network element is directly connected to the access network device in the communication system, or is indirectly connected to the access network device via a third-party network element. The third-party network element is a core network element such as an authentication management function (authentication management function, AMF) network element or a user plane function (user plane function, UPF) network element. Alternatively, an AI function, an AI module, or an AI entity is configured in another network element in the communication system, to implement an AI-related operation. For example, the another network element is an access network device (such as a gNB), a core network device, a network management device (operation, administration and maintenance, OAM), or the like. In this case, a network element that performs the AI-related operation is a network element equipped with a built-in AI function. In at least one embodiment, an example in which another network element is equipped with a built-in AI function is used for description.

In at least one embodiment, the OAM is configured to operate, manage, and/or maintain the core network device, and/or is configured to operate, manage, and/or maintain the access network device.

In at least one embodiment, an AI model is a specific method for implementing the AI function, and the AI model represents a mapping relationship between an input and an output of the model. The AI model is a neural network or another machine learning model. The AI model is referred to as a model for short. The AI-related operation includes at least one of the following: data collection, model training, model information releasing, model inference (model inference), inference result releasing, or the like.

A neural network is used as an example. The neural network is a specific implementation form of a machine learning technology. According to the universal approximation theorem, the neural network approximates any continuous function in theory, so that the neural network has a capability of learning any mapping. In a conventional communication system, a communication module is to be designed with rich expert knowledge. However, a neural network-based deep learning communication system automatically discovers an implicit pattern structure from a large quantity of data sets, establish a mapping relationship between data, and obtain performance better than that of a conventional modeling method.

The idea of the neural network comes from a neuron structure of brain tissue. Each neuron performs a weighted summation operation on an input value of the neuron, and outputs a result of the weighted summation through an activation function. FIG. 2a is a schematic diagram of a neuron structure. An input of a neuron is x=[x₀, x₁, . . . , x_n], a weight corresponding to each input is w=[w, w₁, . . . , w_n], and an offset of weighted summation is b. An activation function has diversified forms. An activation function of a neuron is y=f(z)=max (0, z). In this case, an output of the neuron is y=f(Σ_i=0ⁱ⁼ⁿw_i*x_i+b)=max (0, Σ_i=0ⁱ⁼ⁿw_i*x_i+b). For another example, in response to an activation function of a neuron being y=f(z)=z, an output of the neuron is y=f(Σ_i=0ⁱ⁼ⁿw_i*x_i+b)=Σ_i=0ⁱ⁼ⁿw_i*x_i+b. b is any value such as a decimal, an integer (including 0, a positive integer, a negative integer, or the like), or a complex number. Activation functions of different neurons in the neural network are the same or different.

The neural network generally includes a multi-layer structure, and each layer includes one or more neurons. Increasing a depth and/or a width of the neural network improves an expression capability of the neural network, and provides more powerful information extraction and abstract modeling capabilities for a complex system. The depth of the neural network indicates a quantity of layers included in the neural network, and a quantity of neurons included in each layer is referred to as the width of the layer. FIG. 2b is a schematic diagram of a layer relationship of a neural network. In an implementation, the neural network includes an input layer and an output layer. The input layer of the neural network performs neuron processing on a received input, and then transfers a result to the output layer, and the output layer obtains an output result of the neural network. In another implementation, the neural network includes an input layer, a hidden layer, and an output layer. The input layer of the neural network performs neuron processing on a received input, and transfers a result to an intermediate hidden layer. The hidden layer then transfers a calculation result to the output layer or an adjacent hidden layer. Finally, the output layer obtains an output result of the neural network. One neural network includes one or more hidden layers that are sequentially connected. This is not limited. In a training process of the neural network, a loss function is defined. The loss function describes a gap or a difference between an output value of the neural network and an ideal target value. A specific form of the loss function is not limited in at least one embodiment. The training process of the neural network is a process of adjusting a neural network parameter, such as a quantity of layers and a width of the neural network, a weight of a neuron, a parameter in an activation function of the neuron, and/or the like, so that a value of the loss function is less than a threshold or meets a target goal.

FIG. 2c is a schematic diagram of an application framework of AI. A data source (data source) is configured to store training data and inference data. A model training node (model training host) analyzes or trains training data (training data) provided by the data source to obtain an AI model, and deploys the AI model in a model inference node (model inference host). Optionally, the model training node further updates the AI model deployed in the model inference node. The model inference node further feeds back related information about the deployed model to the model training node, so that the model training node performs optimization, updating, or the like on the deployed AI model.

The AI model represents a mapping relationship between an input and an output of the model. Obtaining an AI model through learning by the model training node is equivalent to obtaining the mapping relationship between the input and the output of the model through learning by the model training node by using the training data. The model inference node uses the AI model to perform inference based on inference data provided by the data source, to obtain an inference result. The method is also described as that the model inference node inputs the inference data to the AI model, and obtains an output by using the AI model. The output is the inference result. The inference result indicates a configuration parameter used (executed) by an execution object, and/or an operation performed by the execution object. The inference result is uniformly planned by an execution (actor) entity, and sent to one or more execution objects (for example, network entities) for execution.

FIG. 3a and FIG. 3b each are a schematic diagram of a network architecture according to at least one embodiment. An AI model is deployed in at least one of a core network device, an access network device, a terminal, OAM, or the like, and a corresponding function is implemented by using the AI model. In at least one embodiment, AI models deployed in different nodes are the same or different, and that the models are different includes at least one of the following differences. Structural parameters of the models are different. For example, quantities of layers and/or weights of the models are different: input parameters of the models are different: or output parameters of the models are different. That the input parameters of the models and/or the output parameters of the models are different is further described as that functions of the models are different. Different from FIG. 3a, in FIG. 3b, functions of the access network device are split into a CU and a DU. Optionally, one or more AI models is deployed in the CU, and/or one or more AI models is deployed in the DU. Optionally, the CU in FIG. 3b is further split into a CU-CP and a CU-UP. Optionally, one or more AI models are deployed in the CU-CP, and/or one or more AI models are deployed in the CU-UP. Optionally, in FIG. 3a or FIG. 3b, OAM of the access network device and OAM of the core network device is separately deployed.

In a conventional machine learning field, model training and model testing are usually completed by one device, and another device does not perform management and control. However, in a wireless network, there are a plurality of devices, and usually, one device (for example, a base station, a core network device, or OAM) manages and controls another device (for example, a base station or a terminal). For example, in response to the terminal using an AI model to process an operation related to a network-side device, the network-side device evaluates performance of the AI model used by the terminal, and determine whether the terminal is able to use the AI model to communicate with the network-side device. Particularly, in response to the AI model used by the terminal being used together with some operations or an AI model of the network side device, in response to the network-side device not evaluating, managing, and controlling the AI model used by the terminal at all, normal communication between the two parties fails, and even communication of another terminal is affected. However, in the current wireless network, there is no test method for the AI model, to evaluate, manage, and control the used AI model.

In a model test method provided in at least one embodiment, a first device tests an AI model deployed in a second device. The first device and the second device are not limited in at least one embodiment. For example, the first device is an access network device, the second device is a terminal, and the access network device tests an AI model in the terminal. Alternatively, the first device is OAM, the second device is a terminal, and the OAM tests an AI model in the terminal. Alternatively, the first device is a core network device, the second device is a terminal, and the core network device tests an AI model in the terminal. Alternatively, the first device is a core network device, the second device is an access network device, and the core network device tests an AI model deployed in the access network device. In subsequent specific descriptions of the model test method, an example in which the first device is an access network device, the second device is a terminal, and a base station tests an AI model deployed in the terminal is used for description. In response to the first device and the second device not directly communicating with each other, a third device assists the first device and the second device in communication between the first device and the second device. For example, the first device is a core network device or OAM, and the second device is a terminal. In this case, the first device sends a signal or information to the second device through forwarding by the third device (for example, an access network device), and the second device sends a signal or information to the first device through forwarding by the third device. The forwarding is transparent transmission, or processing (for example, adding a header, segmenting, or cascading) the to-be-forwarded signal or information and then forwarding processed signal or information.

In a design, as shown in FIG. 4, a machine learning process includes the following steps.

1. Perform division on a sample set, to obtain a training set, a validation set, and a test set.

For example, a division method includes a hold-out method, a cross-validation method, a bootstrapping method, or the like. The hold-out method is to divide the sample set into a plurality of mutually exclusive sets, and the plurality of mutually exclusive sets are referred to as a training set, a validation set, and a test set. The cross-validation method is to try to use different training sets, validation sets, and test sets that are obtained through division to perform a plurality of groups of different training, validation, testing, and the like on an AI model, to deal with problems such as a single test result being excessively one-sided and different training data. The bootstrapping method is to perform sampling with replacement on the sample set for n times to obtain n samples, and then use the n samples as a training set, where n is a positive integer. In an implementation, a part of the training set is obtained through random division as the validation set, and the remaining part is used as the training set. Because the sampling is sampling with replacement, there is a high probability that a part of the n samples is duplicate. In addition, a part of samples in the entire sample set are not sampled, and the part of samples that are not sampled are used as the test set. The training set, the validation set, and the test set is from one sample set, different sample sets, or the like. For example, the training set and the validation set are from one sample set, and the test set is from another sample set.

There is no intersection between any two of the training set, the validation set, and the test set. However, a case that there is an intersection between any two of the training set, the validation set, and the test set is not excluded in at least one embodiment. The training set is a set of training samples, and each training sample is one input in a model training process. The validation set is a set of validation samples. Each validation sample is one input in a model validation process. The test set is a set of test samples. Each test sample is one input in a model test process.

2. Perform model training by using the training set, to obtain an AI model, where this process is referred to as model training.

Model training is an important part of machine learning. The essence of machine learning is to learn some features of the training sample from the training sample, so that a difference between an output of the AI model and an ideal target value is minimized through training of the training set. Usually, even in response to a same network structure being used, weights and/or outputs of AI models trained by using different training sets are different. Therefore, to some extent, performance of the AI model is determined by composition and selection of the training set.

3. Validate the AI model by using the validation set, where this process is referred to as model validation.

Model validation is usually performed in the model training process. For example, each time one or more times of iterative (EPOCH) training is performed on the model, the validation set is used to validate the current model, to monitor a model training status, for example, to validate whether the model is under-fitted, overfitted, converged, or the like, and determine whether to end the training. Optionally, in the model validation process, a hyperparameter of the model is further adjusted. The hyperparameter is at least one of the following parameters of the model: a quantity of layers of a neural network, a quantity of neurons, an activation function, a loss function, or the like.

4. Test the AI model by using the test set, where this process is referred to as model test.

After the model training is completed, the test set is used to test a trained AI model, for example, evaluate a generalization capability of the AI model, determine whether performance of the AI model meets a goal, whether the AI model is available, or the like.

In the current wireless network, there is no available AI model test procedure, to achieve an objective that a first device (for example, a base station) evaluates, manages, and controls an AI model used by a second device (for example, a terminal). In at least one embodiment, a model test method is provided. In the method, a base station tests an AI model used by a terminal, so that the base station evaluates, manages, and controls the AI model used by the terminal. As shown in FIG. 5, a procedure of a model test method is provided, including at least the following steps.

Step 500: A terminal sends request information to a base station, where the request information is used to request to test an AI model, or is used to request first information in step 501.

Before step 500, the terminal obtains at least one AI models participating in the test. The AI models participating in the test are obtained by the terminal through training, or are sent by the base station to the terminal, or is downloaded by the terminal from another node, or the like. This is not limited.

Specifically, the terminal sends the request information in step 500 to the base station after obtaining the at least one AI models participating in the test, or after the terminal is ready for the test and tests the model.

In a design, the request information includes indication information of an input format of the AI models participating in the test. The base station determines format of a test sample in a test set based on the input format of the AI models participating in the test. For example, the base station converts the format of the test sample in the test set based on the input format of the AI model. A converted format of the test sample meets a goal for the input format of the AI model. Alternatively, the base station selects, from a plurality of test sets based on the input format of the AI models participating in the test, a test set that meets a goal, or the like. For example, there are a plurality of test sets, and formats of test samples in different test sets are different. The base station selects, from the plurality of test sets, a test set that includes a test sample whose format meets the goal for the input format of the AI model.

Alternatively, whether the request information includes indication information of an input format of the AI models participating in the test is not limited. In other words, the request information includes the indication information of the input format of the AI models participating in the test, or does not include the indication information of the input format of the AI models participating in the test. The base station sends, to the terminal, indication information of a test set including a test sample in any format, and the terminal converts the test sample in the test set into a test sample that meets the input format of the AI models participating in the test. Alternatively, a format of the test sample is predefined, and the AI models participating in the test are designed, trained, or the like based on the predefined format in response to being generated. The format of the test sample included in the test set sent by the base station to the terminal meets the input format of the AI models participating in the test.

Optionally, the input format of the AI model includes a type of input data of the AI model, a dimension of the input data, and/or the like. The type of the input data is content described in the input data. For example, the type of the input data is a radio channel, a channel feature, a received signal, received signal power, or the like. The dimension of the input data is a specific expression form of the input data. For example, the dimension of the input data is one or more of a time domain dimension, a frequency domain dimension, a space domain dimension, a beam domain dimension, a latency domain dimension, and the like. The dimension of the input data further includes a size of the input data in each dimension. For example, in response to the size of the input data in the time domain dimension being 2, and the size of the input data in the frequency domain dimension being 72, the input data is a matrix of 2*72. The dimension of the input data further includes a unit of the input data in each dimension. For example, the unit of the input data in the time domain dimension is a slot, an orthogonal frequency division multiplexing (orthogonal frequency division multiplexing, OFDM) symbol, or the like, and the unit of the input data in the frequency domain dimension is a subcarrier, a resource block (resource block, RB), or the like.

Step 501: The base station sends the first information to the terminal, where the first information is used to determine at least one test set.

For example, the base station sends the first information to the terminal in T1 time units after receiving the request information sent by the terminal.

In a design, the first information includes indication information of the at least one test set. For example, the base station sends the at least one test set to the terminal. The test set is generated by the base station, or is from another device. For example, the test set is from a core network device, OAM, remote intelligent communication, a wireless intelligent controller, an AI node, another device, or the like. Alternatively, the base station sends the indication information of the at least one test set to the terminal. The indication information indicates the terminal to select at least one test set from a plurality of predefined test sets to perform a test. The plurality of predefined test sets is stored in the terminal (for example, as specified in a protocol), or stored in a third-party node, where the third-party node is referred to as a test data management node. Optionally, the first information further includes network address information of the third-party node or test data. Optionally, for security consideration, the third-party node is not randomly accessed. The first information further includes authorization information for accessing the third-party node. The authorization information of the third-party node is referred to as test set access authorization information.

In another design, the first information includes a reference signal. The terminal determines the at least one test set based on the reference signal. Alternatively, the first information includes configuration information of a reference signal. The terminal receives the reference signal based on the configuration information of the reference signal, and determines the at least one test set based on the received reference signal. For example, the base station configures a reference signal pattern for the terminal, and notifies the terminal that a reference signal of the pattern is used by the terminal to obtain a test set related to a radio channel online. In response to detecting the reference signal of the pattern, the terminal performs channel estimation on the reference signal, to obtain the test set related to the radio channel, and the like. In this design, because the obtained test set is the test set related to the radio channel, an AI model related to the radio channel is tested by using the test set. Alternatively, the first information includes a downlink data signal sent on a predefined downlink channel. For example, the base station configures a downlink channel for the terminal, where the downlink channel is predefined or known to the terminal. The base station notifies the terminal that the downlink channel is used to obtain a test set online. In this case, the terminal receives the downlink data signal on the downlink channel, and determines at least one test set based on the downlink data signal.

Step 502: The terminal tests the AI model based on the at least one test set, to obtain a test result.

In at least one embodiment, the AI model is a model that implements a function in an AI manner, for example, predicts a movement track of the terminal, network load, or the like. A name of the AI model is not limited. For example, the AI model is referred to as a model for short, or is referred to as a first model, a neural network model, or the like. The AI model represents a mapping relationship between an input and an output of the model. As shown in FIG. 2c, obtaining an AI model through learning by a model training node is equivalent to obtaining a mapping relationship between an input and an output of the model through learning by the model training node by using training data. A model inference node uses the AI model to perform inference based on inference data provided by a data source, to obtain an inference result. In at least one embodiment, each model including a parameter that is trained is referred to as an AI model, and a deep neural network is a typical AI model. Specifically, the deep neural network includes a multi-layer perceptron (multi-layer perceptron, MLP), a convolutional neural network (convolutional neural network, CNN), a recurrent neural network (recurrent neural network, RNN), and the like.

In at least one embodiment, there are one or more AI models participating in the test. After obtaining the at least one test set, the terminal tests one or more AI models by using the test set. In an implementation, each AI model corresponds to a same test set. For example, for each AI models participating in the test, the terminal tests the AI model once by using each of the at least one test set indicated by the base station. For example, there are N AI models participating in the test, the at least one test set includes M test sets, and both N and M are integers. Each of the N AI models corresponds to the M test sets. In other words, each AI model is tested by using the M test sets separately. One test result is obtained by inputting any one of the M test sets into any one of the N AI models, and M*N test results are obtained by separately inputting the M test sets into the N AI models. Alternatively, in another implementation, AI models correspond to different tests. For example, three AI models are respectively referred to as a first AI model, a second AI model, and a third AI model. Three test sets are referred to as a test set 1 to a test set 3. The first AI model is tested by using the test set 1, the second AI model is tested by using the test set 2, and the third AI model is tested by using the test set 3. In this case, the base station additionally notifies the terminal of a to-be-tested AI model corresponding to each test set. For example, a correspondence between the test set and the AI model is notified to the terminal in the first information in step 501. In this implementation, the terminal notifies the base station of the AI models participating in the test in advance. For example, the request information in step 500 carries indication information of the AI models participating in the test, so that the base station allocates a same test set or different test sets to the AI models participating in the test. Alternatively, in still another implementation, some AI models in AI models participating in the test correspond to a same test set, and remaining AI models correspond to different test sets. For example, there are three AI models participating in the test: a first AI model, a second AI model, and a third AI model. The at least one test set includes five test sets: a test set 1 to a test set 5. The first AI model and the second AI model correspond to same test sets. The first AI model or the second AI model corresponds to the test set 1 to the test set 5. Test sets corresponding to the third AI model are a test set 4 and the test set 5. The test sets corresponding to the third AI model and the test sets corresponding to the first AI model or the second AI model are different. In this implementation, the base station also notifies the terminal of a correspondence between the AI model and the test set. Alternatively, in yet another implementation, test sets corresponding to different AI models in AI models participating in the test are partially the same (there is an intersection), but are not completely the same. The base station also notifies the terminal of a correspondence between the AI model and the test set. For example, test sets corresponding to a first AI model are 1 and 2, and test sets corresponding to a second AI model are 2 and 3. The test sets corresponding to the first AI model and the test sets corresponding to the second AI model are partially the same, and the test sets corresponding to the first AI model and the test sets corresponding to the second AI model include the same test set 2.

Step 503: The terminal sends second information to the base station, where the second information indicates the test result of the AI model.

Optionally, after receiving the first information in step 501, the terminal feeds back the second information to the base station in T2 time units. An uplink resource used in response to the terminal feeding back the second information being indicated or configured in response to the base station sending the first information. In other words, the first information in step 501 further includes indication or configuration information of the uplink resource corresponding to the second information. Alternatively, an uplink resource used for feeding back the second information is indicated or configured to the terminal in another manner, or is predefined or preconfigured, or the like. This is not limited.

In a design, the second information indicates at least one of the following.

1. AI models participating in the test: There are one or more AI models participating in the test. For example, the second information includes a number, an identifier, or the like of the AI models participating in the test. In at least one embodiment, the second information alternatively does not indicate the AI models participating in the test. For example, in response to the terminal sending the request information to the base station, in other words, indicating the AI models participating in the test to the base station, and the terminal completing the test of the AI model and reporting a test result to the base station, even in response to the second information indicating the test result does not carry the indication information of the AI models participating in the test, the base station obtains that the test result currently reported by the terminal is the test result of the AI model that is previously requested to be tested.

2. Test result corresponding to each AI model: For the AI models participating in the test, each AI model corresponds to one or more test results. For example, in response to the base station indicating a plurality of test sets, one test result is obtained by testing the AI model by using each test set. The terminal notifies the base station of a plurality of test results corresponding to the plurality of test sets. Alternatively, the terminal selects some test results from a plurality of test results, and notifies the base station of the test results. Alternatively, the terminal performs comprehensive processing on a plurality of test results to obtain a comprehensive test result, and notify the base station of the comprehensive test result.

3. Test set corresponding to each test result: As described in step 502, test sets corresponding to different AI models participating in the test are the same or different. In response to the terminal reporting the test result to the base station, the terminal notifies the base station of a test set corresponding to the test result, to be specific, a specific test set based on which the test result is generated.

The test procedure shown in FIG. 5 is initiated by the terminal, for example, is initiated in response to the terminal preparing at least one AI model, or being initiated in response to the terminal being ready to use the AI model. For example, the terminal wants to predict the movement track of the terminal by using a first AI model. The first AI model is trained by the terminal, downloaded from another node, or the like. This is not limited. Before using the first AI model, the terminal initiates the test procedure shown in FIG. 5, and sends the request information in step 501 to the base station, to request the base station to test the first AI model. After the test performed by the base station on the first AI model succeeds, the terminal predicts the movement track by using the first AI model. In response to the terminal initiating the test procedure shown in FIG. 5, step 500 is performed. Alternatively, the test procedure shown in FIG. 5 is initiated by the base station. For example, the base station initiates the test procedure in response to the base station wanting to know current performance of an AI model of the terminal, or the base station initiating the test procedure in response to the base station being ready to enable the terminal to use an AI model. Alternatively, for an AI model that has been tested before, after a period of time, for example, an environment changes, and the base station wants to know current performance of the AI model. In this case, the base station indicates the terminal to test the AI model again. In this case, the base station sends indication information of a to-be-tested AI model to the terminal. For example, the indication information of the to-be-tested AI model and the test set is carried in same information. In other words, the first information in step 501 further indicates the to-be-tested AI model of the terminal. Alternatively, the base station separately sends one piece of information to the terminal. The information separately indicates the terminal to test an internally deployed AI model, or the like. Alternatively, the base station tests an AI model based on a time domain pattern. The time domain pattern indicates a time domain resource for testing the model. The time domain resource indicated by the time domain pattern is periodic or aperiodic. This is not limited. The time domain pattern is specified in a protocol or preconfigured by the base station for the terminal. In response to the base station initiating the test procedure shown in FIG. 5, step 500 is not performed, and the base station directly sends the first information in step 501 to the terminal.

According to the foregoing method, the base station sends, to the terminal, the first information used to determine the at least one test set. The terminal tests the AI model by using the test set indicated by the base station, to determine the test result, and notifies the base station of the test result, so that the base station knows and manages the AI model used by the terminal.

In the procedure shown in FIG. 5, an example in which the base station tests the AI model is used for description. In an implementation, the AI model is tested by another device other than the base station. The another device includes a core network device, OAM, remote intelligent communication, a wireless intelligent controller, an AI node, or the like. The base station specifically forwards information. For example, a core network device tests an AI model deployed in a terminal. As shown in FIG. 6, a procedure of a model test method is provided, including at least the following steps.

Step 600: A terminal sends a request information to a core network device, where the request information is used to request the core network device to test an AI model, or request first information, or the like.

Specifically, the base station first receives the request information, and then forwards the request information to the core network device. Similar to the procedure shown in FIG. 5, step 500 is optional.

Step 601: The core network device sends the first information to the terminal, where the first information is used to determine at least one test set.

Specifically, the base station first receives the first information sent by the core network device, and then forwards the first information to the terminal.

Step 602: The terminal tests the AI model based on the at least one test set, to obtain a first test result.

Step 603: The terminal sends second information to the core network device, where the second information indicates the test result of the AI model.

Specifically, the base station first receives the second information, and then forwards the second information to the core network device.

In at least one embodiment, for how to determine whether each AI models participating in a test meets a performance goal, at least one embodiment provides the following two solutions. In a first solution, a terminal reports a first test result to a base station. The first test result indicates an output of the AI model, a performance indicator determined based on an output of the AI model, or the like. The base station determines, based on the first test result, whether the AI models participating in the test meets the performance goal, and the like. For details, refer to a procedure shown in FIG. 7. In a second solution, a base station notifies a terminal of a performance goal for an AI model in advance. In response to testing the AI model, the terminal obtains a performance indicator in the foregoing first test result. The terminal determines, based on a performance indicator of each AI model and the performance goal notified by the base station, whether each AI model meets the performance goal, and notifies the base station that each AI model meets or does not meet the performance goal. Information that is notified by the terminal to the base station and that indicates whether each AI model meets the performance goal, or the like is referred to as a second test result. For details, refer to a procedure shown in FIG. 8.

In at least one embodiment, in response to the terminal testing the AI model to obtain the first test result, and notifying the base station of the first test result, the base station determines whether the AI model meets the goal, and the base station notifies the terminal of a test conclusion of the AI model. As shown in FIG. 7, a procedure of a model test method is provided, including at least the following steps.

Step 700: A terminal sends request information to a base station, where the request information is used to request to test an AI model, or is used to request first information in step 701.

Similar to step 500 shown in FIG. 5, step 700 is not necessarily performed, and is selectively performed.

Step 701: The base station sends the first information to the terminal, where the first information is used to determine at least one test set.

Step 702: The terminal tests the AI model based on the at least one test set, to obtain a first test result.

For example, the terminal uses an output that is of the AI models participating in the test and that is obtained based on each test set as the first test result and feed back the first test result to the base station. In this case, the first test result indicates an output that is of the AI model and that is obtained based on the at least one test set. Alternatively, the terminal determines a performance indicator of the AI model based on an output that is of the AI model and that is obtained based on the test set, and use the performance indicator of the AI model as the first test result of the AI model and feed back the first test result of the AI model to the base station. In this case, the first test result indicates a performance indicator of the AI model obtained by testing the AI model based on the at least one test set. A specific manner in which the terminal determines the performance indicator of the AI model based on the output that is of the AI model and that is obtained based on the test set is predefined, configured by the base station, or the like. This is not limited. For example, the performance indicator of the AI model includes a minimum mean square error (mean square error, MSE), a normalized minimum mean square error (normalized mean square error, NMSE), a block error rate (block error rate, BLER), or the like. For example, the terminal calculates an MSE, an NMSE, or the like based on the output that is of the AI model and that is obtained based on the test set and a label corresponding to the test set. The label is considered as a correct output corresponding to the test set. Optionally, in this design, the first information in step 701 further indicates a label corresponding to each test set.

Step 703: The terminal sends second information to the base station, where the second information indicates the first test result of the AI model.

In response to receiving the second information sent by the terminal, the base station determines, based on the first test result indicated by the second information, whether each AI models participating in the test meets a performance goal. In a design, in response to the first test result fed back by the terminal indicating the output that is of the AI model and that is obtained based on the test set, in a case of supervised learning, the base station compares the output of the AI model with the corresponding label (namely, an accurate output) to determine an error between the output of the AI model and the corresponding label, and determines, based on the error between the output of the AI model and the corresponding label, whether the AI models participating in the test meets the performance goal. For example, in response to the error between the output of the AI model and the corresponding label being less than a specific threshold, the AI models participating in the test meets the performance goal. In response to the error between the output of the AI model and the corresponding label not being less than a specific threshold, the AI models participating in the test do not meet the performance goal. In a case of unsupervised learning or reinforcement learning, there is no label corresponding to the output of the AI model, the base station determines the performance indicator of the AI model based on the output of the AI model, and determines, based on the performance indicator of the AI model, whether the AI model meets the performance goal, and the like. For example, in response to the output of the AI model being a beamforming vector or a precoding matrix, the base station calculates a corresponding signal to interference plus noise ratio (signal to interference and noise ratio, SINR) or throughput in response to the beamforming vector or the precoding matrix being used. In response to the SINR or throughput being greater than a threshold, the AI models participating in the test meets the performance goal. In response to the SINR or throughput is not greater than a threshold, the AI models participating in the test does not meet the performance goal. For example, the output of the AI model is a precoding matrix, and an SINR corresponding to the precoding matrix is calculated. The first test result fed back by the base station is a precoding matrix W. After a signal that is sent by using power P and the precoding matrix W passes through a channel H, the base station calculates receive power of the signal at a receiving end. An SINR is calculated based on the receive power at the receiving end and noise power.

In another design, in response to the first test result fed back by the terminal indicating the performance indicator that is of the AI model obtained by testing the AI model based on the at least one test set, the base station determines, based on the performance indicator of the AI model, whether the AI model meets the performance goal. For example, the performance indicator fed back by the terminal is an NMSE. In this case, in response to the NMSE being less than a threshold, the base station considers that the AI models participating in the test meets the performance goal. In response to the NMSE not being less than a threshold, the base station considers that the AI models participating in the test does not meet the performance goal.

In a case of a plurality of test sets, for each AI models participating in the test, the base station comprehensively considers first test results that are of the AI model and that are obtained based on the plurality of test sets, to determine whether the AI model meets the performance goal. For example, in response to an AI model meeting a performance goal of each of the plurality of test sets, the AI model meets the performance goal. Alternatively, in response to an AI model meeting performance goals of more than a half of the test sets, the AI model meets the performance goal. Alternatively, in response to average or weighted average performance that is of an AI model and that is obtained based on the plurality of test sets meeting the performance goal, the AI model meets the performance goal.

Step 704: The base station sends first indication information to the terminal. There are one or more pieces of first indication information. This is not limited. For example, in response to the first indication information indicating that the AI models participating in the test meets or does not meet the performance goal, whether each of the AI models participating in the test meets the performance goal is indicated by using one piece of indication information, for example, by using the following bitmap. Alternatively, for each AI models participating in the test, the base station sends one piece of indication information, to indicate whether a corresponding AI model meets the performance goal, and the like.

Optionally, in response to receiving the second information in step 703, the base station sends the first indication information to the terminal in T3 time units.

In a design, the first indication information is used for a test conclusion of the AI model. For example, the first indication information indicates that each of the AI models participating in the test meets or does not meet the performance goal. For example, five AI models in the terminal participate in the test, and the base station finds, based on first test results of the five AI models reported by the terminal, that three AI models meet the performance goal, and two AI models do not meet the performance goal. In this case, the base station notifies the terminal that the three AI models meet the goal and the two AI models do not meet the goal. For example, a 5-bit bitmap (bitmap) is used, and each of five bits corresponds to one AI model. In response to an AI model meeting the performance goal, a bit corresponding to the AI model is represented by 1 in the bitmap. In response to an AI model not meeting the performance goal, a bit corresponding to the AI model is represented by 0. Meanings of 1 and 0 are interchanged. This is not limited.

Alternatively, the first indication information indicates an AI model that meets the performance goal and that is in the AI models participating in the test. In this case, that the base station feeds back, to the terminal, the AI model that meets the performance goal is pre-agreed on by the base station and the terminal, specified in a protocol, or the like. In this design, the base station notifies the terminal of the three AI models that meet the performance goal and that are in the five AI models participating in the test. For example, numbers or identifiers of the three AI models are notified to the terminal. In response to obtaining indication information of the three AI models, the terminal considers by default that the three AI models meet the performance goal, and the remaining two AI models participating in the test do not meet the performance goal.

Alternatively, the first indication information indicates an AI model that does not meet the performance goal and that is in the AI models participating in the test. In this case, that the base station feeds back, to the terminal, the AI model that does not meet the performance goal is pre-agreed on by the base station and the terminal or specified in a protocol. In this design, the base station notifies the terminal of the two AI models that do not meet the performance goal and that are in the five AI models participating in the test. In response to obtaining indication information of the two AI models, the terminal considers by default that the two AI models do not meet the performance goal, and the remaining three AI models participating in the test meet the performance goal.

Optionally, the terminal subsequently performs a specific operation by using the three AI models that meet the performance goal, for example, predict a movement track of the terminal, network load, or the like. The two AI models that do not meet the performance goal are not used subsequently. The three AI models that meet the performance goal respectively implement different functions. For example, the three AI models are respectively used to predict the track of the terminal, the network load, movement speed of the terminal, and the like. Alternatively, the three AI models implement a same function. For example, all the three AI models are used to predict the movement track of the terminal. In this case, the terminal selects one AI model from the three AI models, to perform a specific movement track prediction operation. The terminal selects, based on a principle, a specific AI model from the three AI models having the same function to perform a specific operation. The principle is not limited thereto.

In another design, the first indication information indicates an AI model to be subsequently used by the terminal or an available AI model in the AI models participating in the test, or indicates the terminal to perform a subsequent operation in a non-AI manner. For example, the base station determines, based on the first test result reported by the terminal, that there is a plurality of AI models that meet the performance goal. Functions implemented by the plurality of AI models are the same. In this case, the base station selects one AI model from the plurality of AI models, and indicates the AI model to the terminal. The terminal directly uses the AI model to perform the subsequent operation. For example, the terminal tests five AI models. The five AI models are all used to predict the movement track of the terminal. In this case, the terminal notifies the base station of first test results of the five AI models. The base station determines, based on the first test results of the five AI models, whether the five AI models meet the performance goal. One AI model is selected from AI models that meet the performance goal, and the AI model is notified to the terminal, to indicate the terminal to perform subsequent movement track prediction by using the AI model. Alternatively, the base station finds that all the tested AI models do not meet the performance goal. In this case, the base station indicates the terminal to perform a corresponding operation in the non-AI manner.

In at least one embodiment, in response to receiving the first indication information, the terminal performs the subsequent operation based on the first indication information. For example, in response to the first indication information indicating that the AI models participating in the test does not meet the performance goal, or indicating the terminal to perform the corresponding operation in the non-AI manner, the terminal performs the corresponding operation in the non-AI manner. For example, in a channel estimation scenario, the terminal performs channel estimation in a non-AI manner such as a least square (least square, LS) method or a linear minimum mean square error (linear minimum mean square error, LMMSE) method. In this design, the base station indicates the terminal to perform the corresponding operation in the non-AI manner. A specific non-AI manner used by the terminal is not limited. In another design, in response to the AI models participating in the test not meeting the performance goal, the terminal indicates, with reference to a specific application scenario, the terminal to use a specific non-AI manner to perform the subsequent operation. In this case, the first indication information indicates a specific non-AI manner. For example, in a channel state information (channel state information, CSI) feedback scenario, the base station indicates the terminal to use an R16 type 2 codebook. In this case, the terminal uses the R16 type 2 codebook to perform CSI feedback. The R16 type 2 codebook is one of codebooks used for CSI feedback in the 3rd generation partnership project (3rd generation partnership project, 3GPP) protocol. The terminal uses the R16 type 2 codebook to represent a downlink channel or an eigenvector of a downlink channel as a precoding matrix index (precoding matrix index, PMI), and feed back the PMI to the base station. The base station restores the PMI to the downlink channel or the eigenvector of the downlink channel based on the same codebook.

Alternatively, in response to the first indication information indicating an AI model that should be subsequently used by the terminal, the terminal uses the AI model to perform a corresponding operation. For example, the base station indicates an AI model index to the terminal. In this case, the terminal performs a corresponding operation by using an AI model corresponding to the index. Alternatively, in response to the first indication information indicating whether each of the AI models participating in the test meets the performance goal, indicating a specific AI model that meets the performance goal, or indicating the AI model that does not meet the performance goal, in response to there being at least one AI model that meets the performance goal and that is in the AI models participating in the test, the terminal uses any AI model that meets the performance goal to perform a corresponding operation.

According to the foregoing method, the base station sends, to the terminal, the first information used to determine the at least one test set. The terminal tests the AI model based on the test set, to obtain the first test result. In addition, the terminal sends, to the base station, the second information indicating the first test result. The base station determines, based on the first test result, whether the AI models participating in the test meets the performance goal, and notifies the terminal that the AI models participating in the test meets or does not meet the performance goal. Therefore, the base station evaluates, manages, and controls the AI model used by the terminal.

In at least one embodiment, the base station sends the performance goal for the AI model to the terminal. The terminal determines, based on the performance goal that is for the AI model and that is notified by the base station and the performance indicator that is of the AI model and that is obtained through the test, whether the AI models participating in the test meets the performance goal. As shown in FIG. 8, a procedure of a model test method is provided, including at least the following steps.

Step 800: A terminal sends a request information to a base station, where the request information is used to request to test an AI model, or is used to request first information in step 801.

Similar to step 500 shown in FIG. 5, step 800 is not necessarily performed, and is selectively performed.

Step 801: The base station sends the first information to the terminal, where the first information is used to determine at least one test set.

Step 802: The base station sends second indication information to the terminal, where the second indication information indicates a performance goal for the AI model.

In a design, the performance goal is related to a test set. For example, the performance goal is a performance goal corresponding to the test set. The performance goal is also referred to as a performance indicator, a performance threshold, or the like, and is used to determine whether the AI models participating in the test meets the performance goal. For example, the performance goal is an MSE threshold, an NMSE threshold, a BLER threshold, an SINR threshold, or the like.

For a plurality of test sets, one performance goal is configured for each test set. Each test set corresponds to a same performance goal or a different performance goal. In this case, the second indication information further indicates how the terminal determines, based on the plurality of test sets, whether the AI model meets the performance goal. For example, for one AI models participating in the test, in response to a performance indicator of the AI model meeting a performance goal of each of the plurality of test sets, the AI model meets the performance goal. Alternatively, in response to the AI model meeting performance goals of more than a half of the test sets, the AI model meets the performance goal. Alternatively, for the plurality of test sets, a comprehensive performance goal is configured. In response to comprehensive performance of the AI models participating in the test meeting a goal, the AI model meets the performance goal. In this case, the second indication information specifically indicates a comprehensive performance goal for the AI model. The comprehensive performance goal is average performance, weighted average performance, or the like that is of the AI model and that is obtained based on the plurality of test sets. In a case of weighted averaging, in response to the second indication information indicating the comprehensive performance goal, weights of different test sets, and the like is further included.

In step 801 and step 802, sending is performed by using a same message, or is performed by using different messages. In response to the sending being performed by using different messages, a sequence of step 801 and step 802 is not limited.

Step 802 is optional, and is represented by a dashed line in FIG. 8. For example, in response to the performance goal being specified in a protocol, or is determined by the terminal, step 802 is not performed.

Step 803: The terminal determines a second test result based on the at least one test set and the performance goal for the AI model.

For example, the terminal tests the AI model based on the at least one test set, to obtain a performance indicator of the AI model. For a specific process, refer to the descriptions in FIG. 4. The second test result is determined based on the performance indicator of the AI model and the performance goal for the AI model. For example, in response to the performance indicator of the AI model exceeding the performance goal, the AI model meets the performance goal. In response to the performance indicator of the AI model not exceeding the performance goal, the AI model does not meet the performance goal. The second test result specifically indicates that each of the AI models participating in the test meets or does not meet the performance goal, indicates an AI model that meets the performance goal and that is in the AI models participating in the test, indicates an AI model that does not meet the performance goal and that is in the AI models participating in the test, indicates an AI model to be subsequently used by the terminal, indicates the terminal to perform a subsequent operation in a non-AI manner, or the like. For a specific process of generating the second test result, refer to the first indication information in the procedure shown in FIG. 4. A difference from the procedure shown in FIG. 4 is as follows. In the procedure shown in FIG. 4, the terminal reports the first test result to the base station, and the base station determines the first indication information based on the reported first test result. However, in the procedure shown in FIG. 8, the terminal directly obtains the second test result of the AI model based on the performance goal and the performance indicator obtained by testing the AI model.

For example, the terminal determines three test sets by using the first information in step 801. The three test sets are respectively referred to as a test set 1, a test set 2, and a test set 3. The terminal determines three performance goals by using the second indication information in step 802. The three performance goals are respectively referred to as a performance goal 1 corresponding to the test set 1, a performance goal 2 corresponding to the test set 2, and a performance goal 3 corresponding to the test set 3. For an AI models participating in the test, the terminal tests the AI model based on the test set 1 to obtain a first performance indicator of the AI model, and compares the performance goal 1 with the first performance indicator of the AI model, to determine whether the AI model meets the performance goal. For example, in response to the first performance indicator of the AI model exceeding a goal of the performance goal 1, the AI model meets the performance goal. In response to the first performance indicator of the AI model not exceeding a goal of the performance goal 1, the AI model does not meet the performance goal. Similarly, a result indicating whether the AI model meets the performance goal is also obtained by using the test set 2 and the performance indicator 2, and a result indicating whether the AI model meets the performance goal is also obtained by using the test set 3 and the performance indicator 3. The terminal determines, by combining results that are of the AI model and that are obtained based on the three test sets, whether the AI model meets the performance goal. For example, as described in step 802, the second indication information in step 802 further indicates how the terminal determines, based on a plurality of test sets, whether the AI model meets the performance goal, and the like. For example, in response to the AI model meeting a goal of each test set, the AI model meets the performance goal. Alternatively, in response to the AI model meeting performance goals of a half of the test sets, the AI model meets the performance goal.

Step 804: The terminal sends second information to the base station, where the second information indicates the second test result of the AI model.

According to the foregoing method, the base station sends, to the terminal, the first information used to determine the at least one test set and performance goal information. The terminal tests the AI model based on the at least one test set, to obtain the test result of the AI model, and determines, based on the performance indicator of the AI model and the performance goal, information such as whether the AI model meets the goal, and notifies the base station of the information, so that the base station manages, controls, and evaluates the AI model used by the terminal.

The request information, the first information, the second information, first request information, second request information, or the like in at least one embodiment indicates at least one piece of information. The first information, the second information, the first request information, the second request information, or the like explicitly indicates corresponding information. For example, the first information explicitly indicates the at least one test set, and an explicit indication manner is directly carrying corresponding information. For example, the first information directly carries the at least one test set. Alternatively, an identifier of corresponding information is carried. For example, the first information carries identification information of the at least one test set. Alternatively, the request information, the first information, the second information, the first request information, or the second request information implicitly indicates corresponding information. For example, the first information implicitly indicates the at least one test set, and an implicit indication is implicitly indicating by using one or more of manners such as scrambling, a reference signal sequence, a resource location, or the like. For example, the base station configures a plurality of test sets for the terminal. By scrambling, in a specific manner, at least one test set used for model training, the base station indicates the terminal to perform model training and the like on the at least one test set scrambled in the specific manner.

To implement functions in the foregoing methods, the base station and the terminal include corresponding hardware structures and/or software modules for performing the functions. A person skilled in the art should be easily aware that, with reference to units and method steps in the examples described in at least one embodiment, at least one embodiment is implemented by hardware or a combination of hardware and computer software. Whether a function is performed by hardware or hardware driven by computer software depends on particular application scenarios and design constraints of the technical solutions.

FIG. 9 and FIG. 10 each are a schematic diagram of a structure of a communication apparatus according to at least one embodiment. The communication apparatus is configured to implement a function of the terminal or the base station in the foregoing methods, and therefore also implements beneficial effects of the foregoing methods. In at least one embodiment, the communication apparatus is one of the terminals 120a to 120j shown in FIG. 1, or is the base station 110a or 110b shown in FIG. 1, or is a module (for example, a chip) used in a terminal or a base station.

As shown in FIG. 9, a communication apparatus 900 includes a processing unit 910 and a transceiver unit 920. The communication apparatus 900 is configured to implement a function of the terminal or the base station in the method shown in FIG. 5, FIG. 7, or FIG. 8.

In response to the communication apparatus 900 being configured to implement the function of the terminal in the method shown in FIG. 5, FIG. 7, or FIG. 8, the transceiver unit 920 is configured to receive first information from a base station; and the processing unit 910 is configured to test an AI model based on at least one test set, to obtain a test result.

In response to the communication apparatus 900 being configured to implement the function of the base station in the method shown in FIG. 5, FIG. 7, or FIG. 8, the processing unit 910 is configured to determine first information; and the transceiver unit 920 is configured to send the first information to a terminal, and receive second information from the terminal.

For more detailed descriptions of the processing unit 910 and the transceiver unit 920, directly refer to related descriptions in the method shown in FIG. 5, FIG. 7, or FIG. 8. Details are not described herein again.

As shown in FIG. 10, a communication apparatus 1000 includes a processor 1010 and an interface circuit 1020. The processor 1010 and the interface circuit 1020 are coupled to each other. The interface circuit 1020 is a transceiver or an input/output interface. Optionally, the communication apparatus 1000 further includes a memory 1030, configured to store instructions executed by the processor 1010, store input data used by the processor 1010 to run instructions, or store data generated after the processor 1010 runs instructions.

In response to the communication apparatus 1000 being configured to implement the foregoing methods, the processor 1010 is configured to implement a function of the processing unit 910, and the interface circuit 1020 is configured to implement a function of the transceiver unit 920.

In response to the communication apparatus being a chip used in a terminal, the chip in the terminal implements a function of the terminal in the foregoing methods. The chip in the terminal receives information from another module (for example, a radio frequency module or an antenna) in the terminal, where the information is sent by a base station to the terminal: or the chip in the terminal sends information to another module (for example, a radio frequency module or an antenna) in the terminal, where the information is sent by the terminal to a base station.

In response to the communication apparatus being a module used in a base station, the module in the base station implements a function of the base station in the foregoing methods. The module in the base station receives information from another module (for example, a radio frequency module or an antenna) in the base station, where the information is sent by a terminal to the base station: or the module in the base station sends information to another module (for example, a radio frequency module or an antenna) in the base station, where the information is sent by the base station to a terminal. The module in the base station herein is a baseband chip in the base station, or is a DU or another module. The DU herein is a DU in an open radio access network (open radio access network, O-RAN) architecture.

The processor in at least one embodiment is a central processing unit (central processing unit, CPU), another general-purpose processor, a digital signal processor (digital signal processor, DSP), an application-specific integrated circuit (application-specific integrated circuit, ASIC), a field programmable gate array (field programmable gate array, FPGA) or another programmable logic device, a transistor logic device, a hardware component, or any combination thereof. The general-purpose processor is a microprocessor or any conventional processor.

The memory in at least one embodiment is a random access memory, a flash memory, a read-only memory, a programmable read-only memory, an erasable programmable read-only memory, an electrically erasable programmable read-only memory, a register, a hard disk, a removable hard disk, a CD-ROM, or a storage medium of any other form well-known in the art.

For example, a storage medium is coupled to a processor, so that the processor reads information from the storage medium and writes information into the storage medium. The storage medium is alternatively a component of the processor. The processor and the storage medium are disposed in an ASIC. In addition, the ASIC is disposed in a base station or a terminal. Certainly, the processor and the storage medium alternatively exist in a base station or a terminal as discrete components.

All or some of the methods in at least one embodiment are implemented by software, hardware, firmware, or any combination thereof. In response to software being used to implement the foregoing embodiments, all or some of the foregoing embodiments are implemented in a form of a computer program product. The computer program product includes one or more computer programs or instructions. In response to the computer programs or instructions being loaded and executed on a computer, the procedures or functions according to at least one embodiment are completely or partially executed. The computer is a general-purpose computer, a dedicated computer, a computer network, a network device, user equipment, a core network device, OAM, or another programmable apparatus. The computer programs or instructions are stored in a computer-readable storage medium, or are transmitted from a computer-readable storage medium to another computer-readable storage medium. For example, the computer programs or instructions are transmitted from a website, computer, server, or data center to another website, computer, server, or data center in a wired or wireless manner. The computer-readable storage medium is any usable medium accessible by the computer, or is a data storage device, such as a server or a data center, integrating one or more usable media. The usable medium is a magnetic medium, for example, a floppy disk, a hard disk, or a magnetic tape: is an optical medium, for example, a digital video disc: or is a semiconductor medium, for example, a solid-state drive. The computer-readable storage medium is a volatile or non-volatile storage medium, or includes two types of storage media: a volatile storage medium and a non-volatile storage medium.

In at least one embodiment, unless otherwise stated or there is a logic conflict, terms and/or descriptions in different embodiments are consistent and is mutually referenced, and technical features in different embodiments is combined based on an internal logical relationship thereof, to form a new embodiment.

In at least one embodiment, “at least one” means one or more, and “a plurality of” means two or more. The term “and/or” describes an association relationship between associated objects and indicates that three relationships exist. For example, A and/or B indicates the following three cases: Only A exists, both A and B exist, and only B exists. A and B each is singular or plural. In the text descriptions of at least one embodiment, the character “/” generally indicates an “or” relationship between associated objects. In a formula in at least one embodiment, the character “/” indicates a “division” relationship between associated objects. “Including at least one of A, B, or C” indicates: including A; including B; including C; including A and B; including A and C; including B and C; and including A, B, and C.

Various numerals used in at least one embodiment are merely differentiated for ease of description, but are not used to limit the scope of at least one embodiment. The sequence numbers of the foregoing processes do not mean execution sequences, and the execution sequences of the processes should be determined based on functions and internal logic of the processes.

	Number	Date	Country
Parent	PCT/CN2022/117999	Sep 2022	WO
Child	18598612		US

MODEL TEST METHOD AND APPARATUS

Information

Publication Number

Date Filed

Date Published

Inventors

Original Assignees

CPC

International Classifications

Abstract

Description

Claims

Priority Claims (1)

CROSS-REFERENCE TO RELATED APPLICATIONS

Continuations (1)