Artificial Intelligence-Based Data Processing Method, Electronic Device and Computer-Readable Storage Medium

Description

CROSS-REFERENCE TO RELATED APPLICATION

This application is based on and claims priority to Chinese Patent Application No. 202410177101.X, filed on Feb. 8, 2024, in the China National Intellectual Property Administration, the disclosure of which is incorporated by reference herein in its entirety.

TECHNICAL FIELD

The present disclosure relates to the technical field of a computing power network (CPN), and in particular to an artificial intelligence-based data processing method, an electronic device and a computer-readable storage medium.

BACKGROUND

The computing power network is a key part of an intelligent, comprehensive, and new type of information infrastructure that enables the transition from information exchange centered on networks to information data processing centered on computing power. At present, due to the separation of network and computing power construction, there are issues of “low efficiency and high cost” in the supply of computing power, which makes it difficult to meet the development requirements of novel digital economy.

Therefore, the construction of a unified digital infrastructure that integrates computing power and networking has emerged. In the prior art, this is achieved by adding independent computing power cards or dedicated computing servers to network equipments. However, these technologies require the additional purchase of computing power cards or computing servers, which makes the construction cost of the computing power network relatively high.

SUMMARY

Note that the concept and embodiments of the present disclosure will be described with respect to computing power network (CPN) which is a type of network that realizes optimized resource allocation, by distributing computing, storage, network and other resource information of service nodes through a network control plane (such as a centralized controller, distributed routing protocol, etc.). Such computing power network can be any kind of know and to-be-known network which may have equivalent functionalities, regardless of the expression or abbreviations of such a network, for example, Computing Aware Networking (CAN) in ITU-TD835, which is in the scope of cloud computing and is the enhancement of network of cloud computing to support the integration of cloud and network resources, Computing First Network (CFN) in IETF which leverages both computing and networking status to help determine the optimal edge among multiple edge sites with different geographic locations to serve as a specific edge computing request, Computing Force Network (CFN) in TU-C1407 which aims to achieve the computing and network joint optimization based on the awareness, control and management over computing resources in the context of IMT-2020 and beyond and is required to enable the use of Al/ML related capabilities, Computing Network Convergence (CNC) in TU-TD953, which aims to achieve computing and network resource joint optimization based on the awareness, control and management over network and computing resources, computing network, and Coordination of computing and networking (CCN) and so on. And the scope of the present application will not affected/influenced by the expression/abbreviation of the network in which the solution of the present disclosure is implemented.

Embodiments of the present disclosure provide an artificial intelligence-based data processing method, an electronic device and a computer-readable storage medium, which can solve the problem of high construction cost of a computing power network in the prior art.

The technical solutions are as follows:

According to one aspect of the embodiments of the present disclosure, an artificial intelligence-based data processing method is provided. The method includes:

in response to at least one task request for a target application, acquiring at least one task respectively corresponding to the at least one task request;

determining a second cluster respectively corresponding to each task from at least one first cluster corresponding to the target application, based on task information respectively corresponding to each task, wherein the second cluster is a cluster matched with task information of the task; and

for each task, allocating the task to the second cluster corresponding to the task so that the second cluster performs the task based on a calculation computing power resource,

wherein the first clusters are determined based on the following operations:

acquiring predicted idle computing power respectively corresponding to at least one base band unit (BBU) during task execution time, and determining the calculation computing power resource corresponding to the predicted idle computing power from candidate computing power resources;

clustering each BBU based on predicted idle computing power respectively corresponding to each BBU to obtain at least one cluster;

acquiring at least one application to be processed, wherein the at least one application includes the target application; and

determining at least one first cluster respectively corresponding to each application from the at least one cluster, based on a computing power requirement respectively corresponding to each application, wherein the first clusters are clusters matched with the computing power requirement of the application.

Optionally, the determining at least one first cluster respectively corresponding to each application from the at least one cluster, based on a computing power requirement respectively corresponding to each application includes:

determining the computing power requirement respectively corresponding to each application and a cluster feature respectively corresponding to each cluster, wherein the cluster feature is used to represent the computing power resource supply level of the cluster;

determining a first mapping relationship between each application and each cluster, based on the computing power requirement respectively corresponding to each application and the cluster feature respectively corresponding to each cluster; and

determining at least one first cluster respectively corresponding to each application based on the first mapping relationship.

Optionally, the determining a first mapping relationship between each application and each cluster, based on the computing power requirement respectively corresponding to each application and the cluster feature respectively corresponding to each cluster includes:

inputting the computing power requirement respectively corresponding to each application and the cluster feature respectively corresponding to each cluster to an orchestration model to obtain a plurality of candidate orchestration policies output by the orchestration model;

determining a target orchestration policy from the plurality of candidate orchestration policies based on a first policy evaluation index; and

determining the first mapping relationship based on the target orchestration policy.

Optionally, the determining a second cluster respectively corresponding to each task from at least one first cluster corresponding to the target application, based on task information respectively corresponding to each task includes:

inputting the task information respectively corresponding to each task and each of the first clusters to a scheduling model to obtain a plurality of candidate scheduling policies output by the scheduling model;

determining a target scheduling policy from the plurality of candidate scheduling policies based on a second policy evaluation index; and

determining the second cluster respectively corresponding to each task based on the target scheduling policy.

Optionally, the acquiring predicted idle computing power respectively corresponding to at least one base band unit (BBU) during task execution time includes:

for each BBU, acquiring a predicted use amount of computing power resource for communication services of the BBU during the task execution time; and

determining the predicted idle computing power of the BBU during the task execution time, based on the predicted use amount of computing power resource for communication services of the BBU, a resource constraint corresponding to the BBU and a scaling-out threshold.

Optionally, the scaling-out threshold is determined based on the following operations:

determining an initial scaling-out threshold; and

performing at least one optimizing operation on the initial scaling-out threshold until a preset ending condition is met, and taking the initial scaling-out threshold meeting the preset ending condition as the scaling-out threshold,

wherein the optimizing operations include:

for each BBU, acquiring the current status information of computing power resource for communication services and the historical status information of computing power resource for communication services of the BBU;

determining a first predicted use amount of computing power resource for communication services in a preset time domain, based on the current status of computing power resource for communication services and the historical status of computing power resource for communication services;

for any preset time in the preset time domain, determining a second predicted use amount of computing power resource for communication services at the preset time from the first predicted use amount of computing power resource for communication services;

obtaining predicted idle computing power of the BBU at the preset time, based on the second predicted use amount of computing power resource for communication services, the resource constraint corresponding to the BBU and the initial scaling-out threshold;

determining a predicted error, based on a difference between predicted idle computing power at each preset time and actual idle computing power at each preset time in the preset time domain; and

in a case that the predicted error does not meet the preset ending condition, modifying the initial scaling-out threshold and taking the modified initial scaling-out threshold as an initial scaling-out threshold for a next optimization.

Optionally, the target application includes a Federated Learning application; and a corresponding initial local model is respectively deployed by each BBU in the second cluster; and

the second cluster performing a task includes:

performing at least one training operation on an initial aggregation model in the Mobile-Edge Computing (MEC) server until a training ending condition is met, and taking the initial aggregation model meeting the training ending condition as a trained aggregation model,

wherein the training operations include:

acquiring the initial local model deployed by each BBU in the second cluster;

performing model aggregation on a plurality of initial local models to obtain a first aggregation model, and updating the initial aggregation model based on the first aggregation model; and

in a case that a loss function of the updated initial aggregation model does not meet the training ending condition, sending the updated initial aggregation model to each BBU in the second cluster respectively so that each BBU takes the updated initial aggregation model as an initial local model for a next training operation.

According to another aspect of the embodiments of the present disclosure, an electronic device is provided. The electronic device includes a memory, and a processor, the memory is configured to store computer programs which, when executed by the processor, are configured to the following operations: