This application is based upon and claims priority to Chinese Patent Application No. 202310012159.4, filed on Jan. 5, 2023, the entire contents of which are incorporated herein by reference.
The present disclosure relates to the technical field of federated learning (FL), and in particular, to a semi-federated learning (semiFL) method based on a next-generation multiple access (NGMA) technology.
With rapid development and integration of machine learning (ML) and wireless communication, massive distributed devices can generate a large amount of real-time information and multimodal data. In a large-scale wireless Internet of Things (IoT) scenario, scarce spectrum resources lead to a communication bottleneck. In addition, the massive distributed devices have different computing capabilities. These factors cause intelligent IoT that relies upon traditional ML technology to face severe challenges.
Although FL can significantly reduce communication overheads and training time of traditional centralized learning (CL), the distributed characteristic of FL compromises training accuracy of a model. In addition, an important feature of FL different from CL is that powerful computing resources of a base station (BS) are not easily used for model training in FL, user data is locally stored, and all nodes perform model training on local devices. However, an actual large-scale wireless IoT scenario cannot meet this requirement because devices have heterogeneous computing capabilities and it is difficult for devices with weak computing capabilities to cooperate with devices with strong computing capabilities to train a shared model. The foregoing limitations of IoT make existing ML paradigms (such as CL and FL) inefficient when the paradigms are directly combined with traditional communication technologies. Therefore, it is highly desirable to develop a new learning-oriented network technology for high-efficient model training in wireless IoT.
In addition, it is necessary to realize that communication and computing of devices in a system require considerable energy. IoT devices with limited battery capacity are difficult to support normal operation of a distributed system for a long time. Furthermore, some devices may be deployed in inaccessible or dangerous locations, making periodic charging of the batteries very difficult. Therefore, it is very important to design a high-efficient power control strategy to prolong a life cycle of a wireless IoT network.
To resolve a problem that intelligent performance of a network edge is reduced due to heterogeneous computing capabilities of devices and limited resources in an existing intelligent IoT scenario, the present disclosure provides a semiFL method based on an NGMA technology. CL and FL are integrated such that devices with weak computing capabilities can also participate in training of a global model. In addition, a simultaneously transmitting and reflecting reconfigurable intelligent surface (STAR-RIS) is deployed to dynamically change a channel environment such that a system can meet different task requirements of heterogeneous users. Communication-centric CL users and computing-centric FL users can transmit data in parallel on a same time-frequency resource. This avoids a waste of data resources, enriches data obtaining of a BS, and improves accuracy of the global model. The method provided in the present disclosure also integrates a strategy for jointly optimizing user power allocation and a configuration of the STAR-RIS to reduce total uplink transmit power consumption of the system and prolong a life cycle of an intelligent IoT network.
To achieve the foregoing objective, the present disclosure provides the following technical solutions:
The present disclosure provides a semiFL method based on an NGMA technology, including the following steps:
Further, the NGMA technology in S4 provides services for all users in a same frequency band in a non-orthogonal manner such that all users are capable of communicating in parallel on a same time-frequency resource.
Further, the STAR-RIS deployed in S4 modifies an amplitude and a phase of an incident signal to reshape a wireless transmission environment and adjust channel gains of different users.
Further, S5 specifically includes:
Further, before each round of communication, user transmit power allocation and a configuration of the STAR-RIS are jointly optimized with an objective of minimizing total transmit power consumption of the round, and an optimization problem and constraints are constructed as follows:
Further, the optimization problem is decoupled into two subproblems. Alternating optimization is performed on the transmit power {pn} of the user and the configuration {Θu} of the STAR-RIS of the user.
Further, during the alternating optimization, when {Θu} is fixed, for the {pu} subproblem, the constraints are rewritten by using an uplink communication rate expression
of the CL user and a computation distortion expression
of the FL user, to equivalently express the user power allocation subproblem.
For a transformed expression, power allocation {pk} of the FL users is fixed, and the following closed-form expression of optimal power allocation {p*n} of the CL users is derived through mathematical induction:
Power allocation {pn} of the CL users is fixed, {circumflex over (p)}k=√{square root over (pk)}, the optimization problem is reorganized, and the following closed-form expression of optimal power allocation {p*k} of the FL users is obtained through a Lagrange duality method:
Further, during the alternating optimization, when {pn} is fixed, the {Θn} subproblem is a feasibility check problem and is expressed as follows:
are introduced. A joint uplink channel coefficient is rewritten. The subproblem is further expressed. Qu±0, Diag(Qn)=βu, a non-convex rank-one constraint rank(Qu)=1 exists, and a transformed expression also has a binary variable.
∥Qu∥0−∥Qu∥2=0, ∀u∈U and βmχ−(βmχ)2=0, ∀χ∈{R, T}, ∀m∈M are introduced to transform the non-convex rank-one constraint and the binary variable into penalty terms in an objective function. Because the penalty terms are non-convex, convex upper bounds of the penalty terms are obtained through first-order Taylor expansion in an th iteration as follows:
The convex upper bounds are introduced to the objective function as penalty functions to obtain a convex semidefinite programming (SDP) problem.
Further, solving the convex SDP problem includes: continuously updating penalty factors η1 and η2 of the penalty terms, and solving the SDP problem through an iterative method until the penalty terms satisfy a predefined maximum violation or a predefined maximum quantity of outer iterations is reached.
Further, performing alternating optimization on the user power allocation subproblem and the STAR-RIS configuration subproblem specifically includes: initializing {pu[0]}, {Qu[0]}, {βu[0]}, and preset accuracy Ò3; and setting a current iteration index 3=0, given {Qu[3]} and {βu[3]}, computing {pu[3+1]} by using a closed-form expression of optimal user power allocation, given {pu[3+1]}, updating {Qu[3+1]} and {Bu[+1]} through a penalty-based successive convex approximation (SCA) method, updating 3=3+1, and repeating the foregoing process until a value of an objective function decreases to the preset accuracy or a preset maximum quantity L3 of iterations is reached.
Compared with the prior art, the present disclosure has the following beneficial effects:
The present disclosure provides the semiFL method based on an NGMA technology. CL and FL are integrated such that devices with weak computing capabilities can also participate in training of the global model. During uplink transmission, channel conditions of users are flexibly adjusted through the STAR-RIS to dynamically change a channel environment such that a system can meet different task requirements of heterogeneous users. Communication-centric CL users with weak local computing capabilities and computing-centric FL users with strong local computing capabilities can transmit data in parallel on a same time-frequency resource. This avoids a waste of data resources, enriches data obtaining of the BS, and improves accuracy of the global model.
In a large-scale wireless IoT scenario, an underlying device may face a dilemma of limited battery capacity and inconvenient periodic charging, which seriously affects a life cycle of an entire system. To resolve this problem, the present disclosure constructs a mixed integer non-linear programming problem for jointly optimizing power allocation and the configuration of the STAR-RIS with an objective of minimizing user transmit power consumption. The proposed non-convex optimization problem can be decoupled into two subproblems. For the user power allocation subproblem, the closed-form expressions of the optimal power allocation can be derived through mathematical induction and the Lagrange duality method. For the STAR-RIS configuration subproblem, the original feasibility check problem can be transformed into the convex SDP problem through a penalty function method and the SCA method, and the SDP problem is solved through CVX. In summary, the method provided in the present disclosure integrates a strategy for jointly optimizing the user power allocation and the configuration of the STAR-RIS to reduce total uplink transmit power consumption of the system and prolong a life cycle of an intelligent IoT network.
To describe the technical solutions in embodiments of the present application or in the prior art more clearly, the following briefly describes the accompanying drawings required for describing the embodiments. Apparently, the accompanying drawings in the following description show merely some embodiments of the present disclosure, and persons of ordinary skill in the art may still derive other accompanying drawings from these accompanying drawings.
The present disclosure provides a semiFL method based on an NGMA technology. CL and FL are integrated such that devices with weak computing capabilities in a large-scale wireless IoT scenario can also participate in training of a global model. During uplink transmission, a channel environment of a user is flexibly adjusted through a STAR-RIS such that CL users with weak computing capabilities and FL users with strong computing capabilities can communicate in parallel on a shared time-frequency resource. Then, the present disclosure studies how to minimize total user transmit power consumption, constructs a mixed integer non-linear programming problem for jointly optimizing user power allocation and a configuration of the STAR-RIS, and obtains an optimal suboptimal solution through alternating optimization. Specifically, for the user power allocation subproblem, closed-form expressions of optimal power are obtained through mathematical induction and a Lagrange duality method. For the STAR-RIS configuration subproblem, a feasibility check problem is transformed into a convex SDP problem through a penalty function method and an SCA method.
To better understand the technical solutions, the following describes in detail a method in the present disclosure with reference to the accompanying drawings. Apparently, the described embodiments are merely some rather than all of the embodiments of the present disclosure. All other embodiments obtained by those of ordinary skill in the art based on the embodiments of the present disclosure without creative efforts shall fall within the protection scope of the present disclosure.
Referring to
A STAR-RIS-assisted wireless network that supports collaborative learning for heterogeneous users is considered, as shown in
A channel gain of a wireless link can be obtained by multiplying a path loss by small-scale fading. Specifically, a link distance from the STAR-RIS to the BS and the user is denoted as Λu, ∀u∈Ū≙{0}∪U. u=0) represents the BS. u∈U represents the user. Large-scale fading is denoted as Lu=ζ0(Λu)−α, ∀u∈Ū≙{0}∪U. ζ0 is a path loss at a reference distance of 1 m. α≥2 is a path loss index. For small-scale fading, the present disclosure assumes that a channel between the user and the BS is subject to Rayleigh fading due to blocking and extensive scattering. Because the STAR-RIS is deployed at height, it can be assumed that all RIS-related links are subject to Ricean fading. Therefore, the present disclosure can represent a channel coefficient of all RIS-related links as follows:
κ is a Ricean factor. ruLoS∈M is a deterministic line-of-sight channel component. ruNLoS∈M is a Rayleigh fading channel component. The STAR-RIS divides effective coverage of the BS into refraction space and reflection space. In a system in which the STAR-RIS is deployed, a joint channel coefficient of an up link from the uth user to the BS is as follows:
hu∈ represents a direct link from the uth user to the BS. r0HΘuru represents a double fading reflection/refraction link provided by the STAR-RIS. Specifically, if the uth user is located in the reflection space, Θu=diag(qR). If the uth user is located in the refraction space, Θu=diag(qT). The present disclosure assumes that instantaneous CSI of all channels is available at the BS.
Referring to
In each round of communication, the CL user maps the local data set {dn} to the communication symbol {sn} and the FL user maps and the gradient {gk} to the computation symbol {sk} first. The NGMA technology provides services for all users in a same frequency band in a non-orthogonal manner such that the users can concurrently perform uplink communication. The communication symbol {sn} is transmitted through a power domain NOMA technology. The computation symbol {sk} is transmitted through the AirComp technology. Therefore, a superimposed signal received by the BS is as follows:
pn (pk) is transmit power of the nth (kth) user. z0˜CN (0,σ2) is additive white Gaussian noise (AWGN) at the BS. The present disclosure assumes that all symbols {su}={sn}∪{sk} are statistically independent and have zero mean and normalized variance, that is, E[su2]=1, ∀u∈U and E[suHsν]=0, ∀u≠#ν∈U.
With the help of the STAR-RIS, it can be ensured that parallel communication between the two groups of users is smooth even if some direct links between the BS and the users are blocked. The CL users and the FL users have different communication objectives and performance indicators. For example, the communication-centric CL users expect that the local data sets sent to the BS can be perfectly decoded by the BS with an objective of maximizing a data transmission rate. The computing-centric FL users expect that local model parameters sent to the BS can be carefully aggregated by the BS with an objective of minimizing a computation distortion MSE. Considering the plurality of objectives involved in NGMA, the present disclosure expects to improve throughput of the CL users by suppressing interference, while completing weighting computation of the model parameters of the FL users by utilizing the superimposition characteristic of channels. Therefore, there is a need to design a collaborative transmission mechanism with an efficient interference management capability in such a joint communication and learning framework.
Heterogeneous users are classified into strong users and weak users based on different path losses. The BS separates the superimposed signal through SIC to perfectly decode a signal from the strong users. The strong signal is modulated and subtracted from the received superimposed signal V to obtain a signal from the weak users. This leaves the weak signal for collaborative over-the-air computation. To implement the foregoing process, the present disclosure needs to reshape a wireless transmission environment by modifying an amplitude and a phase of an incident signal through the STAR-RIS such that channel gains of the two groups of users are adjusted in the following order:
Specifically, a channel coefficient is adjusted through the STAR-RIS to arrange all CL users as strong users for communication message decoding and all FL users as weak users for model aggregation. Based on the superimposed signal and a decoding order, an uplink communication rate of the nth CL user is as follows:
B is available bandwidth of the system.
The communication symbols {sn} sent by the CL users may be decoded and modulated by the BS, and subtracted from the superimposed signal y received by the BS to obtain a residual signal
that contains gradient information of the models of the FL users. For aggregation of the FL models, linear computation output estimated by the BS is as follows:
A computation distortion may be quantified by a mean square error (MSE) as follows:
Specifically, referring to
is an objective function used to train a model parameter w∈Q, and ƒ(w;Dn(i)) is a loss function of the model with respect to the ith sample Dn(i) of the nth CL user.
Then, the BS subtracts the decoded information symbols of the CL users from the received superimposed signal to obtain the residual signal
that contains model information of the FL users, performs averaging on the residual signal, restores the local gradients {gk} of the FL users from the computation symbols {sk}, and finally obtains an estimated average gradient of the FL users as follows:
Finally, after obtaining the gradients (
The global model is updated by using w:=w−λ{tilde over (g)}, where λ>0 is a learning rate.
The semiFL method based on an NGMA technology provided in the present disclosure considers reducing total uplink transmit power consumption while meeting requirements for computation distortion tolerance of the FL users and data transmission rates of the CL users. Specifically, before each round of communication, an optimization problem and constraints are constructed with an objective of minimizing total transmit power consumption of the round. User transmit power allocation and a configuration of the STAR-RIS are jointly optimized through alternating optimization.
The objective of the present disclosure is to minimize the total transmit power consumption of each round of communication by jointly optimizing uplink power allocation for all users and the configuration of the STAR-RIS. Considering a QoS requirement of the CL users and the computation distortion tolerance of the FL users, the considered optimization problem can be expressed as follows:
Q={βmR, βmT, θmR, θmT|βmR, βmT∈{0, 1}; θmR, θmT∈[0,2π]; βmR+βmT=1} is a feasible set of refraction and reflection coefficients of the STAR-RIS. Rmin is a minimum data transmission rate for meeting the requirement of the CL users. The constraint |
Due to non-convexity of the constraints, directly solving the constructed optimization problem faces the following difficulties: First, optimization of the configuration of the STAR-RIS is more complex than that of a traditional RIS with only reflection coefficients. Second, existence of the discrete variable {βm(χ)} and other continuous variables makes the optimization problem a mixed integer programming problem. It is difficult to find optimal solutions of the highly coupled variables {pu} and {Θu} in polynomial time complexity. To effectively solve the proposed problem, the present disclosure decouples the problem into two subproblems: a power allocation subproblem and a STAR-RIS configuration subproblem. Alternating optimization is performed on the two subproblems.
Given the configuration of the STAR-RIS, the related constraints are rewritten by using the expression of the uplink communication rate of the CL user and the expression of the computation distortion MSE of the FL user, to equivalently express the power allocation subproblem in the optimization problem as follows:
ζ=2R
Σ*1 is an optimal dual variable related to the QoS constraint. Σ*2 is an optimal dual variable related to the MSE constraint.
A specific proof process for the optimal transmit power {p*n} and {p*k} is as follows:
The solution of the transmit power {pu} is composed of {pn} of the CL user and {pk} of the FL user.
First, given {pk}, the power allocation subproblem degenerates to:
Based on an analytical structure of the problem, it can be proved through reduction to absurdity that
is a valid constraint for the optimal solution that meets a minimum QoS requirement. Therefore, given {pk}, optimal transmit power of the Nth CL user can be written as follows:
Optimal transmit power of the (N−1)th CL user can be written as follows:
(e) can be derived by substituting the expression of p*N into the expression of p*N-1, with some simple algebraic operations. Similarly, optimal transmit power of the (N−2)th CL user is as follows:
In summary, a closed-form solution of {p*n} can be obtained through induction.
Then, given {pn}, {circumflex over (p)}k=√{square root over (pk)} and the power allocation subproblem can be reformulated as follows:
Σ1≥0 and Σ2≥0 represent Lagrange multipliers, and a Lagrange function of the transformed subproblem is as follows:
A dual function of the Lagrange function is as follows:
Therefore, a dual problem of the transformed subproblem is as follows:
Given the dual variables τ1 and τ2, an optimal solution
of the dual function can be obtained by finding an extreme point that satisfies
The dual function D(τ1, τ2) is obtained by substituting the optimal solution into the expression of the dual function. For the proposed dual problem, optimal Lagrange multipliers τ*1 and τ*2 are found through a subgradient descent method, that is:
is an iteration index. {tilde over (μ)}1, {tilde over (μ)}2>0 is a step, which is a constant. The optimal transmit power of the FL user can be obtained by replacing {τ1, τ2} in the expression of {circumflex over (p)}*k with the obtained optimal dual variables {τ*1, τ*2} and using p*k={circumflex over (p)}*22 to restore {p*k}.
The proof is completed.
Given the user transmit power, because an objective function in the optimization problem is independent of {Θu}, the STAR-RIS configuration subproblem is a feasibility check problem, which can be equivalently expressed as follows:
The present disclosure further defines Ru=diag{
Similarly,
are defined.
In this case, in the present disclosure, the following can be obtained:
Based on the foregoing transformation, the first three non-convex constraints in the STAR-RIS configuration subproblem can be re-expressed as follows:
Based on the foregoing approximations, the STAR-RIS configuration subproblem can be approximated as follows:
In the present disclosure, the non-convex rank-one constraint and the binary constraint in the foregoing problem can be equivalently transformed into the following form:
∥⋅∥* represents a kernel norm. ∥⋅∥2 represents a spectral norm.
Then, the present disclosure adds the foregoing two equations to the objective function of the transformed subproblem as penalty terms, to obtain the following problem:
η1 and η2 are two non-negative penalty factors that penalize the objective function if a rank of {Qu} is not 1 or {βmχ} is not binary. However, these penalty terms make the objective function of the foregoing problem non-convex. In the present disclosure, a suboptimal solution is obtained through continuous iterations by using an SCA method. Specifically, in the present disclosure, ∥Qu∥2 and (βmχ)2 are linearized by using fixed points Qu[] and βmχ[] through first-order Taylor expansion in an th iteration, that is:
Then, the non-convex penalty terms in the th iteration are replaced by solved convex upper bounds. A transformed problem is a convex SDP problem, which can be effectively solved through CVX. A penalty-based SCA algorithm proposed to solve the STAR-RIS configuration subproblem is composed of two loops: an inner loop used to iteratively solve an approximate SDP problem of the configuration subproblem and an outer loop used to continuously update the penalty factors and determine whether an iteration termination condition is met. A constraint violation is defined as follows:
The algorithm is described in detail below.
Specifically, {Qu[0]}, {βu[0]}, the penalty factors η1 and η2, corresponding scale factors ñ1 and ñ2, and preset accuracy ò1 and ò2 are initialized first. An outer iteration index 1=0 is set. The constraint violation ō[1] is computed. An inner iteration index 2=0 is set. An objective function value Etot[2] of the approximate SDP problem of the STAR-RIS configuration subproblem is computed. 2=2+1 is updated. The approximate SDP problem is solved to update Qu[2] and βu[2]. The objective function value Etot[2] is updated. The foregoing process is repeated until
or a quantity of inner iterations 2≥L2. 1=1+1, Qu[1]=Qu[2], βu[1]=βu[2], the constraint violation ō[1], η1=ñ1η1, and η2=ñ2η2 are updated. The foregoing process is repeated until 1]≤1 or 1≥L1.
The present disclosure proposes an alternating optimization algorithm to jointly optimize the user power allocation and the configuration of the STAR-RIS, to minimize the total uplink transmit power consumption.
Specifically, {pu[0]}, {Qu[0]}, {βu[0]}, and preset accuracy ò3 are initialized. A current iteration index 3=0 is set. Given {Qu[3]} and {βu[3]}, {pu[3+1]} is computed by using a closed-form expression of optimal user power allocation. Given {pu[3+1]}, {Qu[3+1]} and {βu[3+1]} are updated through the penalty-based SCA algorithm. 3=3+1 is updated. The foregoing process is repeated until a value of the objective function decreases to the preset accuracy or a preset maximum quantity L3 of iterations is reached.
Because the total transmit power decreases as a quantity of iterations increases in the alternating optimization algorithm, and there is a lower bound constraint, the proposed alternating optimization algorithm can ensure convergence. In each iteration, computational complexity of the algorithm mainly depends on the step of solving the approximate SDP problem, with complexity of O(LoLi(M2+M)3). L0=min {L1, log(1/ò1)} and Li=min{L2, log(1/ò2)} respectively represent quantities of outer and inner iterations required for the convergence of the penalty-based SCA algorithm.
Simulation results show that the method provided in the present disclosure can effectively reduce communication overheads and transmission delay in comparison with CL and can improve learning accuracy in comparison with FL.
The foregoing embodiments are used only to describe the technical solutions of the present disclosure, and are not intended to limit same. Although the present disclosure is described in detail with reference to the foregoing embodiments, those of ordinary skill in the art should understand that they can still modify the technical solutions described in the foregoing embodiments, or make equivalent substitutions to some technical features therein. These modifications or substitutions do not make the essence of the corresponding technical solutions depart from the spirit and scope of the technical solutions of the embodiments of the present disclosure.
Number | Date | Country | Kind |
---|---|---|---|
202310012159.4 | Jan 2023 | CN | national |