SEMI-FEDERATED LEARNING METHOD BASED ON NEXT-GENERATION MULTIPLE ACCESS TECHNOLOGY

CROSS REFERENCE TO THE RELATED APPLICATIONS

This application is based upon and claims priority to Chinese Patent Application No. 202310012159.4, filed on Jan. 5, 2023, the entire contents of which are incorporated herein by reference.

TECHNICAL FIELD

The present disclosure relates to the technical field of federated learning (FL), and in particular, to a semi-federated learning (semiFL) method based on a next-generation multiple access (NGMA) technology.

BACKGROUND

With rapid development and integration of machine learning (ML) and wireless communication, massive distributed devices can generate a large amount of real-time information and multimodal data. In a large-scale wireless Internet of Things (IoT) scenario, scarce spectrum resources lead to a communication bottleneck. In addition, the massive distributed devices have different computing capabilities. These factors cause intelligent IoT that relies upon traditional ML technology to face severe challenges.

Although FL can significantly reduce communication overheads and training time of traditional centralized learning (CL), the distributed characteristic of FL compromises training accuracy of a model. In addition, an important feature of FL different from CL is that powerful computing resources of a base station (BS) are not easily used for model training in FL, user data is locally stored, and all nodes perform model training on local devices. However, an actual large-scale wireless IoT scenario cannot meet this requirement because devices have heterogeneous computing capabilities and it is difficult for devices with weak computing capabilities to cooperate with devices with strong computing capabilities to train a shared model. The foregoing limitations of IoT make existing ML paradigms (such as CL and FL) inefficient when the paradigms are directly combined with traditional communication technologies. Therefore, it is highly desirable to develop a new learning-oriented network technology for high-efficient model training in wireless IoT.

In addition, it is necessary to realize that communication and computing of devices in a system require considerable energy. IoT devices with limited battery capacity are difficult to support normal operation of a distributed system for a long time. Furthermore, some devices may be deployed in inaccessible or dangerous locations, making periodic charging of the batteries very difficult. Therefore, it is very important to design a high-efficient power control strategy to prolong a life cycle of a wireless IoT network.

SUMMARY

To resolve a problem that intelligent performance of a network edge is reduced due to heterogeneous computing capabilities of devices and limited resources in an existing intelligent IoT scenario, the present disclosure provides a semiFL method based on an NGMA technology. CL and FL are integrated such that devices with weak computing capabilities can also participate in training of a global model. In addition, a simultaneously transmitting and reflecting reconfigurable intelligent surface (STAR-RIS) is deployed to dynamically change a channel environment such that a system can meet different task requirements of heterogeneous users. Communication-centric CL users and computing-centric FL users can transmit data in parallel on a same time-frequency resource. This avoids a waste of data resources, enriches data obtaining of a BS, and improves accuracy of the global model. The method provided in the present disclosure also integrates a strategy for jointly optimizing user power allocation and a configuration of the STAR-RIS to reduce total uplink transmit power consumption of the system and prolong a life cycle of an intelligent IoT network.

To achieve the foregoing objective, the present disclosure provides the following technical solutions:

The present disclosure provides a semiFL method based on an NGMA technology, including the following steps:

- S1: reporting, by users, state information to a BS, where the state information includes instantaneous channel state information (CSI) and available central processing unit (CPU) frequency state information;
- S2: after receiving the reported state information, classifying, by the BS, the users into communication-centric CL users and computing-centric FL users based on computing capabilities of local devices of the users, and broadcasting a classification result to all users after classification;
- S3: training, by each FL user, a local model through a local data set based on a global model W obtained in a previous round, and computing a local gradient g_k; and preparing, by each CL user, a local data set D_nto be uploaded to the BS;
- S4: encoding the local data set of each CL user into a communication symbol {s_n}, processing the gradient of the local model trained by each FL user into a computation symbol {s_k}, and sending, by all users, the information bearing symbols of the users to the BS by using the NGMA technology in combination with a STAR-RIS;
- S5: receiving, by the BS, a superimposed signal from the two types of users, decoding the local data sets from the CL users to perform centralized training and obtain an average gradient, aggregating the gradients of the local models from the FL users, and aggregating a global model by using the obtained gradients:
- S6: after each round of communication is completed, broadcasting, by the BS, the latest global model w∈^Qto all FL users for gradient computation in a next round; and
- S7: repeating the foregoing steps until convergence or a maximum quantity of rounds of communication is reached.

Further, the NGMA technology in S4 provides services for all users in a same frequency band in a non-orthogonal manner such that all users are capable of communicating in parallel on a same time-frequency resource.

Further, the STAR-RIS deployed in S4 modifies an amplitude and a phase of an incident signal to reshape a wireless transmission environment and adjust channel gains of different users.

Further, S5 specifically includes:

- S51: detecting, by the BS, communication output {s_n} of each CL user through successive interference cancellation (SIC), decoding the communication output to generate training samples {D_n} for CL, and training the model through a gradient descent method to obtain the average gradient g∈^Qof the CL users as follows:

$CL : \overline{g} = \frac{1}{N} \sum_{n \in N} g_{n} = \frac{1}{N} \sum_{n \in N} ▽ F_{n} (w; D_{n})$

- where N is a quantity of the CL users, g_n=∇F_n(w;D_n)∈^Qrepresents a gradient of the n^thCL user computed by the BS,

$F_{n} (w; D_{n}) = \frac{1}{❘ D_{n} ❘} \sum_{i = 1}^{❘ D_{n} ❘} f (w; D_{n}^{(i)})$

- is an objective function used to train a model parameter w∈^Q, and ƒ(w;D_n⁽ⁱ⁾) is a loss function of the model with respect to the i^thsample D_n⁽ⁱ⁾of the n^thCL user;
- S52: assuming that all symbols from the CL users are successfully decoded in S51, subtracting, by the BS, signals of the CL users from the received superimposed signal to obtain a residual signal

$\hat{y} = \sum_{k \in K} {\overline{h}}_{k} \sqrt{p_{k}} s_{k} + z_{0}$

- that contains signals of the FL users, performing averaging on the residual signal that contains the signals of the FL users, restoring the local gradients {g_k} of the FL users from the computation symbols {s_k}, and finally obtaining an estimated average gradient of the FL users as follows:

$FL : \hat{g} = \frac{\hat{y}}{K} = \frac{1}{K} (\sum_{k \in K} {\overline{h}}_{k} \sqrt{p_{k}} g_{k} + z_{0})$

- where K is a quantity of the FL users, z₀˜ N (0,σ²I)∈^Qis a noise vector at the BS, and σ²is noise power;
- S53: after obtaining the gradients {g,ĝ}, updating, by the BS, a global gradient as follows:

$SemiFL : \tilde{g} = \frac{N}{N + K} \overline{g} + \frac{K}{N + K} \hat{g}$

- updating the global model by using w:=w−λ{tilde over (g)}, where λ>0 is a learning rate.

Further, before each round of communication, user transmit power allocation and a configuration of the STAR-RIS are jointly optimized with an objective of minimizing total transmit power consumption of the round, and an optimization problem and constraints are constructed as follows:

$s . t .$

$\min_{{p_{u}}, {Θ_{u}}} \sum_{u = 1}^{N + K} p_{u}$

${❘ {\overline{h}}_{1} ❘}^{2} \geq \dots {❘ {\overline{h}}_{N} ❘}^{2} \geq {❘ {\overline{h}}_{k} ❘}^{2}, \forall k \in K,$

$R_{n} ({p_{u}}, {Θ_{u}}) \geq R_{\min}, \forall n \in N,$

$MSE ({p_{k}}, {Θ_{k}}) \leq E_{0},$

$p_{u} \geq 0, Θ_{u} \in Q, \forall u \in U,$

- where U=N∪K is a set of all users, N≙32 {1, 2, . . . , N} is a set of the CL users. K≙{N+1, N+2, . . . , N+K} is a set of the FL users, p_uis transmit power of the u^thuser, Θ_uis a coefficient matrix of the STAR-RIS of the u^thuser, h_uis a joint channel of the BS, the STAR-RIS and the user, Q={β_m^R, β_m^T, θ_m^R, θ_m^T|β_m^R, β_m^T∈{0, 1}; θ_m^R, θ_m^T∈[0,2π]; β_m^R+β_m^T=1} is a feasible set of refraction and reflection coefficients of the STAR-RIS, β_m^χ∈{0,1} and θ_m^χ∈[0,2π] are respectively an amplitude and a phase shift of an m^thelement in χ∈{R, T} mode, R_minis a minimum data transmission rate for meeting a quality of service (Qos) requirement of the CL users, E₀is a maximum computation distortion that the FL users can tolerate, R_n({p_u}, {Θ_u}) is a data transmission rate of the n^thCL user, and MSE({p_k}, {Θ_k}) is a computation distortion of the k^thFL user.

Further, the optimization problem is decoupled into two subproblems. Alternating optimization is performed on the transmit power {p_n} of the user and the configuration {Θ_u} of the STAR-RIS of the user.

Further, during the alternating optimization, when {Θ_u} is fixed, for the {p_u} subproblem, the constraints are rewritten by using an uplink communication rate expression

$R_{n} = B \log_{2} (1 + \frac{{❘ {\overline{h}}_{n} ❘}^{2} p_{n}}{\sum_{u = n + 1}^{N + K} {❘ {\overline{h}}_{u} ❘}^{2} p_{u} + σ^{2}}),$

$\forall n \in N$

of the CL user and a computation distortion expression

$MSE \hat{=} E [{❘ \hat{s} - \frac{1}{K} \sum_{k \in K} s_{k} ❘}^{2}] = \frac{1}{K^{2}} (\sum_{k \in K} {❘ {\overline{h}}_{k} \sqrt{p_{k}} - 1 ❘}^{2} + σ^{2})$

of the FL user, to equivalently express the user power allocation subproblem.

For a transformed expression, power allocation {p_k} of the FL users is fixed, and the following closed-form expression of optimal power allocation {p*_n} of the CL users is derived through mathematical induction:

$p_{n}^{*} = {ζ (ζ + 1)}^{N - n} (\sum_{k = N + 1}^{N + K} {❘ {\overline{h}}_{k} ❘}^{2} p_{k}^{*} + σ^{2}) {❘ {\overline{h}}_{n} ❘}^{- 2}$

Power allocation {p_n} of the CL users is fixed, {circumflex over (p)}_k=√{square root over (p_k)}, the optimization problem is reorganized, and the following closed-form expression of optimal power allocation {p*_k} of the FL users is obtained through a Lagrange duality method:

$p_{k}^{*} = τ_{2}^{* 2} {❘ {\overline{h}}_{k} ❘}^{2} {(1 + τ_{1}^{*} ζ {❘ {\overline{h}}_{k} ❘}^{2} + τ_{2}^{*} {❘ {\overline{h}}_{k} ❘}^{2})}^{- 2}$

- where τ*₁is an optimal dual variable related to the QoS constraint, and τ*₂is an optimal dual variable related to the MSE constraint.

Further, during the alternating optimization, when {p_n} is fixed, the {Θ_n} subproblem is a feasibility check problem and is expressed as follows:

$find {Θ_{u}}$

$s . t . {❘ {\overline{h}}_{1} ❘}^{2} \geq \dots {❘ {\overline{h}}_{N} ❘}^{2} \geq {❘ {\overline{h}}_{k} ❘}^{2},$

$\forall k \in K,$

$R_{n} ({p_{u}}, {Θ_{u}}) \geq R_{\min},$

$\forall n \in N,$

$MSE ({p_{k}}, {Θ_{k}}) \leq E_{0},$

$Θ_{u} \in Q,$

$\forall u \in U,$

$R_{u} = diag {{\overline{r}}^{H}} r_{u},$

${\overline{R}}_{u} = [\begin{matrix} R_{u} R_{u}^{H} & R_{u} h_{u}^{H} \\ h_{u} R_{u}^{H} & 0 \end{matrix}],$

${\overline{q}}_{u} = [\begin{matrix} q_{u} \\ 1 \end{matrix}],$

$and$

$Q_{u} = {\overline{q}}_{u} {\overline{q}}_{u}^{H}$

are introduced. A joint uplink channel coefficient is rewritten. The subproblem is further expressed. Q_u±0, Diag(Q_n)=β_u, a non-convex rank-one constraint rank(Q_u)=1 exists, and a transformed expression also has a binary variable.

∥Q_u∥₀−∥Q_u∥₂=0, ∀u∈U and β_m^χ−(β_m^χ)²=0, ∀_χ∈{R, T}, ∀m∈M are introduced to transform the non-convex rank-one constraint and the binary variable into penalty terms in an objective function. Because the penalty terms are non-convex, convex upper bounds of the penalty terms are obtained through first-order Taylor expansion in an custom-character ^thiteration as follows:

${ Q_{u} }_{*} - { Q_{u} }_{2} \leq { Q_{u} }_{*} - {{ Q_{u} [ℓ] }_{2} + tr [{\overline{q}}_{\max} [ℓ] {({\overline{q}}_{\max} [ℓ])}^{H} (Q_{u} - Q_{u} [ℓ])]},$

$β_{m}^{χ} - {(β_{m}^{χ})}^{2} \leq β_{m}^{χ} - [{(β_{m}^{χ} [ℓ])}^{2} + 2 β_{m}^{χ} [ℓ] (β_{m}^{χ} - β_{m}^{χ} [ℓ])]$

The convex upper bounds are introduced to the objective function as penalty functions to obtain a convex semidefinite programming (SDP) problem.

Further, solving the convex SDP problem includes: continuously updating penalty factors η₁and η₂of the penalty terms, and solving the SDP problem through an iterative method until the penalty terms satisfy a predefined maximum violation or a predefined maximum quantity of outer iterations is reached.

Further, performing alternating optimization on the user power allocation subproblem and the STAR-RIS configuration subproblem specifically includes: initializing {p_u[0]}, {Q_u[0]}, {β_u[0]}, and preset accuracy Ò₃; and setting a current iteration index custom-character ₃=0, given {Q_u[₃]} and {β_u[₃]}, computing {p_u[₃+1]} by using a closed-form expression of optimal user power allocation, given {p_u[₃+1]}, updating {Q_u[₃+1]} and {B_u[+1]} through a penalty-based successive convex approximation (SCA) method, updating ₃=₃+1, and repeating the foregoing process until a value of an objective function decreases to the preset accuracy or a preset maximum quantity L₃of iterations is reached.

Compared with the prior art, the present disclosure has the following beneficial effects:

The present disclosure provides the semiFL method based on an NGMA technology. CL and FL are integrated such that devices with weak computing capabilities can also participate in training of the global model. During uplink transmission, channel conditions of users are flexibly adjusted through the STAR-RIS to dynamically change a channel environment such that a system can meet different task requirements of heterogeneous users. Communication-centric CL users with weak local computing capabilities and computing-centric FL users with strong local computing capabilities can transmit data in parallel on a same time-frequency resource. This avoids a waste of data resources, enriches data obtaining of the BS, and improves accuracy of the global model.

In a large-scale wireless IoT scenario, an underlying device may face a dilemma of limited battery capacity and inconvenient periodic charging, which seriously affects a life cycle of an entire system. To resolve this problem, the present disclosure constructs a mixed integer non-linear programming problem for jointly optimizing power allocation and the configuration of the STAR-RIS with an objective of minimizing user transmit power consumption. The proposed non-convex optimization problem can be decoupled into two subproblems. For the user power allocation subproblem, the closed-form expressions of the optimal power allocation can be derived through mathematical induction and the Lagrange duality method. For the STAR-RIS configuration subproblem, the original feasibility check problem can be transformed into the convex SDP problem through a penalty function method and the SCA method, and the SDP problem is solved through CVX. In summary, the method provided in the present disclosure integrates a strategy for jointly optimizing the user power allocation and the configuration of the STAR-RIS to reduce total uplink transmit power consumption of the system and prolong a life cycle of an intelligent IoT network.

BRIEF DESCRIPTION OF THE DRAWINGS

To describe the technical solutions in embodiments of the present application or in the prior art more clearly, the following briefly describes the accompanying drawings required for describing the embodiments. Apparently, the accompanying drawings in the following description show merely some embodiments of the present disclosure, and persons of ordinary skill in the art may still derive other accompanying drawings from these accompanying drawings.

FIG. 1 is a flowchart of a semiFL method based on an NGMA technology according to an embodiment of the present disclosure;

FIG. 2 is an architectural diagram of a semiFL method based on an NGMA technology according to an embodiment of the present disclosure;

FIG. 3 is an application structural diagram of a semiFL method based on an NGMA technology according to an embodiment of the present disclosure; and

FIG. 4 is a principle diagram of a STAR-RIS-assisted NGMA technology according to an embodiment of the present disclosure.

DETAILED DESCRIPTION OF THE EMBODIMENTS

The present disclosure provides a semiFL method based on an NGMA technology. CL and FL are integrated such that devices with weak computing capabilities in a large-scale wireless IoT scenario can also participate in training of a global model. During uplink transmission, a channel environment of a user is flexibly adjusted through a STAR-RIS such that CL users with weak computing capabilities and FL users with strong computing capabilities can communicate in parallel on a shared time-frequency resource. Then, the present disclosure studies how to minimize total user transmit power consumption, constructs a mixed integer non-linear programming problem for jointly optimizing user power allocation and a configuration of the STAR-RIS, and obtains an optimal suboptimal solution through alternating optimization. Specifically, for the user power allocation subproblem, closed-form expressions of optimal power are obtained through mathematical induction and a Lagrange duality method. For the STAR-RIS configuration subproblem, a feasibility check problem is transformed into a convex SDP problem through a penalty function method and an SCA method.

To better understand the technical solutions, the following describes in detail a method in the present disclosure with reference to the accompanying drawings. Apparently, the described embodiments are merely some rather than all of the embodiments of the present disclosure. All other embodiments obtained by those of ordinary skill in the art based on the embodiments of the present disclosure without creative efforts shall fall within the protection scope of the present disclosure.

Referring to FIG. 1 and FIG. 2, in a semiFL method based on an NGMA technology provided in the present disclosure, all users participate in training of a global model and can transmit data in parallel on a shared time-frequency resource. The method includes the following steps:

- S1: Before each round of communication starts, state information such as instantaneous CSI and available CPU frequency state information is estimated and reported to a BS by all local users.
- S2: After the reported state information is received, the users are classified by the BS into communication-centric CL users and computing-centric FL users based on computing capabilities of local devices of the users, and a classification result is broadcast to all users after classification.

A STAR-RIS-assisted wireless network that supports collaborative learning for heterogeneous users is considered, as shown in FIG. 3. A heterogeneous user set U=N∪K is divided into communication-centric CL users N=≙{1, 2, . . . , N} with weak computing capabilities and computing-centric FL users K ≙{N+1, N+2, . . . , N+K} with strong computing capabilities based on different computing capabilities of the local devices of the users. There are M passive reflective/refractive elements in a STAR-RIS. Each element relays (refracts or reflects) an incident signal to a desired direction. To avoid energy leakage and facilitate synchronous communication of all users, the STAR-RIS uses a mode switching protocol to select at least one of the elements to work in total refraction mode (namely, T mode) and the other elements to work in total reflection mode (namely, R mode). q_χ=√{square root over ([β₁^χ)} e^jθ¹^χ, √{square root over (β₂^χ)} e^jθ²^χ, . . . , √{square root over (β_M^χ)} e^jθ^M^χ]^Hrepresents a reflection (χ=R) and refraction (χ=T) vector. β_m^χ∈{0,1} and θ_m^χ∈[0,2π] respectively represent an amplitude and a phase shift of the m^thelement in χ∈{R, T} mode. Because each element can work in only one mode in a specific timeslot, a constraint on mode switching is β_m^R+β_m^T=1, ∀m∈M≙{1, 2, . . . , M}.

A channel gain of a wireless link can be obtained by multiplying a path loss by small-scale fading. Specifically, a link distance from the STAR-RIS to the BS and the user is denoted as Λ_u, ∀u∈Ū≙{0}∪U. u=0) represents the BS. u∈U represents the user. Large-scale fading is denoted as L_u=ζ₀(Λ_u)^−α, ∀u∈Ū≙{0}∪U. ζ₀is a path loss at a reference distance of 1 m. α≥2 is a path loss index. For small-scale fading, the present disclosure assumes that a channel between the user and the BS is subject to Rayleigh fading due to blocking and extensive scattering. Because the STAR-RIS is deployed at height, it can be assumed that all RIS-related links are subject to Ricean fading. Therefore, the present disclosure can represent a channel coefficient of all RIS-related links as follows:

$r_{u} = \sqrt{\frac{ς_{0}}{Λ_{u}^{α}}} (\sqrt{\frac{κ}{κ + 1}} r_{u}^{LoS} + \sqrt{\frac{1}{κ + 1}} r_{u}^{NLoS}),$

$\forall u \in \overline{U}$

κ is a Ricean factor. r_u^LoS∈ custom-character ^Mis a deterministic line-of-sight channel component. r_u^NLoS∈^Mis a Rayleigh fading channel component. The STAR-RIS divides effective coverage of the BS into refraction space and reflection space. In a system in which the STAR-RIS is deployed, a joint channel coefficient of an up link from the u^thuser to the BS is as follows:

${\overline{h}}_{u} = ? + r_{0}^{H} Θ_{u} r_{u},$

$\forall u \in U$

$? indicates text missing or illegible when filed$

h_u∈ custom-character represents a direct link from the u^thuser to the BS. r₀^HΘ_ur_urepresents a double fading reflection/refraction link provided by the STAR-RIS. Specifically, if the u^thuser is located in the reflection space, Θ_u=diag(q_R). If the u^thuser is located in the refraction space, Θ_u=diag(q_T). The present disclosure assumes that instantaneous CSI of all channels is available at the BS.

- S3: A local gradient g_kis computed by each FL user based on a model W obtained in a previous round, and a local data set D_nto be uploaded to the BS is prepared by each CL user.
- S4: The local data set of each CL user is encoded into a communication symbol {s_n}, the gradient of the local model trained by each FL user is processed into a computation symbol {s_k}, and the information bearing symbols of all users are sent to the BS by using an NGMA technology.

Referring to FIG. 4, the present disclosure designs an NGMA technology integrating a non-orthogonal multiple access (NOMA) technology and an AirComp technology by utilizing a superimposition characteristic of signals in a wireless channel. Wireless access services are provided for CL and FL users through a STAR-RIS-assisted multiple access channel such that uplink communication of an original data set of the CL user and over-the-air computation of a model parameter (such as gradient information) of the FL user are multiplexed in a same frequency band.

In each round of communication, the CL user maps the local data set {d_n} to the communication symbol {s_n} and the FL user maps and the gradient {g_k} to the computation symbol {s_k} first. The NGMA technology provides services for all users in a same frequency band in a non-orthogonal manner such that the users can concurrently perform uplink communication. The communication symbol {s_n} is transmitted through a power domain NOMA technology. The computation symbol {s_k} is transmitted through the AirComp technology. Therefore, a superimposed signal received by the BS is as follows:

$y = \underset{CL users}{\underset{︸}{\sum_{n = 1}^{N} {\overline{h}}_{n} \sqrt{p_{n}} s_{n}}} + \underset{FL users}{\underset{︸}{\sum_{k = N + 1}^{N + K} {\overline{h}}_{k} \sqrt{p_{k}} s_{k}}} + \underset{Noise}{\underset{︸}{z_{0}}}$

p_n(p_k) is transmit power of the n^th(k^th) user. z₀˜CN (0,σ²) is additive white Gaussian noise (AWGN) at the BS. The present disclosure assumes that all symbols {s_u}={s_n}∪{s_k} are statistically independent and have zero mean and normalized variance, that is, E[s_u²]=1, ∀u∈U and E[s_u^Hs_ν]=0, ∀u≠#ν∈U.

With the help of the STAR-RIS, it can be ensured that parallel communication between the two groups of users is smooth even if some direct links between the BS and the users are blocked. The CL users and the FL users have different communication objectives and performance indicators. For example, the communication-centric CL users expect that the local data sets sent to the BS can be perfectly decoded by the BS with an objective of maximizing a data transmission rate. The computing-centric FL users expect that local model parameters sent to the BS can be carefully aggregated by the BS with an objective of minimizing a computation distortion MSE. Considering the plurality of objectives involved in NGMA, the present disclosure expects to improve throughput of the CL users by suppressing interference, while completing weighting computation of the model parameters of the FL users by utilizing the superimposition characteristic of channels. Therefore, there is a need to design a collaborative transmission mechanism with an efficient interference management capability in such a joint communication and learning framework.

Heterogeneous users are classified into strong users and weak users based on different path losses. The BS separates the superimposed signal through SIC to perfectly decode a signal from the strong users. The strong signal is modulated and subtracted from the received superimposed signal V to obtain a signal from the weak users. This leaves the weak signal for collaborative over-the-air computation. To implement the foregoing process, the present disclosure needs to reshape a wireless transmission environment by modifying an amplitude and a phase of an incident signal through the STAR-RIS such that channel gains of the two groups of users are adjusted in the following order:

$\underset{Strong users}{\underset{︸}{{❘ {\overline{h}}_{1} ❘}^{2} \geq {❘ {\overline{h}}_{2} ❘}^{2} \geq \dots \geq {❘ {\overline{h}}_{N} ❘}^{2}}} \geq \underset{Weak users}{\underset{︸}{{❘ {\overline{h}}_{k} ❘}^{2}, \forall k \in K}}$

Specifically, a channel coefficient is adjusted through the STAR-RIS to arrange all CL users as strong users for communication message decoding and all FL users as weak users for model aggregation. Based on the superimposed signal and a decoding order, an uplink communication rate of the n^thCL user is as follows:

$R_{n} = B \log_{2} (1 + \frac{{❘ {\overline{h}}_{n} ❘}^{2} p_{n}}{\sum_{u = n + 1}^{N + K} {❘ {\overline{h}}_{u} ❘}^{2} p_{u} + σ^{2}}),$

$\forall n \in N$

B is available bandwidth of the system.

The communication symbols {s_n} sent by the CL users may be decoded and modulated by the BS, and subtracted from the superimposed signal y received by the BS to obtain a residual signal

$\hat{y} = \sum_{k \in K} {\overline{h}}_{k} \sqrt{p_{k}} s_{k} + z_{0}$

that contains gradient information of the models of the FL users. For aggregation of the FL models, linear computation output estimated by the BS is as follows:

$\hat{s} = \frac{\hat{y}}{K} = \frac{1}{K} (\sum_{k \in K} {\overline{h}}_{k} \sqrt{p_{k}} s_{k} + z_{0})$

A computation distortion may be quantified by a mean square error (MSE) as follows:

$MSE \hat{=} E [{❘ \hat{s} - \frac{1}{K} \sum_{k \in K} s_{k} ❘}^{2}] = \frac{1}{K^{2}} (\sum_{k \in K} {❘ {\overline{h}}_{k} \sqrt{p_{k}} - 1 ❘}^{2} + σ^{2})$

- S5: The superimposed signal from the two groups of users is received by the BS, the local data sets from the CL users are decoded to perform centralized training and obtain an average gradient, the gradients of the local models from the FL users are aggregated, and a global model is aggregated by using the obtained gradients.

Specifically, referring to FIG. 4, the BS first detects communication output {s_n} of each CL user through SIC, decodes the communication output to generate training samples {D_n} for CL, and trains the model through a gradient descent method to obtain the average gradient g∈ custom-character ^Qof the CL users as follows:

$CL : \bar{g} = \frac{1}{N} \sum_{n \in N} g_{n} = \frac{1}{N} \sum_{n \in N} \nabla F_{n} (w; D_{n})$

- where N is a quantity of the CL users, g_n=∇F_n(w;D_n)∈^Qrepresents a gradient of the n^thCL user computed by the BS,

$F_{n} (w; D_{n}) = \frac{1}{❘ D_{n} ❘} \sum_{i = 1}^{❘ D_{n} ❘} f (w; D_{n}^{(i)})$

is an objective function used to train a model parameter w∈ custom-character ^Q, and ƒ(w;D_n⁽ⁱ⁾) is a loss function of the model with respect to the i^thsample D_n⁽ⁱ⁾of the n^thCL user.

Then, the BS subtracts the decoded information symbols of the CL users from the received superimposed signal to obtain the residual signal

$\hat{y} = \sum_{k \in K} {\overline{h}}_{k} \sqrt{p_{k}} s_{k} + z_{0}$

that contains model information of the FL users, performs averaging on the residual signal, restores the local gradients {g_k} of the FL users from the computation symbols {s_k}, and finally obtains an estimated average gradient of the FL users as follows:

$FL : \hat{g} = \frac{\hat{y}}{K} = \frac{1}{K} (\sum_{k \in K} {\overline{h}}_{k} \sqrt{p_{k}} g_{k} + z_{0})$

- where K is a quantity of the FL users, z₀˜N (0,σ²I)∈^Qis a noise vector at the BS, and σ²is noise power.

Finally, after obtaining the gradients (g,ĝ), the BS updates a global gradient as follows:

$SemiFL : \tilde{g} = \frac{N}{N + K} \bar{g} + \frac{K}{N + K} \hat{g}$

The global model is updated by using w:=w−λ{tilde over (g)}, where λ>0 is a learning rate.

- S6: After each round of communication is completed, the latest global model w∈^Qis broadcast by the BS to all FL users for gradient computation in a next round.
- S7: The foregoing steps are repeated until convergence or a maximum quantity of rounds of communication is reached.

The semiFL method based on an NGMA technology provided in the present disclosure considers reducing total uplink transmit power consumption while meeting requirements for computation distortion tolerance of the FL users and data transmission rates of the CL users. Specifically, before each round of communication, an optimization problem and constraints are constructed with an objective of minimizing total transmit power consumption of the round. User transmit power allocation and a configuration of the STAR-RIS are jointly optimized through alternating optimization.

The objective of the present disclosure is to minimize the total transmit power consumption of each round of communication by jointly optimizing uplink power allocation for all users and the configuration of the STAR-RIS. Considering a QoS requirement of the CL users and the computation distortion tolerance of the FL users, the considered optimization problem can be expressed as follows:

$\min_{{p_{u}}, {Θ_{u}}} \sum_{u = 1}^{N + K} p_{u}$

$s . t . {❘ {\overline{h}}_{1} ❘}^{2} \geq \dots {❘ {\overline{h}}_{N} ❘}^{2} \geq {❘ {\overline{h}}_{k} ❘}^{2}, \forall k \in K,$

$R_{n} ({p_{u}}, {Θ_{u}}) \geq R_{\min}, \forall n \in N,$

$MSE ({p_{k}}, {Θ_{k}}) \leq E_{0},$

$p_{u} \geq 0, Θ_{u} \in Q, \forall u \in U,$

Q={β_m^R, β_m^T, θ_m^R, θ_m^T|β_m^R, β_m^T∈{0, 1}; θ_m^R, θ_m^T∈[0,2π]; β_m^R+β_m^T=1} is a feasible set of refraction and reflection coefficients of the STAR-RIS. R_minis a minimum data transmission rate for meeting the requirement of the CL users. The constraint |h₁|²≥ . . . |h_N|²≥|h_k|², ∀k∈K represents a decoding order that ensures successful separation of the communication symbols and the computation symbols. The constraint R_n({p_u}, {Θ_u})≥R_min, ∀n∈N is the QoS requirement of the CL users. The constraint MSE({p_k}, {Θ_k})≤ E₀ensures that the computation distortion of the FL users does not exceed E₀<1/K.

Due to non-convexity of the constraints, directly solving the constructed optimization problem faces the following difficulties: First, optimization of the configuration of the STAR-RIS is more complex than that of a traditional RIS with only reflection coefficients. Second, existence of the discrete variable {β_m^(χ)} and other continuous variables makes the optimization problem a mixed integer programming problem. It is difficult to find optimal solutions of the highly coupled variables {p_u} and {Θ_u} in polynomial time complexity. To effectively solve the proposed problem, the present disclosure decouples the problem into two subproblems: a power allocation subproblem and a STAR-RIS configuration subproblem. Alternating optimization is performed on the two subproblems.

Given the configuration of the STAR-RIS, the related constraints are rewritten by using the expression of the uplink communication rate of the CL user and the expression of the computation distortion MSE of the FL user, to equivalently express the power allocation subproblem in the optimization problem as follows:

$\min_{{p_{u} \geq 0}} \sum_{u = 1}^{N + K} p_{u}$

$s . t . {❘ {\overline{h}}_{n} ❘}^{2} p_{n} \geq ζ (\sum_{u = n + 1}^{N + K} {❘ {\overline{h}}_{u} ❘}^{2} p_{u} + σ^{2}), \forall n \in N,$

$\sum_{k = N + 1}^{N + K} {❘ {\overline{h}}_{k} \sqrt{p_{k}} - 1 ❘}^{2} + σ^{2} \leq E_{0} K^{2},$

ζ=2^R^min^/B−1 is a constant. In the present disclosure, optimal solutions of the problem are obtained by using an analytical structure and a Lagrange duality method. Finally obtained closed-form expressions of optimal transmit power {p*_n} of the CL user and optimal transmit power {p*_k} of the FL user are as follows:

$p_{n}^{*} = {ζ (ζ + 1)}^{N - n} (\sum_{k = N + 1}^{N + K} {❘ {\overline{h}}_{k} ❘}^{2} p_{k}^{*} + σ^{2}) {❘ {\overline{h}}_{n} ❘}^{- 2},$

$p_{k}^{*} = τ_{2}^{* 2} {❘ {\overline{h}}_{k} ❘}^{2} {(1 + τ_{1}^{*} ζ {❘ {\overline{h}}_{k} ❘}^{2} + τ_{2}^{*} {❘ {\overline{h}}_{k} ❘}^{2})}^{- 2} .$

Σ*₁is an optimal dual variable related to the QoS constraint. Σ*₂is an optimal dual variable related to the MSE constraint.

A specific proof process for the optimal transmit power {p*_n} and {p*_k} is as follows:

The solution of the transmit power {p_u} is composed of {p_n} of the CL user and {p_k} of the FL user.

First, given {p_k}, the power allocation subproblem degenerates to:

$\min_{{p_{u} \geq 0}} \sum_{u = 1}^{N + K} p_{n}$

$s . t . {❘ {\overline{h}}_{n} ❘}^{2} p_{n} \geq ζ (\sum_{u = n + 1}^{N + K} {❘ {\overline{h}}_{u} ❘}^{2} p_{u} + σ^{2}), \forall n \in N,$

Based on an analytical structure of the problem, it can be proved through reduction to absurdity that

${❘ {\overline{h}}_{n} ❘}^{2} p_{n} \geq ζ (\sum_{u = n + 1}^{N + K} {❘ {\overline{h}}_{u} ❘}^{2} p_{u} + σ^{2}), \forall n \in N$

is a valid constraint for the optimal solution that meets a minimum QoS requirement. Therefore, given {p_k}, optimal transmit power of the N^thCL user can be written as follows:

$p_{N}^{*} = \frac{ζ}{{❘ {\overline{h}}_{N} ❘}^{2}} (\sum_{k = N + 1}^{N + K} {❘ {\overline{h}}_{k} ❘}^{2} p_{k} + σ^{2})$

Optimal transmit power of the (N−1)^thCL user can be written as follows:

$\begin{matrix} p_{N - 1}^{*} & = \frac{ζ}{{❘ {\overline{h}}_{N - 1} ❘}^{2}} ({❘ {\overline{h}}_{N} ❘}^{2} p_{N}^{*} + \sum_{k = N + 1}^{N + K} {❘ {\overline{h}}_{k} ❘}^{2} p_{k} + σ^{2}) \\ \overset{(e)}{=} \frac{ζ (ζ + 1)}{{❘ {\overline{h}}_{N - 1} ❘}^{2}} (\sum_{k = N + 1}^{N + K} {❘ {\overline{h}}_{k} ❘}^{2} p_{k} + σ^{2}) \end{matrix}$

(e) can be derived by substituting the expression of p*_Ninto the expression of p*_N-1, with some simple algebraic operations. Similarly, optimal transmit power of the (N−2)^thCL user is as follows:

$p_{N - 2}^{*} = \frac{{ζ (ζ + 1)}^{2}}{{❘ {\overline{h}}_{N - 2} ❘}^{2}} (\sum_{k = N + 1}^{N + K} {❘ {\overline{h}}_{k} ❘}^{2} p_{k} + σ^{2})$

In summary, a closed-form solution of {p*_n} can be obtained through induction.

Then, given {p_n}, {circumflex over (p)}_k=√{square root over (p_k)} and the power allocation subproblem can be reformulated as follows:

$\min_{{{\hat{p}}_{k} \geq 0}} \sum_{k = N + 1}^{N + K} {\hat{p}}_{k}^{2}$

$s . t . ζ \sum_{k = N + 1}^{N + K} {❘ {\overline{h}}_{k} ❘}^{2} {\hat{p}}_{k}^{2} \leq I_{\min},$

$\sum_{k = N + 1}^{N + K} {❘ \bar{h_{k}} {\hat{p}}_{κ} - 1 ❘}^{2} + σ^{2} \leq E_{0} K^{2},$

$where I_{\min} = \min_{n \in N} {{❘ {\overline{h}}_{n} ❘}^{2} p_{n} - ζ \sum_{u = n + 1}^{N + K} {❘ {\overline{h}}_{u} ❘}^{2} p_{u} - ζ σ^{2}} .$

Σ₁≥0 and Σ₂≥0 represent Lagrange multipliers, and a Lagrange function of the transformed subproblem is as follows:

$L ({{\hat{p}}_{k}}, τ_{1}, τ_{2}) = \sum_{k = N + 1}^{N + K} {\hat{p}}_{k}^{2} + τ_{1} (ζ \sum_{k = N + 1}^{N + K} {❘ {\overline{h}}_{k} ❘}^{2} {\hat{p}}_{k}^{2} - I_{\min}) + τ_{2} (\sum_{k = N + 1}^{N + K} {❘ {\overline{h}}_{k} {\hat{p}}_{k} - 1 ❘}^{2} + σ^{2} - E_{0} K^{2})$

A dual function of the Lagrange function is as follows:

$D (τ_{1}, τ_{2}) = \min_{{{\hat{p}}_{k} \geq 0}} L ({{\hat{p}}_{k}}, τ_{1}, τ_{2})$

Therefore, a dual problem of the transformed subproblem is as follows:

$\max_{τ_{1}, τ_{2} \geq 0} D (τ_{1}, τ_{2})$

Given the dual variables τ₁and τ₂, an optimal solution

${\hat{p}}_{k}^{*} = \frac{τ_{2} ❘ {\overline{h}}_{k} ❘}{1 + τ_{1} ζ {❘ {\overline{h}}_{k} ❘}^{2} + τ_{2} {❘ {\overline{h}}_{k} ❘}^{2}}$

of the dual function can be obtained by finding an extreme point that satisfies

$\frac{\partial L}{\partial {\hat{p}}_{k}^{*}} = 2 {\hat{p}}_{k}^{*} + 2 τ_{1} ζ {❘ {\overline{h}}_{k} ❘}^{2} {\hat{p}}_{k}^{*} + 2 τ_{2} ❘ {\overline{h}}_{k} {\hat{p}}_{k} - 1 ❘ ❘ {\overline{h}}_{k} ❘ = 0.$

The dual function D(τ₁, τ₂) is obtained by substituting the optimal solution into the expression of the dual function. For the proposed dual problem, optimal Lagrange multipliers τ*₁and τ*₂are found through a subgradient descent method, that is:

$τ_{1} [ℓ + 1] = {[τ_{1} [ℓ] + {\tilde{μ}}_{1} (ζ \sum_{k \in K} {❘ {\overline{h}}_{k} ❘}^{2} {\hat{p}}_{k}^{* 2} - I_{\min})]}^{+}$

$τ_{2} [ℓ + 1] = {[τ_{2} [ℓ] + {\tilde{μ}}_{2} (\sum_{k \in K} {❘ {\overline{h}}_{k} {\hat{p}}_{k}^{*} - 1 ❘}^{2} + σ^{2} - E_{0} K^{2})]}^{+}$

custom-character is an iteration index. {tilde over (μ)}₁, {tilde over (μ)}₂>0 is a step, which is a constant. The optimal transmit power of the FL user can be obtained by replacing {τ₁, τ₂} in the expression of {circumflex over (p)}*_kwith the obtained optimal dual variables {τ*₁, τ*₂} and using p*_k={circumflex over (p)}*₂²to restore {p*_k}.

The proof is completed.

Given the user transmit power, because an objective function in the optimization problem is independent of {Θ_u}, the STAR-RIS configuration subproblem is a feasibility check problem, which can be equivalently expressed as follows:

$find {Θ_{u}}$

$s . t . {❘ {\overline{h}}_{1} ❘}^{2} \geq \dots {❘ {\overline{h}}_{N} ❘}^{2} \geq {❘ {\overline{h}}_{k} ❘}^{2}, \forall k \in K,$

$R_{n} ({p_{u}}, {Θ_{u}}) \geq R_{\min}, \forall n \in N,$

$MSE ({p_{k}}, {Θ_{k}}) \leq E_{0},$

$Θ_{u} \in Q, \forall u \in U,$

The present disclosure further defines R_u=diag{r^H}r_u. In this case, r^HΘ_ur_u=q_u^HR_u. If the u^thuser is located in the reflection space, q_u=q_R. If the u^thuser is located in the refraction space. q_u=q_T. Correspondingly, in the present disclosure, the following can be obtained:

$\begin{matrix} {❘ {\overline{h}}_{u} ❘}^{2} & = {❘ h_{u} + {\bar{r}}^{H} Θ_{u} r_{u} ❘}^{2} = ❘ h_{u} + q_{u}^{H} R_{u} |^{2} \\ = q_{u}^{H} R_{u} R_{u}^{H} q_{u} + q_{u}^{H} R_{u} h_{u}^{H} + h_{u} R_{u}^{H} q_{u} + {❘ h_{u} ❘}^{2} \\ = {\overline{q}}_{u}^{H} {\overline{R}}_{u} {\overline{q}}_{u} + {❘ h_{u} ❘}^{2} \end{matrix}$

- where

${\bar{R}}_{u} = [\begin{matrix} R_{u} R_{u}^{H} & R_{u} h_{u}^{H} \\ h_{u} R_{u}^{H} & 0 \end{matrix}], {\overline{q}}_{u} = [\begin{matrix} q_{u} \\ 1 \end{matrix}] .$

- It can be found that q_u^HR_uq_u=tr(R_uq_uq_u^H). Q_u=q_uq_u^His defined, and Q_u±0, rank(Q_u)=1, and Diag(Q_u)=β_uneed to be satisfied. The vector Diag(Q_u) represents an element extracted from a main diagonal of a matrix Q_u. If the u^thuser is located in the reflection space, β_u=β_R=[β₁^R, β₂^R, . . . , β_M^R]*. Otherwise. β_u=β_T=[β₁^T, β₂^T, . . . , β_M^T]*. Next, a joint uplink channel coefficient may be further organized as follows:

${❘ {\overline{h}}_{u} ❘}^{2} = tr ({\overline{R}}_{u} {\overline{q}}_{u} {\bar{q}}_{u}^{H}) + {❘ h_{u} ❘}^{2} = tr ({\overline{R}}_{u} Q_{u}) + {❘ h_{u} ❘}^{2}$

Similarly,

${\overset{◦}{R}}_{k} = R_{k} \sqrt{p_{k}}, {\hat{h}}_{k} = h_{k} \sqrt{p_{k}} - 1, and {\hat{R}}_{k} = [\begin{matrix} {\overset{◦}{R}}_{k} {\overset{◦}{R}}_{k}^{H} & {\overset{◦}{R}}_{k} {\hat{h}}_{k}^{H} \\ {\hat{h}}_{k} {\overset{◦}{R}}_{k}^{H} & 0 \end{matrix}]$

are defined.

In this case, in the present disclosure, the following can be obtained:

${❘ {\overline{h}}_{k} \sqrt{p_{k}} - 1 ❘}^{2} = tr ({\hat{R}}_{k} Q_{k}) + {❘ {\hat{h}}_{k} ❘}^{2}$

Based on the foregoing transformation, the first three non-convex constraints in the STAR-RIS configuration subproblem can be re-expressed as follows:

$tr ({\bar{R}}_{1} Q_{1}) + {❘ h_{1} ❘}^{2} \geq \dots \geq tr ({\bar{R}}_{N} Q_{N}) + {❘ h_{N} ❘}^{2}$

$\geq tr ({\bar{R}}_{k} Q_{k}) + {❘ h_{k} ❘}^{2}, \forall k \in K,$

$[tr ({\bar{R}}_{n} Q_{n}) + {❘ h_{n} ❘}^{2}] p_{n}$

$\geq ζ σ^{2} + ζ \sum_{u = n + 1}^{N + K} [tr ({\bar{R}}_{u} Q_{u}) + {❘ h_{u} ❘}^{2}] p_{u}, \forall n \in N,$

$\sum_{k \in K} [tr ({\hat{R}}_{k} Q_{k}) + {❘ {\hat{h}}_{k} ❘}^{2}] + σ^{2} \leq E_{0} K^{2}$

Based on the foregoing approximations, the STAR-RIS configuration subproblem can be approximated as follows:

$find {Q_{u}}, {β_{u}}$

$s . t . Q_{u} \pm 0, \forall u \in U,$

$rank (Q_{u}) = 1, \forall u \in U,$

$Diag (Q_{u}) = β_{u}, \forall ι ι \in U,$

$β_{m}^{R} + β_{m}^{T} = 1, \forall m \in M,$

$β_{m}^{R}, β_{m}^{T} \in {0, 1}, \forall m \in M$

$tr ({\bar{R}}_{1} Q_{1}) + {❘ h_{1} ❘}^{2} \geq \dots \geq tr ({\bar{R}}_{N} Q_{N}) + {❘ h_{N} ❘}^{2} \geq tr ({\bar{R}}_{k} Q_{k}) + {❘ h_{k} ❘}^{2}, \forall k \in K,$

$[tr ({\bar{R}}_{n} Q_{n}) + {❘ h_{n} ❘}^{2}] p_{n} \geq {ζσ}^{2} + ζ \sum_{u = n + 1}^{N + K} [tr ({\bar{R}}_{u} Q_{u}) + {❘ h_{u} ❘}^{2}] p_{u}, \forall n \in N,$

$\sum_{k \in K} [tr ({\hat{R}}_{k} Q_{k}) + {❘ {\hat{h}}_{k} ❘}^{2}] + σ^{2} \leq E_{0} K^{2}$

In the present disclosure, the non-convex rank-one constraint and the binary constraint in the foregoing problem can be equivalently transformed into the following form:

$rank (Q_{u}) = 1 \Leftrightarrow { Q_{u} }_{*} - { Q_{u} }_{2} = 0, \forall u \in U,$

$β_{m}^{χ} \in {0, 1} \Leftrightarrow β_{m}^{χ} - {(β_{m}^{χ})}^{2} = 0, \forall χ \in {R, T}, \forall m \in M$

∥⋅∥_*represents a kernel norm. ∥⋅∥₂represents a spectral norm.

Then, the present disclosure adds the foregoing two equations to the objective function of the transformed subproblem as penalty terms, to obtain the following problem:

$\min_{{Q_{u}}, {β_{u}}} η_{1} \sum_{u} ({ Q_{u} }_{*} - { Q_{u} }_{2}) + η_{2} \sum_{m} \sum_{χ} (β_{m}^{χ} - {(β_{m}^{χ})}^{2})$

$s . t . Q_{u} \pm 0, \forall u \in U,$

$Diag (Q_{u}) = β_{u}, \forall ι ι \in U,$

$β_{m}^{R} + β_{m}^{T} = 1, \forall m \in M,$

$tr ({\bar{R}}_{1} Q_{1}) + {❘ h_{1} ❘}^{2} \geq \dots \geq tr ({\bar{R}}_{N} Q_{N}) + {❘ h_{N} ❘}^{2} \geq tr ({\bar{R}}_{k} Q_{k}) + {❘ h_{k} ❘}^{2}, \forall k \in K,$

$[tr ({\bar{R}}_{n} Q_{n}) + {❘ h_{n} ❘}^{2}] p_{n} \geq {ζσ}^{2} + ζ \sum_{u = n + 1}^{N + K} [tr ({\bar{R}}_{u} Q_{u}) + {❘ h_{u} ❘}^{2}] p_{u}, \forall n \in N,$

$\sum_{k \in K} [tr ({\hat{R}}_{k} Q_{k}) + {❘ {\hat{h}}_{k} ❘}^{2}] + σ^{2} \leq E_{0} K^{2}$

η₁and η₂are two non-negative penalty factors that penalize the objective function if a rank of {Q_u} is not 1 or {β_m^χ} is not binary. However, these penalty terms make the objective function of the foregoing problem non-convex. In the present disclosure, a suboptimal solution is obtained through continuous iterations by using an SCA method. Specifically, in the present disclosure, ∥Q_u∥₂and (β_m^χ)²are linearized by using fixed points Q_u[ custom-character ] and β_m^χ[] through first-order Taylor expansion in an ^thiteration, that is:

${ Q_{u} }_{*} - { Q_{u} }_{2} \leq { Q_{u} }_{*} - {{ Q_{u} [ℓ] }_{2} + tr [{\bar{q}}_{\max} [ℓ] {({\bar{q}}_{\max} [ℓ])}^{H} (Q_{u} - Q_{u} [ℓ])]},$

$β_{m}^{χ} - {(β_{m}^{χ})}^{2} \leq β_{m}^{χ} - [{(β_{m}^{χ} [ℓ])}^{2} + 2 β_{m}^{χ} [ℓ] (β_{m}^{χ} - β_{m}^{χ} [ℓ])]$

Then, the non-convex penalty terms in the custom-character ^thiteration are replaced by solved convex upper bounds. A transformed problem is a convex SDP problem, which can be effectively solved through CVX. A penalty-based SCA algorithm proposed to solve the STAR-RIS configuration subproblem is composed of two loops: an inner loop used to iteratively solve an approximate SDP problem of the configuration subproblem and an outer loop used to continuously update the penalty factors and determine whether an iteration termination condition is met. A constraint violation is defined as follows:

$\bar{o} = \max {{ Q_{u} }_{*} - { Q_{u} }_{2}, β_{m}^{χ} - {(β_{m}^{χ})}^{2}}$

The algorithm is described in detail below.

Specifically, {Q_u[0]}, {β_u[0]}, the penalty factors η₁and η₂, corresponding scale factors ñ₁and ñ₂, and preset accuracy ò₁and ò₂are initialized first. An outer iteration index custom-character ₁=0 is set. The constraint violation ō[₁] is computed. An inner iteration index ₂=0 is set. An objective function value E_tot[₂] of the approximate SDP problem of the STAR-RIS configuration subproblem is computed. ₂=₂+1 is updated. The approximate SDP problem is solved to update Q_u[ custom-character ₂] and β_u[₂]. The objective function value E_tot[₂] is updated. The foregoing process is repeated until

$\frac{❘ E_{t o t} [ℓ_{2}] - E_{tot} [ℓ_{2} - 1] ❘}{E_{tot} [ℓ_{2^{-}} 1]} \leq ò_{2}$

or a quantity of inner iterations custom-character ₂≥L₂. ₁=₁+1, Q_u[₁]=Q_u[₂], β_u[₁]=β_u[₂], the constraint violation ō[₁], η₁=ñ₁η₁, and η₂=ñ₂η₂are updated. The foregoing process is repeated until ₁]≤₁or ₁≥L₁.

The present disclosure proposes an alternating optimization algorithm to jointly optimize the user power allocation and the configuration of the STAR-RIS, to minimize the total uplink transmit power consumption.

Specifically, {p_u[0]}, {Q_u[0]}, {β_u[0]}, and preset accuracy ò₃are initialized. A current iteration index custom-character ₃=0 is set. Given {Q_u[₃]} and {β_u[₃]}, {p_u[₃+1]} is computed by using a closed-form expression of optimal user power allocation. Given {p_u[₃+1]}, {Q_u[₃+1]} and {β_u[₃+1]} are updated through the penalty-based SCA algorithm. ₃=₃+1 is updated. The foregoing process is repeated until a value of the objective function decreases to the preset accuracy or a preset maximum quantity L₃of iterations is reached.

Because the total transmit power decreases as a quantity of iterations increases in the alternating optimization algorithm, and there is a lower bound constraint, the proposed alternating optimization algorithm can ensure convergence. In each iteration, computational complexity of the algorithm mainly depends on the step of solving the approximate SDP problem, with complexity of O(L_oL_i(M²+M)³). L₀=min {L₁, log(1/ò₁)} and L_i=min{L₂, log(1/ò₂)} respectively represent quantities of outer and inner iterations required for the convergence of the penalty-based SCA algorithm.

Simulation results show that the method provided in the present disclosure can effectively reduce communication overheads and transmission delay in comparison with CL and can improve learning accuracy in comparison with FL.

The foregoing embodiments are used only to describe the technical solutions of the present disclosure, and are not intended to limit same. Although the present disclosure is described in detail with reference to the foregoing embodiments, those of ordinary skill in the art should understand that they can still modify the technical solutions described in the foregoing embodiments, or make equivalent substitutions to some technical features therein. These modifications or substitutions do not make the essence of the corresponding technical solutions depart from the spirit and scope of the technical solutions of the embodiments of the present disclosure.

SEMI-FEDERATED LEARNING METHOD BASED ON NEXT-GENERATION MULTIPLE ACCESS TECHNOLOGY

Information

Publication Number

Date Filed

Date Published

Inventors

Original Assignees

CPC

International Classifications

Abstract

Description

Claims

Priority Claims (1)