SEMI-FEDERATED LEARNING METHOD BASED ON NEXT-GENERATION MULTIPLE ACCESS TECHNOLOGY

Information

  • Patent Application
  • 20240232719
  • Publication Number
    20240232719
  • Date Filed
    November 21, 2023
    a year ago
  • Date Published
    July 11, 2024
    5 months ago
  • CPC
    • G06N20/00
  • International Classifications
    • G06N20/00
Abstract
A semi-federated learning (semiFL) method based on a next-generation multiple access (NGMA) technology is provided. Centralized learning (CL) and FL are integrated such that devices with weak computing capabilities can also participate in training of a global model. A simultaneously transmitting and reflecting reconfigurable intelligent surface (STAR-RIS) is deployed to dynamically change a channel environment such that a system can meet different task requirements of heterogeneous users. Communication-centric CL users and computing-centric FL users can transmit data in parallel on a same time-frequency resource. This avoids a waste of data resources, enriches data obtaining of a base station (BS), and improves accuracy of the global model. The semiFL method also integrates a strategy for jointly optimizing user power allocation and a configuration of the STAR-RIS to reduce total uplink transmit power consumption of the system and prolong a life cycle of an intelligent Internet of Things (IoT) network.
Description
CROSS REFERENCE TO THE RELATED APPLICATIONS

This application is based upon and claims priority to Chinese Patent Application No. 202310012159.4, filed on Jan. 5, 2023, the entire contents of which are incorporated herein by reference.


TECHNICAL FIELD

The present disclosure relates to the technical field of federated learning (FL), and in particular, to a semi-federated learning (semiFL) method based on a next-generation multiple access (NGMA) technology.


BACKGROUND

With rapid development and integration of machine learning (ML) and wireless communication, massive distributed devices can generate a large amount of real-time information and multimodal data. In a large-scale wireless Internet of Things (IoT) scenario, scarce spectrum resources lead to a communication bottleneck. In addition, the massive distributed devices have different computing capabilities. These factors cause intelligent IoT that relies upon traditional ML technology to face severe challenges.


Although FL can significantly reduce communication overheads and training time of traditional centralized learning (CL), the distributed characteristic of FL compromises training accuracy of a model. In addition, an important feature of FL different from CL is that powerful computing resources of a base station (BS) are not easily used for model training in FL, user data is locally stored, and all nodes perform model training on local devices. However, an actual large-scale wireless IoT scenario cannot meet this requirement because devices have heterogeneous computing capabilities and it is difficult for devices with weak computing capabilities to cooperate with devices with strong computing capabilities to train a shared model. The foregoing limitations of IoT make existing ML paradigms (such as CL and FL) inefficient when the paradigms are directly combined with traditional communication technologies. Therefore, it is highly desirable to develop a new learning-oriented network technology for high-efficient model training in wireless IoT.


In addition, it is necessary to realize that communication and computing of devices in a system require considerable energy. IoT devices with limited battery capacity are difficult to support normal operation of a distributed system for a long time. Furthermore, some devices may be deployed in inaccessible or dangerous locations, making periodic charging of the batteries very difficult. Therefore, it is very important to design a high-efficient power control strategy to prolong a life cycle of a wireless IoT network.


SUMMARY

To resolve a problem that intelligent performance of a network edge is reduced due to heterogeneous computing capabilities of devices and limited resources in an existing intelligent IoT scenario, the present disclosure provides a semiFL method based on an NGMA technology. CL and FL are integrated such that devices with weak computing capabilities can also participate in training of a global model. In addition, a simultaneously transmitting and reflecting reconfigurable intelligent surface (STAR-RIS) is deployed to dynamically change a channel environment such that a system can meet different task requirements of heterogeneous users. Communication-centric CL users and computing-centric FL users can transmit data in parallel on a same time-frequency resource. This avoids a waste of data resources, enriches data obtaining of a BS, and improves accuracy of the global model. The method provided in the present disclosure also integrates a strategy for jointly optimizing user power allocation and a configuration of the STAR-RIS to reduce total uplink transmit power consumption of the system and prolong a life cycle of an intelligent IoT network.


To achieve the foregoing objective, the present disclosure provides the following technical solutions:


The present disclosure provides a semiFL method based on an NGMA technology, including the following steps:

    • S1: reporting, by users, state information to a BS, where the state information includes instantaneous channel state information (CSI) and available central processing unit (CPU) frequency state information;
    • S2: after receiving the reported state information, classifying, by the BS, the users into communication-centric CL users and computing-centric FL users based on computing capabilities of local devices of the users, and broadcasting a classification result to all users after classification;
    • S3: training, by each FL user, a local model through a local data set based on a global model W obtained in a previous round, and computing a local gradient gk; and preparing, by each CL user, a local data set Dn to be uploaded to the BS;
    • S4: encoding the local data set of each CL user into a communication symbol {sn}, processing the gradient of the local model trained by each FL user into a computation symbol {sk}, and sending, by all users, the information bearing symbols of the users to the BS by using the NGMA technology in combination with a STAR-RIS;
    • S5: receiving, by the BS, a superimposed signal from the two types of users, decoding the local data sets from the CL users to perform centralized training and obtain an average gradient, aggregating the gradients of the local models from the FL users, and aggregating a global model by using the obtained gradients:
    • S6: after each round of communication is completed, broadcasting, by the BS, the latest global model w∈custom-characterQ to all FL users for gradient computation in a next round; and
    • S7: repeating the foregoing steps until convergence or a maximum quantity of rounds of communication is reached.


Further, the NGMA technology in S4 provides services for all users in a same frequency band in a non-orthogonal manner such that all users are capable of communicating in parallel on a same time-frequency resource.


Further, the STAR-RIS deployed in S4 modifies an amplitude and a phase of an incident signal to reshape a wireless transmission environment and adjust channel gains of different users.


Further, S5 specifically includes:

    • S51: detecting, by the BS, communication output {sn} of each CL user through successive interference cancellation (SIC), decoding the communication output to generate training samples {Dn} for CL, and training the model through a gradient descent method to obtain the average gradient gcustom-characterQ of the CL users as follows:







CL
:


g
_


=



1
N






n

N




g
n



=


1
N






n

N








F
n

(

w
;

D
n


)











    • where N is a quantity of the CL users, gn=∇Fn(w;Dn)∈custom-characterQ represents a gradient of the nth CL user computed by the BS,











F
n

(

w
;

D
n


)

=


1



"\[LeftBracketingBar]"


D
n



"\[RightBracketingBar]"








i
=
1




"\[LeftBracketingBar]"


D
n



"\[RightBracketingBar]"





f

(

w
;

D
n

(
i
)



)









    •  is an objective function used to train a model parameter w∈custom-characterQ, and ƒ(w;Dn(i)) is a loss function of the model with respect to the ith sample Dn(i) of the nth CL user;

    • S52: assuming that all symbols from the CL users are successfully decoded in S51, subtracting, by the BS, signals of the CL users from the received superimposed signal to obtain a residual signal










y
^

=








k

K





h
_

k




p
k




s
k


+

z
0








    •  that contains signals of the FL users, performing averaging on the residual signal that contains the signals of the FL users, restoring the local gradients {gk} of the FL users from the computation symbols {sk}, and finally obtaining an estimated average gradient of the FL users as follows:










FL
:


g
^


=



y
^

K

=


1
K



(





k

K






h
_

k




p
k




g
k



+

z
0


)









    • where K is a quantity of the FL users, z0˜ N (0,σ2I)∈custom-characterQ is a noise vector at the BS, and σ2 is noise power;

    • S53: after obtaining the gradients {g,ĝ}, updating, by the BS, a global gradient as follows:










SemiFL
:


g
~


=



N

N
+
K




g
_


+


K

N
+
K




g
^









    • updating the global model by using w:=w−λ{tilde over (g)}, where λ>0 is a learning rate.





Further, before each round of communication, user transmit power allocation and a configuration of the STAR-RIS are jointly optimized with an objective of minimizing total transmit power consumption of the round, and an optimization problem and constraints are constructed as follows:






s
.
t
.







min


{

p
u

}

,

{

Θ
u

}









u
=
1


N
+
K




p
u












"\[LeftBracketingBar]"



h
_

1



"\[RightBracketingBar]"


2










"\[LeftBracketingBar]"




h
_



N



"\[RightBracketingBar]"




2








"\[LeftBracketingBar]"



h
_

k



"\[RightBracketingBar]"


2


,




k

K


,









R
n

(


{

p
u

}

,

{

Θ
u

}


)



R
min


,




n

N


,








MSE

(


{

p
k

}

,

{

Θ
k

}


)



E
0


,








p
u


0

,


Θ
u


Q

,




u

U


,






    • where U=N∪K is a set of all users, N≙32 {1, 2, . . . , N} is a set of the CL users. K≙{N+1, N+2, . . . , N+K} is a set of the FL users, pu is transmit power of the uth user, Θu is a coefficient matrix of the STAR-RIS of the uth user, hu is a joint channel of the BS, the STAR-RIS and the user, Q={βmR, βmT, θmR, θmTmR, βmT∈{0, 1}; θmR, θmT∈[0,2π]; βmRmT=1} is a feasible set of refraction and reflection coefficients of the STAR-RIS, βmχ∈{0,1} and θmχ∈[0,2π] are respectively an amplitude and a phase shift of an mth element in χ∈{R, T} mode, Rmin is a minimum data transmission rate for meeting a quality of service (Qos) requirement of the CL users, E0 is a maximum computation distortion that the FL users can tolerate, Rn ({pu}, {Θu}) is a data transmission rate of the nth CL user, and MSE({pk}, {Θk}) is a computation distortion of the kth FL user.





Further, the optimization problem is decoupled into two subproblems. Alternating optimization is performed on the transmit power {pn} of the user and the configuration {Θu} of the STAR-RIS of the user.


Further, during the alternating optimization, when {Θu} is fixed, for the {pu} subproblem, the constraints are rewritten by using an uplink communication rate expression








R
n

=

B




log
2

(

1
+






"\[LeftBracketingBar]"



h
_

n



"\[RightBracketingBar]"


2



p
n







u
=

n
+
1



N
+
K







"\[LeftBracketingBar]"



h
_

u



"\[RightBracketingBar]"


2



p
u



+

σ
2




)



,








n

N





of the CL user and a computation distortion expression






MSE

=
^



E
[




"\[LeftBracketingBar]"



s
^

-


1
K






k

K



s
k






"\[RightBracketingBar]"


2

]

=


1

K
2




(





k

K






"\[LeftBracketingBar]"





h
_

k




p
k



-
1



"\[RightBracketingBar]"


2


+

σ
2


)







of the FL user, to equivalently express the user power allocation subproblem.


For a transformed expression, power allocation {pk} of the FL users is fixed, and the following closed-form expression of optimal power allocation {p*n} of the CL users is derived through mathematical induction:







p
n
*

=



ζ

(

ζ
+
1

)


N
-
n




(







k
=

N
+
1






N
+
K








"\[LeftBracketingBar]"



h
_

k



"\[RightBracketingBar]"


2



p
k
*



+

σ
2


)






"\[LeftBracketingBar]"



h
_

n



"\[RightBracketingBar]"



-
2







Power allocation {pn} of the CL users is fixed, {circumflex over (p)}k=√{square root over (pk)}, the optimization problem is reorganized, and the following closed-form expression of optimal power allocation {p*k} of the FL users is obtained through a Lagrange duality method:







p
k
*

=


τ
2

*
2







"\[LeftBracketingBar]"



h
_

k



"\[RightBracketingBar]"


2




(

1
+


τ
1
*


ζ





"\[LeftBracketingBar]"



h
_

k



"\[RightBracketingBar]"


2


+


τ
2
*






"\[LeftBracketingBar]"



h
_

k



"\[RightBracketingBar]"


2



)


-
2









    • where τ*1 is an optimal dual variable related to the QoS constraint, and τ*2 is an optimal dual variable related to the MSE constraint.





Further, during the alternating optimization, when {pn} is fixed, the {Θn} subproblem is a feasibility check problem and is expressed as follows:






find



{

Θ
u

}









s
.
t
.





"\[LeftBracketingBar]"



h
_

1



"\[RightBracketingBar]"


2











"\[LeftBracketingBar]"



h
_

N



"\[RightBracketingBar]"


2







"\[LeftBracketingBar]"



h
_

k



"\[RightBracketingBar]"


2


,









k

K


,









R
n

(


{

p
u

}

,

{

Θ
u

}


)



R
min


,









n

N


,








MSE

(


{

p
k

}

,

{

Θ
k

}


)



E
0


,








Θ
u


Q

,









u

U


,










R
u

=

diag


{


r
_

H

}



r
u



,









R
_

u

=

[





R
u



R
u
H






R
u



h
u
H








h
u



R
u
H




0



]


,









q
_

u

=

[




q
u





1



]


,





and






Q
u

=



q
_

u




q
_

u
H






are introduced. A joint uplink channel coefficient is rewritten. The subproblem is further expressed. Qu±0, Diag(Qn)=βu, a non-convex rank-one constraint rank(Qu)=1 exists, and a transformed expression also has a binary variable.


∥Qu0−∥Qu2=0, ∀u∈U and βmχ−(βmχ)2=0, ∀χ∈{R, T}, ∀m∈M are introduced to transform the non-convex rank-one constraint and the binary variable into penalty terms in an objective function. Because the penalty terms are non-convex, convex upper bounds of the penalty terms are obtained through first-order Taylor expansion in an custom-characterth iteration as follows:












Q
u



*

-




Q
u



2








Q
u



*

-

{






Q
u

[

]



2

+

tr
[




q
_

max

[

]




(



q
_

max

[

]

)

H



(


Q
u

-


Q
u

[

]


)


]


}



,








β
m
χ

-


(

β
m
χ

)

2





β
m
χ

-

[



(


β
m
χ

[

]

)

2

+

2



β
m
χ

[

]



(


β
m
χ

-


β
m
χ

[

]


)



]






The convex upper bounds are introduced to the objective function as penalty functions to obtain a convex semidefinite programming (SDP) problem.


Further, solving the convex SDP problem includes: continuously updating penalty factors η1 and η2 of the penalty terms, and solving the SDP problem through an iterative method until the penalty terms satisfy a predefined maximum violation or a predefined maximum quantity of outer iterations is reached.


Further, performing alternating optimization on the user power allocation subproblem and the STAR-RIS configuration subproblem specifically includes: initializing {pu[0]}, {Qu[0]}, {βu[0]}, and preset accuracy Ò3; and setting a current iteration index custom-character3=0, given {Qu[custom-character3]} and {βu[custom-character3]}, computing {pu[custom-character3+1]} by using a closed-form expression of optimal user power allocation, given {pu[custom-character3+1]}, updating {Qu[custom-character3+1]} and {Bu[custom-character+1]} through a penalty-based successive convex approximation (SCA) method, updating custom-character3=custom-character3+1, and repeating the foregoing process until a value of an objective function decreases to the preset accuracy or a preset maximum quantity L3 of iterations is reached.


Compared with the prior art, the present disclosure has the following beneficial effects:


The present disclosure provides the semiFL method based on an NGMA technology. CL and FL are integrated such that devices with weak computing capabilities can also participate in training of the global model. During uplink transmission, channel conditions of users are flexibly adjusted through the STAR-RIS to dynamically change a channel environment such that a system can meet different task requirements of heterogeneous users. Communication-centric CL users with weak local computing capabilities and computing-centric FL users with strong local computing capabilities can transmit data in parallel on a same time-frequency resource. This avoids a waste of data resources, enriches data obtaining of the BS, and improves accuracy of the global model.


In a large-scale wireless IoT scenario, an underlying device may face a dilemma of limited battery capacity and inconvenient periodic charging, which seriously affects a life cycle of an entire system. To resolve this problem, the present disclosure constructs a mixed integer non-linear programming problem for jointly optimizing power allocation and the configuration of the STAR-RIS with an objective of minimizing user transmit power consumption. The proposed non-convex optimization problem can be decoupled into two subproblems. For the user power allocation subproblem, the closed-form expressions of the optimal power allocation can be derived through mathematical induction and the Lagrange duality method. For the STAR-RIS configuration subproblem, the original feasibility check problem can be transformed into the convex SDP problem through a penalty function method and the SCA method, and the SDP problem is solved through CVX. In summary, the method provided in the present disclosure integrates a strategy for jointly optimizing the user power allocation and the configuration of the STAR-RIS to reduce total uplink transmit power consumption of the system and prolong a life cycle of an intelligent IoT network.





BRIEF DESCRIPTION OF THE DRAWINGS

To describe the technical solutions in embodiments of the present application or in the prior art more clearly, the following briefly describes the accompanying drawings required for describing the embodiments. Apparently, the accompanying drawings in the following description show merely some embodiments of the present disclosure, and persons of ordinary skill in the art may still derive other accompanying drawings from these accompanying drawings.



FIG. 1 is a flowchart of a semiFL method based on an NGMA technology according to an embodiment of the present disclosure;



FIG. 2 is an architectural diagram of a semiFL method based on an NGMA technology according to an embodiment of the present disclosure;



FIG. 3 is an application structural diagram of a semiFL method based on an NGMA technology according to an embodiment of the present disclosure; and



FIG. 4 is a principle diagram of a STAR-RIS-assisted NGMA technology according to an embodiment of the present disclosure.





DETAILED DESCRIPTION OF THE EMBODIMENTS

The present disclosure provides a semiFL method based on an NGMA technology. CL and FL are integrated such that devices with weak computing capabilities in a large-scale wireless IoT scenario can also participate in training of a global model. During uplink transmission, a channel environment of a user is flexibly adjusted through a STAR-RIS such that CL users with weak computing capabilities and FL users with strong computing capabilities can communicate in parallel on a shared time-frequency resource. Then, the present disclosure studies how to minimize total user transmit power consumption, constructs a mixed integer non-linear programming problem for jointly optimizing user power allocation and a configuration of the STAR-RIS, and obtains an optimal suboptimal solution through alternating optimization. Specifically, for the user power allocation subproblem, closed-form expressions of optimal power are obtained through mathematical induction and a Lagrange duality method. For the STAR-RIS configuration subproblem, a feasibility check problem is transformed into a convex SDP problem through a penalty function method and an SCA method.


To better understand the technical solutions, the following describes in detail a method in the present disclosure with reference to the accompanying drawings. Apparently, the described embodiments are merely some rather than all of the embodiments of the present disclosure. All other embodiments obtained by those of ordinary skill in the art based on the embodiments of the present disclosure without creative efforts shall fall within the protection scope of the present disclosure.


Referring to FIG. 1 and FIG. 2, in a semiFL method based on an NGMA technology provided in the present disclosure, all users participate in training of a global model and can transmit data in parallel on a shared time-frequency resource. The method includes the following steps:

    • S1: Before each round of communication starts, state information such as instantaneous CSI and available CPU frequency state information is estimated and reported to a BS by all local users.
    • S2: After the reported state information is received, the users are classified by the BS into communication-centric CL users and computing-centric FL users based on computing capabilities of local devices of the users, and a classification result is broadcast to all users after classification.


A STAR-RIS-assisted wireless network that supports collaborative learning for heterogeneous users is considered, as shown in FIG. 3. A heterogeneous user set U=N∪K is divided into communication-centric CL users N=≙{1, 2, . . . , N} with weak computing capabilities and computing-centric FL users K ≙{N+1, N+2, . . . , N+K} with strong computing capabilities based on different computing capabilities of the local devices of the users. There are M passive reflective/refractive elements in a STAR-RIS. Each element relays (refracts or reflects) an incident signal to a desired direction. To avoid energy leakage and facilitate synchronous communication of all users, the STAR-RIS uses a mode switching protocol to select at least one of the elements to work in total refraction mode (namely, T mode) and the other elements to work in total reflection mode (namely, R mode). qχ=√{square root over ([β1χ)} e1χ, √{square root over (β2χ)} e2χ, . . . , √{square root over (βMχ)} eMχ]H represents a reflection (χ=R) and refraction (χ=T) vector. βmχ∈{0,1} and θmχ∈[0,2π] respectively represent an amplitude and a phase shift of the mth element in χ∈{R, T} mode. Because each element can work in only one mode in a specific timeslot, a constraint on mode switching is βmRmT=1, ∀m∈M≙{1, 2, . . . , M}.


A channel gain of a wireless link can be obtained by multiplying a path loss by small-scale fading. Specifically, a link distance from the STAR-RIS to the BS and the user is denoted as Λu, ∀u∈Ū≙{0}∪U. u=0) represents the BS. u∈U represents the user. Large-scale fading is denoted as Lu0u)−α, ∀u∈Ū≙{0}∪U. ζ0 is a path loss at a reference distance of 1 m. α≥2 is a path loss index. For small-scale fading, the present disclosure assumes that a channel between the user and the BS is subject to Rayleigh fading due to blocking and extensive scattering. Because the STAR-RIS is deployed at height, it can be assumed that all RIS-related links are subject to Ricean fading. Therefore, the present disclosure can represent a channel coefficient of all RIS-related links as follows:








r
u

=




ς
0


Λ
u
α





(




κ

κ
+
1





r
u
LoS


+



1

κ
+
1





r
u
NLoS



)



,








u


U
_






κ is a Ricean factor. ruLoScustom-characterM is a deterministic line-of-sight channel component. ruNLoScustom-characterM is a Rayleigh fading channel component. The STAR-RIS divides effective coverage of the BS into refraction space and reflection space. In a system in which the STAR-RIS is deployed, a joint channel coefficient of an up link from the uth user to the BS is as follows:









h
_

u

=


?

+


r
0
H



Θ
u



r
u




,








u

U








?

indicates text missing or illegible when filed




hucustom-character represents a direct link from the uth user to the BS. r0HΘuru represents a double fading reflection/refraction link provided by the STAR-RIS. Specifically, if the uth user is located in the reflection space, Θu=diag(qR). If the uth user is located in the refraction space, Θu=diag(qT). The present disclosure assumes that instantaneous CSI of all channels is available at the BS.

    • S3: A local gradient gk is computed by each FL user based on a model W obtained in a previous round, and a local data set Dn to be uploaded to the BS is prepared by each CL user.
    • S4: The local data set of each CL user is encoded into a communication symbol {sn}, the gradient of the local model trained by each FL user is processed into a computation symbol {sk}, and the information bearing symbols of all users are sent to the BS by using an NGMA technology.


Referring to FIG. 4, the present disclosure designs an NGMA technology integrating a non-orthogonal multiple access (NOMA) technology and an AirComp technology by utilizing a superimposition characteristic of signals in a wireless channel. Wireless access services are provided for CL and FL users through a STAR-RIS-assisted multiple access channel such that uplink communication of an original data set of the CL user and over-the-air computation of a model parameter (such as gradient information) of the FL user are multiplexed in a same frequency band.


In each round of communication, the CL user maps the local data set {dn} to the communication symbol {sn} and the FL user maps and the gradient {gk} to the computation symbol {sk} first. The NGMA technology provides services for all users in a same frequency band in a non-orthogonal manner such that the users can concurrently perform uplink communication. The communication symbol {sn} is transmitted through a power domain NOMA technology. The computation symbol {sk} is transmitted through the AirComp technology. Therefore, a superimposed signal received by the BS is as follows:






y
=









n
=
1




N





h
_

n




p
n




s
n






CL


users


+








k
=

N
+
1






N
+
K






h
_

k




p
k




s
k






FL


users


+



z
0



Noise






pn (pk) is transmit power of the nth (kth) user. z0˜CN (0,σ2) is additive white Gaussian noise (AWGN) at the BS. The present disclosure assumes that all symbols {su}={sn}∪{sk} are statistically independent and have zero mean and normalized variance, that is, E[su2]=1, ∀u∈U and E[suHsν]=0, ∀u≠#ν∈U.


With the help of the STAR-RIS, it can be ensured that parallel communication between the two groups of users is smooth even if some direct links between the BS and the users are blocked. The CL users and the FL users have different communication objectives and performance indicators. For example, the communication-centric CL users expect that the local data sets sent to the BS can be perfectly decoded by the BS with an objective of maximizing a data transmission rate. The computing-centric FL users expect that local model parameters sent to the BS can be carefully aggregated by the BS with an objective of minimizing a computation distortion MSE. Considering the plurality of objectives involved in NGMA, the present disclosure expects to improve throughput of the CL users by suppressing interference, while completing weighting computation of the model parameters of the FL users by utilizing the superimposition characteristic of channels. Therefore, there is a need to design a collaborative transmission mechanism with an efficient interference management capability in such a joint communication and learning framework.


Heterogeneous users are classified into strong users and weak users based on different path losses. The BS separates the superimposed signal through SIC to perfectly decode a signal from the strong users. The strong signal is modulated and subtracted from the received superimposed signal V to obtain a signal from the weak users. This leaves the weak signal for collaborative over-the-air computation. To implement the foregoing process, the present disclosure needs to reshape a wireless transmission environment by modifying an amplitude and a phase of an incident signal through the STAR-RIS such that channel gains of the two groups of users are adjusted in the following order:













"\[LeftBracketingBar]"



h
_

1



"\[RightBracketingBar]"


2






"\[LeftBracketingBar]"



h
_

2



"\[RightBracketingBar]"


2








"\[LeftBracketingBar]"



h
_

N



"\[RightBracketingBar]"


2





Strong


users










"\[LeftBracketingBar]"



h
_

k



"\[RightBracketingBar]"


2

,



k

K






Weak


users






Specifically, a channel coefficient is adjusted through the STAR-RIS to arrange all CL users as strong users for communication message decoding and all FL users as weak users for model aggregation. Based on the superimposed signal and a decoding order, an uplink communication rate of the nth CL user is as follows:








R
n

=

B




log
2

(

1
+






"\[LeftBracketingBar]"



h
_

n



"\[RightBracketingBar]"


2



p
n







u
=

n
+
1



N
+
K







"\[LeftBracketingBar]"



h
_

u



"\[RightBracketingBar]"


2



p
u



+

σ
2




)



,








n

N





B is available bandwidth of the system.


The communication symbols {sn} sent by the CL users may be decoded and modulated by the BS, and subtracted from the superimposed signal y received by the BS to obtain a residual signal







y
^

=







k

K






h
_

k




p
k




s
k



+

z
0






that contains gradient information of the models of the FL users. For aggregation of the FL models, linear computation output estimated by the BS is as follows:







s
^

=



y
^

K

=


1
K



(







k

K






h
_

k




p
k




s
k



+

z
0


)







A computation distortion may be quantified by a mean square error (MSE) as follows:






MSE

=
^



E
[




"\[LeftBracketingBar]"



s
^

-


1
K






k

K



s
k






"\[RightBracketingBar]"


2

]

=


1

K
2




(





k

K






"\[LeftBracketingBar]"





h
_

k




p
k



-
1



"\[RightBracketingBar]"


2


+

σ
2


)









    • S5: The superimposed signal from the two groups of users is received by the BS, the local data sets from the CL users are decoded to perform centralized training and obtain an average gradient, the gradients of the local models from the FL users are aggregated, and a global model is aggregated by using the obtained gradients.





Specifically, referring to FIG. 4, the BS first detects communication output {sn} of each CL user through SIC, decodes the communication output to generate training samples {Dn} for CL, and trains the model through a gradient descent method to obtain the average gradient gcustom-characterQ of the CL users as follows:







CL
:


g
¯


=



1
N






n

N



g
n



=


1
N






n

N






F
n

(

w
;

D
n


)











    • where N is a quantity of the CL users, gn=∇Fn(w;Dn)∈custom-characterQ represents a gradient of the nth CL user computed by the BS,











F
n

(

w
;

D
n


)

=


1



"\[LeftBracketingBar]"


D
n



"\[RightBracketingBar]"








i
=
1




"\[LeftBracketingBar]"


D
n



"\[RightBracketingBar]"




f

(

w
;

D
n

(
i
)



)







is an objective function used to train a model parameter w∈custom-characterQ, and ƒ(w;Dn(i)) is a loss function of the model with respect to the ith sample Dn(i) of the nth CL user.


Then, the BS subtracts the decoded information symbols of the CL users from the received superimposed signal to obtain the residual signal







y
ˆ

=







k

K






h
_

k




p
k




s
k



+

z
0






that contains model information of the FL users, performs averaging on the residual signal, restores the local gradients {gk} of the FL users from the computation symbols {sk}, and finally obtains an estimated average gradient of the FL users as follows:







FL
:


g
ˆ


=



y
ˆ

K

=


1
K



(





k

K





h
_

k




p
k




g
k



+

z
0


)









    • where K is a quantity of the FL users, z0˜N (0,σ2I)∈custom-characterQ is a noise vector at the BS, and σ2 is noise power.





Finally, after obtaining the gradients (g,ĝ), the BS updates a global gradient as follows:







SemiFL
:


g
~


=



N

N
+
K




g
¯


+


K

N
+
K




g
ˆ







The global model is updated by using w:=w−λ{tilde over (g)}, where λ>0 is a learning rate.

    • S6: After each round of communication is completed, the latest global model w∈custom-characterQ is broadcast by the BS to all FL users for gradient computation in a next round.
    • S7: The foregoing steps are repeated until convergence or a maximum quantity of rounds of communication is reached.


The semiFL method based on an NGMA technology provided in the present disclosure considers reducing total uplink transmit power consumption while meeting requirements for computation distortion tolerance of the FL users and data transmission rates of the CL users. Specifically, before each round of communication, an optimization problem and constraints are constructed with an objective of minimizing total transmit power consumption of the round. User transmit power allocation and a configuration of the STAR-RIS are jointly optimized through alternating optimization.


The objective of the present disclosure is to minimize the total transmit power consumption of each round of communication by jointly optimizing uplink power allocation for all users and the configuration of the STAR-RIS. Considering a QoS requirement of the CL users and the computation distortion tolerance of the FL users, the considered optimization problem can be expressed as follows:







min


{

p
u

}

,

{

Θ
u

}









u
=
1


N
+
K




p
u









s
.
t
.





"\[LeftBracketingBar]"



h
_

1



"\[RightBracketingBar]"


2










"\[LeftBracketingBar]"



h
_

N



"\[RightBracketingBar]"


2







"\[LeftBracketingBar]"



h
_

k



"\[RightBracketingBar]"


2


,



k

K


,









R
n

(


{

p
u

}

,

{

Θ
u

}


)



R
min


,




n

N


,








MSE

(


{

p
k

}

,

{

Θ
k

}


)



E
0


,








p
u


0

,


Θ
u


Q


,




u

U


,




Q={βmR, βmT, θmR, θmTmR, βmT∈{0, 1}; θmR, θmT∈[0,2π]; βmRmT=1} is a feasible set of refraction and reflection coefficients of the STAR-RIS. Rmin is a minimum data transmission rate for meeting the requirement of the CL users. The constraint |h1|2≥ . . . |hN|2≥|hk|2, ∀k∈K represents a decoding order that ensures successful separation of the communication symbols and the computation symbols. The constraint Rn({pu}, {Θu})≥Rmin, ∀n∈N is the QoS requirement of the CL users. The constraint MSE({pk}, {Θk})≤ E0 ensures that the computation distortion of the FL users does not exceed E0<1/K.


Due to non-convexity of the constraints, directly solving the constructed optimization problem faces the following difficulties: First, optimization of the configuration of the STAR-RIS is more complex than that of a traditional RIS with only reflection coefficients. Second, existence of the discrete variable {βm(χ)} and other continuous variables makes the optimization problem a mixed integer programming problem. It is difficult to find optimal solutions of the highly coupled variables {pu} and {Θu} in polynomial time complexity. To effectively solve the proposed problem, the present disclosure decouples the problem into two subproblems: a power allocation subproblem and a STAR-RIS configuration subproblem. Alternating optimization is performed on the two subproblems.


Given the configuration of the STAR-RIS, the related constraints are rewritten by using the expression of the uplink communication rate of the CL user and the expression of the computation distortion MSE of the FL user, to equivalently express the power allocation subproblem in the optimization problem as follows:







min

{


p
u


0

}








u
=
1


N
+
K




p
u










s
.
t
.





"\[LeftBracketingBar]"



h
_

n



"\[RightBracketingBar]"


2




p
n




ζ

(








u
=

n
+
1



N
+
K







"\[LeftBracketingBar]"



h
_

u



"\[RightBracketingBar]"


2



p
u


+

σ
2


)


,



n

N


,















k
=

N
+
1



N
+
K







"\[LeftBracketingBar]"





h
_

k




p
k



-
1



"\[RightBracketingBar]"


2


+

σ
2





E
0



K
2



,




ζ=2Rmin/B−1 is a constant. In the present disclosure, optimal solutions of the problem are obtained by using an analytical structure and a Lagrange duality method. Finally obtained closed-form expressions of optimal transmit power {p*n} of the CL user and optimal transmit power {p*k} of the FL user are as follows:








p
n
*

=



ζ

(

ζ
+
1

)


N
-
n




(








k
=

N
+
1



N
+
K







"\[LeftBracketingBar]"



h
_

k



"\[RightBracketingBar]"


2



p
k
*


+

σ
2


)







"\[LeftBracketingBar]"



h
_

n



"\[RightBracketingBar]"



-
2




,







p
k
*

=


τ
2

*
2







"\[LeftBracketingBar]"



h
_

k



"\[RightBracketingBar]"


2





(

1
+


τ
1
*


ζ





"\[LeftBracketingBar]"



h
_

k



"\[RightBracketingBar]"


2


+


τ
2
*






"\[LeftBracketingBar]"



h
_

k



"\[RightBracketingBar]"


2



)


-
2


.






Σ*1 is an optimal dual variable related to the QoS constraint. Σ*2 is an optimal dual variable related to the MSE constraint.


A specific proof process for the optimal transmit power {p*n} and {p*k} is as follows:


The solution of the transmit power {pu} is composed of {pn} of the CL user and {pk} of the FL user.


First, given {pk}, the power allocation subproblem degenerates to:







min

{


p
u


0

}








u
=
1


N
+
K




p
n










s
.
t
.





"\[LeftBracketingBar]"



h
_

n



"\[RightBracketingBar]"


2




p
n




ζ

(








u
=

n
+
1



N
+
K







"\[LeftBracketingBar]"



h
_

u



"\[RightBracketingBar]"


2



p
u


+

σ
2


)


,



n

N


,




Based on an analytical structure of the problem, it can be proved through reduction to absurdity that












"\[LeftBracketingBar]"



h
_

n



"\[RightBracketingBar]"


2



p
n




ζ

(








u
=

n
+
1



N
+
K







"\[LeftBracketingBar]"



h
_

u



"\[RightBracketingBar]"


2



p
u


+

σ
2


)


,



n

N






is a valid constraint for the optimal solution that meets a minimum QoS requirement. Therefore, given {pk}, optimal transmit power of the Nth CL user can be written as follows:







p
N
*

=


ζ




"\[LeftBracketingBar]"



h
_

N



"\[RightBracketingBar]"


2




(








k
=

N
+
1



N
+
K







"\[LeftBracketingBar]"



h
_

k



"\[RightBracketingBar]"


2



p
k


+

σ
2


)






Optimal transmit power of the (N−1)th CL user can be written as follows:









p

N
-
1

*




=


ζ




"\[LeftBracketingBar]"



h
_


N
-
1




"\[RightBracketingBar]"


2




(






"\[LeftBracketingBar]"



h
_

N



"\[RightBracketingBar]"


2



p
N
*


+







k
=

N
+
1



N
+
K







"\[LeftBracketingBar]"



h
_

k



"\[RightBracketingBar]"


2



p
k


+

σ
2


)












=

(
e
)





ζ

(

ζ
+
1

)





"\[LeftBracketingBar]"



h
_


N
-
1




"\[RightBracketingBar]"


2




(








k
=

N
+
1



N
+
K







"\[LeftBracketingBar]"



h
_

k



"\[RightBracketingBar]"


2



p
k


+

σ
2


)









(e) can be derived by substituting the expression of p*N into the expression of p*N-1, with some simple algebraic operations. Similarly, optimal transmit power of the (N−2)th CL user is as follows:







p

N
-
2

*

=




ζ

(

ζ
+
1

)

2





"\[LeftBracketingBar]"



h
_


N
-
2




"\[RightBracketingBar]"


2




(








k
=

N
+
1



N
+
K







"\[LeftBracketingBar]"



h
_

k



"\[RightBracketingBar]"


2



p
k


+

σ
2


)






In summary, a closed-form solution of {p*n} can be obtained through induction.


Then, given {pn}, {circumflex over (p)}k=√{square root over (pk)} and the power allocation subproblem can be reformulated as follows:







min

{



p
ˆ

k


0

}








k
=

N
+
1



N
+
K





p
^

k
2










s
.
t
.

ζ








k
=

N
+
1



N
+
K







"\[LeftBracketingBar]"



h
_

k



"\[RightBracketingBar]"


2




p
^

k
2




I
min


,















k
=

N
+
1



N
+
K







"\[LeftBracketingBar]"





h
k

¯




p
ˆ

κ


-
1



"\[RightBracketingBar]"


2


+

σ
2





E
0



K
2



,







where



I
min


=


min

n

N




{






"\[LeftBracketingBar]"



h
_

n



"\[RightBracketingBar]"


2



p
n


-

ζ







u
=

n
+
1



N
+
K







"\[LeftBracketingBar]"



h
_

u



"\[RightBracketingBar]"


2



p
u



-

ζ


σ
2



}

.






Σ1≥0 and Σ2≥0 represent Lagrange multipliers, and a Lagrange function of the transformed subproblem is as follows:







L

(


{


p
ˆ

k

}

,

τ
1

,

τ
2


)

=





k
=

N
+
1



N
+
K




p
ˆ

k
2


+


τ
1

(


ζ





k
=

N
+
1



N
+
K







"\[LeftBracketingBar]"



h
_

k



"\[RightBracketingBar]"


2




p
^

k
2




-

I
min


)

+



τ
2

(





k
=

N
+
1



N
+
K






"\[LeftBracketingBar]"





h
_

k




p
^

k


-
1



"\[RightBracketingBar]"


2


+

σ
2

-


E
0



K
2



)






A dual function of the Lagrange function is as follows:







D

(


τ
1

,

τ
2


)

=


min

{



p
^

k


0

}


L



(


{


p
^

k

}

,

τ
1

,

τ
2


)






Therefore, a dual problem of the transformed subproblem is as follows:







max


τ
1

,


τ
2


0





D

(


τ
1

,

τ
2


)





Given the dual variables τ1 and τ2, an optimal solution








p
^

k
*

=



τ
2





"\[LeftBracketingBar]"



h
_

k



"\[RightBracketingBar]"




1
+


τ
1


ζ





"\[LeftBracketingBar]"



h
_

k



"\[RightBracketingBar]"


2


+


τ
2






"\[LeftBracketingBar]"



h
_

k



"\[RightBracketingBar]"


2








of the dual function can be obtained by finding an extreme point that satisfies









L





p
^

k
*



=



2



p
^

k
*


+

2


τ
1


ζ





"\[LeftBracketingBar]"



h
_

k



"\[RightBracketingBar]"


2




p
^

k
*


+

2


τ
2





"\[LeftBracketingBar]"





h
_

k




p
^

k


-
1



"\[RightBracketingBar]"






"\[LeftBracketingBar]"



h
_

k



"\[RightBracketingBar]"




=
0.





The dual function D(τ1, τ2) is obtained by substituting the optimal solution into the expression of the dual function. For the proposed dual problem, optimal Lagrange multipliers τ*1 and τ*2 are found through a subgradient descent method, that is:








τ
1

[


+
1

]

=


[



τ
1

[

]

+



μ
~

1

(


ζ







k

K








"\[LeftBracketingBar]"



h
_

k



"\[RightBracketingBar]"


2




p
^

k

*
2





-

I
min


)


]

+









τ
2

[


+
1

]

=


[



τ
2

[

]

+



μ
~

2

(







k

K







"\[LeftBracketingBar]"





h
_

k




p
^

k
*


-
1



"\[RightBracketingBar]"


2


+

σ
2

-


E
0



K
2



)


]

+






custom-character is an iteration index. {tilde over (μ)}1, {tilde over (μ)}2>0 is a step, which is a constant. The optimal transmit power of the FL user can be obtained by replacing {τ1, τ2} in the expression of {circumflex over (p)}*k with the obtained optimal dual variables {τ*1, τ*2} and using p*k={circumflex over (p)}*22 to restore {p*k}.


The proof is completed.


Given the user transmit power, because an objective function in the optimization problem is independent of {Θu}, the STAR-RIS configuration subproblem is a feasibility check problem, which can be equivalently expressed as follows:






find



{

Θ
u

}









s

.

t

.








"\[LeftBracketingBar]"



h
_

1



"\[RightBracketingBar]"


2










"\[LeftBracketingBar]"



h
_

N



"\[RightBracketingBar]"


2







"\[LeftBracketingBar]"



h
_

k



"\[RightBracketingBar]"


2


,



k

K


,









R
n

(


{

p
u

}

,

{

Θ
u

}


)



R
min


,



n

N


,








MSE

(


{

p
k

}

,

{

Θ
k

}


)



E
0


,








Θ
u


Q

,



u

U


,




The present disclosure further defines Ru=diag{rH}ru. In this case, rHΘuru=quHRu. If the uth user is located in the reflection space, qu=qR. If the uth user is located in the refraction space. qu=qT. Correspondingly, in the present disclosure, the following can be obtained:












"\[LeftBracketingBar]"



h
_

u



"\[RightBracketingBar]"


2




=





"\[LeftBracketingBar]"



h
u

+



r
¯

H



Θ
u



r
u





"\[RightBracketingBar]"


2

=



"\[LeftBracketingBar]"




h
u

+


q
u
H



R
u




|
2













=



q
u
H



R
u



R
u
H



q
u


+


q
u
H



R
u



h
u
H


+


h
u



R
u
H



q
u


+




"\[LeftBracketingBar]"


h
u



"\[RightBracketingBar]"


2











=




q
_

u
H




R
_

u




q
_

u


+




"\[LeftBracketingBar]"


h
u



"\[RightBracketingBar]"


2











    • where












R
¯

u

=

[





R
u



R
u
H






R
u



h
u
H








h
u



R
u
H




0



]


,



q
_

u

=


[




q
u





1



]

.








    •  It can be found that quHRuqu=tr(RuququH). Qu=ququH is defined, and Qu±0, rank(Qu)=1, and Diag(Qu)=βu need to be satisfied. The vector Diag(Qu) represents an element extracted from a main diagonal of a matrix Qu. If the uth user is located in the reflection space, βuR=[β1R, β2R, . . . , βMR]*. Otherwise. βuT=[β1T, β2T, . . . , βMT]*. Next, a joint uplink channel coefficient may be further organized as follows:













"\[LeftBracketingBar]"



h
_

u



"\[RightBracketingBar]"


2

=



tr

(



R
_

u




q
_

u




q
¯

u
H


)

+




"\[LeftBracketingBar]"


h
u



"\[RightBracketingBar]"


2


=


tr

(



R
_

u



Q
u


)

+




"\[LeftBracketingBar]"


h
u



"\[RightBracketingBar]"


2







Similarly,









R


k

=


R
k




p
k




,



h
ˆ

k

=



h
k




p
k



-
1


,


and




R
^

k


=

[






R


k




R


k
H







R


k




h
^

k
H









h
^

k




R


k
H




0



]






are defined.


In this case, in the present disclosure, the following can be obtained:










"\[LeftBracketingBar]"





h
_

k




p
k



-
1



"\[RightBracketingBar]"


2

=


tr

(



R
ˆ

k



Q
k


)

+




"\[LeftBracketingBar]"



h
^

k



"\[RightBracketingBar]"


2






Based on the foregoing transformation, the first three non-convex constraints in the STAR-RIS configuration subproblem can be re-expressed as follows:








tr

(



R
¯

1



Q
1


)

+




"\[LeftBracketingBar]"


h
1



"\[RightBracketingBar]"


2







tr

(



R
¯

N



Q
N


)

+




"\[LeftBracketingBar]"


h
N



"\[RightBracketingBar]"


2












tr

(



R
¯

k



Q
k


)

+




"\[LeftBracketingBar]"


h
k



"\[RightBracketingBar]"


2



,




k

K


,







[


tr

(



R
¯

n



Q
n


)

+




"\[LeftBracketingBar]"


h
n



"\[RightBracketingBar]"


2


]



p
n











ζ


σ
2


+

ζ





u
=

n
+
1



N
+
K




[


tr

(



R
¯

u



Q
u


)

+




"\[LeftBracketingBar]"


h
u



"\[RightBracketingBar]"


2


]



p
u






,




n

N


,











k

K



[


tr

(



R
^

k



Q
k


)

+




"\[LeftBracketingBar]"



h
ˆ

k



"\[RightBracketingBar]"


2


]


+

σ
2





E
0



K
2






Based on the foregoing approximations, the STAR-RIS configuration subproblem can be approximated as follows:







find



{

Q
u

}


,

{

β
u

}









s
.
t
.


Q
u


±
0

,



u

U


,








rank



(

Q
u

)


=
1

,



u

U


,








Diag



(

Q
u

)


=

β
u


,




ι

ι


U


,









β
m
R

+

β
m
T


=
1

,



m

M


,







β
m
R

,


β
m
T



{

0
,
1

}


,



m

M











tr

(



R
¯

1



Q
1


)

+




"\[LeftBracketingBar]"


h
1



"\[RightBracketingBar]"


2







tr

(



R
¯

N



Q
N


)

+




"\[LeftBracketingBar]"


h
N



"\[RightBracketingBar]"


2





tr

(



R
¯

k



Q
k


)

+




"\[LeftBracketingBar]"


h
k



"\[RightBracketingBar]"


2



,



k

K


,









[


tr

(



R
¯

n



Q
n


)

+




"\[LeftBracketingBar]"


h
n



"\[RightBracketingBar]"


2


]



p
n





ζσ
2

+

ζ





u
=

n
+
1



N
+
K




[


tr

(



R
¯

u



Q
u


)

+




"\[LeftBracketingBar]"


h
u



"\[RightBracketingBar]"


2


]



p
u






,



n

N


,











k

K



[


tr

(



R
^

k



Q
k


)

+




"\[LeftBracketingBar]"



h
ˆ

k



"\[RightBracketingBar]"


2


]


+

σ
2





E
0



K
2






In the present disclosure, the non-convex rank-one constraint and the binary constraint in the foregoing problem can be equivalently transformed into the following form:








rank
(

Q
u

)

=


1






Q
u



*

-




Q
u



2



=
0


,



u

U


,









β
m
χ




{

0
,
1

}




β
m
χ

-


(

β
m
χ

)

2




=
0

,



χ


{

R
,
T

}



,



m

M






∥⋅∥* represents a kernel norm. ∥⋅∥2 represents a spectral norm.


Then, the present disclosure adds the foregoing two equations to the objective function of the transformed subproblem as penalty terms, to obtain the following problem:








min


{

Q
u

}

,

{

β
u

}





η
1





u


(





Q
u



*

-




Q
u



2


)



+


η
2





m




χ


(


β
m
χ

-


(

β
m
χ

)

2


)












s
.
t
.


Q
u


±
0

,



u

U


,








Diag



(

Q
u

)


=

β
u


,




ι

ι


U


,









β
m
R

+

β
m
T


=
1

,



m

M


,









tr

(



R
¯

1



Q
1


)

+




"\[LeftBracketingBar]"


h
1



"\[RightBracketingBar]"


2







tr

(



R
¯

N



Q
N


)

+




"\[LeftBracketingBar]"


h
N



"\[RightBracketingBar]"


2





tr

(



R
¯

k



Q
k


)

+




"\[LeftBracketingBar]"


h
k



"\[RightBracketingBar]"


2



,



k

K


,









[


tr

(



R
¯

n



Q
n


)

+




"\[LeftBracketingBar]"


h
n



"\[RightBracketingBar]"


2


]



p
n





ζσ
2

+

ζ





u
=

n
+
1



N
+
K




[


tr

(



R
¯

u



Q
u


)

+




"\[LeftBracketingBar]"


h
u



"\[RightBracketingBar]"


2


]



p
u






,



n

N


,











k

K



[


tr

(



R
^

k



Q
k


)

+




"\[LeftBracketingBar]"



h
ˆ

k



"\[RightBracketingBar]"


2


]


+

σ
2





E
0



K
2






η1 and η2 are two non-negative penalty factors that penalize the objective function if a rank of {Qu} is not 1 or {βmχ} is not binary. However, these penalty terms make the objective function of the foregoing problem non-convex. In the present disclosure, a suboptimal solution is obtained through continuous iterations by using an SCA method. Specifically, in the present disclosure, ∥Qu2 and (βmχ)2 are linearized by using fixed points Qu[custom-character] and βmχ[custom-character] through first-order Taylor expansion in an custom-characterth iteration, that is:












Q
u



*

-




Q
u



2








Q
u



*

-

{






Q
u

[

]



2

+

tr
[




q
¯

max

[

]




(



q
¯

max

[

]

)

H



(


Q
u

-


Q
u

[

]


)


]


}



,








β
m
χ

-


(

β
m
χ

)

2





β
m
χ

-

[



(


β
m
χ

[

]

)

2

+

2



β
m
χ

[

]



(


β
m
χ

-


β
m
χ

[

]


)



]






Then, the non-convex penalty terms in the custom-characterth iteration are replaced by solved convex upper bounds. A transformed problem is a convex SDP problem, which can be effectively solved through CVX. A penalty-based SCA algorithm proposed to solve the STAR-RIS configuration subproblem is composed of two loops: an inner loop used to iteratively solve an approximate SDP problem of the configuration subproblem and an outer loop used to continuously update the penalty factors and determine whether an iteration termination condition is met. A constraint violation is defined as follows:







o
¯

=

max


{






Q
u



*

-




Q
u



2


,


β
m
χ

-


(

β
m
χ

)

2



}






The algorithm is described in detail below.


Specifically, {Qu[0]}, {βu[0]}, the penalty factors η1 and η2, corresponding scale factors ñ1 and ñ2, and preset accuracy ò1 and ò2 are initialized first. An outer iteration index custom-character1=0 is set. The constraint violation ō[custom-character1] is computed. An inner iteration index custom-character2=0 is set. An objective function value Etot[custom-character2] of the approximate SDP problem of the STAR-RIS configuration subproblem is computed. custom-character2=custom-character2+1 is updated. The approximate SDP problem is solved to update Qu[custom-character2] and βu[custom-character2]. The objective function value Etot[custom-character2] is updated. The foregoing process is repeated until










"\[LeftBracketingBar]"




E

t

o

t


[


2

]

-


E
tot

[



2

-
1

]




"\[RightBracketingBar]"




E
tot

[




2
-



1

]




ò
2





or a quantity of inner iterations custom-character2≥L2. custom-character1=custom-character1+1, Qu[custom-character1]=Qu[custom-character2], βu[custom-character1]=βu[custom-character2], the constraint violation ō[custom-character1], η11η1, and η22η2 are updated. The foregoing process is repeated until custom-character1]≤1 or custom-character1≥L1.


The present disclosure proposes an alternating optimization algorithm to jointly optimize the user power allocation and the configuration of the STAR-RIS, to minimize the total uplink transmit power consumption.


Specifically, {pu[0]}, {Qu[0]}, {βu[0]}, and preset accuracy ò3 are initialized. A current iteration index custom-character3=0 is set. Given {Qu[custom-character3]} and {βu[custom-character3]}, {pu[custom-character3+1]} is computed by using a closed-form expression of optimal user power allocation. Given {pu[custom-character3+1]}, {Qu[custom-character3+1]} and {βu[custom-character3+1]} are updated through the penalty-based SCA algorithm. custom-character3=custom-character3+1 is updated. The foregoing process is repeated until a value of the objective function decreases to the preset accuracy or a preset maximum quantity L3 of iterations is reached.


Because the total transmit power decreases as a quantity of iterations increases in the alternating optimization algorithm, and there is a lower bound constraint, the proposed alternating optimization algorithm can ensure convergence. In each iteration, computational complexity of the algorithm mainly depends on the step of solving the approximate SDP problem, with complexity of O(LoLi(M2+M)3). L0=min {L1, log(1/ò1)} and Li=min{L2, log(1/ò2)} respectively represent quantities of outer and inner iterations required for the convergence of the penalty-based SCA algorithm.


Simulation results show that the method provided in the present disclosure can effectively reduce communication overheads and transmission delay in comparison with CL and can improve learning accuracy in comparison with FL.


The foregoing embodiments are used only to describe the technical solutions of the present disclosure, and are not intended to limit same. Although the present disclosure is described in detail with reference to the foregoing embodiments, those of ordinary skill in the art should understand that they can still modify the technical solutions described in the foregoing embodiments, or make equivalent substitutions to some technical features therein. These modifications or substitutions do not make the essence of the corresponding technical solutions depart from the spirit and scope of the technical solutions of the embodiments of the present disclosure.

Claims
  • 1. A semi-federated learning (semiFL) method based on a next-generation multiple access (NGMA) technology, comprising the following steps: S1: reporting, by users, state information to a base station (BS), wherein the state information comprises instantaneous channel state information (CSI) and available central processing unit (CPU) frequency state information;S2: after receiving the state information, classifying, by the BS, the users into communication-centric centralized learning (CL) users and computing-centric FL users based on computing capabilities of local devices of the users, and broadcasting a classification result to all users after classification;S3: training, by each FL user, a local model through a local data set based on a global model w obtained in a previous round, and computing a local gradient gk; and preparing, by each CL user, a local data set Dn to be uploaded to the BS;S4: encoding the local data set of each CL user into a communication symbol {sn}, processing the gradient of the local model trained by each FL user into a computation symbol {sk}, and sending, by all users, the information bearing symbols of the users to the BS by using the NGMA technology in combination with a simultaneously transmitting and reflecting reconfigurable intelligent surface (STAR-RIS);S5: receiving, by the BS, a superimposed signal from the CL users and the FL users, decoding the local data sets from the CL users to perform centralized training and obtain an average gradient, aggregating the gradients of the local models from the FL users, and aggregating a global model by using the obtained gradients;S6: after each round of communication is completed, broadcasting, by the BS, the latest global model w∈Q to all FL users for gradient computation in a next round; andS7: repeating the foregoing steps until convergence or a maximum quantity of rounds of communication is reached.
  • 2. The semiFL method based on the NGMA technology according to claim 1, wherein the NGMA technology in S4 provides services for all users in a same frequency band in a non-orthogonal manner such that all users are allowed to communicate in parallel on a same time-frequency resource.
  • 3. The semiFL method based on the NGMA technology according to claim 1, wherein the STAR-RIS deployed in S4 modifies an amplitude and a phase of an incident signal to reshape a wireless transmission environment and adjust channel gains of different users.
  • 4. The semiFL method based on the NGMA technology according to claim 1, wherein S5 comprises: S51: detecting, by the BS, communication output {sn} of each CL user through successive interference cancellation (SIC), decoding the communication output to generate training samples {Dn} for CL, and training a CL model through a gradient descent method to obtain the average gradient g∈Q of the CL users as follows:
  • 5. The semiFL method based on the NGMA technology according to claim 1, wherein before each round of communication, user transmit power allocation and a configuration of the STAR-RIS are jointly optimized with an objective of minimizing total transmit power consumption of the round, and an optimization problem and constraints are constructed as follows:
  • 6. The semiFL method based on the NGMA technology according to claim 5, wherein the optimization problem is decoupled into two subproblems, and alternating optimization is performed on the transmit power {pu} of the user and the configuration {Θu} of the STAR-RIS of the user.
  • 7. The semiFL method based on the NGMA technology according to claim 6, wherein during the alternating optimization, when {Θu} is fixed, for the {pu} subproblem, the constraints are rewritten by an uplink communication using rate expression
  • 8. The semiFL method based on the NGMA technology according to claim 6, wherein during the alternating optimization, when {pu} is fixed, the {Θu} subproblem is a feasibility check problem and is expressed as follows:
  • 9. The semiFL method based on the NGMA technology according to claim 8, wherein solving the convex SDP problem comprises: continuously updating penalty factors η1 and η2 of the penalty terms, and solving the SDP problem through an iterative method until the penalty terms satisfy a predefined maximum violation or a predefined maximum quantity of outer iterations is reached.
  • 10. The semiFL method based on the NGMA technology according to claim 6, wherein performing alternating optimization on the user power allocation subproblem and the STAR-RIS configuration subproblem comprises: initializing {pu[0]}, {Qu[0]}, {βu[0]}, and preset accuracy ò3; and setting a current iteration index 3=0, given {Qu[3]} and {βu[3]}, computing {pu[3+1]} by using a closed-form expression of optimal user power allocation, given {pu[3+1]}, updating {Qu[3+1]} and {βu[3+1]} through a penalty-based successive convex approximation (SCA) method, updating 3=3+1, and repeating the foregoing process until a value of an objective function decreases to the preset accuracy or a preset maximum quantity L3 of iterations is reached.
Priority Claims (1)
Number Date Country Kind
202310012159.4 Jan 2023 CN national