Embodiments presented herein relate to a method, a server entity, a computer program, and a computer program product for performing an iterative learning process with agent entities. Embodiments presented herein further relate to a method, an agent entity, a computer program, and a computer program product for performing the iterative learning process with the server entity.
The increasing concerns for data privacy have motivated the consideration of collaborative machine learning systems with decentralized data where pieces of training data are stored and processed locally by edge user devices, such as user equipment. Federated learning (FL) is one non-limiting example of a decentralized learning topology, where multiple (possible very large number of) agents, for example implemented in user equipment, participate in training a shared global learning model by exchanging model updates with a centralized parameter server (PS), for example implemented in a network node.
FL is an iterative process where each global iteration, often referred to as communication round, is divided into three phases: In a first phase the PS broadcasts the current model parameter vector to all participating agents. In a second phase each of the agents performs one or several steps of a stochastic gradient descent (SGD) procedure on its own training data based on the current model parameter vector and obtains a model update. In a third phase the model updates from all agents are sent to the PS, which aggregates the received model updates and updates the parameter vector for the next iteration based on the model updates according to some aggregation rule. The first phase is then entered again but with the updated parameter vector as the current model parameter vector.
A common baseline scheme in FL is named Federated SGD, where in each local iteration, only one step of SGD is performed at each participating agent, and the model updates contain the gradient information. A natural extension is so-called Federated Averaging, where the model updates from the agents contain the updated parameter vector after performing their local iterations.
Analog modulation as used for the transmission of the model updates with over-the-air computation is susceptible to fading, interference, and other types of channel disturbances caused by transmission over a radio propagation channel.
This can be mitigated by using FL with dedicated agent-to-PS channels. Communication over dedicated agent-to-PS channels can be achieved by using digital modulation and coding. However, using dedicated agent-to-PS channels might cause communication latency and require the need for overhead signalling. Using dedicated agent-to-PS channels thus comes at a cost of an increased need for network resources and computational resources at both the PS and the agents. There could thus be scenarios in which using communication over dedicated agent-to-PS channels for the transmission of the model updates is unfeasible and should be avoided.
An object of embodiments herein is to address the above issues in order to enable efficient communication between the agents (hereinafter denoted agent entities) and the PS (hereinafter denoted server entity) in scenarios impacted by channel disturbances, without resorting using communication over dedicated agent-to-PS channels.
According to a first aspect there is presented method for performing an iterative learning process with agent entities. The method is performed by a server entity. The server entity communicating with the agent entities over a radio propagation channel. The method comprises selecting precoding vectors, one individual precoding vector for each of the agent entities. The precoding vectors are to be used by the agent entities when reporting computational results of a computational task to the server entity. The precoding vectors are selected as a function of uplink channel estimates of the radio propagation channel between antennas of all the agent entities to antennas of the server entity. The method comprises configuring the agent entities with the computational task and the precoding vectors. The method comprises performing the iterative learning process with the agent entities until a termination criterion is met.
According to a second aspect there is presented a server entity for performing an iterative learning process with agent entities. The server entity comprises processing circuitry. The processing circuitry is configured to cause the server entity to select precoding vectors, one individual precoding vector for each of the agent entities. The precoding vectors are to be used by the agent entities when reporting computational results of a computational task to the server entity. The precoding vectors are selected as a function of uplink channel estimates of the radio propagation channel between antennas of all the agent entities to antennas of the server entity. The processing circuitry is configured to cause the server entity to configure the agent entities with the computational task and the precoding vectors. The processing circuitry is configured to cause the server entity to perform the iterative learning process with the agent entities until a termination criterion is met.
According to a third aspect there is presented a server entity for performing an iterative learning process with agent entities. The server entity comprises a select module configured to select precoding vectors, one individual precoding vector for each of the agent entities. The precoding vectors are to be used by the agent entities when reporting computational results of a computational task to the server entity. The precoding vectors are selected as a function of uplink channel estimates of the radio propagation channel between antennas of all the agent entities to antennas of the server entity. The server entity comprises a configure module configured to configure the agent entities with the computational task and the precoding vectors. The server entity comprises a process module configured to perform the iterative learning process with the agent entities until a termination criterion is met.
According to a fourth aspect there is presented a computer program for performing an iterative learning process with agent entities, the computer program comprising computer program code which, when run on processing circuitry of a server entity, causes the server entity to perform a method according to the first aspect.
According to a fifth aspect there is presented a method for performing an iterative learning process with a server entity. The method is performed by an agent entity. The method comprises obtaining a precoding vector to be used by the agent entity when reporting computational results of a computational task to the server entity. The method comprises obtaining configuration in terms of the computational task from the server entity. The method comprises performing the iterative learning process with the server entity until a termination criterion is met. The agent entity as part of performing the iterative learning process applies the precoding vector to the computational results when sending the computational results to the server entity.
According to a sixth aspect there is presented an agent entity for performing an iterative learning process with a server entity. The agent entity comprises processing circuitry. The processing circuitry is configured to cause the agent entity to obtain a precoding vector to be used by the agent entity when reporting computational results of a computational task to the server entity. The processing circuitry is configured to cause the agent entity to obtain configuration in terms of the computational task from the server entity. The processing circuitry is configured to cause the agent entity to perform the iterative learning process with the server entity until a termination criterion is met. The agent entity as part of performing the iterative learning process applies the precoding vector to the computational results when sending the computational results to the server entity.
According to a seventh aspect there is presented an agent entity for performing an iterative learning process with a server entity. The agent entity comprises an obtain module configured to obtain a precoding vector to be used by the agent entity when reporting computational results of a computational task to the server entity. The agent entity comprises an obtain module configured to obtain configuration in terms of the computational task from the server entity. The agent entity comprises a process module configured to perform the iterative learning process with the server entity until a termination criterion is met. The agent entity as part of performing the iterative learning process applies the precoding vector to the computational results when sending the computational results to the server entity.
According to an eighth aspect there is presented a computer program for performing an iterative learning process with a server entity, the computer program comprising computer program code which, when run on processing circuitry of an agent entity, causes the agent entity to perform a method according to the fifth aspect.
According to a ninth aspect there is presented a computer program product comprising a computer program according to at least one of the fourth aspect and the eighth aspect and a computer readable storage medium on which the computer program is stored. The computer readable storage medium could be a non-transitory computer readable storage medium.
Advantageously, these aspects provide efficient communication between the server entity and the agent entities so that the server entity can obtain updates from all agent entities, even in situations of high channel disturbances.
Advantageously, these aspects provide robust communication between the server entity and the agent entities in the presence of high channel disturbances.
Advantageously, application of these aspects achieves faster convergence of the iterative learning process than in the absence of these aspects
Advantageously, these aspects provide improved accuracy and resilience to fading and noise in federated learning with over-the-air computation. This will lead to less signaling overhead when training an iterative learning model, both in terms of time-frequency overhead and number of iterations.
Advantageously, the use of precoding improves the link-budget, thus reducing the susceptibility to noise and out-of-cell interference, which translates directly into improved resilience and accuracy of the gradient updates.
Advantageously, the use of precoding reduces the risk of having an erroneous update rounds.
Other objectives, features and advantages of the enclosed embodiments will be apparent from the following detailed disclosure, from the attached dependent claims as well as from the drawings.
Generally, all terms used in the claims are to be interpreted according to their ordinary meaning in the technical field, unless explicitly defined otherwise herein. All references to “a/an/the element, apparatus, component, means, module, step, etc.” are to be interpreted openly as referring to at least one instance of the element, apparatus, component, means, module, step, etc., unless explicitly stated otherwise. The steps of any method disclosed herein do not have to be performed in the exact order disclosed, unless explicitly stated.
The inventive concept is now described, by way of example, with reference to the accompanying drawings, in which:
The inventive concept will now be described more fully hereinafter with reference to the accompanying drawings, in which certain embodiments of the inventive concept are shown. This inventive concept may, however, be embodied in many different forms and should not be construed as limited to the embodiments set forth herein; rather, these embodiments are provided by way of example so that this disclosure will be thorough and complete, and will fully convey the scope of the inventive concept to those skilled in the art. Like numbers refer to like elements throughout the description. Any step or feature illustrated by dashed lines should be regarded as optional.
The wording that a certain data item, piece of information, etc. is obtained by a first device should be construed as that data item or piece of information being retrieved, fetched, received, or otherwise made available to the first device. For example, the data item or piece of information might either be pushed to the first device from a second device or pulled by the first device from a second device. Further, in order for the first device to obtain the data item or piece of information, the first device might be configured to perform a series of operations, possible including interaction with the second device. Such operations, or interactions, might involve a message exchange comprising any of a request message for the data item or piece of information, a response message comprising the data item or piece of information, and an acknowledge message of the data item or piece of information. The request message might be omitted if the data item or piece of information is neither explicitly nor implicitly requested by the first device.
The wording that a certain data item, piece of information, etc. is provided by a first device to a second device should be construed as that data item or piece of information being sent or otherwise made available to the second device by the first device. For example, the data item or piece of information might either be pushed to the second device from the first device or pulled by the second device from the first device. Further, in order for the first device to provide the data item or piece of information to the second device, the first device and the second device might be configured to perform a series of operations in order to interact with each other. Such operations, or interaction, might involve a message exchange comprising any of a request message for the data item or piece of information, a response message comprising the data item or piece of information, and an acknowledge message of the data item or piece of information. The request message might be omitted if the data item or piece of information is neither explicitly nor implicitly requested by the second device.
The communication network 100 comprises a transmission and reception point 140 configured to provide network access to user equipment 170a, 170k, 170K in an (radio) access network 110 over a radio propagation channel 150. The access network 110 is operatively connected to a core network 120. The core network 120 is in turn operatively connected to a service network 130, such as the Internet. The user equipment 170a:170K is thereby, via the transmission and reception point 140, enabled to access services of, and exchange data with, the service network 130. Each user equipment 170a:170K and/or the transmission and reception point 140 is assumed to be equipped with at least two antennas, or antenna elements. In some examples, each of the user equipment 170a:170K as well as the transmission and reception point 140 are equipped with a plurality of antennas, or antenna elements.
Operation of the transmission and reception point 140 is controlled by a network node 160. The network node 160 might be part of, collocated with, or integrated with the transmission and reception point 140.
Examples of network nodes 160 are (radio) access network nodes, radio base stations, base transceiver stations, Node Bs (NBs), evolved Node Bs (eNBs), gNBs, access points, access nodes, and integrated access and backhaul nodes. Examples of user equipment 170a:170K are wireless devices, mobile stations, mobile phones, handsets, wireless local loop phones, smartphones, laptop computers, tablet computers, network equipped sensors, network equipped vehicles, and so-called Internet of Things devices.
It is assumed that the user equipment 170a:170K are to be utilized during an iterative learning process and that the user equipment 170a:170K as part of performing the iterative learning process are to report computational results to the network node 160. The network node 160 therefore comprises, is collocated with, or integrated with, a server entity 200. Each of the user equipment 170a:170K comprises, is collocated with, or integrated with, a respective agent entity 300a:300K.
Reference is next made to the signalling diagram of
The server entity 200 updates its estimate of the learning model (maintained as a global model θ in step S0), as defined by a parameter vector θ(i), by performing global iterations with an iteration time index i. The parameter vector θ(i) is assumed to be an N-dimensional vector. At each iteration i, the following steps are performed: Steps S1a, S1b: The server entity 200 broadcasts the current parameter vector of the learning model, θ(i), to the agent entities 300a, 300b.
Steps S2a, S2b: Each agent entity 300a, 300b performs a local optimization of the model by running T steps of a stochastic gradient descent update on θ(i), based on its local training data;
where ηk is a weight and fk is the objective function used at agent entity k (and which is based on its locally available training data).
Steps S3a, S3b: Each agent entity 300a, 300b transmits to the server entity 200 their model update θk (i);
where θk(i, 0) is the model that agent entity k received from the server entity 200. Steps S3a, S3b may be performed sequentially, in any order, or simultaneously.
Step S4: The server entity 200 updates its estimate of the parameter vector θ(i) by adding to it a linear combination (weighted sum) of the updates received from the agent entities 300a, 300b;
where wk are weights.
Assume now that there are K agent entities and hence K model updates. When the model updates {δ1, . . . , δK} (where the time index has been dropped for simplicity) from the agent entities 300a:300K over a wireless communication channel, there are specific benefits of using direct analog modulation. For analog modulation, the k:th agent entity could transmit the N components of δk directly over N resource elements (REs). Here an RE could be, for example: (i) one sample in time in a single-carrier system, or (ii) one subcarrier in one orthogonal frequency-division multiplexing (OFDM) symbol in a multicarrier system, or (iii) a particular spatial beam or a combination of a beam and a time/frequency resource.
One benefit of direct analog modulation is that the superposition nature of the wireless communication channel can be exploited to compute the aggregated update, δ1+δ2+ . . . +δK. More specifically, rather than sending δ1, . . . , δK to the server entity 200 on separate channels, the agent entities 300a:300K could send the model updates {δ1, . . . , δK} simultaneously, using N REs, through linear analog modulation. The server entity 200 could then exploit the wave superposition property of the wireless communication channel, namely that {δ1, . . . , δK} add up “in the air”. Neglecting noise and interference, the server entity 200 would thus receive the linear sum, δ1+δ2+ . . . +δK, as desired. That is, the server entity 200 ultimately is interested only in the aggregated model update δ1+δ2+ . . . +δK, but not in each individual parameter vector {δ1, . . . , δK}. This technique can thus be referred to as iterative learning with over-the-air computation.
The over-the-air computation assumes that appropriate power control is applied (such that all transmissions of {δk} are received at the server entity 200 with the same power), and that each transmitted δk is appropriately phase-rotated prior to transmission to pre-compensate for the phase rotation incurred by the channel from agent entity k to the server entity 200.
One benefit of the thus described over-the-air computation is the savings of radio resources. With two agents (K=2), 50% resources are saved compared to standard FL since the two agent entities can send their model updates simultaneously in the same RE. With K agent entities, only a fraction 1/K of the nominally required resources are needed.
As disclosed above, analog modulation as used for the transmission of the model updates with over-the-air computation is susceptible to channel disturbances. As further disclosed above, there could be scenarios in which using communication over dedicated agent-to-PS channels for the transmission of the model updates is unfeasible and should be avoided.
Further in this respect, due to fading and interference, the received aggregated vector δ1+δ2+ . . . +δK may be corrupted. An antenna array with a plurality of antennas, say M antennas, at the server entity 200 can be used to receive δ1+δ2+ . . . +δK. However, when the k:th agent entity 300k transmits, the phase rotations incurred by the radio propagation channel 150 from this agent entity 300b to each of the M antennas at the server entity 200 will be different. This means that there is no universal rotation e−jθ
As an introductory non-limiting and illustrative example, assume that there are K=3 agent entities 300a, 300b, 300c, that there are M=2 antennas at the server entity 200, and that there are L=2 antennas per agent entity. Assume further that channel coefficients are defined as in Table 1.
Assume that agent entity 300a is to communicate update δa, that agent entity 300b is to communicate update δb, and that agent entity 300c is to communicate update S. These three values thus represent one component of the corresponding gradient update vectors. In order for its data to properly reach the first antenna at server entity 200, agent entity 300a pre-processes δa with √{square root over (P)}a*/|a|2, where P is a parameter selected such that the transmit power constraint is satisfied for all agent entities 300a, 300b, 300c. Similarly, for δb to properly reach the first antenna at server entity 200, agent entity 300b phase rotates δb with √{square root over (P)}b*/|b|2. Similarly, for S, to properly reach the first antenna at server entity 200, agent entity 300c rotates δc by √{square root over (P)}c*/|c|2. In this way, up to an irrelevant scaling factor δa+δb+δc is received at the first antenna at server entity 200, as desired. However, up to an irrelevant scaling factor, δa−δb−δc is received at the second antenna at server entity 200, and this contribution does not contain any information about δa+δb+δc. Thus, the second antenna at the server entity 200 is not useful at all. In addition, it could be that there are significant imbalances in the path losses, say, |c|<<|a| and |c|<<|b|. In that case, agent entity 300c will dictate how large P can be, and hence eventually dictate the signal-to-noise ratio in the signal received at the first antenna at server entity 200.
The embodiments disclosed herein therefore relate to mechanisms for performing an iterative learning process with agent entities 300a:300K and performing an iterative learning process with a server entity 200. In order to obtain such mechanisms there is provided a server entity 200, a method performed by the server entity 200, a computer program product comprising code, for example in the form of a computer program, that when run on processing circuitry of the server entity 200, causes the server entity 200 to perform the method. In order to obtain such mechanisms there is further provided an agent entity 300k, a method performed by agent entity 300k, and a computer program product comprising code, for example in the form of a computer program, that when run on processing circuitry of agent entity 300k, causes agent entity 300k to perform the method.
Reference is now made to
S102: The server entity 200 selects precoding vectors. One individual precoding vector is selected for each of the agent entities 300a:300K. The precoding vectors are to be used by the agent entities 300a:300K when reporting computational results of a computational task to the server entity 200. The precoding vectors are selected as a function of uplink channel estimates of the radio propagation channel 150 between antennas of all the agent entities 300a:300K to antennas of the server entity 200. That is, the precoding vector for agent entity 300k is thus selected as function of the radio propagation channel 150 between the antennas of agent entity 300k to the antennas of the server entity 200. Hence, it might be assumed that each of the agent entities 300a:300K as well as the server entity 200 has access to a respective set of antennas, as in
S104: The server entity 200 configures the agent entities 300a:300K with the computational task and the precoding vectors.
S106: The server entity 200 performs the iterative learning process with the agent entities 300a:300K until a termination criterion is met.
As an introductory non-limiting and illustrative example, assume that there are K=3 agent entities 300a, 300b, 300c, that there are M=2 antennas at the server entity 200, and that there are L=2 antennas per agent entity. Assume further that agent entities 300a, 300b have good fading (path loss) coefficients to both antennas at the server entity 200, but that agent entity 300c has a good fading channel only to the second antenna at the server entity 200. The server entity 200 therefore selects precoding vectors for the agent entities 300a, 300b such that the respective data (δa and δb respectively) as transmitted by the agent entities 300a, 300b add up with a scaling of 0.5 at each of the antennas at the server entity 200. This can be accomplished by setting wa=Ga−1[0.5,0.5]T and wa=Ga−1[0.5,0.5]T. Furthermore, the server entity 200 selects precoding vectors for agent entity 300c such that agent entity 300c transmit its data δc with a spatial null into the direction towards the first antenna at the server entity 200 and a beam (with received amplitude 1) towards the second antenna at the server entity 200. This can be accomplished by setting wc=Gc−1[0,1]T. This way, at the first antenna the server entity 200 receives 0.5(δ1+δ2), and at the second antenna the server entity 200 receives 0.5(δ1+δ2)+δ3. Summing up the signals at both antennas, using a linear combining vector v=[1,1]T, yields δ1+δ2+δ3 (plus noise), as desired.
Embodiments relating to further details of performing an iterative learning process with agent entities 300a:300K as performed by the server entity 200 will now be disclosed.
The termination criterion in some non-limiting examples can be when a pre-determined number of iterations have been reached or when the computational results differ less than a threshold value from one iteration to the next.
There may be different ways for the server entity 200 to select the precoding vectors in step S102. Different embodiments relating thereto will now be described in turn.
In some embodiments, as part of performing the iterative learning process, the server entity 200 sums the computational results reported by all the agent entities 300a:300K per iteration into a sum, and the precoding vectors are determined with respect to an unbiasedness constraint of the sum of the computational results. For example, for two agent entities, the unbiasedness constraint can be formulated as E[δ2]=δ1+δ2. The precoding vectors might then be selected to minimize the variance var(δ2).
In general terms, the precoding vectors can be selected by solving an optimization problem subject to the server entity 200 receiving the computational results from the agent entities 300a:300K on all the antennas of the server entity 200.
For example, the selected based on power constraint at each agent entity 300k, such that ∥wk∥2 is less than some predetermined constant. In particular, in some embodiments, each of the precoding vectors is represented by a weighting coefficient vector wk with one weighting coefficient per antenna at agent entity k, and wherein ∥wk∥2<ξk, for all k∈{1, 2, . . . , K}, where ξk are power constraint values. These power constraint values may all be equal, or different, depending on the characteristics of the agent entities 300a:300K.
In some aspects, the selection of the precoding vectors is based on utilizing a linear combining vector v. In particular, in some embodiments, the precoding vectors are selected as a function of a linear combining vector v, where:
where z represents received noise, where Gk represents the uplink channel estimates and denotes the radio propagation channel from agent entity k to the server entity 200, and where δk denotes the computational result reported from agent entity k.
There could be different ways to select the linear combining vector v. In some embodiments, vTGkwk=1 for all E {1, 2, . . . , K}. In this respect, the linear combining vector v might be selected such that vTGkwk is equal to 1 within some pre-determined margin epsilon. In some embodiments, the linear combining vector v is determined by solving an optimization problem. For example, the optimization problem might be defined as:
As above, the equality might be within some pre-determined margin epsilon.
To find the combining vector v that results in an unbiased estimate, the constraints vTGkwk=1 should be satisfied for all k=1, . . . , K. Since IWO are to be selected in a later stage, consider them fixed for now. The matrix Gk is of dimension M×L. Hence {Gkwk} are M-vectors. Thus, there are K linear constraints, each of which confine v to an (M−1)-dimensional subspace. A solution is guaranteed to exist for any {wk} as long as M is at least as large as K. But for many choices of {wk}, a solution for v will exist even if M<K, see the examples below. If v is instead selected to minimize the mean-square error, then the problem of finding v is always tractable. For example, with Gaussian priors the mean-square error is a quadratic function of v, and thus finding v is an unconstrained optimization problem. For k=2, the optimization problem corresponds to minimizing var(δ2) with constraints on power with respect to w1 and w2, and with a constraint such that E[δ2]=δ1+δ2.
Different ways to solve the optimization problem will be disclosed next.
One way of solving the optimization problem is to search through a predetermined (fixed) set of combinations of beams (defined by the precoding vectors), that satisfy the power constraints. These combinations of beams may have been pre-obtained, for example, by laying out beams with angle-of-arrivals on a grid, for some assumed antenna topology. Hence, in some embodiments, the precoding vectors are selected from a predetermined set of precoding vectors. Alternatively, the precoding vectors are found by applying a clustering algorithm to a set of empirically obtained precoding vectors that have been found to be good for the locations that the agent entities 300a:300K are at. Alternatively, the precoding vectors may have been constructed by selecting beam vectors at random.
For each candidate beam vector combination, all wk are fixed and the problem of minimizing ∥v∥2 under the unbiasedness constraint is a linearly constrained quadratic problem, which can be solved by any method in the state-of-the-art, yielding the vector v along with the resulting variance. The server entity 200 Might then select the combination of precoding vectors (and the associated v), among the predetermined set of combinations, for which the resulting variance is the smallest.
Once a solution to the optimization problem has been found, in case the power constraints are not met with equality, {wk, v} can be renormalized as follows. First wk are scaled up with a factor that makes at least one power constraint satisfied with equality (and the other satisfied—with or without equality). Then, v is scaled down correspondingly so that the unbiasedness constraint is still satisfied.
In some embodiments, the optimization problem is solved by alternatingly and iteratively determining the weighting coefficient vectors wk and the linear combining vector v. Hence, alternating optimization can be used by cycling between the choice of {wk} and the choice of v. For example, the server entity 200 might start with an initial guess of {wk}, and then find the best v as disclosed above. Once the optimal v (for these wk has been found, then the so-obtained v is kept fixed, and the precoding vectors wk are adjusted in a way such that they still satisfy the unbiasedness constraint and at the same time the total sum of all power values ∥wk∥2 is minimized. Optionally, a random perturbation to wk may be added in this step as well. This may be accomplished by solving a linearly constrained quadratic optimization problem. Once the solution is found, {wk,v} can be re-normalized as described above. Next, the so-obtained wk are kept fixed, and ∥v∥2 is minimized under the unbiasedness constraint. This again amounts to solving a linearly constrained quadratic problem, which can be done by using any methods in state of the art. Next, wk are adjusted, and so forth.
Combinations of any of the above disclosed embodiments (e.g., using a pre-determined set of combinations for {wk} and using cyclic/alternating optimization) are possible. For example, a pre-determined set of combinations for {wk} can be used to find an initial solution for {wk} which is then used as the initial guess for cyclic/alternating optimization. A yet further approach is to use a neural network to return the optimal {wk,v} for given power constraints and given Gk. The network may be trained by using the methods described above to obtain the ground truth.
The precoding vectors might also be selected based on further information, such as information of the agent entities 300a:300K and/or of the user equipment 170a:170K in which the agent entities 300a:300K are provided. One non-limiting example of such information is beamforming capability information. Hence, in some embodiments, the precoding vectors are selected based on beamforming capability information received from the agent entities 300a:300K. The beamforming capability information for agent entity k specifies which precoding vectors that can be applied at agent entity k, where k−{1, 2, . . . , K}. Further in this respect, the server entity 200 might receive, from the agent entities 300a:300K, information associated to the device capabilities associated to training models as part of a device capability information message. The server entity 200 might from this information determine one or more models according to which the iterative learning process is to be performed. The device capability information message might be transmitted via radio resource control (RRC) signaling, for instance during an initial registration process of the user equipment 170a:170K with the network node 160 or of the agent entities 300a:300K with the server entity 200. The device capability information message could comprise information associated to the device capabilities for participation in the iterative learning process.
In some aspects the precoding vector comprises a beamforming direction vector (which could be represented as an index pointing to a codebook, or found as disclosed above). Hence, in some embodiments, each of the precoding vectors identifies a beamforming direction vector. Each component of the precoding vectors might have an amplitude component and a phase component. The amplitude component and the phase component are common for all elements per precoding vector but are individual per each of the agent entities 300a:300K.
Intermediate reference is next made to the flowchart of
S106a: The server entity 200 provides a parameter vector of the computational task to the agent entities 300a:300K.
S106b: The server entity 200 receives the computational results as a function of the parameter vector from the agent entities 300a:300K.
S106c: The server entity 200 updates the parameter vector as a function of an aggregate of the received computational results.
In some examples, the agent entities 300a:300K are configured to repeat the transmission of each component of {δk} using different precoding vectors. For each agent entities 300a:300K the server entity 200 might therefore determine a family of J precoding vectors {wki}, j=1, . . . , J, and have the k:th agent entity transmit each component of δkJ times, the j:th time using the precoding vector wkj. Hence, in some embodiments, the server entity 200 configures the agent entities 300a:300K to apply at least two different precoding vectors to each of the computational results reported to the server entity 200.
Further in this respect, the server entity 200 might determine to keep the selected precoding vectors for the agent entities 300a:300K until a condition is triggered. In this case, new uplink channel estimates need not to be acquired by the server entity 200 for each iteration round, but rather on a per need basis, as determined by one or more triggering conditions.
The triggering condition can be based on performance. The precoding vectors can be based on the performance of the model. The server entity 200 might, for example, test the model on a dataset located in the server entity 200, and check whether the current global model meets the prediction requirements (for example the CSI compression loss is within a certain threshold in the first below example, or the server entity 200 has proper understanding of the mapping between two carriers in the below second example). The server entity 200 might also check the model improvements after each iteration round, for example with respect to an increase or decrease in training or testing error. The performance can also be represented by performance feedback as reported by the agent entities 300a:300K. The performance feedback might, for example, be provided in terms of prediction accuracies for the local dataset for each agent entity 300a:300K. An agent entity 300k might, for example, indicate that the model is not improving the prediction performance on its local dataset. The server entity 200 might then configure the agent entity 300k to report in case the improvements are not within a certain threshold range over a number of training rounds.
The triggering condition can be based on channel state information. In case the server entity 200 has access to uncertain channel state information, for example due to operating in low signal to interference plus noise ratio (SINR) region for multiple agent entities 300a:300K, the server entity 200 can configure these agent entities 300a:300K to send more frequent uplink pilots based on which the uplink channel estimates of the radio propagation channel 150 can be obtained and thus the precoding vectors be selected. Further, the server entity 200 might, in case the number of agent entities 300a:300K is large, group the agent entities 300a:300K based on their variations in channel state information. In this way, agent entities 300a:300K with comparatively large channel variations might be instructed to send uplink pilots more frequent in time than agent entities 300a:300K with comparatively small channel variations.
In some aspects, the herein disclosed embodiments are applied only for a subset of all agent entities 300a:300K participating in the iterative learning process with the server entity 200. That is, some agent entities will transmit their model updates using unicast while other agent entities will participate in the iterative learning process using FL and with over-the-air computation.
Reference is now made to
S202: Agent entity 300k obtains a precoding vector to be used by agent entity 300k when reporting computational results of a computational task to the server entity 200.
S204: Agent entity 300k obtains configuration in terms of the computational task from the server entity 200.
S206: Agent entity 300k performs the iterative learning process with the server entity 200 until a termination criterion is met. Agent entity 300k as part of performing the iterative learning process applies the precoding vector to the computational results when sending the computational results to the server entity 200.
Embodiments relating to further details of performing an iterative learning process with a server entity 200 as performed by agent entity 300k will now be disclosed.
The termination criterion in some non-limiting examples can be when a pre-determined number of iterations have been reached or when the computational results differ less than a threshold value from one iteration to the next.
As disclosed above, in some embodiments, the precoding vector identifies a beamforming direction vector.
As disclosed above, in some embodiments, each component of the precoding vector has an amplitude component and a phase component, and the amplitude component and the phase component are common for all elements of the precoding vector but are individual for the agent entity 300k.
As disclosed above, in some embodiments, the precoding vector is selected from a predetermined set of precoding vectors.
As disclosed above, in some embodiments, agent entity 300k is configured by the server entity 200 to apply at least two different precoding vectors to each of the computational results reported to the server entity 200.
Intermediate reference is next made to the flowchart of
S206a: Agent entity 300k obtains a parameter vector of the computational problem from the server entity 200.
S206b: Agent entity 300k determines the computational result of the computational task as a function of the obtained parameter vector for the iteration and of data locally obtained by agent entity 300k.
S206c: Agent entity 300k reports the transformed computational result to the server entity 200. The precoding vector is applied to the computational results when the computational results are sent towards the server entity 200.
One particular embodiment of performing an iterative learning process with agent entities 300a:300K as performed by the server entity 200 and of performing an iterative learning process with a server entity 200 as performed by agent entity 300k based on at least some of the above disclosed embodiments will now be disclosed in detail with reference to the signalling diagram of
S300: The server entity 200 maintains a global model vector θ of dimension N.
S301a, S301b: The agent entities 300a, 300b transmit uplink pilots. The uplink pilots might be transmitted using e.g. sounding reference signals (SRS). Specifically, agent entity 300a transmits a first sequence of T L-dimensional vectors {ϕ11, . . . , ϕ1T} and agent entity 300b transmits a second sequence of T L-dimensional vectors {ϕ21, . . . , ϕ1T}, where the rows of the matrix [ϕk1, . . . , ϕkT] comprise the pilot sequences (of length T). In some examples, all pilot sequences are mutually orthogonal with the same norm, which results in good estimation accuracy. In this case, ΦkΦkH=αI for k=1, 2 and Φ1Φ2H=0, for some positive constant α. In another example, the pilot sequences are mutually orthogonal and have different norms, in which case ΦkΦkH are diagonal matrices. Then the antenna array at the server entity 200 receives T M-dimensional vectors {yp1, . . . , ypT}, where
where zpt is noise, and t∈1,2, . . . , T. For future use, define Yp=[yp1, . . . , ypT] and Φk=[ϕk1, . . . , ϕkT].
S302: Based on {yp1, . . . , ypT}, the server entity 200 estimates G1 and G2 using any suitable technique, for example based on least-square estimates or minimum mean squared error (MMSE) estimates. In some examples, where ϕ1t and ϕ2t are orthogonal with the same norm, the least-squares channel estimates are obtained by projecting ypt onto ϕ1t and ϕ2t, respectively: =YpΦkH/α. In some examples, of statistics of the noise (zpt) is known, for example that the noise contains in addition to thermal noise also interference (e.g., from other cells or out-of-system interference), this statistics can be incorporated to form, for instance, MMSE estimates. Based on the estimated G1 and G2, the server entity 200 determines two L-dimensional precoding vectors w1 (for agent entity 300a) and w2 (for agent entity 300b). The precoding vectors are determined as disclosed above.
S303a, S303b: The server entity 200 broadcasts the model θ to the agent entities 300a, 300b. This can be done using any suitable technique for wireless transmission. The server entity 200 further communicates w1 to agent entity 300a and w2 to agent entity 300b, possibly over a control channel. These precoding vectors are scaled such that they also incorporate power control and phase rotations for each agent entity 300a, 300b. In some examples, the server entity 200 includes the iterative learning parameters (for example number of iterations of gradient update and weights denoted by Ek, αk, for agent Ak, k∈{1, 2} respectively).
S304a, S304b: The agent entities 300a, 300b obtain, based on the broadcasted global model θ and locally available training data, gradient updates represented by a N-dimensional vectors. That is, agent entity 300a obtains 81 and agent entity 300b obtains 81 based on their local data and the global model θ.
S305a, S305b: Agent entity 300a and agent entity 300b simultaneously transmit 81 and 82, respectively, using precoding vectors w1 (for agent entity 300a) and w2 (for agent entity 300b). One component of the gradient update vectors is transmitted at a time, so the complete transmission takes N channel uses.
S306: The server entity 200, in the n:th channel use (corresponding to the n:th component of the gradient update vectors), receives the M-dimensional vector:
where δkn is the n:th component of δk (corresponding to agent entity k) and zn is noise. Based on yn, the server entity 200 now determines an estimate of δ1n+δ2n. This processing is identical for all components n∈1, . . . , N, so henceforth the component index n is dropped. For example, the server entity 200 might use a linear, unbiased estimator of δ1+δ2. This can be achieved by the server entity 200 first determining a linear combining vector v (of dimension M) and then computing
In some examples, v is selected such that E[δ2=δ1+δ2. The resulting estimate is unbiased, correct on average, where the average refers to the statistical average over the noise z. This is achieved if v satisfies:
In some examples, v is selected such that these relations hold, and in addition, to minimize the variance of the estimate, which equals var(δ2)=∥v∥2σ2, where σ2 is the variance of each component of z (assuming these noise components are uncorrelated). Such a linear combining vector v can be found in closed form by solving a linearly constrained quadratic optimization problem.
In some examples, v is selected to minimize the mean-square error, or some other Bayesian cost of the error,
S307: The server entity 200 updates the global model θ based on the estimate obtained in step S306.
The process is repeated from step S303a (or even S301a) if the termination criterion is not met.
Illustrative examples where the herein disclosed embodiments apply will now be disclosed.
According to a first example, the computational task pertains to prediction of best secondary carrier frequencies to be used by user equipment 170a:170K in which the agent entities 300a:300K are provided. The data locally obtained by agent entity 300k can then represent a measurement on a serving carrier of the user equipment 170k. In this respect, the best secondary carrier frequencies for user equipment 170a:170K can be predicted based on their measurement reports on the serving carrier. The secondary carrier frequencies as reported thus defines the computational result. In order to enable such a mechanism, the agent entities 300a:300K can be trained by the server entity 200, where each agent entity 300k takes as input the measurement reports on the serving carrier(s) (among possibly other available reports such as timing advance, etc.) and as outputs a prediction of whether the user equipment 170k in which agent entity 300k is provided has coverage or not in the secondary carrier frequency. The device capability information message for agent entity 300k could indicate the frequencies for which agent entity 300k has local training data. The device capability information message for agent entity 300k could indicate device measurement accuracies, for example the reference signal received power; RSRP measurement accuracies, for agent entity 300k.
According to a second example, the computational task pertains to compressing channel-state-information using an auto-encoder, where the server entity 200 implements a decoder of the auto-encoder, and where each of the agent entities 300a:300K implements a respective encoder of the auto-encoder. An autoencoder can be regarded as a type of neural network used to learn efficient data representations (denoted by code hereafter). Instead of transmitting raw Channel Impulse Response (CIR) values from the user equipment 170a:170K to the network node 160, the agent entities 300a:300K encodes the raw CIR values using the encoders and report the resulting code to the server entity 200. The code as reported thus defines the computational result. The server entity 200, upon reception of the code from the agent entities 300a:300K, reconstructs the CIR values using the decoder. Since the code can be sent with fewer information bits, this will result in significant signaling overhead reduction. The reconstruction accuracy can be further enhanced if as many independent agent entities 300a:300K as possible are utilized. This can be achieved by enabling each agent entity 300k to contribute to training a global model preserved at the server entity 200. The device capability information message for agent entity 300k could indicate the frequencies and corresponding bandwidth that agent entity 300k has used during its dataset logging.
According to a third example, the computational task pertains to signal quality drop prediction. The signal quality drop prediction is based on measurements on wireless links used by user equipment 170a:170K in which the agent entities 300a:300K are provided. In this respect, based on received data, in terms of computational results, in the reports, the server entity 200 can learn, for example, what sequence of signal quality measurements (e.g. RSRP) that results in a large signal quality drop. After a model is trained, for instance using the iterative learning process, the server entity 200 can provide the model to the agent entities 300a:300K. The model can be provided either to agent entities 300a:300K having taken part in the training, or to other agent entities 300a:300K. The agent entities 300a:300K can then apply the model to predict future signal quality values. This signal quality prediction can then be used in the context of any of: initiating inter-frequency handover, setting handover and/or reselection parameters, changing device scheduler priority so as to schedule the user equipment 170a:170K when the expected signal quality is good. The data for training such a model is located at the device-side where the agent entities 300a:300K reside, and hence an iterative learning process as disclosed herein can be used to efficiently learn the future signal quality prediction. The device capability information message for agent entity 300k could indicate the forecasted time, i.e. for how many (milli-)seconds in time the model predicts. The device capability information message for agent entity 300k could indicate the dataset information, for example the measured channel state information signal (CSI-RS), synchronization signal blocks (SSBs), etc. used in the predictions.
There are also further examples of computational tasks where the herein disclosed embodiments for performing an iterative learning process can be applied, such as distributed training of a language model using keyboard input from users, distributed training of an object recognition algorithm using camera streams from many devices.
Particularly, the processing circuitry 210 is configured to cause the server entity 200 to perform a set of operations, or steps, as disclosed above. For example, the storage medium 230 may store the set of operations, and the processing circuitry 210 may be configured to retrieve the set of operations from the storage medium 230 to cause the server entity 200 to perform the set of operations. The set of operations may be provided as a set of executable instructions. Thus the processing circuitry 210 is thereby arranged to execute methods as herein disclosed.
The storage medium 230 may also comprise persistent storage, which, for example, can be any single one or combination of magnetic memory, optical memory, solid state memory or even remotely mounted memory.
The server entity 200 may further comprise a communications interface 220 for communications with other entities, functions, nodes, and devices, either directly or indirectly. As such the communications interface 220 may comprise one or more transmitters and receivers, comprising analogue and digital components.
The processing circuitry 210 controls the general operation of the server entity 200 e.g. by sending data and control signals to the communications interface 220 and the storage medium 230, by receiving data and reports from the communications interface 220, and by retrieving data and instructions from the storage medium 230. Other components, as well as the related functionality, of the server entity 200 are omitted in order not to obscure the concepts presented herein.
The server entity 200 may be provided as a standalone device or as a part of at least one further device. For example, the server entity 200 may be provided in a node of the radio access network or in a node of the core network. Alternatively, functionality of the server entity 200 may be distributed between at least two devices, or nodes. These at least two nodes, or devices, may either be part of the same network part (such as the radio access network or the core network) or may be spread between at least two such network parts. In general terms, instructions that are required to be performed in real time may be performed in a device, or node, operatively closer to the cell than instructions that are not required to be performed in real time. Thus, a first portion of the instructions performed by the server entity 200 may be executed in a first device, and a second portion of the instructions performed by the server entity 200 may be executed in a second device; the herein disclosed embodiments are not limited to any particular number of devices on which the instructions performed by the server entity 200 may be executed. Hence, the methods according to the herein disclosed embodiments are suitable to be performed by a server entity 200 residing in a cloud computational environment. Therefore, although a single processing circuitry 210 is illustrated in
Particularly, the processing circuitry 310 is configured to cause agent entity 300k to perform a set of operations, or steps, as disclosed above. For example, the storage medium 330 may store the set of operations, and the processing circuitry 310 may be configured to retrieve the set of operations from the storage medium 330 to cause agent entity 300k to perform the set of operations. The set of operations may be provided as a set of executable instructions. Thus the processing circuitry 310 is thereby arranged to execute methods as herein disclosed.
The storage medium 330 may also comprise persistent storage, which, for example, can be any single one or combination of magnetic memory, optical memory, solid state memory or even remotely mounted memory.
Agent entity 300k may further comprise a communications interface 320 for communications with other entities, functions, nodes, and devices, either directly or indirectly. As such the communications interface 320 may comprise one or more transmitters and receivers, comprising analogue and digital components.
The processing circuitry 310 controls the general operation of agent entity 300k e.g. by sending data and control signals to the communications interface 320 and the storage medium 330, by receiving data and reports from the communications interface 320, and by retrieving data and instructions from the storage medium 330. Other components, as well as the related functionality, of agent entity 300k are omitted in order not to obscure the concepts presented herein.
In the example of
Telecommunication network 410 is itself connected to host computer 430, which may be embodied in the hardware and/or software of a standalone server, a cloud-implemented server, a distributed server or as processing resources in a server farm. Host computer 430 may be under the ownership or control of a service provider, or may be operated by the service provider or on behalf of the service provider. Connections 421 and 422 between telecommunication network 410 and host computer 430 may extend directly from core network 414 to host computer 430 or may go via an optional intermediate network 420. Intermediate network 420 may be one of, or a combination of more than one of, a public, private or hosted network; intermediate network 420, if any, may be a backbone network or the Internet; in particular, intermediate network 420 may comprise two or more sub-networks (not shown).
The communication system of
Communication system 500 further includes radio access network node 520 provided in a telecommunication system and comprising hardware 525 enabling it to communicate with host computer 510 and with UE 530. The radio access network node 520 corresponds to the network node 160 of
Communication system 500 further includes UE 530 already referred to. Its hardware 535 may include radio interface 537 configured to set up and maintain wireless connection 570 with a radio access network node serving a coverage area in which UE 530 is currently located. Hardware 535 of UE 530 further includes processing circuitry 538, which may comprise one or more programmable processors, application-specific integrated circuits, field programmable gate arrays or combinations of these (not shown) adapted to execute instructions. UE 530 further comprises software 531, which is stored in or accessible by UE 530 and executable by processing circuitry 538. Software 531 includes client application 532. Client application 532 may be operable to provide a service to a human or non-human user via UE 530, with the support of host computer 510. In host computer 510, an executing host application 512 may communicate with the executing client application 532 via OTT connection 550 terminating at UE 530 and host computer 510. In providing the service to the user, client application 532 may receive request data from host application 512 and provide user data in response to the request data. OTT connection 550 may transfer both the request data and the user data. Client application 532 may interact with the user to generate the user data that it provides.
It is noted that host computer 510, radio access network node 520 and UE 530 illustrated in
In
Wireless connection 570 between UE 530 and radio access network node 520 is in accordance with the teachings of the embodiments described throughout this disclosure. One or more of the various embodiments improve the performance of OTT services provided to UE 530 using OTT connection 550, in which wireless connection 570 forms the last segment. More precisely, the teachings of these embodiments may reduce interference, due to improved classification ability of airborne UEs which can generate significant interference.
A measurement procedure may be provided for the purpose of monitoring data rate, latency and other factors on which the one or more embodiments improve. There may further be an optional network functionality for reconfiguring OTT connection 550 between host computer 510 and UE 530, in response to variations in the measurement results. The measurement procedure and/or the network functionality for reconfiguring OTT connection 550 may be implemented in software 511 and hardware 515 of host computer 510 or in software 531 and hardware 535 of UE 530, or both. In embodiments, sensors (not shown) may be deployed in or in association with communication devices through which OTT connection 550 passes; the sensors may participate in the measurement procedure by supplying values of the monitored quantities exemplified above, or supplying values of other physical quantities from which software 511, 531 may compute or estimate the monitored quantities. The reconfiguring of OTT connection 550 may include message format, retransmission settings, preferred routing etc.; the reconfiguring need not affect network node 520, and it may be unknown or imperceptible to radio access network node 520. Such procedures and functionalities may be known and practiced in the art. In certain embodiments, measurements may involve proprietary UE signalling facilitating host computer's 510 measurements of throughput, propagation times, latency and the like. The measurements may be implemented in that software 511 and 531 causes messages to be transmitted, in particular empty or ‘dummy’ messages, using OTT connection 550 while it monitors propagation times, errors etc.
The inventive concept has mainly been described above with reference to a few embodiments. However, as is readily appreciated by a person skilled in the art, other embodiments than the ones disclosed above are equally possible within the scope of the inventive concept, as defined by the appended patent claims.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/EP2021/083160 | 11/26/2021 | WO |