Beamforming in Wireless Networks Via Split Computing

Description

BACKGROUND

Today, Wi-Fi networks are used to connect hundreds of millions of people worldwide. Wi-Fi is so ubiquitous that cellular operators are expected to offload 63% of their traffic to Wi-Fi by 2022. To attest to the need for higher data rates, the IEEE is currently standardizing 802.11be (Wi-Fi 7), which will support throughput of up to 46 Gbps through wider signal bandwidths and the usage of multi-user multiple-input and multiple-output (MU-MIMO) techniques. MU-MIMO will become fundamental also to effectively decongest the unlicensed spectrum bands through spatial reuse, which are increasingly saturated. To correctly beamform transmissions, MU-MIMO requires access points (APs) to periodically collect channel state information (CSI) from each connected station (station) to beamform the transmissions. According to the IEEE 802.11 standard, the beamforming feedback (BF) is constructed by (i) measuring the CSI through pilot signals and (ii) computing the BF through singular value decomposition (SVD). Then, the BF is decomposed into Givens rotation (GR) angles that produce the beamforming matrix (BM).

SUMMARY

Example embodiments include a wireless communications network. A transmitter may be configured to generate beamformed signals and omnidirectional signals. A receiver may communicatively coupled to the transmitter via a wireless channel and may be configured to 1) generate channel state information (CSI) based on a received signal from the transmitter; 2) generate, via a first neural network (NN), a compressed representation of beamforming feedback as a function of the CSI; and 3) transmit the compressed representation to the transmitter via the wireless channel. The transmitter may then determine, via a second NN, the beamforming matrix as a function of the compressed representation; and generate a subsequent beamformed signal toward the receiver as a function of the beamforming matrix.

The first and second NNs may each operate a distinct subset of a common NN model, the common NN model being configured to output the beamforming matrix in response to an input comprising the CSI. The distinct subsets of the common NN model may include 1) a first subset configured to output the compressed representation in response to an input comprising the CSI, and 2) a second subset configured to output the beamforming matrix in response to an input comprising the compressed representation. The common NN model may include an intermediate layer between the distinct subsets, the intermediate layer outputting the compressed representation in response to an input comprising the CSI.

The transmitter may be further configured to update a beamforming configuration based on the beamforming matrix, and may generate the subsequent beamformed signal via an antenna array. The transmitter may generate the subsequent beamformed signal via a multiple-input and multiple-output (MIMO) process. The compressed representation may be less than 50% of the data size of the beamforming feedback. The received signal may be an omnidirectional signal or a beamformed signal.

Further embodiments include a method of wireless communication. At a receiver communicatively coupled to a transmitter via a wireless channel, channel state information (CSI) may be generated based on a received signal from the transmitter. Via a first neural network (NN), a compressed representation of beamforming feedback may be generated as a function of the CSI. The compressed representation may then be transmitted to the transmitter via the wireless channel. At the transmitter, the beamforming matrix may be determined via a second NN as a function of the compressed representation. Via a second NN, the beamforming matrix may be determined as a function of the compressed representation. A subsequent beamformed signal may then be generated toward the receiver as a function of the beamforming matrix.

Further embodiments include a network transmitter. A transceiver may be configured to generate a beamformed signal toward a receiver, and receive a compressed representation of beamforming feedback from the receiver, the compressed representation being a function of channel state information (CSI) determined by the receiver. A neural network (NN) may be configured to generate the beamforming matrix as a function of the compressed representation. The transceiver may be further configured to generate a subsequent beamformed signal toward the receiver as a function of the beamforming matrix.

BRIEF DESCRIPTION OF THE DRAWINGS

The foregoing will be apparent from the following more particular description of example embodiments, as illustrated in the accompanying drawings in which like reference characters refer to the same parts throughout the different views. The drawings are not necessarily to scale, emphasis instead being placed upon illustrating embodiments.

FIG. 1 is a diagram of a network in one embodiment.

FIG. 2 is a diagram of a system including an access point in one embodiment.

FIG. 3 is a diagram of communications exchanged between an access point and a station in one embodiment.

FIG. 4 is a diagram of a common neural network in one embodiment.

FIG. 5 is a flow diagram illustrating a beamforming feedback loop in one embodiment.

FIG. 6 is a block diagram of an access point in one embodiment.

FIG. 7 is a flow diagram of a process of configuring a beamformed signal in one embodiment.

FIG. 8 is a chart illustrating computational overhead in one embodiment.

FIG. 9 is a chart illustrating compression ratio of a beamforming matrix in one embodiment.

DETAILED DESCRIPTION

A description of example embodiments follows. The teachings of all patents, published applications and references cited herein are incorporated by reference in their entirety.

A key challenge in MIMO systems is that the size of the beamforming feedback (BF) grows with the number of subcarriers, transmitting and receiving antennas. For example, in an 8×8 network at 160 MHz of bandwidth, the BF in 802.11 will be of size (486 subcarriers×56 angles/subcarrier×16 bits/angle=) 435,456 bits≃54.43 kB, if the maximum angle resolution is used. If BFs are sent back every 10 ms, the airtime overhead is 435,456/0.01≃43.55 Mbit/s. Moreover, the BF computation imposes a significant burden on the stations, which may become intolerable for low-power devices. Specifically, the complexity of SVD and GR are:

$𝒪 ((4 N_{t} N_{r}^{2} + 22 N_{t}^{3}) \cdot S)$

$and$

$𝒪 (N_{t}^{3} N_{r}^{3} S)$

wherein N_t, N_r, and S denote the number of transmitting and receiving antennas and subcarriers. Because Wi-Fi 7 will support more spatial streams (up to 16) and bandwidth (up to 320 MHz), a thorough revision of how MIMO is performed in Wi-Fi is essential to keep the complexity under control.

Existing approaches to reduce MIMO complexity come with excessive computation overhead and/or performance loss, with most of them not being compliant to the IEEE 802.11 standard. Example embodiments, described below, take a different approach, providing an IEEE 802.11 standard-compliant framework leveraging split computing to drastically decrease both computational load and BF size while maintaining reasonable beamforming accuracy.

FIG. 1 provides a high-level overview of a network 100 in an example embodiment. Here, a deep neural network (DNN) model 150 is trained to map an estimated CSI matrix to the beamforming feedback (BF) 146 in a supervised manner. The DNN 150 is “split” into first and second NN models, referred to as a head model 152 and a tail model 154, respectively executed by the stations 180 (e.g., smartphones, laptops, and/or other computing devices, also referred to as receivers) and by the access point 110 (e.g., a wireless router, also referred to as the transmitter). The head model 152 can be trained, offline, to produce a compressed representation of the BF through the introduction of a “bottleneck” inside the model, thus reducing BF airtime and station computational load.

An advantage of example embodiments is that the complexity of the head model and the BF representation size can be adjusted by modifying the bottleneck placement and size. Indeed, the bottleneck can trade off computational load, feedback size and beamforming accuracy, which was not available in previous approaches. This is crucial for constrained Wi-Fi devices and systems, which will cater to heterogeneous devices with different processing capacities.

Example embodiments provide a novel framework for BF compression and station computation reduction in MU-MIMO Wi-Fi networks. A complexity analysis, described below, shows that example embodiments successfully reduces the station computational load and the BF size by respectively 92% and 91% when compared to the standardized 802.11 algorithm.

A bottleneck optimization problem (BOP) is formulated below to determine the bottleneck placement and size with the goal of minimizing airtime and computation overhead, while ensuring that the bit error rate (BER) and end-to-end delay are below the application's desired level. Given its complexity, a heuristic algorithm is introduced, and a customized training procedure for the resulting DNN is described herein.

Example embodiments may leveredge off-the-shelf Wi-Fi equipment to collect CSI data in two different propagation environments, and the performance of example embodiments can be compared favorably with IEEE 802.11ac/ax CSI feedback algorithm (henceforth called 802.11) and the state-of-the-art DNN-based compression technique, LB-SciFi. Example embodiments have been demonstrated to reduce computational load and feedback size by up to 84% and 81% with respect to 802.11. Also, with the same compression rate, the computational load is reduced by up to 89% compared to LB-SciFi.

Example embodiments may be synthesized in field-programmable gate array (FPGA) hardware by using a customized library to show the feasibility of such embodiments in real-world Wi-Fi systems. Our experimental results show that the maximum end-to-end latency incurred by example embodiments is less than 7 milliseconds (ms) in the case of 4×4 MIMO operating at 160 MHz and lowest compression rate, which is well below the suggested threshold of 10 ms in MU-MIMO Wi-Fi systems.

Example embodiments are described below with the following notation for mathematical expressions. Boldface uppercase letters denote matrices. The superscripts T and † denote the transpose and the complex conjugate transpose (i.e., the Hermitian). The symbol ∠C defines the matrix containing the phases of the complex-valued matrix C. Moreover, diag(c₁, . . . , c_j) indicates the diagonal matrix with elements (c₁, . . . , c_j) on the main diagonal. The (c₁, c₂) entry of matrix C is defined by [C]c₁,c₂, while I_crefers to an identity matrix of size c×c and I_c×dis a c×d generalized identity matrix. The notations R and C indicate the set of real and complex numbers, respectively.

FIG. 2 illustrates an example system 200 comprising the access point 110 (transmitter) in communication with the stations 180 (receivers). The system 200 may be a MUMIMO system with the access point 110 as the beamformer, and a set I of N_sstation 180 as beamformees. As shown, N_tantennas are located at the access point 110 and N_r,iantennas are at each receiver/client 180. N_ss,iis the number of spatial streams for station i. Let X_i(s)∈C^Nss,i×1represent the transmitted data symbol vector for user i over subcarrier s∈S, where S is the set of S orthogonal frequency-division multiplexing (OFDM) subcarriers. Each data symbol vector is beamformed through a beamforming matrix (BM) denoted by W_i(s)∈C^Nt×Nss,i. By defining the fading channel from the access point to station i as H_i(s)∈C^Nr,i×Nt, the received signal at station i is:

$\begin{matrix} Y_{i} = \sqrt{\frac{ρ}{N_{t}}} (H_{i} W_{i} X_{i} + \sum_{j \in ℐ \ i} H_{i} W_{j} X_{j}) + N_{i}, & (1) \end{matrix}$

where ρ denotes the signal-to-noise-ratio (SNR) and is assumed equal for all users. Ni is the complex additive white Gaussian noise (AGWN) for station i as CN(0, 1). To simplify notation, equation (1) is given in terms of the frequency domain for a single subcarrier and subcarrier index(s) is omitted. We assume the number of transmit antennas is set to be the sum total of all the used spatial streams:

$N_{t} = \sum_{i \in ℐ} N_{ss, i}$

The first term in (1) denotes the desired signal and the second term is the inter-user interference, which can be eliminated due to the beamforming. Ideally, H_iW_j=0 when i≠j. Therefore, the received signal can be reduced to:

$Y_{i} = \sqrt{ρ / N_{t}} H_{i} W_{i} X_{i}$

FIG. 3 is a diagram of communications exchanged between an access point and a station in one embodiment. In MU-MIMO Wi-Fi systems, the beamforming matrix W with dimension

$N_{t} \times \sum_{i = 1}^{^{*} N_{s}} N_{ss, i} \times S$

can be calculated using a multi-user channel sounding mechanism, as shown in FIG. 3. The procedure may include three main operations:

First, the access point may begin the process by transmitting a null data packet (NDP) announcement frame, used to gain control of the channel and identify the stations. The access point follows the NDP announcement frame with a NDP for each spatial stream. Second, upon reception of the NDP, each station i analyzes the NDP training fields—for example, VHT-LTF (Very High Throughput Legacy Training Field) in 802.11ac—and estimates the channel matrix H_i(s) for all subcarriers s, which is then decomposed by using SVD:

$\begin{matrix} H_{i} (s) = U_{i} (s) \cdot S_{i} (s) \cdot {Z_{i} (s)}^{†} & (2) \end{matrix}$

where U_i(s)∈C^Nr,i×Nr,iand Z_i(s)∈C^Nt×Ntare unitary matrices, while the singular values are collected in the N_r,i×Nt diagonal matrix S_i(s). With this notation, the complex-valued BM V_i(s) is defined by collecting the first N_ss,icolumns of Z_i(s). To simplify the notation, the i subscript can be dropped in favor of a generic receiver. To reduce the channel overhead, V(s) is converted into polar coordinates as detailed in Algorithm 1, provided below. The output is matrices D_s,tand G_s,l,t, defined as:

$\begin{matrix} D_{s, t} = [\begin{matrix} 𝕀_{t - 1} & 0 & \dots & 0 \\ 0 & e^{j ϕ_{s, t, t}} & 0 & \dots & ⋮ \\ 0 & ⋱ & 0 \\ ⋮ & ⋮ & 0 & e^{j ϕ_{s, N_{t} - 1, t}} & 0 \\ 0 & \dots & 0 & 1 \end{matrix}], & (3) \end{matrix}$

$\begin{matrix} G_{s, ℓ, t} = [\begin{matrix} 𝕀_{t - 1} & 0 & \dots & 0 \\ 0 & \cos ψ_{s, ℓ, t} & 0 & \sin ψ_{s, ℓ, t} \\ 0 & 𝕀_{ℓ - t - 1} & 0 & ⋮ \\ ⋮ & - \sin ψ_{s, ℓ, t} & 0 & \cos ψ_{s, ℓ, t} & 0 \\ 0 & \dots & 0 & 𝕀_{N_{r} - ℓ} \end{matrix}], & (4) \end{matrix}$

The above equations (3) and (4) allow rewriting V(s) as:

$\begin{matrix} \begin{matrix} V (s) = \tilde{V} (s) \cdot \tilde{D} (s) \\ with \\ \tilde{V} (s) = \prod_{t = 1}^{\min (N_{ss,} N_{t} - 1)} (D_{s, t} \prod_{l = t + 1}^{N_{t}} G_{s, l, t}^{T}) \cdot 𝕀_{N_{t} \times N_{ss}}, \end{matrix} & (5) \end{matrix}$

In the {tilde over (V)}(s) matrix, the last row (i.e., the feedback for the Ntth transmitting antenna) consists of non-negative real numbers by construction. Using this transformation, the station is only required to transmit the ϕ and ψ angles to the access point. Moreover, the beamforming performance can be equivalent when using V(s) or {tilde over (V)}(s). Thus, the feedback for {tilde over ( )}Dk is not fed back to the access point.

$\begin{matrix} \begin{matrix} Algorithm 1 : V (s) decomposition \\ \begin{matrix} Require : V (s); \\ \tilde{D} (s) = diag (e^{j {∠ [V (s)]}_{N_{t}, 1}}, \dots, e^{{∠ [V (s)]}_{N_{t}, N_{ss}}}); \\ Ω (s) = V (s) \cdot {\tilde{D} (s)}^{†}; \\ for t \leftarrow 1 to \min (N_{ss}, N_{t} - 1) do ϕ_{s, ℓ, t} = {∠ [Ω (s)]}_{ℓ, t} with ℓ = t, \dots, N_{t} - 1; \end{matrix} \\ compute D_{s, t} through; \end{matrix} & Equation (3) \end{matrix}$

$Ω (s) = \leftarrow D_{s, t}^{†} \cdot Ω (s);$

$for ℓ \leftarrow t + 1 to N_{t} do ψ_{s, ℓ, t} = \arccos (\frac{{[Ω (s)]}_{t, t}}{\sqrt{{[Ω (s)]}_{t, t}^{2} + {[Ω (s)]}_{ℓ, t}^{2}}});$

$\begin{matrix} \begin{matrix} compute G_{s, ℓ, t} through; \\ Ω (s) \leftarrow G_{s, ℓ, t} \cdot Ω (s); \end{matrix} & Equation (4) \end{matrix}$

Third, the access point transmits a beamforming report poll (BRP) frame to retrieve the angles from each station. The angles are further quantized using b_ϕ∈{7, 9} bits for ϕ and b_ψ=b_ϕ−2 bits for ψ, to further reduce the channel occupancy. The quantized values—q_ϕ={0, . . . , 2^bϕ−1} and q_ψ={0, . . . , 2^bϕ−1}—are packed into a compressed beamforming frame (CBF). Each contains A number of angles for each of the S OFDM subchannels for a total of S·A angles each. For example, a 16×16 system with 320 MHz channels requires 256 complex elements for each of the 996 subcarriers. The 802.11 standard requires 8 bits for each real and imaginary component of the CBF, which results in 510 kB.

The size of the beamforming feedback (BF) grows as

$N_{t} \times \sum_{i = 1}^{N_{s}} N_{ss, i} \times S$

This grow implies the following challenges:

- a) Feedback airtime increases with the number of stations, as each station sends its BF separately. Moreover, the number of decomposed angles and ultimately the size of the BF depends on the number of antennas, and grows linearly with channel bandwidth.
- b) Computing and compressing the through SVD and GR operations imposes a significant computational load on beamformees.
- c) GR angle decomposition and BF reconstruction introduce an additional error. This deteriorates the performance of the multi-user transmission, especially in scenarios with small inter-user separation where successful data recovery depends highly on accurate beamforming.
- d) The computational load at the station and the feedback size cannot be modified according to application-specific and device-specific constraints. As next-generation Wi-Fi caters to heterogeneous devices and a wide range of performance requirements, it is advantageous to achieve this functionality.

FIG. 4 is a diagram of a unified (also referred to as “common”) neural network 150 in one embodiment. Example embodiments may train a DNN that maps the CSI matrix H_ito the BF V_iin a supervised manner. To compress the BF and transfer the stations computational load to the access point (with higher computational capacity), a “bottleneck layer” 156 may be implemented in the DNN as shown in FIG. 4. The bottleneck 156 may be an intermediate representation in the DNN model that is (K<1 times) smaller than the model input H_i. The bottleneck 156 divides the DNN into a head and a tail network, which are respectively executed on the station and the access point.

FIG. 5 is a flow diagram illustrating a beamforming feedback loop in one embodiment. (1) The estimated CSI matrices at stations 180 are fed to the head model 152 (2) that is tasked to produce a compressed representation of the BF denoted by V′_i(3). The compressed BF is sent to the access point 110 over the air (4), where it is fed to the tail model (5) to reconstruct the BF and generate the beamforming matrix (6) for sending a subsequent downlink signal 148 to the station 180.

The placement and size of the bottleneck ultimately determine the head network architecture, and thus (i) the station computational load, (ii) the BF feedback size, and (iii) the beamforming accuracy. Indeed, there is a trade-off between the complexity of the head model, the BF compression rate, and the accuracy of inference. While placing the bottleneck early on with a low number of nodes reduces the station computation load and airtime overhead, it leads to a decrease in beamforming accuracy, which ultimately increases the BER. Therefore, the bottleneck placement and size must be adjusted according to the application-specific requirements.

Bottleneck Optimization Problem (BOP)

The original DNN can be modeled as a function M that maps the channel matrix H_i∈C^Nr×Nt×Sto the BF V_i∈C^Nr×Nt×Sas M(H; θ):C^H→C^V, thorough L-layer transformations:

$r_{j} = F_{j} (r_{j - 1}, θ_{j}) 0 \leq j \leq L,$

- where F_j(r_j-1, θ_j) is the mapping carried out by the j-th layer and j=0 denotes the input layer. The vector θ={θ₁, . . . , θ_L} defines the set of parameters of the DNN. To devise the bottleneck, an encoder-decoder structure may be used, wherein the first e layers of the DNN is the encoder and the rest of the layers are the decoder. The encoder, called the head model H, is placed from the input layer to the bottleneck B. Next, the tail model T decompresses the encoded BF to construct the BF V_i. The modified model can be written as

$\begin{matrix} ℳ (H; θ) = {\begin{matrix} ℋ = F_{j} (r_{j - 1}, θ_{j}), & 0 \leq j < e, \\ ℬ = F_{e} (r_{e - 1}, θ_{e}), & j = e, \\ 𝒯 = F_{j} (r_{j - 1}, θ_{j}),) & e + 1 \leq j \leq L . \end{matrix} & (6) \end{matrix}$

Let L_i^H(e,N) be the station i overhead consists of three components: (i) the computational cost (i.e., the power consumption and memory required for executing the model), denoted by L_i^c(e,N); (ii) the execution time for BF compression through the head model, denoted by T_i^H(e,N); and (iii) the power consumption of transmitting the compressed BF to the access point, denoted by L_i^tx(e,N). Also, TA i (e,N) represents the compressed BF feedback airtime. Finally, TT (e,N) denotes the time required for reconstructing the BF at the access point. Notice that compression, decompression and airtime overhead depend on the placement e and size of the bottleneck N. The BOP may be defined such that it minimizes the station computation overhead and feedback airtime as

$\begin{matrix} \min_{e, N} \sum_{i \in ℐ} (μ_{i}^{ℋ} L_{i}^{ℋ} (e, N) + (1 - μ_{i}^{ℋ}) \cdot T_{i}^{A} (e, N)) & (7 a) \end{matrix}$

$\begin{matrix} s . t . 0 < μ_{i}^{ℋ} < 1, i \in ℐ & (7 b) \end{matrix}$

$\begin{matrix} {BER}_{i} \leq γ, i \in ℐ & (7 c) \end{matrix}$

$\begin{matrix} \max_{i \in ℐ} (T_{i}^{ℋ} (e, N) + T_{i}^{tx} (e, N)) + T^{𝒯} (e, N) < 𝒯, & (7 d) \end{matrix}$

where μ_i^Hparameterizes the importance of reducing the stations overhead versus the feedback airtime. In applications where stations are resource-constrained, it is crucial to reduce the stations load, i.e., μ_i^H>μ_i^A. On the other hand, in dynamic propagation environments like crowded rooms, where the channel coherence time is short, high feedback airtime cannot be tolerated. Thus, reducing the feedback airtime must be prioritized, i.e., μ_i^H<μ_i^A. BER, represents the bit error rate (BER) of client i. As described herein, the accuracy of the generated BF is measured at the access point in terms of achievable BER by the stations. BER is the number of erroneous bits divided by the total number of transferred bits. Condition (7c) guarantees that the BER experienced by each client does not exceed the maximum BER threshold γ. Condition (7d) indicates that the maximum end-to-end delay of BF cannot exceed the maximum tolerable delay denoted by t. In practice, these two conditions ensure that the bottleneck placement does not significantly impact the inference accuracy and latency. The maximum tolerable BER and delay can be specified according to the requirements.

Example Solution to BOP

The BOP is a particular instance of the extremely complex neural architecture search (NAS) problem. Example embodiments implement a heuristic algorithm to search for proper hyperparameters that is specific to our context. Specifically, to limit the search space, the following procedure may be used:

- a) With the primary goal of minimizing the clients' computational load L_i^H, the bottleneck layer can be placed immediately after the input layer (i.e., e=1);
- b) To reduce the inference time at the access point, T^H, a single layer for the tail network (i.e., L=2) can be considered. Thus, the resulting DNN is a 3-layer network comprising input, bottleneck and output layers;
- c) The size of the bottleneck layer may be adjusted according to the QoS requirements. Specifically, a limited number of compression levels K=V′_i/H_i, and the BER as the QoS metric are considered. Beginning from the highest level of compression (lowest number of bottleneck nodes), train the 3-layer DNN may be trained with the CSI and corresponding V matrices dataset. Once trained, the generated BM by the DNN is used to estimate the BER at the station by comparing the recovered and transmitted data bits.
- d) If the desired BER cannot be achieved, the compression level is decreased. The new model is trained according to step (3) until the model is capable of meeting the BER constraint. If the compression level is the minimum, another layer is inserted after the bottleneck (L=L+1), and the algorithm goes back to (c).

Model Training

Because H and I′ are complex matrices, real and complex components can be decoupled in the matrices and treat them as double-sized real matrices. For each of our datasets, a dataset can be split into training, validation, and test splits with 8:1:1 ratio. Example embodiments may be trained offline for various network configurations and does not require retraining. The stations select the proper trained DNN according to the network configuration information acquired from the NDP preamble.

Loss Function: One goal is to deploy exactly the same model for each station without fine-tuning its parameters to its environment. The training process may be done offline (i.e., on a single computer). Given a channel matrix H_i, the DNN model M estimates the corresponding BF V_i, i.e., V_i=M(H_i, θ). We formulate the loss function L as follows:

$\begin{matrix} ℒ = \frac{1}{b} \sum_{j = 1}^{b} \sum_{i = 1}^{N_{s}} \frac{{(ℳ (H_{i}^{j}, θ) - V_{i}^{j})}^{2}}{{ V_{i}^{j} }_{1}}, & (8) \end{matrix}$

where b indicates training batch size and ∥⋅∥1 represents L1-norm. H_i^jand V_i^jindicate the j-th channel matrix and BF for station i, respectively. By minimizing the loss in equation (8), the parameters θ of our DNN model M can be optimized. A stochastic gradient descent (SGD) and Adam can be used to train the synthetic and experimental datasets, respectively. Unless specified, the models in this example are trained for 40 epochs, using the training split in the dataset with batch size of 16 and the initial learning rate of 10⁻³. The learning rate is decreased by a factor of 10 after the end of 20th and 30th epochs. Using the validation split in the dataset, the model can be assessed in terms of achieved BER at the end of every epoch and save the best parameters θ* such that achieve the lowest BER for the validation split. The trained model is assessed with the best parameters for the held-out test split in the dataset and report the test BER.

Difference with Autoencoders: Although an AE is similar in terms of model architecture, its training objective is different. AEs are trained to reconstruct its input in an unsupervised manner (e.g., to estimate {tilde over (V)}_igiven V_i). Conversely, a task-specific model may be trained in a supervised fashion to estimate BF V_igiven a channel matrix H_i.

FIG. 6 illustrates the access point 110 in further detail. The access point 110 may include a transceiver 112 for performing wireless communications (e.g., omnidirectional and beamformed) with the station 180. A controller 120 may be coupled to the transceiver 112 and may manage communications by the transceiver 112, for example by configuring and updating the beamforming matrix used by the transceiver 112. The neural network 114 may operate a subset of the unified NN model 150 described above, for example the tail model 154 of FIG. 4, a representation of which may be stored at a data store.

FIG. 7 is a flow diagram of a process 700 of configuring a beamformed signal. With reference to FIG. 6, the transmitter 110 may generate beamformed signals and omnidirectional signals for reception by the station 180. In particular, the transceiver 112 may transmit a signal 144 (omnidirectional or beamformed) from which channel state information (CSI) can be derived. Upon receipt of the signal 144, the station 180 may generate the CSI based on the received signal 144 (705), and then generate, via a first NN (e.g., the head network 152 of FIG. 4), a compressed representation of beamforming feedback as a function of the CSI (710). The station 180 may then transmit the compressed representation 146 to the access point 110 via a wireless channel (715). The access point 110 may then determine, via a second NN 114 (e.g., the tail network 154 of FIG. 4), the beamforming matrix as a function of the compressed representation (720). Using the beamforming matrix, the controller 120 may update a beamforming configuration for the transceiver 112 for subsequent beamformed communications with the station 180, for example by replacing a previous beamforming matrix with the newly-generated beamforming matrix. The transceiver may then generate a subsequent beamformed signal 148 toward the receiver as a function of the beamforming matrix (725).

Complexity Analysis and Compression Rate

Computational Overhead: The complexity of the SVD operation for decomposing the BF V in 802.11 is O((4N_tN_r²+22N_t³)S). The BF is further transformed into a set of angles using the Givens rotation (GR) matrix multiplication which has a complexity of O(N_t³N_r³S). Conversely, the complexity of an example embodiment is O(K N_t²N_r²S²), where K<1 denotes the head model's compression level.

FIG. 8 illustrates, for an example embodiment, the ratio of the number of floating-point operations (FLOP) required for compressing the BF using example embodiments to 802.11 compression technique. The ratio is calculated by X/Y×100 where X and Y denote the number of floating points operation in example embodiments and legacy Wi-Fi protocol, respectively. The comparison is performed for different MU-MIMO orders and a various number of subcarriers, as computed through a MATLAB program. Example embodiments noticeably reduce the computational load of station, particularly as the number of antennas and/or stations increases. At 80 MHz, example embodiments with K=⅛ decreases 75% and 87% of the station's load in 4×4 and 8×8 systems. On average, example embodiments have been demonstrated to improve computation by 73%. Further, example embodiments with K=⅛ can keep the BER within 87% of 802.11.

Airtime Overhead: In 802.11, the size of the compressed BF report is BMR=8×N_t+N_a×S×(b_ϕ+b_ψ)/2 where N_adenotes the number of Givens angles. Values b_ϕ and b_ψ are the number of bits required for the angle quantization. Therefore, the 802.11 compression ratio can be written as

$\begin{matrix} CR = \frac{BMR}{S \times N_{t} \times N_{r} \times b}, & (9) \end{matrix}$

where b=16 is the number of bits required for transmitting channel information over each subcarrier. Conversely, the compression rate of example embodiments is K, which is constant and does not grow with the size of the channel matrix.

FIG. 9 depicts the impact of an example embodiment in reducing the airtime overhead. The bars show the ratio of the size of the compressed BF of example embodiments to the angle decomposition technique in 802.11. Example embodiment have a significant impact at higher-order MU-MIMO configurations. For example, such embodiments may reduce the size of the feedback overhead by 91% and 93% in 4×4 and 8×8 configurations with 80 MHz channel. On average, example embodiments have been demonstrated to reduce the airtime overhead by 75% with respect to 802.11.

While example embodiments have been particularly shown and described, it will be understood by those skilled in the art that various changes in form and details may be made therein without departing from the scope of the embodiments encompassed by the appended claims.

Claims

1. A wireless communications network, comprising: a transmitter configured to generate beamformed signals and omnidirectional signals; anda receiver communicatively coupled to the transmitter via a wireless channel, the receiver configured to: generate channel state information (CSI) based on a received signal from the transmitter;generate, via a first neural network (NN), a compressed representation of beamforming feedback as a function of the CSI; andtransmit the compressed representation to the transmitter via the wireless channel;the transmitter being further configured to: determine, via a second NN, the beamforming matrix as a function of the compressed representation; andgenerate a subsequent beamformed signal toward the receiver as a function of the beamforming matrix.
2. The network of claim 1, wherein the first and second NNs each operate a distinct subset of a common NN model, the common NN model being configured to output the beamforming matrix in response to an input comprising the CSI.
3. The network of claim 2, wherein the distinct subsets of the common NN model include: a first subset configured to output the compressed representation in response to an input comprising the CSI, anda second subset configured to output the beamforming matrix in response to an input comprising the compressed representation.
4. The network of claim 2, wherein the common NN model includes an intermediate layer between the distinct subsets, the intermediate layer outputting the compressed representation in response to an input comprising the CSI.
5. The network of claim 1, wherein the transmitter is further configured to update a beamforming configuration based on the beamforming matrix.
6. The network of claim 1, wherein the transmitter is further configured to generate the subsequent beamformed signal via an antenna array.
7. The network of claim 1, wherein the transmitter is further configured to generate the subsequent beamformed signal via a multiple-input and multiple-output (MIMO) process.
8. The network of claim 1, wherein the compressed representation is less than 50% of the data size of the beamforming feedback.
9. The network of claim 1, wherein the received signal is an omnidirectional signal.
10. The network of claim 1, wherein the received signal is a beamformed signal.
11. A method of wireless communication, comprising: at a receiver communicatively coupled to a transmitter via a wireless channel: generating channel state information (CSI) based on a received signal from the transmitter;generating, via a first neural network (NN), a compressed representation of beamforming feedback as a function of the CSI; andtransmitting the compressed representation to the transmitter via the wireless channel; andat the transmitter: determining, via a second NN, the beamforming matrix as a function of the compressed representation; andgenerating a subsequent beamformed signal toward the receiver as a function of the beamforming matrix.
12. The method of claim 11, wherein the first and second NNs each operate a distinct subset of a common NN model, the common NN model being configured to output the beamforming matrix in response to an input comprising the CSI.
13. The method of claim 12, wherein the distinct subsets of the common NN model include: a first subset configured to output the compressed representation in response to an input comprising the CSI, anda second subset configured to output the beamforming matrix in response to an input comprising the compressed representation.
14. The method of claim 12, wherein the common NN model includes an intermediate layer between the distinct subsets, the intermediate layer outputting the compressed representation in response to an input comprising the CSI.
15. The method of claim 11, further comprising updating a beamforming configuration based on the beamforming matrix.
16. A network transmitter, comprising: a transceiver configured to: generate a beamformed signal toward a receiver, andreceive a compressed representation of beamforming feedback from the receiver, the compressed representation being a function of channel state information (CSI) determined by the receiver; anda neural network (NN) configured to generate the beamforming matrix as a function of the compressed representation;the transceiver further configured to generate a subsequent beamformed signal toward the receiver as a function of the beamforming matrix.
17. The transmitter of claim 16, wherein the neural network operates a subset of a common NN model, the common NN model being configured to output the beamforming matrix in response to an input comprising the CSI.
18. The transmitter of claim 17, wherein the distinct subsets of the common NN model include: a first subset configured to output the compressed representation in response to an input comprising the CSI, anda second subset configured to output the beamforming matrix in response to an input comprising the compressed representation.
19. The transmitter of claim 17, wherein the common NN model includes an intermediate layer between the distinct subsets, the intermediate layer outputting the compressed representation in response to an input comprising the CSI.
20. The transmitter of claim 16, wherein the transmitter is further configured to update a beamforming configuration based on the beamforming matrix.
21. The transmitter of claim 16, wherein the compressed representation is less than 50% of the data size of the beamforming feedback.

RELATED APPLICATION

This application claims the benefit of U.S. Provisional Application No. 63/362,524, filed on Apr. 5, 2022. The entire teachings of the above application are incorporated herein by reference.

GOVERNMENT SUPPORT

This invention was made with government support under Grant Numbers 2120447 and 2134567 awarded by the National Science Foundation. The government has certain rights in the invention.

PCT Information

Filing Document	Filing Date	Country	Kind
PCT/US2023/065300	4/4/2023	WO

Provisional Applications (1)

	Number	Date	Country
	63362524	Apr 2022	US

Beamforming in Wireless Networks Via Split Computing

Information

Publication Number

Date Filed

Date Published

Inventors

Original Assignees

CPC