The present invention relates to a method of obtaining channel state information in a wireless communication network which includes an artificial wave transformer having electronically reconfigurable antennas, and more particularly to such a method in which machine learning is used to approximate a statistical model of the communication channel, based on which the channel state information is determined.
Millimeter-wave (mmWave) communication is one of the emerging technologies for 5G/6G communication systems and beyond to meet the high data rate and spectral efficiency requirements [2]. Although mmWave communications offer a significant gain in throughput thanks to the increased available bandwidth, they are more susceptible to blockages due to rapid signal attenuation and severe path loss. In this context, reconfigurable intelligent surfaces (RISs) have been proposed to mitigate the challenges in mmWave communication systems and also enable smart and reconfigurable wireless environments [3], [4]. An RIS is a two-dimensional (2D) array consisting of a large number of passive low-cost reflecting elements that redirect the impinging electromagnetic waves following a specific phase shift pattern to create a favorable environment for the propagation of the signals [5], [6]. By manipulating the signals' phases and amplitudes, the RIS can create constructive or destructive interference, amplify or attenuate the signals, and improve the communication link quality and coverage [7]. This technology has many potential benefits, including improving the signal-to-noise ratio (SNR), increasing coverage and capacity, reducing power consumption, and enhancing security and privacy [8]-[10]. In contrast to the non-regenerative relays (also called repeaters), the RIS operates efficiently in full-duplex without self-interference or noise amplification [11], [12]. As a passive structure, the RIS introduces no additional noise beyond the environmental thermal noise level, similar to other passive scattering objects in the system. This stands as a notable advantage over active repeaters [13].
To achieve the desired performance through passive and active beamforming, it is crucial to accurately estimate the channel state information (CSI) between the RIS and the transceivers [14], [15]. This is a challenging problem since (i) passive RISs are unable to transmit or receive training sequences, restricting the estimation to the pilot signals at the receiver, and (ii) the number of channel coefficients to estimate increases with the number of RIS elements, limiting the feasibility of CSI acquisition within a practical coherence time.
The existing literature may be categorized into two groups: cascaded channel estimation [16]-[22] and separate channel estimation [23]-[25].
Cascaded channel estimation focuses on estimating the channel between the user equipment (UE) and the base station (BS) through the RIS (UE-RIS-BS) from the training signal. For instance, a compressed sensing-based method, exploiting the sparse structure of the channels, was proposed for a single-user narrowband setup [16]. Additionally, a channel estimation scheme was developed for an RIS-aided multi-user broadband communication system by leveraging the shared channel between the RIS and BS (RIS-BS) among the users, which improves the training efficiency [17]. In mmWave communication, the channel has a low-rank structure and is modeled by using a small number of paths compared to the number of antennas at the transceivers where each path is distinguished by a direction of departure (DoD) and a direction of arrival (DoA). For the high dimensional RIS-BS and UE-RIS channels, a two-stage non-iterative downlink channel estimation framework can be adopted by first estimating the DoDs and DoAs for the RIS-BS and UE-RIS channels, respectively. Next, the cascaded channel UE-RIS-BS is directly estimated using the estimated DoDs and DoAs [18].
Several data-driven techniques have been proposed in RIS-aided systems and showed their effectiveness in the cascaded channel estimation problems [19]-[22]. For instance, a deep residual learning based approach was adopted to denoise the least square (LS) estimates by exploiting their spatial features with a conventional neural network (CNN) [19]. However, the LS estimator suffers from high training overhead due to the large number of channel coefficients to estimate. Addressing this shortcoming, previous work combines the super-resolution CNNs with deep denoising CNNs (DnCNNs) to estimate the cascaded channel and denoise the estimates in a MIMO OFDM communication system [20]. For semi-passive RIS where a small number of active elements are implemented in the RIS to receive the training sequence at the RIS from the transmitter, a hybrid method used compressed-sensing to estimate the cascaded channel coefficients from a low-resolution channel matrix and a DnCNN to further denoise and improve the estimation quality. Another line of work trained a neural network to compute the optimal locations of the active RIS elements, afterward the full channel matrix was extrapolated from the estimated channels of the selected active antennas using a CNN [22].
The knowledge of the cascaded channel enables the RIS configuration and optimal precoding. However, this approach has various drawbacks: (i) it is not suitable for user tracking due to the coupling of DoDs and DoAs at the RIS [26], [27], and (ii) it does not exploit the slow-varying feature of the RIS-BS channel to reduce the training overhead [3]. Acquiring separate channels, i.e., RIS-BS and UE-RIS channels, addresses these limitations as it decouples the cascaded channel and allows the identification of the channels' behavior in each part.
Separate channel estimation has granted attention in the existing literature. The decomposition of the cascaded UE-RIS-BS channel into two separate channels (i.e., UE-RIS and RIS-BS channels) has been studied in RIS-aided systems with fully-passive RIS elements. It was shown in [23] that the received signal follows the parallel factor tensor model which is used to develop an iterative alternating estimation scheme to obtain estimates of the UE-RIS and RIS-BS channels separately based on the Khatri-Rao factorization of the cascaded channel. However, the training overhead is still considerably high for a fully passive RIS. The use of semi-passive setup with active sensing elements at the RIS was proposed to estimate the RIS-BS channels as an initial step. Then, using the slow-varying property of the RIS-BS channel, only the UE-RIS channel is estimated in the training time of the subsequent coherence blocks [24]. In the same context of semi-passive RISs, a variational inference (VI)-based method was developed to reduce the training overhead and estimate the channels using only the uplink training signals [25].
The aforementioned works focused on estimating the I-CSI of either the cascaded channel or the separate channels. Estimating the I-CSIs is practical in scenarios involving static users, where the coherence time is sufficiently high. Although RIS phase-shifts optimization based on I-CSI achieves optimal performance in terms of achievable rate, it can be impractical in different scenarios such as high user mobility and large RISs. Indeed, I-CSI estimates and phase-shifts are updated in every coherence block, thereby leading to high training overhead and signaling complexity to control the RIS. Besides, the channel conditions in mmWave frequencies can change rapidly since the mmWave signals are more susceptible to blockages and attenuation [28]. Therefore, the coherence block of the channels is very limited for mobile users.
Statistical CSI (S-CSI) has recently emerged as an approach in addressing the active and passive beamforming in RIS-assisted wireless systems reducing the overhead of the channel estimation and extending the coverage for practical use [29], [30]. For example, The S-CSI was employed in a two-timescale beamforming design to reduce the training overhead and signal processing for acquiring the I-CSI with a specific transmission protocol [31]. The main idea relies on optimizing the phase-shifts based on the S-CSI while computing the downlink beamforming vectors based on the I-CSI of the effective channel between the UEs and the BS through the RIS (i.e., UE-RIS-BS channel including the phase-shifts optimized). A more sophisticated algorithm was proposed in [32] to cover a more general fading channel with discrete phase-shifts in both single-user and multi-user cases. In mmWave scenarios, the S-CSI was exploited for joint hybrid and passive precoder design using block-coordinate descent-based algorithms to maximize the ergodic capacity [33]. However, an approach of direct S-CSI estimation was not well studied in the literature for the RIS-aided systems. Typically, the S-CSI is characterized by the spatial channel covariance matrix (CCM) [34]. However, the estimation of the spatial CCM is challenging since its size increases as a function of the number of RIS elements. To address this problem, a CCM estimation method for the cascaded UE-RIS-BS channel was proposed in [35] by exploiting the low-rank and the semi-definite three-level Toeplitz structure of the covariance matrix. Table 1 summarizes several works in the area of I-CSI and S-CSI estimation in RIS-aided systems.
Considering the challenges discussed above about the estimation of the cascaded channel, it becomes desirable to solve the separate channel estimation problem in fully-passive RIS-aided network. As mentioned before, several works proposed methods to estimate the separate channels [24], [25]. However, these works suffer from high power consumption due to the semi-passive setup adopted.
According to an aspect of the invention there is provided a method of obtaining, within a wireless communication network, channel state information of a communication channel between a user device and an access point having plural antennas and configured to wirelessly communicate with the user device; wherein the wireless communication network further includes a wave transformer located at a geographically intermediate location between the access point and the user device and configured to reflect electromagnetic signals between the access point and the user device, wherein the wave transformer has a plurality of electronically reconfigurable antennas; wherein the wireless communication network includes a central server having a processor and a non-transitory memory operatively connected to the processor and storing instructions to be executed thereon, wherein the central server is communicatively connected to the access point and configured to control the wireless communication network, wherein the central server is free of data connection with the wave transformer; the method comprising:
forming a statistical model of the communication channel, wherein the statistical model of the communication channel comprises separate statistical models representative of constituent portions of the communication channel, wherein the constituent portions of the communication include a first portion between the access point and the wave transformer and a second portion between the wave transformer and the user device;
wherein forming the statistical model of the communication channel comprises:
receiving, at one of the access point and the user device, a signal transmitted from another one of the access point and the user device;
using respective machine learning algorithms configured to determine parameters of a type of tractable statistical distribution selected to represent both the first and second portions of the communication channel, processing the signal to determine the parameters of a first tractable statistical distribution of the selected type and representative of the first portion of the communication channel and the parameters of a second tractable statistical distribution of the selected type and representative of the second portion of the communication channel, so as to form parametrized first and second tractable statistical distributions respectively defining the separate statistical models of the first and second portions of the communication channel;
wherein, to determine parameters of a type of tractable statistical distribution selected to represent both the first and second portions of the communication channel, the respective machine learning algorithms are configured to solve an optimization problem to minimize an objective function thereof based on a lower bound of a log-likelihood function of the received signal and including (i) a first divergence term representative of a statistical distance between a prior statistical distribution representative of the first portion of the communication channel and the separate statistical model of the first portion of the communication channel, (ii) a second divergence term representative of a statistical distance between a prior statistical distribution representative of the second portion of the communication channel and the separate statistical model of the second portion of the communication channel and (iii) a likelihood term based on a difference between the received signal and a reconstructed signal formed by the separate statistical models of the first and second portions of the communication channel; and
after forming the statistical model of the communication channel, determining, using the statistical model, the channel state information of the communication channel.
This provides an arrangement in which a statistical model of the communication channel, which is intractable, is approximated as plural tractable statistical distributions, one for each constituent portion of the communication channel, using variational inference-based machine learning.
In the illustrated arrangement, receiving, at one of the access point and the user device, a signal transmitted from another one of the access point and the user device comprises receiving, at the access point, a signal transmitted from the user device.
In the illustrated arrangement, the respective machine learning algorithms comprise neural networks.
In the illustrated arrangement, the first and second divergence terms are both of a Kullback-Leibler type.
In the illustrated arrangement, the type of tractable statistical distribution selected to represent both the first and second portions of the communication channel is one of Gaussian and Laplace.
The invention will be described in conjunction with the accompanying additional drawings in which:
In the drawings like characters of reference indicate corresponding parts in the different figures.
Referring to the accompany figures, there is disclosed a fully passive reconfigurable intelligent surfaces (RIS) arrangement to separately estimate the user equipment (UE)-RIS and RIS-base station (BS) channels from the uplink training signals. From a Bayesian inference perspective, the main challenge is the acquisition of the posterior distribution of the separate channels because of the passive nature of the RIS. Therefore, a variational inference (VI)-based framework is used to provide an approximation of the intractable posterior distribution with convenient distributions. Diverging from conventional deterministic models, VI introduces a probabilistic paradigm that seamlessly integrates uncertainties allowing for the incorporation of prior information. It has been widely applied in channel estimation making use of the knowledge of the channels' prior [36]-[38]. A joint channel estimation (JCE) method, where the intractable posterior distribution of the UE-RIS and RIS-BS channels are approximated by auxiliary distributions that are convenient and tractable, is disclosed. The amortized VI framework where neural networks are used to map the training signals to the parameters of the auxiliary distributions is employed therein. These neural networks are trained through the minimization of the Evidence Lower Bound (ELBO) that represents the Kullback-Leibler (KL) divergence between the true posterior distribution of the channels and the auxiliary distributions. Then, using the predicted parameters, the maximum a posteriori (MAP) is used to estimate the channels.
Optimizing the phase-shifts according to the instantaneous CSI (I-CSI) can incur substantial signaling overhead at the RIS. This arises from the necessity to update the RIS configuration in each coherence block, particularly inconvenient when considering the rapid and dynamic changes of the UE-RIS channel. To reduce the signaling overhead, RIS-BS channel and UE-RIS channel covariance matrix (CCM) are used for passive beamforming, as they are slow-varying compared to the dynamic UE-RIS channel. Therefore, additionally is disclosed a joint channel-covariance estimation (JCCE) method that extends the use of the VI-based framework to directly estimate the RIS-BS channel and UE-RIS CCM from the received training signal at BS. This uses the VI-based framework to effectively approximate the posterior distributions of the RIS-BS channel and the UE-RIS CCM. Like the methodology applied in the joint channel estimation method, the auxiliary distributions, whose parameters are predicted by the neural networks, are leveraged to obtain the MAP estimates. Considering the large size of the UE-RIS CCM resulting from the large number of elements at the RIS, the inherent low-rank structure of the covariance of the mmWave channels is exploited. Differing from traditional methods, our approach directly estimates the UE-RIS CCM from the training signals, eliminating multiple intermediary channel estimation steps before the CCM computation. Also, unlike prior art in which the covariance matrix of the cascaded channel was estimated [35], the novel methodology estimates the RIS-BS channel and UE-RIS CCM separately. Also, the phase-shifts are derived in closed form, that maximize the capacity based on the RIS-BS channel and the UE-RIS CCM.
The novel methods are flexible and take into account the sparsity of mmWave channels as they do not use foreknowledge of the number of paths prior to the estimation process. The proposed solutions can also be extended to other types of channels. To summarize, teachings of the disclosure are as follows:
The list of symbols that used hereinafter is given in Table 2. Scalars, vectors and matrices are denoted by x, x, and X, respectively. X* and XH denote the complex conjugate and conjugate transpose of X. The i-th element of a vector x is xi, while the (i,j)-th element of a matrix X is Xi,j. The n x n identity matrix is written as In. The diag(x) is the diagonal matrix with the elements of the vector a on the main diagonal. The element-wise product of X and Y is written as X∘Y, while the Khatri-Rao product between X and Y is written as X⊙Y. X⊗Y denotes the kronecker product between X and Y. Tr(X) and |X| represent the trace and determinant of the matrix X, respectively, and |x| represents the absolute value of a complex number x. The complex Gaussian random vector is denoted as x˜ CN(m, Σ) with mean m and covariance matrix Σ, whereas a complex Laplace random variable x is denoted as x˜(m, b) with mean m, scale b and probability density function (PDF) given by:
A Gamma distributed random variable with unit scale is denoted as x˜Gamma(k) with shape k, while an Exponentially distributed random variable with rate α is denoted by x˜Exp(α).
An RIS-assisted single-user communication system has M antennas at the BS, N passive reflecting elements at the RIS and a single-antenna user, as illustrated in
where ρ, x∈, and w∈M are, respectively, the SNR, the transmitted signal, and the additive white noise, i.e., w˜(0,IM). The phase shifts contributed by the RIS are represented by the diagonal matrix diag(v), where v=[ejθ
To optimize the phase-shifts based on the I-CSI realizations, the VI technique is used to jointly estimate the channels between the UE and the RIS h, as well as between the RIS and the BS G, relying on the pilot transmissions in the uplink and the sparse structure of the channels.
Although the use of the instantaneous channels may lead to optimal phase shift configuration, it is a challenging task in practice. First, the coherence time of the mmWave channels can be drastically shorter than that in sub-6 GHz channels [28], in particular for high mobile users. Hence, the channel estimation and phase optimization are performed repeatedly after every Th<<TG, which will entail a significant amount of training overhead and tremendous computational resources accompanied by spectral inefficiency due to the pilots sent in each coherence block. Furthermore, the system optimization based on the I-CSI requires frequent transmissions of control signals from the BS to the RIS, which involves a considerable amount of signaling overhead. Therefore, to mitigate the overhead due to the traditional channel estimation approaches, one promising direction for the RIS phase-shift design is to use only the S-CSI of the UE-RIS channel that is considered wide-sense stationary with an invariant covariance Rh=[hhH], and the RIS-BS channel which remains quasi-static given the static positions of the RIS and the BS. Therefore, no frequent updates will be required, thus reducing the signaling overhead and enhancing the efficiency of the RIS-aided communication system.
Hereinafter, the VI-based approach is described further, used to solve the joint RIS-BS and UE-RIS channel estimation, and the joint RIS-BS channel and UE-RIS CCM estimation problems in an RIS-aided mmWave wireless communication system.
The variational methods are a class of systematic approaches that approximate complex and intractable probability distributions with convenient tractable ones. VI is a specific case of variational methods that infers the marginal distributions or likelihood functions of hidden variables in a statistical model [40], [41]. For instance, for a communication model with two unknown inputs denoted z1 and z2 (e.g., RIS-BS and UE-RIS channels) and an observed output Y, the output is assumed to be obtained following a certain probability p(Y|z1, z2). If the goal is to infer {z1, z2} based on the evidence Y, there is interest in deriving the probability p(z1, z2|Y). When the direct evaluation of the posterior distribution p(z1, z2|Y) is infeasible, VI permits approximation of the posterior p(z1, z2|Y) with a parameterized tractable distribution qλ(z1, z2|Y).
The central concept in VI is the Evidence Lower Bound (ELBO), also known as the variational lower bound. It serves as a surrogate for the intractable log-likelihood of the data, and maximizing it corresponds to minimizing the Kullback-Leibler (KL) divergence between the true posterior p(z1, z2|Y) and the variational approximation qλ(z1, z2|Y). The ELBO is given by [42]:
Assuming that qλ(z1, z2|Y) belongs to a family of tractable distributions, the VI approach optimizes the parameters λ of the approximated distribution qλ(z1, z2|Y) such that the objective function (Y; λ) is minimized.
Furthermore, it is assumed that the approximated distribution can be factorized as qλ(z1, z2|Y)=qλ
Note that 1 and 2 in Eq. (4) represent the KL divergence between the auxiliary distributions, also known as variational distributions, qλ
After deriving the ELBO, one common approach is to use neural networks to parameterize the approximate posterior distribution [43]. In this approach, a neural network is used to map the observed data to the parameters of the auxiliary distribution, such as the mean and the scale parameters of a complex Laplace distribution. The neural network is typically trained using stochastic gradient descent or a related optimization algorithm to minimize the KL divergence between the auxiliary distribution and the true posterior distribution, as represented by the ELBO.
Therefore, the parameters of the two auxiliary distributions qλ
referred to by Encoder and Encoder as shown in
The channel information of the RIS-BS and the UE-RIS links in an RIS-aided mmWave wireless communication system with fully-passive elements are separately estimated, using the uplink training signals. First, the I-CSI for RIS-BS and UE-RIS links are separately estimated. Second, the RIS-BS channel and the UE-RIS CCM are separately estimated.
The RIS-BS channel G and the UE-RIS channel h are estimated based on the received training signal. The training signal is obtained by sending Np pilot signals by the user to the BS through the UE-RIS-BS channel. For different transmissions, different configurations of the RIS are maintained for each pilot signal, denoted by vl. The received training signals are given by:
where Y=y1, . . . , yN
In mmWave communication and due to the large number of elements in the RIS and the high path loss, the channels are sparse in the angular domain [3]. Specifically, only a small number of paths contribute to the received signal, and the other paths are negligible. The channels in the angular can be obtained by applying the Discrete Fourier Transform (DFT) as follows:
where FN and FM are the DFT matrices of size N×N and M×M, respectively. Gvir and hvir are the channels in the angular domain where the elements are independent and identically distributed and distributed according to a complex Laplace distribution with zero mean and scales αG
for any DFT matrix of size N×N, the received training signal for the l-th time slot is expressed as follows:
By applying the VI framework, the intractable true posterior distribution p(hvir, Gvir|Y) is approximated by a tractable parameterized distribution denoted qλ(hvir, Gvir|Y) that minimizes the ELBO function. Assuming a low-correlation between the channels hvir and Gvir conditioned on the training signal Y, by using the mean-field approximation, the auxiliary distribution is factorized as qλ(hvir, Gvir|Y)=qλ
The auxiliary distributions is assumed to follow complex Laplace distributions with independent elements:
where λ1={m, b} and λ2={M, B} are the parameters of the auxiliary distributions where the optimal values minimize the ELBO function expressed as follows:
The first loss 1I-CSI is the KL-divergence between the auxiliary distribution and the prior of hvir, which can be expressed as follows:
where H(qλ
The proof can be found in the section entitled ‘Supplementary Material’. he cross-entropy between two complex Laplace distributions can be obtained by using the Monte-Carlo method to approximate the expectation over hvir. Therefore, it is given by:
where the d-th sample is computed as =mi+bi×(0,1). Hence, 1I-CSI is expressed as:
Similarly, 2I-CSI is derived as:
where the Monte-Carlo samples are computed as =Mi,j+Bi,j×(0,1). The third loss consists of the expectation over the auxiliary distributions of the log-likelihood of the received training signal. It can be derived in closed-form as in Eq. (13), where C1 is a constant, Q and ∧ are the covariance matrix over the columns of Gvir and covariance matrix of hvir, respectively, which are diagonal matrices due to the independence of the elements according to the auxiliary distributions. The main diagonal elements are as follows (see the proof in the section entitled ‘Supplementary Material’):
The parameters m, b, M and B of the auxiliary distributions are obtained using the variational neural networks, as shown in Eq. (5). Specifically, Encoder is used to characterize qλ
In the s-th UE-RIS coherence block, by sending NP pilot signals while altering the configuration for each pilot, the received signal at the BS can be expressed as:
where hs is the UE-RIS channel during the s-th coherence block, Φ[v1, . . . , vN
where w=vec(W)˜(0,IMN
In various scenarios, the UE-RIS channel is highly correlated because of the small set of angles of arrivals (AoAs) contributing to the propagation [39]. Therefore, the covariance matrix Rh=[hhH] is considered as a low-rank matrix. Formally, the covariance matrix is expressed as follows:
R
h
=F
N
H
DF
N, (23)
where D=diag(d) is a diagonal matrix with a sparse main diagonal denoted as d. The focus is on estimating the sparse vector d, rather than estimating the full covariance matrix Rh which is typically a large matrix of size N×N, and the RIS-BS channel in the angular domain denoted as Gvir=FMGFN.
Applying the VI framework, the intractable true posterior distribution p(Gvir, d|{tilde over (Y)}) is approximated by two separate tractable parameterized distributions denoted by qλ
The auxiliary distribution for the RIS-BS channel in the angular domain Gvir is assumed to follow the complex Laplace distribution with independent elements, and the elements of d follow a Gamma distribution with unit scale:
where λ1={k} and λ2={M, B} are the parameters of the auxiliary distributions which are obtained by minimizing the ELBO function, which is given in general form in Eq. (4) expressed as follows:
Since the prior and the auxiliary posterior of Gvir align with the case addressed in the joint channel estimation, the second loss, expressed as 2S-CSI 2I-CSI, remains unchanged. However, the first loss, which involves the KL-divergence between an Exponential distribution and a Gamma distribution, can be expressed as follows:
where Γ(x) is the gamma function and ψ(x) is the digamma function. The third loss, denoted as 3S-CSI, is defined as the log-likelihood of the received training signal and can be expressed as follows:
where C2 is a constant. To compute the gradient with respect to the parameters of the auxiliary distribution of the RIS-BS channel link, qλ
After training the neural networks, denoted as Encoder and Encoder , that predict the distribution parameters k and {M,B} of qλ
The primary evaluation metric is the capacity of the RIS-assisted network obtained after deriving the phase-shifts based on the estimated quantities. Therefore, closed-form expressions are derived of the phase-shifts of the RIS that maximize the capacities for the two cases of channel information considered: the RIS-BS and the UE-RIS channels, and the RIS-BS channel and the UE-RIS CCM.
For the considered uplink RIS-assisted mmWave system, the received signal at the BS can be expressed as follows:
where x is the transmitted symbol satisfying (|x|2)=1, ρ is the SNR, and w˜(0, IM) denotes the additive white noise. The ergodic capacity is expressed by:
Based on the I-CSI, i.e., h and G, the phase-shifts are configured to maximize the capacity C, which is equivalent to solving the following problem:
Given the singular value decomposition (SVD) of G=USVH, the problem is equivalent
where r is the rank of G and si are the singular values in the descending order of G. A solution is to align the phase-shifts θk to the phases of the largest right singular vector of G, denoted as ϑmax, and the phases of the channel vector h. Specifically, the suboptimal phase-shifts are obtained as follows:
A closed-form expression of the phase-shifts that maximize the achievable rate of the UE-RIS-BS link based on the I-CSI of RIS-BS channel and the S-CSI (i.e., channel covariance matrix) of the UE-RIS channel, is disclosed. The problem is formulated as follows:
The problem in Eq. (37) is challenging to solve due to the lack of an explicit expression for the expectation over the logarithm. A strategy of maximizing a reliable upper bound on this expression [32] is adopted to address this difficulty:
The upper bound in Eq. (38) is highly accurate and serves as a reliable approximation of the original objective function, particularly for large values of ρ [32]. To maximize this upper bound, the subsequent optimization problem is formulated as follows:
The objective can be further expressed as follows:
Given the SVD of G=USVH and the eigenvalue decomposition of the covariance matrix Rh=PΣPH, the objective function can be expressed as follows:
where r′ is the rank of Rh and σj are the eigenvalues in the descending order. Therefore, The phases that align with the phases of the largest eigenvector of G and Rh, referred to as ϑmax and pmax, respectively, are taken to maximize the objective function and satisfy the unit modulus constraints, which are given by:
The performance of the two proposed CSI estimation methods was evaluated in RIS-aided SIMO mmWave wireless communication systems. The first approach to estimate the I-CSI is referred to as joint channel estimation (JCE) and the second method is referred to as joint channel and covariance Estimation (JCCE). In the example for evaluating performance, the setup of M=4 antennas at the BS and N=64 passive elements at the RIS is considered.
One evaluation metric is the capacity of the RIS-aided SIMO communication system. The estimated quantities, specifically the UE-RIS and RIS-BS channels for JCE, and the UE-RIS CCM and the estimated instantaneous RIS-BS channel gains for JCCE are leveraged to calculate the phase-shifts and determine the achieved capacity defined as C=log2(1+ρ∥G diag(v) h∥2). Moreover, the normalized mean square error (NMSE) defined by NMSE=∥{circumflex over (X)}−X∥2/∥X∥22 is evaluated, where Frobenius norm is used for matrices and l2 norm is used for vectors.
The methodologies are compared against the following baselines:
Following a hyperparameters tuning process, the performance of the VI-based neural networks has been significantly enhanced. The hyperparameters tuning was conducted using the Bayesian method [47] which includes optimization in the search for the optimal hyperparameters. The hyperparameters tuned consist of the architecture of the neural networks, the use of dropout layers and the learning rate. The architecture adopted for the JCE method features fully connected neural networks for both Encoder and Encoder . They consist of an input layer, two 300-unit hidden layers with Relu activation combined with a dropout layer and a batch normalization layer, and an output layer with two heads: the first outputs the mean after a Tanh activation and the second uses Softmax activation for scale. Conversely, for the JCCE method, the architecture of the encoders is maintained, and the output layer of Encoder that models the auxiliary distribution qλ
A mmWave channel model is adopted, as follows [46]:
where αp, θp, and ϕp/φp denote the complex gain, AoA, and azimuth/elevation of angle of departure (AoD) of the p-th path of RIS-BS channel. Similarly, βq and ϕq/φq denote the complex gain and azimuth/elevation AoA of the q-th path of the UE-RIS channel, respectively. Besides, aBS and aRIS denote the receive and transmit array response vectors at the BS and the RIS, respectively. Then, the array response vector of the half-wavelength spaced uniform linear array at the BS is given by:
In addition, the array response vector of the planar array at the RIS involving N elements is given by:
Two channel generation modes are used to train and test the novel methods:
The performance of the proposed JCE method was evaluated using mmWave channels generated according to Mode 1. To estimate the UE-RIS and the RIS-BS channels, Np=50 pilot symbols are sent over an uplink SIMO RIS-assisted mmWave communication system with number of paths Q=1 and P=3 for the UE-RIS and RIS-BS channels, respectively, and the training signals which are fed to the trained neural networks Encoder and Encoder are obtained.
Next, the estimation error of both channels UE-RIS and RIS-BS was investigated. As depicted in
Furthermore, the JCE method was evaluated under different number of paths investigating the effect of the level of sparsity on the estimation performance.
To compare the JCE and JCCE methods, the capacity was evaluated taking into account the number of pilots used to get the training signals that is expressed as Cp=(1−α)log2(1+ρ∥G diag(v)h∥2) where α=Npilots used/NTotal transmissions. The parameters Np=4 and Nb=200 were considered to obtain the training signal with channels generated in Mode 2. At coherence times TG and Th in the order of 100 ms and 0.1 ms, respectively,
The following parameter values were selected for simulations: Np=4 for the number of pilot symbols per UE-RIS coherence block and Nb=200 for the number of coherence blocks for UE-RIS channel. To evaluate the JCCE method, it was compared against the MO-EST estimation approach, where the channels are estimated at each coherence block and used to estimate the covariance matrix Rh. P=3 and Q=1 to represent the number of paths for the RIS-BS and UE-RIS channels, respectively.
The performance of each estimate was examined, aside from the baselines of the capacity with phase-shifts derived from the exact channels and the phase-shifts derived from the exact RIS-BS channel and UE-RIS CCM.
JCCE methods' performance was assessed in terms of the number of coherence blocks at SNR=5 db.
With respect to time-complexity analysis, the neural networks are trained in offline mode, and therefore are evaluated only is the inference mode, i.e., the forward propagation. The conventional method to evaluate the time-complexity of a neural network is the floating-point operations per second (FLOPs) [50]. For any fully connected layer Li of input size Ii and output size Oi that follows a dropout of rate 1−r and a batch normalization layers, the number of FLOPs is given by
Thus, the total number of FLOPs of the proposed neural network with 2 hidden layers yields
where H1 and H2 denote the size of the two hidden layers L1 and L2, respectively, r is the dropout rate applied before the two hidden layers, I represents the size of the input, and O the size of the output. Table 3 compares the order of complexity of inference of the proposed VI-based methods. Note that the input of the encoders are complex numbers, so the size of the input is multiplied by two considering the real and imaginary parts. That is, for the JCE method, the input to the neural networks is of size 2MNp. Moreover, a preprocessing is performed to the training signal for the JCCE method, i.e., {tilde over (Y)}{tilde over (Y)}H/Nb−IMN
Channel estimation poses a notable challenge for fully passive RIS-aided systems and the effectiveness of estimation schemes is dependent on the specific scenarios in which RIS systems are deployed. This disclosure relates to the CSI estimation problem in RIS-aided mmWave communication systems with fully-passive RIS elements using a VI-based framework to approximate the intractable posterior distribution of the channels with auxiliary distributions. In particular, there are two different novel approaches addressing two scenarios in which the RIS is deployed. The first method, named JCE, separately estimates the UE-RIS and RIS-BS I-CSI that is suitable for scenarios with low mobile users. This method is useful for decoupling the cascaded channels and allows the identification of the channels' behavior in each part. However, its main limitation lies in its susceptibility to high training and signaling overhead as the UR-RIS channel becomes more dynamic for high mobile users. To overcome this challenge, leveraging the slow-varying nature of the RIS-BS I-CSI and the UE-RIS S-CSI, a second method is disclosed, namely JCCE, that extends the VI-based framework used for JCE to estimate the RIS-BS channel and the UE-RIS CCM. Lastly, closed-form expressions of the phase-shifts are given, based on the obtained estimates for each use case considered in the methods. Sampling from the optimized auxiliary posterior distributions yields a capacity that is close to the one achieved with perfect CSI. Moreover, the JCCE provides an improvement of spectral efficiency through the reduction of the training overhead by relying on the slow-varying S-CSI of the UE-RIS channel rather than the I-CSI for the passive beamforming. Further development on the invention may include a more physically consistent RIS modeling, where the elements of the RIS experience mutual coupling, which leads to a non-diagonal reflection matrix. In addition, the multi-user scenario can be appropriately managed by employing identical phase-shifts for nearby users who share a similar covariance matrix.
A detailed derivation of the losses is provided, under the distributions investigated.
The entropy of a complex Laplace random variable z˜(m, b) with mean m and scale b is derived as:
Next, the closed-form of 3I-CSI (Eq. (13)) with complex Laplace priors is derived. In the first step, the expectation over hvir is computed, where
which is a constant with respect to hvir:
where C1 is a constant, m a vector of means of hvir following qλ
is the covariance matrix of hvir. The latter is a diagonal matrix with a main diagonal containing the variances of the elements. The variance of a complex Laplace is defined as follows:
Hence, the covariance matrix ∧ is expressed as follows:
∧i,j=6 diag(b)2 (52)
To compute Gvir, a constant matrix
i.e, A=FMHGvirC is defined. Hence:
Then, the property G
Further to the user device 2 and the access point 3, the wireless communication network includes a wave transformer 5 located at a geographically intermediate location between the access point 3 and the user device 2 and configured to reflect or redirect electromagnetic signals between the access point 3 and the user device 2. Typically, the wireless communication network includes a plurality of the wave transformers at geographically spaced locations from each other and from the access points. When there are a plurality of wave transformers, and a plurality of access points, then for a communication channel including a respective one of the user devices, the channel is formed between a respective one of the access points geographically closest to the user device and includes a respective one of the wave transformers which is geographically closest to an imaginary line interconnecting geographical locations of the respective access point and the respective user device between which the communication channel is formed.
The wave transformer 5 has a plurality of electronically reconfigurable antennas, so that it is a wave transformer of an artificial type, which is typically referred to in industry as a reconfigurable intelligent surface.
Yet further to the aforementioned elements of the network, the wireless communication network includes a central server 8 having a processor 9 and a non-transitory memory 10 operatively connected to the processor and storing instructions to be executed thereon. The central server 8 is communicatively connected to the access point 3, so as to be arranged to communicate or exchange data therebetween, and configured to control the wireless communication network. With respect to the wave transformer, however, the central server 8 is free of data connection with the wave transformer, such that data cannot be exchanged or communicated therebetween. The central server 8 may however be in control connection with the wave transformer, that is it may be communicatively connected to the wave transformer in such a manner as to transmit control or operational instructions, for example, regarding configuration of one or more of the antennas of the wave transformer.
The communication channel effectively has two constituent portions substantially defining a path of transmission of data between the access point and the user device. The constituent portions include a first portion between the access point and the wave transformer, which typically is static because both the access point and the wave transformer are stationary, and a second portion between the wave transformer and the user device, which is dynamic because the user device is movable and so a geographical location thereof may vary over time.
It will be appreciated that each tractable statistical distribution, which is based on a received signal (at a receiving one of the access point and the user device relative to a direction of data transmission) and not a transmitted signal (from a transmitting one of the access point and the user device relative to the direction of data transmission) is intended to model or represent a statistical distribution of the communication channel, that is the communication channel at the time or point of transmission of data. This is in part because an exact posterior statistical distribution of each constituent channel portion is intractable.
It will be appreciated that training of the neural networks, which is preferably unsupervised, may be performed or conducted using empirical data, that is data derived from actual measurements in the communication channel, or synthetic data.
It will be appreciated that ‘likelihood term’ may be alternatively referred to as ‘reconstruction error term.’
As described hereinbefore, the present disclosure relates to channel estimation in reconfigurable intelligent surfaces (RIS)-aided systems, which is used for optimal configuration of the RIS and various downstream tasks like user localization. In RIS-aided systems, channel estimation involves estimating two channels for the user-RIS (UE-RIS) and RIS-base station (RIS-BS) links. In the literature, two approaches are proposed: (i) cascaded channel estimation where the two channels are collapsed into a single one and estimated using training signals at the BS, and (ii) separate channel estimation that estimates each channel separately either in a passive or semi-passive RIS setting. In this disclosure, the separate channel estimation problem is investigated in a fully passive RIS-aided millimeter-wave (mmWave) single-user single-input multiple-output (SIMO) communication system. First, a variational-inference (VI) approach is adopted to jointly estimate the UE-RIS and RIS-BS instantaneous channel state information (I-CSI). Particularly, auxiliary posterior distributions of the I-CSI are learned through the maximization of the evidence lower bound. However, estimating the I-CSI for both links in every coherence block results in a high signaling overhead in scenarios with highly mobile users. Thus, our first approach is extended to go beyond the quasi-static assumption and leverage the slow-varying property of the RIS-BS channel. Our second method estimates the channel covariance matrix of the UE-RIS channel instead of the instantaneous channel. The simulation results demonstrate that maximum a posteriori channel estimation using the auxiliary posteriors approaches the capacity with perfect CSI. Leveraging the UE-RIS CCM enhances spectral efficiency by minimizing the pilot signaling to control the RIS, and exploiting its low-rank structure reduces training overhead compared to the maximum likelihood estimator.
The scope of the claims should not be limited by the preferred embodiments set forth in the examples but should be given the broadest interpretation consistent with the specification as a whole.
I-CSI, S-CSI
1
This application claims the benefit under 35 U.S.C. 119(e) of U.S. Provisional Application Ser. No. 63/446,646 filed Feb. 17, 2023.
Number | Date | Country | |
---|---|---|---|
63446646 | Feb 2023 | US |