The disclosure was made by or on behalf of the below listed parties to a joint research agreement. The joint research agreement was in effect on or before the date the disclosure was made and the disclosure was made as a result of activities undertaken within the scope of the joint research agreement. The parties to the joint research agreement are 1) Samsung Electronics Co., Ltd. and 2) Seoul National University R&DB Foundation.
The disclosure relates to a device and method for estimating a channel in a millimeter-wave (mmWave) communication system.
Since a millimeter-wave (mmWave) communication system employs a high frequency band (30 gigahertz (GHz) to 300 GHz) unlike existing fourth generation long term evolution (4G LTE) systems, the propagation of mmWave communication is highly attenuated depending on the distance. To offset such a disadvantage, a beamforming scheme through a multiple input multiple output (MIMO) system using a plurality of antennas is used in the mmWave communication system. Here, a channel coefficient that needs to be estimated in the MIMO system increases in proportion to the number of antennas in a receiver and a transmitter. To use a least square method or a minimum mean square error method that is an existing channel estimation method, in the MIMO system, pilot transmission overhead occurs because at least the same number of pilots as the number of channel coefficients need to be transmitted.
To reduce the pilot transmission overhead, research is being conducted on channel estimation methods using various schemes such as compressive sensing, MUSIC, deep learning schemes, and the like.
The above information is presented as background information only to assist with an understanding of the disclosure. No determination has been made, and no assertion is made, as to whether any of the above might be applicable as prior art with regard to the disclosure.
According to various embodiments disclosed in the disclosure, a channel may be estimated from pilot signals received in units of time slots, using a long short-term memory network and a fully connected network.
Aspects of the disclosure are to address at least the above-mentioned problems and/or disadvantages and to provide at least the advantages described below. Accordingly, an aspect of the disclosure is to provide a device and method for channel estimation using short/long-term memory network in millimeter-wave communication system.
Additional aspects will be set forth in part in the description which follows and, in part, will be apparent from the description, or may be learned by practice of the presented embodiments.
In accordance with an aspect of the disclosure, a channel estimation method is provided. The channel estimation method includes inputting a received pilot signal of a time slot to a long short-term memory network, extracting a time-varying channel feature embedding vector by estimating a change state of a channel by using the received pilot signal of the time slot as an input in the long short-term memory network, estimating a parameter of a channel model by using the time-varying channel feature embedding vector as an input in a fully connected network, and estimating a channel for the received pilot signal of the time slot, using the parameter of the channel model.
In accordance with another aspect of the disclosure, a channel estimation device is provided. The channel estimation device includes memory storing one or more computer programs and one or more processors communicatively coupled to the memory. The one or more computer programs include computer-executable instructions that, when executed by the one or more processors, cause the channel estimation device, when a received pilot signal of a time slot is input, extract, by a long short-term memory network, a time-varying channel feature embedding vector by estimating a change state of a channel by using the received pilot signal of the time slot as an input, estimate, by a fully connected network, a parameter of a channel model by using the time-varying channel feature embedding vector as an input, and estimate, by a channel reproduction unit, a channel for the received pilot signal of the time slot, using the parameter of the channel model are included.
According to an embodiment, a receiver includes a coupling unit configured to obtain a received pilot signal by multiplying a combining matrix by a signal received through an antenna, a channel estimation device configured to estimate a channel from pilot signals received in units of time slots, using a long short-term memory network and a fully connected network, an equalizer configured to correct a distortion in the pilot signals received in units of time slots, a demodulation unit configured to demodulate the corrected signal, and a decoding unit configured to decode the demodulated signal.
According to various embodiments, a device and method for estimating a channel from pilot signals received in units of time slots, using a long short-term memory network and a fully connected network are provided, and accurate parameters may be estimated in a situation in which a communication channel changes over time because a parameter of a geometric channel is estimated in a continuous domain through sequentially received pilot signals. In addition, since the parameters are estimated in a time domain instead of a frequency domain, channels for all subcarriers is estimated using a geometric channel model and Fourier transform.
Other aspects, advantages, and salient features of the disclosure will become apparent to those skilled in the art from the following detailed description, which, taken in conjunction with the annexed drawings, discloses various embodiments of the disclosure.
The above and other aspects, features, and advantages of certain embodiments of the disclosure will be more apparent from the following description taken in conjunction with the accompanying drawings, in which:
The same reference numerals are used to represent the same elements throughout the drawings.
The following description with reference to the accompanying drawings is provided to assist in a comprehensive understanding of various embodiments of the disclosure as defined by the claims and their equivalents. It includes various specific details to assist in that understanding but these are to be regarded as merely exemplary. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications of the various embodiments described herein can be made without departing from the scope and spirit of the disclosure. In addition, descriptions of well-known functions and constructions may be omitted for clarity and conciseness.
The terms and words used in the following description and claims are not limited to the bibliographical meanings, but, are merely used by the inventor to enable a clear and consistent understanding of the disclosure. Accordingly, it should be apparent to those skilled in the art that the following description of various embodiments of the disclosure is provided for illustration purpose only and not for the purpose of limiting the disclosure as defined by the appended claims and their equivalents.
It is to be understood that the singular forms “a,” “an,” and “the” include plural referents unless the context clearly dictates otherwise. Thus, for example, reference to “a component surface” includes reference to one or more of such surfaces.
Hereinafter, a device and method for channel estimation using a long short-term memory network in a millimeter-wave (mmWave) communication system according to an embodiment of the disclosure is described in detail with reference to
It should be appreciated that the blocks in each flowchart and combinations of the flowcharts may be performed by one or more computer programs which include instructions. The entirety of the one or more computer programs may be stored in a single memory device or the one or more computer programs may be divided with different portions stored in different multiple memory devices.
Any of the functions or operations described herein can be processed by one processor or a combination of processors. The one processor or the combination of processors is circuitry performing processing and includes circuitry like an application processor (AP, e.g. a central processing unit (CPU)), a communication processor (CP, e.g., a modem), a graphics processing unit (GPU), a neural processing unit (NPU) (e.g., an artificial intelligence (AI) chip), a Wi-Fi chip, a Bluetooth® chip, a global positioning system (GPS) chip, a near field communication (NFC) chip, connectivity chips, a sensor controller, a touch controller, a finger-print sensor controller, a display drive integrated circuit (IC), an audio CODEC chip, a universal serial bus (USB) controller, a camera controller, an image processing IC, a microprocessor unit (MPU), a system on chip (SoC), an integrated circuit (IC), or the like.
Referring to
A receiver 100 of the mmWave communication system may be configured to include a coupling unit 110, a channel estimation device 120, an equalizer 130, a demodulation unit 150, and a decoding unit 150.
The coupling unit 110 may obtain a received pilot signal by multiplying a combining matrix by a signal received through an antenna.
The channel estimation device 120 may perform channel estimation from pilot signals received in units of time slots, using a long short-term memory network 122 and a fully connected network 124.
The equalizer 130 may correct a distortion in the pilot signals received in units of time slots.
The demodulation unit 140 may convert a corrected signal into a discrete signal through a demodulation process.
The decoding unit 150 may obtain accurate data transmitted by a transmitter, by performing error correction (channel decoding) on a demodulated signal and decoding the demodulated signal.
The channel estimation device 120 may be configured to include the long short-term memory network 122, the fully connected network 124, and a channel reproduction unit 126.
When a received pilot signal of a time slot is input, the long short-term memory network 122 may extract a time-varying channel feature embedding vector by estimating a change state of a channel by using the received pilot signal of the time slot as an input. The long short-term memory network 122 will be further described below with reference to
The fully connected network 124 may estimate a parameter of a channel model by using the time-varying channel feature embedding vector as an input. Here, the parameter of the channel model may include a departure angle, an arrival angle, a path delay, and a path gain. The fully connected network 124 will be further described below with reference to
The channel reproduction unit 126 may estimate a channel for the received pilot signal of the time slot, using the parameter of the channel model.
The channel reproduction unit 126 may estimate a channel by applying the parameter of the channel model to Equation 1 shown below.
Here, Ĥl[k] denotes an estimated channel matrix of a k-th subcarrier, {circumflex over (α)}il denotes an estimated channel gain of an i-th path in an l-th time slot, fs denotes a spacing between subcarriers, {circumflex over (τ)}il denotes an estimated path delay of the i-th path in the l-th time slot, {circumflex over (θ)}il denotes an estimated arrival angle of the i-th path in the l-th time slot, {circumflex over (ϕ)}il denotes an estimated departure angle of the i-th path in the l-th time slot, aR({circumflex over (θ)}il) denotes a steering vector for the estimated departure angle of the i-th path in the l-th time slot, and aT({circumflex over (ϕ)}il) denotes a steering vector for the estimated arrival angle of the i-th path in the l-th time slot.
A process of deriving Equation 1 is described below.
A transmitter may include “NT” antennas and “NRF” RF chains, and may transmit “M” pilot training symbols in total. Among them, a pilot signal sm[k] may be transmitted by multiplying a k-th subcarrier of an m-th symbol by a pre-coding matrix Fm. The receiver 100 may be assumed to include “NR” antennas and “NRF” RF chains.
The receiver 100 receives a signal rm[k] obtained by adding a noise vector vm[k] and passing through a channel H[k] to a pilot signal Fmsm[k] transmitted by the transmitter. Therefore, the signal rm[k] in a frequency domain received by the receiver 100 may be expressed as shown in Equation 2 below.
Here, H[k] denotes a channel matrix corresponding to a k-th subcarrier in a frequency domain, Fm denotes a precoding matrix of an m-th symbol, sm[k] denotes a pilot signal of the k-th subcarrier of the m-th symbol in the frequency domain, and vm[k] denotes noise of the k-th subcarrier of the m-th symbol in the frequency domain.
The coupling unit 110 of the receiver 100 may obtain a combined signal ym[k] by multiplying the received signal rm[k] by a combiner matrix Wm. Accordingly, a final received signal ym[k] in the frequency domain may be expressed as shown in Equation 3 below.
Here, Wm* denotes a transpose conjugate matrix of a combiner matrix, H[k] denotes a channel matrix corresponding to a k-th subcarrier in a frequency domain, Fm denotes a precoding matrix of an m-th symbol, sm[k] denotes a pilot signal of the k-th subcarrier of the m-th symbol in the frequency domain, and vm[k] denotes noise of the k-th subcarrier of the m-th symbol in the frequency domain.
In addition, if the geometric channel model is used, a channel matrix H(τ) in a time domain may be configured as shown in Equation 4 below.
Here, Np denotes the number of channel paths, αi denotes a channel gain of an i-th path, τi denotes a path delay of the i-th path, θi denotes an arrival angle of the i-th path, and ϕi denotes a departure angle of the i-th path. In addition, aR and aT denote steering vectors defined as shown in Equation 5 below.
The channel matrix H(τ) of the time domain through the channel matrix H[k] of the k-th subcarrier may be calculated from Fourier transform. Accordingly, H[k] may be expressed as shown in Equation 6 below.
Here, H[k] denotes an estimated channel matrix of a k-th subcarrier, αi denotes a channel gain of an i-th path, fs denotes a spacing between subcarriers, τi denotes a path delay of the i-th path, θi denotes an arrival angle of the i-th path, ϕi denotes a departure angle of the i-th path, aR(θi) denotes a steering vector for the departure angle of the i-th path, and aT(ϕi) denotes a steering vector for the arrival angle of the i-th path.
Therefore, Equation 1 may be derived from Equation 6.
The channel estimation device 120 with an artificial intelligence neural network structure proposed in the disclosure may be trained by an Adam optimization algorithm, to minimize a loss function.
A loss function of the channel estimation device 120 with the artificial intelligence neural network structure may be defined as shown in Equation 7 below.
Here, Ĥl[k] denotes a channel matrix corresponding to a k-th subcarrier and an estimated l-th time slot.
Meanwhile, since “M” training pilot symbols share the same parameters in a time domain, an input y of a long short-term memory network may be generated by concatenating them. In other words, the input y of the network may be expressed as shown in Equation 8 below by concatenating the “M” training pilot symbols.
is satisfied, which indicates a collection of all received signals for “K” subcarriers in an m-th training symbol.
To estimate a channel that changes for a period of L times over time, inputs y corresponding to L times are called y1, y2, . . . , yl, . . . , yL in an order. Since an l-th input yl is a complex number, it may be converted into a real number and input to the long short-term memory network 122.
In other words, an input yin of the long short-term memory network 122 is yin=[(yl), (yl)], which connects a real part converted to a real number and an imaginary part.
Referring to
The long short-term memory network 122 may calculate an output of the input gate 202 that determines a degree to which a candidate state information cell is reflected in a final state information cell based on a received pilot signal of a time slot and an output of the long short-term memory network of a previous time slot. Here, the output of the input gate 202 may be calculated as shown in Equation 9 below.
Here, IC denotes an output of an input gate, σ denotes a sigmoid function, WI and UI denote weight matrices of the input gate, bI denotes a deviation of the input gate, yinl denotes an input of an l-th time slot, and zl-1 denotes an output of the long short-term memory network of an (l−1)-th time slot.
The long short-term memory network 122 may calculate the forget gate 201 that determines a degree to which a final state information cell of the previous time slot is reflected in the final state information cell based on the received pilot signal of the time slot and the output of the long short-term memory network of the previous time slot. Here, an output of the forget gate 201 may be calculated as shown in Equation 10 below.
Here, FC denotes an output of a forget gate, σ denotes a sigmoid function, WF and UF denote weight matrices of the forget gate, bF denotes a deviation of the forget gate, yinl denotes an input of an l-th time slot, and zl-1 denotes an output of the long short-term memory network of an (l−1)-th time slot.
The long short-term memory network 122 may calculate an output of the output gate 203 that determines a degree to which the final state information cell is reflected in an output of the long short-term memory network based on the received pilot signal of the time slot and the output of the long short-term memory network of the previous time slot. Here, the output of the output gate 203 may be calculated as shown in Equation 11 below.
Here, OC denotes an output of an output gate, σ denotes a sigmoid function, WO and UO denote weight matrices of the output gate, bO denotes a deviation of the output gate, yinl denotes an input of an l-th time slot, and zl-1 denotes an output of the long short-term memory network of an (l−1)-th time slot.
The long short-term memory network 122 may calculate the candidate state information cell based on the received pilot signal of the time slot and the output of the long short-term memory network of the previous time slot, using the hyperbolic tangent 204. Here, the candidate state information cell may be calculated as shown in Equation 12 below.
Here, {tilde over (C)} denotes a value of the candidate state information cell, tan h( ) denotes a hyperbolic tangent function, WC and UC denote weight matrices of the candidate state information cell, bC denotes a deviation of the candidate state information cell, yinl denotes an input of an l-th time slot, and zl-1 denotes an output of the long short-term memory network of an (l−1)-th time slot.
The long short-term memory network 122 may calculate the final state information cell by adding, using an adder 207, a value obtained by multiplying the output of the forget gate 201 by the final state information cell of the previous time slot using a first multiplier 205 and a value obtained by multiplying the output of the input gate 202 by the candidate state information cell using a second multiplier 206. Here, the final state information cell may be calculated as shown in Equation 13 below.
Here, Cl denotes a value of the final state information cell of an l-th time slot, FC denotes an output of a forget gate, Cl-1 denotes a value of the final state information cell of an (l−1)-th time slot, IC denotes an output of an input gate, and {tilde over (C)} denotes a value of a candidate state information cell.
The long short-term memory network 122 may calculate and output a time-varying channel feature embedding vector by multiplying an output gate by a value obtained by applying the hyperbolic tangent 208 to the final state information cell using a third multiplier 209. Here, the time-varying channel feature embedding vector may be calculated as shown in Equation 14 below.
Here, zl denotes a time-varying channel feature embedding vector of an l-th time slot, OC denotes an output of an output gate, tank( ) denotes a hyperbolic tangent function, and Cl denotes a value of the final state information cell of an l-th time slot.
Referring to
The input layer 310 may function to adjust a time-varying channel feature embedding vector to a dimension of the hidden layer 320.
The input layer 310 may output a value calculated by Equation 15 shown below.
Here, x0 denotes an output of an input layer, W0 denotes a weight matrix of the input layer, b0 denotes a deviation of the input layer, zl denotes a time-varying channel feature embedding vector that is an output of a long short-term memory network, and factivation denotes an activation function.
The output x0 of the input layer 310 is input to “Nhidden” hidden layers 320.
The hidden layer 320 may output a value calculated by Equation 16 shown below.
Here, xi denotes an output of an i-th hidden layer, Wi denotes a weight of the i-th hidden layer, bi denotes a deviation of the i-th hidden layer, and factivation denotes an activation function used in each hidden layer.
An output xN
The output layer 330 may output a value calculated by Equation 17 shown below.
Here, Ψl denotes a final output that yields estimates of a departure angle, an arrival angle, a path delay, and a path gain for each channel path in an l-th time slot, tan h( ) denotes a hyperbolic tangent function, Wout denotes a weight of an output layer, xN
In other words, Ψl corresponds to estimates {circumflex over (ϕ)}l, {circumflex over (θ)}l, {circumflex over (τ)}l, and {circumflex over (α)}l of the departure angle, the arrival angle, the path delay, and the path gain in the l-th time slot. Here, the estimate {circumflex over (α)}l of the path gain includes a real part ({circumflex over (α)}l) and an imaginary part ({circumflex over (α)}l).
Hereinafter, a method according to the disclosure configured as described above will be described with reference to the drawings.
Referring to
In operation 420, the long short-term memory network of the channel estimation device may extract a time-varying channel feature embedding vector by estimating a change state of a channel by using the received pilot signal of the time slot as an input.
In operation 420, the long short-term memory network may extract the time-varying channel feature embedding vector, using the received pilot signal of the time slot, a final state information cell of the long short-term memory network of a previous time slot, and an output of the long short-term memory network of the previous time slot. An operation of the long short-term memory network will be further described below with reference to
In operation 430, a fully connected network of the channel estimation device may estimate a parameter of a channel model by using the time-varying channel feature embedding vector as an input. Here, the parameter of the channel model may include a departure angle, an arrival angle, a path delay, and a path gain.
In operation 430, the fully connected network may estimate the parameter of the channel model by matching the time-varying channel feature embedding vector and the parameter of the channel model, using an input layer, at least one hidden layer, and an output layer.
In operation 440, the channel estimation device may estimate a channel for the received pilot signal of the time slot, using the parameter of the channel model.
In operation 440, the channel estimation device may perform estimation by applying the parameter of the channel model to Equation 1 described above.
Referring to
In operation 520, the long short-term memory network may calculate an output of a forget gate that determines a degree to which a final state information cell of the previous time slot is reflected in the final state information cell based on the received pilot signal of the time slot and the output of the long short-term memory network of the previous time slot.
In operation 530, the long short-term memory network may calculate an output of an output gate that determines a degree to which the final state information cell is reflected in an output of the long short-term memory network based on the received pilot signal of the time slot and the output of the long short-term memory network of the previous time slot.
In operation 540, the long short-term memory network may calculate the candidate state information cell based on the received pilot signal of the time slot and the output of the long short-term memory network of the previous time slot.
In operation 550, the long short-term memory network may calculate the final state information cell by adding a value obtained by multiplying the output of the forget gate by the final state information cell of the previous time slot and a value obtained by multiplying the output of the input gate by the candidate state information cell.
In operation 560, the long short-term memory network may calculate and output the time-varying channel feature embedding vector by multiplying the output of the output gate by a value obtained by applying a hyperbolic tangent to the final state information cell.
While the disclosure has been shown and described with reference to various embodiments thereof, it will be understood by those skilled in the art that various changes in form and details may be made therein without departing from the spirit and scope of the disclosure as defined by the appended claims and their equivalents.
Number | Date | Country | Kind |
---|---|---|---|
10-2021-0112977 | Aug 2021 | KR | national |
This application is a continuation application, claiming priority under § 365(c), of an International Application No. PCT/KR2022/009832 filed on Jul. 7, 2022, which is based on and claims the benefit of a Korean patent application number 10-2021-0112977 filed on Aug. 26, 2021 in the Korean Intellectual Property Office, the disclosure of which is incorporated by reference herein in its entirety.
Number | Date | Country | |
---|---|---|---|
Parent | PCT/KR2022/009832 | Jul 2022 | WO |
Child | 18584561 | US |