Communication systems and methods in which vector based deep learning is applied for enhanced performance are provided.
5G millimeter wave (mmWave) systems promise a paradigm of higher bandwidth and thus greater connectivity over previous networks. However, 5G brings several challenges, including higher path losses experienced by 5G mmWaves from obstructions and the atmosphere that are inherent to the frequencies used by 5G systems (28-39 GHz). To address the path loss issues, 5G communication cells are smaller as compared to systems that operate at lower frequencies, and beamforming strategies are often employed for transmitter-to-receiver efficiency. As a result, the need for complex signal routing and frequent hand-off of signals between base stations (BS) can lead to safety critical impacts to quality of service (QoS) and latency (e.g., for autonomous cars). Furthermore, beamforming (that is, establishing the proper signal routing instructions) requires considerable training overhead when using predefined codebooks. In addition, 5G heterogeneous networks (HetNets) comprise a mixture of base stations and their lower-tier companions (such as micro base stations (MBS), pico base stations (PBS), femto base stations (FBS), and remote radio heads (RRH)), as well as a multitude of user equipment (UE) served by these BSs. Such HetNets suffer from significant co-channel and intra-cell interference, impacts to achievable data rate (ADR), and load balancing (LB) coordination challenges.
Alkhateeb et al, “Deep Learning Coordinated Beamforming for Highly-Mobile Millimeter Wave Systems,” IEEE Access, 2018, considered a single UE supported by multiple BSs. Each signal from a UE received at a BS draws a unique RF signature of a UE's location and interaction with the transmission environment. This signature, which the authors referred to as the unique RF-signal signature (URS), was used to establish beamforming vectors for transmission from a BS to UE. Using a simple 2-dimensional representation of the URS (e.g., real and imaginary) as input, Alkhateeb then employed a deep learning (DL) technique called convolutional neural networks (CNN) to learn and predict the optimum beamforming vectors at a BS that optimized a UE's ADR. The 2D representation of the URS exploited the CNN's strength in image recognition tasks.
Systems such as that contemplated by Alkhateeb et al. showed that CNNs could be extremely fast and provide a 2× improvement in performance over the conventional codebook-search beamforming vector determination approach. The speed and performance of these CNN systems were attained by exploiting two types of CNN layers: (1) “max pooling” and (2) convolution (both of which result in some level of invariance). Max pooling is a strategy that uses the maximum value of clustered CNN nodes to reduce model complexity and enable model generalization. Convolution is the application of a convolution kernel (much like blurring) to an image. The use of these types of layers results in invariance, or a “blindness” of the network to certain transformations. For example, CNNs ignore the spatial relationships between objects in an image by assuming that different internal representations of the image (i.e., the re-positioning of objects within an image) do not change the properties of the image (i.e., the whole is viewed as invariant). While CNNs are generally effective at image classification tasks, this invariance resulting from expediency measures (such as max pooling and convolution) makes networks employing such measures prone to misclassify output results.
Embodiments of the present disclosure provide systems and methods that use vector-based deep learning to improve the performance of a communications network, including but not limited to a 5G communications network. In particular, embodiments of the present disclosure employ vector based deep learning (VBDL) models. In contrast to CNNs whose efficiency strategies result in invariance, VBDL methods preserve vector information (such as object spatial relationships), which results in the model demonstrating equivariance instead. Equivariance means the model sees a transformation on the input and propagates that to yield a transformation in the output. Accordingly, VBDL approaches can improve the prediction accuracy of the prior art by overcoming various limitations of CNNs.
In an example scenario, a single BS serves multiple UE units. Embodiments of the present disclosure overcome CNN limitations of the prior-art by employing VBDL models. By employing a VBDL model to predict optimal beamforming vectors, a much higher quality result can be obtained, as this approach is not prone to the same invariance as a CNN. In accordance with at least some embodiments of the present disclosure, a VBDL model is implemented using novel methods to optimize a neural network called a Hinton capsule network (HCN). An HCN improves image-classification performance over CNNs by preserving vector-based information such as the pose (i.e., x-y position and orientation) and the relative spatial relationships of objects (i.e., parts) in an image (i.e., a whole), as well as time-dependent spatial behavior such as velocity. As such, an HCN would correctly classify images that can suffer from invariance when processed by a conventional CNN.
In accordance with further embodiments of the present disclosure, a VBDL model is implemented using novel methods to optimize a neural network called coordinate convolution (CoordConv) methods. CoordConv improves image classification accuracy of CNNs by giving standard CNN convolutional filters input a positional reference. This is done by adding at least two channels to the convolutional layer input—one for each of the Cartesian coordinates (here i and j). This additional information ensures that the CNN output is not invariant to pose, thus improving the classification accuracy. As applied in connection with at least some embodiments of the present disclosure, the CoordConv accuracy is enhanced by removing the max pooling layer and optimizing the filter size to capture the correlation between co-channel uplink signals. These enhancements will inevitably reduce the CoordConv runtime speed.
The challenge with VBDL models or approaches is that the inclusion of vector information results in high dimensional parameter space, and a much longer training time is needed. Even after the VBDL model is trained, the prediction time will be long, making it unsuitable for real-time tasks. In accordance with at least some embodiments of the present disclosure, the VBDL model is optimized for runtime performance and reduced neural network size (e.g., to fit on small devices with limited memory and processing capabilities) in real-world applications by novel pruning methods.
Additional features and advantages of embodiments of the present disclosure will become more readily apparent from the following discussion, particularly when taken together with the accompanying drawings.
A communications system 100 such as may be operated in connection with embodiments of the present disclosure is depicted in
A base station 104 can include, can be in communication with, or can otherwise be associated with a processing system 200 for implementing a VBDL model or network as described herein. Components that can be included in such a processing system 200 are depicted in
The processor 204 may comprise a general purpose programmable processor, a set of processors, a field programmable gate array (FPGA), a controller, or the like. In accordance with embodiments of the present disclosure, the processor 204 is in the form of, includes, or is associated with a graphical processing unit (GPU). The GPU can be operated to efficiently process image data depicting RF signal signatures (URS) used to establish beam forming vectors in connection with a VBDL network as described herein. The processor 204 can operate to execute application programming 236 stored in the memory 208 and/or in the data storage 212. The application programming 236 can include firmware or software that implements methods in accordance with embodiments of the present disclosure, including implementing a VBDL model as described herein. In addition, the processor 204 can operate to process data, including signal signature data regarding signals passed between devices 108 and the base station 104.
The memory 208 may comprise solid-state, volatile, and/or non-volatile memory. Examples of suitable memory devices include RAM, DRAM, SDRAM, etc. The memory 208 can provide short and/or long term storage for the application programming 236, operating system software, data received from various sources, intermediate data products, beamforming code book data, and calculated data. The data storage 212 may comprise a hard disk drive, solid-state drive, solid-state memory, or other device for the storage of application programming and data.
The communication interface 216 may support communications between the processing system 200 and other systems. For example, the processing system 200 may be in communication with one or more base stations 104, another processing system 200, a network authority, or the like, via the communication interface 216. As examples, but without limitation, the communication interface 216 can include an Ethernet interface, a microwave communication interface, a satellite communication interface, a WiFi interface, a fiber optic network interface, or the like.
The beamforming network 220 generally includes feed lines for carrying signals between individual antenna elements 228 of the phased array antenna 224 and the transceiver 232. The beamforming network 220 can further include, phase shifters, amplifiers, delay lines, attenuators, and the like, at least some of which are variable. As can be appreciated by one of skill in the art after consideration of the present disclosure, by selectively controlling the elements of the beamforming network 220, the attributes of RF beams produced by the phased array antenna 224 can be controlled. For example, the direction in three-dimensional space towards which a beam 112 is pointed can be controlled.
The improvements in performance over the conventional codebook-search beamforming vector determination approach available using a conventional CNN can be attained by exploiting two types of CNN layers: (1) “Max Pooling” and (2) Convolution (both of which result in some level of invariance). Max pooling is a strategy that uses the maximum value of clustered CNN nodes to reduce model complexity and enable model generalization. Convolution is the application of a convolution kernel (much like blurring) to an image. The use of these types of layers results in invariance, or a “blindness” of the network to certain transformations. For example, CNNs ignore the spatial relationships between objects in an image by assuming that different internal representations of the image (i.e., the re-positioning of objects within an image) do not change the properties of the image (i.e., the whole is viewed as invariant).
An example of a VBDL model that is capable of maintaining an awareness of spatial relationships, and therefore of providing a more robust neural network, comprising a novel, optimized version of a neural network, is called a Hinton Capsule Network (HCN) (see
In accordance with other embodiments of the present disclosure, a second VBDL model comprising a novel, optimized version of a neural network called coordinate convolution (CoordConv) is applied (see
In accordance with embodiments of the present disclosure, the accuracy of CoordConv can be improved by removing the max pooling layer and optimizing the filter size to capture the correlation between co-channel uplink signals.
In accordance with other embodiments of the present disclosure, output information from the convolutional filters may be used in backpropagation or other schemes to optimize weights during training.
In accordance with at least some embodiments of the present disclosure, VBDL models comprising an HCN or a CoordConv network that are optimized for runtime performance and reduced network size (e.g., to fit on small devices with limited memory and processing capabilities) in real-world applications by novel pruning methods are provided. Aspects of such models in accordance with embodiments of the present disclosure are depicted in
In accordance with other embodiments of the present disclosure, the pruning methods described in
In further embodiments of the present disclosure shown in
In accordance with other embodiments of the present disclosure, pruning is performed using weight similarity (e.g., cosine distance between weights) to remove redundant branches of the network.
As can be appreciated by one of skill in the art after consideration of the present disclosure, signals used for training a VBDL network as disclosed herein can be real or simulated. For example, as depicted in in
In accordance with embodiments of the present disclosure, during the VBDL network learning phase (
In accordance with further embodiments of the present disclosure, during the model prediction phase (
In accordance with further embodiments of the present disclosure, multiple UEs connected to multiple BSs, MBSs, PBS, FBS etc. are modeled (
(a) The relative spatial or spatio-temporal relationships of the model-applicable elements in the HetNets such as the relative location, velocities of different UEs or BS elements at a given time,
(b) The operational state of the model-applicable elements in the HetNets such as the number and the configuration of active UEs connected to a BS (i.e., the BS beamforming-vector-space including the number of antennas that are actively in use and the number that is available for use) and the sleep-or-active operating mode of other model-relevant UEs or BS-tiers at a given time,
(c) BS state values such as its power range (min-max),
(d) The state of the model environment such as obstacles and weather conditions,
(e) Inter/co-channel and intra-cell interferences characterized by metrics such as the Signal-to-Interference-plus-Noise Ratio (SINR),
(f) Positive or negative Load Balancing Power Bias (LBPB) applied to different BS-tiers (e.g., PBS) aimed at balancing the load among them in HetNets, and
(g) Network quality characterized by metrics such as a UE's QoS.
The EURS (model input) necessarily contains more information than the URS that does not apply the EURS. Judicious pruning and model optimization is therefore applied to expedite model training without loss of performance.
In accordance with embodiments of the present disclosure, the VBDL model operates to learn and predict the optimum beamforming vectors at a BS, MBS, PBS, FBS etc., that optimizes ADR and other real-world UE metrics such as such SINR and QoS by using the EURS as input to the VBDL model. The VBDL model can also provide BS-tier-coordination. For example, a PBS can optimally learn its positive LBPB while satisfying its own UEs QoS requirements. At the same time, for PBS-MBS coordination, a PBS would inform an MBS (e.g., using backhaul fiber communications or direct RF links) of the MBS potential to cause load balancing and interference issues so that the MBS can serve its own UEs while reducing its LBPB and adhering to a PBS's SINR constraints.
In accordance with embodiments of the present disclosure, the beamforming vector predicted by the VBDL model to optimize the various metrics is provided as an output. That output can then be applied, for example by a beamforming network 220, to control the phased array antenna 224 of a base station 104 to produce the desired beam or beams for communications with one or more devices 108. Moreover, the VBDL model can be executed by a processing system 200 provided as part of a single base station 104, or can be executed by a processing system 200 provided separately from a base station 104. Whether integrated with a base station or not, a processing system 200 in accordance with embodiments of the present disclosure can serve one or more base stations 104.
As discussed herein, training and deployment of a VBDL model can include pruning. Pruning can be achieved by identifying parts of the trained model that are not activated after training is complete, and by identifying those parts of the trained network that do not contribute to the specific vector outputs of interests selected for a particular application. In general pruning simplifies the number of operations for optimized runtime performance and concentrates the model and effectively identify the signal attributes used to determine an optimal beamforming vector.
The foregoing discussion of the disclosed systems and methods has been presented for purposes of illustration and description. Further, the description is not intended to limit the disclosed systems and methods to the forms disclosed herein. Consequently, variations and modifications commensurate with the above teachings, within the skill or knowledge of the relevant art, are within the scope of the present disclosure. The embodiments described herein are further intended to explain the best mode presently known of practicing the disclosed systems and methods, and to enable others skilled in the art to utilize the disclosed systems and methods in such or in other embodiments and with various modifications required by the particular application or use. It is intended that the appended claims be construed to include alternative embodiments to the extent permitted by the prior art.
This application claims the benefit of U.S. Provisional Patent Application Ser. No. 62/854,152, filed May 29, 2019, the entire disclosure of which is hereby incorporated herein by reference.
Number | Name | Date | Kind |
---|---|---|---|
5588073 | Lee et al. | Dec 1996 | A |
5636326 | Stork | Jun 1997 | A |
5719794 | Altshuler et al. | Feb 1998 | A |
5960391 | Tateishi et al. | Sep 1999 | A |
6075991 | Raleigh et al. | Jun 2000 | A |
6252627 | Frame et al. | Jun 2001 | B1 |
6437692 | Petite et al. | Aug 2002 | B1 |
6597394 | Duncan et al. | Jul 2003 | B1 |
6820053 | Ruwisch | Nov 2004 | B1 |
7020501 | Elliott et al. | Mar 2006 | B1 |
7590098 | Ganesh | Sep 2009 | B2 |
8019544 | Needelman et al. | Sep 2011 | B2 |
8583371 | Goodzeit et al. | Nov 2013 | B1 |
8929936 | Mody et al. | Jan 2015 | B2 |
9073648 | Tsao et al. | Jul 2015 | B2 |
9191587 | Wright et al. | Nov 2015 | B2 |
9294365 | Misra et al. | Mar 2016 | B2 |
9449374 | Nash et al. | Sep 2016 | B2 |
9702702 | Lane et al. | Jul 2017 | B1 |
9924522 | Gulati et al. | Mar 2018 | B2 |
9927510 | Waldron et al. | Mar 2018 | B2 |
10021313 | Chen et al. | Jul 2018 | B1 |
10048084 | Laine et al. | Aug 2018 | B2 |
10271179 | Shima | Apr 2019 | B1 |
10726830 | Mandal | Jul 2020 | B1 |
10879946 | Shima | Dec 2020 | B1 |
10970520 | Kim et al. | Apr 2021 | B1 |
11182672 | Schmidt | Nov 2021 | B1 |
20050049876 | Agranat | Mar 2005 | A1 |
20050228660 | Schweng | Oct 2005 | A1 |
20060030332 | Carrott et al. | Feb 2006 | A1 |
20070010956 | Nerguizian et al. | Jan 2007 | A1 |
20080020354 | Goree et al. | Jan 2008 | A1 |
20080045235 | Kennedy et al. | Feb 2008 | A1 |
20080293353 | Mody et al. | Nov 2008 | A1 |
20090179142 | Duparre et al. | Jul 2009 | A1 |
20090197550 | Huttunen et al. | Aug 2009 | A1 |
20090268619 | Dain et al. | Oct 2009 | A1 |
20100091017 | Kmiecik et al. | Apr 2010 | A1 |
20120071105 | Walker et al. | Mar 2012 | A1 |
20120072986 | Livsics et al. | Mar 2012 | A1 |
20120163355 | Heo et al. | Jun 2012 | A1 |
20120167144 | Avison-Fell | Jun 2012 | A1 |
20120202510 | Singh | Aug 2012 | A1 |
20120238201 | Du et al. | Sep 2012 | A1 |
20120238220 | Du et al. | Sep 2012 | A1 |
20140218520 | Teich et al. | Aug 2014 | A1 |
20140232871 | Kriel et al. | Aug 2014 | A1 |
20140282783 | Totten et al. | Sep 2014 | A1 |
20140329540 | Duggan et al. | Nov 2014 | A1 |
20150009072 | Nijsure | Jan 2015 | A1 |
20160101779 | Katoh | Apr 2016 | A1 |
20160173241 | Goodson et al. | Jun 2016 | A1 |
20160187477 | Wang | Jun 2016 | A1 |
20170120906 | Penilla et al. | May 2017 | A1 |
20170123429 | Levinson et al. | May 2017 | A1 |
20170366264 | Riesing et al. | Dec 2017 | A1 |
20180019910 | Tsagkaris et al. | Jan 2018 | A1 |
20180025641 | LaVelle et al. | Jan 2018 | A1 |
20180033449 | Theverapperuma | Feb 2018 | A1 |
20180053108 | Olabiyi et al. | Feb 2018 | A1 |
20180082438 | Simon et al. | Mar 2018 | A1 |
20180107215 | Djuric et al. | Apr 2018 | A1 |
20180149730 | Li et al. | May 2018 | A1 |
20180268571 | Park et al. | Sep 2018 | A1 |
20180293893 | Yang et al. | Oct 2018 | A1 |
20180324595 | Shima | Nov 2018 | A1 |
20190049955 | Yabuuchi et al. | Feb 2019 | A1 |
20190066713 | Mesgarani et al. | Feb 2019 | A1 |
20190122689 | Jain et al. | Apr 2019 | A1 |
20190164430 | Nix | May 2019 | A1 |
20190213887 | Kitayama et al. | Jul 2019 | A1 |
20190294108 | Ozcan et al. | Sep 2019 | A1 |
20190318725 | Le Roux et al. | Oct 2019 | A1 |
20190322282 | Theodosis et al. | Oct 2019 | A1 |
20190353741 | Bolster, Jr. et al. | Nov 2019 | A1 |
20190363430 | Wang et al. | Nov 2019 | A1 |
20200327397 | Govea | Oct 2020 | A1 |
20200335121 | Mosseri | Oct 2020 | A1 |
Number | Date | Country |
---|---|---|
108875595 | Nov 2018 | CN |
Entry |
---|
Official Action for U.S. Appl. No. 16/668,826, dated Jul. 28, 2020 11 pages. |
U.S. Appl. No. 16/597,411, filed Oct. 9, 2019, Schmidt et al. |
U.S. Appl. No. 16/668,826, filed Oct. 30, 2019, Shima. |
U.S. Appl. No. 16/693,992, filed Nov. 25, 2019, Tchilian. |
U.S. Appl. No. 16/745,725, filed Jan. 17, 2020, Tchilian et al. |
U.S. Appl. No. 16/806,367, filed Mar. 2, 2020, Frye et al. |
U.S. Appl. No. 16/886,983, filed May 29, 2020, Regan. |
“Deep Learning Meets DSP: OFDM Signal Detection,” KickView Tech Blog, Feb. 13, 2018, 25 pages [retrieved online from: blog.kickview.com/deep-learning-meets-dsp-ofdm-signal-detection/]. |
Buchheim “Astronomical Discoveries You Can Make, Too!” Springer, 2015, pp. 442-443. |
Ma et al. “Attitude-correlated frames approach for a star sensor to improve attitude accuracy under highly dynamic conditions,” Applied Optics, Sep. 2015, vol. 54, No. 25, pp. 7559-7566. |
Ma et al. “Performance Analysis of the Attitude-correlated Frames Approach for Star Sensors,” IEEE, 3rd IEEE International Workshop on Metrology for Aerospace (MetroAeroSpace), Firenze, Italy, Jun. 22-23, 2016, pp. 81-86. |
Nair et al. “Accelerating Capsule Networks with Tensor Comprehensions,” Princeton, May 2018, 8 pages. |
Ni et al. “Attitude-correlated frames adding approach to improve signal-to-noise ratio of star image for star tracker,” Optics Express, May 2019, vol. 27, No. 11, pp. 15548-15564. |
Wang “Research on Pruning Convolutional Neural Network, Autoencoder and Capsule Network,” before Oct. 9, 2018, 11 pages. |
Wang et al. “An Optimization View on Dynamic Routing Between Capsules,” ICLR 2018 Workshop, Feb. 2018, 4 pages. |
Official Action for U.S. Appl. No. 15/973,022, dated Oct. 9, 2018 16 pages. |
Notice of Allowance for U.S. Appl. No. 15/973,022, dated Feb. 20, 2019 10 pages. |
Official Action for U.S. Appl. No. 15/973,003, dated Mar. 20, 2020 35 pages. |
Official Action for U.S. Appl. No. 16/668,826, dated Mar. 3, 2020 10 pages. |
Gu et al. “Continuous Deep Q-Learning with Model-based Acceleration,” Proceedings of Machine Learning Research, vol. 48, 2016, pp. 2829-2838. |
Happel et al. “The Design and Evolution of Modular Neural Network Architectures,” Neural Networks, 1994, vol. 7, pp. 985-1004. |
JOHNSON “Orbital Debris: The Growing Threat to Space Operations,” 33rd Annual Guidance and Control Conference, Feb. 6, 2010, Breckenridge, CO, United States. |
Peng et al. “Improving Orbit Prediction Accuracy through Supervised Machine Learning,”. |
Taylor “Transfer Learning for Reinforcement Learning Domains: A Survey,” Journal of Machine Learning Research, 2009, vol. 10, pp. 1633-1685. |
Watkins “Learning from Delayed Rewards,” King's College, May 1989, Thesis for Ph.D., 241 pages. |
Goodfellow et al. “Deep Learning,” MIT Press, 2016, 796 pages [retrieved online from: www.deeplearningbook.org]. |
Official Action for U.S. Appl. No. 15/973,003, dated Oct. 21, 2020 31 pages. |
Notice of Allowance for U.S. Appl. No. 16/668,826, dated Nov. 3, 2020 5 pages. |
Number | Date | Country | |
---|---|---|---|
62854152 | May 2019 | US |