The invention relates to wireless communication and, more specifically, to spectrum cartography.
All wireless transmissions use a portion of the radio frequency spectrum. Cellular phones, broadcast television, satellite, and short-distance wireless networks such as Bluetooth and wireless local area networks (WLAN) utilize different portions of the Wi-Fi, for example, typically use wireless frequency spectrum. Often it is important to coordinate the use of the various technologies and frequency ranges to ensure that the technologies do not interfere with each other or with planned future services.
This disclosure describes techniques for constructing power spectral density (PSD) maps representative of the distribution of radio frequency (RF) power as a function of both frequency and space (geographic location). For example, the disclosure describes techniques for construction PSD maps using robust basis pursuit form of signal expansion.
To further address the challenges encountered with this multi-dimensional sensing vision, the present paper presents a collaborative sensing scheme whereby CRs cooperate to localize the actively transmitting primary users (Pus) and estimate their PSD across space in the presence of model uncertainties.
One example of the techniques described herein is a novel sensing scheme based on a parsimonious system model accounting for the scarce presence of active PUs in the same frequency band(s), in the monitored area, due to mutual interference. Using a virtual grid-based approach for the potential PU transmitter locations, a form of spatial-domain sparsity emerges because actual PU transmitters are present in only few of the potential (grid) locations. A basis expansion model is then adopted to approximate the PU transmit-PSD distribution in frequency, which renders the sensing objective tantamount to estimating the PSD basis coefficients corresponding to each grid point. Since individual PU transmissions are narrow-band relative to the large swath of frequencies a CR can sense, only few of the PSD basis coefficients are nonzero—a fact giving rise to frequency-domain sparsity.
This parsimonious system model thus entails a form of hierarchical dual-domain sparsity in the PSD basis coefficients that are to be estimated, in the sense that groups of coefficients corresponding to locations with no PUs will be collectively zero. In addition, some of the basis coefficients within groups corresponding to active PU locations will be zero. Capitalizing on this form of hierarchical sparsity, a group sparse regression problem is formulated, which is to be solved centrally by either a fusion center (FC), or, individual CRs using their measured PSDs. A novel low-complexity algorithm for solving such a problem is developed using the alternating direction method of multipliers (ADMoM).
One challenge addressed by the techniques is acquiring the grid-to-CR channel gains present in the underlying regression matrix. One way to acquire such information is through the channel gain cartography approach. However, possible inaccurate channel gains or adoption of a shadowing-agnostic path loss-only model could deteriorate the performance of the sensing algorithm. Also, a grid-based approach introduces itself possible model offsets, as the actual PU locations may not coincide with points of the grid. To account for these uncertainties, a robust version of the group sparse (GS) least-absolute-shrinkage-and-selection operator (Lasso) is described herein. One technique described herein is an extension of the sparse total least-squares (TLS) framework to incorporate the hierarchical sparsity inherent to this sensing application. Combining the merits of Lasso, group Lasso, and TLS, the proposed group sparse (GS-)TLS approach described herein yields hierarchically-sparse PSD estimates that are also robust to model uncertainties induced by the random channel, grid offsets, and basis approximation errors. In spite of the non-convexity of the proposed GS-TLS criterion, an iterative solver with guaranteed convergence to at least a locally-optimal solution is described.
Additional factors that may compromise accuracy of PSD estimates at the CRs, are abrupt changes in shadow fading that may be due to, e.g., moving obstacles or moving CRs, and, possible failures of the sensing modules themselves. A robust GS-TLS formulation is proposed here, that is capable of discerning and removing such so-called model outliers, which in turn leads to reliable PSD estimates. However, sorting out unreliable measurements not only promotes estimation accuracy, but also leads to self-healing and re-organization mechanisms for the CRs network.
This disclosure introduces the basis expansion model, and describes the PSD observations used for the model fitting approach. A centralized algorithm for solving GS-Lasso problems is described, whereas perturbations in the channel (regression) matrices are considered. An outlier-resilient sensing algorithm is devised, numerical results are provided.
In one example, a method comprises sensing local radio-frequency (RF) interference spectrum at each of a plurality of sensors positioned at a plurality of locations within a geographic region, and computing a basis expansion model from the sensed RF interference spectrum at each of the sensors to construct a power spectral density (PSD) map representative of the distribution of RF power throughout the geographic region as a function of frequency and location. The basis expansion model is computed as a plurality of functions having a corresponding coefficient, each of the functions representing a power emitted by an RF-enabled device on a corresponding frequency band. Moreover, the coefficients are computed of the basis expansion model using a group sparse least-absolute-shrinkage-and-selection operator (GS-Lasso).
In another example, a system comprises a plurality of sensors to sense a local radio-frequency (RF) interference spectrum at each of a plurality of locations within a geographic region. A processor computes a basis expansion model from the sensed RF interference spectrum at each of the sensors to construct a power spectral density (PSD) map representative of the distribution of RF power throughout the geographic region as a function of frequency and location. The processor computes the basis expansion model is computed as a plurality of functions having a corresponding coefficient, each of the functions representing a power emitted by an RF-enabled device on a corresponding frequency band. The processor computing the coefficients of the basis expansion model using a group sparse least-absolute-shrinkage-and-selection operator (GS-Lasso).
In another example, a mobile device comprises a sensor to sense a local radio-frequency (RF) interference spectrum at locations within a geographic region. The mobile device further comprises a processor that computes a basis expansion model from the sensed RF interference spectrum at each of the sensors to construct a power spectral density (PSD) map representative of the distribution of RF power throughout the geographic region as a function of frequency and location. The processor computes the basis expansion model is computed as a plurality of functions having a corresponding coefficient, each of the functions representing a power emitted by an RF-enabled device on a corresponding frequency band. The processor computes the coefficients of the basis expansion model using a group sparse least-absolute-shrinkage-and-selection operator (GS-Lasso).
The details of one or more examples are set forth in the accompanying drawings and the description below.
In general, FC 16 is a computing system of one or more computing devices that applies the techniques described herein to compute one or more PSD maps for the geographic region. In one example, FC 16 applies robust parametric basis pursuit as a form of signal expansion for construction PSD maps based on the observations. FC may, for example, compute the basis expansion model from the sensed RF interference spectrum at each of the CRs 12 to construct a power spectral density (PSD) map representative of the distribution of RF power throughout the geographic region as a function of frequency and location. In general, a basis expansion model consists of a superposition of shifted and scaled versions of reference basis functions. The reference basis functions represent the frequency distribution of the RF power, i.e., RF power present at different frequency slots. The basis expansion model may be computed as a plurality of reference basis functions having corresponding coefficients, each of the reference basis functions representing a power emitted by an RF-enabled device 15 on a corresponding frequency band. As used herein, a paramentric basis expansion model refers to a basis expansion model where scaling coefficients of the reference basis functions are computed as scalars.
Moreover, as described below, FC 16 may apply a group sparse least-absolute-shrinkage-and-selection operator (GS-Lasso) in computing the coefficients of the basis expansion model. In general, least-absolute-shrinkage-and-selection operator (LASSO) is one example of a compressive sensing operator. Group sparse (GS) LASSO techniques are described in which a group sparsity structure for the basis expansion model is encoded and enforced. In this way, as further described below, a parametric PSD map representative of the power distribution of the RF power can be recovered, i.e., computed, with fewer measurements. The GS Lasso operator, in effect, is applied to select which coefficients in the model are non-zero and are to be estimated by FC 16. For example, in accordance with the techniques described herein, the group sparse Lasso (GS Lasso) identifies the locations in space where and RF-enabled device 15 is actively transmitting, and the frequency band(s) where transmission occurs. Specifically, by application of the GS Lasso described herein, non-zero coefficients in the basis expansion model for a transmitter's location correspond to occupied bands. In this way, FC 16 may apply the GS Lasso operator described herein to enforce group sparsity within the basis expansion model, thereby reflecting sparsity within system 10 that may arise due to geographic sparsity of sensors RF-enabled devices 15 within the measured environment and sparsity in the frequency domain since individual transmissions may occupy small portions of the spectrum. In some cases, FC 16 may use a most parsimonious sparse signal expansion using an overcomplete basis set may be used to constructing the PSD maps.
Consider an incumbent PU system comprising Ns transmitters (sources) located in a geographical area A⊂R2. Their activity over a frequency band B is to be monitored via cooperation of Nr CRs, also located in A. Let S:={xsεA}s=1N
Let us(t) be the (unknown) signal transmitted by PU s at time t. Then, the signal received at CR position xr at time t can be expressed as
where hx
(As1) Sources {us(t)} are stationary, mutually uncorrelated, independent of {hx
Consider approximating the transmit-PSD of PU s using the following basis expansion model:
where Nb is assumed sufficiently large, and {Θsv}v=1N
Channel {hx
where γx
for every s and r.
Received samples {yr(t)} are parsed into N-dimensional blocks, where N is chosen equal to (or smaller than) the coherence interval of the small-scale fading, over which hx
Shadowing and small-scale fading are characterized by different dynamics. The following is assumed regarding channel propagation and modeling.
(As3) The coherence interval of shadow fading exceeds that of fx
Based on (As1)-(As3), it is possible to express the PSD measured at location x due to Ns simultaneous PU transmissions as
where σv2 denotes noise variance at the receiver, and gx
As neither the number of PU sources nor their locations are known to the CRs, a set of Ng candidate transmit-PUs is postulated on a grid of locations G:={xgεA}g=1N
Define the Nb×1 vector Θg:=[Θg1, . . . , ΘgN
φx
The sensing objective of revealing PU locations and the available portions (sub-bands) of B is tantamount to estimating Θ. To this end, CRs rely on the periodogram estimate of Φx
with αε(0,1] denoting the so-called forgetting factor. As shown gives an estimate of the PSD measured at point xr, and frequency f as
̂Φx
with (asymptotic) variance bounded as
After dropping τ for notational brevity, let ̂φx
Based on the linear model (3), the sensing objective is to estimate Θ from the received-PSD estimate
gathered at CR locations {xr}r=1N
The following describes spectrum sensing via Group Sparse Lasso. The number of active PUs transmitting over the same spectral band in a given area is naturally limited by mutual interference. As a consequence, the number of PU sources (Ns) is far smaller than Ng, for a sufficiently dense grid. Absence of PU sources in most grid locations gives rise to a group sparsity of the vector Θ, since Θg=0N
One criterion for estimating Θ is the (non-negative) least-squares (LS). However, LS fails to provide a parsimonious model estimate involving only the prominent variables. The Lasso and the so-called group Lasso on the other hand, were proposed to overcome such a limitation of LS. In the Lasso criterion, the LS cost is augmented with the l1-norm |Θ|1 to encourage sparsity at the single-coefficient level; while in the group Lasso, the regularization term
is referred to herein as a group sparse (GS) penalty that enforces group sparsity within the model.
Combining Lasso with group Lasso, the so-called group sparse (GS-)Lasso provides a parsimonious model estimate, where sparsity is accounted for both at the group- and at the single-coefficient levels. This hierarchical sparsity is possible by regularizing the conventional LS cost with the term
combined with |Θ|1.
Taking also into account the non-negativity of PU power spectra, Θ can be estimated by solving the following sparse regression problem, where Θ indicates the location in space and the frequency band for any active transmitter (RF-enabled device 15):
where the coefficient λ1≧0 enforces sparsity at individual entries, whereas λ2≧0 promotes group sparsity. For λ1=0 (λ2=0), (6) reduces to the Lasso (group Lasso) based estimate. In this way, the group sparse Lasso (GS Lasso) identifies the locations in space where and RF-enabled device 15 is actively transmitting, and the frequency band(s) where transmission occurs. Specifically, non-zero coefficients in the basis expansion model for a transmitter's location correspond to occupied bands. PU localization and PSD estimation was viewed as a sparse linear regression model; here, the formulation is considerably broadened by taking into account both individual and group sparsity.
To obtain regression matrices {Bx
With respect to the PSD atlas, it is worth re-iterating that identifying the support of the vector Θ reveals not only the primary sub-bands occupied, but also the locations where the active PU transmitters reside. Complementing this information with either the PUs' channel gain maps or a simple path loss-based propagation model, CRs can readily reconstruct the PSD atlas; that is, estimate PSD maps at any location of the monitored area as
with ĝx
ADMoM-based Solver: In this section, a reduced-complexity algorithm attaining the optimal solution of GS-Lasso problems will be developed using the alternating direction method of multipliers (ADMoM). The disclosure show that (6) admits an equivalent reformulation that can be solved via ADMoM. Before doing so, the following lemmas are needed.
Lemma 1 Consider the following convex minimization problem in the variable yεRN
Albeit non differentiable, (8) admits a closed-form solution. Specifically, the global minimizer y* is given by the following soft-thresholding vector operation expressed in terms of [a]+:=max{0,a} as
Proof. It will be argued that the solver of (8) takes the form y=za for some scalar z≧0. This is because among all y with the time l2-norm, the Cauchy-Schwarz inequality implies that the maximizer of aTy is collinear with (and in the same direction of) a. Substituting y=za into (8) renders the problem scalar z≧0, with solution z8=(∥a∥−λ)+/(c∥a∥2), which completes the proof.
Lemma 2 Consider the following non-smooth convex problem in the vector variable yεRN
Using operator Tλ(·) defined as
T
λ(a):=[sgn(a1)[|a1|−λ]+, . . . , sgn(aN)[|aN|−λ]+]T
the global minimized of (10) can be written as
If non-negativity of the entries in y is imposed, the solution of (10) subject to y≧0N is obtained by using the vector soft-thresholding operator
Proof. Note first that (12) can be solved element-wise; specifically, each entry yi of y is found by solving the scalar problem y*i=arg minyC(y):=(c/2y2−aiy+λ|y|), which has a non-differentiable cost. The necessary and sufficient condition for y*i to minimize C(y) is [27, p. 92]
which is satisfied by y*i=sgn(ai[|ai|−λ]+; see also [21]. When y is enforced to be non-negative, solution (12) can be easily derived from (13) element-wise. □
Consider now the NbNg×1 auxiliary vector variables γ and ξ, and neglect irrelevant terms to re-write the GS-Lasso problem as:
where R:=Σr=1N
Letting η and μ denote the Lagrange multipliers associated with the equality constraints Θ=γ and Θ=ξ, respectively, the quadratically augmented Lagrangian function of problem (14) is
where c1, c2>0 are arbitrary constants. Then, for any initial vectors γ(0), ξ(0), η(0), μ(0), the ADMoM algorithm entails the following primal-dual iterative updates
where j=1, 2, . . . is the iteration index. The first step updates the primal vector Θ(j) by using the values of the auxiliary variables and the Lagrange multipliers obtained at the previous iteration j−1; since L(·) is quadratic in Θ, the convex optimization problem (16a) can be solved in closed form as
θ(j)=(R+(c1+c2)IN
Next, variables γ and ξ can be updated using the newly computed vector θ(j), with the Lagrange multipliers fixed from the previous iteration. Inspection of the function L(θ(j), γ, ξ, η(j-1), μ(j-1)) reveals that (16b) can be split into two sub-problems, where minimization over γ and ξ can be performed separately. After neglecting irrelevant terms, minimization of (16b) wrt γ reduces to the following non-differentiable convex problem
which, in turn, can be separated in following Ng sub-problems
where γg and ηg are Nb×1 sub-vectors of γ and η, respectively, collecting elements {γn}n=N
From Lemma 1, the global minimizer of each sub-problem (19) is given by
Upon neglecting constant terms, minimization of (16b) wrt the non-negative variable ξ can be obtained after solving the following non-smooth convex problem
which, from Lemma 2, admits the following closed-form solution
The overall ADMoM-based solver for GS-Lasso problems is tabulated as Algorithm 1.
The distinct feature of the proposed ADMoM-based algorithm for solving the GS-Lasso problem (14) is its computationally affordable implementation, offered by the closed-form expressions for the primal variable updates; as well as the simple updates of the dual variables η(j) and μ(j). Furthermore, since ADMoM has provable convergence to the global minimizer when the considered problem is convex, convergence of the proposed algorithm to ̂Θ in (6) is ensured as stated next.
Proposition 1
For any c1,c2>0 and any initializing vectors γ(0), ξ(0), η(0) and μ(0), the iterates (17) for Θ(j), ( ) for {γg(j)}, ( ) for ξ(j), and (16c)(16d) for η(j) and μ(j), respectively, are convergent. Also, Θ(j) converges to the solution of the GS-Lasso (6); i.e.,
Shadow fading as well as possible slow temporal variations of the PU transmit-PSDs lead to time-varying {Φx
where βτ,tε(0,1] is the so-called forgetting factor, and index τ=1, . . . , t emphasizes the temporal variability of channels and received PSDs. Also, to address the need for real-time processing, the estimation of 0 in (23) can be performed on-line, where each iteration of the ADMoM algorithm is performed after acquiring new estimates {φx
Algorithm 1 is centralized, meaning that the whole set of PSD estimates {φx
Techniques for Spectrum Sensing under Channel Uncertainties are now described, in including a Group Sparse Total Least-Squares (TLS) technique.
Uncertainty in the matrices {Bx
TLS is the workhorse used for estimating non-sparse vectors obeying an over-determined linear system of equations with uncertainty present in both the regression matrix and the observations (fully-perturbed model). Sparsity in the estimate was taken into account, where the TLS framework was extended to solve sparse under-determined fully-perturbed linear systems. The sparse TLS approach is broadened here to account for sparsity present both at individual entries, and also at groups of entries.
Define
and E a NNr×NgNb matrix capturing perturbations corrupting the matrix B. Consider now estimating Θ as follows:
Relative to the classical TLS, the cost in the group sparse (GS-)TLS problem (24) is augmented with the regularization terms accounting for the two forms of sparsity inherent to Θ. Compared to, problem (24) includes also the term λ2RG(Θ).
Problem (24) is generally non-convex due to the presence of the product EΘ; thus, it is in general difficult to obtain a globally-optimal solution. However, a novel reduced-complexity algorithm with provable convergence to a stationary point of (24) will be developed in the ensuing section.
An Alternating Descent Solver is no described. The cost in (24) will be optimized here iteratively using a block coordinate descent algorithm, which cyclically minimizes it wrt E (keeping Θ fixed), and wrt Θ after fixing E. Specifically, the following two steps are performed at the i-th iteration:
By fixing ̂E(i), (24) boils down to a GS-Lasso problem; thus, ̂Θ(i) can be computed by using the ADMoM-based solver after replacing (17) with the following update, where j still represents the index for the (inner) ADMoM iterations:
The quadratic convex problem (26) admits the following closed form solution
Ê
(i)=(1+∥{circumflex over (θ)}(i)∥22)−1[
which can be obtained after equating the derivative of the cost in (26) with zero. The overall solver for GS-TLS is tabulated as Algorithm 2.
Under certain conditions, the block coordinate descent algorithm is known to converge (at least) to a local optimum point, as asserted next.
Proposition 2. For any initialization {{circumflex over (θ)}(0), Ê(0)}, the Iterates {{circumflex over (θ)}(i), Ê(i)} (25)-(26) converge monotonically to a stationary point of problem (4).
The following describes outlier-resilient spectrum sensing techniques, including Robust GS-TLS.
The problem dealt with in the previous section accounts for uncertainty in the entries of the regression matrix B. However, due to particularly abrupt local shadow fading, failures of the sensing modules, or unexpected narrow-band impulsive noise and/or interference, CRs observations may be affected by abundant errors. This section develops schemes for discerning and removing the observations that largely deviate from the underlying model (a.k.a. outliers).
A simple heuristic to detect unreliable data could be to estimate Θ via (24), compute the residuals and, then, reject the PSD observations whose residuals exceed a certain threshold. A systematic method that accounts for possible outliers can be found, where the underlying linear regression model is augmented by an auxiliary outlier vector. Using this model, the receiver PSD at the CR locations
where the nonzero entries of the NNr×1 real vector o capture outliers; and e is a proper vectorization of the periodogram estimation errors. Since few outliers are expected compared to the total number of data collected in the vector o is sparse.
Capitalizing on the three forms of sparsity emerging from (i) the grid-based model (group sparsity), (ii) the PSD basis expansion (single-coefficient sparsity) and (iii) the outliers (single-coefficient sparsity), and accounting for perturbations in the regression matrix, the following robust GS-TLS is considered. In this way, a robust GS-TLS technique is described that addresses errors introduced by uncertainties, such as estimate errors, grid offsets, and BEM approximation errors, and also handles measurement corruption by inclusion of outliers.
where λ3≧0 promotes the (single-coefficient) sparsity of vector ô. The nature of the perturbations captured in E and o is in general different: o collects unmodeled errors, whereas E describes (small) perturbations. It is also worth noticing that the support of ô reveals the unreliable CR data.
An alternating descent algorithm is described. Although (30) is generally a non-convex problem, a block coordinate descent algorithm can still be employed. In this case, the cost in (30) will be iteratively minimized wrt E and {Θ,o}; that is, the following two updates are performed at the i-th iteration:
The quadratic problem (32) can be solved in closed form, to obtain
Ê
(i)=(1+∥{circumflex over (θ)}(i)∥22)−1[
As for (31), the ADMoM can be employed to find its optimal solution. To this end, (31) can be re-formulated as:
with γ and ξ denoting auxiliary vector variables. Letting again η and μ denote the Lagrange multipliers associated with the constraints Θ=γ and Θ=ξ, respectively, the quadratically augmented Lagrangian function (34) is given by
Starting from any initial vectors o(0), γ(0), ξ(0), η(0), μ(0), each iteration j of the ADMoM (within each iteration i of the block coordinate descent) proceeds in these steps:
Since (34) is convex and satisfies the requirements for the ADMoM to be convergent, iterates {Θ(i,j),o(i,j)} will converge to the solution {̂Θ(i),̂o(i)} of (31).
Problem (36a) admits the closed-form solution given by
where γ(j) and ξ(j) are still computed via (20) and (22). Using Lemma 2, it is possible to show that the solution of (36d) is computed via soft-thresholding as
o
(i,j)=ℑλ
At each step of the ADMoM algorithm the soft-thresholding in (38) tags as outliers the (current) residuals
With arguments similar to Proposition 2, the following result can be asserted.
Proposition 3. For any initialization {{circumflex over (θ)}(0), ô(0), Ê(0))}, the iterates {{circumflex over (θ)}(i), ô(i), Ê(i)} in (32)-(31) converge monotonically to a stationary point of problem (30).
Simulations are now described. Consider a set of Nr=50 CRs uniformly distributed in an area of 100 m×100 m, cooperating to localize Ns=2 active PUs and estimate their PSD map. CRs and PU sources are marked with blue circles and red triangles, respectively, in
Transmitted signals are searched over a grid of Nb=10 evenly spaced center frequencies fc=95+vW, vε{1, . . . , 10}. Each CR computes periodogram samples at N=64 frequencies at signal-to-noise-ratio (SNR)−5 dB, and averages them across τ=100 time-slots to form ̂Φx
In the first experiment, the PSD generated by PU s experiences only small-scale fading in its propagation from xs to any location x, where it is measured in the presence of noise with variance σv2=0.1. To simulate small-scale fading {hx
In
The enhanced localization and power estimation capabilities impact also the subsequent CR power allocation task, which relies on the estimated coverage region of the PU-transmitters to re-use the licensed bands without causing harmful interference to any potential PU receiver. GS-Lasso will be preferable if a coarse description of the “interference-heavy” areas in terms of PU activity is desired over say accurate localization and transmit-power estimation of the PUs. Such a coarse can be useful for e.g., temporal (rather than spatio-temporal) frequency re-use purposes.
As described, abrupt local shadow fading may severely compromise the PSD estimates at CRs, and thus degrade the sensing performance. This is the case considered in
As described above, the radio frequency (RF) spectrum is a precious resource that must be utilized efficiently. Fixed spectrum allocation, which confers exclusive access rights on spectrum license holders, has resulted in significant under-utilization of the valuable spectral resource, depending on time and locations. The cognitive radio (CR) strategy aims at alleviating this inefficiency by allowing unlicensed secondary users to opportunistically transmit, provided that the transmissions do not disturb the communication of licensed primary users (PUs). To achieve the necessary protection of PU systems, CR systems provide spectrum sensing and intelligent resource allocation. Spectrum sensing is used to identify unused spectral resources in the frequency, time and space domains. The “spectrum holes” can then be exploited through agile resource allocation.
A simplifying assumption often made for spectrum sensing is that the spectrum occupancy is more or less invariant over the deployment region of the CR systems. Based on this, spectrum sensing is often performed in a collaborative fashion, where the band occupancy by a common set of PU transmitters is detected using observations fused from multiple CRs. This mitigates effectively fading and shadowing, which impede reliable detection of PU presence.
However, the assumption might not hold when PU systems employ a small RF footprint for significant spatial reuse, or the CR network grows in size and gets deployed in a broader geographical region. An instrumental concept in this case is the RF cartography, which provides a map of RF power distribution over space and reveals the spatial variation of spectrum occupancy. Such a construct is useful for optimizing CR network operations, not only in the PHY/MAC, but also in higher layers.
Techniques are described herein that allow interference power present at each CR node in the network to be acquired, and also to predict its future levels. Some example techniques may address the challenge that CRs do not have prior information on the number of PU emitters and the corresponding PU-CR channel gains, which may be needed for combining the measurements from different sensors. Moreover, CRs might not be able to report their measurements every time, due to energy-saving sleep modes or congested signaling channels. A network controller operating accordance to the techniques described herein accounts for missing observations by performing appropriate interpolation. The future spectrum state must be inferred based on past measurements. Machine learning and compressive sensing techniques are employed herein to tackle these challenges. A general dictionary learning framework is adapted here to learn the spatial and temporal patterns of the RF power distribution. To aid in spatial interpolation, the topology information of the CR network is exploited in a semi-supervised learning fashion. Batch and online algorithms are developed. The online techniques can track the slow variation of the RF power distribution, and features low computational complexity and lax memory requirement.
Spatio-temporal spectrum prediction algorithms for cognitive radios (CRs) are developed using the framework of dictionary learning and compressive sensing. The interference power levels at each CR node locations are predicted using the measurements from a subset of CR nodes without a priori knowledge on the primary transmitters. A semi-supervised dictionary learning approach for the interpolation of missing observations in space is described. The proposed techniques are also extended for spatio-temporal prediction. Batch and online alternatives are presented, where the online algorithm features low complexity and memory requirements. Numerical tests verify the performance of the proposed novel methods.
Returning to
Suppose that there are K PU transmitters (RF-enabled devices 15) in the area with the k-th PU transmitting at power pk(t). Let gmk(t) denote the channel gain from the k-th PU to the m-th CR. Then, the interference power level πm(t) perceived at the m-th CR can be modeled as
Upon defining vectors π(t):=[π1(t), . . . , πM(t)]τ (.τ denotes transposition) and P(t):=[p1(t), . . . , pk(t)]τ as well as matrix G(t) whose (m,k)-entry is gmk(t), the matrix-vector counterpart of (1) can be expressed as
π(t)=G(t)P(t). (1.2)
At each time t, a subset Mobs(t)⊂ of CRs observe the interference power levels. The measurements from these CRs can be stacked in vector yobs(t)εR|M
y
obs(t)=0(t)π(t)+z(t) (1.3)
where (t)εR|M
The problem of estimating G(t) and p(t), given the past and the current measurements yobs(τ) for τ=1, 2, . . . , t, when there were no missing observations was tackled using dictionary learning in [13]. Here, the goal is to predict the missing interference levels πm(t) for πm(t) for mεMmiss(t):=M\Mobs(t), given {yobs(τ)}τ=1t.
Techniques for spatial spectrum prediction using semi-supervised dictionary learning are described. Prompted by (1.2), suppose that π can be represented as a linear combination of a small number of bases (atoms) taken from a dictionary. Let εRM×Q denote a dictionary with Q atoms. Then, the preceding assumption amounts to
π=Ds (1.4)
where vector εRQ is sparse. Fourier bases or the wavelet bases are some of the dictionaries often used for a variety of natural or man-made signals. Based on this model, the techniques described below leverage compressive sensing and machine learning to predict the unobserved interference levels in space. In the following, a two-stage approach is first considered, in which the dictionary learned in the training phase is used for the desired prediction task in the operational phase. An alternative algorithm capable of performing the dictionary learning and spatial prediction simultaneously will be presented subsequently.
An example two-phase batch implementation is described. Instead of using off-the-shelf bases such as Fourier or the wavelet bases, D can be directly learned from training data, which can be collected through a sounding procedure. In the presence of missing entries in the data, it is helpful to augment this learning process with additional structural information. In this work, the network topology information, which is typically maintained for various network control tasks such as routing, is leveraged in the framework of semi-supervised learning.
Let Aε{1,0}M×M denote the adjacency matrix of the CR network topology. Thus, the (m,m′)-th entry am,m′ of for m,m′ε is 1 if nodes m and m′ are neighbors, and 0 otherwise. Then, the Laplacian matrix is defined as :=diag(1)-, where 1 is the all-one vector, and diag(v) is a diagonal matrix with the entries of vector v on its diagonal.
Training Phase:
In the training phase, given a training set {ynovs}n=1N, which may contain missing entries, the goal is to form an estimate {circumflex over (D)} of D such that ynobs≈On{circumflex over (D)}sn for n=1, 2, . . . , N, where coefficients sn are sparse, and matrix On discards the missing entries. Specifically, the following optimization problem is solved to obtain {circumflex over (D)}.
Here, the first term in (1.6) promotes fitness of the reconstruction to the training datum in a least-squares (LS) sense, and the l1-norm-based regularization term encourages sparsity in s with λs>0 playing the role of a tuning parameter. The third term in (1.6) can be re-written as
s
τ
D
τ
LDs=Σ
m=1
MΣm′=1Mam,m,(πm−πm′)2 (1.8)
indicating that it encourages the interference levels experienced at neighboring nodes to be similar, with λL>0 being a tuning parameter.
To appreciate the role of this Laplacian matrix-based regularization, suppose that a CR never reports an observation during the entire training period. Then, without the last term, the corresponding row in cannot be estimated, making it impossible to predict the interference level at this CR's location. The presence of the Laplacian term allows one to estimate the missing entry relying on neighbors' measurements.
Operational Phase: Once 15 has been obtained as in (5), the operational phase predicts the interference levels. First, a sparse coding step is performed at each time t to estimate the sparse coefficient (t) corresponding to the measurement yobs(t); that is,
Then, the desired interference levels in π(t) that include the missing entries can be recovered by {circumflex over (π)}(t):{circumflex over ({circumflex over (=)}(t):={circumflex over (D)}ŝ(t).
Implementation:
Problem (1.9) is convex and there are various specialized algorithms available for solving the problems of this sort extremely fast. On the other hand, (5) is nonconvex, and it is difficult to obtain globally optimal solutions. However, the problem is convex with respect to and {sn} individually. Thus, to find a locally optimal solution, a block-coordinate descent (BCD) algorithm can be employed until convergence.
Specifically, at the k-th iteration, updates are done as:
where {circumflex over (D)}(k) and {ŝn(k)} are the k-th iterates. Note that (1.10) can be solved separately per n=1, 2, . . . , N using the same solver as the one for (1.9). To solve (1.11), a BCD algorithm can be once again employed over the columns of D. Define {tilde over (L)}n:=OnτOn+λLL, and let sn,q denote the q-th entry of vector sn. Then, the overall dictionary training algorithm is presented in Table 1.
Example online techniques are now described. In order to track time-varying statistics of the interference patterns, an online algorithm can be derived, in which the dictionary training and spatial interference prediction are performed jointly at the same time. Compared to the batch training discussed, the online algorithm can perform the computation recursively, resulting in significant savings in complexity and memory.
Specifically, the following formulation is adopted, which weights recent observations more heavily.
where βε(0,1] is a forgetting factor. Instead of solving problem (1.12) in a batch fashion for the entire time horizon τ=1, 2, . . . , t whenever a new observation yobss (t) arrives at each time t, an online approach updates only the “current” coefficient vector ŝ(t), while the past ones ŝ(t−1), . . . , ŝ(1) are held fixed. Nevertheless, it can be shown under mild conditions that {circumflex over (D)} so obtained converges as t→∞ to the same {circumflex over (D)} as would be obtained from a batch approach.
Although the dictionary update depends on the entire observation history, a recursive computation can avoid storing the past observations and calculations. For this, it is useful to maintain the following quantities:
where ∥{·} is an indicator function equal to 1 if the condition inside the braces are satisfied, and 0 otherwise.
Similar to the batch case described above, the dictionary update amounts to solving (12) for {circumflex over (D)}(t) with {ŝ(τ)}τ=1t fixed. Let ŝj(τ) denote the j-th entry of ŝ(τ), and Am,jq(t) and Ajq(t) the (j,q)-th entry of matrices Am(t) and A(t), respectively. Also, let bj(t) represent the j-th column of B(t). Then, upon defining
the column-wise BCD leads to the following update for the j-th column of {circumflex over (D)}(t)
An example method for online spectrum prediction is given in Table II.
Techniques for spatio-temporal spectrum prediction are now described. The algorithms developed so far provide imputations for missing measurements of spatial interference distributions, given the (incomplete) measurements of the current and the past time instants. However, in order to predict the future interference status, for which not even partial measurements can be available, one has to incorporate temporal correlation structures into the model, or learn such structures from the data. In this disclosure, we leverage the data-driven dictionary learning framework to learn temporal dynamics from the data even in an online fashion.
The observations over T consecutive intervals are concatenated into a super-vector, and apply the algorithms described above. That is, define
y
obs(t):=[yobs
O(t):=[Oτ(t), . . . , Oτ(t−T+1)]τ (1.21)
L:=I
T
L (1.22)
where denotes the Kronecker product, which are used in place of yobs(t), O(t), and L, respectively, in the method of Table II.
To perform prediction for π(t+1), after executing line 14 in Table II, compute sparse coefficient ŝf(t+1) for a fictitious observation yobs,f(t+1):=[yobs
{circumflex over (π)}(t+1)={circumflex over (D)}(t)[1:M,:]ŝf(t+1) (1.23)
where {circumflex over (D)}(t)[1:M,:] denotes the first M rows of {circumflex over (D)}(t).
The performance of the proposed techniques was evaluated via numerical tests. A CR network consisting of M=20 nodes with the topology depicted in
where d was the distance, d0=0.01 and α=2.5. The number of atoms of the dictionary was set to Q=50.
First, the two-phase algorithm was tested. Each of the PUs turned on with a 30% chance, and transmitted at a power level pk(t) chosen from a uniform distribution with support. Each CR made a measurement with a 70% chance. The measurements were corrupted by additive noise generated from a zero-mean Gaussian distribution with variance 10−5, which was then clipped to ensure non-negativity. The measurements were also normalized by the maximum amplitude observed in the training set. No shadowing or small-scale fading was considered, signifying a quasi-static scenario. N=300 samples were used to train the dictionary, and then another 300 samples were supplied for the operational phase. The values of λs and λL were set to 0.1 and 0.005, respectively. To compensate for the bias inherent in Lasso-type estimators, de-biasing was performed in the operational phase; that is, after performing sparse coding in (1.9) to obtain ŝ(t), (1.9) was re-solved without the l1 regularization term only for the non-zero entries in ŝ(t).
To test the online algorithm, the Rayleigh fading coefficient hmk(t) for the channel from PU k to CR m at time t was generated using a first-order autoregressive model
h
mk(t)=αhmk(t−1)+√{square root over (1−α2)}wmk(t) (1.24)
where α=0.9995 was used, and wmk(t) was circularly symmetric zero-mean complex Gaussian noise with variance 1. The overall channel gain gmk(t) was formed by multiplying the pathloss with |hmk(t)|2. The forgetting factor was β=0.95, and λs=0.25 and λL=0.005 were used. The transmit-power of all PUs was fixed to 150 so that the tracking performance could be clearly visible.
The normalized root mean square error (RMSE) for the missing observations, averaged over 20 CRs, is depicted in
In order to test the temporal prediction, certain traffic patterns were assumed. That is, at each time interval t, PU 1 tossed a coin and transmitted with probability 0.1. If PU 1 did transmit, PU 2 transmitted in the next time slot, followed by PU 3's transmission in the third time slot. Likewise, at each time t, PU 3 started transmission with probability 0.15, followed by PU 2 in the second time slot, and PU 1 in the third.
Spectrum prediction algorithms for CR networks have been described. Using a dictionary learning framework, the techniques can predict the interference power experienced at each CR node based on the current and the past measurements collected from a subset of nodes in the network. Exploiting the fact that the spatial variation of interference is smooth, a regularization term based on the CR network topology was also incorporated. Batch and online algorithms were derived, where the online alternative possessed a tracking capability at lower complexity and memory requirements. Temporal prediction was also discussed. Numerical tests verified the efficacy of the techniques.
In this example, a computer 500 includes a processor 510 that is operable to execute program instructions or software, causing the computer to perform various methods or tasks. Processor 510 is coupled via bus 520 to a memory 530, which is used to store information such as program instructions and other data while the computer is in operation. A storage device 540, such as a hard disk drive, nonvolatile memory, or other non-transient storage device stores information such as program instructions, data files of the multidimensional data and the reduced data set, and other information. The computer also includes various input-output elements 550, including parallel or serial ports, USB, Firewire or IEEE 1394, Ethernet, and other such ports to connect the computer to external device such a printer, video camera, surveillance equipment or the like. Other input-output elements include wireless communication interfaces such as Bluetooth, Wi-Fi, and cellular data networks.
The computer itself may be a traditional personal computer, a rack-mount or business computer or server as shown in
The techniques described herein may be implemented in hardware, software, firmware, or any combination thereof. Various features described as modules, units or components may be implemented together in an integrated logic device or separately as discrete but interoperable logic devices or other hardware devices. In some cases, various features of electronic circuitry may be implemented as one or more integrated circuit devices, such as an integrated circuit chip or chipset.
If implemented in hardware, this disclosure may be directed to an apparatus such a processor or an integrated circuit device, such as an integrated circuit chip or chipset. Alternatively or additionally, if implemented in software or firmware, the techniques may be realized at least in part by a computer readable data storage medium comprising instructions that, when executed, cause one or more processors to perform one or more of the methods described above. For example, the computer-readable data storage medium may store such instructions for execution by a processor. Any combination of one or more computer-readable medium(s) may be utilized.
A computer-readable medium may form part of a computer program product, which may include packaging materials. A computer-readable medium may comprise a computer data storage medium such as random access memory (RAM), read-only memory (ROM), non-volatile random access memory (NVRAM), electrically erasable programmable read-only memory (EEPROM), flash memory, magnetic or optical data storage media, and the like. In general, a computer-readable storage medium may be any tangible medium that can contain or store a program for use by or in connection with an instruction execution system, apparatus, or device. Additional examples of computer readable medium include computer-readable storage devices, computer-readable memory, and tangible computer-readable medium. In some examples, an article of manufacture may comprise one or more computer-readable storage media.
In some examples, the computer-readable storage media may comprise non-transitory media. The term “non-transitory” may indicate that the storage medium is not embodied in a carrier wave or a propagated signal. In certain examples, a non-transitory storage medium may store data that can, over time, change (e.g., in RAM or cache).
The code or instructions may be software and/or firmware executed by processing circuitry including one or more processors, such as one or more digital signal processors (DSPs), general purpose microprocessors, application-specific integrated circuits (ASICs), field-programmable gate arrays (FPGAs), or other equivalent integrated or discrete logic circuitry. Accordingly, the term “processor,” as used herein may refer to any of the foregoing structure or any other processing circuitry suitable for implementation of the techniques described herein. In addition, in some aspects, functionality described in this disclosure may be provided within software modules or hardware modules.
As described herein, spatio-temporal and dynamic re-use of the licensed bands calls for collaborative CR network sensing algorithms able to portray the ambient power spectral density at arbitrary locations in space, frequency, and time. The present paper addressed this ambitious task through a parsimonious model of the PSD in frequency and space, which reduces the sensing task to estimating a sparse vector of unknown parameters. An estimator of the model parameters was developed based on the GS-Lasso, and a low-complexity solver based on the ADMoM was presented. The location and transmit-PSD information conveyed by the estimated model parameters, complemented with either a path loss-based or more elaborated propagation models was shown to allow CRs to accurately reconstruct the PSD atlas of the primary system. To cope with uncertainty in the regression matrix, a provably convergent sensing algorithm was introduced which combines the merits of the TLS framework with the hierarchical sparsity inherent to the network-level sensing problem. To account for outliers, a robust algorithm able to discern and reject unreliable PSD data was also developed. The novel robust GS-TLS approach capitalizes on the sparsity of the unknown parameters and the outliers, and offers systematic estimation of the spectrum holes jointly in space, frequency, and time while taking into account channel uncertainties and unmodeled errors.
Further exemplary details are described in: Dall'Anese, “Group sparse Lasso for cognitive network sensing robust to model uncertainties and outliers,” Physical Communication, November, 2011; Dall'Anese, “Group sparse Lasso for cognitive network sensing robust to model uncertainties and outliers,” IEEE 12th International Workshop on Signal Processing Advances in Wireless Communications, Jun. 26, 2011; Dall'Anese, “Distributed Cognitive Spectrum Sensing via Group Sparse Total Least-Squares,” 4th IEEE International Workshop on Computational Advances in Multi-sensor Adaptive Processing,” Dec. 13, 2011; and Kim, “Joint Link Learning and Cognitive Radio Sensing,” in Proc. of the 45th Asilomar Conf. on Signals, Systems, and Computers, Pacific Grove, Calif., Nov. 6-9, 2011, contents of all of which are incorporated herein by reference.
The following clauses illustrate additional examples:
Clause 1. A method comprising: sensing local radio-frequency (RF) interference spectrum at each of a plurality of sensors positioned at a plurality of locations within a geographic region; and computing a basis expansion model from the sensed RF interference spectrum at each of the sensors to construct a power spectral density (PSD) map representative of the distribution of RF power throughout the geographic region as a function of frequency and location, wherein the basis expansion model is computed as a plurality of functions having a corresponding coefficient, each of the functions representing a power emitted by an RF-enabled device on a corresponding frequency band, and wherein computing the basis expansion model comprises applying group sparse (GS) total least-squares (TLS) method to compute the coefficients.
Clause 2. The method of clause 1, wherein computing estimates for coefficients of the basis expansion model comprises applying an alternating direction method of multipliers (ADMoM) to compute the coefficients.
Clause 3. The method of clause 1, wherein computing a basis expansion model comprises computing the coefficients of the basis expansion model using a group sparse least-absolute-shrinkage-and-selection operator (GS-Lasso).
Clause 4. A system comprising: a plurality of sensors to sense a local radio-frequency (RF) interference spectrum at each of a plurality of locations within a geographic region; and a processor that computes a basis expansion model from the sensed RF interference spectrum at each of the sensors to construct a power spectral density (PSD) map representative of the distribution of RF power throughout the geographic region as a function of frequency and location, wherein the processor computes the basis expansion model is computed as a plurality of functions having a corresponding coefficient, each of the functions representing a power emitted by an RF-enabled device on a corresponding frequency band, and wherein the processor applies group sparse (GS) total least-squares (TLS) method to compute the coefficients.
Clause 5. The system of clause 4, wherein the processor applies an alternating direction method of multipliers (ADMoM) to compute the coefficients.
Clause 6. The system of claim 4, wherein the processor computes the coefficients of the basis expansion model using a group sparse least-absolute-shrinkage-and-selection operator (GS-Lasso) to identify the coefficients for computation.
Clause 7. A method comprising: sensing, over a period of time, local radio-frequency (RF) interference spectrum at a subset of a plurality of sensors positioned at a plurality of locations within a geographic region; and applying dictionary learning to compute predicted interference power levels at the plurality of sensors for a current time based on current and past measurements collected from the subset of sensor in the network.
Clause 8. The method of clause 7, further comprising constructing a power spectral density (PSD) map representative of the distribution of RF power throughout the geographic region as a function of frequency and location based on the predicted interference power at each of the sensors.
Various embodiments of the invention have been described. These and other embodiments are within the scope of the following claims.
This application claims the benefit of U.S. Provisional Application No. 61/661,109, filed Jun. 18, 2012, the entire contents of which are incorporated herein by reference.
Number | Date | Country | |
---|---|---|---|
61661109 | Jun 2012 | US |