This invention relates generally to wireless communications, and more particularly to shaping the spectrum of transmitted signals in ultra wide bandwidth communication systems.
With the release of the “First Report and Order,” Feb. 14, 2002, by the Federal Communications Commission (FCC), interest in ultra wide bandwidth (UWB) communication systems has increased. The IEEE 802.15 standards organization, which is responsible for personal area networks (PAN), has established a task group, TG3a, to standardize a high-data-rate physical layer based on UWB. One of the most important requirements is compliance with the FCC spectral mask for indoor channels shown in
In addition to this requirement, it is also required by IEEE 802.15 that UWB systems do not interfere with existing wireless systems, such as 802.15.1 (Bluetooth), 802.15.3 (personal area networks), 802.15.4 (Zigbee) and 802.11a and 802.11b (wireless local area networks).
Furthermore, UWB systems should also be resilient against interference from these existing wireless systems. In addition, UWB systems should also resist interference from home appliances, such as microwave ovens, and other narrowband interferers.
All these requirements impose additional constraints on the spectrum shaping. Minimizing interference, in both receivers and transmitters, requires placing minima on the poser in the transmission spectrum. Rigorously speaking, the spectral mask demands very low power in certain frequency bands, as well as matching filters. A null in the transmit spectrum also implies a null in the transfer function of the matching filter, and thus interference suppression at this frequency.
While some interference, e.g., microwave radiation, is at fixed frequencies, other interference is at variable frequencies, e.g., the different bands of 802.11a, or at frequencies that cannot be predicted a priori. Thus, it is necessary to be able to shape the spectrum adaptively.
For impulse radio signals, pulse-position modulation, (PPM) and pulse amplitude modulation (PAM) are the most popular signaling techniques. These techniques are combined with time hopping (TH) for multiple-access format. In TH, each symbol is represented by a series of “basis pulses.” The location or delay of the pulses is determined by a time-hopping code. The time-hopping code is generated as a pseudo-random sequence.
In general, the spectrum with random code generation can be quite different from the FCC mask. As a result, in order to be compliant with FCC regulation, with a scaling of transmission power, the maximum allowed transmission power can be significantly restricted. Therefore, any signal shaping method that uses a basic, single short pulse must explicitly take the FCC mask into consideration.
Under certain conditions the spectrum of the transmit signal becomes identical to the spectrum of the basis pulse. Therefore, there is a need to modify the spectrum of the basis pulse to fit the requirements of both the FCC mask and industry standards.
The invention shapes the spectrum of a transmitted signal in an ultra wide bandwidth (UWB) system so that the transmitted signal meets regulatory constraints, as well as practical requirements of interference suppression.
The system according to the invention uses a linear combination of weighted, time-shifted (delayed) basis pulses in order to achieve a desired spectrum. Characteristically, the system uses a bank of basic pulses, preferably with different time-frequency duration, to approximate a given mask of general shape with a small number of parameters.
The weights and delays are selected according to criteria related to spectral efficiency, adherence to the FCC spectral mask, and other restrictions like maximum number of rake fingers.
The method for determining the weights is based on a non-linear search, initialized by the solution of a quadratic optimization problem. Two ways of approximating the original problem with a quadratic formulation are described. Additionally, the invention provides combinatorial optimization to perform discrete optimization over pulse positions (delays), as well as off-line selection from a basic pulse set.
As shown in
Prior to the combining 110, a set of optimized filter and delay lines 120, hl(t), l=1, . . . , L, is applied to the pulses to weight and delay the generated basis pulses to conform to a predetermine spectral mask.
The combination of the filtered pulses can be achieved by a combination of analogue delay lines, adders, and programmable pulse generators.
As shown in
An important part of our method is the determination of the pulse weights and delays. We determine the weight and delays on either fixed or time-varying criteria for the spectrum shape of the resulting signal 102. These criteria can stem from the FCC spectral mask (fixed), from the necessity to avoid interference to other users, which can be pre-defined or time-varying, or following an instantaneous or averaged determination of the emissions of users in the current environment, or other criteria. In any case, these criteria are mapped onto an “instantaneous” spectral mask that has to be satisfied by the resulting signal 102.
Selecting Pulses Off-Line
The selection of the basis pulses 101 can be based on two criteria: implementation complexity, and approximation capability.
We prefer the basis pulses to be Gaussian in form. Gaussian pulses are relatively easy to generate, and their differentiations and integrations can be implemented with differentiation and integral circuits well known in the art.
The approximation can be performed with multi-resolution time-frequency analysis of signals using, for example, wavelets, and multi-resolution time-frequency approximation.
The signal of interest for the invention conforms to a desired spectral mask. Whereas a signal may be analyzed with respect an infinite number of possible bases in signal space, the choice of bases matters critically when one is restricted to a given number of bases available to approximate the spectrum of the desired signal.
A high-level description of the problem is given as follows. Assume there is a set of candidate of basis functions C. For example, the basis functions correspond to differentiations of a Gaussian pulse of various orders. In addition, the basis functions can be orthogonalized, e.g., by a sequential Gram-Schmidt procedure, to facilitate the subsequent optimization.
A set of typical masks available for training purpose is denoted by S={M(Ω)}, and one particular set of chosen bases as denoted by p(t). The particular set is obtained by stacking the selected basis functions into a column vector.
We define a function ƒ(p(t), S) to evaluate the fulfillment of the two considerations. For example, the function is a weighted combination of two parts: the first part models a cost of generating the pulse, and the second part models an approximation error or some other efficiency metric. This function can be expressed as the formulation:
where α and β are predetermined constants. This formulation is a combinatorial optimization problem, described below.
In comparison, the problem of optimal pulse locations is mainly for on-line applications, where the choices of bases are fewer, the implementation cost is fixed, and there is only one target mask, instead of a set of training pulses.
Formulation of the Problem
The individual basic pulses are denoted by pl(t), l=1, . . . L, and their Fourier transforms by Pl(jΩ), l=1, . . . L. The set of shaping filter 120 is FIR, with the impulse response being the sum of δ-functions placed at different delays, τ, and weighted or scaled, s, differently. Corresponding to
We use the following notations:
The elements of p(t) constitute a pool of bases pulses in the signal space. Then, the single user spectrum shaping problem can now be formulated as follows:
subject to |S(jΩ|2<M(Ω), ∀Ωε[−∞,+∞],
where M(Ω) is an upper-bound on the squared magnitude response regulated by FCC.
This is equivalent to a min-max formulation:
subject to sTRs=1.
In addition, structural constraints can be put on the parameter vector s due to implementation concerns. For example, the number of non-zero elements in certain sub-vectors of the parameter vector s can be constrained.
In certain scenarios, the inner product of the desired signal <s(t), s(t)>can be replaced by the maximum of the signal energy within a certain frequency range, e.g., the “in-band” proportions of energy are maximized. This only changes the definition of the matrix R.
In certain scenarios, the constraints |S(jΩ)|2<M(Ω), ∀Ωε[−∞,+∞] are extended to include integral spectrum constraints, e.g., for “out-of-band” signals, which produces interference to other devices. These constraints are not as strict as the fixed envelop definition of the FCC mask.
The modifications corresponding to the system shown in
Initializing Pulse Parameters
The min-max formulation as described above, or a robust ∞-norm minimization is known to be a difficult problem. Several existing game-theoretic techniques rely on the existence of saddle points, which unfortunately are not satisfied in this case.
Therefore, we minimize with an approximate 2-norm formulation instead. In other words, we replace the maximum over all frequency by the minimum of a weighted integral formulation, i.e.,
We also note that:
≡sTWs.
Provided that there is no structural constraint on s, the quadratic approximation leads to:
subject to sTRs=1,
for which the solution is an eigenvector corresponding to the smallest eigenvalue λ of the matrix w−λR. When there are structural constraints on s and the non-zero positions have been fixed, e.g., as one elementary step in a combinatorial optimization procedure, the problem bears a similar form except that W and R are now replaced by their corresponding principal sub-matrices.
Optimization Over Pulse Positions
One notable difference of our invention, compared with traditional least-squares FIR filter design, is that the pulse positions, i.e., delays of the non-zero elements in s are also optimized.
In prior art UWB systems, the random pulse-positive modulation relies essentially on a sparse arrangement of the pulses to support asynchronous transmissions, to provide better multi-path resolution, and to avoid catastrophic collisions.
The optimization over the pulse positions or delays is a combinatorial optimization problem, which determines an optimal subset selected from a given set of basis functions p(t).
As stated above, this problem is in similar form as the optimal choice of the initial basis functions, i.e., p1(t−τli), from a larger set of candidate pulses. The second cost function ƒ2(p(t), M(Ω)) can be selected as an extreme value of the min-max, or the approximate min-integral formulation.
Here, we describe three selection procedures, with the first two being greedy.
Iterative Greedy Addition
We initialize with an empty set of basis functions.
We search for a basis function from the candidate pool, which maximizes the cost function when added to the existing set of basis functions.
We select this basis function, and we repeat the search until the desired number of basis functions has been selected.
Iterative Greedy Removal
We initialize with the entire set of basis functions.
We search for a basis function from the candidate pool, which maximizes the cost function when removed from the existing set of basis functions.
We select this basis function, and we repeat the search until the desired number of basis functions has been selected.
Branch-and-Bound
This approach assumes that p1(t−τli) have already been orthogonalized and normalized. Additionally, this approach uses a quadratic approximation rather than the min-max formulation. In this case, R=I, and the minimum generalized eigenvalue of matrix pair (W, R) reduces to the minimum eigenvalue of W, which is the cost function to be minimized.
Let K denote the maximum number of non-zero elements. Let N denote the number of candidate pulses. Denote one arrangement of the pulse positions (delays) by a vector with K 1's, e.g., (1, 0, 0, 1, . . . , 0).
In the following, we construct an enumeration tree of all the allowed combinations, where each node n is associated with a positioning vector with each element from the set {0, 1, *} where * means the position may be a 1 or 0. The enumeration tree is defined as follows:
The root of the tree is (*,*, . . . ,*), referring to all possible selections.
The first layer of the tree enumerates all the vectors with one 0 and N−1*'s by considering the placement of a 0.
The children of any node with less than N−K zeros enumerates all the possibilities of putting another 0. To avoid duplications, ( . . . , 0,*, . . . ,*) is by convention associated with one possible father node ( . . . ,*,*, . . . ,*), where the first dotted part is the same as the current child vector.
At each visited node, the procedure maintains a lower bound and an upper bound of the cost function among all the descendant combinations. The procedure visits the enumeration tree with certain order, e.g., depth-first, or breadth-first. When visiting a certain node, the procedure updates the upper-bound and lower-bound of the cost function. If the lower-bound is greater than the minimum upper-bound among the visited nodes, the sub-tree, with its root being the current node, is pruned from further consideration. The bounds are determined by Cauchy's interlacing theorem of eigenvalues for symmetric matrices:
Cauchy's Interlacing Theorem:
Let Ar denote a r×r submatrix of a n×n real symmetric matrix A. The eigenvalues λ1≦λ2≦ . . . ≦λn of A, and the eigenvalues μ1≦μ2≦ . . . ≦μr of Ar satisfy the following relations: λi≦μi≦λi+n−r, i=1, . . . , r.
The procedure can terminate after visiting all the nodes. In this case, the procedure reaches a global optimal selection of pulse delays. Alternatively, the procedure can terminate when the cost function evaluated at a visited node is within ε of the current upper-bound of all the possible combinations.
Quadratic Optimization for L=1
For the special case of L=1 and τli=iΔσ, i=1, . . . M1, a distinct quadratic approximation is pursued. In this formulation, the optimization over the pulse positions is directly solvable.
We introduce
φ(Ω,s)≡|s10+s11e−jΩΔτ+ . . . +slM
Then,
Note,
Now, we select a substantially large constraint, i.e., sTRs=b, such that the above equation is equal to the ∞-norm. Although it does not make any difference in the original ∞-norm formulation, it does affect the 2-norm approximation. With the 2-norm approximation, the problem becomes a FIR filter design problem with a least-squares formulation:
subject to sTRs=b with b↑∞
This equation can be approximated by
subject to sTRs=b with b↑∞,
where gn is a real sequence that produces an amplitude response
In particular, we match the non-zero positions to the taps of gn with largest magnitudes. After, the positions of the taps have been fixed, the solutions to the tap weights are computed as s∝uu1u1Tg, where un is the principal component of R, and g are the vectors constructed from elements of gn having largest magnitudes among those complying with the structural constraints of s.
Note, the above two quadratic approximations are only two out of the many possible quadratic approximations. In essence, without FIR constraints and the degree of freedom constraints, the optimal solution is an IIR filter, which uniformly matches the FCC spectral mask.
The various quadratic approximations can be viewed as minimizing some distance measure between the pursued FIR solution and the IIR solution. We prefer the above quadratic approximation mainly because the joint optimizations over the weights and delays are easily solvable due to its special structure.
Non-linear Optimization with Neural Networks
We initialize our solution with the above quadratic approximation, and further exploit non-linear optimization techniques to gradually refine the solutions. The description in this section refers to the original min-max formulation described above.
A back-propagated (BP) multi-layer perceptron (MLP) possesses adaptive learning abilities to estimate sampled functions, represent these samples, encode structural knowledge, and inference inputs to outputs via association. Its main strength lies in its substantially large number of hidden units, and thus, a large number of interconnections. The MLP neural networks enhance the ability to learn and generalize from training data.
We describe the MLP optimization for the special case L=1. Optimization of the scaling coefficients, with fixed positions in the general case L>1, follows similarly. Because L=1, all quantities that depends on L are simplified to shortened notations with the dependence removed.
In addition, we re-parameterize the problem so that s1 . . . , sk represent the scaling or weighting coefficients at the pulse locations (delays) σ1, . . . σK. Therefore, we define σ≡[σ1, . . . σK]T. The optimization over pulse positions for L=1 can be treated by MLP, for L>1. We refer again to the three combinatorial optimization approaches discussed earlier.
In order to put the current problem into the general framework of MLP, we uniformly quantize the frequency range to arrive at
rthrh
Then, we replace the max-function with a differentiable soft-max. Given a function ƒ(x)>0, xεS, for a sufficiently large positive number α, we have the following soft-max approximation:
Then, we replace the max-function with a differentiable soft-max. Given a function ƒ(x)>0,xεS, for a sufficiently large positive number α, we have the following soft-max approximation:
With these two simplifications, the problem becomes
subject to sTR(σ)s=1,
with R(σ) being the principal submatrix of R corresponding to delays σ and
φi(s,τ)≡φ(Ωi,s,τ).
While it is theoretically possible to simultaneously adjust both the positions or delays τ401 and the weighting or scaling coefficients s402, practically we prefer to decouple their tuning by adopting a conditional maximization approach, i.e., optimizing one with the other fixed. In addition, this decoupling may be justified by the different nature of the two parameter sets. In preferred embodiment, with the allowed positions quantized, the slight change σ→σ+μδ2τ is always normalized to “hop” to the nearest valid quantization point on the multi-dimensional grid.
Typically, numerical non-linear optimizations can only be assured to arrive at a local optima rather than a global one. Simulated annealing can be used to avoid local optima. One approach to escape from possible local minima is to begin the optimization with several randomly distributed initial solutions, and select the best solution among the different trial paths.
We note that this is only one possible neural network, which again is just one of the methods for implementing non-linear optimization.
Although the invention has been described by way of examples of preferred embodiments, it is to be understood that various other adaptations and modifications may be made within the spirit and scope of the invention. Therefore, it is the object of the appended claims to cover all such variations and modifications as come within the true spirit and scope of the invention.
A claim of priority is made to U.S. provisional application Ser. No. 60/451,577, “UWB Communication system with shaped signal spectrum by pulse combination,” filed Mar. 3, 2003.
Number | Name | Date | Kind |
---|---|---|---|
5742814 | Balasa et al. | Apr 1998 | A |
20020034191 | Shattil | Mar 2002 | A1 |
Number | Date | Country | |
---|---|---|---|
20070286260 A1 | Dec 2007 | US |
Number | Date | Country | |
---|---|---|---|
60451577 | Mar 2003 | US |