Systems and methods for adaptively decoding transmitted frames

Information

  • Patent Application
  • 20030223512
  • Publication Number
    20030223512
  • Date Filed
    June 04, 2002
    22 years ago
  • Date Published
    December 04, 2003
    21 years ago
Abstract
The present invention provides systems and methods for adaptively decoding transmitted frames efficiently in non-Gaussian, non-stationary environments. One such system comprises a decoder adapted to decode a received transmission frame using a decoding scheme, a channel assessment unit for sensing channel characteristics, and a tuning unit for adjusting the decoding scheme based, at least in part, on channel characteristics sensed by the channel assessment unit.
Description


TECHNICAL FIELD

[0001] The present invention relates to the field of communications, and more particularly, systems and methods for adaptively decoding transmitted frames in the presence of impulsive noise and interference.



BACKGROUND OF THE INVENTION

[0002] As the world has become more reliant on computers and information exchange, the need for the reliable transmission of data has become increasingly important. One key element in information exchange is the accurate and efficient transmission and reception of data across noisy transmission channels.


[0003] Signal processing methods implemented in practical communications systems are usually designed under the assumption that any underlying noise and interference is Gaussian. Although this assumption finds strong theoretical justification in the central limit theorem, the noise and interference patterns commonly present in modem mobile communications systems are far from Gaussian. Noise and interference generally exhibit “impulsive” behavior. In typical mobile communication systems, noise and interference sources often include motor-vehicle ignition noise, switching noise from electromechanical equipment, thunderstorms, and heavy bursts of interference. Current signal processing systems are not designed to handle these non-Gaussian noise sources. Accordingly, these systems may perform poorly, and might even fail, in the presence of impulsive noise.


[0004] Channel noise and interference can be effectively modeled as the superposition of many small and independent effects. In practice, these effects do not always follow a Gaussian distribution. This situation appears to contradict the central limit theorem. For many years, engineers have been unable to explain this apparent contradiction. Consequently, many of the techniques developed to cope with impulsive noise were mainly ad hoc, largely based on signal clipping and filtering prior to application of a Gaussian-based technique.


[0005] Clipping the amplitude of an input signal is only effective if the amplitude of the input signal is above or below the specific threshold values. These threshold values are typically determined by the limits of the hardware used in a receiver in a communications system. Accordingly, the threshold values are often chosen to take advantage of the full dynamic range of the analog to digital (A/D) converter(s) of the receiver. However, if impulsive noise, added to the input signal, does not cause the amplitude of the signal to exceed a specific threshold, clipping will not remove the noise. Additionally, even when noise does cause the signal to exceed the threshold, the clipping solution only removes the noise to the extent that the magnitude of the signal plus the noise is above the threshold. Accordingly, noise is not actually removed, but its effects are somewhat reduced.


[0006] When individual signals within a sequence are contaminated by noise, the sequence may not be properly decoded and efficient communications may be difficult. In typical communication systems, decoding is used to identify potential communication errors. Additionally, decoding may be able to correct some, or even most, errors. Errors may be corrected by one of many error detection and correct schemes known to those skilled in the art. Typical coding and decoding schemes are able to correct errors by inserting controlled redundancy into the transmitted information stream. This is typically performed by adding additional bits or using an expanded channel signal set. These schemes allow the receiver to detect and possibly correct errors.


[0007] In its most simple form, one problem with noisy transmission environments is that, a certain percentage of the time, a transmitted ‘1’ is received as a ‘0’ or vice versa. There are many methods of encoding data that allow received errors to be detected or even corrected. These encoding and decoding schemes are typically optimized based on a set of underlying assumptions. Preferably, these assumptions are designed to match the conditions of a real-world communications environment. Generally, decoding systems are designed under the assumption that the underlying noise and interference is Gaussian. When these assumptions do not match real-world conditions, the performance of such schemes may no longer be optimal. In real-world environments, this effect is ever-present because conditions are constantly changing. Even those systems in existence that try to accommodate impulsive noise are based on average conditions. These systems fall short of optimal performance when conditions stray from the average. These problems are compounded in a mobile system because conditions change even more rapidly and more often than stationary systems.


[0008] When a receiver is able to detect, but not correct, errors in received information, the receiver may request that the transmitter resend the information. In a noisy environment, this may lead to highly inefficient communications.


[0009] There is a need in the art for systems and methods for processing signals to alleviate impulsive noise distortion.


[0010] Additionally, there is a need in the art for systems and methods for providing an adaptive decoding system for efficient communications in non-Gaussian, non-stationary environments.



SUMMARY OF THE INVENTION

[0011] The present invention overcomes the limitations of the existing technology by providing systems and methods for providing adaptive decoding for efficient communications in non-Guassian, non-stationary environments. An exemplary adaptive decoding system in accordance with the present invention comprises a decoder having a decoding scheme for decoding a received transmission frame, a channel assessment unit for sensing channel characteristics, and a tuning unit for adjusting the decoding scheme of the decoder at least in part on the channel characteristics sensed by the channel assessment unit.


[0012] Other objects, features, and advantages of the present invention will become apparent upon reading the following detailed description of the embodiments of the invention, when taken in conjunction with the accompanying drawings and appended claims.







BRIEF DESCRIPTION OF THE DRAWINGS

[0013]
FIG. 1 is an illustration of several plots of a metric ρ(x) for several values of k, which may be used in an exemplary embodiment of the present invention.


[0014]
FIG. 2 is a plot of a general α-k plot.


[0015]
FIG. 3 is a block diagram illustrating an adaptive decoding system in accordance with an exemplary embodiment of the present invention.


[0016]
FIG. 4 is a flow diagram illustrating a method of adjusting the metric of a decoding scheme in accordance with an exemplary embodiment of the present invention.







DETAILED DESCRIPTION OF THE INVENTION

[0017] Referring now to the drawings, in which like numerals refer to like techniques throughout the several views, exemplary embodiments of the present invention are described.


[0018] As discussed in the background of the present invention, noise and interference patterns are comprised of small and independent effects. Placing upper and lower boundaries on the variance of these effects may allow greater reliance on the central limit theorem. From a conceptual perspective, an unbounded or infinite variance is feasible as a model of highly dispersed or impulsive phenomena. Without the finite variance constraint, a converging sum of normalized random variables can be proven to belong to a wider class of random variables known as “α-stable”. Thus, similar to Gaussian processes, α-stable processes can appear in practice as the result of physical principles. Furthermore, all non-Gaussian α-stable processes are “heavy-tailed” processes with infinite variance, explaining the often found impulsive nature of practical signals.


[0019] “Symmetric” α-stable random variables are commonly described through their characteristic function:


Φ(ω)=e−γ|ω|α,  (1)


[0020] where α is the index or characteristic exponent, and γ is the dispersion. Analogous to the variance in a Gaussian process, γ is a measure of signal strength. The shape of the distribution is determined by α From the equation above, it can be shown that α is restricted to values in the interval (0,2]. Qualitatively, smaller values of α correspond to more impulsiveness in the distribution. The limiting case α=2 corresponds to a Gaussian distribution, which is the least impulsive α-stable distribution and the only one with finite variance. A value of α=1 results in a random variable with a Cauchy distribution.


[0021] A theory of estimation in α-stable environments can be derived from the tools of robust statistics. In general, let ρ(x) be a symmetric cost function or metric which is monotonically non-decreasing on [0, ∞). For a set of samples x1, x2, . . . ,xN, the M-estimator of the location parameter, β, is defined as
1β^=argminβi=1Nρ(xi-β).(2)


[0022] In the theory of M-estimators, the shape of the cost function ρ determines the characteristics of the estimate, {circumflex over (β)}. For example, if ρ(x)=x2, {circumflex over (β)} becomes the least-squares estimate (i.e. the sample mean). For ρ(x)=|x|, {circumflex over (β)} is the sample median. The cost function:


ρ(x)=log(k2+x2),  (3)


[0023] where k is a constant, possesses important optimality properties along the whole range of α-stable distributions.


[0024] The importance of the cost function described in the above equation is that the value of k can be tuned to give optimal estimation performance depending on the parameters of the underlying distribution. Cost-functions may be tuned to optimize performance for a set of conditions. FIG. 1 shows ρ(x) for different values of the tuned parameter k. Note that as k takes on a very large value, the shape of the cost function is similar to that used by the least squares estimator. On the other hand, for small values of k, ρ(x) grows slowly for large values of x, which is desirable since it adds robustness to the metric function. Given that parameters α and γ of an α-stable distribution generate an independently and identically distributed (i.i.d.) sample, the optimal value of k is given by a function of the form:




k
(α, γ)=k(α)γ1/α.  (4)



[0025] The above expression indicates a “separability” property of the optimal value of k in terms of the parameters α and γ. This reduces the problem of the functional form of k (α,γ) to that of determining the simpler form:




k
(α)=k(α, 1), 0<α≦2.  (5)



[0026] This function describes “the α-k plot” of α-stable distributions. Under the “maximum likelihood” optimality criterion, it can be proven that the α-k plot touches three fundamental points:


[0027] 1. For α=2 (i.e., the Gaussian distribution), the optimal value of k is k=∞, which, for the location estimation problem, makes {circumflex over (β)} equal to the sample mean.


[0028] 2. With α=1 (i.e., the Cauchy distribution), the optimal value is k=1. This is a direct consequence of the definition of the cost function in Equation (3), and the fact that the resulting M-estimator is equivalent to the maximum likelihood estimator for a Cauchy distribution.


[0029] 3. When α→0 (i.e., the most impulsive distribution), the optimal value of k converges to k=0.


[0030] The above points suggest the general shape of the α-k plot illustrated in FIG. 2. Although finding an exact expression matching the shape of the α-k plot is difficult, the following approximations may be used:
2k(α)=tan(πα4),and(6)k(α)=(α2-α)P,(7)


[0031] where P is in general a positive constant. Due to its relative simplicity and efficient results, the following tuning function is often preferred:
3k(α)=α2-α.(8)


[0032] One general goal of using encoding and decoding for the transmission of data, is to minimize the probability of error. In the situation where various coded sequences are equally likely, this may be accomplished using a “maximum likelihood” decoder. For hard decision decoding, it is well known that the maximum likelihood decoder selects the codeword that is closest in Hamming distance to the received sequence.


[0033] It is also well known that soft decision decoding offers a performance advantage over hard decision decoding. Soft decision decoding preserves information contained in the received sequence and passes that information on to a decoding scheme. The task is to choose a cost function appropriate for soft decision decoding. For a channel with underlying noise and interference that is Gaussian, maximum likelihood decoding is achieved using a Euclidean distance (ρ(x)=x2) as the cost function. Furthermore, if the channel can be accurately modeled as a stationary additive-noise channel with noise density function ƒ(x), an optimal cost function can be easily designated as ρ(x)=−log(ƒ(x)). In real world applications, however, the statistical behavior of a channel is not stationary. Accordingly, finding a model for the channel noise is a significant challenge. This situation is aggravated by the fact that even a very small deviation from model channel assumptions can lead to the design of highly inefficient cost functions. Thus, the choice of an appropriate cost function that gives satisfactory robust performance is important.


[0034] The present invention envisions decoders that are adapted to be adjusted according to present channel characteristics. In an exemplary embodiment of the present invention, a Viterbi decoder is used in which a metric function is changed according to channel characteristics. The present invention is optimized for use in channels having any of the following characteristics:


[0035] Non-Gaussian/impulsive channels


[0036] Time-varying channels


[0037] Systems with varying impulsiveness such as cellular systems under varying environment conditions or changing multiuser interference patterns.


[0038]
FIG. 3 illustrates a block diagram of an adaptive decoder in accordance with an exemplary embodiment of the present invention. The decoder 315 comprises a channel assessment unit 325 adapted to “sense” channel behavior, and to provide instructions to a metric generator 310 of a Viterbi decoder 315. The generator 310 is adjusted based, at least in part, on channel characteristics.


[0039] The metric generator 310 is adapted to use a finite number of cost functions ρ1, ρ2, . . . ρN,. The channel assessment unit 325 is, in general, adapted to use a discrete function that, based on soft data encoding from a demodulator (not shown), decides which cost function should be applied. The channel assessment unit 325 may be adapted to make use of a pilot signal or may be further adapted to work directly on the received data sequence. It may also work on a continuous basis, or only during a percentage of the transmission time in the same way as multiplexed pilots.


[0040] In an exemplary embodiment of the present invention, the channel assessment may be implemented as the estimator of a vector of channel parameters {circumflex over (θ)}=[{circumflex over (θ)}1, {circumflex over (θ)}2, . . . , {circumflex over (θ)}p]. Then, the parameter space p, is partitioned into N disjoint regions, S1, S2, . . . , SN. Selection adaptation is then performed according to the simple rule:


select ρi when {circumflex over (θ)}εSi.  (9)


[0041] In an exemplary embodiment of the present invention, the operation of the channel assessment unit 325 may be further explained using the following example of Gaussian/Laplacian selective decoding or “hard” decoding. It should be understood that this is only one example and in no way should be construed as a limitation to the present invention. When performing Gaussian/Laplacian hard adaptation, the following cost functions may be used:


ρ1(x)=x2 (Optimum for Gaussian noise)  (10)


ρ2(x)=|x|(Optimum for Laplacian noise).  (11)


[0042] Alternatively, a channel assessment unit 325 may be adapted to perform a Gaussianity test in place of the channel assessment scheme. The Gaussianity test may have different structures. One commonly used Gaussianity test is based on Bispectral estimation. In this method, the 2-D Fourier Transform of the third order cumulant sequence yields the bispectrum of the process. Essentially the bispectrum of a Gaussian process is zero over all frequencies (zero cumulant matrix). The Gaussian test hypothesizes whether there is sufficient statistical deviation from zero to imply a non-Guassian source. The main goal of the Gaussianity test is to give information about the current impulsiveness of the channel. Typically, the test will be comprised of a real function λ, applied to the demodulated history:


λ=λ({circumflex over (β)}M, {circumflex over (β)}M-1, {circumflex over (β)}M-2, . . . {circumflex over (β)}1),  (12)


[0043] where {circumflex over (β)}M is the current demodulated symbol, and {circumflex over (β)}M-1 is the output of the demodulator i symbols ago. If λ is larger than a predetermined constant C, then the test determines that the channel can be considered Gaussian, and ρ1 is chosen. On the contrary, if λ≦C, the test indicates a tail heavier than Gaussian, calling for the use of a more impulse-resistant cost function, namely ρ2.


[0044] It should be noted that the performance of the system depends strongly on the quality of the test. Also, the adaptation speed and/or responsiveness of the system is determined by the relative importance that the function λ gives to recent and old data. In an exemplary embodiment of the present invention, λ is a “weighted” test function assigning more importance or weight to the most recent data. The variation of the weight distribution determines the responsiveness of the system. Furthermore, λ can be defined to operate only on the M most recent data. The size of M, in this case, will determine the system responsiveness (or adaptivity), with small values of M corresponding to more responsive systems.


[0045] When transitions from one cost function to the other occur, the memory of the decoder is adapted to store information in terms of the old cost function. As part of the normal operation of the decoder 315, the metrics associated with the new cost function are added to old accumulated metrics. Hence, metrics may be scaled in order to guarantee comparability or compatibility of ρ1 and ρ2 when both contribute information to the accumulated metric.


[0046] Noting that ρ2 and cρ2 (for c>0) are equivalent cost functions, an optimal value of c may be designed to guarantee compatibility and optimal performance during transitions. Such values of c are referred to as the transition constants of the system. Although only one non-unit transition constant is needed for the example outlined, c1 and c2 may be referred to as the associated transition constants in the sense that the implementation of the decoder 315 uses the equivalent metrics c1ρ1 and c2ρ2.


[0047] The present invention may also be used for soft adaptation implementations. When operating using soft adaptations, a family of cost functions are defined: ρk,kεL. The tuning of the system (i.e., the selection of the cost function to be used) is performed using a tuning function:






k
=k({circumflex over (θ)}).

  (13)



[0048] The difference between hard and soft adaptation (and associated hard and soft data) can be seen as
1hardk is a quantized functionsoftk is continuous


[0049] In hard adaptation, a firm (or hard) decision is made regarding each bit as it is received. In soft adaptation, decisions may be changed if other received bits (typically received later in time) indicate that the earlier decision was likely incorrect.


[0050] A third function, the transition function, (in addition to the previously introduced cost and tuning functions) is used in soft adaptations. Instead of the constants c1, c2, . . . cN used in hard adaptation, a function c(k) scales each ρk appropriately to allow an efficient comparison among the different cost functions.


[0051] The use of different cost functions may be illustrated using a Myriad-based decoder. In an exemplary embodiment of the present invention, a Myriad-based decoder is adapted to use the following cost function and parameters:


ρκ(x)=log(k2+x2), kε




θ
=[α, γ] (The parameters characterizing α-stable noise)





θ
=[{circumflex over (α)}, {circumflex over (γ)}] (Any specific known estimator)





k
(θ)=k({circumflex over (α)},{circumflex over (γ)})=k({circumflex over (α)})γ1/α,



[0052] where k({circumflex over (α)}) is a predetermined α-k plot such as the one shown in FIG. 3.


[0053] The optimal form of c(k) may be designed through simulations or from theoretical analysis. For instance, c(k)=k2 is an exemplary sample which generally provides good performance. Alternatively, simulations may be performed to identify a function c(k) that produces optimal results for a specified system.


[0054]
FIG. 4 is a flow diagram of a method for adjusting a decoding scheme in accordance with an exemplary embodiment of the present invention. In step 405 of an exemplary adaptive decoding method, the system first senses the channel characteristics of the current transmission channel using one of many methods known in the art. Among the sensed channel characteristics are noise characteristics. Channel noise has an associated α, which is representative of the impulsiveness of channel noise. Assuming that α is stable (generally a valid assumption), α may be estimated using a standard α estimator in step 410.


[0055] Once a value of α is estimated, k(α) may be calculated in step 415 using one of the approximation functions for k(α) shown above, or k(α) may be taken directly from the α-k plot shown in FIG. 2. After calculating the appropriate value for k(α), the new metric shape may be calculated in step 420 according to the following function:


ρk(x)=log(k2+x2)  (14)


[0056] After the new metric is calculated, the decoding scheme may be adjusted to the new metric in step 425.


[0057] While this invention has been described with reference to embodiments thereof, it should be understood that variations and modifications can be made without departing from the spirit and scope of the present invention as defined by the claims that follow.


Claims
  • 1. An adaptive decoding system comprising: a decoder adapted to decode a received transmission sequence using a decoding scheme; a channel assessment unit for sensing channel characteristics; and a tuning unit for adjusting the decoding scheme of the decoder based at least in part on the channel characteristics sensed by the channel assessment unit.
  • 2. The system of claim 1, wherein the channel assessment unit is adapted to estimate a value for an indicator of the impulsiveness of the channel.
  • 3. The system of claim 1, wherein the decoder is further adapted to adjust the decoding scheme based at least in part upon the impulsiveness of the channel.
  • 4. The system of claim 2, wherein the channel assessment unit is further adapted to estimate α, an indicator of the impulsiveness of the channel.
  • 5. The system of claim 2, wherein the tuning unit is further adapted to adjust the decoding scheme of the decoder based at least in part on a value k that is calculated from the estimated value of the indicator of the impulsiveness of the channel.
  • 6. The system of claim 5, wherein the turning unit is adapted to adjust the decoding scheme using k, calculated from an α-k plot, wherein a is an indicator of the impulsiveness of the transmission channel.
  • 7. The system of claim 5, wherein the tuning unit is adapted to adjust the decoding scheme using k, calculated in accordance with the following equation:
  • 8. The system of claim 1, wherein the decoding scheme comprises a decoding metric calculated in accordance with the following equation:
  • 9. The system of claim 1, wherein the decoding scheme uses a first metric, the system further comprising: a metric generator adapted to calculate a second metric for use with the decoding scheme, the second metric being selected based on the sensed channel characteristics of the predetermined transmission channel, the metric generator being further adapted to scale the second metric to increase compatibility between the first metric and the second metric, and the metric generator being further adapted to apply the scaled second metric to the decoding scheme.
  • 10. The system of claim 9, wherein the metric generator is further adapted to use a transition constant to scale the second metric.
  • 11. The system of claim 9, wherein the metric generator and the tuning unit are a single unit.
  • 12. A method for adapting a decoding scheme of a decoding system, the method comprising: sensing the channel characteristics of a predetermined transmission channel; and shaping the decoding scheme based at least in part on the channel characteristics of the predetermined transmission channel.
  • 13. The method of claim 12, further comprising the step of: estimating a value representative of the impulsiveness of the predetermined transmission channel based on the sensed channel characteristics.
  • 14. The method of claim 13, further comprising the step of: computing a value k based on the estimated value of the impulsiveness of the predetermined transmission channel.
  • 15. The method of claim 14, wherein the step of computing the value k comprises selecting the value from an α-k plot.
  • 16. The method of claim 14, wherein the step of computing the value k is performed in accordance with the following equation:
  • 17. The method of claim 14, wherein the step of shaping the decoding scheme is performed using a cost function calculated in accordance with the following equation:
  • 18. The method of claim 14, wherein the step of computing the value k is performed in accordance with the following equation:
  • 19. The method of claim 14, wherein the step of computing the value k is performed in accordance with the following equation:
  • 20. The method of claim 13, wherein the step of shaping the decoding scheme comprises: shaping the decoding scheme based at least in part on the impulsiveness of the predetermined transmission channel.
  • 21. The method of claim 12, wherein the steps are performed repeatedly throughout the reception of a transmitted sequence.
  • 22. The method of claim 12, wherein the decoding scheme has a first metric, and wherein the step of shaping the decoding scheme comprises: calculating a second metric for use with the decoding scheme, the second metric being selected based on the sensed channel characteristics of the predetermined transmission channel; scaling the second metric to increase compatibility between the first metric and the second metric; and applying the scaled second metric to the decoding scheme.
  • 23. The method of claim 22, wherein the step of scaling the second metric comprises using a transition constant to scale the second metric.
  • 24. A method for adapting a decoding scheme of a decoding system, the method comprising: sensing the channel characteristics of a predetermined transmission channel; determining a transmission channel statistic representative of the statistical properties of the transmission channel; and selecting a cost function for a decoding scheme based, at least in part, on the transmission channel statistic.
  • 25. The method of claim 24, wherein the steps of the method are performed repeatedly throughout the reception of a transmitted sequence.
  • 26. The method of claim 24, wherein the step of sensing the transmission channel characteristics comprises senseing the impulsiveness of the transmission channel.
  • 27. The method of claim 26, wherein the step of determining a transmission channel statistic representative of the statistical properties of the transmission channel comprises estimating a value α representative of the impulsiveness of the transmission channel.
  • 28. The method of claim 27, wherein the step of selecting a cost function comprises calculating a cost function from the estimated value α.
  • 29. The method of claim 28, wherein the cost function is calculated in accordance with the following equation:
  • 30. An adaptive decoding system comprising: a decoder adapted to decode a received transmission sequence using a first decoding scheme; a channel assessment unit for sensing channel characteristics; and a metric generator for selecting a cost function for a second decoding scheme based on the sensed channel characteristics.
  • 31. The system of claim 30, wherein the channel assessment unit and the metric generator are adapted to repeatedly sense channel characteristics and calculate cost functions.
  • 32. The system of claim 30, wherein the channel assessment unit is adapted to sense the impulsiveness of a transmission channel.
  • 33. The system of claim 32, wherein the channel assessment unit is further adapted to estimate α, a value representative of the impulsiveness of the transmission channel.
  • 34. The system of claim 33, wherein the metric generator is adapted to calculate a cost function from the estimated value α.
  • 35. The system of claim 34, wherein the metric generator is further adapted to calculate a cost function in accordance with the following equation: