The following articles provide a brief description of the prior art:
There is a growing need to improve the carrier andrfrequency recovery of OFDM signals.
There are provided an OFDM receiver, and a method for receiving and processing OFDM symbols according to various embodiments of the invention.
There may be provided an orthogonal frequency division multiplexing (OFDM) receiver, may include: an input port that may be configured to receive a stream of OFDM symbols; a timing circuit that may be configured to search, in the stream of OFDM symbols, for a training sequence that may include a first Golay codeword and a second Golay codeword and to process the training sequence and extract timing information about a timing of reception of OFDM symbols, out of the stream of OFDM symbols, that convey data; wherein the sum of an autocorrelation of the first Golay codeword and an autocorrelation of the second Golay codeword consists essentially of a delta function.
The first Golay codeword and the second Golay codeword may be separated from each other by at least one padding bit.
The first Golay codeword and the second Golay codeword may not be separated from each other by any padding bits.
At least ninety percent of energy of the sum may belong to the delta function.
The sum may consist only of the delta function.
The delta function may have a peak that equals twice a length of the first Golay codeword.
The timing circuit may not include multiplication units.
The OFDM symbol stream may include multiple interleaved sequences of oversampled data symbols; wherein each sequence of oversampled data symbols may include a training sequence candidate; wherein the timing circuit may be configured to select a selected training sequence out of multiple training sequence candidates of the OFDM sequence stream.
The timing circuit may be configured to calculate cross-correlations peaks by cross correlating between each of the multiple training sequence candidate and a reference training sequence that may include the first Golay codeword and the second Golay codeword.
The timing circuit may be configured to select the selected training sequence in response to the cross-correlation peaks.
The timing circuit may be configured to select as the selected training sequence a selected training sequence candidate having a biggest cross correlation peak out of the cross correlation peaks.
The timing circuit may be configured to define a timing reference point as a location of the cross correlation peak of the selected training sequence.
The timing circuit may be configured to compare the cross correlation peak of the selected training sequence to a cross correlation peak of at least one training sequence candidate that differs from the selected training sequence to provide a comparison result; and to determine a fractional timing offset based upon the comparison.
The OFDM receiver may include a frequency offset determination circuit that may be configured to determine a frequency offset of the OFDM sequence in response to a value of the cross correlation peak of the selected training sequence.
The frequency offset determination circuit may be configured to: divide the first Golay codeword of the selected training sequence into multiple first subsets; calculate first averages of cross correlations between the multiple first subsets and a corresponding reference First Golay codeword subsets; divide the second Golay codeword of the selected training sequence into multiple second subsets; calculate second averages of cross correlations between the multiple second subsets and a corresponding reference Second Golay codeword subsets; extract phase difference between first averages and corresponding averages; and determine the frequency offset of the OFDM sequence in response to the phase differences.
The circuit may include: a first cross-correlation circuit that may include first taps and may be configured to search for the first Golay codeword; a second cross-correlation circuit that may include second taps and may be configured to search for the second Golay codeword; wherein the frequency offset determination circuit may include multiple phase detectors that are configured to calculate phase differences between output signals of different taps of the first taps and the second taps; wherein the frequency offset determination circuit may be configured to determine the frequency offset of the OFDM sequence in response to the phase differences.
The multiple phase detectors comprise first phase detectors that are configured to calculate phase differences between output signals of different first taps; second phase detectors that are configured to calculate phase differences between output signals of different second taps.
The multiple phase detectors comprise a phase detector that may be configured to calculate a phase difference between a first output signal of a first tap and a second output signal of a second tap.
A method for receiving and processing orthogonal frequency division multiplexing (OFDM) signals, the method may include: receiving a stream of OFDM symbols; searching, by a timing circuit, in the stream of OFDM symbols, for a training sequence that may include a first Golay codeword and a second Golay codeword; processing, by a timing circuit, the training sequence and extracting timing information about a timing of reception of OFDM symbols, out of the stream of OFDM symbols, that convey data; wherein a sum of an autocorrelation of the first Golay codeword and an autocorrelation of the second Golay codeword consists essentially of a delta function.
The subject matter regarded as the invention is particularly pointed out and distinctly claimed in the concluding portion of the specification. The invention, however, both as to organization and method of operation, together with objects, features, and advantages thereof, may best be understood by reference to the following detailed description when read with the accompanying drawings in which:
In the following detailed description, numerous specific details are set forth in order to provide a thorough understanding of the invention. However, it will be understood by those skilled in the art that the present invention may be practiced without these specific details. In other instances, well-known methods, procedures, and components have not been described in detail so as not to obscure the present invention.
The subject matter regarded as the invention is particularly pointed out and distinctly claimed in the concluding portion of the specification. The invention, however, both as to organization and method of operation, together with objects, features, and advantages thereof, may best be understood by reference to the following detailed description when read with the accompanying drawings.
It will be appreciated that for simplicity and clarity of illustration, elements shown in the figures have not necessarily been drawn to scale. For example, the dimensions of some of the elements may be exaggerated relative to other elements for clarity. Further, where considered appropriate, reference numerals may be repeated among the figures to indicate corresponding or analogous elements.
Because the illustrated embodiments of the present invention may for the most part, be implemented using electronic components and circuits known to those skilled in the art, details will not be explained in any greater extent than that considered necessary as illustrated above, for the understanding and appreciation of the underlying concepts of the present invention and in order not to obfuscate or distract from the teachings of the present invention.
Any reference in the specification to a method should be applied mutatis mutandis to a system capable of executing the method.
Any reference in the specification to a system should be applied mutatis mutandis to a method that may be executed by the system.
There is provided a system and a method for an improved Coarse Timing Offset (CTO) and Carrier Frequency Offset (CFO) recovery algorithm for OFDM and its variants, based on the so-called Golay Complementary Sequences (GCC). The scheme is suitable for filter-bank based (optionally DFT-Spread OFDM) the signal processing entirely multiplier-free and the autocorrelation features a distinct single-sample peak well above the sidelobes even in the presence of multiple strong channel impairments.
Although OFDM-based optical transmission has not yet been commercially deployed, intense research continues in this promising direction as the OFDM approach continually evolves and improved multiple variants are introduced, such as DFT-spread (DFT-S) OFDM, filter-bank based sub-banded OFDM [SPM, Hauske] and combinations thereof.
In all OFDM variants, training sequences are used for coarse timing offset (CTO) recovery as well carrier frequency offset (CFO) recovery, i.e. estimation and correction of the timing window to perform the receiver FFT for OFDM detection, as well as the estimation of the CFO and its subsequent cancellation (note: “coarse” in the CTO term means up to one integer sample—as fractional delay is up to the OFDM one-tap OFDM equalizer to mitigate). The transmitted signal consists of (optionally DFT-S) OFDM data symbols interspersed with training symbols launched at low repetition rate. The received signal is subjected to Delay&Correlate (D&C) (moving window cross-correlation) processing, continually searching for the presence of an auto-correlation (ACOR) peak of the transmitted training symbol. Ideally, sharp ACOR peaks should be generated even in the presence of noise and impairments. Inserting a sequence with good ACOR properties within identical sections of the training symbol enables both D&C processing for CTO estimation, as well as related Delay&Phase-compare (D&PC) processing for CFO estimation. The pioneering work in this area [1], [2] yielded the Schmidl-Cox [S-C] algorithm based on transmitting the training symbol (A, A), with A some pseudorandom sequence, and applying D&C to the two halves.
Improved algorithms such as Minn's followed suite in the wireless area. In our previous works on digitally sub-banded OFDM systems as recently reviewed in a tutorial [SPM], we have ported the Minn algorithm from wireless communication to optical transmission for the purposes of CTO and CFO recovery sub-band by sub-band; We have also applied the Minn scheme for Chromatic Dispersion (CD) estimation [3]. Our Minn estimator is based on a training sequence of the for, (A, A, −A, −A) wherein A is a Zadoff-Chu CAZAC-type finite sequence with “good” ACOR (narrow-peak (no plateau) and high mainlobe-to-sidelobes rejection ratio (MSRR)). There already exists extensive optical communication literature on using CAZAC sequences for CD and Polarization (2×2 MIMO channel) estimation purposes, rather than for CTO and CFO estimation as advocated in our previous Minn-based work. However, usage of Golay sequences has been suggested as an alternative to using CAZAC sequences. In the framework of optical communication, GCC have already been demonstrated, albeit in the frequency-domain for Polarization (POL) 2×2 MIMO channel estimation as well as CD estimation. There have been other instances of the useful tool of Golay sequences in the field optical sensing.
There is provided an improved CTO and CFO recovery algorithm for OFDM and its variants, based on the so-called Golay Complementary Codes (GCC), which are pairs of sequences yielding a discrete delta-sequence upon summing up their a-periodic autocorrelations. The novel proposed GCC-based scheme will be shown to outperform the state-of-the-art Minn-based scheme for CTO and CFO estimation purposes as well as provide even lower complexity than D&C techniques—since Golay cross-correlation is multiplier-free, just performing additions, as Golay sequences are ±1-valued.
Our concept has been inspired by our work on fiber sensing, specifically Optical Time Domain Reflectometry (OTDR) [4], wherein the fiber is sequentially probed by pairs of Golay sequences and the optical backscatter “echo” is cross-correlated with the two Golay codewords in turn, separated in time by a guard interval, followed by suitable digital signal processing (DSP) consisting of adding up the cross-correlations of the received signals with the two codewords. In contrast, our multiplier-free GCC estimator is based on time-domain processing, reducing to adding multiplications by ±1 (i.e. signed additions) which is far less complex than frequency-domain Golay processing (which requires complex multipliers with arbitrary values, as the Golay spectra are pseudo-random).
Extra advantages of the proposed GCC estimator beyond being multiplier-free and sharp-peaked (high MSRR and narrow mainlobe, ideally single distinct peak): When used in conjunction with filter-bank based DFT-S OFDM, which is our main interest, the GCC estimator is highly tolerant of CD, PMD impairments but is less tolerant of CFO relative to the Minn, scheme, although its CFO tolerance is sufficient for sub-band processing); Our scheme features a ˜3 dB OSNR advantage with respect to ASE-induced white noise while displays about the same tolerance with respect to laser phase noise; Our scheme supports twice oversampling (as required in under-decimated filter-bank based digital sub-banding) without incurring increased computational complexity due to the processing of interpolated values. It further features decoupled variable frame sizes operation in both its CTO and CFO estimation modes, for flexible overhead.
I. Golay Complementary Codes Review
Given two complex-valued sequences, both assumed infinite, A={Ak}k=−∞∞, B={Bk}k=−∞∞, i.e., defined over the domain of integers, R, then their (a-periodic) cross-correlation (XCOR) is the following sequence:
where {circle around (x)} denotes convolution and denotes XCOR. In particular, the autocorrelation (ACOR) of a sequence A is:
ΓA≡ΓAA=AA=Σk′=−∞∞Ak′+kAk′* (2)
If the two sequences are finite, say, each containing L points, e.g., with support {0, 1, . . . , L−1}, then they are assumed zero-padded to become infinite. Then the support of their XCOR is {−(L−1), . . . , −1, 0, 1, . . . , L−1}, containing 2L−1 points.
Evident properties of the XCOR (and ACOR) are that the XCOR is associative, distributive but not commutative:
BA
†=(AB)†=A†*B (3)
where the conjugate-reflection or para-conjugation operation on a sequence is defined as follows:
s
k
†
[k]≡s*[−k] (4)
The delay property of the XCOR reads (with Dk
(Dk
In particular
(Dk
(Dk
Complementary sequences (CS) are pairs of sequences with the useful property that their out-of-phase aperiodic autocorrelation coefficients sum to zero.
A complementary pair a, b may be encoded as polynomials A(z)=a(0)+a(1)z+ . . . +a(N−1)zN−1 and similarly for B(z). The complementarity property of the sequences is equivalent to the condition |A(z)|2+|B(z)|2=2N for all z on the unit circle, that is, |z|=1. If so, A and B form a Golay pair of polynomials. Examples include the Shapiro polynomials, which give rise to complementary sequences of length a power of 2.
A Golay Complementary Code (GCC) is defined as a pair (GL(1), GL(2)) of sequences of length L (the Golay codewords) with unimodular elements,
|GL(i)[k]|=1,i=1,2;k=0, 1, . . . , L−1
satisfying the following complementary ACOR property:
G
L
(1)
[k]G
L
(1)
[k]+G
L
(2)
[k]G
L
(2)
[k]=2Lδ[k] (7)
with δ[k] the discrete-time impulse. Note that while the ACOR of each GCC codeword may have non-zero sidelobes, once the two ACORs are summed up, their sidelobes perfectly cancel out, while the peak doubles up.
For a power-of-two length L, a ±1-valued GCC may be recursively constructed by concatenating half-length Golay pairs as follows:
G
L
(1)
≡[G
L/2
(1)
,G
L/2
(2)
];G
1
(2)
=[G
L/2
(1)
−G
L/2
(2)] (8)
initialized as G1(1)=[1] and G1(2)=[1].
II. If (G(1), G(2)) is a GCC, then so are (G(2), G(1), (±G(1), ±G(2)), (G(1)†, G(2)†) and (Dk
The suggested GCC-based timing and CFO estimator, illustrated below is referred to as Golay Estimator (G-EST). The transmitter (Tx) repeatedly (at low duty cycle) launches a training sequence (TS) of length NTS, of the form:
g≡{g[k]}
0
N
−1
=D
L
G
L
(1)
+D
L
+L+L
G
L
(2) (9)
where
2Ledg+Lc+2L=NTS (10)
Thus, the proposed TS 40 (
The TS is positioned between data frames 61.
Note that we formally include the two null edge segments of length Ledg each, within the definition of the 4L-points (pnt) support of g, although the elements of these segments are null.
In the preferred implementations, the lengths of the TS as well as that of each GCC codeword are powers-of-two. Thus, to maximize the duty cycle of the GCC codewords within the overall TS, the following length constraints must be satisfied,
N
TS=4L,2Ledg+Lc=2L. (11)
Implying that the TS has 50% duty cycle. It will be shown that the overall G-EST performance is somewhat sensitive to the “guardbands ratio” rGB ≡Lc/Ledg determining the partition between the two guardband types; in the sequel we optimize over this ratio.
We initially analyze the case of ideal noise-free and distortion-free transmission, without oversampling (the Rx samples at baudrate). Singling out a particular TS, we assume a lone TS has been transmitted, preceded and followed by data frames. Initially let us ignore channel impairments, both distortions (assume the sampled linear impulse response is impulsive and assume no nonlinear distortion and noise). Therefore the received sequence r[k] coincides with the transmitted sequence. Let us then express the received signal into the G-EST module as a juxtaposition of three components, namely {data, TS, data}:
r={r[k]}
k=−∞
∞
=d
−
†
+g+D
4L
d
+, (12)
where d−, g, d+ are zero-padded sequences, extending over all R, and we expressed the data subsequences, d−†, D4Ld+ (data respectively preceding and following the TS) in terms of underlying causal sequences d−, d+. We recall that the support of g has duration NTS=4L (see (11)), ranging over {0, 1, . . . , 4L−1}.
The G-EST module cross-correlates the received signal against the transmitted TS, generating the following statistic:
ρ[k]=r[k]g[k]=r[k]{circle around (x)}g†[k]=r[k]{circle around (x)}g[−k] (13)
Very Low-Complexity Implementation
This XCOR operation, referred to as the TS cross-correlator (TS-XCOR) may be simply realized in real-time by means of an FIR filter, with impulse response g[−k] (
The signal ρ[k] generated by the FIR filter (the TS-XCOR output) is split to feed the timing and CFO detector sub-modules. The timing detector consists of a peak position extractor, finding the peaks of the absolute value (squared) of the cross-correlation of the streaming received signal and the Golay Training sequence (G-TS)
The output ρ[k] will be shown to consist of a distinct single-sample peaks, corresponding to the G-TS locations, embedded in some low-level sidelobes. Assuming single shot TS transmission there is a single peak, but as the G-TS is periodically repeated, say every several hundred frames, there will be repeated peaks indicative of the G-TS positions.
Timing Detector
The timing detector may be robustly realized by binary decisions with a certain threshold onto the absolute value |ρ[k]| samples of the TS-XCOR output sequence (or alternatively the absolute-value squared, which may be easier to evaluate). The discrete-time instants when the absolute value of the correlator output exceeds the threshold are declared as timing estimates, indicative of the positions of the TS embedded within the data stream. Other peak finding algorithms are possible.
This completes the description of the GCC based timing estimator, the signal analysis of which is carried out next.
Signal Analysis
Using (12), the TS-XCOR output is expressed as:
ρ=rg=d−†g+gg+D4Ld+g (14)
The presence of guardbands now implies that the supports of the three terms in the RHS of (14) are just partially or not at all overlapping, provided the autocorrelation lag (argument) is not taken to have excessive value. To begin with, let us evaluate the main term gg by expressing the GCC TS as per (9) and using (5) and (3), yielding for the TS ACOR term:
Thus, the Rx additively superposes the sum of autocorrelations with the sum of the cross-correlations. It is also useful to visualize this result graphically (
Now, by virtue of the complementary ACORs property (7), the last equation yields a key result for the TS ACOR:
gg=2Lδ[k]+D−(L
The useful term, used to extract the timing is evidently the impulsive peak 2Lδ[k], whereas the XCOR terms yield a background of sidelobes, which will be shown to be at relatively small levels in comparison with the peak. As the supports of GL(1)GL(2) and its reflection are symmetric around the origin: {−(L−1), . . . , −1, 0, 1, . . . , L−1} and the centers of the two cross-terms in the last expression are offset ±(Lmid+L) from the origin, having respective supports symmetrically positioned around the origin: {Lc−1, Lc, . . . , Lc+2L−1} and {−(Lc+2L−1), . . . , −Lc, −(Lc−1)}.
Thus, the gap separating the two cross-term supports equals
[L−1]−[−(Lc−1)]−1=2Lc−3. (17)
It is within this gap that the peak 2Lδ[k] is embedded, (for Lc<2 there is no gap at all, but even for Lc=0, 1 the peak typically dominates the value of the sum of autocorrelation terms at zero lag). Having evaluated gg and shown that it is essentially impulsive, we must also consider the propagation of the two additional DATA× TS terms in (13) via the TS-XCOR. It turns out that the response due to these terms at the cross-correlator output may have partial or no overlap with the XCOR terms of gg, as may be seen by explicitly evaluating these DATA×TS terms:
Simulations 201 and 203 of
Effect of Channel Memory (Delay Spread)
Heretofore, we have assumed that the discrete-time impulse response h[k] of the linear optical channel is impulsive, h[k]∝δ[k] where cc denotes proportionality. This “digital” impulse response is related to the analog impulse response ha(t) by h[k]=ha(kTs) where Ts is the sampling interval.
The analog impulse response does not have to be an impulse, δ[k] in order for the digital impulse response to be a discrete-time impulse; rather, the support of ha(t) must satisfy support{ha(t)}<Ts i.e., the analog delay-spread must be less than a sample interval. This indicates that the proposed method would work best when this condition is satisfied, but it must not be strictly satisfied, as discussed next. Let us now assume that we have support{ha(t)}≧Ts, i.e. the delay spread exceeds the sampling interval. Now, h[k] contains several non-zero samples. A model taking into account non-impulsive h[k] is readily formulated. The received signal is r[k]=s[k]{circle around (x)}h[k] where s[k] is the overall transmitted signal (TS embedded in data). Eq. (12) is now replaced by S={s[k]}k=−∞∞=d−†+g+D4Ld+, thus the received discrete-time sequence is given by
r=h{circle around (x)}d
−
†
+h{circle around (x)}g+h{circle around (x)}D
4L
d
+ (20)
The output of the TS-XCOR is then given by
Now the cross-terms are convolved with the channel impulse response (which does not appreciably modify them as these terms are essentially pseudorandom) but more significantly, the useful term, containing timing information within (21) is now (using (16)) given by:
Within this term the dominant sub-term is 2Lh, which replaces the 2Lδ term in the memoryless channel case. This indicates that when the discrete-time channel has memory, the Golay timing estimation method actually yields for channel impulse response identification. This may still result in a timing estimate. E.g., in case h[k] peaks at k=0, then finding the maximum may still provide timing information. Notwithstanding this analysis, which indicates that the proposed method may still work for channels with memory, here we are primarily interested in filter-bank based receivers, wherein the proposed timing recovery method is applied on a sub-band basis. In this case the sampling rate of each relatively narrowband sub-band is substantially lower than the sampling rate of the overall channel, hence the condition support{ha(t)}<Ts typically holds, making timing recovery more robust for a sub-bands based receiver, than for a full channel which may reveal multiple densely taken samples of the non-impulsive impulse response h[k].
It is further evident that the proposed Golay based timing recovery is also suitable for channels which exhibit discrete-multipath (channel impulse response consists of a superposition of multiple impulses with various delays and amplitudes).
III. Twice-Oversampled Golay Timing Recovery
Heretofore the analysis was conducted for a baud-rate sampled Rx. Consider now a K-fold fractionally oversampled Rx, in the simplest case a twice-oversampled (2×OS) Rx. A simple conceptual block diagram for the transmission chain comprises an interpolator (a K-fold up-sampler followed by shaping filter) and by an ideal DAC at the Tx, an analog channel, an discrete-time equivalent channel, represented as a linear time invariant analog filter with impulse response ha(t), an ideal ADC followed by a decimator (a V-fold down-sampler preceded by an anti-aliasing filter). The DAC and ADC operate at the elevated sampling rate of V times the symbol rate.
Thus, the cascade ADC->analog_channel_filter->DAC amounts to a discrete time channel hc[n]=hd(nTc) where fs=Ts−1 is the sampling rate at the Tx input and Rx output; the sampling rate within the channel is K times larger,
f
c
=T
c
−1=(Ts/K)−1=Kfs
Now suppose that the sampled impulse response satisfies:
h[k]=h
a(kTs)=ha(kKTc)=hc[kK]∝δ[k] (23)
A sufficient condition for this is that the analog impulse response satisfy support {ha(t)}<Ts, however this is not necessary. Indeed, if the zero-crossings of ha(t) occur at regular intervals, {kTs}k≠0 then (23) is satisfied even when the support of ha(t) is arbitrarily large. In digital terms it is apparent that K-fold sub-sampling hc[n] should yield a response h[k]≡hc[kK] which is essentially a discrete impulse. Under these conditions the K-fold up-sampler and K-fold down-sampler are essentially back-to-back and cancel out, thus the overall transmission chain becomes an identity. Thus, the transmitted baud-rate symbols get reconstructed at the receiver. To the extent that the Nyquist condition (23) is not strictly satisfied then there will be Inter-Symbol Interference (ISI). However if the taps of the impulse response h[k] are not precisely zero for k≠0 but are close to zero, then the ISI is small and in the current context it may modify the complementary ACOR sidelobes somewhat but hardly affect the distinct mainlobe peak which may still remain dominant.
Our proposed Golay timing method was tested to operate using a receiver which oversamples the data by the factor of two, as the filter bank receiver does within each sub-band, provided suitable measures are taken as described below. We also propose an extension for an oversampling receiver which operates with any integer oversampling ratio by simply cross-correlating each of the polyphases of the received K-fold oversampled signal and selecting the XCOR with largest mainlobe. Another case when there may be oversampling used, hence the proposed Golay based timing method is applicable, is for fractionally-equalized single-carrier receiver using a receiver sampling rate which is an integer multiple of that of the transmitted symbol rate.
For definiteness we describe how any twice-oversampled receiver may be equipped with a Golay based timing estimator.
Golay-CTO&FTO Estimator 301 is configured to separate the stream of OFDM symbols that is twice-oversampled separated into even and odd OFDM symbols (that form odd and even polyphase subsequences), by means of a 1:2 serial-to-parallel (S/P) module (a two-state commutator) 250. The even and odd polyphase sequences are input into respective Golay-Training-Sequence Cross-correlators (G-TS-XCOR) modules 100. These resulting cross-correlation streams are input in the coarse timing estimator 360 that may choose the largest-time index, which selects the largest time index both over time and between the two XCOR sequences. Thus, the highest XCOR mainlobe (peak) of the two is selected (typically the XCOR mainlobes of the even and odd sequences will occur at the same time index (or one index off), and the index corresponding to the largest of the two XCOR peaks provides the CTO estimate. We conclude that the CTO timing is determined by the stronger G-TS-XCOR peak in the two even and odd data sub-streams.
Curves 421m 422, 423 and 424 of
To describe the internals of the fractional timing offset estimator 370, one may develop an analytic expression or numerical graph for the difference between the two XCOR peaks of the even and odd polyphases as a function of the FTO and this functional dependence may be inverted to obtain an FTO estimate as a function of the difference in absolute values or absolute values squared of the two mainlobe peaks. The FTO estimate may then be used to set up the linear phase vs. frequency index slope in the 2×2 MIMO equalizer of the sub-band receiver.
However, typically it suffices to obtain an odd function of the difference between the absolute squares of the two peaks, which describes an odd-valued function of the FTO in order to effectively stabilize the FTO by feeding a PLL-like loop controlling the ADC sampling clocks by means of the difference between the absolute squares of the two peaks.
The timing estimator described in the sequel is then proposed for the following generic oversampled system with under-decimation as depicted in
The proposed GCC-based timing estimator consists of a 1:K serial-to-parallel (S/P) module, followed on each of its parallel arms by a TS-XCOR module, performing the correlation with the GCC-based TS at the slowed down sampling rate (at baud-rate). The outputs of all cross-correlators are compared and the path with the highest absolute value of cross-correlation is selected (first the peak over discrete time is selected for each path, then the highest absolute value is selected among all polyphases (S/P outputs)). Thus, this method may be described as per-polyphase-cross-correlation.
In the special case that K=2 (twice-under-decimated system) then the 1:2 S/P effectively extracts the even and odd sub-sequences of the K-down-sampled output and baud-rate TS-XCORs are performed on each of these two sub-sequences. In this case, it is possible to estimate the fractional delay of the channel from the relative levels of the mainlobes of even and odd TS-XCORs. As shown by the simulation of
Simulation Results for Golay Timing of Twice-Oversampled Rx
Graphs 203, 204, 205, 206, 205′, 206′ of
[GHz]. For our novel training sequence 32 tones out of the 64 were used as two Golay sequences as described before, whereas the rest of the tones were zeros as per Eq. (2).
These simulations were conducted for the twice-oversampled receiver of
The three figures differ in their usage of linear or dB scale and in the horizontal range. It is apparent that the Minn algorithm timing peak is not very sharp and one can images that values slightly off-peak may be affected by noise spikes such that they are mistaken for the peak. Another notable effect is that the Golay sidelobes are depressed in the local spectral environment of the peak facilitating peak discrimination.
IV. CFO Tolerance
In this sub-section we consider the tolerance of the proposed timing estimator to the presence of Carrier Frequency Offset (CFO). In this case a simple analytical model may be set up, indicating that CFO reduces the level of the correlation mainlobe (peak) and accurately predicting the dependence of the peak level on ΔvCFO. It turns out that modification of the sidelobes level is quite small in the wake of CFO, as borne by simulations, hence the CFO tolerance is essentially determined by the peak level roll-off with increased ΔvCFO. First we present the result of CFO tolerance simulations (
The precise mainlobe expression is:
To understand this result intuitively, consider first what occurs at the moment of alignment of the transmitted and received G-TS sequences (
for the first Golay codeword and
for the second Golay codeword. Summing all the complex phase factors and taking the absolute value squared yields the result of (25).
One operational conclusion from this expression for the mainlobe power degradation as a function of the CFO, is that it is worth decreasing the gap Lc between the two Golay codes in order to make the cosine function roll-off more mildly (graphs 214 and 215 of
V. Tolerance to Additive White Noise (Amplified Spontaneous Emission) and to Laser Phase Noise.
AWGN (Additive White Gaussian Noise)
In D&C methods are based on a cross-correlation between parts of the received frame. A result of one multiplication between samples of time k and k−L the are three noise terms:
When applying the suggested method there is only one noise term g[k]*n[k].
On the other hand just half the frame is filled up.
For low SNR the term n[k]n*[k−L] is dominant
The tolerance to white noise is about the same as for the Minn algorithm, with an advantage for Golay timing in low SNR (in which case the noise×noise term, present in Minn but not in Golay, becomes more pronounced). The reason the two methods have the same white noise tolerance in high SNR is that on one hand Minn presents two noise×signal terms (contributing equal noise powers), whereas Golay presents a single such term, but on the other hand Golay is sparse, with just half the frame being filled up, hence there is a factor of two less noise averaging in the Golay case, which offsets the fact that it comprises a single noise×signal term. In detail, the conjugate products arising in the Minn case are of the form
which yields 3 noise terms (for each multiplication):
Our proposed method uses an internal noise free sequence, g[n], thus one multiplication of the correlation process becomes (x[k]+n[k])g[k] yielding only one noise term: g[k]n[k].
As for tolerance to laser phase noise (LPN) (graph 216 of
VI. Golay Based CFO Estimation
The CFO active estimation using the Golay based system is related to the analysis of CFO degradation analysis. We have seen that at the instant when the correlation peak occurs, the received time samples {−L/2−1, . . . , −1, 0, 1 . . . L/2−1} are perfectly aligned with the TS version stored in the taps of the cross-correlator. If there were no noise and distortion, then the received signal would equal the transmitted signal, i.e., the TS. The 2L non-zero taps of the FIR filter implementing the TS XCOR, would then invariably generate (±1)(±1)=1. Thus we would obtain two records of all-ones, each of length L, corresponding to the two GCCs. However, if there is CFO, then the received TS over the support of L points corresponding to each GCC, would consist of successive samples (±1) multiplied by the CFO time-domain phase-ramp factor ejθ
θCFO=2πΔvCFOTs=2πΔvCFO/fs (26)
thus, the received sequence would be (±1)ejθ
(±1)ejθ
Thus, at the optimal timing instant, we ideally obtain a constant (unity in this example) amplitude sequence with a phase tilt. To determine the CFO, it remains to estimate the phase increment per sample, θCFO, as then (26) may be used to extract
ΔvCFO=fs(θCFO/2π) (28)
Various methods may be used to extract the CFO phase increment, e.g. one may evaluate the phase angles of the TS-XCOR samples at the alignment time (L samples associated with the two respective Golay codewords, separated by Ledg samples), by inputting each complex sample into an angle extractor (e.g. the CORDIC algorithm) and “passing a line”, based on a least-squares solution, through the measured phases, albeit after the raw measured phases have been unwrapped. An alternative preferred method is to divide each of the L points into S subsets of L/S points (where L/S is integer). Each such subset is averaged (its samples are summed up and divided by L/S) and the sub-sets are grouped in successive pairs and the phase of each even subset is compared with that of the odd subset within that pair, i.e. the phase difference is extracted, e.g. by taking conjugate products and evaluating the phase angle, or alternatively by evaluating the phase-angles and subtracting.
In the particular case that S=L, then the “subsets” becomes singletons (single samples). In this case the phase of each complex odd sample is subtracting from its preceding odd sample, and the phases are averaged out. It is also possible to further average over the average phases corresponding to the two Golay codewords and in a sub-banded filter bank based context to further average across sub-bands, as all sub-bands are assumed to be affected by common CFO. This method requires long averaging and is relatively impervious to laser phase noise as the subtraction of successive phases of the subgroups effectively whitens the phase noise, hence the whitened phase noise is amenable to averaging. The longer the averaging window or the more windows are averaged at the top level (averages of averages) the better the laser phase noise and white noise tolerance of the CFO estimator. Another measure which may improve laser phase noise tolerance in the sub-banded context, is to use “non-redundant interleaving” of the sub-bands as described in the sequel of this patent application.
Comparison with Delay&Correlate Methods Such as Minn
The operation principle of CFO estimation using D&C methods is by taking the angle of the correlation, from which the CFO can be estimated. Denoting s [n] as the complex received symbols, the correlation of
identical parts of the frame is given by:
Thanks to the conjugate operation the common phase of the entire frame is cancelled, and the phase rotation due to CFO remains. The phase rotation is proportional to the time difference between the two samples s[n] and
which is
regardless of n (Ts is the receiver sample time). CFO estimation is possible by finding the phase of the vector
in the I-Q plane:
where the last equality is true in case there is no noise (fCFO denotes the CFO). The extraction of the CFO is done by dividing equation (3) by the factor
Dividing by
has a beneficial outcome as it reduces the variance of the noise, yet it also reduces the dynamic range of the CFO estimation. The largest positive phase that can be detected is π, which leads to the following equality:
The largest N is, the smaller fCFOMax becomes (smaller dynamic range). D&C methods have a fix dynamic range.
At the moment of perfect alignment between the transmitted training sequence and the internal training sequence, the multiplication of the two frames results only in the values of ±1. We can choose using any set of the identical symbols, which implies having a much more flexible dynamic range of the CFO estimation, than in Minn's or S-C's. Controlling the number of symbols,
in (5), gives freedom in compromising between dynamic range and noise averaging.
We note that our timing correlation is less resistant to CFO the Minn's, since we use an internal frame and not correlating two halves of a received frame, yet it shows one distinct peak even for CFO values of 100 [MHz], as will be elaborated next.
Golay Based Methods of CFO Estimation
In
Next, figures illustrate 25-27 CFO estimation circuits in which the Golay codewords G1 and G2 records are combined together for the CFO estimation
These multiple embodiments demonstrate that the Golay based timing has large leeway in how to organize the extraction of the incremental phases. This is in contrast with Minn CFO estimation which is quite constrained to operate with strict separations for the PDs.
The various embodiments 20-32 represent various tradeoffs between ASE and laser phase noise performance, CFO dynamic range and realization complexities.
VII. Polarization-Diverse Operation
It suffices to launch the G-TS in just one state of polarization (say along the X or Y polarization axis) in the Tx, since the polarization transformation along the fiber implies that the powers and relative phases of the two received G-TS components in the two polarizations (POL), will be randomized anyway. To attain resilience of the timing to the POL evolution in the fiber, it is proposed to use a POL-diversity technique, whereby in each sub-band that G-TS are launched is received at the filter-bank outputs for both POL corresponding to the particular sub-band and coherent combining of the two XCORs is performed in order to increase the SNR prior to determining the position of the peak (mainlobe).
The proposed novel timing offset estimation is manifested in a highly distinct correlation peak, impervious of channel impairments, allows for CFO estimation over a large dynamic range and outperformance previous method in the sense of the clear peak it displays and the AWGN resilience. The implementation requires no multipliers which is a benefit in both area and cost. To recap the Golay based channel estimation algorithm advantages:
for filter-bank receivers
than Minn and Schmidl-Cox, enabling a variety of tradeoffs between dynamic range, white noise and laser phase noise tolerances and complexity.
Method 400 may start by step 410 of include receiving a stream of OFDM symbols.
Step 410 may be followed by step 420 of searching, by a timing circuit, in the stream of OFDM symbols, for a training sequence that comprises a first Golay codeword and a second Golay codeword.
The sum of an autocorrelation of the first Golay codeword and an autocorrelation of the second Golay codeword consists essentially of a delta function. See, for example,
Step 420 may be followed by step 440 of calculating frequency offset.
To estimate timing we are going to send 2 complimentary Golay sequences. Golay sequence can be built recursively in the following way:
A
n
=[A
n−1
B
n−1
]B
n
=[A
n−1
−B
n−1]
For example:
An=1, Bn=−1
A2=[1 −1], B2=[1 1]
A3=[1 −1 1 1], B3=[1 −1 −1 −1]
We send the following training frame over all sub-band of the X polarization: [A32B32].
We send the following training frame over all sub-band of the Y polarization: [A32−B32].
During channel propagation X-polarization and Y-polarization are mixed. This is the reason we must deal with both polarization to get reliable timing estimation.
The receiver includes square absolute value unit 502, running average module 504, multipliers 510, first adder 521, second adder 522, first square absolute value unit 531, second square absolute value unit 532, adder 540 and peak search unit 550.
Let's define the following values:
To achieve that in order to achieve the perfect autocorrelation property of the Golay sequence we must add the correlation over different Golay sequences:
C
X
□A
corr
+B
corr
C
Y
□A
corr
+B
corr
But in real system the signal over different polarizations are multiplexed. We must perform correlation with X-pol sequence and Y-pol sequence at each receiver polarization. To reduce complexity we propose the following overall cost function:
Corr□CX2+CY2=(Acorr+Bcorr)2+(Acorr−Bcorr)2=Acorr2+Bcorr2
From the following equation it is evident that we only need single operation to get reliable correlation value over any polarization rotation angle.
Graph 610 of
As we can see Golay based (PN sequence) based timing estimation gives as much clearer results compared to previously implemented Minn timing method.
Graphs 620, 630, 640 and 650 of
Peak detect mechanism should be able to give reliable peak reading in all these cases. The following algorithm is proposed: Choose the first peak higher than given threshold, and
The threshold value should be adaptive and be determined according to the average power of the input data signal.
Moreover, the terms “front,” “back,” “top,” “bottom,” “over,” “under” and the like in the description and in the claims, if any, are used for descriptive purposes and not necessarily for describing permanent relative positions. It is understood that the terms so used are interchangeable under appropriate circumstances such that the embodiments of the invention described herein are, for example, capable of operation in other orientations than those illustrated or otherwise described herein.
Those skilled in the art will recognize that the boundaries between logic blocks are merely illustrative and that alternative embodiments may merge logic blocks or circuit elements or impose an alternate decomposition of functionality upon various logic blocks or circuit elements. Thus, it is to be understood that the architectures depicted herein are merely exemplary, and that in fact many other architectures may be implemented which achieve the same functionality.
Any arrangement of components to achieve the same functionality is effectively “associated” such that the desired functionality is achieved. Hence, any two components herein combined to achieve a particular functionality may be seen as “associated with” each other such that the desired functionality is achieved, irrespective of architectures or intermediate components. Likewise, any two components so associated can also be viewed as being “operably connected,” or “operably coupled,” to each other to achieve the desired functionality.
Furthermore, those skilled in the art will recognize that boundaries between the above described operations merely illustrative. The multiple operations may be combined into a single operation, a single operation may be distributed in additional operations and operations may be executed at least partially overlapping in time. Moreover, alternative embodiments may include multiple instances of a particular operation, and the order of operations may be altered in various other embodiments.
Also for example, in one embodiment, the illustrated examples may be implemented as circuitry located on a single integrated circuit or within a same device. Alternatively, the examples may be implemented as any number of separate integrated circuits or separate devices interconnected with each other in a suitable manner.
However, other modifications, variations and alternatives are also possible. The specifications and drawings are, accordingly, to be regarded in an illustrative rather than in a restrictive sense.
In the claims, any reference signs placed between parentheses shall not be construed as limiting the claim. The word ‘comprising’ does not exclude the presence of other elements or steps then those listed in a claim. Furthermore, the terms “a” or “an,” as used herein, are defined as one or more than one. Also, the use of introductory phrases such as “at least one” and “one or more” in the claims should not be construed to imply that the introduction of another claim element by the indefinite articles “a” or “an” limits any particular claim containing such introduced claim element to inventions containing only one such element, even when the same claim includes the introductory phrases “one or more” or “at least one” and indefinite articles such as “a” or “an.” The same holds true for the use of definite articles. Unless stated otherwise, terms such as “first” and “second” are used to arbitrarily distinguish between the elements such terms describe. Thus, these terms are not necessarily intended to indicate temporal or other prioritization of such elements The mere fact that certain measures are recited in mutually different claims does not indicate that a combination of these measures cannot be used to advantage.
While certain features of the invention have been illustrated and described herein, many modifications, substitutions, changes, and equivalents will now occur to those of ordinary skill in the art. It is, therefore, to be understood that the appended claims are intended to cover all such modifications and changes as fall within the true spirit of the invention.
This application claims priority from U.S. provisional patent Ser. No. 62/020,467 filing date Jul. 3, 2014, which is incorporated herein by reference.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/IL2015/050688 | 7/2/2015 | WO | 00 |
Number | Date | Country | |
---|---|---|---|
62020467 | Jul 2014 | US |