The present invention relates to the field of optical data communication. More particularly, the invention relates to a method and system for performing efficient blind maximum-likelihood channel and data equalization in high speed optical communication channels.
The task of joint channel and data estimation without a training sequence is of high importance for Maximum Likelihood Sequence Estimation (MLSE) processing, as widely described in prior art. MLSE processing is suitable for fiber optical communication systems. Especially, due to the presence of Polarization Mode Dispersion (PMD—modal dispersion where two different polarizations of light in a waveguide, propagate at different speeds, causing random spreading of the optical pulses), the optical channel is considered to be non-stationary and adaptive equalization is required. When the histograms, which serve as channel estimators, are updated faster than the channel variation rate, successful variations tracking can be achieved.
Maximum Likelihood Sequence Estimation (MLSE) is considered to be a non-linear equalization technique. To explain the main idea that is behind the MLSE processing, trellis diagrams are often used. The example of 4-state trellis diagram is presented in
An MLSE processor chooses the path with the smallest metric, and produces the most likely sequence by tracing the trellis back. For a sequence of length N, there are 2N possible paths in the trellis. Therefore, an exhaustive comparison of the received sequence with all valid paths is a cumbersome task, becoming non feasible for channels with a long memory. However, since not all paths have the similar probabilities (or metrics) when proceeding through the trellis, there is an efficient known algorithm, called Viterbi algorithm (is a dynamic programming algorithm for finding the most likely sequence of hidden states), that limits the comparison to 2K “surviving paths”, where K is the channel memory length, independent of N, making the maximum likelihood principles to be practically feasible.
From two paths entering the trellis node, the path with the smallest metric is the most probable. Such a path is called the “surviving path”, and only surviving paths with their running metrics need to be stored.
Maximum likelihood sequence detection is the most effective technique for mitigating optical channel impairments, such as Chromatic Dispersion (CD—the dependency of the phase velocity of an optical signal on its wavelength) and Polarization Mode Dispersion (PMD). In order to successfully apply this technique, it is mandatory to estimate some key channel parameters, needed by the Viterbi processor.
Conventional channel estimation methods can be classified as parametric and non-parametric. Parametric methods assume that the functional form of the Probability Density Function (PDF—a function that describes the relative likelihood for this random variable to take on a given value) is known, and only its parameters should be estimated. However, non-parametric methods do not assume any knowledge about the PDF functional form or its parameters. There are two most common methods, used in practice for channel estimation: Method of Moments (MoM—a way of proving convergence in distribution by proving convergence of a sequence of moment sequences) and Histogram Method (of estimation).
Method of Moments is considered to be parametric, and therefore, it assumes that the functional form of the PDF is known and only its moments need to be estimated. When the dominant noise mechanism in the optical system is thermal, like in optically unamplified links, the conditional PDF of the received sample xn, given that μk is transmitted, is assumed to be Gaussian with σn2 being the variance of the noise:
In this case, only first and second moments need to be estimated. Another case of interest is Amplified Spontaneous Emission (ASE) limited channel. The noise in such a channel becomes signal dependent and the functional form of the conditional PDF of the received sample xn, given that μk is transmitted, can be approximated by a non-central Chi-square distribution with v degrees of freedom:
Where I{•} is the modified Bessel function of the first kind and N0 is power spectral density of the ASE noise given by:
where nsp is the spontaneous emission factor (or population inversion factor), G represents the EDFA gain,
is the photon energy at the wavelength λ0, h being the Plank constant and c being the speed of light. It is clear that in Eq. b, N0 and v need to be estimated.
The histogram method does not assume anything about the PDF of the received samples. According to this method MN
The branch metrics are obtained by taking the natural logarithm of the estimated/assumed PDF. For a transmitted sequence of length N the MLSE decoder chooses between MN possible sequences that minimize the (path) metric:
The estimated bit sequence is determined by tracing the trellis back, based on the minimal path metric of Eq. c.
In optical fiber systems, the purpose of the MLSE is to overcome ISI stemming from CD and from PMD. While CD is a deterministic phenomena for a given link, PMD is stochastic in nature, and therefore, an adaptive equalizer that performs PMD tracking is required. Moreover, the adaptation properties of the MLSE can be also exploited for CD compensation when the amount of CD is not accurately known. Basically, expensive tunable optical dispersion compensation may be replaced by the adaptive MLSE. This type of operation, without knowing any initial information about the channel parameters and distortion is called “blind equalization”.
The constant growth in the demand for high bandwidth data transmission leads to higher challenges that should be resolved in the physical layer, and particularly by optical transmission technology.
The current high end transmission data rates are in the range of hundreds of Gbits/sec. One emerging technology that can support such bitrates for long distances (hundreds of kilometers and above) is coherent transmission and detection. On the other hand, direct detection technology offers the use of lower cost optoelectronic components, consumes less power and enables overall lower latency solution. These advantages may be critical for short reach applications such as sub-hundred kilometers networks of metro-edge and data centers interconnections.
The simpler alternative, (non-coherent) direct detection optical technology is of lower cost, but is limited to lower bit rates and/or shorter distances. For example, increasing the bitrate from 10 Gbit/sec to 25 Gbit/sec, results in distance reduction from ˜80 km to ˜15 km, for the same Bit Error Rate (BER) performance. The main reason for this limitation is the inter-symbol interference (ISI—a form of distortion of a signal in which one symbol interferes with subsequent symbols) caused by chromatic dispersion (CD) and Polarization Mode Dispersion (PMD). Two techniques are commonly used in order to overcome this ISI. The first technique is based on advanced modulation formats, together with partial response signaling, while the other approach is based on Digital Signal Processing (DSP), applying Electronic Dispersion Compensation (EDC—a method for mitigating the effects of chromatic dispersion in fiber-optic communication links with electronic components in the receiver). The EDC implementations with Maximum Likelihood Sequence Estimation (MLSE) at the receiver (Rx) side, is theoretically the optimal tool to combat ISI, this was very popular for 10 Gbis/sec. The combination of the two approaches is also possible, and was also theoretically investigated for 4×25 Gbits transmission with the use of reduced bandwidth components.
The prior art methods described above for performing equalization of an optical channel are not suitable for blind equalization, since they either require training sequence, or have a low convergence rate, or involve high implementation complexity. Moreover, they require using real data signals to converge, which results in a relatively high initial Bit-Error-Rate (BER—the percentage of bits that have errors relative to the total number of bits received in a transmission) and data loss.
Also, blind channel estimation for the MLSE receiver for direct detection systems allows upgrading the current 10G systems to 100G (4×25G) systems, with extended reach of up to 40 km uncompensated links. The task of joint channel and data blind estimation without a training sequence in hand for optical communication with direct detection is of high importance for MLSE processing. Most of the prior art methods deal with a steady state operation, i.e. the tracking mode. However, the blind estimation of the optical channel suitable for the acquisition/initialization stage is less covered. Although various MLSE acquisition methods exist, most of them either require a training sequence, or have a low convergence rate, or involve high implementation complexity.
It is therefore an object of the present invention to provide a method for performing blind equalization of an optical channel without requiring a training sequence.
It is another object of the present invention to provide a method for performing blind equalization of an optical channel with a high convergence rate.
It is still another object of the present invention to provide a method for performing blind equalization of an optical channel which is easy to implement.
It is a further object of the present invention to provide a method for performing blind equalization of an optical channel which does not require using real data signals to converge.
Other objects and advantages of the invention will become apparent as the description proceeds.
The present invention is directed to a method for performing blind channel estimation for an MLSE receiver in high speed optical communication channel, according to which Initial Metrics Determination Procedure (IMDP) is performed using joint channel and data estimation in a decision directed mode. This is done by generating a bank of initial metrics that assures convergence, based on initial coarse histograms estimation, representing the channel and selecting a first metrics set M from the predefined bank. Then an iterative decoding procedure is activated during which, a plurality of decision-directed adaptation learning loops are carried out to perform an iterative histograms estimation procedure for finely tuning the channel estimation. Data is decoded during each iteration, based on a previous estimation of the channel during the previous iteration. After checking whether the resulting metrics are converged (e.g., by using a Z-test), if convergence is not achieved, the next metrics set is selected from the bank. If convergence is achieved, ISI optimization that maximizes the amount of ISI that is compensated by the MLSE is performed. If the initial metrics bank is run out of metrics sets, the IMDP is repeated. Finally, a tracking mode is performing, using the decision-directed adaptation loops as tracking loops.
The bank of initial metrics may generated by transmitting a training sequence and generating several metric sets for different channel conditions, based on Method of Moments (MoM) or by a set of FIR filters, each of which having a FIR length which is determined by the memory depth of the MLSE engine.
Data may be decoded by a decoder that is designed such that the channel memory length is at most as the memory length of the decoder.
The FIR filter may be approximated by either pre-cursor dominating ISI filter, post-cursor dominating ISI filter or symmetrical ISI filters.
The convergence tendency of the histogram set may be monitored by using the sampled standard deviation of the central moments after a predetermined number of iterations, or based on training a sequence.
The ISI optimization may be performed by collecting several channel estimates, while each time, setting a different Match Point (MP)-shift between the stream of ADC samples and the stream of the corresponding decision bits and then, selecting the MP-shift that yields the minimal variances-average of the histograms.
The above and other characteristics and advantages of the invention will be better understood through the following illustrative and non-limitative detailed description of preferred embodiments thereof, with reference to the appended drawings, wherein:
a (prior art) illustrates an example of 4-state trellis diagram
b shows the architecture of blind MLSE equalizer;
a-3c show FIR filter examples for generation of metrics bank;
a and 5b illustrate two normalized histograms during Phase #1 of IMDP, for a channel memory depth of one symbol;
a-7c show IMDP convergence monitoring during phase #2;
d shows IMDP convergence monitoring during phase #3;
a-8e show the results of ISI optimization during phase #4;
a-10d present Phase #2 of the IMDP, which consists of 8 iterations;
a-11d show the convergence process during phase #2 of the IMDP, for a 40 km long optical link;
a-12e show the histogram sets representing the outcome of the ISI optimization Phase #4 of IMDP for a 40 km long optical channel; and
The present invention proposes a novel, simple and fast blind channel estimation method for direct-detection optical systems, based on blind channel acquisition algorithm, for MLSE equalization in high speed optical communications. It performs joint channel and data estimation in decision directed mode.
The blind channel acquisition algorithm is referred herein as Initial Metrics Determination Procedure (IMDP). The initialization of the IMDP is based on the approximate Discrete Time Equivalent (DTE) model, exploiting the most relevant physical properties of the fiber and the nonlinear photo-detector.
For a non-coherent system, maximum likelihood sequence estimation is proven to be the most effective stochastic technique for mitigating optical channel impairments such as chromatic dispersion and polarization mode dispersion. While CD is a deterministic phenomenon for a given link, PMD is stochastic in nature, and therefore an adaptive equalizer that performs PMD tracking is required for proper estimation. Moreover, the adaptation properties of the MLSE can be also exploited for CD compensation when the amount of CD is not perfectly known. Basically, expensive tunable optical dispersion compensation may be replaced by the adaptive MLSE. To ensure sufficient tracking, the adaptation rate must be fast enough, comparing to temporal variations of the channel. Since PMD changes in the scale of 100 μsec-1 msec, the adaptation rate must be at least ten times faster, meaning that every 10 μsec a new channel estimation must be obtained.
The channel estimates are called metrics, and are obtained by taking the (negative) logarithm of the conditional probability density functions (PDFs) of the received samples rn given the transmitted sequence [an, an-1, . . . an-N
M
l(rn|an,an-1, . . . ,an-N
where V represents the vocabulary size at the receiver (Rx) side.
The key idea of the MLSE processor is to choose the path γopt with the smallest running metric γl(k) among VN candidate sequences of length N:
and produce the most likely sequence by tracing the trellis back. Practical implementations often resort to the computationally efficient Viterbi algorithm. Here, the Histogram Method is used to approximate the PDFs in [Eq. 1]. Since blind equalization is pursued, the histograms are collected in decision directed manner, as shown on
In
H={H
l(rn,|an,an-1, . . . ,an-N
H
l(rn|an,an-1, . . . ,an-N
The signal is quantized to NADC bits; therefore, each histogram consists of at most 2N
M={M
l(rn|an,an-1, . . . ,an-N
are obtained, thereby forming the current channel estimate. In the steady state (tracking mode), the histograms, and thus the metrics, are updated iteratively, based on the observed data.
The algorithmic flowchart of the blind MLSE acquisition stage, referred herein as Initial Metrics Determination Procedure (IMDP), is illustrated in
Direct detection optical channel systems are nonlinear in nature, mainly due to the square-law operation in the photo-detector and the intensity dependence of the fiber refractive index (the Kerr effect—a change in the refractive index of a material in response to an applied electric field.). Thus, the noiseless incoming sample is represented by a nonlinear combination of transmitted symbol an and past Nisi(channel) symbols:
For the purposes of coarse channel estimation, it is assumed that the predominant nonlinearity comes from the square-law detection, and the fiber non-linearity Kerr effect can be neglected. At the photo-detection input point, the Discrete Time Equivalent model (DTE) accounting for the transmitter shaping, Optical Fiber (OF), CD and first order PMD, is given by:
is the discrete Kronecker delta function, and ‘*’ denotes the convolution operation. The effect of first order PMD in [Eq. 5] is represented by a discrete time 2×2 diagonal matrix with power splitting coefficient γ and Differential Group Delay (DGD the difference in propagation time between the two eigenmodes X and Y polarizations.) τ. In order to be compatible with the DTE model, τ in [Eq. 5] is rounded up to the nearest value which is multiple of the symbol duration. It should be stressed here, that the latter adjustment does not represent the exact PMD behavior, but is certainly sufficient for the purpose of coarse channel estimation, pursued here to obtain only a starting point for the initial MLSE metrics. Chromatic dispersion can also be represented by a Finite Impulse Response (FIR) filter with NCD taps (the filter length):
where c is the speed of light, λ0 is the wavelength of the optical carrier, CD is the amount of chromatic dispersion and fs is the sampling frequency. By denoting the scalar part of HDTETx+fiber [n] by
ψ[n]hCD[n]*hTx[n]*hOF[n] [Eq. 7]
the signal at the photo-detector input can be written as:
x[n]=(HDTETx+fiber[n]*an)+zASE[n]=sn+zASE[n] [Eq.8]
where zASE[n] is the Amplified Spontaneous Emission (ASE—light, produced by spontaneous emission, that has been optically amplified by a laser source) noise vector coming from optical amplifiers (in both polarizations) and sn is the DTE signal component given by:
where NCh represents the length of scalar impulse response ψ[n] in units of symbol duration:
N
Ch
=N
CD
+N
Tx
+N
OF−2 [Eq. 10]
where NTx and NOF are the impulse response lengths of the transmitter (Tx) and optical filter respectively. Similarly, the overall length of the channel impulse response (including the PMD effect is Noverall=Nch+τ.
The recorded signal at the Photo-Detector (PD) output, is given by:
u
n
=R·(Tr{sn·snH})*hRx[n]+wn=rn+wn [Eq.11]
where ‘Tr’ denotes the trace operation, ‘H’ represents the Hermitian conjugate operation, R is the PD responsivity, hRx[n] is the photo-detector electronic impulse response, and wn represents all the noises present in the system: signal-spontaneous, spontaneous-spontaneous, thermal, shot and dark current. The expanded expression of the signal term accounting for the trace operation:
y
n
Tr{s
n
·s
n
H} [Eq.12]
is given by:
where e is the real part of the complex signal, and “′” designates the complex conjugate. Thus, according to [Eqs.11-14] the operator
in [Eq.4] is given by.
Equations [Eqs.11-14] will be used in the following sections to derive a coarse FIR approximation of the function
which is shown to be a good initial guess for the initialization of the MLSE acquisition process.
The key function that enables the blind MLSE processing is the proper definition of the metrics bank {M(j), j=0, . . . , Jmax−1}, which allows operation in decision directed mode. These can be obtained by preparing a predetermined metrics bank, for example by transmitting a known data (training sequence) followed by generating and storing several metric sets for different channel conditions, as described in
The present invention proposes a novel approach for the definition of the metrics bank based on Method of Moments (MoM), combined with knowing the physical behavior of the optical fiber. Since only coarse channel representation is needed, it may be assumed that the branch histograms Hl (rn|an, an-1, . . . an-N
In this case, there are VN
Based on the argumentation above, the problem of selecting the proper set of metrics bank can be formulated as follows: Finding the set of VN
M
(j)=√{square root over (2π)}σj−({tilde over (r)}n−μj)2·/(2σj2),j=0, . . . ,Jmax−1 [Eq.16]
where the ‘·/’ represents the element-wise (Matlab-like) vector division operation.
The values μj can be determined by the FIR approximation of the operator Γ(•) given by [Eq.15]. Without loss of generality, the following analysis is restricted to the simplest On-Off-Keying (OOK) modulation format, i.e., V=2.
It is assumed that the Non-Return-to-Zero (NRZ) shaping pulse at the transmitter (Tx) is represented by the following impulse response hTx[n]=K1δn in the DTE model, where NTx=1 in [Eq.10], and the constant K1 depends on the transmitted power. It is also assumed that the bandwidth of the optical filter is wide enough, such that at the sampling point, the DTE impulse response of the OF is hOF[n]=K2δn, where NOF=1 and K2 depends on the OF shape.
In practice, the length of hOF[n], NOF, may be longer than a single symbol duration, especially in the environment of concatenated optical filtering (with optical add drop multiplexers). Consequently, according to [Eq.10] the length of the scalar impulse response is dominated by the length of hCD [n], NCD and [Eq.7] can be rewritten as:
Using similar argumentation, it can be assumed that hRx[n]=K3δn. For an OOK format an=|an|2, and substituting [Eq.6], [Eq.14] and [Eq.17] to [Eq.15] yields:
where KRK12K22K3W, K>0 is the non-negative proportionality constant that depends on the responsivity and shapes of the transmitter (Tx), optical and receiver (Rx) filters.
The first two terms of [Eq.18] represent the linear part of rn, and can be regarded as the sum of the responses of two FIR filters with rectangular shapes, relatively delayed by τ:
Thus, as a first order approximation of Γ(•), a metrics bank may be defined by quantizing γ and building all possible combinations of the coefficients, corresponding to various delays τ. The last two terms of [Eq.18] account for nonlinear interaction between the transmitted symbols, and may be viewed as a data dependent FIR filters, whose coefficients are proportional to cos(π·W·(k−l)2):
These two terms contribute to the overall sum only when the corresponding data-dependent coefficients are non-zero. On average this filter can be approximated as:
where E{•} represents the mathematical expectation operator. For OOK modulation format E{an}=0.5, thus on average the contribution of the last two terms in [Eq.18], is at most half of the first two terms.
[Eq. 19] and [Eq. 20] summarize the exact mathematical model of the overall channel DTE FIR. For pragmatic acquisition purposes a coarse approximation is proposed. A closer examination of [Eq. 19] and [Eq. 20] reveals that while [Eq.19] represents rectangular shape, [Eq.20] represents the sum of half period cosine terms multiplied by the random data samples. It can be shown empirically (by plotting the sum of [Eq.19] and [Eq.20] for various data, CD and PMD values) that the FIR-equivalent filter can be approximated by either pre-cursor dominating ISI (next bit effect), post-cursor dominating ISI (previous bit effect) or symmetrical ISI filters.
Consequently, the bank of metrics , can be generated by the following set of FIR filters, {bj, j=0, . . . , Jmax}, where bj is given by:
The indexes in [Eq.22] are summarized as follows: j represents the serial number of each element in the proposed set of FIR set , n represents the discrete time axis of the impulse response bj, and m is related to the impulse response length. The number of coefficients in each element of (the FIR length) is determined by the memory depth of the MLSE engine, Nisi. The design parameter c in [Eq.22] describes the distribution of ISI in each element of (the FIR shape), which, in turn, determines the value of the mean vector μj in [Eq.16]. The actual FIR shape is found to be less critical since only coarse channel model is required for the acquisition stage. Therefore, its value is selected to optimize implementation complexity. In this work, c=2 was used, and satisfactory results are obtained as presented in the below examples.
a-3c illustrate examples of FIRs corresponding to each line in [Eq.22] for c=2 and Nisi=4, for j=0 (increasing exponent) j=4 (decaying exponent and for j=8 (symmetrically decaying exponent), respectively.
Practically, the MLSE decoder memory length is typically small (Nisi<5), and the number of elements in is finite and not too large. For example, in the ASIC, Nisi=4, resulting in Jmax=10 matrices in the bank as dictated by [Eq.22]: 4 matrices with pre-cursor ISI, 4 with post-cursor ISI and 2 with symmetric ISI behavior. The overall acquisition time of the IMDP, in the worst case (when all matrices in the bank should be examined) increases linearly with Jmax.
The mean vectors μj in [Eq.16] can be obtained using [Eq.22] as follows:
μj=A·bj·(2N
where, in the simplest case, A is the VN
Similarly, the vector of variances values σj2 is calculated as follows:
σj2=S·bj2 [Eq.24]
where S is the VN
For example, taking V=2, Nisi=1, the matrices A and S have the following form:
where Var(‘0’) and Var(‘1’) are determined according to the worst case OSNR the system is designed to tolerate (typically slightly below the pre-Forward Error Correction value), that depend both on signal and noise power in the system.
In order to verify whether the X learning loops during phase #2, a channel estimate M that describes the channel reliably enough is provided, such that successful operation in decision directed mode is possible (BER<10−2) and the histograms in the corresponding histogram set H must possess certain statistical properties.
The only assumption that forms the basis of derivation of these properties is that the transmitted symbols are equiviprobable, i.e.:
That this assumption is also needed for using MLSE instead of Maximum A Posteriori Probability (MAP—a mode of the posterior distribution.) algorithm, and generally hold in practical systems which employ source coding and scrambling. In turn, [Eq.26] implies that the probability to transmit any combination of Nisi+1 consecutive symbols is
Therefore, if the decoder works correctly and the channel estimate M is reliable, there are Nbr=VN
P(unεΓi)=p,i=0, . . . ,Nbr−1 [Eq.28]
Thus the total number of events in each branch is given by
is a Gaussian random variable with an expectation value Np and variance Np(1−p):
where N is the total number of observations, used to build the whole histogram set H. Hence, a widely used Z-test (a statistical test for which the distribution of the test statistic under the null hypothesis can be approximated by a normal distribution) is proposed here as a convergence criterion for each branch HiεH, 0≦i≦Nbr−1. Based on [Eq.30] the null hypothesis is
m
0
(i)
=Np,i=0, . . . ,Nbr−1 [Eq.31]
and the Z-statistics is given by:
The two-tailed P-value (the probability of obtaining a test statistic result at least as extreme as the one that was actually observed), or the probability that successfully converged metrics would be classified as non-converged is given by:
Thus based on ε, the practical convergence test translates into:
thr
low
≦m
0
(i)
≦thr
high
,i=0, . . . ,Nbr−1 [Eq.34]
i.e., to check whether the obtained event count in each branch lies between the two threshold values, defined by [Eq.34], where:
Therefore, meeting the conditions in [Eq.35] indicates that the detected symbols obey the equiviprobability assumption of [Eq.26].
Based on the argumentation in section 4.3, it is possible to use the sampled standard deviation of the central moments after d-th iteration, designated as std (m0)[d], in order to monitor the convergence tendency of the histogram set H during phase #2:
The idea behind [Eq.36] is that if during phase #2 H has the tendency to converge, after X iterations, each branch will have a similar number of events, and std(m0)[X−1] will go to zero.
In addition, an additional figure of merit is proposed, based on training a sequence, for illustrational purposes only. In this case the histogram set, Htraining is known, and one can measure the closeness of the obtained set Hblind by means of sample Kullback-Leibler (KL—a non-symmetric measure of the difference between two probability distributions P and Q) distance:
After treating all the KL-distances in Hblind as a vector in a linear space, a Euclidian norm of KL distances DKL(i) can be used to monitor the convergence process during phase #2:
Furthermore, the Bit Error Rate (BER), obtained by direct error counting will be used to illustrate that the proposed figure of merit behaves correctly, i.e. convergence in terms of std (m0) results also in BER convergence.
According to [Eq.4], the incoming sample is a nonlinear combination of a current symbol an and Nisi(channel) previous symbols. The MLSE equalizer operates perfectly, if the memory length of the decoder Nisi is greater than the channel memory, i.e. Nisi≧Nisi(channel). However, in practical scenarios, the opposite statement holds, i.e. Nisi<Nisi(channel). In this case, the MLSE equalizer performs sub-optimally, since it takes care only for the first Nisi terms, leaving some portion of residual ISI uncompensated. This residual ISI is treated by the decoder as noise, and is reflected into the variances of the branch histograms:
σl2=σnoise2(l)+σADC2+σresidual ISI2,0≦l≦Nbr [Eq.39]
where σnoise2(l) is the receiver random noise (both thermal and optical induced noises), and σADC2 is ADC related noise that includes quantization, jitter, etc. Usually, if the decoder is designed correctly, the amount of the residual ISI is small and the effect on the performance is negligible. i.e., σresidual ISI2σnoise2(l)+σADC2, 0≦l≦Nbr. But, when the amount of the impairments in the channel is high the residual ISI may dominate. If a simple FIR channel with Nisi(channel) coefficients is used, the noiseless received sample rn is given by:
The ISI in the system can be divided into two groups: the ISI handled by the MLSE with memory of Nisi symbols, and the residual ISI. The handled ISI should be selected according to a peak-distortion criterion:
Thus, there is a subset of L=Nisi(channel)−Nisi taps that is not compensated, and generates the residual ISI noise with variance σresidual ISI2. For the system with V equiprobable symbols (symbols with equal probabilities)
where σa2 the variance of the transmitted constellation, is given by:
In the case of an OOK system, the received signal is described in [Eq.18], and the peak-distortion criteria can be extended:
where {circumflex over (b)}n is the given by [Eq.19] and {tilde over (b)}n is the data-dependent FIR that can be approximated by [Eq.20]. Consequently, σresidual ISI2 can be approximated by [Eq.42] where bn is replaced by E{{circumflex over (b)}n+{tilde over (b)}n}.
The optimal n0 is called Match Point (MP), and in practice the ISI optimization is done by collecting several channel estimates (histograms), while each time, a different MP-shift n0 is set between the stream of ADC samples and the stream of the corresponding decision bits. Thus, each histogram represents a selection of a different subset of the channel ISI to be compensated by the MLSE. The contribution of σnoise2(l) and σADC2 in [Eq.39] is the same, averagely. Therefore, the variances-average of the histograms' changes between these n0 shifts, and is determined by the σresidual ISI2. Hence, the selected n0 (the correct MP-shift) is the one that yields the minimal variances-average of the histograms:
The proposed IMDP method was implemented within the Q ASIC and was verified experimentally using the following optical setup, shown in
The ASIC has an ADC with nominal resolution of NADC=5 bits and an Effective Number Of Bits (ENOB) of 3.8 bits. An analog Phase-Lock Loop (PLL) was used to recover the symbols clock, while the data was sampled at the symbol rate of 28 Gsymbol/sec. The MLSE equalizer memory depth is Nisi=4 symbols, the principle architecture of which is shown in
The operation of the proposed blind channel acquisition algorithm (IMDP), the outcome of the intermediate procedure phases (
The phases of the IMDP for a Back to back channel are illustrated in
a and 5b illustrate normalized histograms (a) H#1(0) and H#1(1) during Phase #1 of IMDP, for a channel memory depth of one symbol. Two different histogram sets H#1(0) and H#1(1), that can represent PDFs describing a channel with memory depth of one symbol, are shown. The histograms on the left hand side correspond to the increasing exponent j=0 in [Eq. 22], whereas the histograms on the right hand side represent the decreasing exponent channel j=4 in [Eq.22] for Nisi=4. In fact, the branch (or histogram) labeled ‘01’ in
On the other hand, the edge branches ‘00’ and ‘11’ have the same mean and variance values due to the symmetry presented in [Eq.22]. Both histogram sets H#1(0) and H#1(1) contain 32 branches each, which are divided into 4 groups, whereas each group is described by its mean and variance values (which coincide with the 4 histograms shown in
a-6c illustrate normalized histogram sets obtained after phase #2 of IMDP.
By comparing
a-7d show IMDP convergence monitoring during phase #2.
b shows a standard deviation of central moments [Eq.36], presenting the convergence rate in terms of standard deviation of the central moments, as defined in [Eq.35]. Similarly, convergence of H#1(1) (squares) and divergence of H#1(0) (circles) are observed. The (final) value of std (m0) for H#2(1) also goes to zero, indicating that all the histograms in H#2(1) have similar number of observations.
c shows convergence in terms of BER, presenting the bit error rate convergence during phase #2, compared to the BER obtained by training (solid line). The convergence in terms of BER is slower during the acquisition phase #2, as compared to and to std (m0) convergences. However, it is shown in the following figures that the final BER convergence, at the end of the IMDP, is similar to the training case.
d shows Phase #3 of IMDP during which, the convergence criterion of [Eq.34] is checked, presenting the validation of convergence criterion of [Eq.34], which forms phase #3 of the IMDP, as well as the values of the moments m0(1) for various histograms in H#2(0) (circles) and H#2(1) (squares). It is clearly seen that m0(1) of H#2(1) lie within the upper and lower thresholds defined by of [Eq.35], as opposed to the H#2(0) counterpart. In hardware implementation, [Eq.34] is applied only to the m0(1) of H#2(1), to save complexity and the duration of the acquisition process. The zero-th moments of H#2(0) are presented only for clarity and comparison insight.
a-8e show the results of ISI optimization during phase #4 of IMDP, for the histograms set H#2(1).
Histogram sets are obtained for different shifts:
MP
shift=−2,BER=4.26·10−1,σi2=89.21, (a)
MP
shift=−1,BER=3.15·10−3,σi2=12.21, (b)
MP
shift=0,BER=1.77·10−3,σi2=10.4, (c)
MP
shift=1,BER=1.89·10−3,σi2=10.41, (d)
MP
shift=2,BER=1.82·10−3,σi2=10.53 (e)
In each sub-plot, the titles contain the MP-shift, the BER and the average histograms variance calculated according to [Eq.45]. In this “simple” back-to-back case, the major portion of ISI comes from the frequency response of the analog front-end of the ASIC. It can be seen in
The phases of the IMDP for a 40 km optical link are illustrated in
a-10c present Phase #2 of the IMDP, which also consists of 8 iterations, resulting in the histogram sets H#2(0), H#2(1) and H#2(2), respectively. The histogram set obtained by using the training sequence Htraining is shown in
a-11d show the convergence process during phase #2 of the IMDP, for a 40 km long optical link. The convergence in terms of DED (defined by [Eq.45]) for the three histogram sets H#1(0) (circles) H#1(1) (squares) and H#1(2) (triangles) is shown on
All the three sets appear to stabilize around a constant DED value, but as already known from
The fact that H#1(0) and H#1(1) stabilize around a constant DED value, does not imply that they converged to a correct channel estimation. Rather the opposite is true, and a closer look at
Despite the fact that H#2(2) converges, the final DED value (after 8 iterations) for H#2(2) is higher than for H#2(0) and H#2(1) which eventually diverge. The reason for this is that H#1(2) converged to a suboptimal solution. H#2(2) is indeed quantitatively ‘far’ from Htraining, since the DED between them is not close to zero. This suboptimal solution will be improved during phase #4 of the IMDP. Thus, the KL-distance does not immediately show success, since several suboptimal solutions are possible, and only the optimal reference PDF (or its histogram representative) is relevant for comparison.
On the other hand, by observing the intermediate values of the std(m0) criterion (shown in
The practical way to conclude whether a given histogram set is converged to a valid (possibly suboptimal) solution, without observing the resulting histogram sets H#2(0), H#2(1) and H#2(2), is to assure that all the zero-th moments of the resulting histograms within the set lie within a predefined range, given by [Eq.34].
a-12e show the histogram sets representing the outcome of the ISI optimization Phase #4 of IMDP for a 40 km long optical channel. As indicated by both BER and the average standard deviations (ref formula for mean (vars)), the MP-shift of one symbol (shown in) obtains the best performance.
Histogram sets, obtained for different shifts are the following:
FIG. 12a: MPshift=2,BER=3.01·10−1,σi2=54.36,
FIG. 12b: MPshift=−1,BER=2.1·10−1,σi2=30.40,
FIG. 12c: MPshift=0,BER=2.16·10−3,σi2=7.07,
FIG. 12d: MPshift=1,BER=1.15·10−3,σi2=6.32,
FIG. 12e: MPshift=2,BER=2.94·10−3,σi2=8.11
In addition, in
The proposed IPMD requires neither additional hardware nor additional complicated calculations. The full blind equalization scheme was implemented in an Application Specific Integrated Circuit (ASIC) and was validated experimentally at the full data rate of 4×28 Gbit/sec. The overall blind channel acquisition time is measured to be a few milliseconds, which makes it suitable for use in reconfigurable optical network environment that requires 50 msec recovery time.
The above examples and description have of course been provided only for the purpose of illustration, and are not intended to limit the invention in any way. As will be appreciated by the skilled person, the invention can be carried out in a great variety of ways, employing more than one technique from those described above, all without exceeding the scope of the invention.
This application is a non-provisional of and claims priority to U.S. Provisional Application No. 61/863,474, filed on Aug. 8, 2013, the entire contents of which are hereby incorporated by reference.
Number | Date | Country | |
---|---|---|---|
61863474 | Aug 2013 | US |