Maximum Likelihood Sequence Estimation in Optical Fibre Communication Systems

Abstract
A method of and a receiver(20) for detection of a received signal in an optical fibre communication system using Viterbi algorithm methodology in which branch metrics are obtained using approximated expressions to calculate the branch metrics. Use of the expressions results in practically the same performance as a receiver based on exact metrics.
Description
FIELD OF INVENTION

This invention relates to a method and apparatus for maximum likelihood sequence estimation (MLSE) in optical fibre communication systems.


BACKGROUND

In recent years, receivers based on sophisticated electronic processing techniques have received much attention in the design of high-speed optical fibre communication systems (40 Gb/s and more). In a linear regime, group velocity dispersion (GVD) and polarization mode dispersion (PMD) are the most severe sources of signal distortion and system penalty. Although the effects of GVD can be compensated by means of dispersion compensating fibres, it is known that tolerance to GVD decreases as the square of the bit rate. Hence, a compensation that would be adequate for 10 Gb/s systems might not be sufficient when upgrading to a higher bit rate because of a non-negligible residual dispersion.


Moreover, the increased sensitivity to engineering tolerances of higher transmission capacity networks can lead to unpredictable and often variable effects on the signal due to residual GVD which, in addition, can combine with the PMD, an intrinsically stochastic phenomenon. In a first order approximation the effect of PMD is considered as a differential group delay (DGD) Δτ between the two principal states of polarization (PSP) of the fibre, resulting in Inter Symbol Interference (ISI).


Usually the PMD is described by a vector {right arrow over (Ω)}, which, in a first-order approximation is assumed to be independent of frequency. Higher order effects arise when the PMD vector {right arrow over (Ω)} is frequency dependent. In a common second order approximation, {right arrow over (Ω)} is assumed to be a linear function of the frequency, {right arrow over (Ω)}={right arrow over (Ω)}0+{right arrow over (Ω)}1(ω−ω0), where {right arrow over (Ω)}1 is the derivative of {right arrow over (Ω)} evaluated at the carrier frequency ω0. Second order effects are mainly signal distortion and broadening. It has been demonstrated that with different optical compensation techniques such as, for example, a cascade of polarization controllers and polarization maintaining fibres, planar wave guide circuits or other optical devices, it is possible to recover heavy penalties caused by first or second order effects.


The techniques mentioned above, whilst effective, are often impractical because of their cost due to the use of advanced optical technologies. As a consequence much effort has been devoted to apply classical or innovative electrical equalization methods to the case of optical fibre communication systems.


One of the first electrical equalization techniques proposed for optical systems is a Feed Forward Equalizer (FFE) whose purpose is to combat the ISI induced by chromatic dispersion. Non-linear cancellation has also been postulated, since the photo detection process implies a non-linear transformation of the signal. More recently, comparisons between these compensation methods and optical compensation techniques have been presented, showing the benefits and disadvantages of both solutions.


In addition to FFE equalization and decision feedback equalization (DFE) interest is growing for Maximum Likelihood Sequence Estimation (MLSE), realized through the Viterbi algorithm (VA) by virtue of its potentially optimal performance.


In the early Nineties MLSE receivers based on the Viterbi algorithm were proposed for optical fibre systems which did not include the presence of optical amplification. Consequently, the amplified spontaneous emission (ASE) noise introduced by optical amplifiers was not taken into consideration and the statistics of the received signal, required to calculate the branch metrics of the Viterbi algorithm (VA), were assumed to be Gaussian since they are caused by the thermal and shot noise generated after the photo detection process.


In current optical systems optical amplifiers are widely used, hence the signal in the fibre is affected by noise that in the linear regime, can be modelled as additive white Gaussian noise (AWGN). However since the photo detection process performs the action of a square law detector the post detection noise statistic changes and cannot be considered Gaussian any longer. Hence, in the case of MLSE, assuming Gaussian statistics for noise after photo detection is neither realistic nor correct and leads to inaccurate results.


Accordingly, adaptive electric compensation techniques of the PMD based on the MLSE criterion have been proposed in which the statistics of the received signal are measured and updated in real time during transmission using the detected symbols and assuming no decision errors. This method, which assumes specific constraints such as, for example, sample quantization, memory length, filter type and parameters, or even the absence of filtering, has been compared with classical equalization schemes, shows an improved performance.


We have realised that it would be desirable to provide an expression of the VA branch metrics which implements the MLSE criterion for realistic values of the system parameters, whether by sampling the signal with a period equal to the symbol time or at higher rates, given that the oversampling ensures obtaining sufficient statistics of the signal received.


In particular, through numerical evaluation in accordance with a preferred embodiment of the invention, a practically exact expression of the signal statistics is derived in the case of a receiver working at a rate equal to the symbol time.


In the case of oversampling, however, at present there is no expression (exact or approximate) for the statistics of the samples. It is however possible to recur to an adaptive type receiver based on histograms. In accordance with a preferred embodiment of the invention, a method is specified based on an approximate expression in closed form of the VA metrics, which entails a negligible loss of performance compared with an optimal expression.


SUMMARY OF THE INVENTION

According to a first aspect of the present invention there is provided a method of detection of a received signal in an optical fibre communication system using a Viterbi algorithm methodology in which branch metrics are obtained using substantially either of the expressions:







λ


(


a
k

,

σ
k


)






-


v
-
1

2




ln


[


s
R



(


a
k

,

σ
k


)


]



-



s
R



(


a
k

,

σ
k


)



N
0


+

ln


[


I

v
-
1


(


2




z
k




s
R



(


a
k

,

σ
k


)






N
0


)

]







or,







λ


(


a
k

,

σ
k


)





2




z
k




s
R



(


a
k

,

σ
k


)





-


s
R



(


a
k

,

σ
k


)


-



N
0

2



(

v
-

1
2


)



ln


[


s
R



(


a
k

,

σ
k


)


]








in which Iv−1 is the modified Bessel function of the first type and order v−1, σk is the state of the receiver at the kth bit interval, sR(ak, σk) is the noise-free received sample of the kth bit interval, v is the number of degrees of freedom, ak is a possible bit transmitted at the kth bit interval, zk is the sample received at kh bit interval, No is the power spectral density.


According to a second aspect of the invention there is provided a method of detection of a received signal in an optical fibre communication system using a Viterbi algorithm methodology in which branch metrics are obtained using substantially the expression







λ


(


a
k

,

σ
k


)







i
=
0

1



ln






p


(



z

k
,
i


|

a
k


,

σ
k


)








in which p(zk,i|akk) is the probability density function (PDF) of the ith received sample zk,i for the kth bit interval, σk is a possible bit transmitted at the kth bit interval, and σk is the state of the receiver at the kth bit interval, and the method further comprising taking multiple samples per bit interval time of the received signal.


According to a further aspect of the invention there is also provided a receiver for an optical transmission system using as sequence detection one of the above methods.


According to yet a further aspect of the invention there is provided a machine-readable data carrier which comprises instructions to implement the method of any of the first and second aspects of the invention when the instructions are loaded onto a data processor.





BRIEF DESCRIPTION OF THE DRAWINGS

Various embodiments of the invention will now be described, by way of example only, in which:



FIG. 1 shows a block diagram of a model of the transmission system considered,



FIG. 2 shows a low-pass equivalent of the system in FIG. 1, and



FIG. 3 shows a block diagram of a model of the transmission system considered in the case of a receiver based on over-sampling.





DETAILED DESCRIPTION OF EXEMPLARY EMBODIMENTS

With reference to FIG. 1 there is shown diagrammatically and designated by reference number 10 an optical transmission system which comprises a receiver assembly 20. A signal generated from a standard laser source is modulated by a modulator 11 using on-off keying (OOK) modulation, is transmitted over a single-mode fibre (SMF) 12. The signal is optically amplified at the receiver 20 by an optical amplifier 13 and filtered in the optical domain by a band-pass filter 14. The optical amplifier 13 has a high gain G so that the amplified spontaneous emission (ASE) noise is dominant compared to the thermal and shot noise of the receiver 20.


The optical signal is then converted into the electric domain by a photo detector 15 and the signal thus obtained is filtered electrically by a low-pass filter 16, sampled by a sampler 17, and lastly processed in Viterbi processor 18 by application of the Viterbi algorithm to realize the MLSE strategy. The Viterbi processor 18 comprises data processing means and memory means.



FIG. 2 shows the low-pass equivalent diagram of the system in FIG. 1. The low-pass equivalent transfer functions (matrix) of the fibre and the optical filters and post-detection are indicated respectively as HF(ω), HO(ω) and HR(ω).


The signal w(t)=[w1(t),w2(t)]T represents the additive white Gaussian noise (AWGN) with independent complex noise components w1(t), w2(t) responsible for the ASE on the two orthogonal states of polarization (SOP) each with two-sided power spectral density (PSD) equal to N0.


At the output of the optical filter the components of the two-dimensional complex vectors s(t)=(s1(t),s2(t)) and n(t)=(n1(t),n2(t)) represent respectively the signal components and the useful noise in each SOP. The noise components are Gaussian but not white since they are obtained by filtering of the AWGN w(t).


At the output of the photodiode the signal detected can be described as the sum of two contributions, one for each SOP, as follows.






y(t)=∥s(t)+n(t)∥2=∥s1(t)+n1(t)|2+|s2(t)+n2(t)|2  [1]


After photo detection the noise becomes dependent on the signal and its statistics change. Subsequently, since the receiver proposed is independent of a particular choice of filter form and bandwidth, the parameters of the optical filters and post-detection can be chosen arbitrarily.


With the constraints of the described receiver structure 20, and assuming a single sample per bit interval is extracted at the receiver, the optimal MLSE detection strategy can be expressed as follows:










a
^

=

arg







max
a



p


(


z
_

|

a
_


)








[
2
]







in which a={ak} is a possible transmitted bit sequence and with z={zk} the related received sequence.


The samples received are zk=z(t0+kT), to being an adapted time delay (offset) and T the bit interval.


It is assumed that, conditionally upon the transmitted sequence, the samples {zk} can be considered independently. This assumption was verified numerically for optical and electrical filters of commonly used band, form, and amplitude. Hence, the combined probability density function (PDF) p(z|a) of the received samples, conditioned to the transmitted symbols, can be written as follows










p


(


z
_

|

a
_


)


=



k







p


(


z
k

|

a
_


)







[
3
]







and, assuming that the system is causal and with finite memory L, the following can be written.






p(zk|a)=p(zk|ak−1, . . . , ak−L)  [4]


Therefore, the optimal MLSE strategy can be realized by means of the Viterbi algorithm using the following branch metrics.





λk(akk)=lnp(zk⊕akk)  [5]


where σk=(ak−1,ak−2, . . . , ak−L)identifies the state of the receiver on the trellis diagram (trellis state). Consequently, the number of states is S=2L and therefore the complexity of the receiver increases exponentially with the channel memory L.


A closed form expression for the PDF in [4] is not known for an arbitrary signal format and filtering. In fact, despite the samples at the output of the photo detector, are characterized by a non-central chi-square distribution being the sum of squared Gaussian random variables, the presence of the electrical filter modifies such statistics.


We have realised that an appropriate characterization of this PDF would be beneficial in order to avoid the performance of the Viterbi processor being degraded. This PDF can be evaluated almost exactly by means of numerical methods and stored in a look-up table which can be addressed in order to calculate the branch metrics, from the received signal samples and from the transitions between the trellis states considered. This enables a comparison to be made between the exact values and the values resulting from the approximate expressions set out below.


The most efficient numerical method to obtain the PDF in [4] is based on the knowledge of the moment generating function Ψzk|a(s) of the samples, whose expression in closed form can be obtained by expanding the noise on a proper Karhunen-Loève basis. Then, using the saddle-point approximation, the PDF can be evaluated as follows.










p


(


z
k

|

a
_


)





exp


[


Φ


z
k

|

a
_





(

s
0

)


]




2



πΦ


z
k

|

a
_






(

s
0

)









[
6
]







where so is the saddle-point of Ψzk|a(s)exp(-szk) on the real axis,





Φzk|a(s)=log [Ψzk|a(s)e−szk]  [7]


and Φ″zk|ais the second derivative of Φzk|awhich is always positive at the saddle-point.


This approach, although giving an extremely accurate closed form approximation for the PDF, requires a search for the saddle-point. However this is a simple and rapid process, easily imaginable to one skilled in the art.


It was verified that it is possible to approximate the conditional PDF of the received sample (and hence the branch metrics expression) as follows:










p


(


z
k

|


a
k



σ
k



)





1

N
0





(


z
k



s
R



(


a
k

,

σ
k


)



)



(

v
-
1

)

/
2


×

exp


(

-



z
k

+


s
R



(


a
k

,

σ
k


)




N
0



)





I

v
-
1


(


2




z
k




s
R



(


a
k

,

σ
k


)






N
0


)






[
8
]









    • where the number of degrees of freedom v is given by the ratio of the optical and electrical filter noise equivalent bandwidths, SR(akσk) is the noise-free received sample, which is estimated using known methodology, which depends on the present and past transmitted symbols, according to the length of the channel memory, and Iv−1(x) is a modified Bessel function of the first kind and order v−1. All terms in the branch metrics independent from ak and σk can be ignored and, in addition, the branch metrics can be arbitrarily multiplied by a positive constant. Hence, ignoring the irrelevant terms in the maximization, and inserting [8] into [5], a simplified expression of the branch metrics is given by:













λ


(


a
k

,

σ
k


)






-


v
-
1

2




ln


[


s
R



(


a
k

,

σ
k


)


]



-



s
R



(


a
k

,

σ
k


)



N
0


+

ln





I

v
-
1


(


2




z
k




s
R



(


a
k

,

σ
k


)






N
0


)









[
9
]







Advantageously when the processor 18 is configured to implement [9] to calculate the branch metrics this results in practically the same performance as a receiver based on exact metrics.


It is to be noted that [8] is used both when ak=1 or when ak=0, since, given the finite extinction ratio, filtering and signal sR(ak, σk) distortion, is not exactly equal to zero even when ak=0. Clearly when ak=0, [8] is accurate only if the electrical filter impulse response is always positive or if any negative values have a negligible impact such as for example in the case of Gaussian or Bessel type filters.


A further simplification of the branch metrics of [9] can be obtained using the following crude approximation:








I

v
-
1




(
x
)






x



2

π





x







by means of which [8] is further approximated as:










p


(


z
k

|


a
k



σ
k



)






exp
(


-



(



z
k


-


s
R



)

2


N
0



+

(

v
-

3
2


)


)



(



z
k


-
1

)




4

π






N
0



s
R

v
-

1
/
2










[
10
]







The resulting branch metrics are thus:










λ


(


a
k

,

σ
k


)





2




z
k




s
R



(


a
k

,

σ
k


)





-


s
R



(


a
k

,

σ
k


)


-



N
0

2



(

v
-

1
2


)



ln


[


s
R



(


a
k

,

σ
k


)


]








[
11
]







In use the processor of the receiver assembly 20 is configured to determine estimates of the values of SR(ak, σk) for each received sample in respect of each pair (ak, σk). The processor is then operative to use either expression [9] or [11] to calculate each corresponding branch metric and then use the branch metrics in the Viterbi algorithm to reach a decision.


As already mentioned, the number of trellis states and hence the complexity of the receiver depends exponentially on the channel memory L.


In addition, the application of reduced-state sequence detection (RSSD) techniques allows a substantial reduction of the number of trellis states.


In particular, a reduced state can be defined σ′k=(ak−1, ak−2, . . . , ak−L′), with L′<L. The resulting number of states is therefore reduced to 2L′<2L. For the purpose of calculating the branch metrics [5] in the case of reduced trellis, the necessary symbols, not included in the state definition, can be found in the survivor history of the path according to known techniques.


It is noted that in the limiting case of L′=0 the trellis diagram degenerates and a detection is obtained using symbol-by-symbol with decision feedback. The resulting receiver can be considered as a non-linear equalizer with decision feedback.


Since the PMD is a time-varying phenomenon, the values of sR(ak, σk) should be updated adaptively. As a change in the PMD occurs, the receiver must merely identify in an adaptive manner the term sR(ak, σk), employed in the closed form expressions of the above mentioned branch metrics [9] or [11]. This can be easily done using a gradient adaptation algorithm using as cost function the expression of the branch metrics or the mean square error.


Although the MLSE receiver in accordance with this invention is the best post-detection technique in the case of synchronous sampling, one sample per bit time may not represent a sufficient statistic, because of the non-linear nature introduced by the photodiode 15. A sufficient statistic can be obtained by over-sampling as explained below with reference to FIG. 3.


Assuming that n samples are used per bit time (signalling interval) the following notation will be used to indicate the received samples. The n received samples in the kth bit interval will be indicated as zk,i=z(t0+kT+iT/n), i=0, 1, . . . , n−1. As previously, the received sequence is indicated with z={zk,i}. In addition, the n signal samples related to the kth bit are indicated with zk, that is to say zk={zk,i}i=0n−1.


When the samples are spaced at intervals less than the bit time (always conditioned to the sequence of symbols transmitted, namely the transmitted bit sequence) they cannot be considered as independent. Consequently, their joint PDF cannot be expressed as the product of the marginal PDFs. For this reason, the chain rule is used to factorize the joint PDF p(z|a) necessary to implement the MLSE strategy. Assuming as above that the received samples which differ by at least one bit interval are independent, we have:













p


(


z
_

|

a
_


)


=





k







p


(




z
_

k

|


z
_


k
-
1



,

a
_


)









=





k










i
=
0


n
-
1








p


(



z

k
,
i


|

z

k
,

i
-
1




,





,

z

k
,
0


,


z
_


k
-
1


,

a
_


)











[
12
]







where in the last expression, it is implicitly assumed that in case i=0, then the terms zk,i-1, . . . , zk,0 all disappear. Hence, with an appropriate definition of the receiver state σk, the branch metrics of the Viterbi algorithm implementing the MLSD strategy can be calculated as follows:













λ


(


a
k

,

σ
k


)


=



ln





p






(



z
k

|

z

k
-
1



,

a
k

,

σ
k


)








=






i
=
0


n
-
1








ln






p


(



z

k
,
i


|

z

k
,

i
-
1




,





,

z

k
,
0


,

z

k
-
1


,

a
k

,

σ
k


)











[
13
]







Thus, in the case of over sampling, the system memory will be M≧L. In this case the state is defined as σk=(ak−1, ak−2, . . . , ak−M)). The structure of this receiver in the case of over sampling (n>1) is shown in FIG. 3.


It has been determined that a value of n=2 is sufficient in practice to obtain optimal performance. In this case, although there is a correlation between the received samples, considering them as independent in the expression of the branch metrics does not deteriorate the performance obtained using the optimal correlated metrics. Therefore, the simplified branch metrics below can be used without deterioration of the receiver performance.










λ


(


a
k

,

σ
k


)







i
=
0

1



ln






p


(



z

k
,
i


|

a
k


,

σ
k


)








[
14
]







where p(zk,i|ak, σk) is given by [6]. The PDFs which appear in [14] can be further simplified using the approximate expression [8] or [10] as easily imaginable for one skilled in the art.


The above described embodiments advantageously provide an extremely robust and efficient method of implementing the Viterbi algorithm. Advantageously there is no requirement to store a look-up table with various PDF values which need to be addressed. Use of such look-up tables can require significant memory and processing capabilities. Rather, by use of the above expressions the branch metrics can be determined quickly and accurately without requiring overly substantial processing and storage means.


Naturally the above description of embodiments applying the innovative principles of this invention are given by way of non-limiting example of said principles within the scope of the exclusive right claimed here.

Claims
  • 1-13. (canceled)
  • 14. A method of detecting a received signal in an optical fiber communication system using Viterbi algorithm methodology, the method comprising: obtaining branch metrics using one of the following expressions:
  • 15. The method of claim 14 further comprising determining estimates of SR(ak, σk).
  • 16. The method of claim 15 further comprising determining estimates of SR(ak, σk) for each pair (ak, σk.
  • 17. The method of claim 14 further comprising obtaining samples at a rate of substantially one per bit interval.
  • 18. The method of claim 14 further comprising adaptively determining values of SR(ak, σk) using a gradient algorithm as a cost function of the branch metrics.
  • 19. The method of claim 14 further comprising adaptively determining values of SR(ak, σk) using mean square error methodology.
  • 20. A method of detecting a signal received in an optical fiber communication system using the Viterbi algorithm, the method comprising: obtaining branch metrics using the expression:
  • 21. The method of claim 20 further comprising determining the probability density functions p(Zk,i|ak, σk) using one of the following expressions:
  • 22. The method of claim 20 further comprising obtaining substantially two samples of the received per bit interval time.
  • 23. The method of claim 21 further comprising adaptively determining values of SR(ak, σk) using a gradient algorithm as a cost function of the branch metrics.
  • 24. The method of claim 21 further comprising adaptively determining values of SR,i(ak, σk) using mean square error methodology.
  • 25. A receiver for an optical transmission system, the receiver being configured to detect a received signal in an optical fiber communication system using Viterbi algorithm methodology by: obtaining branch metrics using one of the following expressions:
  • 26. A receiver for an optical transmission system, the receiver being configured to detect a received signal in an optical fiber communication system using Viterbi algorithm methodology by: obtaining branch metrics using the expression:
  • 27. A machine-readable data carrier comprising instructions configured to cause a data processor to: detect a received signal in an optical fiber communication system using Viterbi algorithm methodology by: obtaining branch metrics using one of the following expressions:
  • 28. A machine-readable data carrier comprising instructions configured to cause a data processor to: detect a received signal in an optical fiber communication system using Viterbi algorithm methodology by: obtaining branch metrics using the expression:
Priority Claims (1)
Number Date Country Kind
0606745.8 Apr 2006 GB national
PCT Information
Filing Document Filing Date Country Kind 371c Date
PCT/EP2006/069168 11/30/2006 WO 00 5/26/2009