This application is cross-related to application Ser. No. 10/194,660 entitled “E2PR4 VITERBI DETECTOR AND METHOD FOR ADDING A BRANCH METRIC TO THE PATH METRIC OF THE SURVIVING PATH AFTER SELECTING THE SURVIVING PATH”, which was filed on the same day as the present application and which is incorporated by reference.
Viterbi detectors are used in many of today's data receivers to recover digital data from samples of a data signal having a relatively low signal-to-noise ratio (SNR). For example, Viterbi detectors are used in disk-drive read channels to recover the sequence of data values read from a magnetic disk, and are used in cell phones to recover the sequence of data values from a digitized voice signal. Basically, a Viterbi detector considers all of the possible data-value sequences that the data signal can represent and determines from the samples of the data signal which of the possible sequences is most likely to be the correct, i.e., surviving, sequence. Because the complexity of the Viterbi detector is independent of the length of the recovered sequence, it has proven to be one of the most effective circuits for recovering digital-data sequences from signals having relatively low SNRs.
Unfortunately, as discussed below, the add-compare-select (ACS) algorithm that many Viterbi detectors implement often requires fast circuitry having a relatively large number of transistors so that such a Viterbi detector does not unduly limit the rate at which a receiver can process received data. Such a Viterbi detector executes the ACS algorithm for each data-signal sample or group of data-signal samples, and must finish executing the algorithm for one sample or group of samples before moving on to the next sample or sample group. Consequently, the rate at which the receiver samples the data signal and recovers data therefrom is limited to the speed at which the Viterbi detector can execute the ACS algorithm. Unfortunately, the ACS algorithm includes a relatively large number of steps that require a relatively long time for the Viterbi detector to execute. To speed up execution of the ACS algorithm, one can design the Viterbi detector to include fast circuitry that performs many of these steps in parallel. But such circuitry typically includes a relatively large number of transistors that increase the layout area, and thus the cost, of the Viterbi detector.
And although engineers have discovered a compare-select-add (CSA) algorithm that allows a Viterbi detector to have fewer transistors than or to be faster than a Viterbi detector that executes the ACS algorithm, one cannot implement the CSA algorithm in an E2PR4 Viterbi detector.
Referring to
1+2D−2D3−D4 (1)
where D represents a delay of one sample period, D3 represents a delay of three sample periods, and D4 represents a delay of four sample periods. Therefore, the sample Yk of a data signal at a sample time k has an ideal (no noise) value that is given by the following equation:
Yk=Xk+2Xk−1−2Xk−3−Xk−4
where Xk is the binary value of the data signal at sample time k, Xk−1 is the binary value at sample time k−1, etc. Because each sample Y is calculated from four binary values X, the sequence of binary values X has one of 42=16 potential states S0-S15 for each sample time k. Two respective branches 20 (e.g., 20a, 20b, 20c, and 20d) originating from two states S prior to sample time k each terminate at respective states S after the sample time k. For example, the branches 20a and 20b originate at S0 and S8 prior to time k, respectively, and terminate at S0 after time k. Table I includes the ideal sample values Y and the L2 branch metrics as a function of Y for each of the branches 20.
Still referring to
Unfortunately, there is no such CSA unit available to replace the ACSU 42 of the full-rate E2PR4 Viterbi detector 40 (
Referring to
Referring to
Still referring to
Referring to
Referring to
PMXo=bk+ck+1+ak+2+bk+3+dk+4 4)
PMYo=ck+bk+1+dk+2+ck+3+bk+4 5)
Similarly, referring to
PMXn=ak−ak+bk+ck+1+ak+2+ak+3−ak+3+bk+3+ck+4+ak+4−bk+4−ck+4+dk+4 6)
PMYn=ck+ak+1−ak+1+bk+1+ck+2+ak+2−bk+2−ck+2+dk+2−ak+2+bk+2+ck+3+ak+4 7)
Canceling common terms, one obtains:
PMXn=bk+ck+1+ak+2+bk+3+ak+4−bk+4+dk+4 8)
PMYn=ck+bk+1+dk+2+ck+3+ak+4 9)
It is well known that if A>B, then A+C>B+C, and if A<B, then A+C<B+C. Therefore, if both PMXn and PMYn respectively differ from PMXo and PMYo by the same value C, then the relationship between PMXn and PMYn is the same as the relationship between PMXo and PMYo. That is, if PMXo>PMYo, then (PMXo+C=PMXn)>(PMYo+C=PMYn). Likewise, if PMXo<PMYo, then (PMXo+C=PMXn)<(PMYo+C=PMYn). Here, referring to equations (4), (5), (8), and (9), C=ak+4−bk+4. Consequently, the branch-shifted trellis 90 preserves the relationships between the path metrics with respect to the trellis 80, and is thus mathematically equivalent to the trellis 80.
Unfortunately, the above-described branch-shifting technique does not allow one to replace the ACSU 42 (
One embodiment of the invention is an E2PR4 Viterbi detector that receives a signal that represents a sequence of values, the sequence having a potential state. The detector includes a recovery circuit that is operable to recover the sequence from the signal by identifying the surviving path to the potential state and adding a modified branch metric to the path metric of the surviving path while identifying the surviving path.
By adding a modified branch metric to the path metric of the surviving path while the surviving path is being identified, such an E2PR4 Viterbi detector can operate faster than an E2PR4 Viterbi detector that adds a branch metric to a path metric before identifying the surviving path.
The following discussion is presented to enable a person skilled in the art to make and use the invention. Various modifications to the embodiments will be readily apparent to those skilled in the art, and the generic principles herein may be applied to other embodiments and applications without departing from the spirit and scope of the present invention as defined by the appended claims. Thus, the present invention is not intended to be limited to the embodiments shown, but is to be accorded the widest scope consistent with the principles and features disclosed herein.
Referring to
Still referring to
Furthermore, one can also derive the half-rate L2 branch metrics for each of the branches 112 by summing the full-rate L2 branch metrics (Table I) for the two respective branches 20 (
UPM0=BM0+PM0 10)
UPM4=BM4+PM4 11)
UPM8=BM8+PM8 12)
UPM12=BM12+PM12 13)
The comparators 124a-124f respectively determine the following differences:
UPM12−UPM8 14)
UPM12−UPM4 15)
UPM12−UPM0 16)
UPM8−UPM4 17)
UPM8−UPM0 18)
UPM4−UPM0 19)
From these differences, the logic 126 determines which of the updated path metrics is the smallest, and causes the multiplexer 128 to load this smallest updated path metric into an SMU (
Unfortunately, the circuit 120 includes a relatively large number of transistors so that the ACSU to which it belongs does not limit the data-recovery rate of the Viterbi detector more than is necessary. Specifically, the adders 122a-122d and comparators 124a-124f are designed to be as fast as possible so that the sample rate, and thus the Viterbi detector's recovery rate, can be as fast as possible. Unfortunately, designing the adders 122a-122d and the comparators 124a-124f to be fast typically entails using a relatively large number of logic gates, and thus a large number of transistors. This causes the circuit 120 to occupy a relatively large area of the integrated circuit on which it resides.
Still referring to
Branch 142: 0 20)
Branch 144: 1−2Ys 21)
Branch 146: 5−2Yf−4Ys 22)
Branch 148: 10−2Yf−6Ys 23)
Branch 150: 5+4Yf+2Ys 24)
Branch 152: 4+4Yf 25)
Branch 154: 2+2Yf−2Ys 26)
Branch 156: 5+2Yf−4Ys 27)
In the second step, one adds −2Yf−2Ys to the branch metric (here 0) of the outer branch 158, and thus subtracts this value from the branch metrics of the inner branches 142, 144, 146, and 148 to obtain the following intermediate branch metrics for these inner branches:
Branch 142: 2Yf+2Ys 28)
Branch 144: 1+2Yf 29)
Branch 146: 5−2Ys 30)
Branch 148: 10−4Ys 31)
In the third step, one adds 2Yf+2Ys, 2Yf, −2Ys, and −4Ys, respectively, to the branch metrics of the outer branches 160, 162, 164, and 166 such that the modified branch metrics for the inner branches 142, 144, 146, and 148 are constants. The modified branch metrics for these branches are:
Branch 142: 0 32)
Branch 144: 1 33)
Branch 146: 5 34)
Branch 148: 10 35)
Branch 160: 2Yf+2Ys 36)
Branch 162: 2Yf 37)
Branch 164: −2Ys 38)
Branch 166: −4Ys 39)
In the fourth step, because 2Yf+2Ys, 2Yf, −2Ys, and −4Ys were respectively added to the branch metrics of the external branches 160, 162, 164, and 166, one subtracts these values from the branch metrics of the inner branches 150, 152, 154, and 156 to give:
Branch 150: 5+2Yf 40)
Branch 152: 4+2Yf 41)
Branch 154: 2+2Yf 42)
Branch 156: 5+2Yf 43)
In the fifth and final step, one adds 2Yf to the branch metric of the external branch 168 such that the modified branch metrics for the inner branches 150, 152, 154, and 156 are constants. Therefore, the modified branch metrics for these branches are:
Branch 150: 5 44)
Branch 152: 4 45)
Branch 154: 2 46)
Branch 156: 5 47)
Branch 168: 2Yf 48)
Consequently, the modified branch metrics for all the branches in the trellis portion 140 are:
Branch 142: 0 49)
Branch 144: 1 50)
Branch 146: 5 51)
Branch 148: 10 52)
Branch 150: 5 53)
Branch 152: 4 54)
Branch 154: 2 55)
Branch 156: 5 56)
Branch 158: −2Yf−2Ys 57)
Branch 160: 2Yf+2Ys 58)
Branch 162: 2Yf 59)
Branch 164: −2Ys 60)
Branch 166: −4Ys 61)
Branch 168: 2Yf 62)
As discussed above in conjunction with
Referring to
Referring to
PMXo=10−2YfK−6YsK+8−4YfK+1+4YsK+1+10+6YfK+2+2YsK+2 63)
PMYo=2+2YfK−2YsK+4−4YfK+1+5+4YfK+2+2YsK+2 64)
Similarly, referring to
PMXn=−2YfK−2YsK+10−4YsK−2YfK+1−2YsK+1+8−2YfK+1+6YsK+1+4YfK+2+10 65)
PMYn=−2YsK+2+2YfK−2YfK+1−2YsK+1+4−2YfK+1+2YsK+1+2YfK+2+5
As discussed above in conjunction with
Referring to
(PM4+CMBM4—0[[−0]])−(PM0+CMBM0—0[[−0]]) 67)
(PM8+CMBM8—0[[−0]])−(PM0+CMBM0—0[[−0]]) 68)
(PM12+CMBM12—0[[−0]])−(PM0+CMBM0—0[[−0]]) 69)
(PM8+CMBM8—0[[−0]])−(PM4+CMBM4—0[[−0]]) 70)
(PM12+CMBM12—0[[−0]])−(PM4+CMBM4—0[[−0]]) 71)
(PM12+CMBM12—0[[−0]])−(PM8+CMBM8—0[[−0]]) 72)
The terms of these differences can be rearranged as:
(PM4−PM0)+(CMBM4—0[[−0]]−CMBMO—0[[−0]]) 73)
(PM8−PM0)+(CMBM8—0[[−0]]−CMBM0—0[[−0]]) 74)
(PM12−PM0)+(CMBM12—0[[−0]]−CMBM0—0[[−0]]) 75)
(PM8−PM4)+(CMBM8—0[[−0]]−CMBM4—0[[−0]]) 76)
(PM12−PM4)+(CMBM12—0[[−0]]−CMBM4—0[[−0]]) 77)
(PM12−PM8)+(CMBM12—0[[−0]]−CMBM8—0[[−0]]) 78)
Because CMBM0_0[[−0]], CMBM4_0[[−0]], CMBM8_0[[−0]], and CMBM12_0[[−0]] are constants, the comparators 202a-202f can be hardwired or programmed to respectively perform the following calculations:
PM4−PM0+Ka 79)
PM8−PM0+Kb 80)
PM12−PM0+Kc 81)
PM8−PM4+Kd 82)
PM12−PM4+Ke 83)
PM12−PM8+Kf 84)
where
Ka=CMBM4—0[[−0]]−CMBM0_0[[−0]]=5 85)
Kb=CMBM8—0[[−0]]−CMBM0—0[[−0]]=1 86)
Kc=CMBM12—0[[−0]]−CMBM0—0[[−0]]=10 87)
Kd=CMBM8—0[[−0]]−CMBM4—0[[−0]]=−4 88)
Ke=CMBM12—0[[−0]]−CMBM4—0[[−0]]=5 89)
Kf=CMBM12—0[[−0]]−CMBM8—0[[−0]]=9 90)
From the differences (85)-(90), the logic 204 selects via the multiplexer 206 the smallest modified path metric MPMsel=Min(PM12+CMBM12_0, PM8+CMBM8_0, PM4+CMBM4_0, PM0+CMBM0_0). The multiplexer 206 is hardwired to add the proper CMBMsel value to the MPMsel value. For example, if PM12+CMBM12_0 is the smallest modified path metric, then the multiplexer 206 generates the sum PM12+CMBM12_0 in response to a corresponding signal from the select logic 204. The adder 208 then sums (PMsel+CMBMsel) and the modified branch metric for state S0, MBM0=2Yfk+2Ysk−2Yfk+1−2Ysk+1 (
The other circuits (not shown) of the CSAU that execute the CSA algorithm for the states S1-S15 are similar to and operate in parallel with the circuit 200. Table IV includes the values of Ka-Kf for all of these other circuits.
Implementing the CSA algorithm using the modified branch metrics of
Like the CSA circuit 200 of
But while the comparators 202a-202f are determining the smallest modified path metric, the adders 208a-208d are respectively generating updated path metrics for the paths having the path metrics PM0, PM4, PM8, and PM12. Therefore, the circuit 220 can provide the updated path metric UPM0 sooner because unlike the circuit 200 (
PM0+CMBM0—0[[−0]]+MBM0[[—0−0]] 91)
PM4+CMBM4—0[[4−0]]+MBM0[[—0−0]] 92)
PM8+CMBM8—0[[8−0]]+MBM0[[—0−0]] 93)
PM12+CMBM12—0[[12−0]]+MBM0[[—0−0]] 94)
where CMBM0_0[[−0]], CMBM4_0[[4−0]], CMBM8_0[[8−0]], and CMBM12_0[[12−0]] are listed in Table III and MBM0[[—0−0]] is shown in
Once the select logic 204 identifies the smallest of the modified path metrics PM0+CMBM0_0, PM4+CBM4_0, PM8+CMBM8_0, and PM12+CMBM12_0, it causes the multiplexer 206 to select as the updated path metric UPM0 the output of the adder 208a-208d that updated the smallest modified path metric MPM, and to load UPM0 into the SMU (
Still referring to
As stated above, because the comparators 202a-202f and the adders 208a-208d operate in parallel and not serially as in the circuit 200 of
The detector 230 includes a recovery circuit 232, which includes a BMU 234 and the CSAU 236, which includes one or more of the circuits 200 or 220—typically sixteen of either the circuits 200 or 220, one for each state S0-S15. The detector 230 also includes a SMU 238, which includes surviving-path-metric registers 240 and surviving-path registers 242, typically one register 240 and one register 242 for each state S0-S15.
In operation, the detector 230 receives a pair of consecutive samples YfK and YsK, and the BMU 234 calculates the modified branch metrics (
The disk-drive system 250 also includes write and read interface adapters 278 and 280 for respectively interfacing the disk-drive controller 266 to a system bus 282, which is specific to the system used. Typical system busses include ISA, PCI, S-Bus, Nu-Bus, etc. The system 250 typically has other devices, such as a random access memory (RAM) 284 and a central processing unit (CPU) 286 coupled to the bus 282.
Number | Name | Date | Kind |
---|---|---|---|
5313495 | Kim | May 1994 | A |
5430744 | Fettweis et al. | Jul 1995 | A |
6081210 | Nikolic et al. | Jun 2000 | A |
6201840 | Rub et al. | Mar 2001 | B1 |
6259749 | Andoh | Jul 2001 | B1 |
Number | Date | Country | |
---|---|---|---|
20040010748 A1 | Jan 2004 | US |