The present invention relates generally to data storage and communication systems. More particularly, the invention relates to the conversion of data to bit sequences satisfying certain constraints, in particular, preventing the local and global imbalance of zeros and ones in a coded binary sequence. The invented method, codes and apparatus are also related to the area of the weakly constrained DC-free (WDCF) codes.
In storage systems such as optical and magnetic storage systems, as well as in some communication systems, user data are usually first encoded by an outer Error Correcting Code (ECC), then encoded by a modulation encoder, and finally optionally encoded by an inner channel encoder. The modulation encoder could be of the Run Length Limiting (RLL) type, the Running Digital Sum limiting (RDS) type or the Direct Current Free (DCF) type. The output of the modulation encoder (or the inner channel encoder if used) can be pre-coded before being recorded onto the media and read by the detector. On the detection side, a Viterbi algorithm is usually used to reconstruct the coded bits. The primary task of the modulation code is to facilitate the front-end stages of the channel, such as a preamp, a timing circuit, an equalizer and others. At the same time, the modulation encoder modifies the distance properties of the output code words of the channel, and therefore can also improve the Bit Error Rate (BER) and Sector Failure Rate (SFR) characteristics of the system.
The conventional modulation codes described in the literature usually employ a state transition diagram. In a finite state encoder, arbitrary user data are encoded to a constraint data sequence via a finite-state machine. The encoder is said to have rate m/n if at each step of the encoding process, one m-tuple of user data is encoded to one n-tuple of constraint data in such a way that the concatenation of the encoded n-tuples obeys the given constraint. The finite-state machine has multiple states, and the encoder or decoder moves from one state to another after the generation of each output tuple. A single error in the received sequence can trigger the generation of wrong states in the decoder, and in result produce a long sequence of errors. This phenomenon is called “error propagation”, and is often related to the modulation codes constructed from finite-state machines. For the purpose of limiting error propagation, decoding can be implemented via a sliding-block decoder. The state-splitting algorithm can be used for designing the finite state encoders for small and moderate values of n, but for large values of n, it usually requires the use of large tables assigning data to codewords in the encoding graph, and is not feasible from a practical point of view.
Recently, various types of iterative detection schemes based on turbo codes, low density parity check (LDPC) codes and turbo product codes (TPC) were developed for application in storage and communication systems. They provide very low BER, but usually require the use of an interleaver after the modulation encoder. An interleaver changes the order of the already coded bits, and in result nullifies the operation of the modulation encoder. Since encoders based on finite state machines transform the data bits using mapping tables without special structure, the use of such codes in channels with interleaving coded bits is impossible or severely restricted, especially, when they are applied for the encoding parity bits of the TPC or LDPC codes.
Embodiments of the present invention provide solutions to these and/or other problems, and offer other advantages over the prior art.
A method of additive encoding of data words includes receiving a plurality of data words, searching a trellis representation of patterns to be used for subsequent modification/encoding of the data words, and additive encoding of the data words using an optimal or sub-optimal sequence of patterns found by the trellis search. The states of the trellis are associated with groups of flags representing different matching patterns. An optimal or sub-optimal sequence of patterns is identified by a survived path between the initial and final states of a trellis representing all possible combinations of matching patterns. The trellis representation can comprise all possible combinations of matching patterns which can be used in a sequence of additive code words. In other words, the trellis representation of matching patterns is searched to identify a sequence of flags representing the sequence of patterns to be used to encode the data words.
Other features and benefits that characterize embodiments of the present invention will be apparent upon reading the following detailed description and review of the associated drawings.
As noted above, various types of iterative detection schemes based on turbo codes, LDPC codes and TPC have been developed for use in storage and communication systems. These iterative detection schemes usually require the use of an interleaver after the modulation encoder. An interleaver changes the order of the already coded bits, and in result nullifies the operation of the modulation encoder. Since encoders based on finite state machines transform the data bits using mapping tables without special structure, the use of such codes in channels with interleaved coded bits is impossible or severely restricted, especially, when they are used to encode parity bits of TPC or LDPC codes. The present invention is directed to a new class of such modulation codes and corresponding encoders and decoders. The resulting encoding methods and apparatus are well suited for such applications, since they are based on simple component wise modulo q operations.
The above mentioned problems are addressed in the current invention where a method of, and apparatus for, additive trellis encoding are provided. The encoding method uses a sequence of matching patterns, chosen from a predefined set, which is added component wise modulo q to the original data. A sequence of flags is also sent to the receiver to identify the used sequence of the matching patterns. A trellis search algorithm for the optimized choice of the sequence of flags is also provided. A new metric utilizing the two variances corresponding to the positive and negative values of the RDS are used to search for the “best” sequence of flags (matching patterns). Results of tests and simulations have demonstrated the efficiency of the encoding schemes, the trellis search algorithms and the new metric of the present invention.
The present invention can be used, for example, in data storage systems and/or communication systems. For example,
While a particular disc drive is shown, disc drive 100 is intended to represent any of a variety of data storage devices in which the methods and apparatus of the present invention can be implemented. For example, in other embodiments, disc drive 100 can be other types of magnetic disc drives, or can be other types of non-magnetic disc drives such as an optical disc drive, a magneto-optical disc drive, etc. The methods and apparatus disclosed herein can also be used in other data storage devices, for example in magnetic tape storage devices. Further still, the methods and apparatus of the present invention can be used in environments other than data storage systems. For instance, the methods and apparatus of the present invention can also be used in communication systems. The following discussion, though directed specifically to data storage systems at times, is intended to be applicable to all such uses of the present invention, and disc drive 100 is intended to generally represent various types of data storage systems and communication systems in which the present invention can be practiced.
1. Example Channel Circuitry
The present invention is particularly useful in read/write channels of data storage systems such as magnetic and/or optical disc drives.
Channel 200 includes a number of different encoding/decoding circuits, each encoding or decoding data in different manners for different purposes. The various circuits shown in the blocks of
The following discussion of channel 200 provides a general understanding of a typical environment in which the modulation encoder methods of the present invention can be implemented, but is not intended to limit the invention to any particular channel configuration or functionality. Assume that data bits of a message word to be recorded on the recording media (heads/media 225) are provided to Reed-Solomon (RS) error correcting code (ECC) circuit 205. Error correcting code circuit 205 introduces additional bits to the message data bits. The additional bits improve the ability of the system to recover the signal when the encoded signal has been corrupted by noise introduced by the recording channel. Also, channel circuitry 200 shown in
Within the inner sub-channel 207 modulation encoder 210 is included to implement an additive coding scheme for encoding the data received from ECC encoder 205. Note that ECC encoder 205 is optional in some embodiments, and that generally modulation encoder 210 performs additive encoding on data it receives, without limitation to ECC encoded data. As used herein, references herein to data words received at the input of modulation encoder 210 are intended to include either of un-encoded data and previously encoded data to DCF additive encoding is to be applied. As will be described below in detail, modulation encoder 210 is configured to implement a trellis search algorithm to search code pattern flags for a best pattern to add to a message or data word to provide a code word as an output. Modulation decoder 245 determines, based on the flags of the code words, which patterns to again add to the code words to retrieve the data originally encoded by modulation encoder 210.
Channel encoder(s) 215 and interleaver/precoder 220 represent optional additional encoders and interleavers of the types known in the art which use known encoding schemes to encode the data from modulation encoder 210. For example, channel encoder(s) 215 can include TPC encoders, LDPC code encoders, etc. As a further example, circuitry 220 can optionally include a precoder used to implement a code of rate 1/1. Generally, a precoder is used to eliminate catastrophic error events and/or to convert the data from binary to another format such as bipolar. Front-end and timing circuit 230 filters and converts an analog read back signal from the heads/media 225 into a digital signal, providing timing for sampling of the read back signal. Channel detector 235 and outer decoder 240 can, in some embodiments, function together to convert the digital signal into a binary (i.e., 1's and 0's) signal. Again, the modulation encoder 210 and corresponding additive trellis encoding and search methods of the present invention, which the encoder is configured to implement, are not limited to use with the other components or configurations shown in
2. Generic Additive Coding Scheme
The necessity of data modification arises usually during encoding when an encoder gets side information from a channel or other source which defines new temporary requirements for a code word to be generated at the current moment of time. The side information could be a constraint of an RLL type, an accumulated RDS, a maximum value of the RDS in the past, or other metric. When the side information is quantified, it can be considered as a current state of a channel. In an additive coding scheme, an encoder 210 uses a predefined set of special words called “patterns,” and chooses one of the patterns in the set for each data word of the same length according to some criteria, when a state of the channel is given. The different patterns in the set are identified or distinguished using one or more flag or prefix bits. Then, the data word and pattern are added component wise modulo 2 if the input and output alphabets of the channel are binary, and sent to the channel. A standard sector of the magnetic recording system can consist of N>70 code words of the length n in bits of 20<n<100. Therefore, even for the single bit flags when two patterns are used to encode one code word, the total number of flag combinations to encode N data or message words is greater than 2N (each of the N data or message words can be encoded using one of the two patterns to obtain the resulting code word), and cannot be searched in a brute force manner due to processing and time limitations. It is a non-trivial problem to find what patterns are to be used, and how to optimize their choice in a long sequence of additive coding steps. The present invention provides a search method using a trellis which is similar to the Viterbi algorithm usually used for decoding purposes, but in fact is different from the Viterbi algorithm. To better understand the invention, a formal description of a generic additive coding scheme is provided.
Let q be a positive integer, and E={0,1,2, . . . , q−1}. An additive code B={B1,B2, . . . ,BM} for the transmission of M=qm messages through a channel with the input and output alphabets En is defined by:
In order to increase the rate of an additive code R=(n−r)/n, the minimal possible number of matching patterns should be used. The Lemma from Appendix A included below provides the answer to the question of how many patterns are required for the absolutely reliable transmission of m q-ary symbols over a constraint deterministic channel with a given set of states S known to the encoder, but unknown to the decoder. Given a masking set C satisfying the conditions of the Lemma from the Appendix A, one can encode and decode as follows:
Encoding step. According to the Lemma from Appendix A, Bu∩Ys≠Ø for any uεU and sεS. Therefore, for any message u and any state s there is a pattern {overscore (c)}εC such that the code word x(u,s)={overscore (u)}⊕{overscore (c)}εBu∩Ys. In other words, by simple component-wise additive operations one can always generate a code word x(u,s) which belongs to the set of words Ys transmitted through the channel without errors. This code word x(u, s) represents the message u.
Decoding step. According to the Equation 2, Bi∩Bj=Ø for all 1≦i≠j≦M. Therefore, all {overscore (v)}εBu are just to be decoded into the output message u. Practically, in the additive coding scheme this can be done by the following manner:
If the number of patterns in the set C is not sufficient to guarantee the error-free transmission through the constraint channel, or to satisfy the channel input constraints at the current moment of time, the best possible pattern in the given set C must be chosen, and a metric is required to determine which combination of the 2N possibilities is “the best”. Examples of such metrics are provided below.
3. Trellis Encoder for the Search of Flag Bits.
Now, a description is provided of an application of additive coding in a magnetic recording channel (or other type of channel) when the suppression of the DC content of a signal is required. As shown in
In the magnetic recording channels it is difficult to compensate the rate loss, and for this reason the use of a high rate code is desirable. Although the technique described below works well with an arbitrary number of redundant bits r, an encoding scheme with one redundant bit per a code word of length n, i.e., with the rate R=(n−1)/n, is described in detail. One redundant bit can be used to enumerate two patterns. When one of the patterns consists of all zeros and other pattern is nonzero, it can also be said that the redundant bit is a flag showing if a nonzero pattern is used or not. Note that the addition of the all zero pattern to the original data leaves an original data “as is”. All generalizations for the multiple matching patterns are strait forward.
4. Description of a Trellis.
Let N be the number of additive code words in a sector. In the specific implementation of an additive trellis scheme with a single bit flags (only one nonzero pattern is used), the number of flags in the sector is also equal to N. Let v be a small integer parameter. By partitioning the sequence of flags into blocks consisting of v bits (the last block may have less than v bits), the task of flag searching can be reduced to determining └NIV┘ blocks of flags (here, └x┘ denotes the smallest integer greater than or equal to x). When the blocks of flags are represented by the nodes of the trellis they are also called states of the trellis. In other words, the state of a trellis is a group of v flag bits, and there are 2v different states for the given parameter v. Therefore, in order to represent all possible combinations of N single bit flags in the coded sector, we need a full trellis with └NIV┘ sections each consisting of 2v states connected by edges with the neighbor states. In the full trellis each state is connected by edges with all directly preceding and following states. For v=2 the trellis is shown in
Note that if an additive code uses L>2 patterns, the flag consists of r>1 bits. The total number of bits in N flags is equal to rN. In this case, it is convenient to choose v as a multiple of r, so that an integer number of flags forms a state of a trellis. The trellis will have the same number 2v states, but rN/v sections.
As described, in the example trellis 600 shown in
As described, each of the trellis states is connected by edges with all directly preceding and following states. As an example, edges 610-1 through 610-4 are the edges between trellis state 615-4 and the four possible following states of the trellis (labeled 617-1 through 617-4). For simplicity of illustration purposes, not all edges in trellis 600 are represented in
5. Metrics for the Trellis Search of Flags.
An additive coding scheme can be used for different purposes, in particular, generation of sequences of bits satisfying certain input constraints. Some important constraints used in theory and practice of magnetic recording (or other types of recording) are related to the notions of the disparity and the RDS of the binary or bipolar word as described below. The disparity of a binary word is the number of ones minus the number of zeros in the word. The disparity of a bipolar word with components +1 and −1 is defined in the same way, i.e., the number of “1”s minus the number of “−1”s in the word. The disparity of a bipolar subsequence from a given instant (t=1) to the current position or time moment t is called the RDS at this time moment, and is denoted as zt=RDS(t). Therefore, RDS(n) at the end of the bipolar word of length n is the disparity of this word. Formally, an RDS function is defined as shown in the following discussion.
Let {overscore (x)}=(x1, x2, . . . , xi, . . . ) be a bipolar sequence. Note that the bipolar values −1 and +1 of xi are often represented by their logical equivalents “0” and “1”, respectively. The RDS function is then defined as shown in Equation 3:
where z0=0. The RDS function of the second order is defined by Equation 4:
Equation 4
RDS2(t)=RDS(t)−RDS(t−1). Equation 4
The accumulated variance of RDS is defined by Equation 5:
In many practical magnetic or other recording systems with AC-coupled preamps, it is not desirable to have signals with the low frequency components that determine the DC-content of a signal. The standard metrics used for the suppression of the DC content include:
The present invention includes an algorithm for the search of flags using a trellis such as described above. The algorithm operates or moves sequentially from one section of the trellis to the next until it reaches the last section. While moving along the trellis, the algorithm updates the metrics of the states in each section using the following three memory buffers:
In contrast to a conventional Viterbi algorithm using additive metrics in the decoding process, the main metrics used in the present invention, in particular and for example, the metrics described in the previous section (except the RDS itself and its accumulated variance) are not additive. Therefore, in order to operate with non-additive metrics, in buffers A and B, for each state multiple real values (called metric components) are stored. Examples of the metric components are the maximum value of the RDS in the past, the pair of accumulated variances (var—0(t), var—1(t)) and the final value of the RDS defined earlier. These metrics are calculated for the different states in the trellis as clarified below.
Let {overscore (s)}_t=(s1,s2, . . . ,st) be a sequence of states in the trellis originating at the state s1 at the time moment t=1 and terminating at the state st in the section corresponding to the time moment t. This sequence of states, connected by edges, is also called a path in the trellis. The state of a trellis was defined as a block of v flag bits, and therefore when the path {overscore (s)}t is given, the first v t flags and the corresponding matching patterns {overscore (c)}1, {overscore (c)}2, . . . , {overscore (c)}vt are known. These pattern and the data words {overscore (u)}1,{overscore (u)}2, . . . , {overscore (u)}vt define the output binary code word {overscore (v)}1, {overscore (v)}2, . . . , {overscore (v)}vt corresponding to the path {overscore (s)}t, i.e.,
{overscore (v)}({overscore (s)}t)=({overscore (u)}1⊕{overscore (c)}1,{overscore (u)}2⊕{overscore (c)}2, . . . , {overscore (u)}tv⊕{overscore (c)}tv) Equation 8
The metrics of the path originating at the state s1 and terminating at the state st are calculated for the code word defined by Equation 8 using the relationships illustrated in Equations 3-7, where {overscore (x)}=(x1,x2, . . . , xi, . . . ) is the code word {overscore (v)}({overscore (s)}t) in the bipolar form.
An embodiment of the search algorithm of the present invention can be described as follows.
Step 1. At the time moment t=1 for each state sεS, the algorithm calculates and stores in the memory buffer A:
In summary, as described above, the present invention includes a method or technique for the suppression of the DC-content of signals. It is based on an additive coding scheme combined with and one or more concepts or solutions, such as:
As shown at block 710, the method includes searching the full trellis representation of additive code words to identify the best or optimized path between sequences of states in the trellis. This path is the sequence of survivor states identified as discussed above, for example with reference to
It must be noted that additive coding, (i.e., adding patterns to the data words) can be started before the search algorithm terminates at the final state, when the flags already produced at the earlier stages of the trellis search are used to choose the matching patterns. Such an implementation can be accomplished using a short path memory, and by the proper choice of the decision delay, the degradation of the characteristics can be reduced to minimum values.
Referring now to
7. Protection of Flags by a Local ECC
The disclosed encoding scheme uses multiple flags that are transmitted through the channel and used by the DCF decoder to recover matching patterns. A single bit error in the flag causes the use of a wrong pattern in the DCF decoder, and therefore can produce w output errors, where w is the Hamming weight of the sum modulo q of the correct and wrong matching patters. This does not result in catastrophic error propagation, but still is not a desirable feature in any coding scheme. In the proposed encoding the number of flags is relatively small, and they are separated from each other by a span of “non-flags” of length n−1, where n is the length of a single matching pattern. Therefore, a typical burst of errors at the output of the Viterbi detector produces only a single bit flag error, and a short Hamming is sufficient to correct all single errors. In the low SNR region double errors are also possible, and more powerful ECC codes are to be used to protect flags.
Referring now to FIG., shown is a block diagram 750 illustrating more particular embodiments of the methods and systems of the present invention. As illustrated, data words are provided at input 751 to the trellis search implementing circuitry 755, with the result being the flags 756 which are identified in the trellis search. The flags represent the patterns used to additively encode the data words, and thus additive coding circuitry 760 uses the flags to generate additive code words 761 as described above. The additive code words are generated, or are selected from previously generated additive code words, to represent the additive combination of data words with the patterns identified by the flags. As discussed above, the flags 756 can also be provided to a local ECC encoder 765 to generate ECC encoded flags 766, thus protecting the flags. The additive code words 761 and the ECC encoded flags 766 are multiplexed using multiplexer 770 to produce an encoded sector 770.
8. Results of Tests and Simulations
Spectral Properties.
Results of the Error Propagation Test A.
Two types of tests were run to evaluate error propagation in the designed DCF decoder (modulation encoder 210) of the present invention. In the first test (designated test “A” for discussion purposes), single bit errors are artificially created at the input of the DCF decoder at different positions of the coded sector. Each single bit input error can create multiple output errors. The number of errors produced by the DCF decoder at its output is counted. The simple Hamming code was used in an implementation of the DCF encoder to protect flag bits in the data sector. This solution results in close to zero error propagation. As can be seen from
Results of the Error Propagation Test B.
In the second test (designated test “B”), the BER was directly estimated at the input and output of the DCF decoder (modulation encoder 210) by simulation of a complete perpendicular magnetic recording system. In the simulations, the received signal was equalized using a generalized partial response target of length 4 (GPR4). An AC-coupled preamp was modeled by the high path filter with the cut-off frequency set to 1/1000 of the baud rate. User normalized linear density (uND) is equal to 2.3, while the channel bit density (cbd) is adjusted according to the code rate R using the formula cbd=uND/R.
Appendix A.
Let Ys be the number of different words that can be transmitted without a single error through the constraint deterministic channel set to the state sεS.
Lemma. For an arbitrary constraint deterministic channel with an “informed” encoder, receiving the side information, and any integer m such that
there exists a set of matching patterns C with L=qn-m words, such that for any uεU and
Bu∩Ys≠Ø.
It is to be understood that even though numerous characteristics and advantages of various embodiments of the invention have been set forth in the foregoing description, together with details of the structure and function of various embodiments of the invention, this disclosure is illustrative only, and changes may be made in detail, especially in matters of structure and arrangement of parts within the principles of the present invention to the full extent indicated by the broad general meaning of the terms in which the appended claims are expressed. For example, the particular elements may vary depending on the particular application for the encoding system while maintaining substantially the same functionality without departing from the scope and spirit of the present invention. In addition, although embodiments described herein are directed toward use in a data storage system, it will be appreciated by those skilled in the art that the teachings of the present invention can be applied to other data storages systems and communication systems, for example, without departing from the scope and spirit of the present invention.
Further, while in some embodiments the present invention is described with reference to a search of a “full trellis”, it is not necessary that the trellis representation be a full trellis representation. The present invention can also be used in a “partial trellis” representation, for example one in which some edges are deleted from the full trellis. This deletion of edges (pruning of the trellis) can in some embodiments result in increased efficiency or other processing benefits.