The present disclosure generally relates to error correction in communication systems, and particularly to an Error Correction Coding (ECC) scheme that implements Bayes estimators with general loss functions to take into account a bit significance.
The information bit error rate (BER) in classical error coding is based on the Hamming distance, i.e. the number of bits that are different between the symbols at the transmitter and the decoded symbols at the receiver. For this metric, the positions of the source bits where the error occurred are not significant. This is unsatisfactory since in many scenarios all source bits are not equal and have different significance. For instance, representing an integer in binary, the bits will have different significance with the difference increasing exponentially with its position, and an error in the most significant bit is much more problematic than an error in the least significant bit.
Furthermore, the relationship between the difference |i−j| of two integers i and j and the Hamming distance when expressed as bitstrings is nonlinear and nonmonotonic. For instance, the difference between the numbers 8 and 7 is 1, but expressed as bits, their Hamming distance is 4. On the other hand, the difference between 0 and 2k is 2k, but have a Hamming distance of 1. In image compression, the discrete cosine transform (DCT) coefficients for lower frequencies are more important than the higher frequencies coefficients and luminance coefficients are more important than chrominance coefficients as the human visual system exhibits low-pass behavior and is more sensitive to luminance changes. In stored program computers, the program code is more critical than data as an erroneous instruction can cause the machine to crash whereas incorrect data typically leads to incorrect results, but not cause a machine to crash.
A system and method for Error Correction Coding (ECC) scheme for a communication system that transmits symbols containing bits over a communications channel and which encoding scheme takes into account the difference in significance of the source bits.
A system and method for Error Correction Coding (ECC) scheme for a data storage system that stores symbols containing bits for storage and which encoding scheme takes into account the difference in significance of the source bits.
In one embodiment, the error metric measures the distance in the symbol space, e.g., for 2 symbols i, j expressed as integers, the distance can be (i−j){circumflex over ( )}2 or |i−j|.
According to one aspect of the invention, there is provided an error correction method in a data symbol communication system. the error correction method comprises: receiving a signal at a decoder device, the signal representing an encoded source symbol having source bits; decoding, at the decoding device, the encoded source symbol to obtain a decoded source symbol, and mapping the decoded source symbol to a corresponding source symbol, the encoded source symbol based on an encoding of the source symbol having bits that account for differences in significance of the bits.
According to a further aspect of the invention, there is provided a system for correcting error corrected encoded signals. The system comprises: a decoder device for receiving a signal, the signal representing an encoded source symbol having source bits, the decoder processing the received signal to obtain a decoded source symbol; a mapping device for mapping the decoded source symbol to a corresponding source symbol, the encoded source symbol encoded based on an encoding of source symbol having bits that account for differences in significance of the bits.
In a further aspect, a data storage system is provided for storing encoded symbols. The data storage system comprises a memory for receiving and storing encoded source symbols at memory locations, the encoded source symbols based on an encoding of symbol source having bits that account for differences in significance of the bits.
Objects, features, and advantages of the present disclosure will become more clearly apparent when the following description is taken in conjunction with the accompanying drawings, in which:
The present disclosure relates to devices and methods employing an Error Correction Coding (ECC) scheme for a communication system that communicates data symbols over a communications channel and which applies an ECC encoding scheme to source bits that takes into account the difference in significance of the source bits.
Additionally, the present disclosure relates to devices and methods employing an Error Correction Coding (ECC) scheme for a data storage system that receives and stores data symbols having source bits encoded according to an ECC encoding scheme that takes into account the difference in significance of the source bits.
ECC symbol encoder/decoder circuitry, memory devices, and data methods are described that facilitate the detection and correction of data in both communication systems and memory systems and devices (e.g., memory controllers, memory systems, and/or non-volatile memory devices) and which ECC symbol encoder circuitry applies an ECC encoding scheme to source bits to be communicated over a channel or stored in a memory device that takes into account the difference in significance of the source bits.
In
In further embodiments, the encoding scheme depicted in
According to the embodiments herein, the codebook 108 implements an ECC encoder based on an encoding scheme that optimizes an error metric e(s) such that the codeword (after transmitting through the channel or stored in a memory storage device) become optimally decoded using a corresponding ECC decoder and inverse codebook. According to the invention, an error metric is a measure of the distance in the source (or information) symbol space, e.g., for 2 symbols i, j expressed as integers, the distance can be (i−j)2 or |i−j|. In an implementation, an optimal decoder embodied as a Bayes estimator (from estimation theory) uses the error metric in symbol space as a “Loss” function.
Further, heuristics are used to find codebook codewords that optimizes error metric e(s). In one embodiment, a genetic algorithm is used to find an optimized codebook based on an objective function that incorporate the error metric e(s) and a characteristic of the channel (communications channel over which the encoded bits are transmitted). A penalty function is added to the objective function to ensure all codewords are distinct.
In anther embodiment, hill climbing or variants such as stochastic hill climbing or hill climbing with random restarts are used to optimize the objective function with optional penalty function in order to find the optimized codebook.
In the following description of the error correction encoding application, an integer is denoted as a value n where 0≤n<2k, and for an integer 0≤n<2k, bk (n) denotes the k-bit representation of n, e.g., b4(9)=1001. The bijection between symbols s∈S (representing information or source symbols) and codewords c∈C is denoted by Φ. Each symbol in S is assumed to occur with equal probability in the data stream, i.e. ∀s∈S,
With reference to the communications system 100 of
Further, with reference to
In a further embodiment, the decoded symbol s*=Φ−1(c*) can be retrieved efficiently by implementing a hash table to facilitate the inverse codebook lookup. For example, a hash table and a constructed dictionary indexed by the decoded codewords can be used to map the decoded codeword 117 to the corresponding source symbols.
Given a transmitted codeword c 111 and a received word c′ 113 (that may or may not be a codeword in C), the ideal observer decoding returns the codeword c* such that c*=argmaxw∈Cp(w|c′) where p(w|c′) is the probability that codeword w is sent given that c′ is received. Given the assumption of a uniform prior distribution for codewords c, this is equivalent to maximum likelihood decoding. For certain channel models (e.g. binary symmetric channel with p<0.5), this is equivalent to minimum distance decoding.
To take into account the difference in significance of the bits of the source symbols, instead of bit error rate (BER), an error metric is used that measures the difference between source symbols. The following error rate eδ is considered:
where s(i) is the i-th (transmitted) source symbol and s*(i) is the i-th decoded symbol. The function δ measures the difference between the transmitted symbol and the decoded symbol. In an example of comparing integers, the following two functions are defined:
δ1=|bk−1(s)−bk−1(s*)| and
δ2=(bk−1(s)−bk−1(s*))2
These represent the difference (resp. squared difference) between s and s* when expressed as integers.
In order to minimize this error rate, there is optimized 1) the decoding method D; and 2) the codebook Φ.
In one embodiment, the minimizing of the error rate include applying a Bayes estimator that returns the decoded symbol s*, or equivalently the decoded codeword c*=Φ(s*), that minimizes the expected loss E(L(s*,s)|c′), where L(s*,s) is a loss function measuring the difference between the decoded symbol s* and the transmitted symbol s. In one aspect, the loss function can take into account the different significance of the source or information bits. Consider the posterior distribution p(s|c′) For an additive white Gaussian noise (AWGN) channel, p(s|c′) is proportional to g(μ,σ)(c′−Φ(s)) where g is the multivariate Gaussian pdf with mean μ and variance σ−2. Typically, the mean μ is 0. For a general loss function on a finite discrete symbol space, the Bayes estimator can be implemented by comparing all possible codebook candidates to the received codeword. For the case when s is a scalar quantity and for some special forms of the loss function, the following explicit forms of the Bayes estimator can be used:
In one embodiment, the loss function is the 0-1 loss function, i.e. L(s*,s)=0 if s*=s and 1 otherwise. If the loss function is the 0-1 loss function, then the Bayes estimator is the mode of the posterior distribution and is equivalent to the maximum-a-posteriori estimate or the ideal observer.
In one embodiment, the loss function is the squared error loss function, i.e. L(s*,s)=(s*−s)2. If the loss function is the squared error loss function, then the Bayes estimator is the mean of the posterior distribution.
In one embodiment, the loss function is the absolute value loss function, i.e. L(s*,s)=|s*−s|. If the loss function is the absolute value loss function, then the Bayes estimator is the median of the posterior distribution.
In one embodiment, the loss function is a function L that explicitly weights the bits according to their significance described as follows: assuming that x and y are integers and letting ai(x) and ai(y) denote the i-th bit of x and y respectively, with the 0-th bit denoting the least significant bit. Then the loss function is:
One choice of the coefficients αi are: αi=2i. For these coefficients, L(x,y) is equal to the integer value of the bitwise XOR of x and y.
The Bayes risk is defined as Eπ(L(s*,s)), where π is the prior distribution of s and the Bayes estimator is the estimator that minimizes the Bayes risk. Since π is the uniform distribution by the assumption,
is an unbiased estimator of the Bayes risk. Thus the Bayes estimator is the appropriate choice in order to minimize
which is equal to error rate eδ if
L(s*,s)=δ(s,s*) (1)
This analysis shows that the loss function is selected according to Eq. (1).
Having determined the optimal decoding scheme for a specific form of eδ and a specific codebook, the next step is to find an optimal codebook Φ that minimizes eδ among all codebooks. In one embodiment, the following heuristic is used. In particular, there is found a codebook 108 that minimizes:
v=Σi≠jδ(si,sj)pc(Φ(si)|(Φ(sj))
where pc(x|y) for x, y∈C is the probability of receiving codeword x when codeword y is transmitted. For the AWGN channel with zero mean and variance σ2, pc(x|y) is proportional to
where dH is the Hamming distance. Thus, there is obtained an objective function:
wherein δ (si,sj) is a difference (loss) function and si and sj are the codewords to be constructed for columns i and j and of the optimized codebook. In one embodiment, a genetic algorithm may be used to find codebooks that minimizes the objective function v. To ensure the codewords are distinct, there is added a penalty term to the objective function whenever a candidate codebook satisfies dH(Φ(si),Φ(sj))=0 for some i≠j. Note that the optimal notebook depends on the error function eδ and the channel probability pc. In a preferred embodiment, after adding the penalty function, the objective function is of the form
where g(x)=−w if x=0 and g(x)=x otherwise. The value of w is chosen to be a large number. In a preferred embodiment, w=2 k.
In further embodiments, it is understood that different codebooks can be used (and the codebook choice communicated between the transmitter and the receiver) based on the noise models and/or parameters. That is, based on possible types of noise models and parameters that are likely to be encountered in the communication channel and the error metric of interest, a set of codebooks and Bayes decoder are optimized off-line for different types of noise models and parameters. During implementation of the transmission system, the noise models and parameters of the transmission channel are estimated and the codebook and Bayes decoders whose corresponding noise model and parameters matches the best with the estimated parameters will be chosen. The choice of codebook and Bayes decoder may change as noise conditions change over time, and such choice of codebook and decoder need to be communicated to the receiver and the inverse codebook need to be matched to the codebook in the transmitter.
This ECC encoding and decoding framework illustrated in
Block Code of Rate 4/7
For a (n,k) block code, consider an example case n=7, k=4. Letting the 16=24 symbols be 0, 1, . . . , 15 and each symbol is mapped to a length 7 binary string, there is constructed an optimized codebook assuming an AWGN channel with σ=1. Simulating the performance of this codebook and that of the Hamming (7,4) code, there is considered three decoding schemes:
1. Hard decoding: the received signal is quantized and the decoded codeword is the closest codeword with respect to the Hamming distance.
2. Soft decoding: the decoded codeword is the nearest codeword with respect to the Euclidean distance.
3. Bayes decoding, as described above.
For an AWGN channel the received noisy word is c′=c+noise and the Bayes estimator requires knowing (or estimating) the variance of the noise. With the assumption that the codewords are equiprobable, the variance of the codewords bits are computed as the variance of the bits of codebook Φ. Since the noise is uncorrelated with the codewords, the variance of the noise can be estimated by subtracting the variance of the codewords bits from the estimated variance of the received noisy bits.
Considering both error metrics eδ
This is further illustrated in
Simulating the system of
The following is observed in both figures: hard decoding is worse than soft decoding which is worse than Bayes decoding. In all three decoding schemes, the optimized codebook performs better than the Hamming code. For eδ
It is further seen that the benefits of using the optimized codebook and Bayes decoding are more significant for eδ
Block Code of Rate 3/8
The same simulation as performed in connection with block code of rate 4/7 is performed with k=3 and 8-bit codewords and compared the optimized codebook with the Hadamard (8,3) code. The simulation results for 82 have similar trends as shown in example simulation results depicted in
Block Code of Rate 8/12
The same simulation as performed in connection with a block code rate 8/12 code that maps 8-bit integers to 12-bit codewords. Simulation results comparing an optimized codebook with the modified Hamming (12, 8) code show that results for δ2 have similar trends as shown in example simulation results depicted in
Signed Integers
While embodiments herein have considered nonnegative integers represented as bitstrings, the same approach can be applied to signed integers.
Considering the case of representing source symbols as signed integers (positive and negative) integers −2k−1≤i<2k−1 in k-bit 2's complement format, the optimal codebook may be very different. Note that in this case the Hamming distance between 0 and −1 is k. For the case k=4, rate 4/7, the optimized codebook 800 for eδ
Thus, there is thus provided a framework for error correction coding that takes into account the difference in bit significance in the source symbols by using an appropriate error metric and minimizing it using a Bayes decoder and an optimized codebook.
It is demonstrated that the Bayes decoder performs better than standard soft and hard minimum distance decoding and that the optimized codebook performs better than classical linear block codes such as Hamming codes. The error metric based on the difference |i−j| is similar to assigning an exponential weight 2d to the d-th bit.
The same approach can be applied to other ways of assigning significance to the various bits in the source bit stream by defining 8 appropriately. Furthermore, the framework has considered the AWGN channel model, but the Bayes estimator can be defined and the codebook can be optimized for other channel models (such as a binary symmetric channel (BSC)) as well. In the examples above, the symbol space S has size 2k. This is not a necessary requirement; the optimized codebook can be built to map any number of m symbols to n-bit codewords, resulting in a rate log2 (m)/n code, thus providing a more flexible tradeoff between rate and distortion. The cases of a nonuniform prior distribution for the source symbols or the addition of source (entropy) coding result in a more complicated source symbol posterior distribution, Bayes estimator and the function v used in the heuristic. In addition, the function v can be amended to optimize for multiple noise SNR's.
Furthermore, noise models and parameters can be estimated by sending intermittent probe bits through the channel.
In further embodiments, different codebooks can be used (and the codebook choice communicated between the transmitter and the receiver) based on the noise models and/or parameters.
The system of
As depicted, the components of the system of
Bus 14 may represent one or more of any of several types of bus structures, including a memory bus or memory controller, a peripheral bus, an accelerated graphics port, and a processor or local bus using any of a variety of bus architectures. By way of example, and not limitation, such architectures include Industry Standard Architecture (ISA) bus, Micro Channel Architecture (MCA) bus, Enhanced ISA (EISA) bus, Video Electronics Standards Association (VESA) local bus, and Peripheral Component Interconnects (PCI) bus.
Computer system may include a variety of computer system readable media. Such media may be any available media that is accessible by computer system, and it may include both volatile and non-volatile media, removable and non-removable media.
System memory 16 can include computer system readable media in the form of volatile memory, such as random access memory (RAM) and/or cache memory or others. Computer system may further include other removable/non-removable, volatile/non-volatile computer system storage media. By way of example only, storage system 18 can be provided for reading from and writing to a non-removable, non-volatile magnetic media (e.g., a “hard drive”). Although not shown, a magnetic disk drive for reading from and writing to a removable, non-volatile magnetic disk (e.g., a “floppy disk”), and an optical disk drive for reading from or writing to a removable, non-volatile optical disk such as a CD-ROM, DVD-ROM or other optical media can be provided. In such instances, each can be connected to bus 14 by one or more data media interfaces.
Computer system may also communicate (via Input/Output (I/O) interfaces 20) with one or more external devices 26, such as a keyboard, a pointing device, a display 28, and/or one or more other devices that enable a user to interact/interface with the computer system; and/or any devices (e.g., network card, modem, network adaptor 22, etc.) that enable computer system to communicate with one or more other computing systems/devices. Such communication can occur via Input/Output (I/O) interfaces 20 and/or network adaptor 22.
By way of further example, the computer system of
The present invention may be a system, a method, and/or a computer program product at any possible technical detail level of integration. The computer program product may include a computer readable storage medium (or media) having computer readable program instructions thereon for causing a processor to carry out aspects of the present invention.
The computer readable storage medium can be a tangible device that can retain and store instructions for use by an instruction execution device. The computer readable storage medium may be, for example, but is not limited to, an electronic storage device, a magnetic storage device, an optical storage device, an electromagnetic storage device, a semiconductor storage device, or any suitable combination of the foregoing. A non-exhaustive list of more specific examples of the computer readable storage medium includes the following: a portable computer diskette, a hard disk, a random access memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or Flash memory), a static random access memory (SRAM), a portable compact disc read-only memory (CD-ROM), a digital versatile disk (DVD), a memory stick, a floppy disk, a mechanically encoded device such as punch-cards or raised structures in a groove having instructions recorded thereon, and any suitable combination of the foregoing. A computer readable storage medium, as used herein, is not to be construed as being transitory signals per se, such as radio waves or other freely propagating electromagnetic waves, electromagnetic waves propagating through a waveguide or other transmission media (e.g., light pulses passing through a fiber-optic cable), or electrical signals transmitted through a wire.
Computer readable program instructions described herein can be downloaded to respective computing/processing devices from a computer readable storage medium or to an external computer or external storage device via a network, for example, the Internet, a local area network, a wide area network and/or a wireless network. The network may comprise copper transmission cables, optical transmission fibers, wireless transmission, routers, firewalls, switches, gateway computers and/or edge servers. A network adapter card or network interface in each computing/processing device receives computer readable program instructions from the network and forwards the computer readable program instructions for storage in a computer readable storage medium within the respective computing/processing device.
Computer readable program instructions for carrying out operations of the present invention may be assembler instructions, instruction-set-architecture (ISA) instructions, machine instructions, machine dependent instructions, microcode, firmware instructions, state-setting data, configuration data for integrated circuitry, or either source code or object code written in any combination of one or more programming languages, including an object oriented programming language such as Smalltalk, C++, or the like, and procedural programming languages, such as the “C” programming language or similar programming languages. The computer readable program instructions may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server. In the latter scenario, the remote computer may be connected to the user's computer through any type of network, including a local area network (LAN) or a wide area network (WAN), or the connection may be made to an external computer (for example, through the Internet using an Internet Service Provider). In some embodiments, electronic circuitry including, for example, programmable logic circuitry, field-programmable gate arrays (FPGA), or programmable logic arrays (PLA) may execute the computer readable program instructions by utilizing state information of the computer readable program instructions to personalize the electronic circuitry, in order to perform aspects of the present invention.
Aspects of the present invention are described herein with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the invention. It will be understood that each block of the flowchart illustrations and/or block diagrams, and combinations of blocks in the flowchart illustrations and/or block diagrams, can be implemented by computer readable program instructions.
These computer readable program instructions may be provided to a processor of a general purpose computer, special purpose computer, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions/acts specified in the flowchart and/or block diagram block or blocks. These computer readable program instructions may also be stored in a computer readable storage medium that can direct a computer, a programmable data processing apparatus, and/or other devices to function in a particular manner, such that the computer readable storage medium having instructions stored therein comprises an article of manufacture including instructions which implement aspects of the function/act specified in the flowchart and/or block diagram block or blocks.
The computer readable program instructions may also be loaded onto a computer, other programmable data processing apparatus, or other device to cause a series of operational steps to be performed on the computer, other programmable apparatus or other device to produce a computer implemented process, such that the instructions which execute on the computer, other programmable apparatus, or other device implement the functions/acts specified in the flowchart and/or block diagram block or blocks.
The flowchart and block diagrams in the Figures illustrate the architecture, functionality, and operation of possible implementations of systems, methods, and computer program products according to various embodiments of the present invention. In this regard, each block in the flowchart or block diagrams may represent a module, segment, or portion of instructions, which comprises one or more executable instructions for implementing the specified logical function(s).
In some alternative implementations, the functions noted in the blocks may occur out of the order noted in the Figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved. It will also be noted that each block of the block diagrams and/or flowchart illustration, and combinations of blocks in the block diagrams and/or flowchart illustration, can be implemented by special purpose hardware-based systems that perform the specified functions or acts or carry out combinations of special purpose hardware and computer instructions.
The descriptions of the various embodiments of the present invention have been presented for purposes of illustration, but are not intended to be exhaustive or limited to the embodiments disclosed. Many modifications and variations will be apparent to those of ordinary skill in the art without departing from the scope and spirit of the described embodiments. The terminology used herein was chosen to best explain the principles of the embodiments, the practical application or technical improvement over technologies found in the marketplace, or to enable others of ordinary skill in the art to understand the embodiments disclosed herein.
Number | Name | Date | Kind |
---|---|---|---|
6654926 | Raphaeli et al. | Nov 2003 | B1 |
6990626 | Ashikhmin | Jan 2006 | B2 |
8301963 | Hou et al. | Oct 2012 | B2 |
8358685 | Siti et al. | Jan 2013 | B2 |
8548027 | Zeng et al. | Oct 2013 | B2 |
20090245083 | Hamzeh | Oct 2009 | A1 |
20130300591 | Marpe | Nov 2013 | A1 |
20180018590 | Szeto | Jan 2018 | A1 |
Number | Date | Country |
---|---|---|
WO2004084417 | Sep 2004 | WO |
Entry |
---|
Berger, “Error-Correcting Output Coding for Text Classification”, IJCAI Workshop: Machine Learning for Information Filtering, Aug. 1, 1999, Stockholm, Sweden, 8 pages. |
Russell et al., “Artificial Intelligence a Modern Approach”, Second Edition, Pearson Eduction, Inc., Upper Saddle River, New Jersey 07458, Book, 2003, Chapter 4, pp. 111-114. |
Number | Date | Country | |
---|---|---|---|
20190312666 A1 | Oct 2019 | US |