A field of the invention is data coding and compression. Embodiments of the invention provide WOM (Write Once Memory) coding methods and devices.
A Write Once Memory (WOM) is a storage medium with binary memory elements, called cells, that can change from the zero state to the one state only once, except, in some types of memory, upon a block erase. WOM-codes were originally designed for memories that consist of binary memory elements that could physically only be changed from a zero state to a one state. Examples of such memories are punch cards and optical disks. More recently, WOM-codes have been designed for general usage in different types of memories, including flash memories. See, e.g., A. Jiang, “On the Generalization of Error-Correcting WOM-codes,” in Proc. IEEE Int. Symp. Inform. Theory, pp. 1391-1395, Nice, France (2007); A. Jiang and J. Bruck, “Joint coding for flash memory storage,” in Proc. IEEE Int. Symp. Inform. Theory, pp. 1741-1745, Toronto, Canada, (July 2008); H. Mandavifar, P. H. Siegel, A. Vardy, J. K. Wolf, and E. Yaakobi, “A Nearly Optimal Construction of Flash Codes,” in Proc. IEEE Int. Symp. Inform. Theory. pp. 1239-1243, Seoul, Korea, (July 2009).
A WOM-code allows the reuse of a write-once medium by introducing redundancy into the recorded bit sequence and, in subsequent write operations, observing the state of the medium before determining how to update the contents of the memory with a new bit sequence.
A simple example enables the recording of two bits of information in 3 memory elements, twice. The encoding and decoding rules for this WOM-code are described in a tabular form in the table below. It is easy to verify that after the first 2-bit data vector is encoded into a 3-bit codeword, if the second 2-bit data vector is different from the first, the 3-bit codeword into which it is encoded does not change any code bit 1 into a code bit 0, ensuring that it can be recorded in the write-once medium.
The sum-rate of the WOM-code is the sum of all the individual rates for each write. While there are different ways to analyze the efficiency of WOM-codes, we find that the appropriate figure of merit is to analyze the sum-rate under the assumption of a fixed number of writes. In general, the more writes the WOM-code can support, the better the sum-rate it can achieve. The goal is to give upper and lower bounds on the sum-rates of WOM-codes while fixing the number of writes t to a desired number.
An embodiment of the invention provides a family of 2-write WOM-codes, preferred embodiments of which provide improved WOM-rates. Embodiments of the invention provide constructs for linear codes C having a 2-write WOM-code. Embodiments of the invention provide 2-write WOM-codes that improve the best known WOM-rates known to the present inventors at the time of filing with two writes. Preferred WOM-codes are proved to be capacity achieving when the parity check matrix of the linear code C is chosen uniformly at random.
Preferred embodiments of the invention provide an electronic device utilizing an efficient coding scheme of WOM-codes with two write capability. The coding method is based on linear binary codes and allows the electronic device to write information to the memory twice before erasing it. This method can be applied for any kind of memory systems, and in particular for flash memories. The method is shown to outperform all well-known codes.
The invention addresses two problems related to 2-write WOM-codes 1) The number of messages written to the memory on each write is the same; 2) Different number of messages can be written on each write. For the case of 2-write WOM-codes, the theoretical bound on the WOM-rate for the first problem is approximately 1.5458 and in the second problem it is approximately 1.58. Since the best known WOM-rate for the first problem is approximately 1.34 and 1.37 for the second problem, there is still room for improvement in closing these gaps. The invention provides a family of 2-write WOM-codes, preferred embodiments of which provide improved WOM-rates. Embodiments of the invention provide constructs for linear codes C having a 2-write WOM-code. Embodiments of the invention provide 2-write WOM-codes that improve the best known WOM-rates known to the present inventors at the time of filing with two writes. Preferred WOM-codes are proved to be capacity achieving when the parity check matrix of the linear code C is chosen uniformly at random.
Preferred embodiments of the invention provide an electronic device utilizing an efficient coding scheme of WOM-codes with two write capability. The coding method is based on linear binary codes and allows the electronic device to write information to the memory twice before erasing it. This method can be applied for any kind of memory systems, and in particular for flash memories. The method is shown to outperform all well-known codes.
Preferred embodiments of the invention are applicable to memories having cells that can change their state from zero to one but not from one to zero except upon an erase of the entire memory. Preferred embodiments of the invention are t-write WOM-codes that conform to Thoerem 1 in the description below.
Preferred embodiments of the invention will now be discussed with respect to the drawings. The drawings may include schematic representations, which will be understood by artisans in view of the general knowledge in the art and the description that follows. Features may be exaggerated in the drawings for emphasis, and features may not be to scale.
Preferred embodiment methods and devices use a two-write WOM-codes construction that reduces the gap between the upper and lower bound on the sum-rates for both fixed- and unrestricted-rate WOM-codes. In Reference [28], a “coset-coding” is used only on the second write in order to generate an ε-error two-write WOM-codes. However, in ε-error two-write WOM-codes, the second write is not guaranteed in the worst case but is allowed with high probability. Methods and codes of the invention guarantee from every linear code a two-write WOM-code. A “coset-coding” scheme only on the second write is used as in Reference [28], but the first write is modified such that the second write is guaranteed in the worst case. Preferred specific embodiment WOM-codes have better sum-rates than the previously best known codes discussed above. Preferred embodiment WOM-codes choose uniformly at random the parity-check matrix of the linear code, such that there exist WOM-codes that achieve all points in the capacity region of two-write WOM-codes. An example application of a preferred method generate from each two-write WOM-code a code for the Blackwell channel.
A. Two-Write WOM-Codes Construction
Let C[n,k] be a linear code with parity-check matrix . For each vε{0,1}n we define the matrix v as follows. The i-th column of v, 1≦i≦n, is the i-th column of if vi=0 and otherwise it is the zeros column. The set Vc is defined to be
Vc={vε{0,1}n|rank(v)=n−k}. (1)
We first note the following position. If a vector v belongs to Vc, its weight is at most k.
The support of a binary vector v, denoted by supp(v), is the set {i/vi=1}. The dual of the code C is denoted by C⊥. The next lemma is a variation of a well known result (see e.g. Reference [5]).
Lemma 1. Let C[n, k] be a linear code with parity-check matrix . For each vector vε{0, 1}n, rank(v)=n−k if and only if v does not cover any non-zero codeword in C⊥.
Lemma 1 implies that if two matrices are parity-check matrices of the same linear code C, then their corresponding sets Vc are identical, and so we can define the set Vc to be
Vc={vε{0, 1}n|v does not cover any non-zero cεC⊥}.
The next theorem presents the preferred embodiment two-write WOM-codes,
Theorem 1.
Let C[n,k] be a linear code with parity-check matrix and let Vc be the set defined in (1). Then there exists an [n,2; |Vc|,2n−k] two-write WOM-code of sum-rate
The two-write WOM-code can be proven by showing the existence of the encoding and decoding maps on the first and second writes. First, let {v1, v2, . . . v, |vc|} be an ordering of the set Vc. The first and the second writes are implemented as follows.
1) On the first write, a symbol over an alphabet of size |Vc| is written. The encoding and decoding maps E1, D1 are defined as follows. For each mε{1, . . . , |Vc|}, E1 (m)=vm and D1 (vm)=m.
2) On the second write, we write a vector s2 of n−k bits. Let v1 be the programmed vector on the first write and s1=v1, then
E2(s2,v1)=v1+v2,
where v2 is a solution of the equation v1·v2=s1+s2. For the decoding map D2, if c is the vector of programmed cells, then the decoded value of the n−k bits is given by D2(c)=·c=·v1+·v2=s1+s1+s2=s2.
The success of the second write results from the condition that for every vector vεVc, rank (v)=n−k.
There is no condition on the code C and therefore we can use any linear code in this construction, though we seek to find codes that maximize the sum-rate
Next, we show two examples of two-write WOM-codes that achieve better sum-rates than the previously best known ones.
This example demonstrates how Theorem 1 works for the [16,5,8] first order Reed-Muller code and demonstrates a rate of 1.4566. Its dual code is the [16,11,4] second order Reed-Muller, which is the extended Hamming code of length 16. Hence, we are interested in the size of the set
Vi={vε{0,1}16| v does not cover any cε[16,11,4]}.
According to Equation (1), the set V1 does not contain vectors of weight greater than five. This extended Hamming code has 140 codewords of weight four and no codewords of weight five. The set V1 consists of the following vector sets.
1) All vectors of weight at most three. There are Σi=03(i16)=697 such vectors.
2) All vectors of weight four that are not codewords. There are (416)−140=1680 such vectors.
3) All vectors of weight five that do not cover a codeword of weight four. There are (516)−12·140=2688 such vectors. Since the minimum distance of the code is four, a vector of weight five can cover at most one codeword of weight four.
Therefore, we get |Vi|=697+1680+2688=5065 and the sum-rate is
(log2(5065)+11)/16=1.4566.
It is possible to modify this WOM-code such that on the first write only 11 bits are written. Thus, we achieve a two-write fixed-rate WOM-code and its sum-rate is 22/16=1.375, which is the best known fixed-rate WOM-code.
In this example we will use the [23,11,8] Golay code. Its dual code is the [23,12,7] Golay code so we are interested in the size of the set V2={vε{0, 1}23 |v does not cover any cε[23, 12,7]}.
According to Equation (1), there are no vectors of weight greater than 11 in the set V2. The invention achieves a rate of 1.4632. The [23,12,7] Golay code has A7=253 codewords of weight seven, A7=506 codewords of weight eight, and A11=1288 codewords of weight 11. The set V2 consists of the following vector sets.
1) All vectors of weight at most 6. This number of vectors is Σi=06(i23)=145499.
2) All vectors of weight between 7 and 10 besides those that cover a codeword of weight 7 or 8. Since the minimum distance of the code is 7 every vector can cover at most one codeword. Hence, this number of vectors is
3) All vectors of weight 11 that are not codewords and do not cover a codeword of weight either 7 or 8. This number was shown in [6] to be 695520.
Therefore, for the [23,11,8] Golay code we get |v2|=145499+2459160+695520=3300179, and thus the sum-rate is
(log2(3300179)+12)/23=1.4632.
B. Random Coding
The preferred coding and coding methods consistent with
so the sum-rate of the generated WOM-codes is
Our goal in this subsection is to show that it is possible to achieve the capacity region C2 of a t-write WOM-code by choosing uniformly at random the parity-check matrix of the linear code C. We prove that in the following theorem.
Theorem 2.
For any (R1, R2)εC2 and ε>0 there exists a linear code C satisfying R1(C)≧R1−ε, R2(C)≧R2−ε.
Proof: Let pε[0,0.5] be such that R1≦h(p) and R2≦1−p. Let k=┌np┐ for n large enough and let us choose uniformly at random an (n−k)×n matrix H. The matrix twill be the parity-check matrix of the linear code C that will be used to construct the two-write WOM-code. For each vector vε{0, 1}n, let us define the indicator random variable Xv () on the space of all matrices as follows
where Vc is the set defined in Equation (1). Note that choosing the matrix uniformly at random induces a probability distribution on the set Vc and thus a probability distribution on the random variable Xv(). Then the number of vectors in Vc is X()=Σvε{0,1}n Xv(), and
We maintain that Pr {Xv (H)=1} depends on v only through its weight, wt (v). In this case, (2) simplifies to
because if wt(v)≧k−1 then Xv=0 (Equation (1)).
Now, let us determine the value of Pr {Xv (H)=1} for a vector v of weight 0≦i≦k. Note that vεVc if and only if the sub-matrix of size (n−k)×(n−wt (v)) induced by the zero entries of the vector v is full rank. It is well known, that if we choose an m×n matrix, where m≦n, uniformly at random then the probability that it is full rank is Πi=n−m+1n(1−2−i). Therefore, if we choose an (n−k)×(n−i) matrix uniformly at random then the probability that it is full rank is Πi=k−i+1n−i(1−2−i). Note that
and hence, Pr{Xv(H)=1}=Πi=k−i+1n−i(1−2−i)>¼ According to Lemma 4.8 in Reference [24],
and, therefore, we get
It follows that there exists a parity-check matrix of a linear code C, such that the size of the set Vc is at least
and
for n large enough.
Random coding was proved to be capacity-achieving by constructing a partition code References [14], [9]. However, the present random coding method has more structure that enables to look for WOM-codes with a relatively small block length. We ran a computer search to look for such WOM-codes. The parity-check matrix of the linear code C was chosen uniformly at random and then the size of the set Vc was computed. The results are shown in
We ran a computer search to find more two-write WOM-codes with high sum-rates. For fixed-rate WOM-codes, our best construction achieved by a computer search has sum-rate
and for unrestricted-rate WOM-codes our best computer search construction achieved sum-rate 1.4928. The number of cells in these two constructions is 33.
The encoding and decoding maps of the second write are implemented by the parity-check matrix of the linear code C as described in the proof of Theorem 1. A naive scheme to implement the encoding and decoding maps of the first write is simply by a lookup table of the set Vc. However, this can be done more efficiently using algorithms to encode and decode constant weight binary codes. There are several works which efficiently encode and decode all binary vectors of length n and weight k and can be used; see for example References [2], [7], [19], [25], [26]. These works can be easily extended to construct efficient encoder and decoder maps to the set of all binary vectors of length n and weight at most k, denoted by
B(n,k)={vε{0,1}n|supp(v)≦k}.
The set Vc is a subset of the set B(n, k). Therefore, we can use these algorithms while constructing a smaller table, only for the vectors in the set B(n,k)\Vc as follows. Assume that ƒ: {1, . . . , |B(n,k)|}→B(n,k) is a one-to-one and onto map such that the complexity to calculate ƒ and ƒ−1 is efficient. Assume we list all the vectors in B(n,k)\VC such that we list for every vector vεB(n,k)\VC its value ƒ−1(v) and this list is sorted according to the values of ƒ−1(v). Then, a mapping g: {1, . . . , |VC|}→VC is constructed such that for all xε{1, . . . |VC|}, g(x)=ƒ(x+a(x)), where a(x) is the number of vectors in B(n,k)\VC of value less than x. The time complexity to calculate a(x) is a(x) is O(log2(|B(n,k)\VC|)) since this list is sorted. Similarly, for all vεVC, g−1(v)=ƒ−1(v)−a(ƒ−1(v)).
In many cases, the size of the set B (k,n)\Vc will be significantly smaller than the size of Vc. For example, for the Golay code [23,11,8] the size of Vc is 3300179 while the size of B(23, 11)\Vc is
Similarly, for the Reed-Muller code [16,5,8] the size of the set Vc is 5065 while the size of the set B(16, 5)\Vc is 1820.
C. Application to the Blackwell Channel
The Blackwell channel, introduced first by Blackwell [1], is one example of a deterministic broadcast channel. The channel is composed of one transmitter and two receivers. The input to the transmitter is ternary and the channel output to each receiver is a binary symbol. Let u be the ternary input vector to the transmitter of length n. For 1≦i≦n, ƒ(ui)=(ƒ(ui)1, ƒ(ui)2), is a binary vector of length two defined as follows (
ƒ(0)=(0,0), ƒ(1)=(0,1), ƒ(2)=(1,0).
The binary vectors ƒ1(u), ƒ2(u) are defined to be
ƒ1(u)=(ƒ(u1)1, ƒ(u2)1, . . . , ƒ(un)1),
ƒ2(u)=(ƒ(u1)2, ƒ(u2)2, . . . , ƒ(un)2),
and are the output vectors to the two receivers.
The capacity region of the Blackwell channel was found by Gel'fand [11] and consists of five sub-regions, given by their boundaries:
{(R1,R2)|0≦R1<½, R2=1}, 1)
{(R1,R2)|R1=1−p, R2=h(p), ⅓≦p≦½}, 2)
{(R1,R2)|R1R2=log2 3, ⅔≦R1≦log23−⅔}, 3)
{(R1,R2)|R1=h(p), R2=1−p, ⅓≦p≦½}, 4)
{(R1,R2)|R1=1,0≦R2≦½}. 5)
The connection between the Blackwell channel and two-write WOM-codes was identified by Roth [23]. The next theorem shows that from every two-write WOM-code of rate (R1, R2) it is possible to construct codes for the Blackwell channel of rates (R1,R2) and (R2,R1).
Theorem 3.
If (R1,R2) is an achievable rate of a two-write WOM-code, then (R1, R2) and (R2, R1) are achievable rates on the Blackwell channel. Proof Assume that there exists a [n, 2; 2nR
Similarly, it is possible to achieve the rate (R2, R1). Now we let v2=E2(m2) and v1=E1 (m1, v2). The vector u is defined as ui=ƒ−1(
It is possible to define the Blackwell channel differently such that the forbidden pair of bits is not (1, 1) but another combination. Our construction of the codes can be adjusted accordingly.
Now, we can use our two-write WOM-codes in order to define codes for the Blackwell channel. By using time sharing, the achievable region is convex and hence we get in
The invention also provides WOM-code constructions which reduce the gaps between the upper and lower bounds on the sum-rates of WOM-codes for 3≦t≦10. First, we generalize the two-write WOM-code construction from above for non-binary cells. Then, we show how to use these non-binary two-write WOM-codes in order to construct binary multiple-write WOM-codes. We start with specific constructions for three- and four-write WOM-codes, and then show a general design approach that works for an arbitrary number of writes.
A. Non-Binary Two-Write WOM-Codes
Suppose now that each cell has q levels, where q is a prime number or a power of a prime number. We start by choosing a linear code C[n,k] over GF(q) with a parity-check matrix of size (n−k)×n. For a vector v of length n over GF(q), let (v) be the matrix with zero columns replacing the columns that correspond to the positions of the non-zero values in v. Then we define
Vc(q)={vε(GF(q))n|rank(H(v))=n−k}. (3)
Next, we construct a non-binary two-write WOM-code [n,2; |Vc(q)|, qn−k] in a similar manner to the construction in Section IV. Since the proof of the next theorem is very similar to the proof of Theorem 4 we omit it. A complete proof can be found in [18].
Theorem 4.
Let C[n,k] be a linear code with parity-check matrix over GF(q) and let Vc(q) be the set defined in (3). Then there exists a q-ary [n,2; |Vc(q)|, qn−k] two-write WOM-code of sum-rate
As was shown in the binary case, there is no restriction on the choice of the linear code C or the parity-check matrix . Every such code/matrix generates a WOM-code. For a linear code C we define
so the sum-rate of the generated WOM-code is R1 (C)+R2 (C). The capacity region of the achievable rates by this construction is
Theorem 5.
For any 1, ′2) εC2(q) and ε>0, there exists a linear code C satisfying R1(C)≧R1−ε, R2(C)≧R2−ε.
The next corollary provides the best achievable sum-rate of the construction.
Corollary.
For any q-ary WOM-code generated using our construction, the highest achievable sum-rate is log2 (2q−1).
Proof: First, note that
and since the function ƒ(x)=log2 x is a concave function
Also, for
the achievable sum-rate is log2(2q−1). Therefore, there exists a WOM-code produced by our construction with sum-rate log2 (2q−1).
On the other hand, any WOM-code resulting from our construction satisfies the property that every cell is programmed at most once. This model was studied in Reference [9] and the maximum achievable sum-rate was proved to be log2 (2q−1). Therefore, our construction cannot produce a WOM-code with a sum-rate that exceeds log2(2q−1).
This construction does not achieve high sum-rates for non-binary two-write WOM-codes in general. While the best achievable sum-rate of the construction is log2 (2q−1), the upper bound on the sum-rate is log2(2q−1). The decrease in the sum-rate in our construction results from the fact that cells cannot be programmed twice. That is, if a cell was programmed on the first write, it cannot be reprogrammed on the second write even if it did not reach its highest level. In fact, it is possible to find non-binary two-write WOM-codes with better sum-rates. However, the goal is not to find efficient non-binary WOM-codes. Rather, the non-binary codes that we have constructed can be used in the design of binary multiple-write WOM-codes.
For the construction of binary multiple-write, we use WOM-codes over GF(3). We ran a computer search to find such a ternary two-write WOM-code of sum-rate 2.2205, and we will use this WOM-code in order to construct specific multiple-write WOM-codes.
B. Three-Write WOM-Codes
We start with a construction for binary three-write WOM-codes. The construction uses the WOM-codes found in the previous subsection over GF(3).
Theorem 6.
Let C3 be an [n, 2; n, 2n2] two-write WOM-code over GF(3) constructed as above in Section A. Then, there exists a [2n, 3:2n2n2, 2n] three-write WOM-code of sum-rate
Proof: We denote by E3,1 and E3,2 the encoding maps of the first and second writes, and by D3,1 and D3,2 the decoding maps of the first and second writes of the WOM-code C3, respectively. The 2n cells of the three-write WOM-code we construct are divided into n two-cell blocks, so the memory-state vector is of the form ((c1,1, c1,2), c2,1, c2/2), . . . , (cn,1, cn,2)). In this construction we also use map φ: GF(3)(GF(2), GF2)) defined as follows:
φ(0)=(0,0),
φ(1)=(1,0),
φ(2)=(0,1),
The map φ extends naturally to ternary vectors v=(v1, . . . , vn)εGF(3)n using the rule
φ(v)=(φ(v1), . . . , φ(vn)),
On the pairs (c,c′) in the image of ø, we define ø−1(c,c′) to indicate the inverse function. The map ø−1 is extended similarly to work over vectors of such bit pairs. We are now ready to describe the encoding and decoding maps for a three-write WOM-code.
1) On the first write, a message in from the set {1, . . . , 2n1} is written in the 2n cells:
ε1(m)=φ(ε3,1(m)).
The decoding map is defined similarly, where c is the memory-state vector:
1(c)=3,1(φ−1(c)).
2) On the second write, a message in from the set {1, . . . , 2n2} is written in the 2n cells as follows. Let c be the programmed vector on the first write. Then,
ε2(m,c)=φ(ε3,2(m,φ−1(c)).
That is, first the memory-state vector c is converted to a ternary vector. Then, it is encoded using the encoding E3,2 and the new message, producing a new ternary memory-state vector. Finally, the last vector is converted to a 2n-bit vector. The decoding map is defined as on the first write:
2(c)=3,2(φ−1(c)).
According to the construction of the WOM-code C3, no ternary cell is programmed twice and therefore each of the n pairs of bits is programmed at most once.
3) On the third write, an n-bit vector v is written. Let c=((c1,1, c1,2), . . . , (cn,1, cn,2)) be the current memory-state vector. Then,
E3(v,c)=((c′1,1,c′1,2), . . . ,(c′n,1,c′n,2))
is a vector, defined as follows. For 1≦i≦n, Yc′i,1, c′i,2)=(1,1) if vi=1 and otherwise (c′i,1, c′i,2). It is always possible to program the pair of bits to be (1, 1) since at most one cell in each pair was previously programmed. The decoding map D2(c) is defined to be
D2(c)=(c1,1·c1,2, . . . ,cn,1·cn,2).
That is, the decoded value of each pair of bits is one if and only if the value of both of them is one.
Corollary.
The best achievable sum-rate of a three-write WOM-code using this construction is (log2 5+1)/2≈1.66.
Proof: Given a two-write WOM-code C3 over GF(3) with rates (R1, R2), the constructed binary three-write WOM-code has rates (R1/2, R2/2, ½) and its sum-rate is R=(R1+R2+1)/2. This sum-rate is maximized when R1+R2 is maximized. But R1+R2 is the sum-rate of the two-write WOM-code over GF(3), which was proven in Corollary 9 to be maximized at log2 5. Then the maximum achievable sum-rate of the constructed binary three-write WOM-code is
Using the construction of WOM-codes over GF(3) presented above, we can construct a three-write WOM-code of sum-rate (2.2205+1)/2=1.6102.
C. Four-Write WOM-Codes
We next present a construction for four-write binary WOM-codes.
Theorem 7.
Let C3 be an [n,2;2nR3,1,2nR3,2] two-write WOM-code over Equation (2) constructed as above. Let C2 be an [n,2;2nR2,1, 2nR2,2] binary two-write WOM-code. Then, there exists a [2n,4;2nR3,1, 2nR3,2, 2nR2,12nR2,2] four-write WOM-code of sum-rate
Proof: The proof is very similar to the one used for three-write WOM-codes. We denote by E3,1, E3,2 the encoding maps of the first and second writes, and by D3,1, D3,2 the decoding maps of the first and second writes of the WOM-code C3, respectively. Similarly, the encoding and decoding maps of the WOM-code C2 for the first and second writes are denoted by E3,1, E3,2 and D3,1, D3,2, respectively. Using the encoding and decoding maps of C3, we define the first and second writes of our constructed four-write WOM-code as we did for the first and second writes of the three-write WOM-codes. The third and fourth writes are defined in a similar way, as follows.
1) On the third write, a message m from the set
{1, . . . ,2nR
be the current memory-state vector. Then,
ε3(m,c)=((c′1,1,c′1,2), . . . ,(c′n,1,c′n,2)),
where for 1≦i≦n, (c′i,1, c′i,2)=(1,1) if vi=1 and, otherwise, (c′1,1, c′1,2)=(ci,1, ci,2). The decoding map D3(c) is defined to be
D3(c)=D2,1(c1,1·c1,2, . . . ,cn,1·cn,2).
2) On the fourth write, a message m from the set
{1, . . . , 2n2,2} is written. Let
ε2,2(m,(c1,1·c1,2, . . . ,cn,1·cn,2))=v=(v1, . . . ,vn),
where c=((c1,1, c1,2), . . . , (cn,1, cn,2)) is the current memory-state vector. Then,
ε4(m,c)=((c1,1′,c1,2′), . . . ,(cn,1′,cn,2′)),
where for 1≦i≦n, (c′1,1, c′1,2)=(1,1) if vi=1 and, otherwise, (ci,1′, ci,2′)=(ci,1, ci,2). The decoding map D4(c) is defined, as before, by
4(c)=2,2(c1,1′·c1,2′, . . . ,cn,1′·cn,2′).
The last theorem requires both the binary two-write and ternary two-write WOM-codes to have the same number of cells, n. However, we can construct a four-write binary WOM-code using any two such WOM-codes, even if they do not have the same number of cells. Suppose we have a WOM-code over GF(3) with n1 cells and binary WOM-code with n2 cells. Both codes can be extended to use (n1,n2) cells. Then, the construction above will give a four-write WOM-code.
Corollary.
The best achievable sum-rate of a four-write WOM-code using this construction is (log25+log23)/2≈1.95.
Proof: The maximum value of R3,1+R3,2 is log2 5 and the maximum value of R2,1+R2,2 is log2 3. Therefore, the maximum sum-rate of the constructed
If we use the WOM-code over GF(3) of sum-rate 2.2205 found in the previous subsection as the WOM-code C3 and the binary two-write WOM-code of sum-rate 1.4928 found as the WOM-code C2, then there exists a four-write WOM-code of sum-rate (2.2205+1.4928)/2=1.8566.
C. Multiple-Write WOM-Codes
The construction of three- and four-write WOM-codes can be easily generalized to an arbitrary number of writes. We state the following theorem and skip its proof since it is very similar to the proofs of the corresponding theorems for three- and four-write WOM-codes.
Theorem 8.
Let C3 be an [n,2; 2n3,1, 2n3,2] two-write WOM-code over GF(3) constructed as above. Let C2 be an [n,t−2; 2n2,1, . . . , 2n2i−2] binary (t−2)-write WOM-code. Then, there exists a
[2n,t;2nR3,1,2nR3,2,2nR2,1, . . . ,2nR2t−2]
t-write WOM-code of sum-rate
Theorem 14 implies that if there exists a (t−2)-write WOM-code of sum-rate Rt−2 then there exists a t-write WOM-code of sum-rate
The following corollary summarizes the possible achievable sum-rates of t-write WOM-codes.
Corollary.
For t≧3, there exists a t-write WOM-code of sum-rate
If we use again the two-write WOM-code over GF(3) of sum-rate 2.2205 and the binary two-write WOM-code of sum-rate 1.4928 from Section IV, then for t≧3 we obtain a t-write WOM-code of sum-rate Rt′ where
The construction presented in the previous section provides us with a family of WOM-codes for all t≧3. In this section, we will show a general scheme to construct more families of WOM-codes. In fact, the construction in the previous section is a special case of this general scheme.
Theorem 9.
Let C* be [m, t/2;q1, . . . , q1/2 ] binary t/2-write WOM-code where t is an even integer. For 1≦I≦t/2, let Ci be an [n, 2;2nRi,1,2nRi,2] two-write WOM-code over GF(qi), as constructed above. Then, there exists an [nm, 2n1,1, 2n1,2, . . . , 2n1/21, 2n/2,2, t] binary t-write WOM-code of sum-rate
Proof: For 1>i>t/2, let ξi*i* be the encoding, decoding maps on the i-th write of the WOM-code C*, respectively. The definition of ξi*i* for 1>i>t/2, extends naturally to vectors by simply invoking the maps on each entry in the vector. Similarly, for 1≦i≦t/2, let us denote by Ei,1 and Ei,2 the encoding maps of the first and second writes, and by Di,1 and Di,2 the decoding maps of the first and second writes of the WOM-code Ci, respectively. We will present the specification of the encoding and decoding maps of the constructed t-write WOM-code.
In the following definitions of the encoding and decoding maps, we consider the memory-state vector c to have n symbols of m bits each, i.e. cε(GF(2m))n. For 1≦i≦t/2, the (2i−1)-st write and 2i-th write are implemented as follows.
The memory-state vector c is decoded according to
2i−1(c)=i,1(i*(c)).
On the 2i-th write, a message m2ε{1, . . . , 2nRi,1} is written according
ε2i(m2)=εi*(εi,2(m2,i*(c)),c)
and the memory-state vector c is decoded according to
2i(c)=i,2i*(c)).
We will demonstrate how this construction works in the following example.
We choose a [3,3;4,3,2] three-write WOM-code as the code C*. This code is depicted in
GF(4), we ran a computer search to find a two-write WOM-code over GF(4) of sum-rate 2.6862. For the code C2 over GF(3), we use the code with sum-rate 2.22 which we found above, and we use the binary two-write WOM-code of sum-rate 1.4928 for the code C3. Then, the sum-rate of the six-write WOM-code is
It is possible to construct a five-write WOM-code by writing a vector of n bits in the last write so its sum-rate is
Note that if one of the codes in the general construction is binary then we can actually use a WOM-code that allows more than two writes. That is, in this construction we can use any binary multiple-write WOM-code as the WOM-code C3. Therefore, we can generate another family of WOM-codes for t≧5. Their maximum achievable sum-rates are given by the following formula
for t≧5 and Rr4 is the maximum achievable sum-rate for a (t−4)-write WOM-code. Similarly, the constructed codes which we obtain using the WOM-codes found above have sum-rates
where t−4′ is the best sum-rate of a constructed (t−4)-write WOM-code. Table IV summarizes these sum-rates.
Note that the construction is a special case of the generalized concatenated WOM-codes construction in which the WOM-code C* is chosen to be a [2,2; 3, 2] binary two-write WOM-code.
The general method described in Theorem 7 provides us with many more families of WOM-codes. However, in order to construct WOM-codes with high sum-rates, the WOM-code C* has to be chosen very carefully. In particular, it is important to choose such a WOM-code with as few cells as possible, since the sum of all sum-rates of the non-binary two-write WOM-codes is averaged over the number of cells of the WOM-code C*. As the number of short WOM-codes is small, there are only a small number of possibilities to check. However, our search for better WOM-codes with between six and ten writes using WOM-codes with few cells did not lead to any better results.
The WOM-code construction for more than two writes improved the achieved sum-rates only in the unrestricted-rate case. In this section, we present a method to construct fixed-rate WOM-codes. The method is recursive and is based on the previously constructed unrestricted-rate WOM-codes.
Theorem 10.
Let C be an [n,t;2nR
Proof: For simplicity, let us assume that R1≧ . . . ≧Ri as it will be clear from the proof how to generalize to the arbitrary case. First, we add (i−1−t)n more cells in order to write (i−1−t)n bits on the last write. This guarantees that the rates on the last two writes are the same. Then, we add 2(Rr2−Rt−1)n/R2 more cells in order to write (Rt2−Rr−1) n more bits on each of the last two writes. This part of the last two writes is invoked using the fixed-rate two-write WOM-code of sum-rate R2 and therefore the additional number of cells is 2(t−2−t−1)n/R2. This addition of cells guarantees that the rates on the last three writes are all the same. In general, for 1≦i≦t−1 we add i(t−i−t−i+1)n/Ri more cells such that (t−i−t−i+1)n more bits are written on each of the last i writes and therefore the rates on the last i+1 writes are all the same. These bits are written using the fixed-rate i-write WOM-code which is assumed to exist.
With the addition of these cells, the number of bits written on the i-th write for 1≦i≦t is
Thus, the rates on all writes are the same and the generated WOM-code is fixed-rate.
The total number of bits we add is
and thus the sum-rate is
Let us demonstrate how to apply the last theorem. We start with the three-write WOM-code. Its rates on the first, second, and third writes are 0.6291, 0.4811, 0.5, respectively. We add 0.0189n more cells in order to guarantee that the rates on the last two writes are the same. Then we use the fixed-rate two-write WOM-code of sum-rate 1.4546. Hence we add
more cells, yielding a fixed-rate three-write WOM-code of sum-rate
If we used the best fixed-rate two-write WOM-code of sum-rate 1.546 and the best three-write WOM-code of sum-rate 1.66, then we get a fixed-rate three-write WOM-code of sum-rate 1.6263.
Note that we could use a two-write WOM-code such that 0.0189n bits are written on its first write and 0.1291n bits are written on its second write. This will indeed add another small improvement to the sum-rate, however this scheme is not easy to generalize. Our goal here is to give a general method. We are aware that for each individual case it is possible to use other unrestricted-rate WOM-codes that will provide a WOM-code of the desired sum-rate with slightly fewer cells.
Now we move to consider the four-write WOM-code. Its component rates are 0.6291, 0.4811, 0.413, 1/3. We add three more groups of cells as follows:
1) (0.413−1/3)n=0.0797n more cells, so that the last two write have the same rate.
2) 2·(0.4811−0.413)n/1.4546=0.0936n more cells, so that the last three writes have the same rate.
3) 3·(0.6291−0.4811)n/1.5731=0.2822n more cells, so that the last four writes have the same rate.
Then, we get a fixed-rate four-write WOM-code with sum-rate
If we used the best fixed-rate two- and three-write WOM-codes and the best unrestricted-rate four-write WOM-code, then we obtain a fixed-rate four-write WOM-code of sum-rate 1.8249. Fixed-rate t-write WOM-code for t>4 can be similarly obtained. We summarize the results for the sum-rates that we actually found and the best ones we could find in this method in Table 2.
Tables 3 and 4 show a comparison of the sum-rates of unrestricted-rate and fixed-rate WOM-codes presented in this application and the best previously known sum-rates for 2≦t≦10. The column labeled “Best Prior” is the highest sum-rate achieved by a previously reported t-write WOM-code. The column “Achieved New Sum-rate” gives the sum-rates that we actually obtained through application of the new techniques. The column “Maximum New Sum-rate” lists the maximum possible sum-rates that can be obtained using our approach. Finally, the column “Upper Bound” gives the maximum possible sum-rates for t-write WOM-codes.
For unrestricted-rate two-write WOM-codes, the results were found by the computer search method. For three and four writes, we used the WOM-codes described for multiple writes, and for 5≦t≦10, we used the WOM-codes discussed for concatenated code. For fixed-rate two-write WOM-codes, we again used the computer search method of this Section providing two write codes. The constructions for more than two writes were obtained by application of Theorem 10.
While specific embodiments of the present invention have been shown and described, it should be understood that other modifications, substitutions and alternatives are apparent to one of ordinary skill in the art. Such modifications, substitutions and alternatives can be made without departing from the spirit and scope of the invention, which should be determined from the appended claims.
Various features of the invention are set forth in the appended claims.
The application claims priority under 35 U.S.C. §119 and all applicable treaties and statutes from prior provisional application Ser. No. 61/353,419, which was filed Jun. 10, 2010, and is incorporated by reference herein.
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/US2011/040036 | 6/10/2011 | WO | 00 | 12/6/2012 |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO2011/156750 | 12/15/2011 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
20040034828 | Hocevar | Feb 2004 | A1 |
20050149840 | Lee et al. | Jul 2005 | A1 |
20090158112 | Oh et al. | Jun 2009 | A1 |
20090187811 | Eroz et al. | Jul 2009 | A1 |
Entry |
---|
Cohen, “Covering Radius and Writing on Memories”, © 2005, p. 1-10. |
Merkx, “Womcodes constructed with projective geometries”, © 2005, p. 1-5. |
Fu et al., “On the Capacity and Error-Correcting Codes of Write-Efficient Memories”, IEEE Transactions on Information Theory, vol. 46, No. 7, Nov. 2000. |
Zémor, Gilles, et. al., Error-Correcting WOM-Codes IEEE Transactions on Information Theory, vol. 37, No. 3, May 1991. |
Wu, Yunnan, “Low Complexity Codes for Writing a Write-Once Memory Twice”, Proc. IEEE International Symposium on Information Theory, Austin, Texas, (Jun. 2010), pp. 1928-1932. |
Number | Date | Country | |
---|---|---|---|
20130080681 A1 | Mar 2013 | US |
Number | Date | Country | |
---|---|---|---|
61353419 | Jun 2010 | US |