Method for optimizing polar-RNNA quantizer of MLC-type NAND flash memory on basis of deep learning

Information

  • Patent Grant
  • 11966587
  • Patent Number
    11,966,587
  • Date Filed
    Monday, November 8, 2021
    2 years ago
  • Date Issued
    Tuesday, April 23, 2024
    23 days ago
Abstract
A method for optimizing a Polar-RNNA quantizer of MLC NAND flash based on deep learning comprises the following steps: Step S1: transforming an MLC flash detection task into a deep learning task, and obtaining three hard-decision read thresholds based on a neural network; Step S2: expanding six soft-decision read thresholds based on the three hard-decision read thresholds; Step S3: constructing an LLR mapping table, and obtaining new LLR information of MLC flash based on the LLR mapping table; Step S4: symmetrizing an MLC flash channel, and performing density evolution; and Step S5: optimizing the soft-decision read thresholds based on a genetic algorithm to obtain an optimal quantization interval. According to the invention, polar codes can be directly used for MLC flash channels without the arduous work of MLC flash channel modeling, so that the reliability of MLC flash is effectively improved.
Description
BACKGROUND OF THE INVENTION
1. Technical Field

The invention relates to the field of communication, in particular to a method for optimizing a Polar-RNNA quantizer of MLC NAND flash based on deep learning.


2. Description of Related Art

Compared with traditional hard disk drives, NAND flash-based solid state drives have lower power consumption, faster write/read rate and higher reliability, thus becoming more and more popular on the market. In order to improve the capacity of a data storage system, the MLC (Multi-Level Cell) storage technique and TLC (Trinary-Level Cell) storage technique have gradually replaced the SLC (Single-Level Cell) storage technique.


The work principle of MLC NAND flash is as follows: each memory cell of a memory stores two bits of information, corresponding to stored symbols 11, 10, 00 and 01 respectively. In order to read the data stored in the memory cell, the read-back voltage of the memory cell should be measured and compared with a fixed read threshold through a memory detection circuit. In order to distinguish four levels, at least three read thresholds {R1, R2, R3} are needed. However, during the data reading process, the channel distributions of the MLC flash will be distorted by various unpredictable noises and channel offsets, thus severely compromising the reliability of an MLC flash system.


In order to improve the reliability of the MLC system, error correction codes are considered. The polar code is a new code form given based on channel polarization, which was put forward for the first time by Erdal Arikan on the ISIT conference in 2008 and was expounded in further detail in the 23-page paper “A Method for Constructing Capacity-Achieving Codes for Symmetric Binary-Input Memoryless Channels” published in Journal “IEEE Transaction on Information Theory” in 2009. The polar code has become the unique code that can reach the Shannon limit at present due to its particular code pattern construction manner.


As for polar codes decoded by soft-decision, the decoding performance heavily depends, on the accuracy of the log-likelihood ratio (LLR) of coded bits after channel coding. However, the accurate calculation of LLR requires accurate memory cell sensing and high-precision memory induction, which often leads to an excessive expenditure and delay damage of a memory. Generally, more read thresholds of the memory cells are used to quantize a required LLR. For example, a maximum mutual information (MMI) quantizer is widely used to optimize quantization intervals by maximizing the quantity of mutual information of the MLC channel. However, an accurate model of the MLC channel is necessary for the MMI quantizer, which is impossible due to some unknown offsets and noises of the MLC channel.


BRIEF SUMMARY OF THE INVENTION

In view of this, the objective of the invention is to provide a method for optimizing a Polar-RNNA quantizer of MLC NAND flash based on deep learning, which directly uses polar codes for MLC flash channels without the arduous work of MLC flash channel modeling, thus effectively improving the reliability of MLC flash.


To fulfill the above objective, the invention adopts the following technical solution:


A method for optimizing a Polar-RNNA quantizer of MLC NAND flash based on deep learning comprises the following steps:


Step S1: transforming an MLC flash detection task into a deep learning task, and obtaining three hard-decision read thresholds based on a neural network;


Step S2: expanding six soft-decision read thresholds based on the three hard-decision read thresholds;


Step S3: constructing an LLR mapping table, and obtaining LLR soft information of the MLC flash based on the LLR mapping table;


Step S4: symmetrizing the MLC flash channel, and performing density evolution; and


Step S5: optimizing the soft-decision read thresholds based on a genetic algorithm to obtain optimal quantization intervals.


Further, Step S1 specifically comprises:


Devoting a read-back voltage of a kth memory cell of the MLC flash by vk, inputting v={v1, v2, . . . , vL} to the neural network, and setting L as the number of neurons of an input layer, and outputting a soft estimation {tilde over (x)}={{tilde over (x)}1, {tilde over (x)}2, . . . , {tilde over (x)}L} of a tag x by the neural network; as for the MLC flash, expressing {11, 10, 00, 01} by a tag {0, 1, 2, 3}, so that a hard estimation {circumflex over (x)}k ∈{0, 1, 2, 3} of the kth memory cell xk ∈{0, 1, 2, 3} is obtained by acquiring integers {tilde over (x)}k most approximate to {tilde over (x)} output by the neural network; then, obtaining a corresponding memory cell symbol through a mapping {0, 1, 2, 3}→{11, 10, 00, 01};


Setting a set of read thresholds {R1, R2, R3} for a read-back voltage {right arrow over (v)} of a given set of memory cells to obtain a hard estimation of the memory cell symbol, which is denoted by x; inputting {right arrow over (v)} to an RNN detector to obtain a hard estimation {circumflex over (x)} of a storage symbol based on a principle of proximity;


Setting, during a search process, N RNN output sequences with a length of L, and denoting an it RNN output sequence as {circumflex over (x)}i, so that {circumflex over (X)}={{circumflex over (x)}1, {circumflex over (x)}2, . . . {circumflex over (x)}N}, and X={x1, x2, . . . , xN};


Wherein, a formula for calculating optimal thresholds {R*1, R*2, R*3} is:










{


R
1
*

,

R
2
*

,

R
3
*


}

=



arg

min


{


R
1

,

R
2

,

R
3


}





d

(


X
¯

,

X
ˆ


)






(
1
)







Uniformly quantizing a search space into m interval boundaries {B0, B1, . . . , Bm}, wherein B0=−∞<B1=Vs11< . . . <Bm-1=Vs01<Bm=∞; and


Searching out the optimal thresholds {R*1, R*2, R*3} from {B1, B2, . . . , Bm-1} through an exhaustive search method to minimize a Hamming distance between two outputs.


Further, Step 2 specifically comprises:


Obtaining the overlapping areas of two adjacent Gaussian distribution functions based on the MLC flash decoding curve;


Sensing the overlapping areas with preset high quantization precision and sensing other areas with preset low quantization precision, that is, setting a difference between adjacent read thresholds of the overlapping area to be less than a difference between adjacent read thresholds of non-overlapping areas, so that a width W exits between each threshold obtained by extension and an adjacent optimal threshold; and obtaining six extended read thresholds according to formula (2):









{





B


2

i

-
1

*

=


R
i
*

-

W


2

i

-
1










B

2

i

*

=


R
i
*

+

W

2

i











(
2
)







Wherein, i=1, 2, 3, R*i ∈(B*2i-1, B*2i).


Further, Step S3 specifically comprises: determining seven corresponding quantization intervals based on the six extended read thresholds, wherein each memory cell of MLC flash stores two bits and has four voltage states correspondingly, a left bit is called a most significant bit and is denoted by MSB, and a right bit is called a least significant bits and is denoted by LSB;


Constructing the LLR mapping table, and obtaining an LLR corresponding to each bit C of the storage symbol based on the LLR mapping table.


Further, Step S4 specifically comprises:


Defining f0(L) and f1(L) as probability density functions of LLRs, corresponding to c=0 and c=1 in the memory cells respectively; and reversing all LLR symbols of c=0, wherein a probability density of the LLRs after channel symmetrization is:

fs({circumflex over (L)})=½(f0(−{circumflex over (L)})+f1({circumflex over (L)}))  (3)


Defining aN(i) as a probability density function of the LLR of an ith sub-channel;


Then, calculating aN(i) of each sub-channel according to formulas a2N(2i-1)=aN(i) ⊙ aN(i), a2N(2i)=aN(i)*aN(i), a1(1)=fs({circumflex over (L)}); calculating a code error probability of each sub-channel after aN(i) of each sub-channel is obtained; and










P
e

(
i
)


=


lim

ε


+
0




(





-



-
ε




a
N

(
i
)



d

x


+


1
2






-
ε


+
ε




a
N

(
i
)



dx




)






(
4
)







Adding K minimum Pe(i) through a discretized density evolution method to obtain a block error rate formula







P
B

=




i

A



P
e

(
i
)








of SC decoding, wherein A is an information bit set.


Further, Step S5 specifically comprises:


Obtaining an optimal set {W1*, W2*, . . . , W6*} through the genetic algorithm, wherein an objective function is PB; and










{


W
1
*

,

W
2
*

,


,

W
6
*


}

=



arg

min


{


W
1

,

W
2

,



,

W
6


}





P
B






(
5
)







Obtaining six optimal soft-decision read thresholds {B*1, B*2, . . . , B*6} according to the optimal set {W1*, W2*, . . . , W6*}, so that seven optimal quantization intervals for a Polar-code MLC flash channel are obtained.


Compared with the prior art, the invention has the following beneficial effects:


According to the invention, polar codes can be directly used for MLC flash channels without the arduous work of MLC flash channel modeling, so that the reliability of MLC flash is effectively improved.





BRIEF DESCRIPTION OF THE SEVERAL VIEWS OF THE DRAWINGS


FIG. 1 is a flow diagram of a method according to the invention;



FIG. 2 is an architecture diagram of the RNN detector in one embodiment of the invention;



FIG. 3 is a schematic diagram of six soft-read thresholds designed according to three hard-decision read thresholds in one embodiment of the invention.





DETAILED DESCRIPTION OF THE INVENTION

The invention will be further expounded below in conjunction with the accompanying drawings and embodiments.


Referring to FIG. 1, the invention provides a method for optimizing a Polar-RNNA quantizer of MLC NAND flash based on deep learning, comprising the following steps:


Step S1: a read-back voltage of a kth memory cell of MLC flash is expressed by vk, v={v1, v2, . . . , vL} is input to a neural network, L is set as the number of neurons of an input layer, and the neural network outputs a soft estimation {tilde over (x)}={{tilde over (x)}1, {tilde over (x)}2, . . . , {tilde over (x)}L} of a tag x (storage symbol);


As for the MLC flash, a tag {0, 1, 2, 3} is used to express {11, 10, 00, 01}, so that a hard estimation {circumflex over (x)}k ∈{0, 1, 2, 3} of a kth memory cell xk ∈{0, 1, 2, 3} is obtained by acquiring integers {tilde over (x)}k most approximate to {tilde over (x)} output by the neural network; then, a corresponding memory cell symbol is obtained according to a mapping {0, 1, 2, 3}→{11, 10, 00, 01}, wherein FIG. 2 is an architecture diagram of an RNN detector, and Table 1 illustrates parameter selection of an RNN network;











TABLE 1








Number of training
3 × 106



samples (symbols)




Mini-batch size
100



Loss function
MSE



Initializer
Xavier uniform



Optimizer
Adam









A set of read thresholds {R1, R2, R3} is set for a read-back voltage of {right arrow over (v)} a given set of memory cells to obtain a hard estimation of a memory cell symbol, which is denoted by x; {right arrow over (v)} is input to an RNN detector to obtain a hard estimation {circumflex over (x)} of the storage symbol based on the principle of proximity;


N RNN output sequences with a length L are set during the search process, and the ith output sequence is denoted by {circumflex over (x)}i, so that {circumflex over (X)}={{circumflex over (x)}1, {circumflex over (x)}2, . . . , {circumflex over (x)}N}, and X={x1, x2, . . . , xN};


Wherein, a formula for calculating optimal thresholds {R*1, R*2, R*3} is:










{


R
1
*

,

R
2
*

,

R
3
*


}

=



arg

min


{


R
1

,

R
2

,

R
3


}





d

(


X
¯

,

X
ˆ


)






(
1
)







A search space is uniformly quantized into m interval boundaries {B0, B1, . . . , Bm}, wherein B0=−∞<B1=Vs11< . . . <Bm-1=Vs01<Bm=∞;


Optimal thresholds {R*1, R*2, R*3} are searched out from {B1, B2, . . . , Bm-1} through an exhaustive search method to minimize a Hamming distance between two outputs.


Step 2: six soft-decision read thresholds are obtained by extension based on the three hard-decision read thresholds.


In this embodiment, as shown in FIG. 3, an overlapping area of two adjacent Gaussian distribution functions is obtained according to an MLC flash decoding curve;


The overlapping area is sensed with preset high quantization precision and other areas are sensed with preset low quantization precision, that is, a difference between adjacent read thresholds of the overlapping area is less than a difference between adjacent read thresholds of non-overlapping areas, so that a width W exits between each extended threshold and an adjacent optimal threshold; and six extended read thresholds are obtained according to formula (2):









{





B


2

i

-
1

*

=


R
i
*

-

W


2

i

-
1










B

2

i

*

=


R
i
*

+

W

2

i











(
2
)







Wherein, i=1, 2, 3, R*i ∈(B*2i-1, B*2i).


Step S3: an LLR mapping table is constructed, and LLR soft information of the MLC flash is obtained based on the LLR mapping table;


Seven corresponding quantization intervals are determined according to the six extended read thresholds, wherein each memory cell of MLC flash stores two bits and has four voltage stages correspondingly, a left bit is called a most significant bit (MSB), and a right bit is called a least significant bits (LSB); and


The LLR mapping table is constructed, and an LLR ({circumflex over (L)}MSB and {circumflex over (L)}LSB) corresponding to each bit C of the storage symbol is obtained based on the LLR mapping table.












TABLE 2






Quantization





intervals
{circumflex over (L)}MSB
{circumflex over (L)}LSB



















[B0*, B1*)
−5
−1



[B1*, B2*)
−3
0



[B2*, B3*)
−1
1



[B3*, B4*)
0
3



[B4*, B5*)
1
1



[B5*, B6*)
3
0



[B6*, B7*)
5
−1









Step S4: the precondition of density evolution of polar codes is channel symmetrization, so the MLC flash channel is symmetrized, f0({circumflex over (L)}) and f1({circumflex over (L)}) are defined as probability density functions of LLRs, wherein f0({circumflex over (L)}) and f1({circumflex over (L)}) respectively correspond to c=0 and c=1 in the memory cells; and all LLR symbols of c=0 are reversed, wherein the probability density of LLR after channel symmetrization is:

fs({circumflex over (L)})=½(f0(−{circumflex over (L)})+f1({circumflex over (L)}))  (3)


aN(i) is defined as a probability density function of the LLR of an ith sub-channel;


Then, aN(i) of each sub-channel is calculated according to formulas a2N(2i-1)=aN(i) ⊙ aN(i), a2N(2i)=aN(i)*aN(i), a1(1)=fs ({circumflex over (L)}); a code error probability of each sub-channel is calculated after aN(i) of each sub-channel is obtained; and










P
e

(
i
)


=


lim

ε


+
0




(





-



-
ε




a
N

(
i
)



d

x


+


1
2






-
ε


+
ε




a
N

(
i
)



dx




)






(
4
)







K minimum Pe(i) are added through a discretized density evolution method to obtain a block error rate formula







P
B

=




i

A



P
e

(
i
)








of SC coding, wherein A is an information bit set (K elements).


Step S5: an optimal set {W*1, W*2, . . . , W*6} is obtained through a genetic algorithm, wherein an objective function is PB










{


W
1
*

,

W
2
*

,


,

W
6
*


}

=



arg

min


{


W
1

,

W
2

,



,

W
6


}





P
B






(
5
)







Six optimal soft-decision read thresholds {B*1, B*2, . . . , B*6} are obtained according to the optimal set {W*1, W*2, . . . , W*6}, that is, seven optimal quantization intervals for a Polar-code MLC channel are obtained.


Those skilled in the art would appreciate that the embodiments of the application can be provided as a method, a system or a computer program product. So, the application may be implemented in the form of a completely hardware embodiment, a completely software embodiment, or an embodiment combining software and hardware. In addition, the application may be in the form of a computer program product to be implemented on one or more computer-available storage media (including, but not limited to, a disk memory, a CD-ROM, an optical memory, and the like) comprising computer-available program codes.


The application is described with reference to the flow diagram and/or block diagram of the method, device (system) and computer program product provided by the embodiments of the application. It should be understood that each process and/or block in the flow diagram and/or block diagram and the combinations of processes and/or blocks in the flow diagram and/or block diagram can be implemented by computer program instructions. These computer program instructions can be configured in a general-purpose computer, a special-purpose computer, an embedded processor, or a processor of other programmable data processing terminals to create a machine, so that the instructions can be executed by the computer or the processor of other programmable data processing terminals to create a device for realizing specific functions in one or more processes in the flow diagram and/or in one or more blocks in the block diagram.


These computer program instructions may also be stored in a computer-readable memory that can guide the computer or other program data processing terminals to work in a specific manner, so that the instructions stored in the computer-readable memory can create a product including an instruction device, and the instruction device implements specific functions in one or more processes of the flow diagram and/or one or more blocks in the block diagram.


These compute r program instructions may also be loaded on a computer or other programmable data processing terminal devices, so that the computer or other programmable terminal devices can perform a series of operation steps to carry out processing realized by the computer, and the instructions are executed on the computer or other programmable terminal devices to realize specific functions in one or more processes in the flow diagram and/or one or more block diagrams in the block diagram.


The above embodiments are merely preferred ones of the invention, and are not used to limit the invention in any form. Any skilled in the art can obtain other equivalent embodiments by changing or modifying the technical contents disclosed above. Any simple amendments and equivalent transformations and modifications made to the above embodiments according to the technical essence of the invention without departing from the contents of the technical solutions of the invention still fall within the protection scope of the technical solutions of the invention.

Claims
  • 1. A method for optimizing a Polar-RNNA quantizer of MLC NAND flash memory based on deep learning, comprising the following steps: Step S1: transforming an MLC flash detection task into a deep learning task, and obtaining three hard-decision read thresholds based on a neural network;Step S2: expanding six soft-decision read thresholds based on the three hard-decision read thresholds;Step S3: constructing an LLR mapping table, and obtaining LLR soft information of the MLC flash based on the LLR mapping table;Step S4: symmetrizing the MLC flash channel, and performing density evolution; andStep S5:optimizing the soft-decision read thresholds based on a genetic algorithm to obtain optimal quantization intervals;Step S2 specifically comprises:obtaining the overlapping areas of two adjacent Gaussian distribution functions based on the MLC flash decoding curve;sensing the overlapping areas with preset high quantization precision and sensing other areas with preset low quantization precision, that is, setting a difference between adjacent read thresholds of the overlapping area to be less than a difference between adjacent read thresholds of non-overlapping areas, so that a width W exits between each thresholds obtained by extension and an adjacent optimal threshold; and obtaining six extended read thresholds according to formula (2):
  • 2. The method for optimizing a Polar-RNNA quantizer of MLC NAND flash memory based on deep learning according to claim 1, wherein Step S1 specifically comprises: devoting a read-back voltage of a kth memory cell of the MLC flash by Vk, inputting v={v1, v2, . . . , vL} to the neural network, and setting L as the number of neurons of an input layer, and outputting a soft estimation {tilde over (x)} ={{tilde over (x)}1, {tilde over (x)}2, . . . , {tilde over (x)}L} of a tag x by the neural network;as for the MLC flash, expressing {11,10,00,01} by a tag {0,1,2,3}, so that a hard estimation {circumflex over (X)}k ∈{0,1,2,3} of the kth memory cell xk ∈{0,1,2,3} is obtained by acquiring integers {tilde over (X)}k most approximate to {tilde over (x)} output by the neural network; then, obtaining a corresponding memory cell symbol through a mapping {0,1,2,3}→{11,10,00,01};setting a set of read thresholds {R1, R2, R3} for a read-back voltage {right arrow over (v)} of a given set of memory cells to obtain a hard estimation of the memory cell symbol, which is denoted by {tilde over (x)}; inputting {right arrow over (v)} to an RNN detector to obtain a hard estimation {circumflex over (X)} of a storage symbol based on a principle of proximity;setting, during a search process, N RNN output sequences with a length of L, and denoting an ith RNN output sequence as {circumflex over (X)}1, so that {circumflex over (X)}={{circumflex over (x)}1, {circumflex over (x)}2, . . . , {circumflex over (x)}N}, and X={x1, x2, . . . , xN};wherein, a formula for calculating optimal thresholds {R*1, R*2, R*3} is:
  • 3. The method for optimizing a Polar-RNNA quantizer of MLC NAND flash memory based on deep learning according to claim 1, wherein Step S4 specifically comprises: defining f0({circumflex over (L)}) and f1({circumflex over (L)}) as probability density functions of LLRs, corresponding to c=0 and c=1 in the memory cells respectively; and reversing all LLR symbols of c=0, wherein a probability density of the LLRs after channel symmetrization is: fs({circumflex over (L)})=1/2 (f0(−{circumflex over (L)})+f1({circumflex over (L)}))   (3)defining aN(i) as a probability density function of the LLR of an ith sub-channel;then, calculating aN(i) of each sub-channel according to formulas a2N(2i-1)=aN(i) ⊙ aN(i), a2N(2i)=aN(i)*aN(i), a1(1)=fs({circumflex over (L)}); calculating a code error probability of each sub-channel after aN(i) of each sub-channel is obtained; and
Priority Claims (1)
Number Date Country Kind
202110101188.9 Jan 2021 CN national
PCT Information
Filing Document Filing Date Country Kind
PCT/CN2021/129180 11/8/2021 WO
Publishing Document Publishing Date Country Kind
WO2022/160847 8/4/2022 WO A
US Referenced Citations (6)
Number Name Date Kind
9431118 Bar Aug 2016 B1
9513982 Wu Dec 2016 B1
20150227403 Zhang Aug 2015 A1
20150256196 Alhussien et al. Sep 2015 A1
20180341543 Ha et al. Nov 2018 A1
20200177209 Ismail et al. Jun 2020 A1
Foreign Referenced Citations (2)
Number Date Country
16504796 Mar 2017 CN
112929033 Jun 2021 CN
Non-Patent Literature Citations (1)
Entry
Z. Mei, K. Cai and X. He, “Deep Learning-Aided Dynamic Read Thresholds Design for Multi-Level-Cell Flash Memories,” in IEEE Transactions on Communications, vol. 68, No. 5, pp. 2850-2862, May 2020, doi: 10.1109/TCOMM.2020.2974723. (Year: 2020).
Related Publications (1)
Number Date Country
20230127404 A1 Apr 2023 US