Low-density parity-check decoding with desaturation

Information

  • Patent Grant
  • 11251810
  • Patent Number
    11,251,810
  • Date Filed
    Friday, August 7, 2020
    4 years ago
  • Date Issued
    Tuesday, February 15, 2022
    2 years ago
Abstract
A saturation metric that represents a degree of saturation in a low-density parity-check (LDPC) decoding system that uses a fixed-point number representation is determined. The saturation metric is compared against a saturation threshold. In the event the saturation metric exceeds the saturation threshold, at the end of a decoding iteration, a message is more aggressively attenuated compared to when the saturation metric does not exceed the saturation threshold in order to produce an attenuated message. In the event the saturation metric does not exceed the saturation threshold, at the end of the decoding iteration, the message is less aggressively attenuated compared to when the saturation metric does exceed the saturation threshold in order to produce the attenuated message.
Description
BACKGROUND OF THE INVENTION

Low-density parity-check (LDPC) codes are a popular choice for stored data, such as data stored on solid state storage. As storage density increases, the number of errors included in the LDPC-encoded data likewise increases. To compensate for this, new and more powerful LDPC decoding techniques would be desirable. Furthermore, it would be desirable if these new LDPC decoding techniques could be easily implemented in existing storage systems, do not consume substantial amounts of resources (e.g., additional processing, memory, power, etc.), and/or do not introduce noticeable (additional) processing delays.





BRIEF DESCRIPTION OF THE DRAWINGS

Various embodiments of the invention are disclosed in the following detailed description and the accompanying drawings.



FIG. 1 is a flowchart illustrating an embodiment of min-sum decoding of low-density, parity-check (LDPC) codes with de-saturation.



FIG. 2A is a diagram illustrating an embodiment of a check node-to-variable node (CN-to-VN) message update.



FIG. 2B is a diagram illustrating an embodiment of a variable node-to-check node (VN-to-CN) message update.



FIG. 3 is a system diagram illustrating an embodiment of a de-saturated min-sum decoding.





DETAILED DESCRIPTION

The invention can be implemented in numerous ways, including as a process; an apparatus; a system; a composition of matter; a computer program product embodied on a computer readable storage medium; and/or a processor, such as a processor configured to execute instructions stored on and/or provided by a memory coupled to the processor. In this specification, these implementations, or any other form that the invention may take, may be referred to as techniques. In general, the order of the steps of disclosed processes may be altered within the scope of the invention. Unless stated otherwise, a component such as a processor or a memory described as being configured to perform a task may be implemented as a general component that is temporarily configured to perform the task at a given time or a specific component that is manufactured to perform the task. As used herein, the term ‘processor’ refers to one or more devices, circuits, and/or processing cores configured to process data, such as computer program instructions.


A detailed description of one or more embodiments of the invention is provided below along with accompanying figures that illustrate the principles of the invention. The invention is described in connection with such embodiments, but the invention is not limited to any embodiment. The scope of the invention is limited only by the claims and the invention encompasses numerous alternatives, modifications and equivalents. Numerous specific details are set forth in the following description in order to provide a thorough understanding of the invention. These details are provided for the purpose of example and the invention may be practiced according to the claims without some or all of these specific details. For the purpose of clarity, technical material that is known in the technical fields related to the invention has not been described in detail so that the invention is not unnecessarily obscured.


Various embodiments of a technique to perform LDPC decoding (e.g., by message passing or more specifically min-sum decoding) with de-saturation are described herein. For example, without de-saturation, LDPC systems that use fixed-point number representation can become trapped in some cases if the messages become saturated relative to the fixed-point number representation. In some embodiments, a saturation metric that represents a degree of saturation in a low-density parity-check (LDPC) decoding system that uses fixed-point number representation is determined. The saturation metric is compared against a saturation threshold and it is determined if the saturation metric exceeds the saturation threshold. If so, LDPC decoding data (e.g., one or more messages in the system) is more aggressively attenuated compared to when the saturation metric does not exceed the saturation threshold at the end of a decoding iteration. If not, the LDPC decoding data is less aggressively attenuated at the end of the decoding iteration. The attenuated message may then be passed from a first type of node to a second type of node in a Tanner Graph, assuming more decoding iterations are required to properly decode the LDPC data.



FIG. 1 is a flowchart illustrating an embodiment of min-sum decoding of low-density, parity-check (LDPC) codes with de-saturation. For example, the process may be performed by an LDPC decoder which processes data which has been encoded using an LDPC code. In various applications, the LDPC-encoded data may have been sent over a (e.g., wireless) communications channel or may have been stored on storage media (e.g., a hard disk drive or solid state storage) before being decoded. As such, the exemplary LDPC decoder may be included in a wireless receiver or a storage read-back module in various embodiments.


At 100 a saturation metric that represents a degree of saturation in a low-density parity-check (LDPC) decoding system that uses a fixed-point number representation is determined. In one example of step 100, the saturation metric is based on the number or percentage of messages (e.g., prior to any message attenuation) from check nodes to variable nodes that are saturated to the maximum fixed-precision magnitude. Alternatively, messages passed in the other direction (e.g., from variable nodes to check nodes) are used to determine the saturation metric at step 100. For example, lmin1(j) is the minimum variable node message to check node j and in some embodiments the saturation metric is based at least in part on the number of messages (e.g., across all check nodes j in the system) that are saturated (e.g., to the maximum fixed precision magnitude) at the end of a decoding iteration.


In another example of step 100, the saturation metric is based on a number of saturated nodes in the system. For example, the saturation metric may be the number of saturated check nodes at the end of a decoding iteration. For example, check node j may be declared saturated if Lmin1(j) (i.e., the minimum attenuated variable node message to check node j where lmin1(j) is the message before attenuation and Lmin1(j) is the message after attenuation) reaches the maximum fixed-precision magnitude.


At 102, the saturation metric is compared against a saturation threshold. In one example, the saturation metric is a percentage and the saturation threshold is within a range of 3/4 (75%)-15/16 (93.75%).


At 104, it is determined if the saturation metric exceeds the saturation threshold. If so, at the end of a decoding iteration, a message is more aggressively attenuated compared to when the saturation metric does not exceed the saturation threshold in order to produce an attenuated message at 106. If not, at the end of a decoding iteration, the message is less aggressively attenuated compared to when the saturation metric does exceed the saturation threshold in order to produce the attenuated message at 108.


In one example of step 106, the messages that are attenuated are the lmin1(j) and lmin2(j) messages, which are the minimum and second minimum variable node messages to check node j, respectively, before any attenuation or normalization. In this example of step 106, they are more aggressively attenuated using de-saturation attenuation factor αds and de-saturation attenuation rounding βds (e.g., lmin1(j)=└αds·lmin1(j)ds┘ and Lmin2(j)=└αds·lmin2(j)ds┘).


In a corresponding example of step 108, the same messages (i.e., lmin1(j) and lmin2(j)) are less aggressively attenuated using attenuation factor α and attenuation rounding β (e.g., Lmin1(j)=└α·lmin1(j)+β┘ and Lmin2(j)=└α·lmin2(j)+β┘) wherein the desaturation attenuation parameters satisfy the following:

αds
└αdsds┘≤└α+β┘
└αdsLmaxds┘<└αLmax+β┘

where Lmax denotes the maximum message magnitude. Furthermore, the parameters α, β, αds, βds can be optimized through density evolution method.


By more aggressively attenuating the messages if the system is saturated (e.g., as measured or otherwise determined using the saturation threshold), the saturation of correct messages is facilitated and error correction over small trapping sets is facilitated, but only if/when needed (e.g., only when the saturation metric exceeds the saturation threshold). More generally, by de-saturating LDPC data if needed, the performance of the LDPC decoder is improved in cases where the data (e.g., the passed messages) would otherwise have saturated and the decoder would trend towards an uncorrectable state.


The attenuated message (e.g., output by step 106 or 108) may then pass from a first type of node (e.g., a variable node) to a second type of node (e.g., a check node) in a Tanner graph, assuming additional LDPC decoding is required. The second type of nodes in the Tanner graph may then send back updated messages to the first type of nodes and the process of FIG. 1 may be repeated as or if needed.


To provide more context for the decoding technique with de-saturation described herein, it may be helpful to discuss check nodes and variable nodes which perform LDPC decoding by passing messages between the nodes. The following figures describe some such examples.



FIG. 2A is a diagram illustrating an embodiment of a check node-to-variable node (CN-to-VN) message update. In the example shown here, each check node (e.g., cj (200a)) in the graph receives log-likelihood ratio (LLR) information (e.g., Li→j (202a)) from all of its neighboring variable nodes (e.g., vi (204a)). Each check node generates an updated check-to-variable message (e.g., Li→j (206a)) for a given variable node using the inputs from all other neighboring variable nodes (not shown here).


Low-density parity-check (LDPC) codes are a type of error correction codes and are so named because they have a very low density of 1s in their parity-check matrix (H). This property makes it convenient to represent an LDPC code using a bipartite graph, called a Tanner Graph. There are two types of nodes in a Tanner graph: variable nodes (VNs) and check nodes (CNs). In the example shown here, vi (204a) is an example of a variable node and cj (200a) is an example of a check node. Each variable node (or check node) corresponds to a column (or row) of the parity-check matrix, H. As used herein, V={v1, . . . , vn} is the set of variable nodes and C={c1, . . . , cm} is the set of check nodes. Each row of H is indexed by C=(1, 2, . . . , m) and each column of H is indexed by V=(1, 2, . . . , n). In the Tanner graph, variable node vi is connected to check node cj via an edge if Hj,i=1 and the set of edges on the Tanner graph is denoted by set E.


Quasi-cyclic low-density parity-check (QC-LDPC) codes are a special class of the LDPC codes with structured H matrix which can be generated by the expansion of an mb×nb base matrix. Each 1s element in the base matrix can be expanded by a circularly right-shifted b×b identity sub-matrix. QC-LDPC codes have advantages over other types of LDPC codes in terms of hardware implementations on both the encoding and decoding side. Encoding of a QC-LDPC code can be (more) efficiently implemented (e.g., in hardware) using simple shift registers. In a hardware implementation of a QC-LDPC decoder, the QC structure of the code simplifies the wire routing for message passing.


It may be helpful to describe message passing in more detail. The following figure shows an example of a variable node-to-check node message update, which is part of message passing (i.e., a type of LDPC decoding).



FIG. 2B is a diagram illustrating an embodiment of a variable node-to-check node (VN-to-CN) message update. In the example shown here, each variable node (e.g., vi (204b)) in the graph receives LLR information (e.g., (206b)) from all of its neighboring check nodes (e.g., cj (200b)). Each variable node (e.g., vi (204b)) generates an updated variable-to-check message (e.g., Li→j (202b)) to a given check node (e.g., cj (200b)) using the inputs from all other neighboring check nodes (not shown).


Message-passing (MP) is an efficient technique to achieve near-optimal decoding of LDPC codes. For notational conciseness, a variable node is referred to subsequently simply as i (instead of vi) and j (instead of cj) is used to denote a check node. As shown in this example, a variable node i (204b) receives an input message Lich (210) from the channel. For example, this message from the channel may be the log-likelihood ratio (LLR) of the corresponding channel output, defined as follows:











L
i
ch

=

log


(


Pr


(


R
i

=



r
i

|

c
i


=
0


)



Pr


(


R
i

=



r
i

|

c
i


=
1


)



)



,




(
1
)








where ci∈{0,1} is the code bit and ri is the corresponding received symbol.


A conventional iterative message passing decoder alternates between two phases: a CN-to-VN phase (during which check nodes send messages to their adjacent variable nodes) and a VN-to-CN phase (during which variable nodes send messages to check nodes along their adjacent edges) which are depicted schematically in FIGS. 2A and 2B, respectively. In the initialization step of the decoding process, variable node i forwards the same message to all of its neighboring check nodes V(i), namely the LLR Lich derived from the corresponding channel output. In the CN-to-VN message update phase, check node j uses the incoming messages and check node update rule to compute and forward, to variable node i∈C(j), a new CN-to-VN message, Lj→i Variable node i then processes its incoming messages according to the variable node update rule and forwards to each adjacent check node, C(i), an updated VN-to-CN message, Li→j. After a pre-specified number of iterations, variable node i sums all of the incoming LLR messages to produce an estimate of the corresponding code bit i. Note that all of the CN-to-VN message updates can be done in parallel, as can all of the VN-to-CN message updates. This enables efficient, high-speed software and/or hardware implementations of iterative message-passing decoding processing.


Let Li→j and Lj→i represent the messages sent from variable node i to check node j and from check node j to variable node i, respectively. Let C(i) be the set of check nodes directly connected to variable node i and V(J) be the set of variable nodes directly connected to check node j. Then, the message sent from variable node i to check node j in sum-product decoding is given by:











L

i

j


=


L
i
ch

+





j





C


(
i
)



\

j





L


j



i





,




(
2
)








and the message from check node j to variable node i is computed as:










L

j

i


=

2




tanh

-
1


(





i





𝒱


(
j
)



\

i









tanh



L


i



j


2



)

.






(
3
)








Let Pi be a posterior probability (APP) message of variable node i where:










P
i

=


L
i
ch

+





j




𝒞


(
i
)







L


j



i


.







(
4
)







In this example, a variable node receives the log-likelihood ratios of received information from the channel as an initial input message (i.e., Li→j=Lich) and the following equivalent check node update rule is employed:











L

j

i


=


[





i





𝒱


(
j
)



\

i





sign


(

L


i



j


)



]

·




α
·


min


i





𝒱


(
j
)



\

i







L


i



j






+
β





,




(
5
)








where 0<α<1, β>0 is the attenuation factor and attenuation rounding, respectively, which can be either pre-fixed or dynamically adjusted. Herein we newly introduced the attenuation rounding parameter β, which satisfies the following:

1≤α+β<2

which prevents a minimum CN-to-VN message of 1 from being attenuated to zero (which would erase any information contained therein).


It is noted that channel LLR inputs may be conveniently scaled for min-sum decoding but preferably are precise for the original sum-product decoding. With that in mind, the following notations are used to simplify the above calculation. Let:

Si→jcustom charactersign(Li→j).  (7)

Let S(j) be the product sign of all variable nodes i to the check node j:










S

(
j
)




=
Δ







i




𝒱


(
j
)







S


i



j


.






(
8
)








Let lmin1(j) and imin1(j) be the minimum variable node message to check node j and its associated index, respectively:











l

min





1


(
j
)




=
Δ




min


i




𝒱


(
j
)








L


i



j






,


i

min





1


(
j
)




=
Δ



arg



min


i




𝒱


(
j
)








L


i



j











(
9
)








and let lmin2(j) be the second minimum variable node message to check node j:










l

min





2


(
j
)




=
Δ




min


i





𝒱


(
j
)



\


l

min





1


(
j
)










L


i



j




.






(
10
)








Furthermore, let Lmin1(j) and Lmin2(j) be the attenuated minimum and second minimum variable node message, respectively, to the check node j:

Lmin1(j)custom character└α·lmin1(j)+β┘,Lmin2(j)custom character└α·lmin2(j)+β┘  (11)

With the above notations, Equation (5) can be conveniently rewritten as:










L

j

i


=


S

(
j
)


·

S

i

j


·

{






L

min





1


(
j
)


,





if





i



i

min





1


(
j
)









L

min





2


(
j
)


,





if





i

=

i

min





1


(
j
)






.







(
12
)








Pseudocode 1 describes a hardware amenable min-sum decoding example along these lines. It is noted that Pseudocode 1 does not de-saturate (e.g., unlike FIG. 1 and other embodiments described herein) and so Pseudocode 1 is vulnerable to becoming trapped in an uncorrectable state if the passed messages were to saturate at the maximum fixed-precision magnitude.












Pseudocode 1: Example of Flooded Min-Sum Decoding Without De-Saturation















Initialization: Lmin1(j) = Lmin2(j) = 0, ∀j ∈ custom character


Iteration:


 1: lmin1(j) = lmin2(j) = ∞, imin1(j) = ∅, Sj = 0, ∀j ∈ custom character


 2: for ∀i ∈ custom character , do


 3:  for ∀j ∈ custom character (i), do


 4:   Read (old) {S(j), imin1(j), Lmin1(j), Lmin2(j)}





 5:   
ComputeLji{S(j)·Sij·Lmin1(j),ifiimin1(j)S(j)·Sij·Lmin2(j),ifi=imin1(j)






 6:  end for


 7:  Compute Pi ← Lich + Σj∈C(i) Lj→i


 8:  for ∀j ∈ custom character (i), do


 9:   Compute Li→j ← Pi − Lj→i


10:   Store (new) Si→j ← sign(Li→j)


11:   Compute (new) S(j) ← S(j) ⊕ Si→j


12:   Compute (new) {imin1(j), lmin1(j), lmin2(j)} ← {imin1(j), lmin1(j), lmin2(j), {i, |Li→j|}}


13:  end for


14: end for


15: Compute syndrome sign([P1, P2, ..., Pn]) · HT. If 0 then return the codeword sign([P1, P2, ..., Pn]).


16: Normalize Lmin1(j) = └α · lmin1(j) + β┘, Lmin2(j) = └α · lmin2(j) + β┘, ∀ j ∈ custom character .









When a QC-LDPC code with b×b circulants is in use, each circulant of b bits is updated independently and in parallel.


The paper “Propagation of LLR saturation and quantization error in LDPC min-sum iterative decoding” by KANISTRAS et al. (which does not describe the de-saturation approach described in FIG. 1) investigated the theoretical aspect of saturation effect but did not propose any de-saturation technique.


In “Quantized iterative message passing decoders with low error floor for LDPC codes” by ZHANG et al., a new non-uniform quantization method was proposed to extend the message quantization range by using an exponentially increased step size for large magnitudes while keeping a fixed step size for small magnitudes. However, the proposed exponentially increased step size design is difficult to implement in hardware. In contrast, the de-saturation decoding technique described in FIG. 1 is amenable to hardware implementation and does not use an exponentially increased vs. fixed step size.


In U.S. Pat. No. 9,755,666 by Yingquan Wu, CN-to-VN messages are halved (e.g., by the variable node in the middle of a decoding iteration) if a significant fraction of VN-to-CN messages are saturated. For example, this would correspond to having a new line between line 5 and line 6 in Pseudocode 1 (not shown) where is halved (i.e., set to Lj→i to Lj→i/2) if some saturation condition is flagged. However, halving the CN-to-VN messages at that stage (i.e., in the middle of a VN-to-CN message update) as opposed to at the end of a decoding layer/iteration results in longer critical (e.g., rate limiting) path and thus slower clock speed. For example, there may be a critical timing path that begins with reading or otherwise inputting some piece of data or variable (e.g., at line 4 in Pseudocode 1) and ending with computing new values for that piece of data or value (e.g., at lines 12 and/or 13 in Pseudocode 1) and the older technique of halving Lj→i (e.g., which would occur between lines 5 and 6 in Pseudocode 1 but which is not shown there) introduces additional delay into a critical timing path. Furthermore, halving is a crude way of attenuating information and it may not be necessary to apply that much attenuation even if saturation is detected in the system. By using two de-saturation attenuation parameters (e.g., αds as well as βds), sufficient attenuation can be achieved without losing as much information as halving does. In one example,







α
ds

=



3
4


α





and






β
ds


=


3
4



β
.







If iterative message-passing decoding is implemented in hardware, the decoding efficiency can be improved using a layered decoding approach. In layered decoding, check node messages are updated serially. That is, instead of sending all messages from variable nodes to check nodes, and then all messages from check nodes to variable nodes (i.e., flooding), the layered coding goes through the check nodes in sequential order such that, to each check node being updated, all messages are sent in and processed, and then sent out to neighboring variable nodes. Such scheduled serial updating of check nodes enables immediate propagation of the newly updated message, unlike the flooded scheme where the updated messages can propagate only in the next iteration.


As a result, layered decoding improves convergence speed by roughly twice compared to that of a flooded implementation. Moreover, it provides a good trade-off between speed and memory. This is achieved by iterating over dynamic CN-to-VN messages, denoted by Qcustom character[Q1, Q2, . . . , Qn]. Specifically, let variable node i∈V(j), then Qi over a layer j is defined as:











Q
i

(
j
)




=
Δ




L

j

i


=


L
i
ch

+





j





𝒞


(
i
)



\

j





L


j



i


(
last
)






,




(
13
)








where the superscript (last) denotes the latest updated. It is worth noting that, in layered decoding, the VN-to-CN message updated at the last layer (all but the last are from the current iteration) is utilized to update the CN-to-VN Qi in the current layer, whereas in the flooded decoding updating a CN-to-VN message Lj→i utilizes the VN-to-CN messages each generated at the last iteration. The Qcustom character[Q1, Q2, . . . , Qn] memory is initialized with the channel messages Lchcustom character[L1ch, L2ch, . . . , Lnch] and no dedicated memory is needed to store Lch, whereas with flooded decoding, Lch is stored but not Q. Qi, i=1, 2, . . . , n, is iteratively calculated as follows. Let j be the current layer and ji be the preceding layer associated with variable node i. A preceding layer is mathematically declared or otherwise defined as follows. Let j1<j2< . . . <jk be all check nodes directly connected to variable node i, then jl is the preceding layer of jl+1 for l=1, 2, . . . , k−1, and jk is the preceding layer of jl.


The APP (i.e., a posterior probability) message Pi at the layer j is calculated as:

Pi(j)=Qi(ji)+Lji→inew  (14)

where Lji→inew is newly updated and Qi is iteratively updated by:

Qi(j)=Pi(j)−Lj→iold,  (15)

where Lji→iold was saved during the preceding iteration. The layered decoding can be applied to all types of iterative message-passing decoding, including SAP and min-sum decoding. A hardware amenable layered min-sum decoding process is described below in Pseudocode 2. It is noted that Pseudocode 2 does not check for de-saturation and perform de-saturation on the LDPC data, if needed (see, e.g., FIG. 1) and so Pseudocode 2 is vulnerable to becoming trapped in an uncorrectable state if the passed messages were to saturate at the maximum fixed-precision magnitude.












Pseudocode 2: Example of Layered Min-Sum Decoding Without De-Saturation















Initialization: Lmin1(j) = Lmin2(j) = 0, ∀j ∈ custom character ; Qi = Lich, ∀i ∈ custom character ; custom character =0


Iteration:


 1: for custom character =0, 1, 2 ..., m − 1 do


 2:  for ∀j ∈ custom character  do


 3:   lmin1(j) = lmin2(j) = ∞, imin1(j) = ∅, Sj = 0


 4:   for ∀i ∈ custom character (j) do


 5:    Read (new) {S(ji), imin1(ji), Lmin1(ji), Lmin2(ji)} where ji is the preceding layer of VN i





 6:    
ComputeL{S(ji)·Siji·Limin1(ji),ifiimin1(ji)S(ji)·Siji·Limin2(ji),ifi=imin1(ji)






 7:    Compute Pi ← Qi + Lji→inew


 8:    Read (old) {S(j), imin1(j), Lmin1(j), Lmin2(j)}





 9:    
ComputeLjiold{S(j)·Sij·Lmin1(j),ifiimin1(j)S(j)·Sij·Lmin2(j),ifi=imin1(j)






10:    Compute Qi ← Pi − Lj→iold.


11:    Store Si→j = sign(Qi)


12:    Compute (new) S(j) ← S(j) ⊕ Si→j


13:    Compute (new) {imin1(j), lmin1(j), lmin2(j)} ← {imin1(j), lmin1(j), lmin2(j), {i, |Qi|}}


14:   end for


15:   Compute syndrome sign([P1, P2, ..., Pn]) · HT. If 0, then return the codeword sign([P1, P2, . . . , Pn]).


16:  end for


17:  Normalize Lmin1(j) = └α · lmin1(j) + β┘, Lmin2(j) = └α · lmin2(j) + β┘, j ∈ custom character


18: end for









When a QC-LDPC code with b×b circulants is in use, b quasi-cyclic rows of H are (naturally) treated as a layer. That is, a layer contains b check nodes, each being updated independently and in parallel. It is noted that convergence may occur within any layer for layered min-sum decoding (one example of which is shown in Pseudocode 2) whereas the convergence must occur at the end of an iteration for the flooded min-sum decoding (one example of which is shown in Pseudocode 1). Moreover, layered decoding enables or otherwise permits utilization of updated CN-to-VN messages within an iteration whereas it is not possible for flooded decoding. Consequently, layered decoding converges roughly twice as fast as flooded decoding.


As described above, in one alternate approach to FIG. 1, values are halved in the middle of a layered decoding iteration in a crude attempt to de-saturate. For example, in Pseudocode 2, this would include a new line (not shown) between lines 7 and 8 which would halve Pi if Ids equals one and variable node i is being visited or iterated through for the first time. There would also be a new line (not shown) in Pseudocode 2 between lines 9 and 10 which would halve Lj→iold. However, as described above, there are drawbacks to doing the de-saturation in the middle of CN-to-VN message update and halving the messages may attenuate the message to an unnecessary degree (e.g., just a little more attenuation than would otherwise or normally be applied is sufficient).


During simulations with Pseudocode 1 and 2 (or the like), it was observed that the range of messages passed between variable nodes and check nodes in the decoder has direct impact on the decoding performance in terms of both converge speed and error rate. When fixed-point magnitude was not enforced, correct messages typically grew faster than incorrect messages, with most errors due to small trapping sets correctable. However, given limited precision in practice (e.g., five bits of representation), after a certain number of iterations, messages tended to saturate to the maximum fixed-point magnitude. In such scenarios, correct messages are not able to outweigh incorrect messages, and the message in passing is gradually downgraded to bipolar messages.


From this observation, new and improved adaptive quantization methods have been developed (see, e.g., FIG. 1). In some embodiments, to expand the range of represented values by message index, the messages are scaled down after (or if) certain criterion is met. For example, but not limited to, if at the end of an iteration the number of saturated check nodes, denoted by θC, is greater than a pre-defined threshold, denoted by Θ, then all CN-to-VN messages in the decoder are more aggressively attenuated which effectively ameliorates the saturation during the next iteration. Herein, a check node j is declared saturated if Lmin1(j) reaches the maximum finite-precision magnitude.


This enables an effective increase in the quantization range without (meaningfully) increasing complexity or memory. Pseudocode 3 shows an example of flooded decoding with de-saturation.












Pseudocode 3: Example of De-saturated Flooded Min-Sum Decoding















Initialization: Lmin1(j) = Lmin2(j) = 0, ∀j ∈ custom character


Iteration:


 1: lmin1(j) = lmin2(j) = ∞, imin1(j) = ∅, Sj = 0, ∀j ∈ custom character


 2: for ∀i ∈ custom character , do


 3:  for ∀j ∈ custom character (i), do


 4:   Read (old) {S(j), imin1(j), Lmin1(j), Lmin2(j)}





 5:   
ComputeLji{S(j)·Sij·Lmin1(j),ifiimin1(j)S(j)·Sij·Lmin2(j),ifi=imin1(j)






 6:  end for


 7:  Compute Pi ← Lich + Σj∈C(i) Lj→i


 8:  for ∀j ∈ custom character (i), do


 9:   Compute Lj→i ← Pi − Lj→i


10:   Store (new) Si→j ← sign(Li→j)


11:   Compute (new) S(j) ← S(j) ⊕ Si→j


12:   Compute (new) {imin1(j), lmin1(j), lmin2(j)} ← {imin1(j), lmin1(j), lmin2(j), {i, |Li→j|}}


13:  end for


14: end for


15: Compute syndrome sign([P1, P2, ..., Pn]) · HT. If 0 then return the codeword sign([P1, P2, . . . , Pn]).


16: If the number of saturated lmin1 is greater than Θ, then set Ids = 1, otherwise 0.


17: If Ids = 0 then normalize Lmin1(j) = └α · lmin1(j) + β┘, Lmin2(j) = └α · lmin2(j) + β┘, otherwise de-saturate Lmin1(j) =


    └αds · lmin1(j) + βds┘, Lmin2(j) = └αds · lmin2(j) + βds┘, ∀j ∈ custom character .









Pseudocode 3 shows one example of how the process of FIG. 1 may be performed. Although not explicitly stated in Pseudocode 3, the process may end if the syndrome equals 0 and the codeword is returned at line 15 (e.g., because decoding has successfully completed and there is no need to continue). As described above, it may be beneficial to do the de-saturation (e.g., more aggressively attenuate a message or other data) at the end of the iteration because it does not affect the existing pipelining stages and critical (timing) path. Also, the de-saturation parameters αds and βds may be able to achieve a better de-saturation compared to crudely halving values (e.g., which may attenuate the messages or data to an unnecessary degree).


The following figure shows an example system diagram which performs the process of Pseudocode 3.



FIG. 3 is a system diagram illustrating an embodiment of a de-saturated min-sum decoding. In the example shown, the system diagram is applicable to both de-saturated flooded min-sum decoding as well as de-saturated layered min-sum decoding. In this example, a (global) saturation monitor (300) generates a (global) de-saturation signal (e.g., Ids) which is passed to all of the variable nodes in the system, including the variable node shown here (304). For example, the Ids signal referred to in lines 16 and 17 of Pseudocode 3 corresponds to this signal.


The de-saturation signal controls a plurality of multiplexers, including the multiplexer shown here (302). If the de-saturation signal equals zero, then the multiplexer (302) selects the less aggressive attenuation parameters of α and β to pass on to the attenuator block (308) which uses the selected attenuation parameters to attenuate messages (e.g., lmin1(j) and lmin2(j)). If the de-saturation signal instead equals one, then the multiplexer (302) selects the more aggressive attenuation parameters of αds and βds to output to the attenuator (308). In some embodiments, αds is within a range of







1
2


α





and






3
4


α





and βds is within a range of







1
2


β





and






3
4



β
.





The attenuated messages which are output by the attenuation block (308) are then passed from the variable node i (304) to the check node j (306) as VN-to-CN messages (e.g., Lmin1(j) and Lmin2(j)) assuming another iteration of decoding needs to be performed (e.g., because the syndrome is not all zeros which indicates that LDPC decoding has not yet successfully completed).


Returning briefly to FIG. 1, the saturation monitor (300) is one example of a device that performs steps 100 and 102 in FIG. 1 and the multiplexer (302) and attenuator (308) are examples of devices that perform steps 104, 106, and 108 in FIG. 1. It is noted that this system diagram is merely exemplary and is not intended to be limiting. For example, it is logically equivalent to have two attenuation blocks (e.g., one that outputs less aggressively attenuated messages and one that outputs more aggressively attenuated messages) and have the multiplexer follow the attenuation blocks.


As described above, unlike some other techniques (e.g., a system which alternates between an exponentially increased step size and fixed step size), adding a saturation monitor (300) and a plurality of multiplexers (e.g., the multiplexer (302) shown here) to an existing (min-sum) LDPC decoding system is relatively easy to implement in hardware.


For completeness, Pseudocode 4 shows an example of layered min-sum decoding with de-saturation.












Pseudocode 4: Example of De-saturated Layered Min-Sum Decoding















Initialization: Lmin1(j) = Lmin2(j) = 0, ∀j ∈ custom character ; Qi = Lich, ∀i ∈ custom character ; custom character =0; Ids = 0


Iteration:


 1: for custom character =0, 1, 2, ..., m − 1 do


 2:  for ∀j ∈ custom character  do


 3:   lmin1(j) = lmin2(j) = ∞, imin1(j) = ∅, Sj = 0


 4:   for ∀i ∈ custom character (j) do


 5:    Read (new) {S(ji), imin1(ji), Lmin1(ji), Lmin2(ji)} where ji is the preceding layer of VN i





 6:    
Computejiinew{S(ji)·Siji·Lmin1(ji),ifiimin1(ji)S(ji)·Siji·Lmin2(ji),ifi=imin1(ji)






 7:    Compute Pi ← Qi + Lji→inew


 8:    Read (old) {S(j), imin1(j), Lmin1(j), Lmin2(j)}.





 9:    
ComputeLjiold{S(j)·Sij·Lmin1(j),ifiimin1(j)S(j)·Sij·Lmin2(j),ifi=imin1(j)






10:    Compute Qi ← Pi − Lj→iold.


11:    Store Si→j = sign(Qi)


12:    Compute (new) S(j) ← S(j) ⊕ Si→j


13:    Compute (new) {imin1(j), lmin1(j), lmin2(j)} ← {imin1(j), lmin1(j), lmin2(j), {i, |Qi|}}


14:   end for


15:   Compute syndrome sign([P1, P2, ..., Pn]) · HT. If 0 then return the codeword sign([P1, P2, ..., Pn]).


16:  end for


17:  If Ids = 0 then normalize Lmin1(j) = └α · lmin1(j) + β┘, Lmin2(j) = └α · lmin2(j) + β┘, otherwise de-saturate


     Lmin1(j) = └αds · lmin1(j) + βds┘, Lmin2(j) = └αds · lmin2(j) + βds┘, j ∈ custom character .


18:  If custom character  = 0 and overall number of saturated lmin1 is greater than a pre-determined threshold Θ,


     then set Ids = 1, otherwise 0.


19: end for










Pseudocode 4 shows one example of how the process of FIG. 1 may be performed. Although not explicitly described in the pseudocode, the process of Pseudocode 4 may end after line 15 if the syndrome equals 0 (which indicates that LDPC decoding has successfully completed and therefore further processing is unnecessary). And as with Pseudocode 3, the attenuation in Pseudocode 4 is done at the end of the iteration (e.g., at line 17) using the parameters αds and βds which may be beneficial in at least some applications.


Although some of the examples described herein show flooded min-sum decoding and layered min-sum decoding, the techniques described herein may be extended to other variants of message-passing decoding of LDPC codes (e.g., shuffled decoding which has efficiency between that of flooded decoding and layered decoding).


In some embodiments, the decoder selects the degree of de-saturation applied (e.g., at step 106 in FIG. 1). For example, in some cases, it may be desirable to more aggressively attenuate a message with de-saturation parameters of







α
ds

=



3
4


α





and






β

d

s



=


3
4


β







at step 106 in FIG. 1 whereas in other cases it may be more desirable to use de-saturation parameters of







α
ds

=



1
2


α





and






β
ds


=


1
2


β







at step 106 because a larger degree of de-saturation and/or attenuation is called for given the state of the system.


In some embodiments, such a selection of de-saturation parameters (e.g., for use at step 106 in FIG. 1) is done by density evolution optimization. For example, density evolution is used to analyze the convergence behavior of the min-sum decoder (e.g., the code threshold) for a given LDPC code ensemble under min-sum decoding, where the code threshold is defined as the maximum channel noise level at which the decoding error probability converges to zero as the code length goes to infinity. Density evolution is used to determine the performance of min-sum decoding for a given code ensemble by tracking the probability density function (PDF) of messages passed along the edges in the corresponding Tanner graph through the iterative decoding process. Then, it is possible to test whether, for a given channel condition and a given degree distribution, the decoder can successfully decode the transmitted message (with the decoding error probability tends to zero as the iterations progress). Using density evolution as an evaluation tool, various de-saturation parameters (e.g., any of α, αds, β, and/or βds) may be tested and optimized values for any of α, αds, β, and/or βds may be selected.


Although the foregoing embodiments have been described in some detail for purposes of clarity of understanding, the invention is not limited to the details provided. There are many alternative ways of implementing the invention. The disclosed embodiments are illustrative and not restrictive.

Claims
  • 1. A decoder, comprising: a saturation monitor configured to: generate a global de-saturation control signal that is determined based at least in part on a saturation threshold and a degree of saturation in a low-density parity-check (LDPC) decoding system that uses a fixed-point number representation;plurality of variable nodes that is configured to: in the event the global de-saturation control signal is a first value, output a more aggressively attenuated message to a plurality of check nodes compared to when the global de-saturation control signal is a second value; andin the event the global de-saturation control signal is the second value, output a less aggressively attenuated message to the plurality of check nodes compared to when the global de-saturation control signal is the first value; andthe plurality of check nodes.
  • 2. The decoder recited in claim 1, wherein the degree of saturation is based at least in part on a number of messages, passed between at least one of the plurality of check nodes and at least one of the plurality of variable nodes, that are saturated to a maximum fixed-precision magnitude.
  • 3. The decoder recited in claim 1, wherein the degree of saturation is based at least in part on a number of saturated nodes in the LDPC decoding system.
  • 4. The decoder recited in claim 1, wherein: the degree of saturation is based at least in part on a number of saturated nodes in the LDPC decoding system; anda node is declared saturated if an associated minimum attenuated message after attenuation reaches a maximum fixed-precision magnitude.
  • 5. The decoder recited in claim 1, wherein outputting the more aggressively attenuated message to the plurality of check nodes includes: outputting a first more aggressively attenuated message that is based at least in part on a first-most minimum variable node message; andoutputting a second more aggressively attenuated message that is based at least in part on a second-most minimum variable node message.
  • 6. The decoder recited in claim 1, wherein outputting the more aggressively attenuated message to the plurality of check nodes includes: outputting a first more aggressively attenuated message (Lmin1(j)) that is based at least in part on a first-most minimum variable node message (lmin1(j)), including by using Lmin1(j)=└αds·lmin1(j)+βds┘, wherein αds is associated with an attenuation factor and βds is associated with an attenuation rounding; andoutputting a second more aggressively attenuated message (Lmin2(j)) that is based at least in part on a second-most minimum variable node message (lmin2(j)) including by using Lmin2(j)=└αds·lmin2(j)+βds┘.
  • 7. The decoder recited in claim 1, wherein: a first attenuation factor (α) is used to generate the less aggressively attenuated message and a second attenuation factor (αds) is used to generate the more aggressively attenuated message, wherein αds<α; anda first attenuation rounding (β) is used to generate the less aggressively attenuated message and a second attenuation rounding (βds) is used to generate the more aggressively attenuated message, wherein └αds+βds┘≤└α+β┘.
  • 8. The decoder recited in claim 1, wherein: a first attenuation factor (α) is used to generate the less aggressively attenuated message and a second attenuation factor (αds) is used to generate the more aggressively attenuated message, wherein αds<α;a first attenuation rounding (β) is used to generate the less aggressively attenuated message and a second attenuation rounding (βds) is used to generate the more aggressively attenuated message, wherein └αds+βds┘≤└α+β┘; andat least one of α, αds, β, or βds is determined using density evolution.
  • 9. A method, comprising: generating a global de-saturation control signal that is determined based at least in part on a saturation threshold and a degree of saturation in a low-density parity-check (LDPC) decoding system that uses a fixed-point number representation;in the event the global de-saturation control signal is a first value, outputting a more aggressively attenuated message to a plurality of check nodes compared to when the global de-saturation control signal is a second value; andin the event the global de-saturation control signal is the second value, outputting a less aggressively attenuated message to the plurality of check nodes compared to when the global de-saturation control signal is the first value.
  • 10. The method recited in claim 9, wherein the degree of saturation is based at least in part on a number of messages, passed between at least one of the plurality of check nodes and at least one of the plurality of variable nodes, that are saturated to a maximum fixed-precision magnitude.
  • 11. The method recited in claim 9, wherein the degree of saturation is based at least in part on a number of saturated nodes in the LDPC decoding system.
  • 12. The method recited in claim 9, wherein: the degree of saturation is based at least in part on a number of saturated nodes in the LDPC decoding system; anda node is declared saturated if an associated minimum attenuated message after attenuation reaches a maximum fixed-precision magnitude.
  • 13. The method recited in claim 9, wherein outputting the more aggressively attenuated message to the plurality of check nodes includes: outputting a first more aggressively attenuated message that is based at least in part on a first-most minimum variable node message; andoutputting a second more aggressively attenuated message that is based at least in part on a second-most minimum variable node message.
  • 14. The method recited in claim 9, wherein outputting the more aggressively attenuated message to the plurality of check nodes includes: outputting a first more aggressively attenuated message (Lmin1(j)) that is based at least in part on a first-most minimum variable node message (lmin1(j)), including by using Lmin1(j)=└αds·lmin1(j)+βds┘, wherein αds is associated with an attenuation factor and βds is associated with an attenuation rounding; andoutputting a second more aggressively attenuated message (Lmin2(j)) that is based at least in part on a second-most minimum variable node message (lmin2(j)) including by using Lmin2(j)=└αds·lmin2(j)+βds┘.
  • 15. The method recited in claim 9, wherein: a first attenuation factor (α) is used to generate the less aggressively attenuated message and a second attenuation factor (αds) is used to generate the more aggressively attenuated message, wherein αds<α; anda first attenuation rounding (β) is used to generate the less aggressively attenuated message and a second attenuation rounding (βds) is used to generate the more aggressively attenuated message, wherein └αds+βds┘≤α+β┘.
  • 16. The method recited in claim 9, wherein: a first attenuation factor (α) is used to generate the less aggressively attenuated message and a second attenuation factor (αds) is used to generate the more aggressively attenuated message, wherein αds<α;a first attenuation rounding (β) is used to generate the less aggressively attenuated message and a second attenuation rounding (βds) is used to generate the more aggressively attenuated message, wherein └αds+βds┘≤└α+β┘; andat least one of α, αds, β, or βds is determined using density evolution.
  • 17. A computer program product embodied in a non-transitory computer readable medium and comprising computer instructions for: generating a global de-saturation control signal that is determined based at least in part on a saturation threshold and a degree of saturation in a low-density parity-check (LDPC) decoding system that uses a fixed-point number representation;in the event the global de-saturation control signal is a first value, outputting a more aggressively attenuated message to a plurality of check nodes compared to when the global de-saturation control signal is a second value; andin the event the global de-saturation control signal is the second value, outputting a less aggressively attenuated message to the plurality of check nodes compared to when the global de-saturation control signal is the first value.
  • 18. The computer program product recited in claim 17, wherein the degree of saturation is based at least in part on a number of messages, passed between at least one of the plurality of check nodes and at least one of the plurality of variable nodes, that are saturated to a maximum fixed-precision magnitude.
  • 19. The computer program product recited in claim 17, wherein outputting the more aggressively attenuated message to the plurality of check nodes includes: outputting a first more aggressively attenuated message that is based at least in part on a first-most minimum variable node message; andoutputting a second more aggressively attenuated message that is based at least in part on a second-most minimum variable node message.
  • 20. The computer program product recited in claim 17, wherein outputting the more aggressively attenuated message to the plurality of check nodes includes: outputting a first more aggressively attenuated message (Lmin2(j)) that is based at least in part on a first-most minimum variable node message (lmin1(j)), including by using Lmin1(j)=└αds·lmin1(j)+βds┘, wherein is associated with an attenuation factor and βds is associated with an attenuation rounding; andoutputting a second more aggressively attenuated message (Lmin2(j)) that is based at least in part on a second-most minimum variable node message (lmin2(j)), including by using Lmin2(j)=└αds·lmin2(j)+βds┘.
CROSS REFERENCE TO OTHER APPLICATIONS

This application is a continuation of U.S. patent application Ser. No. 16/777,457 (now U.S. Pat. No. 10,778,248, issued Sep. 15, 2020) entitled LOW-DENSITY PARITY-CHECK DECODING WITH DE-SATURATION filed Jan. 30, 2020 which is incorporated herein by reference for all purposes.

US Referenced Citations (25)
Number Name Date Kind
5572350 Spanke Nov 1996 A
7383485 Tran Jun 2008 B2
7519898 Narayanan Apr 2009 B2
7583945 McCarthy Sep 2009 B2
7669106 Farjadrad Feb 2010 B1
8281210 Farjadrad Oct 2012 B1
8407556 Shen Mar 2013 B2
8433984 Khandekar Apr 2013 B2
8457194 Ali Jun 2013 B2
8601352 Anholt Dec 2013 B1
8739002 Nakamura May 2014 B2
8995863 Moroi Mar 2015 B2
9048870 Li Jun 2015 B2
9100052 Pisek Aug 2015 B2
9246717 Beidas Jan 2016 B2
9264073 Malmirchegini Feb 2016 B2
9473175 Graumann Oct 2016 B1
9571168 Moon Feb 2017 B2
9612903 Tehrani Apr 2017 B2
9716602 Beidas Jul 2017 B2
9755666 Wu Sep 2017 B2
10164663 Shin Dec 2018 B2
10236070 Barndt Mar 2019 B2
20040187129 Richardson Sep 2004 A1
20080276156 Gunnam Nov 2008 A1
Non-Patent Literature Citations (7)
Entry
Chen et al., Reduced-Complexity Decoding of LDPC Codes, IEEE Transactions on Communications, Aug. 2005, pp. 1288-1299, vol. 53, No. 8.
Kanistras et al., Propagation of LLR Saturation and Quantization Error in LDPC Min-Sum Iterative Decoding, 2012 EEE Workshop on Signal Processing Systems, pp. 276-281, 2012.
Kim et al., A Reduced-Complexity Architecture for LDPC Layered Decoding Schemes, IEEE Transactions on Very Large Scale Integration (VLSI) Systems, Jun. 2011, pp. 1099-1103, vol. 19, No. 6.
Mansour et al., High-Throughput LDPC Decoders, IEEE Transactions on Very Large Scale Integration Systems, Dec. 2003, pp. 976-996, vol. 11, No. 6.
Richardson et al., The Capacity of Low-Density Parity-Check Codes Under Message-Passing Decoding, IEEE Transactions on Information Theory, Feb. 2001, pp. 599-618, vol. 47, No. 2.
Zhang et al., Quantized Iterative Message Passing Decoders with Low Error Floor for LDPC Codes, IEEE Transactions on Communications, Jan. 2014, pp. 1-14, vol. 62, No. 1.
Zhang et al., Shuffled Iterative Decoding, IEEE Transactions on Communications, Feb. 2005, pp. 209-213, vol. 53, No. 2.
Related Publications (1)
Number Date Country
20210242884 A1 Aug 2021 US
Continuations (1)
Number Date Country
Parent 16777457 Jan 2020 US
Child 16988429 US