Low-density parity-check decoding with desaturation

Description

BACKGROUND OF THE INVENTION

Low-density parity-check (LDPC) codes are a popular choice for stored data, such as data stored on solid state storage. As storage density increases, the number of errors included in the LDPC-encoded data likewise increases. To compensate for this, new and more powerful LDPC decoding techniques would be desirable. Furthermore, it would be desirable if these new LDPC decoding techniques could be easily implemented in existing storage systems, do not consume substantial amounts of resources (e.g., additional processing, memory, power, etc.), and/or do not introduce noticeable (additional) processing delays.

BRIEF DESCRIPTION OF THE DRAWINGS

Various embodiments of the invention are disclosed in the following detailed description and the accompanying drawings.

FIG. 1 is a flowchart illustrating an embodiment of min-sum decoding of low-density, parity-check (LDPC) codes with de-saturation.

FIG. 2A is a diagram illustrating an embodiment of a check node-to-variable node (CN-to-VN) message update.

FIG. 2B is a diagram illustrating an embodiment of a variable node-to-check node (VN-to-CN) message update.

FIG. 3 is a system diagram illustrating an embodiment of a de-saturated min-sum decoding.

DETAILED DESCRIPTION

The invention can be implemented in numerous ways, including as a process; an apparatus; a system; a composition of matter; a computer program product embodied on a computer readable storage medium; and/or a processor, such as a processor configured to execute instructions stored on and/or provided by a memory coupled to the processor. In this specification, these implementations, or any other form that the invention may take, may be referred to as techniques. In general, the order of the steps of disclosed processes may be altered within the scope of the invention. Unless stated otherwise, a component such as a processor or a memory described as being configured to perform a task may be implemented as a general component that is temporarily configured to perform the task at a given time or a specific component that is manufactured to perform the task. As used herein, the term ‘processor’ refers to one or more devices, circuits, and/or processing cores configured to process data, such as computer program instructions.

A detailed description of one or more embodiments of the invention is provided below along with accompanying figures that illustrate the principles of the invention. The invention is described in connection with such embodiments, but the invention is not limited to any embodiment. The scope of the invention is limited only by the claims and the invention encompasses numerous alternatives, modifications and equivalents. Numerous specific details are set forth in the following description in order to provide a thorough understanding of the invention. These details are provided for the purpose of example and the invention may be practiced according to the claims without some or all of these specific details. For the purpose of clarity, technical material that is known in the technical fields related to the invention has not been described in detail so that the invention is not unnecessarily obscured.

Various embodiments of a technique to perform LDPC decoding (e.g., by message passing or more specifically min-sum decoding) with de-saturation are described herein. For example, without de-saturation, LDPC systems that use fixed-point number representation can become trapped in some cases if the messages become saturated relative to the fixed-point number representation. In some embodiments, a saturation metric that represents a degree of saturation in a low-density parity-check (LDPC) decoding system that uses fixed-point number representation is determined. The saturation metric is compared against a saturation threshold and it is determined if the saturation metric exceeds the saturation threshold. If so, LDPC decoding data (e.g., one or more messages in the system) is more aggressively attenuated compared to when the saturation metric does not exceed the saturation threshold at the end of a decoding iteration. If not, the LDPC decoding data is less aggressively attenuated at the end of the decoding iteration. The attenuated message may then be passed from a first type of node to a second type of node in a Tanner Graph, assuming more decoding iterations are required to properly decode the LDPC data.

FIG. 1 is a flowchart illustrating an embodiment of min-sum decoding of low-density, parity-check (LDPC) codes with de-saturation. For example, the process may be performed by an LDPC decoder which processes data which has been encoded using an LDPC code. In various applications, the LDPC-encoded data may have been sent over a (e.g., wireless) communications channel or may have been stored on storage media (e.g., a hard disk drive or solid state storage) before being decoded. As such, the exemplary LDPC decoder may be included in a wireless receiver or a storage read-back module in various embodiments.

At 100 a saturation metric that represents a degree of saturation in a low-density parity-check (LDPC) decoding system that uses a fixed-point number representation is determined. In one example of step 100, the saturation metric is based on the number or percentage of messages (e.g., prior to any message attenuation) from check nodes to variable nodes that are saturated to the maximum fixed-precision magnitude. Alternatively, messages passed in the other direction (e.g., from variable nodes to check nodes) are used to determine the saturation metric at step 100. For example, l_min1^(j)is the minimum variable node message to check node j and in some embodiments the saturation metric is based at least in part on the number of messages (e.g., across all check nodes j in the system) that are saturated (e.g., to the maximum fixed precision magnitude) at the end of a decoding iteration.

In another example of step 100, the saturation metric is based on a number of saturated nodes in the system. For example, the saturation metric may be the number of saturated check nodes at the end of a decoding iteration. For example, check node j may be declared saturated if L_min1^(j)(i.e., the minimum attenuated variable node message to check node j where l_min1^(j)is the message before attenuation and L_min1^(j)is the message after attenuation) reaches the maximum fixed-precision magnitude.

At 102, the saturation metric is compared against a saturation threshold. In one example, the saturation metric is a percentage and the saturation threshold is within a range of 3/4 (75%)-15/16 (93.75%).

At 104, it is determined if the saturation metric exceeds the saturation threshold. If so, at the end of a decoding iteration, a message is more aggressively attenuated compared to when the saturation metric does not exceed the saturation threshold in order to produce an attenuated message at 106. If not, at the end of a decoding iteration, the message is less aggressively attenuated compared to when the saturation metric does exceed the saturation threshold in order to produce the attenuated message at 108.

In one example of step 106, the messages that are attenuated are the l_min1^(j)and l_min2^(j)messages, which are the minimum and second minimum variable node messages to check node j, respectively, before any attenuation or normalization. In this example of step 106, they are more aggressively attenuated using de-saturation attenuation factor α_dsand de-saturation attenuation rounding β_ds(e.g., l_min1^(j)=└α_ds·l_min1^(j)+β_ds┘ and L_min2^(j)=└α_ds·l_min2^(j)+β_ds┘).

In a corresponding example of step 108, the same messages (i.e., l_min1^(j)and l_min2^(j)) are less aggressively attenuated using attenuation factor α and attenuation rounding β (e.g., L_min1^(j)=└α·l_min1^(j)+β┘ and L_min2^(j)=└α·l_min2^(j)+β┘) wherein the desaturation attenuation parameters satisfy the following:

α_ds<α
└α_ds+β_ds┘≤└α+β┘
└α_dsL_max+β_ds┘<└αL_max+β┘

where L_maxdenotes the maximum message magnitude. Furthermore, the parameters α, β, α_ds, β_dscan be optimized through density evolution method.

By more aggressively attenuating the messages if the system is saturated (e.g., as measured or otherwise determined using the saturation threshold), the saturation of correct messages is facilitated and error correction over small trapping sets is facilitated, but only if/when needed (e.g., only when the saturation metric exceeds the saturation threshold). More generally, by de-saturating LDPC data if needed, the performance of the LDPC decoder is improved in cases where the data (e.g., the passed messages) would otherwise have saturated and the decoder would trend towards an uncorrectable state.

The attenuated message (e.g., output by step 106 or 108) may then pass from a first type of node (e.g., a variable node) to a second type of node (e.g., a check node) in a Tanner graph, assuming additional LDPC decoding is required. The second type of nodes in the Tanner graph may then send back updated messages to the first type of nodes and the process of FIG. 1 may be repeated as or if needed.

To provide more context for the decoding technique with de-saturation described herein, it may be helpful to discuss check nodes and variable nodes which perform LDPC decoding by passing messages between the nodes. The following figures describe some such examples.

FIG. 2A is a diagram illustrating an embodiment of a check node-to-variable node (CN-to-VN) message update. In the example shown here, each check node (e.g., c_j(200a)) in the graph receives log-likelihood ratio (LLR) information (e.g., L_i→j(202a)) from all of its neighboring variable nodes (e.g., v_i(204a)). Each check node generates an updated check-to-variable message (e.g., L_i→j(206a)) for a given variable node using the inputs from all other neighboring variable nodes (not shown here).

Low-density parity-check (LDPC) codes are a type of error correction codes and are so named because they have a very low density of 1s in their parity-check matrix (H). This property makes it convenient to represent an LDPC code using a bipartite graph, called a Tanner Graph. There are two types of nodes in a Tanner graph: variable nodes (VNs) and check nodes (CNs). In the example shown here, v_i(204a) is an example of a variable node and c_j(200a) is an example of a check node. Each variable node (or check node) corresponds to a column (or row) of the parity-check matrix, H. As used herein, V={v₁, . . . , v_n} is the set of variable nodes and C={c₁, . . . , c_m} is the set of check nodes. Each row of H is indexed by C=(1, 2, . . . , m) and each column of H is indexed by V=(1, 2, . . . , n). In the Tanner graph, variable node v_iis connected to check node c_jvia an edge if H_j,i=1 and the set of edges on the Tanner graph is denoted by set E.

Quasi-cyclic low-density parity-check (QC-LDPC) codes are a special class of the LDPC codes with structured H matrix which can be generated by the expansion of an m_b×n_bbase matrix. Each 1s element in the base matrix can be expanded by a circularly right-shifted b×b identity sub-matrix. QC-LDPC codes have advantages over other types of LDPC codes in terms of hardware implementations on both the encoding and decoding side. Encoding of a QC-LDPC code can be (more) efficiently implemented (e.g., in hardware) using simple shift registers. In a hardware implementation of a QC-LDPC decoder, the QC structure of the code simplifies the wire routing for message passing.

It may be helpful to describe message passing in more detail. The following figure shows an example of a variable node-to-check node message update, which is part of message passing (i.e., a type of LDPC decoding).

FIG. 2B is a diagram illustrating an embodiment of a variable node-to-check node (VN-to-CN) message update. In the example shown here, each variable node (e.g., v_i(204b)) in the graph receives LLR information (e.g., (206b)) from all of its neighboring check nodes (e.g., c_j(200b)). Each variable node (e.g., v_i(204b)) generates an updated variable-to-check message (e.g., L_i→j(202b)) to a given check node (e.g., c_j(200b)) using the inputs from all other neighboring check nodes (not shown).

Message-passing (MP) is an efficient technique to achieve near-optimal decoding of LDPC codes. For notational conciseness, a variable node is referred to subsequently simply as i (instead of v_i) and j (instead of c_j) is used to denote a check node. As shown in this example, a variable node i (204b) receives an input message L_i^ch(210) from the channel. For example, this message from the channel may be the log-likelihood ratio (LLR) of the corresponding channel output, defined as follows:

$\begin{matrix} L_{i}^{ch} = \log (\frac{\Pr (R_{i} = r_{i} | c_{i} = 0)}{\Pr (R_{i} = r_{i} | c_{i} = 1)}), & (1) \end{matrix}$

where c_i∈{0,1} is the code bit and r_iis the corresponding received symbol.

A conventional iterative message passing decoder alternates between two phases: a CN-to-VN phase (during which check nodes send messages to their adjacent variable nodes) and a VN-to-CN phase (during which variable nodes send messages to check nodes along their adjacent edges) which are depicted schematically in FIGS. 2A and 2B, respectively. In the initialization step of the decoding process, variable node i forwards the same message to all of its neighboring check nodes V(i), namely the LLR L_i^chderived from the corresponding channel output. In the CN-to-VN message update phase, check node j uses the incoming messages and check node update rule to compute and forward, to variable node i∈C(j), a new CN-to-VN message, L_j→iVariable node i then processes its incoming messages according to the variable node update rule and forwards to each adjacent check node, C(i), an updated VN-to-CN message, L_i→j. After a pre-specified number of iterations, variable node i sums all of the incoming LLR messages to produce an estimate of the corresponding code bit i. Note that all of the CN-to-VN message updates can be done in parallel, as can all of the VN-to-CN message updates. This enables efficient, high-speed software and/or hardware implementations of iterative message-passing decoding processing.

Let L_i→jand L_j→irepresent the messages sent from variable node i to check node j and from check node j to variable node i, respectively. Let C(i) be the set of check nodes directly connected to variable node i and V(J) be the set of variable nodes directly connected to check node j. Then, the message sent from variable node i to check node j in sum-product decoding is given by:

$\begin{matrix} L_{i \to j} = L_{i}^{ch} + \sum_{j^{'} \in C (i) \ j} L_{j^{'} \to i}, & (2) \end{matrix}$

and the message from check node j to variable node i is computed as:

$\begin{matrix} L_{j \to i} = 2 \tanh^{- 1} (\prod_{i^{'} \in 𝒱 (j) \ i} \tanh \frac{L_{i^{'} \to j}}{2}) . & (3) \end{matrix}$

Let P_ibe a posterior probability (APP) message of variable node i where:

$\begin{matrix} P_{i} = L_{i}^{ch} + \sum_{j^{'} \in 𝒞 (i)} L_{j^{'} \to i} . & (4) \end{matrix}$

In this example, a variable node receives the log-likelihood ratios of received information from the channel as an initial input message (i.e., L_i→j=L_i^ch) and the following equivalent check node update rule is employed:

$\begin{matrix} L_{j \to i} = [\prod_{i^{'} \in 𝒱 (j) \ i} sign (L_{i^{'} \to j})] \cdot ⌊ α \cdot \min_{i^{'} \in 𝒱 (j) \ i} \langle L_{i^{'} \to j} \rangle + β ⌋, & (5) \end{matrix}$

where 0<α<1, β>0 is the attenuation factor and attenuation rounding, respectively, which can be either pre-fixed or dynamically adjusted. Herein we newly introduced the attenuation rounding parameter β, which satisfies the following:

1≤α+β<2

which prevents a minimum CN-to-VN message of 1 from being attenuated to zero (which would erase any information contained therein).

It is noted that channel LLR inputs may be conveniently scaled for min-sum decoding but preferably are precise for the original sum-product decoding. With that in mind, the following notations are used to simplify the above calculation. Let:

S_i→j custom character sign(L_i→j). (7)

Let S^(j)be the product sign of all variable nodes i to the check node j:

$\begin{matrix} S^{(j)} \overset{Δ}{=} \prod_{i^{'} \in 𝒱 (j)} S_{i^{'} \to j} . & (8) \end{matrix}$

Let l_min1^(j)and i_min1^(j)be the minimum variable node message to check node j and its associated index, respectively:

$\begin{matrix} l_{\min 1}^{(j)} \overset{Δ}{=} \min_{i^{'} \in 𝒱 (j)} \langle L_{i^{'} \to j} \rangle, i_{\min 1}^{(j)} \overset{Δ}{=} \arg \min_{i^{'} \in 𝒱 (j)} \langle L_{i^{'} \to j} \rangle & (9) \end{matrix}$

and let l_min2^(j)be the second minimum variable node message to check node j:

$\begin{matrix} l_{\min 2}^{(j)} \overset{Δ}{=} \min_{i^{'} \in 𝒱 (j) \ l_{\min 1}^{(j)}} \langle L_{i^{'} \to j} \rangle . & (10) \end{matrix}$

Furthermore, let L_min1^(j)and L_min2^(j)be the attenuated minimum and second minimum variable node message, respectively, to the check node j:

L_min1^(j) custom character └α·l_min1^(j)+β┘,L_min2^(j)└α·l_min2^(j)+β┘ (11)

With the above notations, Equation (5) can be conveniently rewritten as:

$\begin{matrix} L_{j \to i} = S^{(j)} \cdot S_{i \to j} \cdot {\begin{matrix} L_{\min 1}^{(j)}, & if i \neq i_{\min 1}^{(j)} \\ L_{\min 2}^{(j)}, & if i = i_{\min 1}^{(j)} \end{matrix} . & (12) \end{matrix}$

Pseudocode 1 describes a hardware amenable min-sum decoding example along these lines. It is noted that Pseudocode 1 does not de-saturate (e.g., unlike FIG. 1 and other embodiments described herein) and so Pseudocode 1 is vulnerable to becoming trapped in an uncorrectable state if the passed messages were to saturate at the maximum fixed-precision magnitude.

Pseudocode 1: Example of Flooded Min-Sum Decoding Without De-Saturation

Initialization: L_min1^(j)= L_min2^(j)= 0, ∀j ∈ custom character

Iteration:

1: l_min1^(j)= l_min2^(j)= ∞, i_min1^(j)= ∅, S_j= 0, ∀j ∈ custom character

2: for ∀i ∈ custom character

, do

3: for ∀j ∈ custom character

(i), do

4: Read (old) {S^(j), i_min1^(j), L_min1^(j), L_min2^(j)}

5:

Compute L_{j \to i} \leftarrow {\begin{matrix} S^{(j)} \cdot S_{i \to j} \cdot L_{\min 1}^{(j)}, if i \neq i_{\min 1}^{(j)} \\ S^{(j)} \cdot S_{i \to j} \cdot L_{\min 2}^{(j)}, if i = i_{\min 1}^{(j)} \end{matrix}

6: end for

7: Compute P_i← L_i^ch+ Σ_j∈C(i)L_j→i

8: for ∀j ∈ custom character

(i), do

9: Compute L_i→j← P_i− L_j→i

10: Store (new) S_i→j← sign(L_i→j)

11: Compute (new) S^(j)← S^(j)⊕ S_i→j

12: Compute (new) {i_min1^(j), l_min1^(j), l_min2^(j)} ← {i_min1^(j), l_min1^(j), l_min2^(j), {i, |L_i→j|}}

13: end for

14: end for

15: Compute syndrome sign([P₁, P₂, ..., P_n]) · H^T. If 0 then return the codeword sign([P₁, P₂, ..., P_n]).

16: Normalize L_min1^(j)= └α · l_min1^(j)+ β┘, L_min2^(j)= └α · l_min2^(j)+ β┘, ∀ j ∈ custom character

When a QC-LDPC code with b×b circulants is in use, each circulant of b bits is updated independently and in parallel.

The paper “Propagation of LLR saturation and quantization error in LDPC min-sum iterative decoding” by KANISTRAS et al. (which does not describe the de-saturation approach described in FIG. 1) investigated the theoretical aspect of saturation effect but did not propose any de-saturation technique.

In “Quantized iterative message passing decoders with low error floor for LDPC codes” by ZHANG et al., a new non-uniform quantization method was proposed to extend the message quantization range by using an exponentially increased step size for large magnitudes while keeping a fixed step size for small magnitudes. However, the proposed exponentially increased step size design is difficult to implement in hardware. In contrast, the de-saturation decoding technique described in FIG. 1 is amenable to hardware implementation and does not use an exponentially increased vs. fixed step size.

In U.S. Pat. No. 9,755,666 by Yingquan Wu, CN-to-VN messages are halved (e.g., by the variable node in the middle of a decoding iteration) if a significant fraction of VN-to-CN messages are saturated. For example, this would correspond to having a new line between line 5 and line 6 in Pseudocode 1 (not shown) where is halved (i.e., set to L_j→ito L_j→i/2) if some saturation condition is flagged. However, halving the CN-to-VN messages at that stage (i.e., in the middle of a VN-to-CN message update) as opposed to at the end of a decoding layer/iteration results in longer critical (e.g., rate limiting) path and thus slower clock speed. For example, there may be a critical timing path that begins with reading or otherwise inputting some piece of data or variable (e.g., at line 4 in Pseudocode 1) and ending with computing new values for that piece of data or value (e.g., at lines 12 and/or 13 in Pseudocode 1) and the older technique of halving L_j→i(e.g., which would occur between lines 5 and 6 in Pseudocode 1 but which is not shown there) introduces additional delay into a critical timing path. Furthermore, halving is a crude way of attenuating information and it may not be necessary to apply that much attenuation even if saturation is detected in the system. By using two de-saturation attenuation parameters (e.g., α_dsas well as β_ds), sufficient attenuation can be achieved without losing as much information as halving does. In one example,

$α_{ds} = \frac{3}{4} α and β_{ds} = \frac{3}{4} β .$

If iterative message-passing decoding is implemented in hardware, the decoding efficiency can be improved using a layered decoding approach. In layered decoding, check node messages are updated serially. That is, instead of sending all messages from variable nodes to check nodes, and then all messages from check nodes to variable nodes (i.e., flooding), the layered coding goes through the check nodes in sequential order such that, to each check node being updated, all messages are sent in and processed, and then sent out to neighboring variable nodes. Such scheduled serial updating of check nodes enables immediate propagation of the newly updated message, unlike the flooded scheme where the updated messages can propagate only in the next iteration.

As a result, layered decoding improves convergence speed by roughly twice compared to that of a flooded implementation. Moreover, it provides a good trade-off between speed and memory. This is achieved by iterating over dynamic CN-to-VN messages, denoted by Q custom character [Q₁, Q₂, . . . , Q_n]. Specifically, let variable node i∈V(j), then Q_iover a layer j is defined as:

$\begin{matrix} Q_{i}^{(j)} \overset{Δ}{=} L_{j \to i} = L_{i}^{ch} + \sum_{j^{'} \in 𝒞 (i) \ j} L_{j^{'} \to i}^{(last)}, & (13) \end{matrix}$

where the superscript ^(last)denotes the latest updated. It is worth noting that, in layered decoding, the VN-to-CN message updated at the last layer (all but the last are from the current iteration) is utilized to update the CN-to-VN Q_iin the current layer, whereas in the flooded decoding updating a CN-to-VN message L_j→iutilizes the VN-to-CN messages each generated at the last iteration. The Q custom character [Q₁, Q₂, . . . , Q_n] memory is initialized with the channel messages L^ch[L₁^ch, L₂^ch, . . . , L_n^ch] and no dedicated memory is needed to store L^ch, whereas with flooded decoding, L^chis stored but not Q. Q_i, i=1, 2, . . . , n, is iteratively calculated as follows. Let j be the current layer and j_ibe the preceding layer associated with variable node i. A preceding layer is mathematically declared or otherwise defined as follows. Let j₁<j₂< . . . <j_kbe all check nodes directly connected to variable node i, then j_lis the preceding layer of j_l+1for l=1, 2, . . . , k−1, and j_kis the preceding layer of j_l.

The APP (i.e., a posterior probability) message P_iat the layer j is calculated as:

P_i^(j)=Q_i^(jⁱ⁾+L_j_i_→i^new (14)

where L_j_i_→i^newis newly updated and Q_iis iteratively updated by:

Q_i^(j)=P_i^(j)−L_j→i^old, (15)

where L_j_i_→i^oldwas saved during the preceding iteration. The layered decoding can be applied to all types of iterative message-passing decoding, including SAP and min-sum decoding. A hardware amenable layered min-sum decoding process is described below in Pseudocode 2. It is noted that Pseudocode 2 does not check for de-saturation and perform de-saturation on the LDPC data, if needed (see, e.g., FIG. 1) and so Pseudocode 2 is vulnerable to becoming trapped in an uncorrectable state if the passed messages were to saturate at the maximum fixed-precision magnitude.

Pseudocode 2: Example of Layered Min-Sum Decoding Without De-Saturation

Initialization: L_min1^(j)= L_min2^(j)= 0, ∀j ∈ custom character

; Q_i= L_i^ch, ∀i ∈ custom character

;

=0

Iteration:

1: for custom character

=0, 1, 2 ..., m − 1 do

2: for ∀j ∈ custom character

do

3: l_min1^(j)= l_min2^(j)= ∞, i_min1^(j)= ∅, S_j= 0

4: for ∀i ∈ custom character

(j) do

5: Read (new) {S^(j_i⁾, i_min1^(j_i⁾, L_min1^(j_i⁾, L_min2^(j_i⁾} where j_iis the preceding layer of VN i

6:

Compute L \leftarrow {\begin{matrix} S^{(j_{i})} \cdot S_{i \to j_{i}} \cdot {Li}_{\min 1}^{(j_{i})}, if i \neq i_{\min 1}^{(j_{i})} \\ S^{(j_{i})} \cdot S_{i \to j_{i}} \cdot {Li}_{\min 2}^{(j_{i})}, if i = i_{\min 1}^{(j_{i})} \end{matrix}

7: Compute P_i← Q_i+ L_j_i_→i^new

8: Read (old) {S^(j), i_min1^(j), L_min1^(j), L_min2^(j)}

9:

Compute L_{j \to i}^{old} \leftarrow {\begin{matrix} S^{(j)} \cdot S_{i \to j} \cdot L_{\min 1}^{(j)}, if i \neq i_{\min 1}^{(j)} \\ S^{(j)} \cdot S_{i \to j} \cdot L_{\min 2}^{(j)}, if i = i_{\min 1}^{(j)} \end{matrix}

10: Compute Q_i ← P_i− L_j→i^old.

11: Store S_i→j= sign(Q_i)

12: Compute (new) S^(j)← S^(j)⊕ S_i→j

13: Compute (new) {i_min1^(j), l_min1^(j), l_min2^(j)} ← {i_min1^(j), l_min1^(j), l_min2^(j), {i, |Q_i|}}

14: end for

15: Compute syndrome sign([P₁, P₂, ..., P_n]) · H^T. If 0, then return the codeword sign([P₁, P₂, . . . , P_n]).

16: end for

17: Normalize L_min1^(j)= └α · l_min1^(j)+ β┘, L_min2^(j)= └α · l_min2^(j)+ β┘, j ∈ custom character

18: end for

When a QC-LDPC code with b×b circulants is in use, b quasi-cyclic rows of H are (naturally) treated as a layer. That is, a layer contains b check nodes, each being updated independently and in parallel. It is noted that convergence may occur within any layer for layered min-sum decoding (one example of which is shown in Pseudocode 2) whereas the convergence must occur at the end of an iteration for the flooded min-sum decoding (one example of which is shown in Pseudocode 1). Moreover, layered decoding enables or otherwise permits utilization of updated CN-to-VN messages within an iteration whereas it is not possible for flooded decoding. Consequently, layered decoding converges roughly twice as fast as flooded decoding.

As described above, in one alternate approach to FIG. 1, values are halved in the middle of a layered decoding iteration in a crude attempt to de-saturate. For example, in Pseudocode 2, this would include a new line (not shown) between lines 7 and 8 which would halve P_iif I_dsequals one and variable node i is being visited or iterated through for the first time. There would also be a new line (not shown) in Pseudocode 2 between lines 9 and 10 which would halve L_j→i^old. However, as described above, there are drawbacks to doing the de-saturation in the middle of CN-to-VN message update and halving the messages may attenuate the message to an unnecessary degree (e.g., just a little more attenuation than would otherwise or normally be applied is sufficient).

During simulations with Pseudocode 1 and 2 (or the like), it was observed that the range of messages passed between variable nodes and check nodes in the decoder has direct impact on the decoding performance in terms of both converge speed and error rate. When fixed-point magnitude was not enforced, correct messages typically grew faster than incorrect messages, with most errors due to small trapping sets correctable. However, given limited precision in practice (e.g., five bits of representation), after a certain number of iterations, messages tended to saturate to the maximum fixed-point magnitude. In such scenarios, correct messages are not able to outweigh incorrect messages, and the message in passing is gradually downgraded to bipolar messages.

From this observation, new and improved adaptive quantization methods have been developed (see, e.g., FIG. 1). In some embodiments, to expand the range of represented values by message index, the messages are scaled down after (or if) certain criterion is met. For example, but not limited to, if at the end of an iteration the number of saturated check nodes, denoted by θ_C, is greater than a pre-defined threshold, denoted by Θ, then all CN-to-VN messages in the decoder are more aggressively attenuated which effectively ameliorates the saturation during the next iteration. Herein, a check node j is declared saturated if L_min1^(j)reaches the maximum finite-precision magnitude.

This enables an effective increase in the quantization range without (meaningfully) increasing complexity or memory. Pseudocode 3 shows an example of flooded decoding with de-saturation.

Pseudocode 3: Example of De-saturated Flooded Min-Sum Decoding

Initialization: L_min1^(j)= L_min2^(j)= 0, ∀j ∈ custom character

Iteration:

1: l_min1^(j)= l_min2^(j)= ∞, i_min1^(j)= ∅, S_j= 0, ∀j ∈ custom character

2: for ∀i ∈ custom character

, do

3: for ∀j ∈ custom character

(i), do

4: Read (old) {S^(j), i_min1^(j), L_min1^(j), L_min2^(j)}

5:

Compute L_{j \to i} \leftarrow {\begin{matrix} S^{(j)} \cdot S_{i \to j} \cdot L_{\min 1}^{(j)}, if i \neq i_{\min 1}^{(j)} \\ S^{(j)} \cdot S_{i \to j} \cdot L_{\min 2}^{(j)}, if i = i_{\min 1}^{(j)} \end{matrix}

6: end for

7: Compute P_i← L_i^ch+ Σ_j∈C(i)L_j→i

8: for ∀j ∈ custom character

(i), do

9: Compute L_j→i← P_i− L_j→i

10: Store (new) S_i→j← sign(L_i→j)

11: Compute (new) S^(j)← S^(j)⊕ S_i→j

12: Compute (new) {i_min1^(j), l_min1^(j), l_min2^(j)} ← {i_min1^(j), l_min1^(j), l_min2^(j), {i, |L_i→j|}}

13: end for

14: end for

15: Compute syndrome sign([P₁, P₂, ..., P_n]) · H^T. If 0 then return the codeword sign([P₁, P₂, . . . , P_n]).

16: If the number of saturated l_min1is greater than Θ, then set I_ds= 1, otherwise 0.

17: If I_ds= 0 then normalize L_min1^(j)= └α · l_min1^(j)+ β┘, L_min2^(j)= └α · l_min2^(j)+ β┘, otherwise de-saturate L_min1^(j)=

└α_ds· l_min1^(j)+ β_ds┘, L_min2^(j)= └α_ds· l_min2^(j)+ β_ds┘, ∀j ∈ custom character

Pseudocode 3 shows one example of how the process of FIG. 1 may be performed. Although not explicitly stated in Pseudocode 3, the process may end if the syndrome equals 0 and the codeword is returned at line 15 (e.g., because decoding has successfully completed and there is no need to continue). As described above, it may be beneficial to do the de-saturation (e.g., more aggressively attenuate a message or other data) at the end of the iteration because it does not affect the existing pipelining stages and critical (timing) path. Also, the de-saturation parameters α_dsand β_dsmay be able to achieve a better de-saturation compared to crudely halving values (e.g., which may attenuate the messages or data to an unnecessary degree).

The following figure shows an example system diagram which performs the process of Pseudocode 3.

FIG. 3 is a system diagram illustrating an embodiment of a de-saturated min-sum decoding. In the example shown, the system diagram is applicable to both de-saturated flooded min-sum decoding as well as de-saturated layered min-sum decoding. In this example, a (global) saturation monitor (300) generates a (global) de-saturation signal (e.g., I_ds) which is passed to all of the variable nodes in the system, including the variable node shown here (304). For example, the I_dssignal referred to in lines 16 and 17 of Pseudocode 3 corresponds to this signal.

The de-saturation signal controls a plurality of multiplexers, including the multiplexer shown here (302). If the de-saturation signal equals zero, then the multiplexer (302) selects the less aggressive attenuation parameters of α and β to pass on to the attenuator block (308) which uses the selected attenuation parameters to attenuate messages (e.g., l_min1^(j)and l_min2^(j)). If the de-saturation signal instead equals one, then the multiplexer (302) selects the more aggressive attenuation parameters of α_dsand β_dsto output to the attenuator (308). In some embodiments, α_dsis within a range of

$\frac{1}{2} α and \frac{3}{4} α$

and β_dsis within a range of

$\frac{1}{2} β and \frac{3}{4} β .$

The attenuated messages which are output by the attenuation block (308) are then passed from the variable node i (304) to the check node j (306) as VN-to-CN messages (e.g., L_min1^(j)and L_min2^(j)) assuming another iteration of decoding needs to be performed (e.g., because the syndrome is not all zeros which indicates that LDPC decoding has not yet successfully completed).

Returning briefly to FIG. 1, the saturation monitor (300) is one example of a device that performs steps 100 and 102 in FIG. 1 and the multiplexer (302) and attenuator (308) are examples of devices that perform steps 104, 106, and 108 in FIG. 1. It is noted that this system diagram is merely exemplary and is not intended to be limiting. For example, it is logically equivalent to have two attenuation blocks (e.g., one that outputs less aggressively attenuated messages and one that outputs more aggressively attenuated messages) and have the multiplexer follow the attenuation blocks.

As described above, unlike some other techniques (e.g., a system which alternates between an exponentially increased step size and fixed step size), adding a saturation monitor (300) and a plurality of multiplexers (e.g., the multiplexer (302) shown here) to an existing (min-sum) LDPC decoding system is relatively easy to implement in hardware.

For completeness, Pseudocode 4 shows an example of layered min-sum decoding with de-saturation.

Pseudocode 4: Example of De-saturated Layered Min-Sum Decoding

Initialization: L_min1^(j)= L_min2^(j)= 0, ∀j ∈ custom character

; Q_i= L_i^ch, ∀i ∈ custom character

;

=0; I_ds= 0

Iteration:

1: for custom character

=0, 1, 2, ..., m − 1 do

2: for ∀j ∈ custom character

do

3: l_min1^(j)= l_min2^(j)= ∞, i_min1^(j)= ∅, S_j= 0

4: for ∀i ∈ custom character

(j) do

5: Read (new) {S^(j_i⁾, i_min1^(j_i⁾, L_min1^(j_i⁾, L_min2^(j_i⁾} where j_iis the preceding layer of VN i

6:

Compute {j_{i} \to i}_{new} \leftarrow {\begin{matrix} S^{(j_{i})} \cdot S_{i \to j_{i}} \cdot L_{\min 1}^{(j_{i})}, if i \neq i_{\min 1}^{(j_{i})} \\ S^{(j_{i})} \cdot S_{i \to j_{i}} \cdot L_{\min 2}^{(j_{i})}, if i = i_{\min 1}^{(j_{i})} \end{matrix}

7: Compute P_i← Q_i+ L_j_i_→i^new

8: Read (old) {S^(j), i_min1^(j), L_min1^(j), L_min2^(j)}.

9:

Compute L_{j \to i}^{old} \leftarrow {\begin{matrix} S^{(j)} \cdot S_{i \to j} \cdot L_{\min 1}^{(j)}, if i \neq i_{\min 1}^{(j)} \\ S^{(j)} \cdot S_{i \to j} \cdot L_{\min 2}^{(j)}, if i = i_{\min 1}^{(j)} \end{matrix}

10: Compute Q_i ← P_i− L_j→i^old.

11: Store S_i→j= sign(Q_i)

12: Compute (new) S^(j)← S^(j)⊕ S_i→j

13: Compute (new) {i_min1^(j), l_min1^(j), l_min2^(j)} ← {i_min1^(j), l_min1^(j), l_min2^(j), {i, |Q_i|}}

14: end for

15: Compute syndrome sign([P₁, P₂, ..., P_n]) · H^T. If 0 then return the codeword sign([P₁, P₂, ..., P_n]).

16: end for

17: If I_ds= 0 then normalize L_min1^(j)= └α · l_min1^(j)+ β┘, L_min2^(j)= └α · l_min2^(j)+ β┘, otherwise de-saturate

L_min1^(j)= └α_ds· l_min1^(j)+ β_ds┘, L_min2^(j)= └α_ds· l_min2^(j)+ β_ds┘, j ∈ custom character

.

18: If custom character

= 0 and overall number of saturated l_min1is greater than a pre-determined threshold Θ,

then set I_ds= 1, otherwise 0.

19: end for

Pseudocode 4 shows one example of how the process of FIG. 1 may be performed. Although not explicitly described in the pseudocode, the process of Pseudocode 4 may end after line 15 if the syndrome equals 0 (which indicates that LDPC decoding has successfully completed and therefore further processing is unnecessary). And as with Pseudocode 3, the attenuation in Pseudocode 4 is done at the end of the iteration (e.g., at line 17) using the parameters α_dsand β_dswhich may be beneficial in at least some applications.

Although some of the examples described herein show flooded min-sum decoding and layered min-sum decoding, the techniques described herein may be extended to other variants of message-passing decoding of LDPC codes (e.g., shuffled decoding which has efficiency between that of flooded decoding and layered decoding).

In some embodiments, the decoder selects the degree of de-saturation applied (e.g., at step 106 in FIG. 1). For example, in some cases, it may be desirable to more aggressively attenuate a message with de-saturation parameters of

$α_{ds} = \frac{3}{4} α and β_{d s} = \frac{3}{4} β$

at step 106 in FIG. 1 whereas in other cases it may be more desirable to use de-saturation parameters of

$α_{ds} = \frac{1}{2} α and β_{ds} = \frac{1}{2} β$

at step 106 because a larger degree of de-saturation and/or attenuation is called for given the state of the system.

In some embodiments, such a selection of de-saturation parameters (e.g., for use at step 106 in FIG. 1) is done by density evolution optimization. For example, density evolution is used to analyze the convergence behavior of the min-sum decoder (e.g., the code threshold) for a given LDPC code ensemble under min-sum decoding, where the code threshold is defined as the maximum channel noise level at which the decoding error probability converges to zero as the code length goes to infinity. Density evolution is used to determine the performance of min-sum decoding for a given code ensemble by tracking the probability density function (PDF) of messages passed along the edges in the corresponding Tanner graph through the iterative decoding process. Then, it is possible to test whether, for a given channel condition and a given degree distribution, the decoder can successfully decode the transmitted message (with the decoding error probability tends to zero as the iterations progress). Using density evolution as an evaluation tool, various de-saturation parameters (e.g., any of α, α_ds, β, and/or β_ds) may be tested and optimized values for any of α, α_ds, β, and/or β_dsmay be selected.

Although the foregoing embodiments have been described in some detail for purposes of clarity of understanding, the invention is not limited to the details provided. There are many alternative ways of implementing the invention. The disclosed embodiments are illustrative and not restrictive.

Claims

1. A decoder, comprising: a saturation monitor configured to: generate a global de-saturation control signal that is determined based at least in part on a saturation threshold and a degree of saturation in a low-density parity-check (LDPC) decoding system that uses a fixed-point number representation;plurality of variable nodes that is configured to: in the event the global de-saturation control signal is a first value, output a more aggressively attenuated message to a plurality of check nodes compared to when the global de-saturation control signal is a second value; andin the event the global de-saturation control signal is the second value, output a less aggressively attenuated message to the plurality of check nodes compared to when the global de-saturation control signal is the first value; andthe plurality of check nodes.
2. The decoder recited in claim 1, wherein the degree of saturation is based at least in part on a number of messages, passed between at least one of the plurality of check nodes and at least one of the plurality of variable nodes, that are saturated to a maximum fixed-precision magnitude.
3. The decoder recited in claim 1, wherein the degree of saturation is based at least in part on a number of saturated nodes in the LDPC decoding system.
4. The decoder recited in claim 1, wherein: the degree of saturation is based at least in part on a number of saturated nodes in the LDPC decoding system; anda node is declared saturated if an associated minimum attenuated message after attenuation reaches a maximum fixed-precision magnitude.
5. The decoder recited in claim 1, wherein outputting the more aggressively attenuated message to the plurality of check nodes includes: outputting a first more aggressively attenuated message that is based at least in part on a first-most minimum variable node message; andoutputting a second more aggressively attenuated message that is based at least in part on a second-most minimum variable node message.
6. The decoder recited in claim 1, wherein outputting the more aggressively attenuated message to the plurality of check nodes includes: outputting a first more aggressively attenuated message (Lmin1(j)) that is based at least in part on a first-most minimum variable node message (lmin1(j)), including by using Lmin1(j)=└αds·lmin1(j)+βds┘, wherein αds is associated with an attenuation factor and βds is associated with an attenuation rounding; andoutputting a second more aggressively attenuated message (Lmin2(j)) that is based at least in part on a second-most minimum variable node message (lmin2(j)) including by using Lmin2(j)=└αds·lmin2(j)+βds┘.
7. The decoder recited in claim 1, wherein: a first attenuation factor (α) is used to generate the less aggressively attenuated message and a second attenuation factor (αds) is used to generate the more aggressively attenuated message, wherein αds<α; anda first attenuation rounding (β) is used to generate the less aggressively attenuated message and a second attenuation rounding (βds) is used to generate the more aggressively attenuated message, wherein └αds+βds┘≤└α+β┘.
8. The decoder recited in claim 1, wherein: a first attenuation factor (α) is used to generate the less aggressively attenuated message and a second attenuation factor (αds) is used to generate the more aggressively attenuated message, wherein αds<α;a first attenuation rounding (β) is used to generate the less aggressively attenuated message and a second attenuation rounding (βds) is used to generate the more aggressively attenuated message, wherein └αds+βds┘≤└α+β┘; andat least one of α, αds, β, or βds is determined using density evolution.
9. A method, comprising: generating a global de-saturation control signal that is determined based at least in part on a saturation threshold and a degree of saturation in a low-density parity-check (LDPC) decoding system that uses a fixed-point number representation;in the event the global de-saturation control signal is a first value, outputting a more aggressively attenuated message to a plurality of check nodes compared to when the global de-saturation control signal is a second value; andin the event the global de-saturation control signal is the second value, outputting a less aggressively attenuated message to the plurality of check nodes compared to when the global de-saturation control signal is the first value.
10. The method recited in claim 9, wherein the degree of saturation is based at least in part on a number of messages, passed between at least one of the plurality of check nodes and at least one of the plurality of variable nodes, that are saturated to a maximum fixed-precision magnitude.
11. The method recited in claim 9, wherein the degree of saturation is based at least in part on a number of saturated nodes in the LDPC decoding system.
12. The method recited in claim 9, wherein: the degree of saturation is based at least in part on a number of saturated nodes in the LDPC decoding system; anda node is declared saturated if an associated minimum attenuated message after attenuation reaches a maximum fixed-precision magnitude.
13. The method recited in claim 9, wherein outputting the more aggressively attenuated message to the plurality of check nodes includes: outputting a first more aggressively attenuated message that is based at least in part on a first-most minimum variable node message; andoutputting a second more aggressively attenuated message that is based at least in part on a second-most minimum variable node message.
14. The method recited in claim 9, wherein outputting the more aggressively attenuated message to the plurality of check nodes includes: outputting a first more aggressively attenuated message (Lmin1(j)) that is based at least in part on a first-most minimum variable node message (lmin1(j)), including by using Lmin1(j)=└αds·lmin1(j)+βds┘, wherein αds is associated with an attenuation factor and βds is associated with an attenuation rounding; andoutputting a second more aggressively attenuated message (Lmin2(j)) that is based at least in part on a second-most minimum variable node message (lmin2(j)) including by using Lmin2(j)=└αds·lmin2(j)+βds┘.
15. The method recited in claim 9, wherein: a first attenuation factor (α) is used to generate the less aggressively attenuated message and a second attenuation factor (αds) is used to generate the more aggressively attenuated message, wherein αds<α; anda first attenuation rounding (β) is used to generate the less aggressively attenuated message and a second attenuation rounding (βds) is used to generate the more aggressively attenuated message, wherein └αds+βds┘≤α+β┘.
16. The method recited in claim 9, wherein: a first attenuation factor (α) is used to generate the less aggressively attenuated message and a second attenuation factor (αds) is used to generate the more aggressively attenuated message, wherein αds<α;a first attenuation rounding (β) is used to generate the less aggressively attenuated message and a second attenuation rounding (βds) is used to generate the more aggressively attenuated message, wherein └αds+βds┘≤└α+β┘; andat least one of α, αds, β, or βds is determined using density evolution.
17. A computer program product embodied in a non-transitory computer readable medium and comprising computer instructions for: generating a global de-saturation control signal that is determined based at least in part on a saturation threshold and a degree of saturation in a low-density parity-check (LDPC) decoding system that uses a fixed-point number representation;in the event the global de-saturation control signal is a first value, outputting a more aggressively attenuated message to a plurality of check nodes compared to when the global de-saturation control signal is a second value; andin the event the global de-saturation control signal is the second value, outputting a less aggressively attenuated message to the plurality of check nodes compared to when the global de-saturation control signal is the first value.
18. The computer program product recited in claim 17, wherein the degree of saturation is based at least in part on a number of messages, passed between at least one of the plurality of check nodes and at least one of the plurality of variable nodes, that are saturated to a maximum fixed-precision magnitude.
19. The computer program product recited in claim 17, wherein outputting the more aggressively attenuated message to the plurality of check nodes includes: outputting a first more aggressively attenuated message that is based at least in part on a first-most minimum variable node message; andoutputting a second more aggressively attenuated message that is based at least in part on a second-most minimum variable node message.
20. The computer program product recited in claim 17, wherein outputting the more aggressively attenuated message to the plurality of check nodes includes: outputting a first more aggressively attenuated message (Lmin2(j)) that is based at least in part on a first-most minimum variable node message (lmin1(j)), including by using Lmin1(j)=└αds·lmin1(j)+βds┘, wherein is associated with an attenuation factor and βds is associated with an attenuation rounding; andoutputting a second more aggressively attenuated message (Lmin2(j)) that is based at least in part on a second-most minimum variable node message (lmin2(j)), including by using Lmin2(j)=└αds·lmin2(j)+βds┘.

CROSS REFERENCE TO OTHER APPLICATIONS

This application is a continuation of U.S. patent application Ser. No. 16/777,457 (now U.S. Pat. No. 10,778,248, issued Sep. 15, 2020) entitled LOW-DENSITY PARITY-CHECK DECODING WITH DE-SATURATION filed Jan. 30, 2020 which is incorporated herein by reference for all purposes.

US Referenced Citations (25)

Number	Name	Date	Kind
5572350	Spanke	Nov 1996	A
7383485	Tran	Jun 2008	B2
7519898	Narayanan	Apr 2009	B2
7583945	McCarthy	Sep 2009	B2
7669106	Farjadrad	Feb 2010	B1
8281210	Farjadrad	Oct 2012	B1
8407556	Shen	Mar 2013	B2
8433984	Khandekar	Apr 2013	B2
8457194	Ali	Jun 2013	B2
8601352	Anholt	Dec 2013	B1
8739002	Nakamura	May 2014	B2
8995863	Moroi	Mar 2015	B2
9048870	Li	Jun 2015	B2
9100052	Pisek	Aug 2015	B2
9246717	Beidas	Jan 2016	B2
9264073	Malmirchegini	Feb 2016	B2
9473175	Graumann	Oct 2016	B1
9571168	Moon	Feb 2017	B2
9612903	Tehrani	Apr 2017	B2
9716602	Beidas	Jul 2017	B2
9755666	Wu	Sep 2017	B2
10164663	Shin	Dec 2018	B2
10236070	Barndt	Mar 2019	B2
20040187129	Richardson	Sep 2004	A1
20080276156	Gunnam	Nov 2008	A1

Non-Patent Literature Citations (7)

Entry
Chen et al., Reduced-Complexity Decoding of LDPC Codes, IEEE Transactions on Communications, Aug. 2005, pp. 1288-1299, vol. 53, No. 8.
Kanistras et al., Propagation of LLR Saturation and Quantization Error in LDPC Min-Sum Iterative Decoding, 2012 EEE Workshop on Signal Processing Systems, pp. 276-281, 2012.
Kim et al., A Reduced-Complexity Architecture for LDPC Layered Decoding Schemes, IEEE Transactions on Very Large Scale Integration (VLSI) Systems, Jun. 2011, pp. 1099-1103, vol. 19, No. 6.
Mansour et al., High-Throughput LDPC Decoders, IEEE Transactions on Very Large Scale Integration Systems, Dec. 2003, pp. 976-996, vol. 11, No. 6.
Richardson et al., The Capacity of Low-Density Parity-Check Codes Under Message-Passing Decoding, IEEE Transactions on Information Theory, Feb. 2001, pp. 599-618, vol. 47, No. 2.
Zhang et al., Quantized Iterative Message Passing Decoders with Low Error Floor for LDPC Codes, IEEE Transactions on Communications, Jan. 2014, pp. 1-14, vol. 62, No. 1.
Zhang et al., Shuffled Iterative Decoding, IEEE Transactions on Communications, Feb. 2005, pp. 209-213, vol. 53, No. 2.

Related Publications (1)

	Number	Date	Country
	20210242884 A1	Aug 2021	US

Continuations (1)

	Number	Date	Country
Parent	16777457	Jan 2020	US
Child	16988429		US

Low-density parity-check decoding with desaturation

Information

Patent Number

Date Filed

Date Issued

Inventors

Original Assignees

Examiners

Agents

CPC

Field of Search

CPC

International Classifications