Embodiments described herein relate generally to methods and apparatus for encoding and decoding using erasure codes and to methods and apparatus for generating erasure codes.
Erasure coding involves the introduction of redundancy in coding schemes which makes it possible to reconstruct data that is lost through erasures.
There are many applications of erasure codes including distributed networks in which erasure codes are used to cope with high packet loss; video streaming over lossy channels; distributed network storage and redundant disk drives.
One application of erasure codes is in distributed sensor networks, for example, the monitoring of energy usage. It is envisaged that the modernisation of the existing energy grid to form the so called ‘Smart Grid’ will allow real-time information exchange between the utility provider and consumers to achieve more efficient energy management. Wireless sensor networks are likely to be employed for monitoring consumers' energy consumption and communicating it to the utility provider.
In the following, embodiments are described, by way of example only, with reference to the accompanying drawings in which:
In an embodiment a method of generating an erasure code for a network comprising a plurality of nodes, wherein the nodes of the network store data symbols and the number of data symbols stored on a first node of the network is different from the number of data symbols stored on a second node of the network, the erasure code indicating the dependance of parity symbols stored by the nodes of the network on the data symbols such that the data symbols stored on nodes of the network can be determined from parity symbols and data symbols stored on other nodes of the network comprises determining from the distribution of data symbols among the nodes, an optimal parity symbol distribution; based on the optimal parity symbol distribution and the distribution of data symbols, selecting a code generation method; and generating the erasure code using the selected code generation method.
In an embodiment selecting a code generation method comprises selecting a code generation method from: a first method comprising forming a first array indicating the dependance of the parity symbols on the data symbols, wherein the parity symbols and data symbols form a rectangular array; a second method comprising adding at least one dummy data symbol to the data symbols and forming a second array indicating the dependance of the parity symbols on the data symbols; and a third method comprising adding at least on dummy parity symbol to the data symbols and forming a third array indicating the dependance of the parity symbols on the data symbols.
In an embodiment the first method and/or the second method comprises generating a row wise erasure code from the first or second array.
In an embodiment the first method comprises arranging parity symbols by adding the parity symbols for each node to the column of the first array containing the data symbols corresponding to that node.
In an embodiment the second method comprises deriving a generator matrix from the second array and removing at least one row from the generator matrix, the at least one row corresponding to the at least one dummy data symbol.
In an embodiment the third method further comprises forming a generator matrix from the third array and puncturing a column from the generator matrix.
In an embodiment determining an optimal parity symbol distribution comprises calculating an optimal distribution of parity symbols such that the optimal distribution of parity symbols has the minimum number of parity symbols required to correct a set number of missing nodes.
In an embodiment a data processing system comprises a processor configured to carry out the methods described above.
In an embodiment a concerntrator node of a wireless network comprises the data processing system.
In one embodiment a decoder for decoding a received set of blocks is disclosed. A first block of the received set of blocks comprises a first number of data symbols and a second block of the received set of blocks comprises a second number, different from the first number, of data symbols, and wherein blocks of the received set of blocks comprise parity symbols. The decoder comprises storage for a coding matrix which is derived from the Kronecker product of a totally nonsingular matrix with an antidiagonal matrix; and a processor operable to determine data symbols of at least one erased block from the received set of blocks using the coding matrix.
In an embodiment the coding matrix is the Kronecker product of a totally nonsingular matrix with an antidiagonal matrix.
In an embodiment the coding matrix is a matrix derived by removing at least one row from the Kronecker product of a totally nonsingular matrix with an antidiagonal matrix.
In an embodiment the coding matrix is a matrix derived by removing at least one row and at least one column from the Kronecker product of a totally nonsingular matrix with an antidiagonal matrix.
In one embodiment an encoder for encoding a block of a plurality of blocks of an erasure code is disclosed. A first block of the erasure code comprises a first number of data symbols and a second block of the erasure code comprises a second number, different from the first number, of data symbols. The encoder is configured to set a plurality of parity symbols of the block using combinations of data symbols of other blocks of the plurality of blocks selected according to a column of a coding matrix which is derived from the Kronecker product of a totally non-singular matrix with an antidiagonal matrix.
In an embodiment the coding matrix is the Kronecker product of a totally nonsingular matrix with an antidiagonal matrix.
In an embodiment the coding matrix is a matrix derived by removing at least one row from the Kronecker product of a totally nonsingular matrix with an antidiagonal matrix.
In an embodiment the coding matrix is a matrix derived by removing at least one row and at least one column from the Kronecker product of a totally nonsingular matrix with an antidiagonal matrix.
In an embodiment a method of decoding a received set of blocks is disclosed. Blocks of the received set of blocks comprise a plurality of data symbols and a plurality of parity symbols, and wherein a first block of the received set of blocks comprises a first number of data symbols and a second block of the received set of blocks comprises a second number, different from the first number, of data symbols, the received set of blocks being a subset of a complete set of blocks, the complete set of blocks comprising at least one erased block not included in the received set of blocks. The the method comprises determining the data symbols of the at least one erased block using a coding matrix which is derived from the Kronecker product of a totally nonsingular matrix with an antidiagonal matrix.
In an embodiment a method of encoding a block of a plurality of blocks of an erasure code is disclosed. A first block of the erasure code comprises a first number of data symbols and a second block of the erasure code comprises a second number, different from the first number, of data symbols. The method comprises setting the parity symbols of the block using combinations of data symbols of other blocks of the plurality of blocks selected according to a column of an encoding matrix which is derived from the Kronecker product of a totally nonsingular matrix with an antidiagonal matrix.
Embodiments provide a computer program product comprising computer executable instructions which, when executed by a processor, cause the processor to perform methods as set out above. The computer program product may be embodied in a carrier medium, which may be a storage medium or a signal medium. A storage medium may include optical storage means, or magnetic storage means, or electronic storage means.
The described embodiments can be incorporated into a specific hardware device, a general purpose device configured by suitable software, or a combination of both. Aspects can be embodied in a software product, either as a complete software implementation, or as an add-on component for modification or enhancement of existing software (such as a plug in). Such a software product could be embodied in a carrier medium, such as a storage medium (e.g. an optical disk or a mass storage memory such as a FLASH memory) or a signal medium (such as a download). Specific hardware devices suitable for the embodiment could include an application specific device such as an ASIC, an FPGA or a DSP, or other dedicated functional hardware means. The reader will understand that none of the foregoing discussion of embodiment in software or hardware limits future implementation of the invention on yet to be discovered or defined means of execution.
The monitoring of energy usage can be very frequent, for example in the order of seconds, whereas the data gathering by the concentrator node is less frequent, for example in the order of hours or days. This means that data is accumulated and stored at the houses, that is, the nodes in the network and released to the concentrator node upon request. However if all communication is wireless, houses might only be intermittently connected to the concentrator node. At a given time, there is a chance that no reliable communication link is available. In that case, it would be desirable to obtain the data from all nodes by communicating with only a subset of them. This is possible if redundancy is introduced in the network, i.e., the data from one node is stored at one or more other nodes. With an adequate scheme, it could be possible to retrieve the data from all n nodes from only k of them. Another advantage could be that the concentrator node only needs to contact k nodes, even if communication with all nodes is possible.
This can be achieved by employing erasure coding, which makes it possible to reconstruct all the data even if some observations are erased. This is the case if some houses are not able to communicate with the concentrator node. A suitable code is a vertical array code, which is illustrated in
The way the erasure code works in the context of the sensor network is that each node in the network would generate its own data indicating a reading of, e.g., energy consumption. The node would then share the data with the other nodes in order to have copies of the data in case the node would fail.
Note that all data from one node could simply be copied in full to r other nodes; since the data would then exist in r+1 nodes, the network can cope with r of them failing as at least one copy would survive. However this is an inefficient way of being resilient to node failures; better codes can be designed that can achieve the same level of erasure correction but with much less redundancy and storage. When the concentrator node needs the data from the network, it suffices to connect to k of the n nodes; due to the erasure code the data from the n-k failed nodes can be reconstructed. This is done by using the known dependencies of the data on the parity symbols in the k surviving nodes.
If the array code is applied to, e.g., a sensor network, the data rates might be different since each node in the network act as an independent source. In this case a code must be designed that can cope with multiple irregular sources.
The meter module 310 is configured to monitor energy usage. The communications module is coupled to an antenna 325 and can transmit and receive signals over a wireless network using a wireless protocol such as IEEE802.11n. The storage 340 includes code storage 342, storage for meter data 344 and storage for data received from other nodes 346. The storage may be implemented as volatile memory, non-volatile memory or a combination of both.
In an embodiment, the code storage 342 stores an indexed set of erasure codes and the communications module 320 is operable to receive an indication of an erasure code from the set from the concentrator node.
In an embodiment, the code storage stores an erasure code or a part of a coding matrix which is received over the wireless network from the concentrator node.
The process by which the indication of the erasure code is selected is described in more detail below with reference to
The processor is operable to generate erasure code blocks encoded using the erasure codes described above which have data symbols indicating the meter data 344 and parity blocks determined from the data from other nodes 346.
In an embodiment the code storage 442 of the concentrator node 400 stores an indexed set of erasure codes and the processor is operable to select one of the indexed erasure codes during an initialisation process.
In an embodiment the processor is operable to generate an erasure code during an initialisation process.
The initialisation process is described in more detail below with reference to
In step S502 the concentrator node 400 determines the number of nodes in the network, the data rates and the connections between them.
In step S504 the processor 430 of the concentrator node 400 generates a code based on the determined number of nodes. As discussed above, the methods described herein allow array codes that have the smallest overhead and lowest possible density to be constructed for any number of nodes and that are tolerant of any number of failures. The number of failures may be selectable or may be preset.
In step S506 the generated code is sent to the nodes using the communication module 420 of the concentrator node 400. The code may be sent by sending a coding matrix A to all of the nodes. In an embodiment, the concentrator node sends just the required columns of the coding matrix to each of the nodes.
In step S602 the concentrator node 400 determines the number of nodes in the network and the connections between them.
In step S604, the processor selects a stored code based on the number of nodes determined in step S602 and determines an index value indicating the selected code.
In step S606, the concentrator node sends the index value indicating the selected code to the nodes of the network. The nodes of the network also store the indexed set of codes so the selected code can be obtained by the nodes using the index.
In step S702 the node 300 receives a code generated by the concentrator node 400. The code is generated by the concentrator node 400 according to the method described above with reference to
In step S704 the node 300 stores the received code in the code storage 342 of the node 300. As discussed above, the node may store just the required parts of the coding matrix A.
In step S802 the node 300 receives an indication of a code selected by the concentrator node 400. The indication indicates a code stored in the code storage 342 of the node 400. The indication of the code is generated by the concentrator node 400 according to the method described above with reference to
In step S804 the node 300 uses the indication to determine which code stored in the code storage 342 to use when encoding.
In step S902 the meter module 310 monitors energy usage and stores data indicating the energy usage in as meter data 344 in the storage 340.
In step S904 the communication module 320 multicasts the meter data 344 stored in the storage 340 to neighbouring nodes.
In step S906 the communication module 320 receives data from other nodes. The received data is stored as data received from other nodes 346 in the storage 340.
Periodically, or in response to a request from the concentrator node, the nodes of the network generate blocks using the stored meter data and the stored data received from other nodes.
In an embodiment the following encoding algorithm is used. The encoding operation consists of generating the parity messages for each of the n nodes. Each parity message is obtained by a linear combination of k data messages provided by other nodes. The multiplications and additions are carried out in GF(q). In particular, the p parity messages for node i are given by
The concentrator node receives blocks generated by the nodes of the network and uses the parity data from the received blocks to recreate the data.
In an embodiment the following decoding algorithm is used. The decoding operation consists in retrieving all the data messages in the event of up to r node failures. Let F={i1, . . . , ir} be the set of failing nodes and ={1, . . . , n}\F={ir+1, . . . , in} the set of surviving nodes. The data messages generated by the k surviving nodes can be retrieved directly from these nodes, let these mk messages be grouped as:
d
s=(d1,i
while the remaining mr messages, originated in the failed nodes, form
d
f=(d1,i
Accordingly the k block columns of the non-systematic part of the generator matrix corresponding to the surviving nodes can be partitioned as follows
A
s= (4)
A
f= (5)
Note that Af is square because mr=pk. The pk parity messages of the surviving nodes can be written as follows
f
s=(f1,i
such that
f
s
=d
s
A
s
+d
f
A
f (7)
Therefore, we can solve (7) for the unknowns df
d
f=(fs−dsAs)Af−1 (8)
It can be shown that with the construction described above the matrix Af is always non-singular. It is noted that because of the sparse nature of matrix Af, in practice, calculation of (8) can be done in a simplified fashion by solving a number of subsystems that require at most inverting matrices of size min{k,r}.
The decoding algorithm can be formulated in pseudo language as follows
In case less than r nodes fail, hence there are more than k surviving nodes, (8) still applies, where only k of the surviving nodes are used by the concentrator node to retrieve all the information. This reduces the amount of data and parity messages that have to be transferred to the concentrator node. Alternatively, data and parity messages can be collected from all the surviving nodes and (8) still applies with the inverse replaced by the Moore-Penrose pseudo-inverse.
The coding scheme may be adapted to work in an asymmetric network in which the number of data symbols is different for different nodes. The embodiments described in reference to
Each of the first to fifth nodes store data symbols indicating the energy usage on that node. As shown in
The first thing to do when considering an asymmetric network (different amount of data in each node) as depicted in
where mi and pi are the number of data and parity symbols per node, respectively.
For an optimal distribution pi*, there must be at least one set of failures F* (or equivalently survivors S*) that fulfils the constraint in (9) with equality. The solution is a type of water filling
where the water level is
The total number of parity symbols can be shown to be
In practice it is difficult to find the set F*, so it may be necessary to check all (rn) combinations, find the tentative water level μ as described in (11) and then compute the number of parity symbols according to (12). The combination that offers the smallest number of parity symbols is chosen; the distribution of parity symbols pi is then given by (10). An example of adding parity symbols to a network with different number of data symbols is shown in
Once the minimum number of parity symbols and their distribution have been computed, the erasure code needs to be designed. Note that there might be more than one solution to the parity symbol distribution problem.
Given the number of data and parity symbols per node, the encoded array can be written as
where di,j and fi,j are the ith data and parity symbols for node j, respectively, taken from a finite field Fq=GF(q) where q=bl is a power of a prime number b. Note that it is not necessarily a regular array since mi+pi may be different for different nodes i. The encoding operation can be written as a vector-matrix multiplication
c=dG (13)
where c is the codeword
c=(d0,0 . . . dm
d is the vector of data symbols
d=(d0,0 . . . dm
and G=(I A) is the (Σi mi)×(Σi(mi+pi)) generator matrix. The systematic part of G, of size (Σi mi)×(Σi mi), is the identity matrix I and the nonsystematic part, of size (Σi mi)×(Σi pi), is denoted by A.
For the actual design of A we can divide all possible data distributions into three cases:
Type 1 Codes: mi+pi is Constant
It can be shown that when k divides Σi mi and
the optimal parity distribution is pi=μ−mi and it is unique. Since mi+pi is constant, all nodes have the same total number of symbols and the array is a complete rectangle of size μ×n. In this case, we can arrange the data symbols in the array in the following way: the ith element of d in (14) is placed in row i mod and the next available column without exceeding the limitation on the number of data symbols per column, mi. Each row will now have
data symbols and the array can be rowwise encoded with a [n,k] erasure code where the r=n−k parity symbols are put the empty slots in each row. Since this rowwise code can correct r erasures, the array can cope with any r erased columns. The generator matrix for the array can be conveniently be expressed as
where denotes the Kronecker product. This code is optimal is the sense of overhead and density. A totally nonsingular matrix (all square submatrices are nonsingular) can be constructed with a Singleton matrix with alphabet size q≧n−1.
Decoding of the array code can be done rowwise since each row is a separate [n, k] code. If the generator matrix A′ corresponds to a Reed-Solomon code, there are several well known decoding algorithms; for a generic totally nonsingular generator matrix A′ we can simply decode by solving the system of equations which includes the r erased symbols and the surviving k symbols.
Consider the case with n=5 nodes with data rates m={3,3,2,2,2} and maximum number of erasures r=2 (and hence k=n−r=3). Then the optimal set is F*={0,1} (and consequently S*={2,3,4}) so the water level is
An optimal parity distribution is then
Hence mi+pi=4 is constant for all nodes. This is a unique solution and it is illustrated in
d=(d0,0 d1,0 d2,0 d0,1 d1,1 d2,1 d0,2 d1,2 d0,3 d1,3 d0,4 d1,4)
can then be arranged in a 4×5 array as
where all rows have k=3 data symbols. Each row can now be encoded with a systematic [5,3] erasure code which is capable of correcting r=2 erasures; hence the whole array can correct any r=2 erased columns. The parity symbols are inserted in the empty slots to complete the array
The rowwise erasure code can be designed with, e.g., a Singleton matrix
such that the parity symbols are generated as
In equation form the array is encoded as
This generator matrix can be compactly written as
Explicitly, the encoding is then
f=dA
f
0,0
=d
0,1
+d
1,2
+d
1,4
f
0,1
=d
2,0
+d
0,2
+d
0,4
f
0,2
=d
1,0
+d
2,1
+d
1,3
f
1,2
=d
0,0
+d
1,1
+d
0,3
f
0,3
=d
0,1+4d1,2+3d1,4
f
1,3
=d
2,0+4d0,2+3d0,4
f
0,4
=d
1,0+4d2,1+3d1,3
f
1,4
=d
0,0+4d1,1+3d0,3
Note that all operations described above are in GF(5).
Type 2 Codes: mi+pi is Not Constant and maxi mi≦μ
If the array is not a complete rectangle, the array code described above can't be used. However if maxi mi≦μ we can add “dummy symbols” to complete the array which is illustrated in
Consider the case with n=5 nodes with data rates m={1,1,6,1,6}. We can consider all
possible failure patterns as the optimal failure set F*. One that produces the fewest parity symbols is found to be F*={0,2} and consequently S*={1,3,4}. The water level is
and an optimal distribution is
One such distribution is p={5,5,0,2,0}, although there are three other (equally optimal) distributions since p1, p3 and p4 are not unique. This solution is illustrated in
where d′0,3, d′1,3 and d′2,3 denote the inserted dummy data symbols. Note that each row has k=3 data symbols each. The array can be completed by adding r=2 parity symbols per row and encoding it rowwise with a [5,3] code
The rowwise erasure code can be designed with, e.g., a Singleton matrix
such that the parity symbols are generated as
In equation form the array is encoded as
This generator matrix can be compactly written as
The encoding of the parity symbols is now
Since the dummy symbols don't carry any information we set them to a predetermined value, e.g., d′0,3=d′1,3=d′2,3=0. This means that the parity symbols become
f
0,0
=d
3,2
+d
5,4
f
1,0
=d
2,2
+d
4,4
f
2,0
=d
1,2
+d
3,4
f
3,0
=d
0,2
+d
0,3
+d
2,4
f
4,0
=d
0,1
+d
5,2
+d
1,4
f
0,1
=d
0,0
+d
4,2
+d
0,4
f
1,1
=d
3,2+3d5,4
f
2,1
=d
2,2+3d4,4
f
3,1
=d
1,2+3d3,4
f
4,1
=d
3,2+4d0,3+3d2,4
f
0,3
=d
3,2+4d5,2+3d1,4
f
1,3
=d
3,2+4d4,2+3d0,4
which is the same as deleting the corresponding elements (10,11,12) from the augmented data vector {tilde over (d)} and corresponding rows (10,11,12) from the generator matrix Ã
Decoding is then done for each row separately.
Type 3 Codes: mi+pi is Not Constant and maxi mi>μ
Using the same approach in this case, we would need to add dummy symbols to a level of maxi mi to make the array a complete rectangle. However then there will be too many data+dummy symbols for the code to correct r erasures. What we can do instead is to add a few extra parity symbols (and thereby increasing the overhead) and then add dummy symbols to complete the array; these dummy symbols can be removed afterwards as described above. It can be shown that the minimum number of extra parity symbols that needs to be added is r maxi mi−Σi pi; this is in fact the same number of “empty” symbols in F*. Hence these extra parity symbols can be added to the nodes in F*, dummy symbols are then added to complete the array, a rowwise code is designed for the (maxi mi)×n complete array (as in the (mi+pi)=constant case above) and then the rows corresponding to the dummy symbols are deleted from the generator matrix. Puncturing the extra data symbols, i.e. rows from the generator matrix is not enough in this case to produce an optimal code because of the extra parity symbols.
However, once the candidate parity symbols, i.e. columns of the generator matrix, have been chosen, it can be shown that it is always possible to shift the nonzero elements in these columns along their respective rows into positions made available by puncturing the extra rows. These shifted elements are then replaced by the nonzero elements from the same positions in the deleted rows. This procedure generates a number of all-zero columns equal to the number of extra parity symbols, which can then be removed. The resulting reduced generator matrix must be tested to guarantee that all the relevant submatrices have full row rank.
Therefore, puncturing the extra columns requires a search over the possible combinations of horizontal shifts of the nonzero elements in the columns to be deleted. The search can be made more efficient by identifying the combination of extra columns and rows to remove such that the number of elements to shift is minimised. Moreover, we can introduce an order between these candidate sets of columns to be removed such that the first combinations to be tested are those with the fewest nonzero elements.
A flow chart of the procedure outlined above is illustrated in
In step S1602 the parity allocations are found. Then in step S1604, the additional dummy parity symbol allocations {tilde over (p)} are found. In step S1608, the candidate column set c and corresponding row set r to remove with minimum weight w (number of nonzero values) are found.
In step S1610 the columns are sorted according to their weight. In step S1612 the rows r identified in step S1608 are removed from the generator matrix A.
In step S1614 possible sets of shifts to move the nonzero elements in the columns c into column positions made available by the deleting of rows in step S1612.
After the extra rows are removed, the generator matrix is manipulated in such a way that the columns we want to remove are all zeros. To do so, the nonzero elements in these columns are shifted horizontally to other column positions in step S1616. Now, the only candidate shifts are those provided by the locations of the nonzero elements in the removed rows. This is because of the symmetries in the construction of the generator matrix, which make it impossible to shift a nonzero elements to a column already containing k=n−r nonzero elements.
In step S1618, the nonzero columns c are removed from the generator matrix A.
The generator matrix A does not correspond to a rowwise encoding in this case and hence can't be decoded rowwise. However since this is a punctured generator matrix (rows and columns have been removed) of the form A=A′D, we can use the decoding algorithm described above.
Consider the case with n=5 nodes with data rates m={7,6,1,14}. After checking all
possible failure patterns, the one that produces the fewest parity symbols is found to be F*={0,1} and consequently S*={2,3,4}. The water level is
and an optimal distribution is
One such distribution is p={0,0,3,5,5}, although there are five other (equally optimal) distributions since p2, p3 and p4 are not unique. However since m0>μ we would need to fill up the array to the level m0=7 instead of μ=6 to make it complete; in that case there would not be enough parity symbols to correct r=2 erasures. Instead we can add r maxi mi−Σi pi=2·7−13=1 extra parity symbol to complete the (maxi mi)×r subarray. Then we can add the dummy symbols to complete the array; this procedure is illustrated in
Note that all rows have k=3 data symbols. We can now encode each row separately and add r=2 parity symbols
Note that f′0,1 is the extra parity symbol that was added.
The rowwise erasure code can be designed with, e.g., a Singleton matrix
such that the parity symbols are generated as
In equation form the array is encoded as
This generator matrix can be compactly written as
The encoding of the parity symbols is now
Since the dummy data symbols don't carry any information we set them to a predetermined value, e.g., d′0,2=d′1,2=d′2,2=d′0,3=d′0,4=0, which is the same as deleting the corresponding elements (14,15,16,18,21) from the augmented data vector {tilde over (d)} and corresponding rows (14,15,16,18,21) from the generator matrix Ã. This means that the parity symbols become
This code allows to recover any 2 node failures, however it is not MDS because there remains 1 extra parity symbol, f′0,1, corresponding to the first column of A′.
In order to puncture this column we apply the procedure outlined in
and the first viable column position, reading column wise from top left to bottom right, is column 4 with value 1. After applying this shift and removing the first column, we can test that the new reduced matrix is still a generator matrix of a code, which is now MDS and lowest density. The final parity symbols become
In step S606, the following quantities are calculated:
In step S608, it is determined if μ=ρq/r, if this is not the case, then a type 3 code is required. If μ=ρq/r, then the method moves to step S610.
In step S610, it is determined if μ=κq/k. If this is the case, then a type 1 code is required. If μ≠κq/k, then a type 2 code is required.
There are several important properties an array code should have:
The construction described herein encompasses all of the above properties.
Some additional properties of the array code described here are:
Node permutations. By applying a permutation to the nodes, i.e., relabeling of the nodes, which translates to a row and column permutation on matrix A, we can obtain a different code with the same optimal properties. In total, there are n! such permutations.
Duality. It can be shown that once an MDS code has been designed for n nodes, r=n−k maximum number of erasures, mi information and pi parity messages per node, the same construction can be used for the “dual” code if mi+pi=μ, ∀i; this is also a lowest density MDS array code. The dual code is characterised by the same number of nodes n but can correct k erasures when there are pi data symbols and mi parity symbols per node.
Extension to {mi}i=0n−1 not co-prime. If gcd{mi}i=0n−1=a, then the parity distribution has gcd {pi}i=0n−1=a. Therefore once a [n,k] irregular MDS array code has been designed for the data distribution {mi/a}i=0n−1, an extended MDS code can be easily constructed for the n-tuple {mi}i=0n−1. This extended family of codes is found by simply reusing the same mother code a times. In particular, the generator matrix of the extended code, Ga, is obtained by a Kronecker product
Ga=GIa (16)
where G is the generator matrix of the original code and Ia is the a×a identity matrix.
While the embodiments described above relate to monitoring of energy usage, those of skill in the art will appreciate that the coding schemes can be adapted for use in a variety of applications such as networks with high packet loss; video streaming over lossy channels; distributed network storage and redundant disk drives.
For example, one application is sending packets through a lossy channel. The data in the lost packets could be generated from the parity information of the received packets. In this example of the block erasure code could is used as a higher layer packet recovery mechanism whereby lost packets are recovered from the parity information of the received packets.
In this case the parameters would be determined based on some higher layer channel quality measurements, for example packet loss ratio or packet processing delay. An advantage of using the proposed block erasure code would be in the flexibility of the parameters choice.
The customisation of the coding parameters could be adapted depending on the network topology and other requirements, such as the level of protection needed and the maximum number of nodes involved in the coding operation. Before the code is generated there is a stage where the concentrator node (or a number of nodes in case of a distributed approach) acquires information on the network topology. This may be possible by using for example neighbour lists in routing protocols.
A decision is then made on the number of nodes involved in the failure protection scheme (parameter n), on the level of protection, i.e., how many block erasures the code should sustain (parameter r) and on the number of data symbols processed in parallel (parameters {mi}i=0n−1).
While certain embodiments have been described, these embodiments have been presented by way of example only, and are not intended to limit the scope of the inventions. Indeed, the novel methods, systems, devices and networks described herein may be embodied in a variety of other forms; furthermore, various omissions, substitutions and changes may be made without departing from the spirit of the inventions. The accompanying claims and their equivalents are intended to cover such forms or modifications as would fall within the scope and spirit of the inventions.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/GB13/53016 | 11/15/2013 | WO | 00 |