This disclosure relates to address mapping. In particular, it relates to a redundant location address mapper.
Solid state disk (SSD) and in-memory databases are two ways of speeding up database analytic operations. A set of properties dubbed ACID (atomicity, consistency, isolation, durability) ensure that database transactions are processed reliably. Database software confirms that an entire transaction is written back to permanent (disk) storage before indicating that the transaction is complete. This prevents a partial transaction from changing the database in the event of a hardware failure.
Durability is problematic with both in-memory and SSD databases. Completed transactions must still be written to disk for an in-memory database since memory is volatile. SSD drives suffer from a problem called wear-out and only support a limited number of writes during their lifetime. It is advantageous to batch database updates to minimize the number of writes and thus extend the life of the SSD.
Keeping these pending updates in system memory requires that the memory be made fault tolerant. There is a range of fault tolerance options related to the type of faults that can occur. Error correction code in memory allows recovery from soft error and DRAM failures. Mirrored memory can tolerate entire DIMM or memory channel failures. To tolerate processor or system failures, a redundant copy of modified memory may be kept on a separate system in a computer cluster.
The present disclosure includes a method for mapping addresses for redundant data in a multi-node system that includes receiving a first address and a first node ID, determining a first address type representing an address redundancy level for the first address, determining a first node ID type representing a node ID redundancy level for the first node ID, and determining a second address and a second node ID based on the first address type and the first node ID type.
The present disclosure also includes a method for accessing addresses for redundant data in a multi-node system that includes receiving a first address and a first node ID, decoding a first address type representing an address redundancy level from the first address, decoding a first node ID type representing a node ID redundancy level from the first node ID, and determining a second address and a second node ID based on the first address type and the first node ID type.
The present disclosure includes a redundant system having a first number of nodes, each node having a processor, a memory, and a redundant address mapper. The redundant address mapper has an address type finder configured to determine a first address type representing an address redundancy level for a first address, a node ID type finder configured to determine a first node ID type representing a node ID redundancy level for a first node ID, a copy finder configured to determine a second address and a second node ID from the first address, the first address type, the first node ID, and the first node ID type, and an origin finder configured to determine a third address and a third node ID from the first address, the first address type, the first node ID, and the first node ID type.
The drawings included in the present application are incorporated into, and form part of, the specification. They illustrate embodiments of the present invention and, along with the description, serve to explain the principles of the invention. The drawings are only illustrative of typical embodiments of the invention and do not limit the invention.
To support continuous operations, computer memory systems may utilize fault tolerance to handle faults or errors. For a distributed memory system on a multiple node computer cluster, memory on a node may be made fault tolerant through memory replication and redundancy on remote nodes within the computer cluster. For example, a first node in a computer cluster may store a memory update in its own memory, a first redundant copy of the memory update on a second node, and a second redundant copy of the memory update on a third node. Similarly, the first node may store redundant copies of memory updates from the second and third nodes. If a failure occurs on the first node, the first node's memory update may be accessed by retrieving one of the two redundant copies while the failed component is repaired.
A redundant copy can be located by accessing location information for the redundant copies. When a redundant copy of a memory update is stored on a remote node, software creates location information about where the redundant copy is stored, such as a look-up table containing the address of the redundant copy. In the event of a failure of the home node, a system can find the address of the redundant copy through an object location algorithm and look-up the address of the copy on the remote node, often navigating through a memory hierarchy to access the location information. If a redundant copy is lost and needs to be restored, the original location may not be determined from the copy's location information.
According to embodiments of the disclosure, the locations of original and redundant data blocks may be determined through a location mapping algorithm. This mapping algorithm supports replicating data blocks to other nodes by mapping to a remote node and address that contain the data block. Redundant copies of data blocks may be stored according to coordinates generated by the mapping algorithm from the type of address and node ID of the original memory page. The type of address and node ID represent the redundancy level of the address and node ID, respectively. Given a node ID and address of an original data block, the redundant address location mapper calculates the node ID and address of a redundant copy of the data block. Similarly, given the address and node ID of a redundant copy, the mapper calculates the address and node ID of the original data block. When a component on a node fails, the mapper may receive the intended address and node ID of the data block for the failed node and identify a replica data block on a remote node.
The address type finder determines an address type for the address, as in 102. The address type classifies the redundancy level of the address by the origination status of the information stored at the address, such as origin or redundant copy. A node ID type finder determines a node ID type for the node ID, as in 103. The node ID type classifies the redundancy level of the node in a redundant group, such as first node or second node in a cluster. For example, in a three node system, an original piece of information may be stored on a first node at a first address designated as a first address type, a first copy of the information may be stored on a second node at a second address designated as a second address type, and a second copy of the information may be stored on a third node at a third address designated as a third address type.
The address type is evaluated as to whether the address is an origin address or a copy address. If the address is an origin address, then a copy finder may determine the copy address(es) and node ID(s), as in 105. If the address is a copy address, then an origin finder may determine the origin address and node ID, as in 106. Once the desired address and node ID are determined, the determined address and node ID may be outputted, as in 107.
When an original data block is first generated, the redundant address location mapper may store redundant copies according to the mapping algorithm described above. The redundant address location mapper may support single-copy redundancy or dual-copy redundancy. In single-copy redundancy, system software or logic may allocate every other memory page to storing an original page, leaving an extra page for hardware use to store a redundant copy; similarly, in dual-copy redundancy, system software or logic may allocate every third memory page to storing an original page, leaving two extra pages for redundant copies. The number of nodes needed for the redundancy mapping may be dependent on the type of redundancy used. For example, if the system is single-redundancy, then the system will group the nodes in groups of two for redundancy purposes; if the system is dual-redundancy, the system will group the nodes in groups of three. Once the original and redundant addresses are stored, the redundant address location mapper may later be used to find those original or redundant addresses based on an address and node ID input.
The even bits of the address or node ID are summed and the odd bits of the address or node ID are summed, as in 202. The type finder determines a difference between the sum of the even bits and the sum of the odd bits, as in 203. The difference is used to index into a type information table, as in 204. The type information table outputs the type of the address or node ID, as in 205.
The function of a redundant location address mapper may be represented as a matrix.
A type finder may determine an address or node ID type from an address or node ID, respectively. A single type finder may be used for both an address and a node ID; however, separate type finders for an address and a node ID may be associated with an address input and a node ID input, respectively, if desired.
The difference (D) calculated above indexes into a type information table 604, which receives the difference and outputs an address type according to the difference. The size of the type information table 604 may be dependent on the bit length of the address or node ID. For example, if the address is a 44 bit address, the address type information table 604 may be sized for entries from index values of 22 to −22. If the index value is 21, the type output is 00, which may correlate with an original address.
The output of the type finder may be configured according to the divisibility of the index values based on the redundancy of the system. The type finder logic may be configured differently depending on the redundancy configuration of the system. In a dual-redundancy system having three nodes in a redundancy group, the type information table may be configured to multiples of three. If an original memory page is stored every three addresses, then the type information table will repeat a certain type every three addresses. In a single redundancy system, a type finder may assign a type according to a pattern for a factor of two, such as by analyzing the last bit of an address to determine whether it is for an original or a copy.
In the event that a system fails and an original data block stored on a node on the system cannot be accessed, one or more redundant copy locations of the data block may be determined. An address and node ID of the original may be processed through the copy mapper/finder, along with an address type of the address and a node ID type of the node ID, to determine the address and node ID of the redundant copy or copies of the data block.
For example, an origin node ID 701 having an origin node ID type 702 of 01 and an origin address 703 inputs into the copy mapper/finder logic, as in the example of
In the event that a system fails and a redundant copy of a data block stored on a node in the system cannot be accessed, the original memory page location may be determined. A node ID and address of the redundant copy may be processed through the origin finder, along with an address type of the address and a node ID type of the node ID, to determine the address and node ID of the original memory page.
For example, a copy node ID 801 having a copy node ID type 802 of 10 and a copy address 803 having a copy address type 805 of 01 inputs into the origin finder logic, as in the example of
While the logic of the copy mapper/finder and origin finder have been expressed to create the example matrix of
Although the present disclosure has been described in terms of specific embodiments, it is anticipated that alterations and modifications thereof will become apparent to those skilled in the art. Therefore, it is intended that the following claims be interpreted as covering all such alterations and modifications as fall within the true spirit and scope of the disclosure.