Computing devices may include any number of internal components such as processors, memory, and persistent storage. Each of the internal components of a computing device may be used to generate data. The process of generating and storing data may utilize computing resources of the computing devices such as processing and storage.
In general, in one aspect, the invention relates to a method for storing data. The method includes obtaining data from a host, making a first determination that a data cluster comprises a plurality of non-volatile memory devices, based on the first determination: storing processed data corresponding to the data in at least one of the plurality of non-volatile memory devices, and making a second determination to de-stage at least a portion of the processed data to a persistent storage device, and based on the second determination, initiating a delayed compression operation on the processed data.
In general, in one aspect, the invention relates to a system that includes a processor and a data processor, which when executed by the processor performs a method. The method comprises obtaining data from a host, making a first determination that a data cluster comprises a plurality of non-volatile memory devices, based on the first determination: storing processed data corresponding to the data in at least one of the plurality of non-volatile memory devices, and making a second determination to de-stage at least a portion of the processed data to a persistent storage device, and based on the second determination, initiating a delayed compression operation on the processed data.
In general, in one aspect, the invention relates to a non-transitory computer readable medium which includes computer readable program code, which when executed by a computer processor enables the computer processor to perform a method. The method includes obtaining data from a host, making a first determination that a data cluster comprises a plurality of non-volatile memory devices, based on the first determination: storing processed data corresponding to the data in at least one of the plurality of non-volatile memory devices, and making a second determination to de-stage at least a portion of the processed data to a persistent storage device, and based on the second determination, initiating a delayed compression operation on the processed data.
Specific embodiments will now be described with reference to the accompanying figures. In the following description, numerous details are set forth as examples of the invention. It will be understood by those skilled in the art that one or more embodiments of the present invention may be practiced without these specific details and that numerous variations or modifications may be possible without departing from the scope of the invention. Certain details known to those of ordinary skill in the art are omitted to avoid obscuring the description.
In the following description of the figures, any component described with regard to a figure, in various embodiments of the invention, may be equivalent to one or more like-named components described with regard to any other figure. For brevity, descriptions of these components will not be repeated with regard to each figure. Thus, each and every embodiment of the components of each figure is incorporated by reference and assumed to be optionally present within every other figure having one or more like-named components. Additionally, in accordance with various embodiments of the invention, any description of the components of a figure is to be interpreted as an optional embodiment, which may be implemented in addition to, in conjunction with, or in place of the embodiments described with regard to a corresponding like-named component in any other figure.
Throughout this application, elements of figures may be labeled as A to N. As used herein, the aforementioned labeling means that the element may include any number of items and does not require that the element include the same number of elements as any other item labeled as A to N. For example, a data structure may include a first element labeled as A and a second element labeled as N. This labeling convention means that the data structure may include any number of the elements. A second data structure, also labeled as A to N, may also include any number of elements. The number of elements of the first data structure and the number of elements of the second data structure may be the same or different.
In general, embodiments of the invention relate to method and system for storing data and metadata in a data cluster. More specifically, embodiments of the invention relate to processing data obtained from a host based on protection policies and the hardware capabilities of the data cluster. Further, the data cluster may include components that have hardware accelerators. The hardware accelerators may be compute acceleration devices or non-volatile memory devices. Embodiments of the invention may include a data processor which may perform and/or initiate one or more data processing operations on the obtained data to generate processed data based on the included hardware of the system and at least one protection policy. The data processing operations may include replication, erasure coding, deduplication, and/or delayed compression.
In one or more embodiments of the invention, the host (100) utilizes the data cluster (110) to store data. The data stored may be backups of databases, files, applications, and/or other types of data without departing from the invention.
In one or more embodiments of the invention, the host (100) is implemented as a computing device (see e.g.,
In one or more embodiments of the invention, the host (100) is implemented as a logical device. The logical device may utilize the computing resources of any number of computing devices and thereby provide the functionality of the host (100) described throughout this application.
In one or more embodiments of the invention, the data cluster (110) stores data, metadata, and/or backups of data generated by the host (100). The data and/or backups may be deduplicated versions of data obtained from the host (100) obtained via data deduplication. The data cluster (110) may, via an erasure coding operation, store portions of the deduplicated data across data nodes operating in the data cluster (110). Additionally, the data cluster (110) may generate and store replicas of data obtained from the host (100) via data replication. Finally, the data cluster (110) may compress the data obtained from the host (100) via data compression.
As used herein, deduplication refers to methods of storing only portions of files (also referred to as file segments or segments) that are not already stored in persistent storage. For example, when multiple versions of a large file, having only minimal differences between each of the versions, are stored without deduplication, storing each version will require approximately the same amount of storage space of a persistent storage. In contrast, when the multiple versions of the large file are stored with deduplication, only the first version of the multiple versions stored will require a substantial amount of storage. Once the first version is stored in the persistent storage, the subsequent versions of the large file subsequently stored will be de-duplicated before being stored in the persistent storage resulting in much less storage space of the persistent storage being required to store the subsequently stored versions when compared to the amount of storage space of the persistent storage required to store the first stored version.
In one or more embodiments of the invention, an erasure coding procedure includes dividing the obtained data into portions, referred to as data chunks. Each data chunk may include any number of data segments associated with the obtained data. The individual data chunks may then be combined (or otherwise grouped) into data slices (also referred to as Redundant Array of Independent Disks (RAID) slices). One or more parity values are then calculated for each of the aforementioned slices. The number of parity values may vary based on the erasure coding algorithm that is being used as part of the erasure coding procedure. The chunks of a data slice may then be stored across different data nodes in a data cluster. Any chunk within a data slice may be reconstructed using the other chunks in the data slice. Non-limiting examples of erasure coding algorithms are RAID-3, RAID-4, RAID-5, and RAID-6. Other erasing coding algorithms may be used without departing from the invention.
In one or more embodiments of the invention, data compression refers to methods of reducing the number of bits needed to represent data. Data compression may save storage capacity and decrease costs for storage hardware and network bandwidth. Non-limiting examples of data compression algorithms are run-length encoding (RLE), Huffman coding, LZ77, and LZ78 algorithms. Other data compression algorithms may be used without departing from the invention.
In one or more embodiments of the invention, data replication refers to copying data to obtain replicas and storing the replicas in different storage devices. For example, a file may be replicated resulting in two copies (also referred to as “replicas”) of the original file. The original file and the two replicas may then be individually stored in three separate data nodes. Data replication may improve data availability.
Continuing with the discussion of
In one or more embodiments of the invention, the data processor (120) is a device (physical or logical) that includes the functionality to perform erasure coding, deduplication, replication and/or initiate compression on data obtained from a host (e.g., 100,
The data processor may utilize protection policies (124) to determine which data processing operations to perform on data obtained from the host (100,
In one or more of embodiments of the invention, the data processor (120) is implemented as computer instructions, e.g., computer code, stored on a persistent storage device of a data node (130A, 130N) that when executed by a processor of a data node (e.g., 130A, 130N) cause the data node (130A, 130N) to provide the aforementioned functionality of the data processor (120) described throughout this application and/or all, or a portion thereof, of the methods illustrated in
In one or more embodiments of the invention, the data processor (120) is implemented as a computing device (see e.g.,
In one or more embodiments of the invention, the data processor (120) is implemented as a logical device. The logical device may utilize the computing resources of any number of computing devices and thereby provide the functionality of the data processor (120) described throughout this application and/or all, or a portion thereof, of the methods illustrated in
In one or more embodiments of the invention, the data nodes (130A, 130N) store processed data (as described below). The data nodes (130A, 130N) may include persistent storage devices (e.g., 138A, 138N,
In one or more embodiments of the invention, each data node (130A, 130N) is implemented as a computing device (see e.g.,
In one or more embodiments of the invention, each of the data nodes (130A, 130N) is implemented as a logical device. The logical device may utilize the computing resources of any number of computing devices and thereby provide the functionality of the data nodes (130A, 130N) described throughout this application. For additional details regarding the data nodes (130A, 130N), see, e.g.,
In one or more embodiments of the invention, the processor (132) is a component that processes data and processes requests. The processor (132) may be, for example, a central processing unit (CPU). The processor (132) may be other types of processors without departing from the invention. The processor (132) may process a request to store data and/or metadata and process data and/or metadata using data stored in memory (134), the persistent storage devices (138A, 138N), and/or other data nodes (e.g., 130N,
In one or more embodiments of the invention, the data node (130A) includes memory (134), which stores data that is more accessible to the processor (132) than the persistent storage devices (138A, 138N). The memory (134) may be volatile storage. Volatile storage may be storage that stores data that is lost when the storage loses power. The memory (134) may be, for example, Random Access Memory (RAM). In one or more embodiments of the invention, a copy of the data and/or parity chunks required for a persistent storage device rebuilding operation are stored in the memory (134) of the data node (130A).
In one or more embodiments of the invention, the CAD (136) includes functionality to perform data compression initiated by the data processor (120,
In one or more embodiments of the invention, the CAD (148) is a physical device that includes processing hardware (not shown) and memory (not shown). The CAD may include other elements/components without departing from the invention. The processing hardware may include, but is not limited to, a field-programmable gate array, application specific integrated circuit, programmable processor, microcontroller, digital signal processor, a host bus adapter (HBA) card, other processing hardware, or any combination thereof. Depending on the implementation, the CAD may also include persistent storage that may include computer readable instructions, which may be executed by the processing hardware, to perform all or a portion of the functionality of method shown in
In one or more embodiments of the invention, the persistent storage devices (138A, 138N) store processed data. The data may be data chunks and/or parity chunks, deduplicated data, compressed data, and/or replicated data. In addition, the data may also include storage metadata. The persistent storage devices (138A, 138N) may be non-volatile storage. In other words, the data stored in the persistent storage devices (138A, 138N) is not lost or removed when the persistent storage devices (138A, 138N) lose power. Each of the persistent storage devices (138A, 138N) may be, for example, solid state drives, hard disk drives, and/or tape drives. The persistent storage devices (138A, 138N) may include other types of non-volatile or non-transitory storage mediums without departing from the invention.
In one or more embodiments of the invention, the processor (142) is a component that processes data and processes requests. The processor (142) may be, for example, a central processing unit (CPU). The processor (142) may be other types of processors without departing from the invention. The processor (142) may process a request to store data and/or metadata and process data and/or metadata using data stored in memory (e.g., 144), the persistent storage devices (146A, 146N), and/or other data nodes (e.g., 130N,
In one or more embodiments of the invention, the data node includes a NVM device (144) which stores data that is more accessible to the processor (142) than the persistent storage devices (138A, 138N) and the memory (134,
In one or more embodiments of the invention, the persistent storage devices (146A, 146N) store processed data. The processed data may be data chunks and/or parity chunks, deduplicated data, compressed data, and/or replicated data. In addition, the persistent storage device may also store storage metadata. The persistent storage devices (146A, 146N) may be non-volatile storage. In other words, the data and metadata stored in the persistent storage devices (146A, 146N) is not lost or removed when the persistent storage devices (146A, 146N) lose power. Each of the persistent storage devices (146A, 146N) may be, for example, solid state drives, hard disk drives, and/or tape drives. The persistent storage devices (146A, 146N) may include other types of non-volatile or non-transitory storage mediums without departing from the invention.
In one or more embodiments of the invention, the processor (152) is a component that processes data and processes requests. The processor (152) may be, for example, a central processing unit (CPU). The processor (152) may be other types of processors without departing from the invention. The processor (152) may process a request to store data and/or metadata and process data and/or metadata using data stored in memory (e.g., 154), the persistent storage devices (156A, 156N), and/or other data nodes (e.g., 130N,
In one or more embodiments of the invention, the data node (150) includes memory (154), which stores data that is more accessible to the processor (152) than the persistent storage devices (156A, 156N). The memory (154) may be volatile storage. Volatile storage may be storage that stores data that is lost when the storage loses power. The memory (154) may be, for example, Random Access Memory (RAM). In one or more embodiments of the invention, a copy of the data and/or parity chunks required for a persistent storage device rebuilding operation are stored in the memory (154) of the data node (150).
In one or more embodiments of the invention, the persistent storage devices (156A, 156N) store processed data. The processed data may be data chunks and/or parity chunks, deduplicated data, compressed data, and/or replicated data. In addition, the persistent storage device may also store storage metadata. The persistent storage devices (156A, 156N) may be non-volatile storage. In other words, the data and metadata stored in the persistent storage devices (156A, 156N) is not lost or removed when the persistent storage devices (156A, 156N) lose power. Each of the persistent storage devices (156A, 156N) may be, for example, solid state drives, hard disk drives, and/or tape drives. The persistent storage devices (156A, 156N) may include other types of non-volatile or non-transitory storage mediums without departing from the invention.
In one or more embodiments of the invention, slice metadata (210) includes metadata associated data and parity chunks of data slices generated during erasure coding operations. Each data slice may have an associated metadata slice entry (e.g., 212A, 212N) generated by the data processor (120,
In one or more embodiments of the invention, deduplication metadata (220) includes metadata associated with deduplicated data generated during deduplication operations. The deduplication metadata (220) may include unique identifiers (e.g., fingerprints) for unique data chunks. The deduplication metadata (220) may also include mapping information. Mapping information may, for example, include information that indicates which unique chunks stored in the data cluster (e.g., 110,
In one or more embodiments of the invention, the compression metadata (230) includes metadata associated with compressed data generated during compression and/or delayed compression operations. The compression metadata (230) may include storage locations (e.g., the data node (e.g., 130A,
In one or more embodiments of the invention, replication metadata (240) includes metadata associated with replicas generated during replication operations. Replicas may be copies of data obtained from a host (e.g., 100,
In step 300, data is obtained from a host. The data may be a file, a file segment, a collection of files, or any other type of data without departing from the invention. The obtained data may include one or more protection policies associated with the host.
In step 302, protection policies associated with the host are checked. In one or more embodiments of the invention, the protection policies associated with the host indicate whether the replication or erasure coding should be performed on the data obtained in step 300. The data processor may check the protection policies to determine whether to perform a replication operation or an erasure coding operation.
In step 304, a determination is made as to whether the hardware of the data cluster includes hardware accelerators. If the underlying hardware includes hardware accelerators, the method may proceeds to step 306; otherwise, the method proceeds to step 308. In one or more embodiments of the invention, the data nodes include hardware accelerators that enable the data processor to perform additional data processing operations.
In step 306, in response to the determination of step 304 that the hardware of the data cluster includes hardware accelerators, the appropriate storage of data obtained from the host based on the protection policies and the underlying hardware accelerators is initiated. The data processor may initiate one or more data processing operations on the obtained data using the hardware accelerators based on the protection policies prior to storing the data in the data nodes of the data cluster. For additional details regarding the appropriate storage of data obtained from the host based on the protection policies and the underlying hardware accelerators, refer to
The method may end following step 306.
In step 308, in response to the determination of step 304 that the hardware does not include hardware accelerators, the appropriate storage of data obtained from the host based on the protection policies is initiated. In one or more embodiments of the invention, the data nodes do not have hardware accelerators. The data processor may not perform or initiate the data processing operations due to the lack of hardware accelerators. For addition details regarding the initiation of the appropriate storage of data obtained from the host based on the protection policies, refer to
The method may end following step 308.
Turning to
Continuing with the discussion of
In step 314, the obtained data is replicated and the resulting replicas are sent to the data nodes. In one or more embodiments of the invention, replication includes copying the data obtained from the host to generate replicas. The data processor may perform the replication operation on the obtained data and upon completing the replication operation, may then send the replicas to the data nodes of the data cluster. The protection policies may include the number of replicas to be generated during the replication operation. There may be any number of replicas generated and sent to the data nodes of the data cluster without departing from the invention. The replicas may be sent to the CADs of the data nodes. The data processor may generate storage metadata during the replication operation. The storage metadata may include replication metadata.
In step 316, the compression of the replicas on the CADs is initiated. In one or more embodiments of the invention, the data processor initiates compression of the replicas on the CADs. The data processor may send a request to the CADs of the data nodes. After receiving the request, the CADs may then perform a compression operation on the replicas. Upon completion of the compression operation, the CADs may then store the compressed replicas in persistent storage devices. The CADs may generate storage metadata during the compression operation. The storage metadata may include compression metadata. The CADs may send the storage metadata to the data processor.
The method may end following step 316.
In step 318, in response to the determination that the protection policies are not set to replication (i.e., set to erasure coding) erasure coding is performed on the data to obtain chunks (i.e., data and parity chunks). In one or more embodiments of the invention, the erasure coding procedure includes dividing the obtained data into portions, referred to as data chunks. Each data chunk may include any number of data segments associated with the obtained data. The individual data chunks may then be combined (or otherwise grouped) into slices (also referred to as Redundant Array of Independent Disks (RAID) slices). One or more parity chunks are generated based on the erasure coding algorithm. The number of parity chunks may vary based on the erasure coding algorithm that is being used as part of the erasure coding procedure.
In one or more embodiments of the invention, the number of data chunks and parity chunks generated is determined by the erasure coding procedure, which may be specified by the host, by the data cluster, and/or by another entity.
In step 320, deduplication is performed on the chunks and the unique chunks are sent to the data nodes. The data processor performing the deduplication may generate a fingerprint for a data chunk and identify whether the fingerprint matches an existing fingerprint stored in storage metadata (i.e., deduplication metadata). If the fingerprint matches an existing fingerprint, the data chunk may be deleted, as it is already stored in the data cluster. If the fingerprint does not match any existing fingerprints, the data chunk may be stored as a deduplicated data chunk. Additionally, the fingerprint of each deduplicated data chunk is stored in a storage metadata slice entry of the storage metadata. A fingerprint (or other unique identifier) of each parity chunk is also generated and stored in the storage metadata slice entry.
In one or more embodiments of the invention, the deduplicated data chunks collectively make up the deduplicated data. In one or more embodiments of the invention, the deduplicated data chunks are the data chunks that were not deleted during deduplication.
In step 322, compression of the unique chunks is initiated on the CADs. In one or more embodiments of the invention, the data processor initiates compression of the replicas on the CADs. The data processor may send a request to the CADs of the data nodes. After receiving the request, the CADs may then perform a compression operation on the replicas. Upon completion of the compression operation, the CADs may then store the compressed replicas in persistent storage devices. The CADs may generate storage metadata during the compression operation. The storage metadata may include compression metadata. The CADs may send the storage metadata to the data processor.
The method may end following step 322.
Turning to
In step 332, replication is performed on the obtained data and the replicas are sent to the NVM device of the data nodes. In one or more embodiments of the invention, replication includes copying the data obtained from the host to generate replicas. The data processor may perform the replication operation on the obtained data and upon completing the replication operation, may then send the replicas to the data nodes in the data cluster. The protection policies may include the number of replicas to be generated during the replication operation. There may be any number of replicas generated and sent to the data nodes of the data cluster without departing from the invention. The replicas may be sent to the NVM devices of the data nodes. The data processor may generate storage metadata during the replication operation. The storage metadata may include replication metadata.
In step 334, delayed compression of the replicas when the replicas are de-staged from the NVM devices to persistent storage devices. In one or more embodiments of the invention, one or more of the replicas are de-staged from the NVM devices when the NVM devices reach capacity. When this occurs, the data processor may send a request to the data nodes to initiate delayed compression.
In one or more embodiments of the invention, delayed compression is the persistent storage device may also store performance of a compression operation when the replicas de-stage from the NVM devices to the persistent storage devices of the data nodes. The replicas may reside in the NVM devices decompressed until the capacity of the NVM devices is exceeded. Upon exceeding NVM device capacity, the data node may obtain at least one of the replicas from the NVM device (i.e., the NVM device that has reached capacity), compress the obtained replica and store the compressed replica in a persistent storage device. Once the compressed replica is stored on the persistent storage device, the decompressed replica is remove (or deleted) from the NVM device.
The method may end following step 334.
In step 336, in response to the determination of step 330, that the protection policies are not set to replication (i.e., set to erasure coding), erasure coding is performed on the data to obtain chunks and the chunks are sent to the NVM devices of the data nodes. In one or more embodiments of the invention, the erasure coding procedure includes dividing the obtained data into portions, referred to as data chunks. Each data chunk may include any number of data segments associated with the obtained data. The individual data chunks may then be combined (or otherwise grouped) into slices (also referred to as Redundant Array of Independent Disks (RAID) slices). One or more parity chunks are generated based on the erasure coding algorithm. The number of parity chunks may vary based on the erasure coding algorithm that is being used as part of the erasure coding procedure.
In one or more embodiments of the invention, the number of data chunks and parity chunks generated is determined by the erasure coding procedure, which may be specified by the host, by the data cluster, and/or by another entity.
In step 338, delayed compression of the chunks is initiated when the chunks de-stage from the NVM devices to persistent storage devices. In one or more embodiments of the invention, one or more of the chunks are de-staged from the NVM devices when the NVM devices reach capacity. The data processor may send a request to the data nodes to initiate delayed compression.
In one or more embodiments of the invention, delayed compression is the performance of a compression operation when the chunks de-stage from the NVM devices to the persistent storage devices of the data nodes. The chunks may reside in the NVM devices decompressed until the capacity of the NVM devices is exceeded. Upon exceeding NVM device capacity, the data node may obtain at least one of the chunks from the NVM device (i.e., the NVM device that has reached capacity), compress the chunk(s) and store the compressed chunk(s) in persistent storage devices. Once the compressed chunk(s) is stored on the persistent storage device, the decompressed chunk(s) is remove (or deleted) from the NVM device.
The method may end following step 338.
In step 340, a determination is made as to whether the protection policy is set to replication. In one or more embodiments of the invention, one or more of the protection policies indicate whether to perform replication or erasure coding on the data obtained from the host. If the protection policies are set to replication, then the method proceeds to step 342. If the protection policies are not set to replication (i.e., set to erasure coding), the method proceeds to step 346.
In step 342, the obtained data is replicated to generate replicas. In one or more embodiments of the invention, replication includes copying the data obtained from the host to generate replicas. The data processor may perform the replication operation on the obtained data and upon completing the replication operation, may then send the replicas to the data nodes of the data cluster. The protection policies may include the number of replicas to be generated during the replication operation. There may be any number of replicas generated and sent to the data nodes of the data cluster without departing from the invention. The replicas may be sent to the persistent storage devices of the data nodes. The data processor may generate storage metadata during the replication operation. The storage metadata may include replication metadata.
In step 344, deduplication is performed on the replicas and the unique and necessary replicas are sent to the data nodes. The data processor performing the deduplication may generate a fingerprint for a data chunk of the replicas and identify whether the fingerprint matches an existing fingerprint stored in storage metadata (i.e., deduplication metadata). If the fingerprint matches an existing fingerprint, the data chunk may be deleted, as it is already stored in the data cluster. If the fingerprint does not match any existing fingerprints, the data chunk may be stored as a deduplicated data chunk. Additionally, the fingerprint of each deduplicated data chunk is stored in a storage metadata slice entry of the storage metadata.
In one or more embodiments of the invention, the deduplicated data chunks collectively make up the deduplicated data. In one or more embodiments of the invention, the deduplicated data chunks are the data chunks that were not deleted during deduplication.
The method may end following step 344.
In step 346, erasure coding is performed on the data to obtain chunks. In one or more embodiments of the invention, the erasure coding procedure includes dividing the obtained data into portions, referred to as data chunks. Each data chunk may include any number of data segments associated with the obtained data. The individual data chunks may then be combined (or otherwise grouped) into slices (also referred to as Redundant Array of Independent Disks (RAID) slices). One or more parity chunks are generated based on the erasure coding algorithm. The number of parity chunks may vary based on the erasure coding algorithm that is being used as part of the erasure coding procedure.
In one or more embodiments of the invention, the number of data chunks and parity chunks generated is determined by the erasure coding procedure, which may be specified by the host, by the data cluster, and/or by another entity.
In step 348, deduplication is performed on the chunks and the unique chunks are sent to the data nodes. The data processor performing the deduplication may generate a fingerprint for a data chunk and identify whether the fingerprint matches an existing fingerprint stored in storage metadata (i.e., deduplication metadata). If the fingerprint matches an existing fingerprint, the data chunk may be deleted, as it is already stored in the data cluster. If the fingerprint does not match any existing fingerprints, the data chunk may be stored as a deduplicated data chunk. Additionally, the fingerprint of each deduplicated data chunk is stored in a storage metadata slice entry of the storage metadata. A fingerprint (or other unique identifier) of each parity chunk is also generated and stored in the storage metadata slice entry.
In one or more embodiments of the invention, the deduplicated data chunks collectively make up the deduplicated data. In one or more embodiments of the invention, the deduplicated data chunks are the data chunks that were not deleted during deduplication.
The method may end following step 346.
The data processor (412) then sends each of the replicas to the CADs (422A, 422B, 422C) of the data nodes (420A, 420B, 420C) and initiates compression [5]. The CADs (422A, 422B, 422C) perform compression on the replicas and store the compressed replicas [6]. At a later point in time, the host (400) sends a request to the data processor (412) to obtain the data it previously sent [7]. The data processor (412) then sends a request to CAD A (422A) to decompress the compressed replica stored within and send the decompressed replica to the data processor (412) [8]. CAD A (422A) decompresses the compressed replica [9]. CAD A (422A) then sends the decompressed replica to the data processor (412) [10]. After obtaining the decompressed replica, the data processor (412) sends the decompressed replica (or a portion thereof) to the host (400) [11].
The data processor (412) then sends each of the chunks to the CADs (422A, 422B, 422C) of the data nodes (420A, 420B, 420C) and initiates compression [5]. The CADs (422A, 422B, 422C) perform compression on the chunks and store the compressed chunks [6]. At a later point in time, the host (400) sends a request to the data processor (412) to obtain the data it previously sent [7]. The data processor (412) then sends a request to the CADs (422A, 422B, 422C) to decompress the compressed chunks stored within and send the decompressed chunks to the data processor (412) [8]. The CADs (422A, 422B, 422C) decompress the compressed chunks [9]. The CADs (422A, 422B, 422C) then send the decompressed chunks to the data processor (412) [10]. After obtaining the decompressed chunks, the data processor (412) assembles the chunks into the original file [11]. The data processor (412) then sends the file to the host (400) [12].
The data processor (412) then sends each of the replicas to the NVM devices (426A, 426B, 426C) of the data nodes (420A, 420B, 420C) and initiates delayed compression [5]. At a later point in time, the NVM devices reach capacity and the replicas are de-staged from the NVM devices and are sent to the persistent storage devices (424A, 424B, 424C) [6]. The data nodes (420A, 420B, 420C) perform delayed compression on the replicas and store the compressed replicas in the persistent storage devices (424A, 424B, 424C) and delete the uncompressed replicas from the NVM devices [7]. At a later point in time, the host (400) then sends a request to the data processor (412) to obtain the data it previously sent [8]. The data processor (412) then sends a request to data node A (420A) to decompress the compressed replica stored within and send the decompressed replica to the data processor (412) [9]. Data node A (420A) decompresses the compressed replica and stores the decompressed replica in NVM Device A (426A) [10]. Data node A (420A) then sends the decompressed chunks to the data processor (412) [11]. The data processor (412) then sends the replica (or a portion thereof) to the host (400) [12].
The data processor (412) then sends a replicas to each of persistent storage devices A and C (424A, 424C) of data nodes A and B (420A, 420B), where the replicas are subsequently stored in the persistent storage devices (424A, 424C) [5]. At a later point in time, the host (400) sends a request to the data processor (412) to obtain the data it previously sent [6]. The data processor (412) then sends a request to data node A (420A) to send the replica stored in persistent storage device A (424A) [7]. Data node A (420A) then sends the replica to the data processor (412) [8]. The data processor (412) then sends the replica to the host (400) [9].
As discussed above, embodiments of the invention may be implemented using computing devices.
In one embodiment of the invention, the computer processor(s) (502) may be an integrated circuit for processing instructions. For example, the computer processor(s) may be one or more cores or micro-cores of a processor. The computing device (500) may also include one or more input devices (510), such as a touchscreen, keyboard, mouse, microphone, touchpad, electronic pen, or any other type of input device. Further, the communication interface (512) may include an integrated circuit for connecting the computing device (500) to a network (not shown) (e.g., a local area network (LAN), a wide area network (WAN) such as the Internet, mobile network, or any other type of network) and/or to another device, such as another computing device.
In one embodiment of the invention, the computing device (500) may include one or more output devices (508), such as a screen (e.g., a liquid crystal display (LCD), a plasma display, touchscreen, cathode ray tube (CRT) monitor, projector, or other display device), a printer, external storage, or any other output device. One or more of the output devices may be the same or different from the input device(s). The input and output device(s) may be locally or remotely connected to the computer processor(s) (502), non-persistent storage (504), and persistent storage (506). Many different types of computing devices exist, and the aforementioned input and output device(s) may take other forms.
One or more embodiments of the invention may be implemented using instructions executed by one or more processors of the data management device. Further, such instructions may correspond to computer readable instructions that are stored on one or more non-transitory computer readable mediums.
One or more embodiments of the invention may improve the operation of one or more computing devices. More specifically, embodiments of the invention improve the efficiency of storing data and decrease the computational resources required to store data a data cluster. In one embodiment of the invention, the efficiency is improved and the computational resources required are decreased by offloading one or more data processing operations from a data processor to hardware accelerators included in the data cluster with minimal impact on the performance of the data storage operations of the data cluster. The hardware accelerators may be compute acceleration devices or non-volatile memory devices. The data processing operations may include replication, erasure coding, compression, and deduplication. More specifically, the data processor may initiate a compression operation on one or more compute acceleration devices, and initiate compression in a data node of a data cluster when data de-stages from a non-volatile memory device.
In traditional data clusters, a data processor may perform the compression operations along with replication, deduplication, and/or erasure coding, resulting in increased computational overhead. Embodiments of the invention improve the traditional data clusters by offloading some of the data processing from the data processor to the underlying hardware of the data cluster. As a result, the latency and computational efficiency required to store data in a data cluster are improved.
Thus, embodiments of the invention may address the problem of inefficient use of computing resources. This problem arises due to the technological nature of the environment in which data is stored.
The problems discussed above should be understood as being examples of problems solved by embodiments of the invention disclosed herein and the invention should not be limited to solving the same/similar problems. The disclosed invention is broadly applicable to address a range of problems beyond those discussed herein.
While the invention has been described with respect to a limited number of embodiments, those skilled in the art, having benefit of this disclosure, will appreciate that other embodiments can be devised which do not depart from the scope of the technology as disclosed herein. Accordingly, the scope of the invention should be limited only by the attached claims.