Various forms of data are often submitted for backup consolidation across various remote storage systems or media. Further, frequently, the data may be consolidated irrespective of their characteristics (e.g., their data deduplication compatibility).
Specific embodiments of the invention will now be described in detail with reference to the accompanying figures. In the following detailed description of the embodiments of the invention, numerous specific details are set forth in order to provide a more thorough understanding of the invention. However, it will be apparent to one of ordinary skill in the art that the invention may be practiced without these specific details, In other instances, well-known features have not been described in detail to avoid unnecessarily complicating the description.
In the following description of
Throughout the application, ordinal numbers (e.g., first, second, third, etc.) may be used as an adjective for an element (i.e., any noun in the application). The use of ordinal numbers is not to necessarily imply or create any particular ordering of the elements nor to limit any element to being only a single element unless expressly disclosed, such as by the use of the terms “before”, “after”, “single”, and other such terminology. Rather, the use of ordinal numbers is to distinguish between the elements. By way of an example, a first element is distinct from a second element, and a first element may encompass more than one element and succeed (or precede) the second element in an ordering of elements.
In general, embodiments of the invention relate to a method and system for dynamic data protection. Specifically, towards implementing dynamic data protection, one or more embodiments of the invention provide and manage tiers of licensed storage capacity on which backup data may be consolidated. Further, the particular tier of licensed storage capacity on which backup data may be consolidated may be dependent on the characteristics of the backup data. Moreover, in embodiments where there may be insufficient available capacity in a licensed storage capacity tier to consolidate the backup data, a remaining capacity lacking in the licensed storage capacity tier may be overdrawn from another licensed storage capacity tier, provided that the latter tier has enough unallocated capacity to subsume the lacking capacity.
One or more embodiments of the invention hereinafter may be described with respect to consolidating deduplicated data on either high deduplication licensed storage capacity (LSC) or low deduplication LSC (see e.g.,
In one embodiment of the invention, the aforementioned components may be directly or indirectly connected to one another through a network (e.g., a local area network (LAN), a wide area network (WAN) such as the Internet, a mobile network, or any other network) (not shown). The network may be implemented using any combination of wired and/or wireless connections, In embodiments in which the aforementioned components are indirectly connected, there may be other networking components or systems (e.g., switches, routers, gateways, etc.) that facilitate communication, information exchange, and/or resource sharing. Further, the aforementioned components may communicate with one another using any combination of wired and/or wireless communication protocols.
In one embodiment of the invention, the PCS (102) may be any computing system (see e.g.,
in one embodiment of the invention, the PSS (104) may represent one or more physical storage devices and/or media on which various forms of information, pertinent to the PCS (102), may be consolidated. The one or more physical storage devices and/or media may or may not be of the same type. Further, information consolidated in the PSS (104) may be arranged using any storage mechanism (e.g., a filesystem, a collection of tables or records, etc.). In one embodiment of the invention, the PSS (104) may be implemented using persistent (i.e., non-volatile) storage. Examples of persistent storage include, but are not limited to: optical storage, magnetic storage, NAND Flash Memory, NOR Flash Memory, Magnetic Random Access Memory (M-RAM), Spin Torque Magnetic RAM (ST-MRAM), Phase Change Memory (PCM), or any other storage defined as non-volatile Storage Class Memory (SCM).
In one embodiment of the invention, the BSS (106) may refer to a data backup, archiving, and/or disaster recovery (DR) storage system. The BSS (106) may be implemented using one or more servers (not shown). Each server may be a physical server (i.e., residing in a datacenter) or a virtual server (i.e., residing in a cloud computing environment). In one embodiment of the invention, the BSS (106) may be implemented on one or more computing systems similar to the exemplary computing system shown in FIG, 4. The BSS (106) is described in further detail below with respect to
While
In one embodiment of the invention, the API (202) may be a hardware and/or software implemented construct that employs a set of subroutine definitions, protocols, and/or tools for enabling communications between the BSS (200) and one or more external entities (e.g., the production computing system (PCS) (not shown)). The API (202) may include functionality to: receive data storage requests; delegate the data storage requests to the DPA (206); provide a portal through which analytics, generated by the SAA (208), pertaining to the LSP (210) may be accessed by the one or more external entities (e.g., the PCS (not shown), one or more user clients (not shown), etc.); receive alerts based on the processing of the received data storage requests by/frons the DDA (204), the DPA (206), and/or. the SAA (208); and transmit any alerts to the request submitter(s) (e.g., the PCS (not shown)). One of ordinary skill will appreciate that the API (202) may perform other functionalities without departing from the scope of the invention. By way of an example, the API (202) may be a web API that may be accessed through an assigned web address (e.g., a uniform resource locator (URL)) and a WAN (e.g., Internet) connection.
In one embodiment of the invention, the DDA (204) may be a computer program or process(i.e., an instance of a computer program) that executes on the underlying hardware of the BSS (200). Specifically, the DDA (204) may be a computer program or process tasked with the deduplication of data submitted for consolidation onto the BSS (200). In short, deduplication may refer to a data compression technique directed at eliminating duplicate data blocks (or chunks). Substantively, through deduplication, only the unique data blocks (or chunks) of any particular data may be consolidated. To that end, the DDA (204) may include functionality to: receive original (i.e., non-deduplicated) data from the DPA (206); apply one or more known deduplication algorithms on the original data to render deduplicated data and a deduplication ratio; and provide the deduplicated data and deduplication ratio back to the DPA (206).
In one embodiment of the invention, deduplicated data, may encompass the compressed original data obtained following the deduplication process. The deduplication ratio, on the other hand, measures the effectiveness of the deduplication process on the original data on which the process had been applied. Further, the deduplication process may be determined by dividing the capacity required to store the data before the deduplication process (i.e., the capacity of the original data) by the capacity required to store the data after the deduplication process (i.e., the capacity of the deduplicated data). For example, an original dataset (before the deduplication. process) may require a capacity of 10 Terabytes (TB). The associated deduplicated data (after the deduplication process), however, may reduce the necessary capacity to 1 TB. The resulting deduplication ratio would be 10:1 and would thus render a ninety percent (90%) savings in the required capacity to store the data.
In one embodiment of the invention, the DPA (206) may be a computer program or process (i.e., an instance of a computer program) that executes on the underlying hardware of the BSS (200). Specifically, the DPA (206) may be a computer program or process tasked with dynamically managing the licensed storage capacities of the LSP (210) (described below) in accordance with one or more embodiments of the invention (see e.g.,
In one embodiment of the invention, the SAA (208) may be a computer program or process(i.e., an instance of a computer program) that executes on the underlying hardware of the BSS (200). Specifically, the SAA (208) may be a computer program or process tasked with providing analytic information pertaining to the LSP (210). Subsequently, the SAA (208) may include functionality to: monitor the properties and performance of the LSP (210); generate statistics and/or metrics based on the monitoring; compound the statistics and/or metrics into auto-support reports; and provide these reports to the appropriate external party (e.g., the PCS (not shown), one or more user clients (not shown), etc.). The SAA (208) may additional issue alerts based on the triggering of exception events.
In one embodiment of the invention, the above-mentioned auto-support reports may disclose detailed configuration and performance information surrounding the LSP (210). These reports may be periodically generated and transmitted, thereafter, to the PCS and/or one or more user clients (not shown). Further, these reports may serve to aid any trouble-shooting or debugging processes regarding the LSP (210). Information detailed in an auto-support report may include, but is not limited to: general information (e.g., storage model type, system serial number, installed hardware, media access control (MAC) addresses, etc.), software information (e.g., operating system version, uptime, number of reboots, licenses, etc.), hardware information (e.g., power supply specifications, memory and computer processing resources, storage topology, etc.), filesystem statistics, filesystem compression information (e.g., deduplication statistics), server usage, network statistics, operating system logs, and kernel information.
In one embodiment of the invention, the LSP (210) may represent a logical or virtual aggregation of storage capacity formed from one or more physical. storage devices (212A-212N). The one or more physical storage devices (212A-212N) may be of varying capacities and types. Further, each physical storage device (212A-212N) may be implemented using persistent (i.e., non-volatile) storage. Examples of persistent storage include, but are not limited to: optical storage, magnetic storage, NAND Flash Memory, NOR Flash Memory, Magnetic Random Access Memory (M-RAM), Spin Torque Magnetic RAM (ST-MRAM), Phase Change Memory (PCM), or any other storage defined as non-volatile Storage Class Memory (SCM), In one embodiment of the invention, the LSP (210) may further include one or more physical storage expansion slots (214A-214N), thereby permitting the installation of additional physical storage devices (not shown) to augment the aggregated storage capacity, if necessary. The LSP (210) is described in further detail below with respect to
In one embodiment of the invention, the SCM (220) may be partitioned into licensed storage capacity (LSC) (222, 224) and unlicensed storage capacity (USC) (226). The LSC (222, 224) may refer to a first subtotal storage capacity implemented across a first subset of the physical storage device(s) constituting the LSP (210). Further, the LSC (222, 224) may pertain to storage capacity that may be procured through capacity licensing. Capacity licensing may refer to a licensing model whereby a license fee may be charged in return for an amount of protected storage capacity (i.e., LSC (222, 224)) that backup data may consume. On the other hand, the USC (226) may refer to a second subtotal storage capacity implemented across a second subset of the physical storage device(s) constituting the LSP (210). More specifically, the USC (226) may pertain to storage capacity that has yet to be provisioned through capacity licensing.
In one embodiment of the invention, the LSC (222, 224) may be further partitioned into high deduplication LSC (222) and low deduplication LSC (224). High deduplication LSC (222) may refer to a subset of the LSC designated for the consolidation of data that dedupes well (i.e., data associated with a high deduplication ratio following subjection to the data deduplication process). In contrast, low deduplication. LSC (224) may refer to a remainder of the LSC designated for the consolidation of data that does not dedupe well (i.e., data associated with a low deduplication ratio following subjection to the data deduplication process), Examples of data of the latter classification include, but are not limited to, pre-encrypted and/or pre-compressed data, database transaction logs, audio and/or video files, engineering drawings, closed-circuit television (CCTV) data, and streaming data.
In one embodiment of the invention, the high deduplication LSC (222) may encompass allocated high deduplication LSC (not shown) and unallocated high deduplication LSC (not shown). Allocated high deduplication LSC may refer to a subset of the high deduplication LSC (222) to which data has already been written (through the course of past backup operations). Conversely, unallocated high deduplication LSC may refer to a remaining subset of the high deduplication LSC (222) to which data has yet to be written. Similarly, the low deduplication LSC (224) may encompass allocated low deduplication LSC (not shown) and unallocated low deduplication LSC (not shown).
In one embodiment of the invention, backup data may require the consumption of more high deduplication LSC (222) or low deduplication LSC (224) than that which may be available. In such an embodiment, unallocated LSC may be overdrawn from one LSC partition to subsume (or cover) the lack of storage capacity in another LSC partition. That is, should it be determined that there is insufficient unallocated high deduplication LSC to consolidate submitted backup data, then a difference (or delta) LSC may be calculated, which may represent the amount of unallocated. LSC the high deduplication LSC may be lacking. Thereafter, the obtained difference/delta LSC may be compared with the unallocated low deduplication. LSC to determine whether there is enough of the latter to assist in the consolidation of the submitted backup data. If there indeed is sufficient unallocated low deduplication LSC, storage capacity equivalent to the difference/delta LSC may be overdrawn (i.e., transferred) from the unallocated low deduplication. LSC to be represented as unallocated high deduplication LSC. The augmented unallocated high deduplication LSC may then be used to consolidate the submitted backup data. Similarly, should it be determined that there is insufficient unallocated low deduplication LSC to consolidate submitted data, and sufficient unallocated high deduplication LSC is available to subsume (or cover) the difference/delta LSC that the unallocated low deduplication LSC may be lacking, then a difference/delta LSC equivalent of the unallocated high deduplication LSC may be overdrawn therefrom to the unallocated low deduplication LSC. In one embodiment of the invention, the difference (or delta) LSC may also be referred to as overdraft LSC (228). Furthermore, following transfer of the overdraft LSC (228), the total LSC defining each LSC partition may be adjusted to account for the addition or subtraction of overdraft LSC (228) to/from the impacted LSC partitions.
While
Turning to
In Step 302, the original data (received via the data storage request in Step 300) is subjected through the data deduplication process. In one embodiment of the invention, by way of the data deduplication process, redundant information in the original data is removed prior to consolidation, thereby compressing the original data, Further, following the data deduplication process, deduplicated data (i.e., representative of the compressed original data) may be obtained, as well as a deduplication ratio (DR) that may quantify the effectiveness of the data deduplication process on the original data, The DR relates the storage capacity required to store the original data to the storage capacity required to store the deduplicated data. For example, had the original data required 10 Terabytes (TB) of storage, whereas the deduplicated data requires 1 TB, the resulting DR would be 10:1 (granting a 90% savings in storage capacity).
In Step 304, a determination is made as to whether the DR (obtained in Step 302) is less than a deduplication ratio threshold. In one embodiment of the invention, the deduplication ratio threshold may define a criterion, which when met or not, steers the consolidation of the deduplicated data onto an appropriate tier of licensed storage capacity (LSC) (e.g., a low deduplication LSC or a high deduplication LSC). Further, the deduplication ratio threshold may be determined based on analytic information rendered by the storage auto-support agent (SAA) (see e.g.,
In Step 306, after determining (in Step 304) that the deduplication ratio (obtained in Step 302) is less than a deduplication ratio threshold, another determination is made as to whether there is sufficient unallocated low deduplication. LSC available to consolidate the deduplicated data (also obtained in Step 302). In one embodiment of the invention, unallocated low deduplication LSC may refer to free low deduplication LSC to which data may be written. Further, the low deduplication LSC may represent a logical or virtual partition of the logical storage pool (LSP) (see e.g.,
In Step 308, after determining (in Step 306) that there is sufficient unallocated low deduplication. LSC to consolidate the deduplicated data. (obtained in Step 302), the deduplicated data is stored (or written) therein. In one embodiment of the invention, the deduplicated data may be written to one physical. storage device. In another embodiment of the invention, the deduplicated data may be written across multiple physical storage devices.
Turning to
In Step 322, a determination is made as to whether there is sufficient unallocated high deduplication LSC available to subsume (or cover) the required overdraft storage capacity (identified in Step 320). In one embodiment of the invention, unallocated high deduplication LSC may refer to free high deduplication LSC to which data may be written. Though reserved for data that does dedupe well (i.e., deduplicated data associated with a high deduplication ratio), unallocated high deduplication LSC remains, generally, as unused LSC that may be consumed by any data submitted for consolidation on the BSS. Accordingly, in one embodiment of the invention, if it is determined that the unallocated high deduplication LSC is equal to or greater than the required overdraft storage capacity, then the process may proceed to Step 324. On the other hand, in another embodiment of the invention, if it is alternatively determined that the unallocated high deduplication LSC is less than the required overdraft storage capacity, then the process may alternatively proceed to Step 328.
In Step 324, after determining (in Step 322) that there is sufficient unallocated high deduplication LSC to subsume the required overdraft storage capacity (identified in Step 320), an adjustment of the high and low deduplication LSCs is performed. Specifically, in one embodiment of the invention, the high deduplication LSC may be adjusted by reducing its total LSC by the required overdraft storage capacity (identified in Step 320). Conversely, the low deduplication LSC may be adjusted by augmenting its total LSC by the required overdraft storage capacity. For example, assume the total LSC (i.e., defining allocated and unallocated LSC) for the high deduplication LSC is initially at 50 TB, whereas the total LSC for the low deduplication LSC is initially at 30 TB, Further, insufficient unallocated low deduplication LSC is available to consolidate deduplicated data. Subsequently, an overdraft storage capacity of 10 TB is found to be required, which the unallocated high deduplication LSC can cover. Accordingly, the total LSC for the high deduplication LSC is adjusted by subtracting the overdraft storage capacity, thereby obtaining an adjusted high deduplication LSC of 50 TB minus 10 TB, or 40 TB, On the other hand, the total LSC for the low deduplication LSC is adjusted by adding the overdraft storage capacity, thereby obtaining an adjusted low deduplication. LSC of 30 TB plus 10 TB, or 40 TB.
In Step 326, the deduplicated data (obtained in Step 302) is subsequently stored in the adjusted low deduplication LSC. As mentioned above, in one embodiment of the invention, the adjusted low deduplication LSC may represent a resized low deduplication LSC, which had overdrawn—from the unallocated high deduplication LSC—the difference (or delta) licensed storage capacity needed to consolidate the deduplicated data. Further, in one embodiment of the invention, the deduplicated data may be written to one physical storage device, In another embodiment of the invention, the deduplicated data may be written across multiple physical storage devices.
In Step 328, after alternatively determining (in Step 322) that there is insufficient unallocated high deduplication LSC to subsume the required overdraft storage capacity (identified in Step 320), an alert is issued. Specifically, in one embodiment of the invention, a licensed storage capacity upgrade alert may be issued, which may be directed to notifying users and/or administrators of the BSS that additional capacity licenses would need to be procured. Based on the type of additional capacity licenses that may be procured and subsequently installed, the total LSC for the high deduplication LSC, the total LSC for the low deduplication LSC, or a combination thereof, may be updated. In one embodiment of the invention, the licensed storage capacity upgrade alert may be issued before the determination performed in Step 322. That is, the alert may be issued when either allocated low deduplication LSC or allocated high deduplication LSC has reached a predetermined percentage of the total LSC for their respective LSC partitions, For example, an alert may be issued when the allocated (or consumed) low deduplication. LSC occupies eighty percent (80%) of the total storage capacity designated as low deduplication LSC.
Turning to
In Step 342, after determining Step 340) that there is sufficient unallocated high deduplication LSC to consolidate the deduplicated data (obtained in Step 302), the deduplicated data is stored (or written) therein. In one embodiment of the invention, the deduplicated data may be written to one physical. storage device. In another embodiment of the invention, the deduplicated data may be written across multiple physical storage devices.
In Step 344, after alternatively determining (in Step 340) that there is insufficient unallocated high deduplication LSC to consolidate the deduplicated data (obtained in Step 302), a required overdraft storage capacity is identified. In one embodiment of the invention, the required overdraft storage capacity may represent a difference (or delta) in licensed storage capacity needed to be overdrawn from the low deduplication LSC in order to accommodate consolidation of the deduplicated data. By way of an example, if the storage capacity required to consolidate the deduplicated data is found to be 20 Terabytes (TB), however, the remaining unallocated high deduplication LSC only measures at 5 TB, then the overdraft storage capacity needed to accommodate the deduplicated data is 20 TB minus 5 TB, or 15 TB, which would need to be overdrawn from the low deduplication LSC, if available.
In Step 346, a determination is made as to whether there is sufficient unallocated low deduplication LSC available to subsume (or cover) the required overdraft storage capacity (identified in Step 344). In one embodiment of the invention, unallocated low deduplication LSC may refer to free low deduplication LSC to which data may be written. Though reserved for data that does not dedupe well (i.e., deduplicated data associated with a low deduplication ratio), unallocated low deduplication LSC remains, generally, as unused LSC that may be consumed by any data submitted for consolidation on the BSS. Accordingly, in one embodiment of the invention, if it is determined that the unallocated low deduplication LSC is equal to or greater than the required overdraft storage capacity, then the process may proceed to Step 348. On the other hand, in another embodiment of the invention, if it is alternatively determined that the unallocated low deduplication LSC is less than the required overdraft storage capacity, then the process may alternatively proceed to Step 352.
In Step 348, after determining (in Step 346) that there is sufficient unallocated low deduplication. LSC to subsume the required overdraft storage capacity (identified in Step 344), an adjustment of the high and low deduplication LSCs is performed. Specifically, in one embodiment of the invention, the low deduplication LSC may be adjusted by reducing its total LSC by the required overdraft storage capacity (identified in Step 344). Conversely, the high deduplication LSC may be adjusted by augmenting its total LSC by the required overdraft storage capacity. For example, assume the total LSC (i.e., defining allocated and unallocated LSC) for the high deduplication LSC is initially at 50 TB, whereas the total LSC for the low deduplication. LSC is initially at 30 TB. Further, insufficient unallocated high deduplication LSC is available to consolidate deduplicated data. Subsequently, an overdraft storage capacity of 10 TB is found to be required, which the unallocated low deduplication. LSC can cover. Accordingly, the total LSC for the high deduplication LSC is adjusted by adding the overdraft storage capacity, thereby obtaining an adjusted high deduplication LSC of 50 TB plus 10 TB, or 60 TB. On the other hand, the total LSC for the low deduplication LSC is adjusted by subtracting the overdraft storage capacity, thereby obtaining an adjusted low deduplication LSC of 30 TB minus 10 TB, or 20 TB.
In Step 350, the deduplicated data (obtained in Step 302) is subsequently stored in the adjusted high deduplication LSC. As mentioned above, in one embodiment of the invention, the adjusted high deduplication LSC may represent a resized high deduplication LSC, which had overdrawn—from the unallocated low deduplication LSC—the difference delta) licensed storage capacity needed to consolidate the deduplicated data. Further, in one embodiment of the invention, the deduplicated data may be written to one physical storage device. In another embodiment of the invention, the deduplicated data may be written across multiple physical storage devices.
In Step 352, after alternatively determining (in Step 346) that there is insufficient unallocated low deduplication LSC to subsume the required overdraft storage capacity (identified in Step 344), an alert is issued. Specifically, in one embodiment of the invention, a licensed storage capacity upgrade alert may be issued, which may be directed to notifying users and/or administrators of the BSS that additional capacity licenses would need to be procured. Based on the type of additional capacity licenses that may be procured and subsequently installed, the total LSC for the high deduplication LSC, the total LSC for the low deduplication LSC, or a combination thereof, may be updated. In one embodiment of the invention, the licensed storage capacity upgrade alert may be issued before the determination performed in Step 346. That is, the alert may be issued when either allocated low deduplication LSC or allocated high deduplication LSC has reached a predetermined percentage of the total LSC for their respective LSC partitions. For example, an alert may be issued when the allocated (or consumed) low deduplication LSC occupies eighty percent (80%) of the total storage capacity designated as low deduplication LSC.
In one embodiment of the invention, the computer processor(s) (402) may be an integrated circuit for processing instructions, For example, the computer processor(s) may be one or more cores or micro-cores of a processor. The computing system (400) may also include one or more input devices (410), such as a touchscreen, keyboard, mouse, microphone, touchpad, electronic pen, or any other type of input device. Further, the communication interface (412) may include an integrated circuit for connecting the computing system (400) to a network (not shown) (e.g., a local area network (LAN), a wide area network (WAN) such as the Internet, mobile network, or any other type of network) and/or to another device, such as another computing device.
In one embodiment of the invention, the computing system (400) may include one or more output devices (408), such as a screen (e.g., a liquid crystal display (LCD), a plasma display, touchscreen, cathode ray tube (CRT) monitor, projector, or other display device), a printer, external storage, or any other output device. One or more of the output devices may be the same or different from the input device(s). The input and output device(s) may be locally or remotely connected to the computer processor(s) (402), non-persistent storage (404), and persistent storage (406). Many different types of computing systems exist, and the aforementioned input and output device(s) may take other forms.
Software instructions in the form of computer readable program code to perform embodiments of the invention may be stored, in whole or in part, temporarily or permanently, on a non-transitory computer readable medium such as a CD, DVD, storage device, a diskette, a tape, flash memory, physical memory, or any other computer readable storage medium. Specifically, the software instructions may correspond to computer readable program code that, when executed by a processor(s), is configured to perform one or more embodiments of the invention.
Turning to the example, let us consider an initial state of a logical storage pool (LSP) (500) illustrated in
Turning to FIG, 5B, assume, at this point, a first original data (508A) is submitted for consolidation on the backup storage system (BSS). The storage capacity needed to consolidate the first original data (508A), before data deduplication, is measured at 100 TB. After subjection through the data deduplication process, however, a first deduplicated data (510A) is obtained from the first original data (508A), which measures at 20 TB. Subsequently, a first deduplication ratio (DR) (512A) obtained through the compression of the first original data (508A) is calculated as 100 TB divided by 20 TB, or 5:1.
Furthermore, assume a DR threshold is identified as being set to 4:1. In one embodiment of the invention, the DR threshold may serve as a classification boundary that indicates whether the deduplicated data (510A) should be consolidated in the high deduplication LSC (502A) or the low deduplication LSC (504A) based on the associated DR (512A). Following a comparison, it is determined that the first DR (512A) exceeds the DR threshold (i.e., 5:1>4:1). Based on this determination, the first deduplicated data (510A) is identified as data that dedupes well and, accordingly, should be consolidated in the high. deduplication LSC (502A). Hereinafter, a check confirms that sufficient unallocated high deduplication LSC (i.e., 30 TB) is available to consolidate the first deduplicated data (510A) (i.e., 20 TB). Based on the confirmation, the first deduplicated data (510A) is stored in the high deduplication LSC (502A). With the consolidation of the first deduplicated data (510A), the measured allocated high deduplication LSC rises to 40 TB (i.e., the initial 20 TB plus the 20 TB representative of the first deduplicated data), whereas the unallocated high deduplication LSC lowers to 10 TB (i.e., the initial 30 TB minus the 20 TB representative of the first deduplicated data).
Turning to
Following another comparison, it is determined that the second DR (512B) exceeds the DR threshold (i.e., 10:1>4:1). Based on this determination, the second deduplicated data (510B) is also identified as data that dedupes well and, accordingly, should also be consolidated in the high deduplication LSC (502A). However, this time around, a check confirms that insufficient unallocated high deduplication LSC (i.e., 10 TB) is available to consolidate the second deduplicated. data (510B) (i.e., 20 TB). Based on this confirmation, the low deduplication LSC (504A) is assessed to determine whether there is sufficient unallocated low deduplication LSC to subsume (or cover) the remaining 10 TB (i.e., difference/delta or overdraft LSC) needed to consolidate the second deduplicated data (510B).
Based on the assessment, it is determined that there indeed is sufficient unallocated low deduplication LSC (i.e., the initial 15 TB) to subsume the overdraft LSC (i.e., 10 TB). Accordingly, an overdraft LSC equivalent of the unallocated low deduplication LSC is overdrawn into the high deduplication LSC (502A). This leads to an adjustment of the LSC defining the low deduplication LSC (504B) as well as the high deduplication LSC (502B). Specifically, the low deduplication LSC (504B), overall, is adjusted to encompass 20 TB (i.e., the initial total 30 TB minus 10 TB representative of the overdraft LSC). The unallocated low deduplication LSC is also diminished from 15 TB (initially) to 5 TB (i.e., subtracting 10 TB, representative of the overdraft LSC, from the initial capacity), Conversely, the overall high deduplication LSC (502B) is augmented from the initial 50 TB to an adjusted 60 TB, which includes the additional 10 TB overdrawn from the low deduplication LSC.
Further, in consolidating the second deduplicated data (510B) therein, the allocated high deduplication LSC rises to 60 TB (i.e., 40 TB after consolidating the first deduplicated data (510A) plus 20 TB needed to consolidate the second deduplicated data (510B)), while the unallocated high deduplication LSC drops to 0 TB (i.e., 10 TB after consolidating the first deduplicated data (510A) plus 10 TB overdrawn from the unallocated low deduplication LSC minus 20 TB needed to consolidate the second deduplicated data (510B)). With all high deduplication LSC (502B) consumed by deduplicated data written thereto, an LSC upgrade alert may be issued to notify users and/or administrators of the BSS that further capacity licenses would need to be procured. Upon procurement and installation, a portion or all of the USC (506) may be re-designated as high deduplication LSC and/or low deduplication LSC.
Embodiments of the invention relate to a method and system for dynamic data protection. Dynamic data protection may refer to a flexible framework for protecting backup workloads of varying characteristics. Prior to the invention. disclosed herein, no distinction would be made between, for example, data that may dedupe well versus data that may not dedupe well, with respect to backup management and/or licensing, That is, current data protection (or backup) systems are not aware of the characteristics of the data that which they consolidate, and as such., these systems remain unresponsive to changing workloads. The invention disclosed herein addresses these shortcomings by assessing incoming data characteristics and consolidating the data in characteristic-appropriate storage capacities.
While the invention has been described with respect to a limited number of embodiments, those skilled in the art, having benefit of this disclosure, will appreciate that other embodiments can be devised which do not depart from the scope of the invention as disclosed herein. Accordingly, the scope of the invention should be limited only by the attached claims.