The security of computing resources and associated data is of high importance in many contexts. As an example, organizations often utilize networks of computing devices to provide a robust set of services to their users. Networks often span multiple geographic boundaries and often connect with other networks. An organization, for example, may support its operations using both internal networks of computing resources and computing resources managed by others. Computers of the organization, for instance, may communicate with computers of other organizations to access and/or provide data while using services of another organization. Organizations may utilize complex data storage systems to efficiently and cost effectively store data. In many instances, organizations configure and utilize data storage systems hosted and managed by other organizations, thereby reducing infrastructure costs and achieving other advantages. With such complex use of computing resources to manage, ensuring that access to the data is authorized and generally that the data is secure can be challenging, especially as the size and complexity of such configurations grow.
Various embodiments in accordance with the present disclosure will be described with reference to the drawings, in which:
In the following description, various embodiments will be described. For purposes of explanation, specific configurations and details are set forth in order to provide a thorough understanding of the embodiments. However, it will also be apparent to one skilled in the art that the embodiments may be practiced without the specific details. Furthermore, well-known features may be omitted or simplified in order not to obscure the embodiment being described.
Techniques described and suggested herein include systems and methods for enhancing data security in a data storage system. In various embodiments, a data storage system stores data objects on behalf of users of the data storage system. In some examples, the data storage system is hosted and operated by a computing resource service provider as a service for the users, which may be devices of the data storage system. The customers may upload data objects to the data storage system for storage by the data storage system. In some embodiments, the data storage system provides a web service application programming interface (API) and the customers remotely manage their data through the API. For instance, the customers may submit web service calls to upload data to the data storage system, to retrieve data from the data storage system and to perform other operations, to obtain an inventory of data objects stored by the data storage system, and to perform other operations, some of which are described in more detail below.
In various embodiments, a data storage system is operated in a manner providing for data durability and security. For example, in various embodiments, the data storage system redundantly stores, among data storage devices of the data storage system, and also encrypts the data objects. Redundant storage of the data objects may be accomplished in a variety of ways. For instance, in some examples, a redundancy encoding scheme is used to deconstruct a data object into shards, which may be data objects that individually are insufficient for reconstructing the data object but that can be combined with other shards to construct the data object. In some examples, a proper subset of the shards generated from a data object is sufficient for constructing the data object. An example technique for redundantly encoding a data object into shards and reconstructing shards into the data object is erasure coding. The shards may be stored among a set of data storage devices such as by storing a shard in some or all of the data storage devices, with each shard stored in a different data storage device.
For the purpose of data security, the data objects stored by a data storage system may be encrypted using one or more cryptographically strong algorithms, such as algorithms that encrypt data in a manner rendering the encrypted data computationally impractical to decrypt without access to a secret key usable to decrypt the data. Example cryptographic algorithms are discussed below. In various embodiments, a new key, referred to as a content encryption key, is generated for each data object to be stored by the data storage system. The content encryption key may be generated to be unique for the corresponding data object or, generally, such that the probability of multiple data objects sharing the same key is below some threshold. Once generated, the content encryption key is used to encrypt the data object, such as by encrypting the data object before it is deconstructed to shards or by encrypting the shards individually using the key.
To enable future access to the data object, content encryption keys may be persistently stored by the data storage service. However, to protect against unauthorized access to a content encryption key enabling unauthorized decryption of the data object, the data storage service may store content encryption keys in encrypted form. The data storage service may cause the content encrypting keys to be encrypted using key encrypting keys, such as by encrypting the content encrypting keys itself or securely providing the content encrypting keys to another entity to perform the encryption. The key encrypting keys may be securely stored by the data storage service (or another entity) and a single key encrypting key may be used to encrypt multiple content encryption keys. The data storage service or other entity may maintain, in association with data objects, metadata that includes or references the encrypted content encryption keys (referred to as wrapped keys) such that the metadata can be used to access the wrapped keys. The metadata may also associate, directly or indirectly, the wrapped keys with an identifier or other information usable to identify and access the key encrypting key that was used to encrypt the content encryption key encrypted in the wrapped key. In this manner, should future access to the plaintext data of the data object, the identifier can be used to access the proper key encrypting key so that the key encrypting key can be used to decrypt the wrapped key to obtain the content encryption key and, therefore, so that the content encryption key can be used to decrypt the data object.
To further ensure the security of the data, key encrypting keys are, in various embodiments, encrypted under a disaster recovery key. A disaster recovery key may be a public key of a public-private key pair of a public key cryptosystem, whereas, in various embodiments, the content encryption key and key encrypting keys are keys of a symmetric key cryptosystem (a system that utilizes a corresponding cryptographic algorithm, which may be a cryptographic primitive). In this manner, more common operations performed by the data storage system, such as encryption and decryption of data objects and content encryption keys, can be performed computationally efficiently relative to use of public key cryptosystems. In some embodiments, however, the content encryption key and/or key encrypting key is/are public keys used in a public key cryptosystem and/or the disaster recovery key is a key of a symmetric key cryptosystem. In embodiments where the disaster recovery key is a public key of a public key cryptosystem, the private key corresponding to the disaster recovery key is stored in a highly secure manner, which may utilize specialized hardware and which may require specific security protocols to be followed in order to access (use) the private key for the performance of cryptographic operations.
A single disaster recovery key may be used to encrypt one or more key encrypting keys, which may be rotated over time for the purpose of security. Key encrypting keys may be encrypted by the disaster recovery key and (in encrypted form) stored in a file, referred to as a disaster recovery file, that is redundantly stored in the data storage system. For example, the disaster recovery file may be stored in multiple data storage devices (e.g., drives) that also store the data objects (perhaps in shard form). In some embodiments, every data storage device of the data storage system that stores a shard also stores the disaster recovery file. The disaster recovery file may also associate key encrypting keys with identifiers of the key encrypting keys. In this manner, should access to a key encrypting key be lost, the disaster recovery file can be used to decrypt the key encrypting key from the disaster recovery file.
In various embodiments, the data storage service operates in accordance with a service-level agreement (SLA) relating to durability of the data stored therein, where durability is a measurement of likelihood that a data object stored by the data storage system will not become inadvertently inaccessible due to various factors, such as drive malfunction, catastrophic events and/or other events that can cause some data to be lost. Further, in various embodiments, cryptographic keys are persisted to ensure durability of the keys that is greater than to that of the SLA. For example, content encryption keys may be stored with at least as good a durability as the data objects with which they correspond. Likewise, key encrypting keys may be stored with a durability that is greater than the durability of the SLA. Generally, the content encryption keys and key encrypting keys may be stored such that, when their durability is taken into account when calculating the durability of the data objects stored by the data storage system, the SLA is not violated. In other words, the durability of the cryptographic keys used by a data storage system is used in determining the durability of the data objects to ensure compliance with the SLA. In this manner, the SLA is not inadvertently violated due to a lack of durability of the keys.
In various embodiments of the present disclosure, content encryption keys are stored by the data storage system so that the content encryption keys are accessible for the decryption of data. However, to avoid an unauthorized access of data, the content encryption keys 104 may be stored in encrypted form. As illustrated in
As discussed in more detail below, a data storage system may encrypt content encryption keys 104 for storage within the data storage system. The same key encrypting key 106 may be used to encrypt multiple content encryption keys. However, to enhance security by, for instance, avoiding cryptographic attacks performable when a key is used repeatedly, the key encrypting key 106 may be rotated. Rotating a key encrypting key may be performed by generating or otherwise obtaining a new key encrypting key and using the new key encrypting key to perform encryption (and, if appropriate, decryption) of content encryption keys 104. When a key encrypting key is replaced by a new key encrypting key, it may be said that the key encrypting key that was replaced has been retired. The data storage system may securely store retired key encrypting keys so as to maintain the ability to access content encryption keys that were encrypted under the key encrypting keys before their retirement. Thus, for instance, if a user requests to access a data object (e.g., requests that the data object be retrieved from one or more data storage devices of the data storage system and provided), data storage system may access an appropriate retired key encrypting key, if the key encrypting key has been retired before the request has been received, and use the retired key encrypting key to decrypt a content encryption key used to encrypt the data object. The decrypted content encryption key may then be used to decrypt the data object so that the data object can be provided to the user in a form usable to the user, such as plaintext form or encrypted form decryptable by the user (e.g., over a secure socket layer (SSL) connection).
Various embodiments of the present disclosure utilize techniques to further enhance security by redundantly storing key encrypting keys in encrypted form in the data storage system. In an embodiment, a disaster recovery public key 108 is used to encrypt a key encrypting key used by the data storage system. The disaster recovery public key 108 may be a public key of a public key cryptosystem. Described briefly, the public key cryptosystem may be utilized to enable any entity with access to the public key (which may be publically available) to encrypt data using the disaster recovery public key. Decryption of the encrypted data may be performed by any holder of a disaster recovery private key corresponding to the disaster recovery public key. Sample symmetric cryptosystems include, but are not limited to, the ElGamal cryptosystem, elliptic curve cryptosystems, the Pallier cryptosystem, the Ron Rivest, Adi Shamir and Leonard Adleman (RSA) cryptosystem, the Cramer-Shoup cryptosystem, and variations thereof.
The disaster recovery private key may be maintained in a secure manner such as in a hardware security module. Further, one or more security procedures may be enforced in connection with access to the disaster recovery private key. For instance, one or more rules with respect to a quorum of operators necessary to be present for access of the disaster recovery private key, may be enforced. In this manner, if a key encrypting key in plaintext form is lost—that is, if the key encrypting key 106 in plaintext form has become accessible, the key encrypting key 106 may be recovered by accessing the key encrypting key 106 in encrypted form and utilizing a disaster recovery private key to decrypt the encrypted key encrypting key 106.
As discussed above, various embodiments of the present disclosure are utilized to enhance data security within a data storage system.
In the illustrative example of
The on-demand data storage service 208, on the other hand, may be configured to provide better performance with respect to data access. For example, the on-demand data storage service 208 may be configured to synchronously process requests to store and/or access data. To enable better performance relative to the archival data storage service 206, the on-demand data storage service 208 may operate using additional computing resources (e.g., databases) that enable the on-demand data storage service 208 to locate and retrieve data quickly relative to the archival data storage service 206. The on-demand data storage service 208 may provide synchronous data access. For example, the on-demand data storage service may maintain stored data in a manner such that, when a request for a data object is retrieved, the data object can be provided (or streaming of the data object can be initiated) in a response to the request.
As illustrated in
In various embodiments, the data storage service 300 performs one or more operations asynchronously, such as described above. For example, for some requests submitted to the web interface system 302, the data storage service 300 may provide results of fulfillment of the requests separately from responses to the requests. As an illustrative example, the data storage service 300 may receive to the web interface system 302 a request to retrieve a data storage object stored by the data storage service 300. The data storage service 300 may respond to the request with a response that does not include the requested data, but that may include other information such as information acknowledging the request had been submitted and/or information indicating that the request is to be processed by the data storage service 300. Example information, such as a job identifier, is described below. In these examples, once the data storage service 300 has made the requested data available, the data storage service 300 may provide the data such as in response to a separate request for the data.
As noted above, fulfillment of some types of requests by the data storage service 300 is performed asynchronously. In order to accomplish this, the data storage service 300 may include a job tracking system 304. The job tracking system 304 may be a computer system configured to perform various operations in connection with the tracking of fulfillment of certain types of requests to the data storage service 300 and, in particular, for requests that are fulfilled by the data storage service 300 asynchronously. As an example, when the data storage service 300 receives a request to the web interface system 302, if the request is of a type that is fulfilled asynchronously, the web interface system 302 may notify the job tracking system 304 of the request. Notification may be performed using an electronic communication over an internal network of the data storage service 300 from the web interface system 302 to the job tracking systems 304. The job tracking system 304 may generate an identifier for a job, which may be a workflow for processing the request. The identifier may be generated, for example, in a serial manner or in another manner. The job tracking system 304 may interact with one or more other components of the data storage service 300 in order to track fulfillment of requests for which the job tracking system 304 has initiated jobs.
When the job tracking system 304 detects completion of a job (e.g., detects completion of a workflow to obtain data and make the obtained data available for download), the job tracking system 304 may transmit an electronic message to a notification system 306 which may be a system configured to provide notifications of job completion to users of the data storage service. The notifications may be provided in any suitable manner such as by electronic communications to one or more specified recipients. The notification system 306 may also be configured that users of the data storage service 300 can subscribe to various notification topics and when a message is published to a topic to which a user has been subscribed the user may receive an electronic communication of the message. In this manner, for example, a customer may determine when a job has been completed and results of the job's completion, that is of a corresponding requests fulfillment, are available. In the example of a data retrieval job, for example, a user of the data storage service 300 may utilize a message from the notification system 306 to determine that data requested is available for download. The user may then submit a request to download the data that has been made available. It should be noted that one or more users that receive notifications are not necessarily, but can be, the same user that submitted a request to the web interface system 302 for which a corresponding job was created. Further, in various embodiments, a user that obtains from the data storage service 300 the results of a job may be, but is not necessarily, the same user that initially submitted to the data storage service 300 the request that caused the job to be practiced. For example, one user may request that data be made available for retrieval and another user may request and receive the data once the data has been made available.
As illustrated in
Similarly, the data storage service 300 may include a download staging system 314 which may be a computer system configured to store data objects until the data objects are retrieved by users of the data storage service 300. For example, when the data storage service 300 receives a request to the web interface system 302 to retrieve a data object that is stored in the data storage device 312, the data encryption system 308 may obtain encrypted data objects from the data storage devices 312, may decrypt the encrypted data objects and transmit the decrypted data objects to the download staging system 314. The user or another user may transmit a subsequent request to the web interface system 302 to download the data object from the download staging system 314. The download staging system 314 may then provide the data object to the web interface system 302 which then provides the data object to the user that requested the data object. It should be noted that once decrypted by the data encryption system 308 transmission of the data object does not necessarily occur with the data object in plaintext form. For example, data object may be re-encrypted for transmission over an SSL or other secure connection one or more times during the transmission of the data object to the user that requested the data object. In order to perform various encryption and decryption operations, the data encryption system 308 may operate in accordance with a key storage system 316 which is a system configured to store keys utilized by the data encryption system 308, such as described in more detail below.
In addition to the above components, the data storage service 300 may include an anti-entropy system 318. The anti-entropy system 318 may be a system configured to perform various anti-entropy operations in connection with the data storage system. Such operations may include, for example, reclaiming data storage space when the space is no longer needed to store data that was stored in the storage space. The anti-entropy system may also perform various operations to optimize storage efficiency and cost, such as rewriting data more efficiently. As discussed below, the anti-entropy system may, in various circumstances, be configured to encrypt content encryption keys stored by the data storage system. For instance, if a key encrypting key is compromised, the anti-entropy system may locate content encryption keys encrypted under the key encrypting key and encrypt the located keys using a new key encrypting key. The anti-entropy system 318 may operate in a manner that avoids interference with other operations of the data storage service 300, such as read and write operations performed as part of storing and retrieving data objects in/from the data storage service 300. For instance, the anti-entropy system 318 may begin and pause operations to avoid interference.
As noted above, various storage systems may utilize various techniques described herein in order to securely store data objects within.
Returning to
In an embodiment, the process 400 includes obtaining 406 a content encryption key. Obtaining 406 the content encryption key may be performed in any suitable manner. For example, obtaining the content encryption key may be performed by generating the content encryption key. The content encryption key may be generated in any suitable manner, such as by using the SecureRandom class of the Java security package, although other methods of generating keys may be used. For instance, any cryptographically secure key derivation function may be used. Once the content encryption key has been obtained, the process 400 may include encrypting 408 the shards using the content encryption key that was obtained 406. The shards may be encrypted in any suitable manner using any suitable cryptographic algorithm. For example, in some embodiments, the shards are encrypted using the counter mode of the Advanced Encryption Standard (AES-CTR). Generally, the shards may be encrypted by inputting the content encryption key and the shard (for each shard) into a cryptographic algorithm. It should be noted that, while the present disclosure discusses encrypting shards, variations are considered as being within the scope of the present disclosure. For example, in some embodiments, the data object is first encrypted and the shards may be generated from the encrypted data object. Other variations, such as encrypting a data object, generating shards from the encrypted data object, and then encrypting the shards, are also within the scope of the present disclosure. Regardless of how the shards are encrypted, the encrypted shards may then be stored 410 such as in the data storage devices 312 discussed above in connection with
The content encryption key used to encrypt 408 the shards may be encrypted 412. The content encryption key may, for instance, be encrypted under a key encrypting key described above. The content encryption key may be encrypted using any suitable cryptographic algorithm such as the Galois counter mode of the Advanced Encryption Standard (AES-GCM). The encrypted content encryption key (i.e., wrapped key) may then be stored 414 so as to be assessable to be decrypted and used to decrypt the encrypted shards. The encrypted content encryption key may, for instance, be stored in metadata stored for the data object, such as described below. It should be noted that, while various embodiments of the present disclosure use AES-CTR and AES-GCM, other cryptographic ciphers may be used. Examples of include, for example, other modes of the Advanced Encryption Standard (AES), Blowfish, Data Encryption Standard (DES), Triple DES, Serpent or Twofish, and depending on the specific implementation selected, may be either asymmetric or symmetric key systems.
Data storage systems typically provide the ability to retrieve data stored therein.
In an embodiment, the process 500 includes receiving 502 a data object from a user. The user may, for instance, submit an appropriately configured API call to the web interface system 302 of
Upon transmission of the data object to the upload staging system, the process 500 of
In various embodiments, notification of the successful upload to the user may comprise an electronic message that includes an identifier of the data object used by the data storage service, where the identifier may be useable by the user to later obtain the data object from the data storage service. The user may, for instance, request retrieval of a data object identified by the identifier specified by the user. In some instances, the data storage service opaquely encodes information in the identifier that enables the data storage service to locate, in the data storage system, the data of the data object without having to maintain a separate synchronously accessible database for this purpose.
The identifier of the data object may encode storage location information that may be used to locate a data object stored in an archival data storage system. For example, the storage location information may encode a reference to a hierarchical data structure in which the data object is stored. Such an embodiment may reduce or eliminate the cost to store a namespace map or similar data structure to map data object identifiers to storage locations of the corresponding data objects. The identifier of the data object may also include other information, such as payload validation information which may include a data size, a timestamp (e.g., data creation timestamp), one or more digests of the payload and the like. A data object identifier may also encode metadata validation information to ensure the integrity of information associated with a data object identifier. In various embodiments, metadata validation information may include error-detection checks such as a parity check, cyclic redundancy check (CRC), error-correction checks and the like of some or all information encoded in the data object identifier. Data object identifiers may be encoded (e.g., encrypted) in entirety or partially to encapsulate information described above. Such encoding may provide data security and transparency to implementation details. In some embodiments, data object identifiers may include information described above without encryption or otherwise obscuring the information.
For retrieval or deletion of data object stored in an archival data storage system, the data object identifier associated the data object may be validated using at least the metadata validation information. The storage location information encoded in the data object identifier may be used to locate the stored data object in a non-transient data store. In some embodiments, the identifier is provided while the data object is stored in a transient data store, although the storage location information in the data object identifier is usable to locate the stored data object in a non-transient data store. Finally, payload integrity validation information may be used to validate the integrity of the retrieved or to-be-deleted data object. Subsequently, the requested access (retrieval/deletion) may be validated based at least in part on the policy information encoded in the data object identifier. Other information which may be encoded in an identifier may also be used, such as policy information used to enforce policy on requests to perform operations (e.g., delete operations) submitted using the identifier.
In an embodiment, the identifier opaquely encodes the information so that the information is in a form that is not human readable. In some examples, for instance, the identifier comprises encrypted information so that, without access to a key usable to decrypt the identifier, the encrypted information is not accessible in human readable form or, generally, in a usable form. Other features of data object identifiers that may be used in accordance with the various embodiments are described in U.S. patent application Ser. No. 13/569,994, entitled ARCHIVAL DATA IDENTIFICATION, filed on Aug. 8, 2012, which is incorporated herein by reference for all purposes.
As discussed, in some embodiments, a data storage service stores data objects in an asynchronous manner. That is, upon acknowledgement of a successful upload of a data object from the customer, the data storage service may perform additional operations at a later time in order to efficiently and cost-effectively store the data object. The data storage service may, for instance, process received data objects in batches in order to take advantage of parallelism and various optimizations available to the data storage service.
When the data object has been obtained 604, the process 600 may include applying 606 a redundancy encoding scheme to the data object to generate shards. A system performing the process 600, for example, may apply an erasure coding scheme to the data object to generate the shards. The shards may then be encrypted 608 using an appropriate cryptographic algorithm such as described above. Metadata for the data object may also be generated 610. An example of information included in data object metadata is discussed below in connection with
As noted, data objects stored by a data storage service may be deconstructed into shards and various metadata may be generated and stored for the data objects.
The footer data as illustrated in
Various operations performed by a data storage service, as noted above, may be performed asynchronously. A request may be submitted to the data storage service and the data storage service may fulfill the request in an asynchronous manner. For example, requests to retrieve data objects may be batched, and data objects may be retrieved from data storage devices and provided to a download staging system in batches. When a batch containing a customer's data object has been transmitted to a download staging system, the user of the data storage service may request the data object.
The job tracking system, as noted above, may perform various operations in connection with the tracking of jobs performed by the data storage service. One of these operations may be to initiate jobs, which may include determining an identifier for the job. The job tracking system may, for instance, provide a serial number or other identifier for the job. Accordingly, the process 800 includes receiving 806 a job identifier from the job tracking system. Once the job identifier has been received 806 from the job tracking system, a system performing the process 800 may provide 808 a response to the user with the job identifier. In this manner, the user may submit requests to a data storage system with the job identifier in order to perform various operations in connection with the job, such as checking the status of the job, canceling the job and/or downloading data retrieved as part of processing of the job.
Once a job has been initiated by a data storage service, the data storage service may process the job.
When object metadata has been obtained, the process 900 may include decrypting 908 a wrapped key from the data object metadata to obtain the content encryption key that was used to encrypt the shards. Decrypting 908 the wrapped key may include various appropriate operations such as accessing the key encrypting key that was used to encrypt the content encryption key, thereby generating the wrapped key, and using the key encrypting key to decrypt to the wrapped key and thereby obtain the content encryption key. Further, decrypting 908 the key encrypting key may include determining the serial number of the key encrypting key from the data object metadata and using the serial number to access the corresponding key encrypting key.
Once the content encryption key has been decrypted 908 using the appropriate key encrypting key, the process 900 may include using 910 the content encryption key to decrypt the shards. Once the shards have been decrypted 910, the process 900 may include reconstructing 912 the data object. For example, the data object may be reconstructed 912 in accordance with the scheme (e.g., erasure encoding scheme) that was used to generate the shards. Once the data object has been reconstructed 912 the process 900 may include transmitting the data object to a download staging system such as described above in connection with
Once the data object has been transmitted 914 to the download staging system, the process 900 may include causing 916 a notification of data object availability to be issued. For example, referring to the environment comprising the data storage service 300 in
As with all processes described herein, variations are considered as being within the scope of the present disclosure. For example, as noted above, data objects may be encrypted and then deconstructed into shards. In such embodiments, the process 900 may be adapted such that the encrypted data object is reconstructed using a set of shards and then the reconstructed encrypted data object can be decrypted. Other variations are also considered as being within the scope of the present disclosure.
In various embodiments, once the data object is available for retrieval, a user may, through a web interface system obtain the data object.
As noted above, various embodiments of the present disclosure allow for rotation of a key encrypting key for various purposes related to security enhancement.
As illustrated in
Upon updating 1106 the disaster recovery file, the process 1100 may include replacing 1108 the previous key encrypting key with the new key encrypting key. Replacing 1108 the previous key encrypting key with the new key encrypting key may be performed in any suitable manner, such as by marking the new key encrypting key electronically as available for use in performing cryptographic operations. Generally, one or more systems that utilize the key encrypting key may be updated so that the new key encrypting key is used to perform subsequent encryption operations. It should be noted that the previous key encrypting key may remain available to enable decryption of wrapped keys encrypted under the previous key encrypting key.
It should be noted that the disaster recovery file 1200 illustrated diagrammatically in
As noted above, a disaster recovery may be used in instances where key encrypting keys are lost. For example, if one or more systems having access to a key encrypting key through malfunction or malfeasance destroy access to a key encrypting key, a disaster recovery file may be used to regain access to the key encrypting key.
In an embodiment the process 1300 includes detecting 1302, the loss of a key encrypting key. Detecting 1302, the loss of the key encrypting key, may be performed in various ways in accordance with various embodiments. For example, detection of a key encrypting key loss may be performed upon detection of a failure to perform one or more operations using the key encrypting key. As another example, one or more forensic processes may detect loss of the key encrypting key, such as in response to a detected security breach of a data storage system. Upon detection 1302 of loss of the key encrypting key, the process 1300 may include accessing 1304 a disaster recovery file. For example, in various embodiments of the present disclosure, a disaster recovery file is stored redundantly in a data storage device 312, discussed above in connection with
In some embodiments, each data storage device designated for storage of data objects (e.g., by storing shards for data objects) may include a disaster recovery file, although it should be noted that not all drives of the data storage system need to store a disaster recovery file. For instance, a designated set of data storage devices may store a disaster recovery file. The set may be selected to attain sufficient redundancy. For instance, enough data storage devices may be selected to ensure a calculated likelihood (e.g., 99.99999999999%, referred to as a durability measure) that the disaster recovery file is accessible and incorrupt. In addition, the disaster recovery files may be stored using other data storage systems and/or generally data storage file may be stored in any place where it can be accessed. In some embodiments, the disaster recovery file is stored in a manner calculated to ensure a measure of durability for the disaster recovery file that is greater than a durability ensured for data objects according to a service-level agreement of the data storage system. Ensuring the durability of the disaster recovery file may include storing the disaster recovery file in multiple storage devices, on multiple server racks, in multiple data centers, in multiple geographic regions and/or in other ways that decrease any likelihood of loss of the disaster recovery file. Further, one or more disaster recovery files may be deconstructed into shards as discussed above.
Accordingly, accessing 1304 the disaster recovery file may include reading the disaster file from a data storage location in which the disaster recovery file is stored (or, if the disaster recovery is stored as distributed shards, accessing enough shards to reconstruct the disaster recovery file). Once the disaster recovery file has been accessed 1304, the process 1300 may include decrypting 1306, the lost key encrypting key, using the disaster recovery private key. It should be noted that due to the sensitive nature of the disaster recovery private key, one or more security protocols may be required to be followed in order to decrypt the lost key encrypting key using the disaster recovery private key. For example, such security protocols may require a predefined quorum of human operators present in order to utilize the disaster recovery private key. In one example, the disaster recovery private key is stored in a secure computing device accessible to one or more operators. The one or more operators may be unable to access the secure computing device without security credentials from one or more other human operators. Generally any manner of ensuring that the disaster recovery private key is used for legitimate (e.g., authorized) purposes may be used.
Once the lost key encrypting key has been decrypted 1306 using the disaster recovery private key, the process 1300 may include restoring 1308 the key encrypting key. Restoring the key encrypting key may be performed by providing the key encrypting key to one or more devices that utilize the key encrypting key, such as one or more devices comprising the data encryption system 308 described above in connection with
Providing the key encrypting key to the devices may be performed in various ways. In some examples, one or more security protocols are followed in order to securely transfer the key encrypting key to one or more appropriate devices. For example, a device may utilize a public private key pair for a public key cryptographic algorithm. The public key of the public private key pair may be used to encrypt the key encrypting key. The encrypted key encrypting key may be transferred to the device, which may then use its private key from the public private key pair to decrypt the key encrypting key. As an alternative or in addition, the key encrypting key can be transmitted over a secure (e.g., SSL) channel.
Despite best efforts, data storage systems may experience security breaches. For example, an employee of an organization with access to certain systems of a data storage system may through error or malicious intent compromise a key encrypting key. Various techniques of the present disclosure allow a data storage system to recover from such security breaches.
Upon detection 1402 of compromise of the key encrypting key, a key rotation process may be performed 1404, such as described above. However, because of the key encrypting key being compromised the process 1400 may include one or more additional operations in order to prevent unauthorized use of the compromised key encrypting key to access encrypted data. In an embodiment, the process 1400 includes updating 1406, an anti-entropy system (or, generally, an anti-entropy process that is to be performed), to update wrapped keys that were generated using the compromised key encrypting key. The anti-entropy process may be updated to search for and re-wrap the content encryption key wrapped by the wrapped key. In one example, the compromised key encrypting key may be decrypted using the old key encrypting key and re-encrypted using the new key encrypting key. As another example, the wrapped key may simply be encrypted using the new key encrypting key. In this latter example, metadata may be maintained to indicate that both the compromised key encrypting key and the new key encrypting key are necessary for accessing the content encryption key from the wrapped key. For example, the wrapped key may first be decrypted with the new key encrypting key and the result may then be decrypted using the compromised key encrypting key to obtain the content encryption key.
As noted above, numerous variations are considered as being within the scope of the present disclosure. For example, in some embodiments, each shard of a data object is encrypted using a different content encryption key. As another example, iterations of the above techniques may be performed, such as by deconstructing shards to sub-shards using a redundancy encoding scheme. In this example, shards may themselves be considered to be data objects. Also, additional layers of encryption may be applied to data and, generally, other variations are considered as being within the scope of the present disclosure.
In addition to the foregoing, the techniques described herein are adaptable to variations in how data objects are processed for storage by a data storage system. For instance, intermediate operations may be performed to deconstruct a data object into shards. For example, some data objects, such as data objects exceeding some size threshold, may be deconstructed into smaller sub-objects. Data objects exceeding a certain size may be divided into smaller objects of a smaller size, where all of the smaller objects have equal size except for perhaps one of the objects (a remainder object, when the data object has a size that is not an integer multiple of the smaller size). As an illustrative example, a 31.4 megabyte (MB) data object may be divided into thirty-two sub-objects, the first thirty-one objects having a size of 1 MB and the last sub-object having a size of 0.4 MB. The first sub-object may consist of the first MB of bits of the object, the second sub-object may consist of the next MB of bits, and the like. To track the storage of data in the data storage system, the data storage system may utilize manifest files, where a manifest file is a file that enables the location of the sub-objects of a data object. As an example, an identifier for a data object may encode a location for a corresponding manifest file. The manifest file may encode locations for the sub-objects of the data object or, if the data object is particularly large, the manifest file may encode locations for other manifest files, which in turn, encode locations for other manifest files or sub-objects, and so on.
When a data storage system utilizes manifest files in this manner, the techniques described herein are applicable with slight variation. For example, the data object may be encrypted with a content encryption key and the encrypted data object can be decomposed into sub-objects, each of which can be decomposed into shards that are stored among the storage devices of the data storage system. As another example, the data object can be decomposed into sub-objects, which can then be encrypted and then decomposed into shards which are stored among the storage devices of the data storage system. In this example, each sub-object may be encrypted with the same or a different content encrypting key (or, generally, N sub-objects can be encrypted using K content encrypting keys, where N and K are integers and N is greater than or equal to K). As yet another example, a data object can be decomposed into sub-objects, which are then decomposed into shards that are then encrypted using one or more content encrypting keys (generally, if N shards can be encrypted using K content encrypting keys, where N and K are integers where N is greater than or equal to K). In each of these examples, the content encrypting key(s) may be then encrypted using a key encrypting key, such as described above. Other variations are also considered as being within the scope of the present disclosure.
As illustrated in
The illustrative environment includes at least one application server 1508 and a data store 1510. It should be understood that there can be several application servers, layers or other elements, processes or components, which may be chained or otherwise configured, which can interact to perform tasks such as obtaining data from an appropriate data store. Servers, as used herein, may be implemented in various ways, such as hardware devices or virtual computer systems. In some contexts, servers may refer to a programming module being executed on a computer system. As used herein the term “data store” refers to any device or combination of devices capable of storing, accessing and retrieving data, which may include any combination and number of data servers, databases, data storage devices and data storage media, in any standard, distributed or clustered environment. The application server can include any appropriate hardware and software for integrating with the data store as needed to execute aspects of one or more applications for the client device, handling some (even a majority) of the data access and business logic for an application. The application server may provide access control services in cooperation with the data store and is able to generate content such as text, graphics, audio and/or video to be transferred to the user, which may be served to the user by the web server in the form of HyperText Markup Language (“HTML”), Extensible Markup Language (“XML”) or another appropriate structured language in this example. The handling of all requests and responses, as well as the delivery of content between the client device 1502 and the application server 1508, can be handled by the web server. It should be understood that the web and application servers are not required and are merely example components, as structured code discussed herein can be executed on any appropriate device or host machine as discussed elsewhere herein. Further, operations described herein as being performed by a single device may, unless otherwise clear from context, be performed collectively by multiple devices, which may form a distributed system.
The data store 1510 can include several separate data tables, databases or other data storage mechanisms and media for storing data relating to a particular aspect of the present disclosure. For example, the data store illustrated may include mechanisms for storing production data 1512 and user information 1516, which can be used to serve content for the production side. The data store also is shown to include a mechanism for storing log data 1514, which can be used for reporting, analysis or other such purposes. It should be understood that there can be many other aspects that may need to be stored in the data store, such as for page image information and to access right information, which can be stored in any of the above listed mechanisms as appropriate or in additional mechanisms in the data store 1510. The data store 1510 is operable, through logic associated therewith, to receive instructions from the application server 1508 and obtain, update or otherwise process data in response thereto. In one example, a user, through a device operated by the user, might submit a search request for a certain type of item. In this case, the data store might access the user information to verify the identity of the user and can access the catalog detail information to obtain information about items of that type. The information then can be returned to the user, such as in a results listing on a web page that the user is able to view via a browser on the user device 1502. Information for a particular item of interest can be viewed in a dedicated page or window of the browser. It should be noted, however, that embodiments of the present disclosure are not necessarily limited to the context of web pages, but may be more generally applicable to processing requests in general, where the requests are not necessarily requests for content.
Each server typically will include an operating system that provides executable program instructions for the general administration and operation of that server and typically will include a computer-readable storage medium (e.g., a hard disk, random access memory, read only memory, etc.) storing instructions that, when executed by a processor of the server, allow the server to perform its intended functions. Suitable implementations for the operating system and general functionality of the servers are known or commercially available and are readily implemented by persons having ordinary skill in the art, particularly in light of the disclosure herein.
The environment in one embodiment is a distributed computing environment utilizing several computer systems and components that are interconnected via communication links, using one or more computer networks or direct connections. However, it will be appreciated by those of ordinary skill in the art that such a system could operate equally well in a system having fewer or a greater number of components than are illustrated in
The various embodiments further can be implemented in a wide variety of operating environments, which in some cases can include one or more user computers, computing devices or processing devices which can be used to operate any of a number of applications. User or client devices can include any of a number of general purpose personal computers, such as desktop, laptop or tablet computers running a standard operating system, as well as cellular, wireless and handheld devices running mobile software and capable of supporting a number of networking and messaging protocols. Such a system also can include a number of workstations running any of a variety of commercially-available operating systems and other known applications for purposes such as development and database management. These devices also can include other electronic devices, such as dummy terminals, thin-clients, gaming systems and other devices capable of communicating via a network.
Various embodiments of the present disclosure utilize at least one network that would be familiar to those skilled in the art for supporting communications using any of a variety of commercially-available protocols, such as Transmission Control Protocol/Internet Protocol (“TCP/IP”), protocols operating in various layers of the Open System Interconnection (“OSI”) model, File Transfer Protocol (“FTP”), Universal Plug and Play (“UpnP”), Network File System (“NFS”), Common Internet File System (“CIFS”) and AppleTalk. The network can be, for example, a local area network, a wide-area network, a virtual private network, the Internet, an intranet, an extranet, a public switched telephone network, an infrared network, a wireless network and any combination thereof.
In embodiments utilizing a web server, the web server can run any of a variety of server or mid-tier applications, including Hypertext Transfer Protocol (“HTTP”) servers, FTP servers, Common Gateway Interface (“CGI”) servers, data servers, Java servers and business application servers. The server(s) also may be capable of executing programs or scripts in response requests from user devices, such as by executing one or more web applications that may be implemented as one or more scripts or programs written in any programming language, such as Java®, C, C# or C++, or any scripting language, such as Perl, Python or TCL, as well as combinations thereof. The server(s) may also include database servers, including without limitation those commercially available from Oracle®, Microsoft®, Sybase® and IBM®.
The environment can include a variety of data stores and other memory and storage media as discussed above. These can reside in a variety of locations, such as on a storage medium local to (and/or resident in) one or more of the computers or remote from any or all of the computers across the network. In a particular set of embodiments, the information may reside in a storage-area network (“SAN”) familiar to those skilled in the art. Similarly, any necessary files for performing the functions attributed to the computers, servers or other network devices may be stored locally and/or remotely, as appropriate. Where a system includes computerized devices, each such device can include hardware elements that may be electrically coupled via a bus, the elements including, for example, at least one central processing unit (“CPU” or “processor”), at least one input device (e.g., a mouse, keyboard, controller, touch screen or keypad) and at least one output device (e.g., a display device, printer or speaker). Such a system may also include one or more storage devices, such as disk drives, optical storage devices and solid-state storage devices such as random access memory (“RAM”) or read-only memory (“ROM”), as well as removable media devices, memory cards, flash cards, etc.
Such devices also can include a computer-readable storage media reader, a communications device (e.g., a modem, a network card (wireless or wired), an infrared communication device, etc.) and working memory as described above. The computer-readable storage media reader can be connected with, or configured to receive, a computer-readable storage medium, representing remote, local, fixed and/or removable storage devices as well as storage media for temporarily and/or more permanently containing, storing, transmitting and retrieving computer-readable information. The system and various devices also typically will include a number of software applications, modules, services or other elements located within at least one working memory device, including an operating system and application programs, such as a client application or web browser. It should be appreciated that alternate embodiments may have numerous variations from that described above. For example, customized hardware might also be used and/or particular elements might be implemented in hardware, software (including portable software, such as applets) or both. Further, connection to other computing devices such as network input/output devices may be employed.
Storage media and computer readable media for containing code, or portions of code, can include any appropriate media known or used in the art, including storage media and communication media, such as, but not limited to, volatile and non-volatile, removable and non-removable media implemented in any method or technology for storage and/or transmission of information, such as computer readable instructions, data structures, program modules or other data, including RAM, ROM, Electrically Erasable Programmable Read-Only Memory (“EEPROM”), flash memory or other memory technology, Compact Disc Read-Only Memory (“CD-ROM”), digital versatile disk (DVD) or other optical storage, magnetic cassettes, magnetic tape, magnetic disk storage or other magnetic storage devices or any other medium which can be used to store the desired information and which can be accessed by the system device. Based on the disclosure and teachings provided herein, a person of ordinary skill in the art will appreciate other ways and/or methods to implement the various embodiments.
The specification and drawings are, accordingly, to be regarded in an illustrative rather than a restrictive sense. It will, however, be evident that various modifications and changes may be made thereunto without departing from the broader spirit and scope of the invention as set forth in the claims.
Other variations are within the spirit of the present disclosure. Thus, while the disclosed techniques are susceptible to various modifications and alternative constructions, certain illustrated embodiments thereof are shown in the drawings and have been described above in detail. It should be understood, however, that there is no intention to limit the invention to the specific form or forms disclosed, but on the contrary, the intention is to cover all modifications, alternative constructions and equivalents falling within the spirit and scope of the invention, as defined in the appended claims.
The use of the terms “a” and “an” and “the” and similar referents in the context of describing the disclosed embodiments (especially in the context of the following claims) are to be construed to cover both the singular and the plural, unless otherwise indicated herein or clearly contradicted by context. The terms “comprising,” “having,” “including” and “containing” are to be construed as open-ended terms (i.e., meaning “including, but not limited to,”) unless otherwise noted. The term “connected,” when unmodified and referring to physical connections, is to be construed as partly or wholly contained within, attached to or joined together, even if there is something intervening. Recitation of ranges of values herein are merely intended to serve as a shorthand method of referring individually to each separate value falling within the range, unless otherwise indicated herein and each separate value is incorporated into the specification as if it were individually recited herein. The use of the term “set” (e.g., “a set of items”) or “subset” unless otherwise noted or contradicted by context, is to be construed as a nonempty collection comprising one or more members. Further, unless otherwise noted or contradicted by context, the term “subset” of a corresponding set does not necessarily denote a proper subset of the corresponding set, but the subset and the corresponding set may be equal.
Conjunctive language, such as phrases of the form “at least one of A, B, and C,” or “at least one of A, B and C,” unless specifically stated otherwise or otherwise clearly contradicted by context, is otherwise understood with the context as used in general to present that an item, term, etc., may be either A or B or C, or any nonempty subset of the set of A and B and C. For instance, in the illustrative example of a set having three members used in the above conjunctive phrase, “at least one of A, B, and C” and “at least one of A, B and C” refers to any of the following sets: {A}, {B}, {C}, {A, B}, {A, C}, {B, C}, {A, B, C}. Thus, such conjunctive language is not generally intended to imply that certain embodiments require at least one of A, at least one of B and at least one of C to each be present.
Operations of processes described herein can be performed in any suitable order unless otherwise indicated herein or otherwise clearly contradicted by context. Processes described herein (or variations and/or combinations thereof) may be performed under the control of one or more computer systems configured with executable instructions and may be implemented as code (e.g., executable instructions, one or more computer programs or one or more applications) executing collectively on one or more processors, by hardware or combinations thereof. The code may be stored on a computer-readable storage medium, for example, in the form of a computer program comprising a plurality of instructions executable by one or more processors. The computer-readable storage medium may be non-transitory.
The use of any and all examples, or exemplary language (e.g., “such as”) provided herein, is intended merely to better illuminate embodiments of the invention and does not pose a limitation on the scope of the invention unless otherwise claimed. No language in the specification should be construed as indicating any non-claimed element as essential to the practice of the invention.
Preferred embodiments of this disclosure are described herein, including the best mode known to the inventors for carrying out the invention. Variations of those preferred embodiments may become apparent to those of ordinary skill in the art upon reading the foregoing description. The inventors expect skilled artisans to employ such variations as appropriate and the inventors intend for embodiments of the present disclosure to be practiced otherwise than as specifically described herein. Accordingly, the scope of the present disclosure includes all modifications and equivalents of the subject matter recited in the claims appended hereto as permitted by applicable law. Moreover, any combination of the above-described elements in all possible variations thereof is encompassed by the scope of the present disclosure unless otherwise indicated herein or otherwise clearly contradicted by context.
All references, including publications, patent applications and patents, cited herein are hereby incorporated by reference to the same extent as if each reference were individually and specifically indicated to be incorporated by reference and were set forth in its entirety herein.
This application is a divisional of U.S. patent application Ser. No. 13/919,701, filed on Jun. 17, 2013, entitled “REDUNDANT KEY MANAGEMENT,” which incorporates by reference for all purposes the full disclosure of U.S. patent application Ser. No. 13/569,994, filed on Aug. 8, 2012, entitled “ARCHIVAL DATA IDENTIFICATION,” U.S. patent application Ser. No. 13/570,088, filed on Aug. 8, 2012, entitled “ARCHIVAL DATA STORAGE SYSTEM,” U.S. patent application Ser. No. 13/570,092, filed on Aug. 8, 2012, entitled “ARCHIVAL DATA FLOW MANAGEMENT,” and U.S. patent application Ser. No. 13/069,065, filed on Mar. 22, 2011 entitled “MODULAR MASS STORAGE SYSTEM.”
Number | Name | Date | Kind |
---|---|---|---|
5239640 | Froemke et al. | Aug 1993 | A |
5282099 | Kawagoe et al. | Jan 1994 | A |
5506809 | Csoppenszky et al. | Apr 1996 | A |
5586291 | Lasker et al. | Dec 1996 | A |
5701407 | Matsumoto et al. | Dec 1997 | A |
5737745 | Matsumoto et al. | Apr 1998 | A |
5751997 | Kullick et al. | May 1998 | A |
5757617 | Sherry | May 1998 | A |
5900007 | Nunnelley et al. | May 1999 | A |
6023710 | Steiner et al. | Feb 2000 | A |
6138126 | Hitz | Oct 2000 | A |
6208999 | Spilo et al. | Mar 2001 | B1 |
6234591 | Driscoll et al. | May 2001 | B1 |
6374264 | Bohannon et al. | Apr 2002 | B1 |
6543029 | Sandorfi | Apr 2003 | B1 |
6578127 | Sinclair | Jun 2003 | B1 |
6604224 | Armstrong et al. | Aug 2003 | B1 |
6606629 | DeKoning et al. | Aug 2003 | B1 |
6747825 | Ng et al. | Jun 2004 | B1 |
6768863 | Ando et al. | Jul 2004 | B2 |
6950967 | Brunnett et al. | Sep 2005 | B1 |
6959326 | Day et al. | Oct 2005 | B1 |
7057981 | Kano et al. | Jun 2006 | B2 |
7076604 | Thelin | Jul 2006 | B1 |
7120737 | Thelin | Oct 2006 | B1 |
7200008 | Bhugra | Apr 2007 | B1 |
7269733 | O'Toole | Sep 2007 | B1 |
7340490 | Teloh et al. | Mar 2008 | B2 |
7359186 | Honda et al. | Apr 2008 | B2 |
7409495 | Kekre et al. | Aug 2008 | B1 |
7487316 | Hall et al. | Feb 2009 | B1 |
7487385 | Rodrigues et al. | Feb 2009 | B2 |
7577689 | Masinter et al. | Aug 2009 | B1 |
7587749 | Leser | Sep 2009 | B2 |
7644061 | Fallis et al. | Jan 2010 | B1 |
7685309 | Caronni et al. | Mar 2010 | B2 |
7730071 | Iwasaki et al. | Jun 2010 | B2 |
7774466 | Coates et al. | Aug 2010 | B2 |
7814078 | Forman et al. | Oct 2010 | B1 |
7827201 | Gordon et al. | Nov 2010 | B1 |
7840878 | Tang et al. | Nov 2010 | B1 |
7929551 | Dietrich et al. | Apr 2011 | B2 |
7937369 | Dings et al. | May 2011 | B1 |
8006125 | Meng et al. | Aug 2011 | B1 |
8015158 | Mankovsky et al. | Sep 2011 | B1 |
8019925 | Vogan et al. | Sep 2011 | B1 |
8060473 | Dhumale et al. | Nov 2011 | B1 |
8090104 | Wajs | Jan 2012 | B2 |
8091137 | Karp | Jan 2012 | B2 |
8130554 | Linnell | Mar 2012 | B1 |
8156381 | Tamura et al. | Apr 2012 | B2 |
8191841 | Jeffrey et al. | Jun 2012 | B2 |
8266433 | Przykucki | Sep 2012 | B1 |
8291170 | Zhang et al. | Oct 2012 | B1 |
8331095 | Hu et al. | Dec 2012 | B2 |
8352439 | Lee et al. | Jan 2013 | B1 |
8370315 | Efstathopoulos et al. | Feb 2013 | B1 |
8464133 | Grube et al. | Jun 2013 | B2 |
8473816 | Zvibel | Jun 2013 | B2 |
8554918 | Douglis | Oct 2013 | B1 |
8595596 | Grube et al. | Nov 2013 | B2 |
8620870 | Dwarampudi | Dec 2013 | B2 |
8646061 | Oh | Feb 2014 | B2 |
8671076 | Price et al. | Mar 2014 | B2 |
8699159 | Malina | Apr 2014 | B1 |
8726406 | Catrein | May 2014 | B2 |
8781969 | Khandelwal | Jul 2014 | B2 |
8806502 | Gargash et al. | Aug 2014 | B2 |
8826036 | Snodgrass | Sep 2014 | B1 |
8838911 | Hubin et al. | Sep 2014 | B1 |
8898114 | Feathergill et al. | Nov 2014 | B1 |
8935221 | Lazier | Jan 2015 | B1 |
8959067 | Patiejunas et al. | Feb 2015 | B1 |
8972677 | Jones | Mar 2015 | B1 |
8990215 | Reztlaff, II et al. | Mar 2015 | B1 |
9047306 | Frolund et al. | Jun 2015 | B1 |
9053212 | Beckey | Jun 2015 | B2 |
9372854 | Gold et al. | Jun 2016 | B2 |
20020055942 | Reynolds | May 2002 | A1 |
20020091903 | Mizuno | Jul 2002 | A1 |
20020103815 | Duvillier et al. | Aug 2002 | A1 |
20020122203 | Matsuda | Sep 2002 | A1 |
20020161972 | Talagala et al. | Oct 2002 | A1 |
20020186844 | Levy et al. | Dec 2002 | A1 |
20030033308 | Patel et al. | Feb 2003 | A1 |
20030149717 | Heinzman | Aug 2003 | A1 |
20040003272 | Bantz et al. | Jan 2004 | A1 |
20040057203 | Rabinovitz | Mar 2004 | A1 |
20040098565 | Rohlman et al. | May 2004 | A1 |
20040243737 | Beardsley et al. | Dec 2004 | A1 |
20050057898 | El-Batal et al. | Mar 2005 | A1 |
20050114338 | Borthakur et al. | May 2005 | A1 |
20050117462 | Kano et al. | Jun 2005 | A1 |
20050160427 | Ustaris | Jul 2005 | A1 |
20050187897 | Pawar et al. | Aug 2005 | A1 |
20050203976 | Hyun et al. | Sep 2005 | A1 |
20050262378 | Sleeman et al. | Nov 2005 | A1 |
20050267935 | Gandhi et al. | Dec 2005 | A1 |
20050270681 | Suzuki et al. | Dec 2005 | A1 |
20060005074 | Yanai et al. | Jan 2006 | A1 |
20060015529 | Yagawa | Jan 2006 | A1 |
20060020594 | Garg et al. | Jan 2006 | A1 |
20060087760 | Forrer et al. | Apr 2006 | A1 |
20060095741 | Asher et al. | May 2006 | A1 |
20060107266 | Martin et al. | May 2006 | A1 |
20060187634 | Tanaka et al. | Aug 2006 | A1 |
20060190510 | Gabryjelski et al. | Aug 2006 | A1 |
20060272023 | Schmeidler et al. | Nov 2006 | A1 |
20070011472 | Cheng | Jan 2007 | A1 |
20070050479 | Yoneda | Mar 2007 | A1 |
20070053169 | Carlson et al. | Mar 2007 | A1 |
20070079087 | Wang et al. | Apr 2007 | A1 |
20070091559 | Malone | Apr 2007 | A1 |
20070101095 | Gorobets | May 2007 | A1 |
20070156842 | Vermeulen et al. | Jul 2007 | A1 |
20070174362 | Pham et al. | Jul 2007 | A1 |
20070198789 | Clark et al. | Aug 2007 | A1 |
20070233781 | Starr et al. | Oct 2007 | A1 |
20070250674 | Fineberg et al. | Oct 2007 | A1 |
20070266037 | Terry et al. | Nov 2007 | A1 |
20070282969 | Dietrich et al. | Dec 2007 | A1 |
20070283046 | Dietrich et al. | Dec 2007 | A1 |
20080059483 | Williams et al. | Mar 2008 | A1 |
20080068899 | Ogihara et al. | Mar 2008 | A1 |
20080099235 | Hiramoto et al. | May 2008 | A1 |
20080109478 | Wada et al. | May 2008 | A1 |
20080120164 | Hassler | May 2008 | A1 |
20080168108 | Molaro et al. | Jul 2008 | A1 |
20080177697 | Barsness et al. | Jul 2008 | A1 |
20080212225 | Ito et al. | Sep 2008 | A1 |
20080235485 | Haertel et al. | Sep 2008 | A1 |
20080263363 | Jueneman | Oct 2008 | A1 |
20080285366 | Fujiwara | Nov 2008 | A1 |
20080294764 | Wakako | Nov 2008 | A1 |
20090013123 | Hsieh | Jan 2009 | A1 |
20090070537 | Cho | Mar 2009 | A1 |
20090083476 | Pua et al. | Mar 2009 | A1 |
20090113167 | Gamble et al. | Apr 2009 | A1 |
20090132676 | Tu et al. | May 2009 | A1 |
20090144568 | Fung | Jun 2009 | A1 |
20090150641 | Flynn et al. | Jun 2009 | A1 |
20090157700 | Van Vugt | Jun 2009 | A1 |
20090164506 | Barley et al. | Jun 2009 | A1 |
20090198736 | Shen et al. | Aug 2009 | A1 |
20090198889 | Ito et al. | Aug 2009 | A1 |
20090213487 | Luan et al. | Aug 2009 | A1 |
20090234883 | Hurst et al. | Sep 2009 | A1 |
20090240750 | Seo | Sep 2009 | A1 |
20090254572 | Redlich | Oct 2009 | A1 |
20090265568 | Jackson | Oct 2009 | A1 |
20090300403 | Little | Dec 2009 | A1 |
20100017446 | Choi et al. | Jan 2010 | A1 |
20100037056 | Follis et al. | Feb 2010 | A1 |
20100094819 | Bornhoevd et al. | Apr 2010 | A1 |
20100169544 | Eom et al. | Jul 2010 | A1 |
20100217927 | Song et al. | Aug 2010 | A1 |
20100223259 | Mizrahi | Sep 2010 | A1 |
20100228711 | Li et al. | Sep 2010 | A1 |
20100235409 | Roy et al. | Sep 2010 | A1 |
20100241619 | Snider | Sep 2010 | A1 |
20100242096 | Varadharajan et al. | Sep 2010 | A1 |
20110026942 | Naito | Feb 2011 | A1 |
20110035757 | Comer | Feb 2011 | A1 |
20110055559 | Li | Mar 2011 | A1 |
20110058277 | de la Fuente et al. | Mar 2011 | A1 |
20110060775 | Fitzgerald | Mar 2011 | A1 |
20110078407 | Lewis | Mar 2011 | A1 |
20110099324 | Yeh | Apr 2011 | A1 |
20110161679 | Grube et al. | Jun 2011 | A1 |
20110185099 | Stuhlsatz et al. | Jul 2011 | A1 |
20110225417 | Maharajh et al. | Sep 2011 | A1 |
20110231597 | Lai et al. | Sep 2011 | A1 |
20110246716 | Frame et al. | Oct 2011 | A1 |
20110247074 | Manring et al. | Oct 2011 | A1 |
20110258630 | Fee et al. | Oct 2011 | A1 |
20110264717 | Grube et al. | Oct 2011 | A1 |
20110265143 | Grube et al. | Oct 2011 | A1 |
20110276656 | Knapp et al. | Nov 2011 | A1 |
20110282839 | Paksoy et al. | Nov 2011 | A1 |
20110289383 | Dhuse et al. | Nov 2011 | A1 |
20110307657 | Timashev et al. | Dec 2011 | A1 |
20120030411 | Wang et al. | Feb 2012 | A1 |
20120079562 | Anttila et al. | Mar 2012 | A1 |
20120137062 | Arges et al. | May 2012 | A1 |
20120143830 | Cormode et al. | Jun 2012 | A1 |
20120150528 | Upadhyaya et al. | Jun 2012 | A1 |
20120166576 | Orsini et al. | Jun 2012 | A1 |
20120173392 | Kirby et al. | Jul 2012 | A1 |
20120210092 | Feldman | Aug 2012 | A1 |
20120233432 | Feldman et al. | Sep 2012 | A1 |
20120284719 | Phan et al. | Nov 2012 | A1 |
20120306912 | Blanco et al. | Dec 2012 | A1 |
20120311260 | Yamagiwa et al. | Dec 2012 | A1 |
20130046974 | Kamara et al. | Feb 2013 | A1 |
20130145371 | Brunswig et al. | Jun 2013 | A1 |
20130177157 | Li | Jul 2013 | A1 |
20130254166 | Kottomtharayil | Sep 2013 | A1 |
20130290263 | Beaverson et al. | Oct 2013 | A1 |
20140052706 | Misra et al. | Feb 2014 | A1 |
20140064478 | Ramaswamy | Mar 2014 | A1 |
20140068208 | Feldman | Mar 2014 | A1 |
20140149794 | Shetty et al. | May 2014 | A1 |
20140161123 | Starks et al. | Jun 2014 | A1 |
20150067351 | Wang | Mar 2015 | A1 |
20150082458 | Cooper et al. | Mar 2015 | A1 |
Number | Date | Country |
---|---|---|
1487451 | Apr 2004 | CN |
1799051 | Jul 2006 | CN |
101043372 | Sep 2007 | CN |
101496005 | Jul 2009 | CN |
H05113968 | May 1993 | JP |
H06149739 | May 1994 | JP |
H11259321 | Sep 1999 | JP |
2000023075 | Jan 2000 | JP |
2002278844 | Sep 2002 | JP |
2005122311 | May 2005 | JP |
2006526837 | Nov 2006 | JP |
2007299308 | Nov 2007 | JP |
2008299396 | Dec 2008 | JP |
2011043968 | Mar 2011 | JP |
20020088574 | Nov 2002 | KR |
20070058281 | Feb 2016 | KR |
0227489 | Apr 2002 | WO |
Entry |
---|
U.S. Appl. No. 13/069,065, filed Mar. 22, 2011. |
U.S. Appl. No. No. 13/569,994, filed Aug. 8, 2012. |
U.S. Appl. No. 13/570,088, filed Aug. 8, 2012. |
U.S. Appl. No. 13/570,092, filed Aug. 8, 2012. |
Advanced Computer & Network Corporation, “RAID Level 5: Independent Data Disks With Distributed Parity Blocks”, May 12, 2011, retrieved from https://web.archive.org/web/20110512213916/http://www.acnc.com/raidedu/5, 2 pages. |
Advanced Computer & Network Corporation, “RAID level 6: Independent Data Disks With Two Independent Parity Schemes”, May 7, 2011, retrieved from https://web.archive.org/web/20110507215950/http://www.acnc.com/raidedu/6, 2 pages. |
Amazon Web Services, “Amazon Elastic MapReduce Developer Guide,” API Version Nov. 30, 2009, dated Jun. 12, 2012, retrieved on Jun. 22, 2015, from https://web.archive.org/web/20120612043953/http://s3.amazonaws.com/awsdocs/ElasticMapReduce/latest/emr-dg.pdf, 318 pages. |
Amazon Web Services, Amazon Glacier Developer Guide, API Version Jun. 1, 2012, dated Aug. 20, 2012, retrieved Jun. 22, 2015, from https://web.archive.org/web/20120908043705/http://awsdocs.s3.amazonaws.com/glacier/latest/glacier-dg.pdf, 209 pages. |
Amazon Web Services, “AWS Import/Export Developer Guide,” API Version Jun. 3, 2010, dated Jun. 12, 2012, retrieved Jun. 22, 2015, from https://web.archive.org/web/20120612051330/http://s3.amazonaws.com/awsdocs/ImportExpert/latest/AWSImportExport-dg.pdf, 104 pages. |
Amer et al., “Design Issues for a Shingled Write Disk System,” 26th IEEE Symposium on Massive Storage Systems and Technologies: Research Track (MSST 2010): May 1-12, 2010. |
Chen et al., “RAID: High-Performance, Reliable Secondary Storage,” ACM Computing Surveys 1994, 26:145-185, retrieved on Jan. 11, 2016, from internet https://web.archive.org/web/20040721062927/http://meseec.ce.rit.edu/eecc722-fall2002/papers/io/3/chen94raid.pdf, 69 pages. |
Cisco, “Cisco Standalone HDD Firmware Update Version 3.0—IBM Servers,” Nov. 16, 2010, 5 pages. |
Duan, “Research and Application of Distributed Parallel Search Hadoop Algorithm,” 2012 International Conference on Systems and Informatics (ICSAI 2012), IEEE, May 19, 2012, pp. 2462-2465. |
Gibson et al., “Directions for Shingled-Write and Two-Dimensional Magnetic Recording System Architectures: Synergies with Solid-State Disks (CMU-PDL-09-104),” Carnegie Mellon University Research Showcase, Parallel Data Laboratory, Research Centers and Institutes, pp. 1-3, May 1, 2009. |
IEEE, “The Authoritative Dictionary of IEEE Standards Terms,” Seventh Edition; 2000, p. 836. |
International Search Report and Written Opinion dated Feb. 14, 2014, in International Patent Application No. PCT/US2013/053828, filed Aug. 6, 2013. |
International Search Report and Written Opinion dated Feb. 14, 2014, in International Patent Application No. PCT/US2013/053853, filed Aug. 6, 2013. |
International Search Report and Written Opinion dated Jun. 13, 2012, in International Patent Application No. PCT/US2012/029828, filed Mar. 20, 2012. |
International Search Report and Written Opinion dated Mar. 6, 2014, in International Patent Application No. PCT/US2013/053852, filed Aug. 6, 2013. |
Jacobs et al., “Memory Systems, Cache, DRAM, Disk,” Copyright 2007, Morgan Kaufman, 9 pages. |
Massiglia, “The RAID Book: The Storage System Technology Handbook”, 6th Edition, 1997, pp. 26-27, 84-91, 136-143, and 270-271. |
Merriam-Webster, “Predetermine,” Current Edition of Dictionary, www.merriam-webster.com/dictionary, retrieved on Dec. 15, 2014. |
Micheloni et al., “Inside NAND Flash Memories,” Springer First Edition (ISBN 978-90-481-9430-8):40-42, Aug. 2010. |
Roos, “How to Leverage an API for Conferencing,” Dave Roos, published Jan. 2012, at http://money.howstuffworks.com/businesscommunications/how-to-leverage-an-api-for-conferencing1.htm. |
Rosenblum et al., “The Design and Implementation of a Log-Structured File System,” University of California at Berkley, ACM Transactions on Computer Systems Volume/Issue 10(1):26-52, Feb. 1992. |
Seagate, “Firmware Updates for Seagate Products,” Author Unknown, published Feb. 2012 at http://knowledge.seagate.com/articles/enUS/FAQ/207931en. |
Wikipedia, “Checksum,” from Wayback/Wikipedia at en.wikipedia.org/wiki/checksum, retrieved Mar. 2011, 5 pages. |
Wikipedia, “Error Correction,” from Wayback/Wikipedia.org at en.wikipedia.org/wiki/Error-correcting—code, retrieved Sep. 2010, 7 pages. |
Wikipedia, “Hash Tree,” from Wikipedia.org at http://en.wikipedia.org/wiki/Hash—tree, retrieved Jul. 12, 2012, 1 page. |
Wikipedia, “Process identifier,” dated Sep. 3, 2010, retrieved Jul. 9, 2015, from https://en.wikipedia.org/w/index.php?title=Process—identifier&oldid=382695536, 2 pages. |
Yu et al., “Exploiting sequential access when declustering data over disks and MEMS-based storage,” Distributed and Parallel Databases Volume/Issue 19(23):147-168, May 2006. |
Kozierok, “File Allocation Tables,” The PC Guide, Apr. 17, 2001, retrieved Nov. 28, 2016, from http://www.pcguide.com/ref/hdd/file/fatFATs-c.html,2 pages. |
“Decision of Patent Grant, dated Nov. 1, 2017,” Korean Patent Application No. 10-2017-7021593, filed Aug. 5, 2013, 3 pages. |
“Notice on Grant of Patent Right for Invention, dated Nov. 17, 2017,” Chinese Patent Application No. 201380042169.7, filed Aug. 6, 2013, 2 pages. |
“Office Action dated Nov. 20, 2017,” Canadian Patent Application No. 2881567, filed Aug. 6, 2013, 7 pages. |
Number | Date | Country | |
---|---|---|---|
20160154963 A1 | Jun 2016 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 13919701 | Jun 2013 | US |
Child | 15004592 | US |