The present disclosure relates generally to systems, apparatus, and methods for distributed data storage, and more particularly to systems, apparatus, and methods for distributed data storage using an information dispersal algorithm so that no one location will store an entire copy of stored data, and more particularly still to systems, apparatus, and methods for ensuring data integrity on a dispersed data storage network.
Although the characteristic features of this disclosure will be particularly pointed out in the claims, and the disclosure itself—the manner in which it may be made and used, may be better understood by referring to the following description taken in connection with the accompanying drawings forming a part hereof, wherein like reference numerals refer to like parts throughout the several views and in which:
Storing data in digital form is a well-known problem associated with all computer systems, and numerous solutions to this problem are known in the art. The simplest solution involves merely storing digital data in a single location, such as a punch film, hard drive, or FLASH memory device. However, storage of data in a single location is inherently unreliable. The device storing the data can malfunction or be destroyed through natural disasters, such as a flood, or through a malicious act, such as arson. In addition, digital data is generally stored in a usable file, such as a document that can be opened with the appropriate word processing software, or a financial ledger that can be opened with the appropriate spreadsheet software. Storing an entire usable file in a single location is also inherently insecure as a malicious hacker only need compromise that one location to obtain access to the usable file.
To address reliability concerns, digital data is often “backed-up,” i.e., an additional copy of the digital data is made and maintained in a separate physical location. For example, a backup tape of all network drives may be made by a small office and maintained at the home of a trusted employee. When a backup of digital data exists, the destruction of either the original device holding the digital data or the backup will not compromise the digital data. However, the existence of the backup exacerbates the security problem, as a malicious hacker can choose between two locations from which to obtain the digital data. Further, the site where the backup is stored may be far less secure than the original location of the digital data, such as in the case when an employee stores the tape in her home.
Another method used to address reliability and performance concerns is the use of a Redundant Array of Independent Drives (“RAID”). RAID refers to a collection of data storage schemes that divide and replicate data among multiple storage units. Different configurations of RAID provide increased performance, improved reliability, or both increased performance and improved reliability. In certain configurations of RAID, when digital data is stored, it is split into multiple units, referred to as “stripes,” each of which is stored on a separate drive. Data striping is performed in an algorithmically certain way so that the data can be reconstructed. While certain RAID configurations can improve reliability, RAID does nothing to address security concerns associated with digital data storage.
Encrypted data is mathematically coded so that only users with access to a certain key can decrypt and use the data. Common forms of encryption include DES, AES, RSA, and others. While modern encryption methods are difficult to break, numerous instances of successful attacks are known, some of which have resulted in valuable data being compromised.
Digitally stored data is subject to degradation over time, although such degradation tends to be extremely minor and the time periods involved tend to be much longer than for analog data storage. Nonetheless, if a single bit within a file comprised of millions of bits changes from a zero to a one or vice verse, the integrity of the file has been compromised, and its usability becomes suspect. Further, errors occur more frequently when digital data is transmitted due to noise in the transmission medium. Various prior art techniques have been devised to detect when a digital data segment has been compromised. One early form of error detection is known as parity, wherein a single bit is appended to each transmitted byte or word of data. The parity bit is set so that the total number of one bits in the transmitted byte or word is either even or odd. The receiving processor then checks the received byte or word for the appropriate parity, and, if it is incorrect, asks that the byte or word be resent.
Another form of error detection is the use of a checksum. There are many different types of checksums including classic checksums, cryptographic hash functions, digital signatures, cyclic redundancy checks, and the use of human readable “check digits” by the postal service and libraries. All of these techniques involve performing a mathematical calculation over an entire data segment to arrive at a checksum, which is appended to the data segment. For stored data, the checksum for the data segment can be recalculated periodically, and checked against the previously calculated checksum appended to the data segment. For transmitted data, the checksum is calculated by the transmitter and appended to the data segment. The receiver then recalculates the checksum for the received data segment, and if it does not match the checksum appended to the data segment, requests that it be retransmitted.
In 1979, two researchers independently developed a method for splitting data among multiple recipients called “secret sharing.” One of the characteristics of secret sharing is that a piece of data may be split among n recipients, but cannot be known unless at least t recipients share their data, where n.gtoreq.t. For example, a trivial form of secret sharing can be implemented by assigning a single random byte to every recipient but one, who would receive the actual data byte after it had been bitwise exclusive orred with the random bytes. In other words, for a group of four recipients, three of the recipients would be given random bytes, and the fourth would be given a byte calculated by the following formula:
where s is the original source data, ra, rb, and rc are random bytes given to three of the four recipients, and s′ is the encoded byte given to the fourth recipient. The original byte s can be recovered by bitwise exclusive-oring all four bytes together.
The problem of reconstructing data stored on a digital medium that is subject to damage has also been addressed in the prior art. In particular, Reed-Solomon and Cauchy Reed-Solomon coding are two well-known methods of dividing encoded information into multiple slices so that the original information can be reassembled even if all of the slices are not available. Reed-Solomon coding, Cauchy Reed-Solomon coding, and other data coding techniques are described in “Erasure Codes for Storage Applications,” by Dr. James S. Plank.
While dispersed data storage networks (DDSN′s) can theoretically be implemented to provide any desired level of reliability, practical considerations tend to make this impossible in prior art solutions. For example, DDSNs rely on storage media to store data slices. This storage media, like all storage media, will degrade over time. Furthermore, DDSN′s rely on numerous transmissions to physically disparate slice servers, and data slices may become corrupted during transmissions. While TCP utilizes a CRC in every transmitted packet, the reliability provided by this CRC is not sufficient for critical data storage.
The disclosed disclosure achieves its objectives by providing an improved method for insuring the integrity of data stored on a dispersed data storage network. A checksum is calculated for a data segment to be written to a DDSN. The checksum is appended to the data segment, which is sliced into a plurality of data slices. A second set of checksums is calculated for and appended to the different data slices, which are then transmitted to different slice servers. For each receiving slice server, a checksum is calculated for the received data slice, and compared to the checksum appended to the received data slice. If the checksums vary, the receiving slice server marks the data slice as corrupted, and requests that the corrupted data slice be resent.
In another aspect of the disclosed disclosure, a distributed computer system implements a dispersed data storage network. In this system, a rebuilder application periodically recalculates checksums for data slices stored on a plurality of slice servers. Where the calculated checksum does not match the checksum appended to a stored data slice, the data slice is marked as corrupted. The rebuilder application then identifies the stored data segment associated with the corrupted data slice, and issues read requests to other slice servers holding data slices corresponding to the identified data segment. The data segment is rebuilt and re-sliced, and any slice servers containing corrupted data are sent new data slices to replace the corrupted data slices.
Turning to the Figures, and to
As explained herein, the disclosed disclosure works to ensure integrity of data stored in a DDSN not only by using checksums on each stored data segment as well as the constituent data slices, but also by reconstructing corrupted data slices as well. In accordance with the disclosed disclosure, grid access computers 120, 122 will calculate a checksum for each data segment to be stored, and append the checksum to the data segment prior to slicing. The data segment is then sliced in accordance with an information dispersal algorithm, and checksums are calculated and appended to each of the data slices. The data slices are then forwarded to slice servers 150-162, where the data slices are stored.
In addition, grid access computers 120, 122 also recreate data slices that have become corrupted, or were destroyed. If during operation of the DDSN 100, it is detected that a particular data slice has been corrupted or destroyed, a different data slice will be requested from a different slice server 150-162. Assuming that sufficient non-corrupted data slices exist to successfully reconstruct the original data segment, the reconstructed data segment will be re-sliced, and the corrupted data slice will be replaced with a non-corrupted version. Further, a rebuilder application operating within the DDSN periodically walks through all data slices stored on the DDSN. When a corrupted data slice is found, the rebuilder application identifies the data segment corresponding to the corrupted data slice, rebuilds the identified data segment, and rewrites the corrupted slice.
In step 406, a list of slice servers each holding a required data slice that has yet to be received is assembled, and in step 408, the list is ordered by any applicable criteria. Further information on criteria by which the list may be ordered is contained in U.S. patent application Ser. No. 11/973,622, titled “SMART ACCESS TO A DISPERSED DATA STORAGE NETWORK,” filed on Oct. 9, 2007 and assigned to Pure Storage, Inc. In step 410, read requests are issued to the first k slice servers on the assembled list, where k is at least equal to m, the minimum number of data slices needed to reconstruct the requested data segment, but could be as large as n, the number of data slices that have data relevant to the requested data segment. In step 412, r data slices are received, and in step 414 the number of received data slices r is subtracted from the variable m. In step 416, m is compared to zero, and if m is greater than or equal to zero, execution returns to step 406 and proceeds as normal from there. However, if m is equal to zero, a collection of data transformations may optionally be applied to the received slices in step 418. The applied data transformations can include decryption, decompression, and integrity checking. In accordance with the disclosed disclosure, each data slice includes a cyclical redundancy check (“CRC”), or other form of checksum appended to the data contained in the slice. This checksum will be compared against a checksum calculated by the receiving slice server against the received data to ensure that the data was not corrupted during the transmission process.
In step 420, it is determined if the applied data transformations were successful for all of the received data slices. If the applied data transformations were not successful for some of the received slices, m is incremented by this number in step 422, and execution is resumed at step 406. The data transformations could fail, for example, if an integrity check revealed that a received data slice was corrupted. However, if the applied data transformations were successful for all received data slices, the received slices are assembled into the requested block of data in step 424. The same or different data transformations may optionally be applied to the assembled data block in step 426, which completes the read process. In accordance with the disclosed disclosure, a checksum for the data segment will be calculated and compared to a checksum appended to the assembled data segment.
In
A number of data transformations may optionally be applied to each block in step 506, and an information dispersal algorithm is applied in step 508. In particular, the Cauchy Reed-Solomon dispersal algorithm could be applied to the data segment, resulting in a predetermined number of data slices. In step 510, a number of data transformations are optionally applied to each data slice.
In the disclosed system, writes are performed transactionally, meaning that a minimum number of data slices t must be successfully written before a write is deemed complete. Normally, the number of data slices that must be successfully written will be set to n, i.e., the number of slices that the data segment was originally divided into. However, this number can be configured by the user to a lesser number, down to the minimum number of slices required to reconstruct the data. This would allow the user to continue using the DDSN during a minor network outage where one or more slice servers were unavailable. Slices that could not be immediately transmitted and stored could be queued and transmitted when the network outage cleared. In step 512, a write transaction is initiated to the data storage grid. As discussed herein, all slice servers are simultaneously contacted, and in step 514, a confirmation that at least t receiving slice servers are prepared to begin the write transaction, i.e., to store each slice, must be received, or the transaction is rolled back in step 516.
In step 520 data slices are transmitted to the slice servers that indicated their ability to receive and store slices. The number of slice servers that successfully received and stored their assigned data slices is checked in step 522, and if less than t slices are successfully stored, the transaction is rolled back in step 516. In step 524, a commit transaction is begun on all servers with successful writes. If the commit transaction fails, an error is logged in step 528. Otherwise, the write transaction was successful.
The foregoing description of the disclosure has been presented for purposes of illustration and description, and is not intended to be exhaustive or to limit the disclosure to the precise form disclosed. The description was selected to best explain the principles of the disclosure and practical application of these principles to enable others skilled in the art to best utilize the disclosure in various examples and various modifications as are suited to the particular use contemplated. It is intended that the scope of the disclosure not be limited by the specification, but be defined by the claims set forth below.
It is noted that terminologies as may be used herein such as bit stream, stream, signal sequence, etc. (or their equivalents) have been used interchangeably to describe digital information whose content corresponds to any of a number of desired types (e.g., data, video, speech, text, graphics, audio, etc. any of which may generally be referred to as ‘data’).
As may be used herein, the terms “substantially” and “approximately” provides an industry-accepted tolerance for its corresponding term and/or relativity between items. For some industries, an industry-accepted tolerance is less than one percent and, for other industries, the industry-accepted tolerance is 10 percent or more. Other examples of industry-accepted tolerance range from less than one percent to fifty percent. Industry-accepted tolerances correspond to, but are not limited to, component values, integrated circuit process variations, temperature variations, rise and fall times, thermal noise, dimensions, signaling errors, dropped packets, temperatures, pressures, material compositions, and/or performance metrics. Within an industry, tolerance variances of accepted tolerances may be more or less than a percentage level (e.g., dimension tolerance of less than +/−1%). Some relativity between items may range from a difference of less than a percentage level to a few percent. Other relativity between items may range from a difference of a few percent to magnitude of differences.
As may also be used herein, the term(s) “configured to”, “operably coupled to”, “coupled to”, and/or “coupling” includes direct coupling between items and/or indirect coupling between items via an intervening item (e.g., an item includes, but is not limited to, a component, an element, a circuit, and/or a module) where, for an example of indirect coupling, the intervening item does not modify the information of a signal but may adjust its current level, voltage level, and/or power level. As may further be used herein, inferred coupling (i.e., where one element is coupled to another element by inference) includes direct and indirect coupling between two items in the same manner as “coupled to”.
As may even further be used herein, the term “configured to”, “operable to”, “coupled to”, or “operably coupled to” indicates that an item includes one or more of power connections, input(s), output(s), etc., to perform, when activated, one or more its corresponding functions and may further include inferred coupling to one or more other items. As may still further be used herein, the term “associated with”, includes direct and/or indirect coupling of separate items and/or one item being embedded within another item.
As may be used herein, the term “compares favorably”, indicates that a comparison between two or more items, signals, etc., provides a desired relationship. For example, when the desired relationship is that signal 1 has a greater magnitude than signal 2, a favorable comparison may be achieved when the magnitude of signal 1 is greater than that of signal 2 or when the magnitude of signal 2 is less than that of signal 1. As may be used herein, the term “compares unfavorably”, indicates that a comparison between two or more items, signals, etc., fails to provide the desired relationship.
As may be used herein, one or more claims may include, in a specific form of this generic form, the phrase “at least one of a, b, and c” or of this generic form “at least one of a, b, or c”, with more or less elements than “a”, “b”, and “c”. In either phrasing, the phrases are to be interpreted identically. In particular, “at least one of a, b, and c” is equivalent to “at least one of a, b, or c” and shall mean a, b, and/or c. As an example, it means: “a” only, “b” only, “c” only, “a” and “b”, “a” and “c”, “b” and “c”, and/or “a”, “b”, and “c”.
As may also be used herein, the terms “processing module”, “processing circuit”, “processor”, “processing circuitry”, and/or “processing unit” may be a single processing device or a plurality of processing devices. Such a processing device may be a microprocessor, micro-controller, digital signal processor, microcomputer, central processing unit, field programmable gate array, programmable logic device, state machine, logic circuitry, analog circuitry, digital circuitry, and/or any device that manipulates signals (analog and/or digital) based on hard coding of the circuitry and/or operational instructions. The processing module, module, processing circuit, processing circuitry, and/or processing unit may be, or further include, memory and/or an integrated memory element, which may be a single memory device, a plurality of memory devices, and/or embedded circuitry of another processing module, module, processing circuit, processing circuitry, and/or processing unit. Such a memory device may be a read-only memory, random access memory, volatile memory, non-volatile memory, static memory, dynamic memory, flash memory, cache memory, and/or any device that stores digital information. Note that if the processing module, module, processing circuit, processing circuitry, and/or processing unit includes more than one processing device, the processing devices may be centrally located (e.g., directly coupled together via a wired and/or wireless bus structure) or may be distributedly located (e.g., cloud computing via indirect coupling via a local area network and/or a wide area network). Further note that if the processing module, module, processing circuit, processing circuitry and/or processing unit implements one or more of its functions via a state machine, analog circuitry, digital circuitry, and/or logic circuitry, the memory and/or memory element storing the corresponding operational instructions may be embedded within, or external to, the circuitry comprising the state machine, analog circuitry, digital circuitry, and/or logic circuitry. Still further note that, the memory element may store, and the processing module, module, processing circuit, processing circuitry and/or processing unit executes, hard coded and/or operational instructions corresponding to at least some of the steps and/or functions illustrated in one or more of the Figures. Such a memory device or memory element can be included in an article of manufacture.
One or more embodiments have been described above with the aid of method steps illustrating the performance of specified functions and relationships thereof. The boundaries and sequence of these functional building blocks and method steps have been arbitrarily defined herein for convenience of description. Alternate boundaries and sequences can be defined so long as the specified functions and relationships are appropriately performed. Any such alternate boundaries or sequences are thus within the scope and spirit of the claims. Further, the boundaries of these functional building blocks have been arbitrarily defined for convenience of description. Alternate boundaries could be defined as long as the certain significant functions are appropriately performed. Similarly, flow diagram blocks may also have been arbitrarily defined herein to illustrate certain significant functionality.
To the extent used, the flow diagram block boundaries and sequence could have been defined otherwise and still perform the certain significant functionality. Such alternate definitions of both functional building blocks and flow diagram blocks and sequences are thus within the scope and spirit of the claims. One of average skill in the art will also recognize that the functional building blocks, and other illustrative blocks, modules and components herein, can be implemented as illustrated or by discrete components, application specific integrated circuits, processors executing appropriate software and the like or any combination thereof.
In addition, a flow diagram may include a “start” and/or “continue” indication. The “start” and “continue” indications reflect that the steps presented can optionally be incorporated in or otherwise used in conjunction with one or more other routines. In addition, a flow diagram may include an “end” and/or “continue” indication. The “end” and/or “continue” indications reflect that the steps presented can end as described and shown or optionally be incorporated in or otherwise used in conjunction with one or more other routines. In this context, “start” indicates the beginning of the first step presented and may be preceded by other activities not specifically shown. Further, the “continue” indication reflects that the steps presented may be performed multiple times and/or may be succeeded by other activities not specifically shown. Further, while a flow diagram indicates a particular ordering of steps, other orderings are likewise possible provided that the principles of causality are maintained.
The one or more embodiments are used herein to illustrate one or more aspects, one or more features, one or more concepts, and/or one or more examples. A physical embodiment of an apparatus, an article of manufacture, a machine, and/or of a process may include one or more of the aspects, features, concepts, examples, etc. described with reference to one or more of the embodiments discussed herein. Further, from figure to figure, the embodiments may incorporate the same or similarly named functions, steps, modules, etc. that may use the same or different reference numbers and, as such, the functions, steps, modules, etc. may be the same or similar functions, steps, modules, etc. or different ones.
Unless specifically stated to the contra, signals to, from, and/or between elements in a figure of any of the figures presented herein may be analog or digital, continuous time or discrete time, and single-ended or differential. For instance, if a signal path is shown as a single-ended path, it also represents a differential signal path. Similarly, if a signal path is shown as a differential path, it also represents a single-ended signal path. While one or more particular architectures are described herein, other architectures can likewise be implemented that use one or more data buses not expressly shown, direct connectivity between elements, and/or indirect coupling between other elements as recognized by one of average skill in the art.
The term “module” is used in the description of one or more of the embodiments. A module implements one or more functions via a device such as a processor or other processing device or other hardware that may include or operate in association with a memory that stores operational instructions. A module may operate independently and/or in conjunction with software and/or firmware. As also used herein, a module may contain one or more sub-modules, each of which may be one or more modules.
As may further be used herein, a computer readable memory includes one or more memory elements. A memory element may be a separate memory device, multiple memory devices, or a set of memory locations within a memory device. Such a memory device may be a read-only memory, random access memory, volatile memory, non-volatile memory, static memory, dynamic memory, flash memory, cache memory, a quantum register or other quantum memory and/or any other device that stores data in a non-transitory manner. Furthermore, the memory device may be in a form of a solid-state memory, a hard drive memory or other disk storage, cloud memory, thumb drive, server memory, computing device memory, and/or other non-transitory medium for storing data. The storage of data includes temporary storage (i.e., data is lost when power is removed from the memory element) and/or persistent storage (i.e., data is retained when power is removed from the memory element). As used herein, a transitory medium shall mean one or more of: (a) a wired or wireless medium for the transportation of data as a signal from one computing device to another computing device for temporary storage or persistent storage; (b) a wired or wireless medium for the transportation of data as a signal within a computing device from one element of the computing device to another element of the computing device for temporary storage or persistent storage; (c) a wired or wireless medium for the transportation of data as a signal from one computing device to another computing device for processing the data by the other computing device; and (d) a wired or wireless medium for the transportation of data as a signal within a computing device from one element of the computing device to another element of the computing device for processing the data by the other element of the computing device. As may be used herein, a non-transitory computer readable memory is substantially equivalent to a computer readable memory. A non-transitory computer readable memory can also be referred to as a non-transitory computer readable storage medium.
One or more functions associated with the methods and/or processes described herein can be implemented via a processing module that operates via the non-human “artificial” intelligence (AI) of a machine. Examples of such AI include machines that operate via anomaly detection techniques, decision trees, association rules, expert systems and other knowledge-based systems, computer vision models, artificial neural networks, convolutional neural networks, support vector machines (SVMs), Bayesian networks, genetic algorithms, feature learning, sparse dictionary learning, preference learning, deep learning and other machine learning techniques that are trained using training data via unsupervised, semi-supervised, supervised and/or reinforcement learning, and/or other AI. The human mind is not equipped to perform such AI techniques, not only due to the complexity of these techniques, but also due to the fact that artificial intelligence, by its very definition—requires “artificial” intelligence—i.e. machine/non-human intelligence.
One or more functions associated with the methods and/or processes described herein can be implemented as a large-scale system that is operable to receive, transmit and/or process data on a large-scale. As used herein, a large-scale refers to a large number of data, such as one or more kilobytes, megabytes, gigabytes, terabytes or more of data that are received, transmitted and/or processed. Such receiving, transmitting and/or processing of data cannot practically be performed by the human mind on a large-scale within a reasonable period of time, such as within a second, a millisecond, microsecond, a real-time basis or other high speed required by the machines that generate the data, receive the data, convey the data, store the data and/or use the data.
One or more functions associated with the methods and/or processes described herein can require data to be manipulated in different ways within overlapping time spans. The human mind is not equipped to perform such different data manipulations independently, contemporaneously, in parallel, and/or on a coordinated basis within a reasonable period of time, such as within a second, a millisecond, microsecond, a real-time basis or other high speed required by the machines that generate the data, receive the data, convey the data, store the data and/or use the data.
One or more functions associated with the methods and/or processes described herein can be implemented in a system that is operable to electronically receive digital data via a wired or wireless communication network and/or to electronically transmit digital data via a wired or wireless communication network. Such receiving and transmitting cannot practically be performed by the human mind because the human mind is not equipped to electronically transmit or receive digital data, let alone to transmit and receive digital data via a wired or wireless communication network.
One or more functions associated with the methods and/or processes described herein can be implemented in a system that is operable to electronically store digital data in a memory device. Such storage cannot practically be performed by the human mind because the human mind is not equipped to electronically store digital data.
One or more functions associated with the methods and/or processes described herein may operate to cause an action by a processing module directly in response to a triggering event—without any intervening human interaction between the triggering event and the action. Any such actions may be identified as being performed “automatically”, “automatically based on” and/or “automatically in response to” such a triggering event. Furthermore, any such actions identified in such a fashion specifically preclude the operation of human activity with respect to these actions—even if the triggering event itself may be causally connected to a human activity of some kind.
While particular combinations of various functions and features of the one or more embodiments have been expressly described herein, other combinations of these features and functions are likewise possible. The present disclosure is not limited by the particular examples disclosed herein and expressly incorporates these other combinations.
The present U.S. Utility Patent Application claims priority pursuant to 35 U.S.C. § 120 as a continuation-in-part of U.S. utility application Ser. No. 16/149,667, entitled “UTILIZING CONCENTRIC STORAGE POOLS IN A DISPERSED STORAGE NETWORK,”, filed Oct. 2, 2018, which is a continuation-in-part of U.S. utility application Ser. No. 15/819,810, entitled “STORAGE VAULT TIERING AND DATA MIGRATION IN A DISTRIBUTED STORAGE NETWORK,” filed Nov. 21, 2017, which is a continuation-in-part of U.S. utility application Ser. No. 13/869,655, entitled “UPDATING ACCESS CONTROL INFORMATION WITHIN A DISPERSED STORAGE UNIT,” filed Apr. 24, 2013, issued as U.S. Pat. No. 10,178,083 on Jan. 8, 2019, which claims priority pursuant to 35 U.S.C. § 119(e) to U.S. Provisional Application No. 61/655,736, entitled “STORING DATA IN A LAYERED DISTRIBUTED STORAGE AND TASK NETWORK”, filed Jun. 5, 2012, all of which are hereby incorporated herein by reference in their entirety and made part of the present U.S. Utility Patent Application for all purposes. The present U.S. Utility Patent Application also claims priority pursuant to 35 U.S.C. § 120 as a continuation-in-part of U.S. utility application Ser. No. 16/988,135, entitled “Validating Requests Based On Stored Vault Information”, filed Aug. 7, 2020, which is a continuation of U.S. utility application Ser. No. 16/390,530, entitled “Digest Listing Decomposition”, filed Apr. 22, 2019, issued as U.S. Pat. No. 11,194,662 on Dec. 7, 2021, which is a continuation of U.S. utility application Ser. No. 14/447,890, entitled “Digest Listing Decomposition”, filed Jul. 31, 2014, issued as U.S. Pat. No. 10,360,180 on Jul. 23, 2019, which is a continuation of U.S. utility application Ser. No. 13/154,725, entitled, “Metadata Access In A Dispersed Storage Network”, filed Jun. 7, 2011, issued as U.S. Pat. No. 10,289,688 on May 14, 2019, which claims priority pursuant to 35 U.S.C. § 119(e) to U.S. Provisional Application No. 61/357,430, entitled “Dispersal Method In A Dispersed Storage System”, filed Jun. 22, 2010, all of which are hereby incorporated herein by reference in their entirety and made part of the present U.S. Utility Patent Application for all purposes. U.S. utility application Ser. No. 14/447,890 also claims priority pursuant to 35 U.S.C. § 120 as a continuation-in-part of U.S. utility application Ser. No. 12/749,592, entitled “Dispersed Storage Processing Unit And Methods With Data Aggregation For Use In A Dispersed Storage System”, filed Mar. 30, 2010, issued as U.S. Pat. No. 8,938,591 on Jan. 20, 2015, which claims priority pursuant to 35 U.S.C. § 119(e) to U.S. Provisional Application No. 61/237,624, entitled “Dispersed Storage Unit And Methods With Metadata Separation For Use In A Dispersed Storage System”, filed Aug. 27, 2009, all of which are hereby incorporated herein by reference in their entirety and made part of the present U.S. Utility Patent Application for all purposes. U.S. utility application Ser. No. 12/749,592 also claims priority pursuant to 35 U.S.C. § 120 as a continuation-in-part of U.S. utility application Ser. No. 12/218,594, entitled “Streaming Media Software Interface To A Dispersed Data Storage Network”, filed Jul. 16, 2008, issued as U.S. Pat. No. 7,962,641 on Jun. 14, 2011, which claims priority pursuant to 35 U.S.C. § 120 as a continuation-in-part of: 1. U.S. utility application Ser. No. 11/973,613, entitled “Block Based Access To A Dispersed Data Storage Network”, filed Oct. 9, 2007, issued as U.S. Pat. No. 8,285,878 on Oct. 9, 2012; 2. U.S. utility application Ser. No. 11/973,622, entitled “Smart Access To A Dispersed Data Storage Network”, filed Oct. 9, 2007, issued as U.S. Pat. No. 8,171,101 on May 1, 2012; 3. U.S. utility application Ser. No. 11/973,542, entitled “Ensuring Data Integrity On A Dispersed Storage Network”, filed Oct. 9, 2007, issued as U.S. Pat. No. 9,996,413 on Jun. 12, 2018; 4. U.S. utility application Ser. No. 11/973,621, entitled “Virtualized Storage Vaults On A Dispersed Data Storage Network”, filed Oct. 9, 2007, issued as U.S. Pat. No. 7,904,475 on Mar. 8, 2011; 5. U.S. utility application Ser. No. 11/241,555, entitled “System, Methods, And Apparatus For Subdividing Data For Storage In A Dispersed Data Storage Grid”, filed Sep. 30, 2005, issued as U.S. Pat. No. 7,953,937 on May 31, 2011; 6. U.S. utility application Ser. No. 11/403,684, entitled “Billing System For Information Dispersal System”, filed Apr. 13, 2006, issued as U.S. Pat. No. 7,574,570 on Aug. 11, 2009; 7. U.S. utility application Ser. No. 11/404,071, entitled “Metadata Management System For An Information Dispersed Storage System”, filed Ar. 13, 2006, issued as U.S. Pat. No. 7,574,579 on Aug. 11, 2009; 8. U.S. utility application Ser. No. 11/403,391, entitled “System For Rebuilding Dispersed Data”, filed Apr. 13, 2006, issued as U.S. Pat. No. 7,546,427 on Jun. 9, 2009; 9. U.S. utility application Ser. No. 12/080,042, entitled “Rebuilding Data On A Dispersed Storage Network”, filed Mar. 31, 2008, issued as U.S. Pat. No. 8,880,799 on Nov. 4, 2014, and 10. U.S. utility application Ser. No. 12/218,200, entitled “File System Adapted For Use With A Dispersed Data Storage Network”, filed Jul. 14, 2008, issued as U.S. Pat. No. 8,209,363 on Jun. 26, 2012; All of the above are hereby incorporated herein by reference in their entirety and made part of the present U.S. Utility Patent Application for all purposes. In addition, U.S. utility application Ser. No. 16/988,135 is related to the following U.S. patent applications that are commonly owned: 1. “Dispersed Storage Unit And Methods With Metadata Separation For Use In A Dispersed Storage System”, application Ser. No. 12/749,583, filed on Mar. 30, 2010, issued as U.S. Pat. No. 9,235,350 on Jan. 12, 2016. 2. “Dispersed Storage Processing Unit And Methods With Operating System Diversity For Use In A Dispersed Storage System”, application Ser. No. 12/749,606, filed on Mar. 30, 2010, issued as U.S. Pat. No. 9,690,513 on Jun. 27, 2017. 3. “Dispersed Storage Processing Unit And Methods With Geographical Diversity For Use In A Dispersed Storage System”, application Ser. No. 12/749,625, and filed on Mar. 30, 2010, issued as U.S. Pat. No. 9,772,791 on Sep. 26, 2017. All of the above are hereby incorporated herein by reference in their entirety and made part of the present U.S. Utility Patent Application for all purposes.
Number | Name | Date | Kind |
---|---|---|---|
4092732 | Ouchi | May 1978 | A |
5454101 | Mackay | Sep 1995 | A |
5485474 | Rabin | Jan 1996 | A |
5584008 | Shimada | Dec 1996 | A |
5774643 | Lubbers | Jun 1998 | A |
5802364 | Senator | Sep 1998 | A |
5809285 | Hilland | Sep 1998 | A |
5890156 | Rekieta | Mar 1999 | A |
5987622 | Lo Verso | Nov 1999 | A |
5991414 | Garay | Nov 1999 | A |
6012159 | Fischer | Jan 2000 | A |
6058454 | Gerlach | May 2000 | A |
6128277 | Bruck | Oct 2000 | A |
6175571 | Haddock | Jan 2001 | B1 |
6192472 | Garay | Feb 2001 | B1 |
6256688 | Suetaka | Jul 2001 | B1 |
6272658 | Steele | Aug 2001 | B1 |
6301604 | Nojima | Oct 2001 | B1 |
6356949 | Katsandres | Mar 2002 | B1 |
6366995 | Vilkov | Apr 2002 | B1 |
6374336 | Peters | Apr 2002 | B1 |
6415373 | Peters | Jul 2002 | B1 |
6418539 | Walker | Jul 2002 | B1 |
6449688 | Peters | Sep 2002 | B1 |
6567948 | Steele | May 2003 | B2 |
6571282 | Bowman-Amuah | May 2003 | B1 |
6609223 | Wolfgang | Aug 2003 | B1 |
6718361 | Basani | Apr 2004 | B1 |
6760808 | Peters | Jul 2004 | B2 |
6785768 | Peters | Aug 2004 | B2 |
6785783 | Buckland | Aug 2004 | B2 |
6826711 | Moulton | Nov 2004 | B2 |
6836432 | Parker | Dec 2004 | B1 |
6879596 | Dooply | Apr 2005 | B1 |
6898667 | Umberger | May 2005 | B2 |
6978366 | Ignatchenko | Dec 2005 | B1 |
7000143 | Moulton | Feb 2006 | B2 |
7003688 | Pittelkow | Feb 2006 | B1 |
7024451 | Jorgenson | Apr 2006 | B2 |
7024609 | Wolfgang | Apr 2006 | B2 |
7080101 | Watson | Jul 2006 | B1 |
7103824 | Halford | Sep 2006 | B2 |
7103915 | Redlich | Sep 2006 | B2 |
7111115 | Peters | Sep 2006 | B2 |
7140044 | Redlich | Nov 2006 | B2 |
7146644 | Redlich | Dec 2006 | B2 |
7171493 | Shu | Jan 2007 | B2 |
7222133 | Raipurkar | May 2007 | B1 |
7225263 | Clymer | May 2007 | B1 |
7240236 | Cutts | Jul 2007 | B2 |
7272613 | Sim | Sep 2007 | B2 |
7418649 | Li | Aug 2008 | B2 |
7457835 | Toebes | Nov 2008 | B2 |
7533133 | Lanzatella | May 2009 | B1 |
7574570 | Gladwin et al. | Aug 2009 | B2 |
7581156 | Manasse | Aug 2009 | B2 |
7607063 | Kikuchi | Oct 2009 | B2 |
7636724 | de la Torre | Dec 2009 | B2 |
7680822 | Vyas | Mar 2010 | B1 |
7743275 | Tormasov | Jun 2010 | B1 |
7831793 | Chakravarty | Nov 2010 | B2 |
7865673 | Moore | Jan 2011 | B2 |
7904475 | Gladwin | Mar 2011 | B2 |
7925666 | Johnson | Apr 2011 | B1 |
7945639 | Gavrilov | May 2011 | B2 |
7962641 | Dhuse | Jun 2011 | B1 |
8051362 | Li | Nov 2011 | B2 |
8145818 | Murayama | Mar 2012 | B2 |
8171101 | Gladwin | May 2012 | B2 |
8209363 | Palthepu | Jun 2012 | B2 |
8214590 | Ulrich | Jul 2012 | B2 |
8281181 | Resch | Oct 2012 | B2 |
8281404 | Frey | Oct 2012 | B2 |
8285878 | Gladwin | Oct 2012 | B2 |
8335904 | Kitchen | Dec 2012 | B1 |
8386840 | Stougie | Feb 2013 | B2 |
8406421 | Kaymen | Mar 2013 | B2 |
8429514 | Goel | Apr 2013 | B1 |
8433849 | De Schrijver | Apr 2013 | B2 |
8464133 | Grube | Jun 2013 | B2 |
8620879 | Cairns | Dec 2013 | B2 |
8694467 | Sun | Apr 2014 | B2 |
8713405 | Healey | Apr 2014 | B2 |
8856530 | Patti | Oct 2014 | B2 |
8862837 | Marshak | Oct 2014 | B1 |
8868508 | Drobychev | Oct 2014 | B2 |
8880799 | Foster | Nov 2014 | B2 |
8914632 | Shankar | Dec 2014 | B1 |
8918478 | Ozzie | Dec 2014 | B2 |
8935493 | Dolan | Jan 2015 | B1 |
8938591 | Mark | Jan 2015 | B2 |
8972694 | Dolan | Mar 2015 | B1 |
9098519 | Pavlov | Aug 2015 | B2 |
9235350 | Mark | Jan 2016 | B2 |
9305069 | Zunger | Apr 2016 | B2 |
9332422 | Bai | May 2016 | B2 |
9372809 | Testardi | Jun 2016 | B2 |
9792295 | Rus | Oct 2017 | B1 |
9811262 | Rus | Nov 2017 | B1 |
20020062422 | Butterworth | May 2002 | A1 |
20020166079 | Ulrich | Nov 2002 | A1 |
20030018927 | Gadir | Jan 2003 | A1 |
20030037261 | Meffert | Feb 2003 | A1 |
20030065617 | Watkins | Apr 2003 | A1 |
20030065656 | de la Torre | Apr 2003 | A1 |
20030084020 | Shu | May 2003 | A1 |
20040024963 | Talagala | Feb 2004 | A1 |
20040122917 | Menon | Jun 2004 | A1 |
20040215998 | Buxton | Oct 2004 | A1 |
20040228493 | Ma | Nov 2004 | A1 |
20050055603 | Soran | Mar 2005 | A1 |
20050100022 | Ramprashad | May 2005 | A1 |
20050114594 | Corbett | May 2005 | A1 |
20050125593 | Karpoff | Jun 2005 | A1 |
20050131993 | Fatula, Jr. | Jun 2005 | A1 |
20050132070 | Redlich | Jun 2005 | A1 |
20050144382 | Schmisseur | Jun 2005 | A1 |
20050160329 | Briggs | Jul 2005 | A1 |
20050210270 | Rohatgi | Sep 2005 | A1 |
20050229069 | Hassner | Oct 2005 | A1 |
20060041719 | Chui | Feb 2006 | A1 |
20060047907 | Shiga | Mar 2006 | A1 |
20060136448 | Cialini | Jun 2006 | A1 |
20060156059 | Kitamura | Jul 2006 | A1 |
20060224603 | Correll, Jr. | Oct 2006 | A1 |
20070030734 | Sinclair | Feb 2007 | A1 |
20070079081 | Gladwin | Apr 2007 | A1 |
20070079082 | Gladwin | Apr 2007 | A1 |
20070079083 | Gladwin | Apr 2007 | A1 |
20070088970 | Buxton | Apr 2007 | A1 |
20070113032 | Kameyama | May 2007 | A1 |
20070174192 | Gladwin | Jul 2007 | A1 |
20070214285 | Au | Sep 2007 | A1 |
20070234110 | Soran | Oct 2007 | A1 |
20070283167 | Venters, III | Dec 2007 | A1 |
20080235234 | Beedubail | Sep 2008 | A1 |
20090037500 | Kirshenbaum | Feb 2009 | A1 |
20090094251 | Gladwin | Apr 2009 | A1 |
20090094318 | Gladwin | Apr 2009 | A1 |
20100023524 | Gladwin | Jan 2010 | A1 |
20100088464 | Yang | Apr 2010 | A1 |
20100138604 | Noguchi | Jun 2010 | A1 |
20100218037 | Swartz | Aug 2010 | A1 |
20100268692 | Resch | Oct 2010 | A1 |
20100299313 | Orsini | Nov 2010 | A1 |
20110029840 | Ozzie | Feb 2011 | A1 |
20110087948 | Murakami | Apr 2011 | A1 |
20110126060 | Grube | May 2011 | A1 |
20110225202 | Man | Sep 2011 | A1 |
20110289122 | Grube | Nov 2011 | A1 |
20120060072 | Simitci | Mar 2012 | A1 |
20120131683 | Nassar | May 2012 | A1 |
20130246470 | Price | Sep 2013 | A1 |
20150074216 | Park | Mar 2015 | A1 |
20150355979 | Volvovski | Dec 2015 | A1 |
Entry |
---|
Chung; An Automatic Data Segmentation Method for 3D Measured Data Points; National Taiwan University; pp. 1-8; 1998. |
Harrison; Lightweight Directory Access Protocol (LDAP): Authentication Methods and Security Mechanisms; IETF Network Working Group; RFC 4513; Jun. 2006; pp. 1-32. |
Kubiatowicz, et al.; OceanStore: An Architecture for Global-Scale Persistent Storage; Proceedings of the Ninth International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS 2000); Nov. 2000; pp. 1-12. |
Legg; Lightweight Directory Access Protocol (LDAP): Syntaxes and Matching Rules; IETF Network Working Group; RFC 4517; Jun. 2006; pp. 1-50. |
Plank, T1: Erasure Codes for Storage Applications; FAST2005, 4th Usenix Conference on File Storage Technologies; Dec. 13-16, 2005; pp. 1-74. |
Rabin; Efficient Dispersal of Information for Security, Load Balancing, and Fault Tolerance; Journal of the Association for Computer Machinery; vol. 36, No. 2; Apr. 1989; pp. 335-348. |
Satran, et al.; Internet Small Computer Systems Interface (ISCSI); IETF Network Working Group; RFC 3720; Apr. 2004; pp. 1-257. |
Sciberras; Lightweight Directory Access Protocol (LDAP): Schema for User Applications; IETF Network Working Group; RFC 4519; Jun. 2006; pp. 1-33. |
Sermersheim; Lightweight Directory Access Protocol (LDAP): The Protocol; IETF Network Working Group; RFC 4511; Jun. 2006; pp. 1-68. |
Shamir; How to Share a Secret; Communications of the ACM; vol. 22, No. 11; Nov. 1979; pp. 612-613. |
Smith; Lightweight Directory Access Protocol (LDAP): Uniform Resource Locator; IETF Network Working Group; RFC 4516; Jun. 2006; pp. 1-15. |
Smith; Lightweight Directory Access Protocol (LDAP): String Representation of Search Filters; IETF Network Working Group; RFC 4515; Jun. 2006; pp. 1-12. |
Wildi; Java iSCSi Initiator; Master Thesis; Department of Computer and Information Science, University of Konstanz; Feb. 2007; 60 pgs. |
Xin, et al.; Evaluation of Distributed Recovery in Large-Scale Storage Systems; 13th IEEE International Symposium on High Performance Distributed Computing; Jun. 2004; pp. 172-181. |
Zeilenga; Lightweight Directory Access Protocol (LDAP): Directory Information Models; IETF Network Working Group; RFC 4512; Jun. 2006; pp. 1-49. |
Zeilenga; Lightweight Directory Access Protocol (LDAP): Internationalized String Preparation; IETF Network Working Group; RFC 4518; Jun. 2006; pp. 1-14. |
Zeilenga; Lightweight Directory Access Protocol (LDAP): String Representation of Distinguished Names; IETF Network Working Group; RFC 4514; Jun. 2006; pp. 1-15. |
Zeilenga; Lightweight Directory Access Protocol (LDAP): Technical Specification Road Map; IETF Network Working Group; RFC 4510; Jun. 2006; pp. 1-8. |
Number | Date | Country | |
---|---|---|---|
20220114053 A1 | Apr 2022 | US |
Number | Date | Country | |
---|---|---|---|
61655736 | Jun 2012 | US | |
61357430 | Jun 2010 | US | |
61237624 | Aug 2009 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 16390530 | Apr 2019 | US |
Child | 16988135 | US | |
Parent | 14447890 | Jul 2014 | US |
Child | 16390530 | US | |
Parent | 13154725 | Jun 2011 | US |
Child | 14447890 | US | |
Parent | 12749592 | Mar 2010 | US |
Child | 14447890 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 16988135 | Aug 2020 | US |
Child | 17645563 | US | |
Parent | 16149667 | Oct 2018 | US |
Child | 17645563 | US | |
Parent | 15819810 | Nov 2017 | US |
Child | 16149667 | US | |
Parent | 13869655 | Apr 2013 | US |
Child | 15819810 | US | |
Parent | 12218594 | Jul 2008 | US |
Child | 13154725 | US | |
Parent | 12218200 | Jul 2008 | US |
Child | 12218594 | US | |
Parent | 12080042 | Mar 2008 | US |
Child | 12218594 | US | |
Parent | 11973613 | Oct 2007 | US |
Child | 12218594 | US | |
Parent | 11973621 | Oct 2007 | US |
Child | 12218594 | US | |
Parent | 11973622 | Oct 2007 | US |
Child | 12218594 | US | |
Parent | 11973542 | Oct 2007 | US |
Child | 12218594 | US | |
Parent | 11403684 | Apr 2006 | US |
Child | 12218594 | US | |
Parent | 11403391 | Apr 2006 | US |
Child | 12218594 | US | |
Parent | 11404071 | Apr 2006 | US |
Child | 12218594 | US | |
Parent | 11241555 | Sep 2005 | US |
Child | 12218594 | US |