Data matching using flow based packet data storage

Information

  • Patent Grant
  • 8755381
  • Patent Number
    8,755,381
  • Date Filed
    Wednesday, August 2, 2006
    18 years ago
  • Date Issued
    Tuesday, June 17, 2014
    10 years ago
Abstract
A system for processing packets includes a communications interface and a processor. A communications interface receives a packet between a source and a destination. The processor identifies a flow between the source and the destination based on the packet. The processor determines whether some of packet data of the packet matches to storage data in storage using hashes. The processor then stores the packet data in a block of memory in the storage based on the flow and if the packet data does not match the storage data.
Description
BACKGROUND

1. Technical Field


The present invention relates generally to data storage and communications and more particularly to data matching using flow based packet data storage.


2. Description of Related Art


For transmission over communications networks, data has been organized into packets, frames, and cells. Bundling data into these packets, frames, and cells assists routers and switches in the communication network to handle a manageable amount of data. Large amounts of data such as files may be separated into hundreds of packets. Thus, messages such as data requests or data responses for these large amounts of data may also span over hundreds of packets. When packets get misordered or retransmitted, reassembling the packets into the messages can be difficult and time consuming.


In some situations, incoming data needs to be matched with data stored in memory. In one example, in order to prevent redundancy and reduce communication bandwidth, a request for data is matched with data stored in local memory and retrieved from the memory without transmitting the request for data to retrieve the data over the communications network. However, matching the data may be problematic when the packets for related data are stored in different locations in the memory.


A Transmission Control Protocol (TCP) stack does reassemble and sort out packets. The TCP stack does handle packets destined to a socket and then sequentially stores the packets from the socket in pages of memory. The TCP stack also strips the header of the packet and transfers the payload data of the packet to the applications in the application layer. Ethernet switches and IP routers also perform flow based queuing, where packets for a flow are stored in queues. The packets for a flow may be organized on the same page of memory linked lists.


These protocols, switches, and routers may have stored packets in memory based on a flow to solve different problems such as buffering and queuing. However, none of the prior art systems have stored the packets in memory based on flow to match data.


SUMMARY OF THE INVENTION

The invention addresses the above problems by providing data matching by using flow based packet data storage. A system for processing packets includes a communications interface and a processor. The communications interface receives a packet between a source and a destination. The processor identifies a flow between the source and the destination based on the packet. The processor determines whether some of packet data of the packet matches to storage data in storage using hashes. If the packet data does not match the storage data, the processor then stores the packet data in a block of memory in the storage based on the flow.


The storage may comprise short term memory and long term memory. The flow may comprise a session between a source and a destination. The block of memory may comprise a page of the contiguous memory. In some embodiments, the processor computes the hashes for the flow. The processor may also determine the hashes for the flow and store the hashes for the flow. The hashes can be rolling.


A method for processing packets includes the steps of receiving a packet between a source and a destination and identifying a flow between the source and the destination based on the packet. The method also includes the step of determining whether packet data of the packet matches to storage data in storage using hashes. The method also includes storing the packet data in a block of memory in the storage based on the flow and if the packet data does not match the storage data.


The system advantageously provides an organization of data in memory by flow for easier matching of data through hashes and retrieval of that data. By storing the packets for a flow in the same block of memory, accessing the data for the flow is easier than having different flows interleaved with each other in memory. Also, data can be matched to data in the flow regardless of how data may have been separated into packets. Thus, any issues with packet boundaries for data are eliminated.





BRIEF DESCRIPTION OF THE DRAWINGS


FIG. 1 is an illustration of a communications system in an exemplary implementation of the invention.



FIG. 2 is an illustration of communications in various forms in an exemplary implementation of the invention.



FIG. 3 is an illustration of messages, packets, and a block of memory that contain data in an exemplary implementation of the invention.



FIG. 4 is a flow chart for the network device that hashes data in an exemplary implementation of the invention.



FIG. 5 is a flow chart for the network device that matches data in an exemplary implementation of the invention.



FIG. 6 is a flow chart for the network device that writes data in an exemplary implementation of the invention.



FIG. 7 is a flow chart for the network device that retrieves data in an exemplary implementation of the invention.



FIG. 8 is a block diagram of the network device in an exemplary implementation of the invention.





DETAILED DESCRIPTION OF THE INVENTION

The embodiments discussed herein are illustrative of one example of the present invention. As these embodiments of the present invention are described with reference to illustrations, various modifications or adaptations of the methods and/or specific structures described may become apparent to those skilled in the art. All such modifications, adaptations, or variations that rely upon the teachings of the present invention, and through which these teachings have advanced the art, are considered to be within the scope of the present invention. Hence, these descriptions and drawings should not be considered in a limiting sense, as it is understood that the present invention is in no way limited to only the embodiments illustrated.


A system for processing packets includes a communications interface and a processor. A communications interface receives a packet between a source and a destination. The processor identifies a flow between the source and the destination based on the packet. The processor determines whether some of packet data of the packet matches to storage data in storage using hashes. The processor then stores the packet data in a block of memory in the storage based on the flow and if the packet data does not match the storage data.


The system advantageously provides an organization of data in memory by flow for easier matching of data through hashes and retrieval of that data. By storing the packets for a flow in the same block of memory, accessing the data for the flow is easier than having different flows interleaved with each other in memory. Also, data can be matched to data in the flow regardless of how data may have been separated into packets. Thus, any issues with packet boundaries for data are eliminated. The system can handle different types of data requests and responses and still match data within the packets.


The system minimizes delay by storing packets in a flow on a packet-by-packet basis. Thus, higher throughput is realized because the system does not have to wait for the next packet in the flow. As soon as packets are received, the system determines where to store the packet and whether to store the packet.



FIG. 1 depicts an illustration of a communications system 100 in an exemplary implementation of the invention. The communications system 100 includes a source 110, a network device 120, a communication network 130, a network device 140, and a destination 150. The source 110 is coupled to the network device 120, which includes a memory 122. The network device 120 is coupled to the communication network 130. The communication network 130 is coupled to the network device 140, which includes a memory 142. The network device 140 is coupled to the destination 150.


The embodiments in FIGS. 1-8 depict an example of packets being transmitted from the source 110 to the destination 150 through the network device 120, the communication network 130, and the network device 140. Other embodiments may include packets being transmitted from the destination 150 to the source 110. The network device 120 and the network device 140 may generate their own packets and be the source 110 and the destination 150. While there are numerous variations in where the packet is generated and transmitted to, the figures below describe one example of packet transmissions from the source 110 to the destination 150 for the sake of simplicity.


The network device 120 and the network device 140 are any device or system configured to process packets and exchange packets over the communication network 130. The configuration of the network device 120 is described in further detail below in FIG. 8. One example of the network device 120 is an appliance in a network memory architecture, which is described in U.S. patent application Ser. No. 11/202,697 entitled “Network Memory Architecture for Providing Data Based on Local Accessibility” filed on Aug. 12, 2005, which is hereby incorporated by reference.



FIG. 2 depicts an illustration of communications in various forms in an exemplary implementation of the invention. The communications 210 include messages 212, messages 214, and messages 216. Some example of the messages 212, messages 214, and messages 216 are data requests and responses.


The communications 220 organized by flow include the flow A 222, flow B 224, and flow C 226. A flow is a communications session between two devices over a communications network. One example of a flow is an Internet Protocol (IP) flow. In one example, the flow is uniquely identified by the source IP address, the destination IP address, the IP protocol, the source port, and the destination port. The packets for a flow are advantageously stored in a same block of memory for easy retrieval and matching.


The memory 122 includes memory blocks 240 and flow state information 250. The memory 122 may comprise Random Access Memory (RAM), disk drives, hard disks, optical disks, and any of a combination of these elements. The memory 122 may comprise two levels of memory. One level is a more temporary or short term memory such as RAM, and the other level is a more permanent or long term memory such as disk. Typically, the temporary memory has faster read and write access than the slower read and write access of permanent memory.


The memory blocks 240 include block 1242, block 2244, and block 3246. The memory blocks 240 are blocks of contiguous memory. A block of memory is a contiguous section or portion of memory.



FIG. 2 depicts how flow A 222 is stored into block 1242 of the memory blocks 240. Also, flow B 224 is stored into block 2244 of the memory blocks 240, and flow C 226 is stored into block 3246 of the memory blocks 240. FIG. 2 shows how communications 210 are organized into flows, and how the flows are organized into blocks of memory. The figures below describe how the packets in the flows are stored into a specific block of memory blocks 240 based on flow.


The flow state information 250 is any data or information that relate to the characteristics, status, or state of the flow.



FIG. 3 depicts an illustration of messages, packets, and a block of memory that contain data in an exemplary implementation of the invention. A message A 310 contains data 312. The message A 310 is then separated into packets for communication. The message A 310 is delineated into packet A 320 and packet B 325. Only two packets for message A 310 are shown for the sake of simplicity. However, messages may be separated into numerous packets. The packet A 320 comprises a payload 321 and the header 323. The packet B 325 comprises a payload 326 and the header 328. The data 312 in message A 310 is separated into data 322 in the payload 321 of packet A 320 and data 327 in the payload 326 of packet B 325. As discussed above in FIG. 2, the data 322 and the data 327 are stored in a memory block 330 as data 332 without the headers 323 and 328. The data 332 is stored in the memory block 330 without any packet delineations so data with different packet boundaries can be matched to the data in the memory block 330.


Message B 340 includes data 342. The message B 340 also is then separated into packets for communication. The message B 340 is delineated into packet C 350 and packet D 355. The packet C 350 comprises a payload 351 and the header 353. The packet D 355 comprises a payload 356 and the header 358. The data 342 in message B 340 is separated into data 352 in the payload 351 of packet C 350 and data 357 in the payload 356 of packet D 355. Using hashes and storing states of hashes across packet boundaries as discussed in further detail below in FIGS. 4-6, the data 352 and the data 357 can then be compared with the data 332 for matching. Even though the data 312 is separated into different packet boundaries than data 342, the data 312 can still be matched with data 342 if identical.



FIG. 4 depicts a flow chart for the network device 120 that hashes data in an exemplary implementation of the invention. FIG. 4 begins in step 400. In step 402, the network device 120 receives the next packet. In step 404, the network device 120 processes the packet to identify the flow and the start of the payload. In step 406, the network device 120 restores the state information for the flow including information from the previous packet in the flow. The information from the previous packet in the flow may be hashes, partial hash states, or actual data from the previous packet.


In step 408, the network device 120 determines whether there is unhashed data remaining. If there is no unhashed data remaining, the process ends in step 410. If there is unhashed data remaining, the network device 120 computes byte-wise rolling hashes over packets in the same flow in step 412. A hash is a scheme for providing rapid access to data items which are distinguished by some key. These hashes provide a type of “fingerprinting” over the data to make matching the data easier. One example of a rolling hash is a checksum where the oldest byte in the window is subtracted and new bytes are added. The rolling hash may be over at least one packet. Thus, the rolling hash can run over packet boundaries. When the hash rolls over the end of a packet, the hash state or the remaining portion of the packet (e.g. the last 100 bytes of a packet) that has not been hashed can be saved and retrieved when the next sequential packet for the flow has been received. This hash or the remaining portion of the packet can be saved per flow and can be saved with the state information for the flow. The network device 120 may calculate the hash to the end of the packet.


In one example, cyclic redundancy check (CRC) is used for hashing, where one byte is added and one byte is taken off. CRC provides a stronger fingerprint (e.g. more collision resistant) than checksums. In another example, the hashes are based on the following equation:

new hash=old hash×2×P+new data×Q;

    • where P and Q are constants.


In step 414, the network device 120 checks whether the filter criteria have been passed. The network device 120 may use fine and coarse hashes as filters. Fine and coarse hashes are described in further detail in U.S. patent application Ser. No. 11/202,697 entitled “Network Memory Architecture for Providing Data Based on Local Accessibility” filed on Aug. 12, 2005. If the filter criteria have not been passed, the process returns to step 408. If the filter criteria have been passed, the network device 120 adds the index to the list of possible matches in step 416 and returns to step 408.



FIG. 5 depicts a flow chart for the network device 120 that matches data in an exemplary implementation of the invention. FIG. 5 begins in step 500. In step 502, the network device 120 reads the next entry from the list of possible matches. In step 504, the network device 120 uses the computed hash index to index a hash table. In step 506, the network device 120 checks whether the entry of a possible match is one of the two most likely matches. In determining the most likely match, the network device 120 may check a block of memory that some of the matches are pointing to. In step 508, the network device 120 checks whether the next entry is the end of the list of possible matches. If the next entry is not the end of the list of possible matches, the process returns to step 502.


If the next entry is at the end of the list of possible matches, the network device 120 verifies the match sizes of the most likely and the second most likely matches in step 510. The network device 120 then determines whether both match sizes are small in step 512. The network device 120 advantageously can match data using hashes regardless of how the data was delineated into different packet boundaries.


If both match sizes are small, the network device 120 stores the data in blocks of memory based on the flow in step 514. Thus, as packets are received, the network device 120 sorts and stores the packets by flow, which provides an organization of data by flow making matching the data by hashes easier. For example, writing and retrieving the data for a flow in one portion of memory is easier and faster than retrieving data for a flow that is scattered in various locations in the memory. In some embodiments that have two levels of memory such as RAM and disk, moving data in one portion of disk to RAM can especially save access time to the data. Step 514 is described in more detail below in FIG. 6.


The network device 120 also transmits a store instruction with the data in step 516. The store instruction causes the network device 140 to store the data in the memory 142 of the network device 140. In step 518, the network device 120 then updates the hash table before proceeding to step 522.


If at least one match size is not small, the network device 120 transmits a retrieve instruction with the index of the data for the match with the larger match size in step 520 before proceeding to step 522. The retrieve instruction causes the network device 140 to retrieve the data from its own memory 142. Thus, the data does not have to be transmitted over the communication network 130 but still can be accessed by the network device 140. The network device 120 saves the state information for the flow in step 522. FIG. 5 ends in step 524.


Once data is matched, the network device 120 may match subsequent data by requesting the next packet for the flow. The network device 120 can then iteratively (i.e., going on a “run”) match data and request the next packet for the flow. For example, the network device 120 can match data in a big file until the end of the file. Since the data is stored sequentially by flow in the memory, the network device 120 can just read the next portion of data in the memory. Thus, the data can be coalesced or reconstituted and the one instruction such as a retrieve instruction or store instruction per one packet may be avoided. Store and retrieve instructions are described in further detail in U.S. patent application Ser. No. 11/202,697 entitled “Network Memory Architecture for Providing Data Based on Local Accessibility” filed on Aug. 12, 2005.



FIG. 6 depicts a flow chart for the network device that writes data in an exemplary implementation of the invention. FIG. 6 depicts a process for storing the data in memory as in step 512 of FIG. 5. FIG. 6 begins in step 600. In step 602, the network device 120 identifies the block of memory for the flow. In one example, the network device 120 checks a table of flows and locations of blocks of memory associated with the flow. In step 604, the network device 120 checks whether the block of memory is identified for the flow. If the block has been identified, the network device 120 proceeds to step 608. If the block has not been identified, the network device 120 allocates a new block of memory for the flow or allocates a block of memory from a list of partially filled blocks of memory in step 606. Initially, one flow can be assigned per block of memory. Once the number of flows exceeds the number of blocks of memory, two flows can be assigned per block of memory. Thus, multiple flows can be assigned per block of memory. In some embodiments, the partially filled blocks are reallocated on a Least Recently Used (LRU) basis.


In step 608, the network device 120 writes the packet data to the block of memory excluding the packet header. In step 610, the network device 120 updates the hash table. In step 612, the network device 120 checks whether the block of memory is full. If the block of memory is not full, the network device 120 proceeds to step 616. If the block of memory is full, the network device 120 allocates a new block of memory for the flow or allocates a block of memory from a list of partially filled blocks of memory in step 614. The block of memory does not have to be completely full. A threshold such as 90% full may be set up to indicate that the block of memory is full.


In step 616, the network device 120 checks whether all the packet data has been written. When the network device 120 determines that the flow finishes, the network device 120 can relinquish the block of memory for the flow, so another flow can use the block of memory. In some embodiments that have two levels of memory such as RAM and disk, when the RAM is first used, the block in the RAM is saved to disk. The network device 120 can relinquish the block of memory by adding the block of memory to the list of partially full blocks of memory. The network device 120 may determine that the flow has ended by a TCP reset or finish (FIN) indicator or another termination message that indicates an explicit ending for the flow. The network device 120 may also determine that the flow has ended for this matching technique when after a period of time (e.g., 10 seconds) there is no activity on the flow.


If all the packet data has not been written, the process returns to step 608 to write the remaining packet data. If all the packet data has been written, FIG. 6 ends in step 618.



FIG. 7 depicts a flow chart for the network device that retrieves data in an exemplary implementation of the invention. FIG. 7 begins in step 700. In step 702, the network device 140 receives and processes a message. In step 704, the network device 140 checks whether a retrieve instruction with an index has been received. If a retrieve instruction with an index has not been received, the process continues to step 708. If a retrieve instruction has been received, the network device 140 retrieves the data from the blocks of memory based on the index in step 706.


In step 708, the network device 140 checks whether a store instruction with data has been received. If the store instruction with data has not been received, the process continues to step 712. If the store instruction with data has been received, the network device 140 stores the data in the blocks of memory based on the flow in step 710. In step 712, the network device 140 forwards the data to the destination. FIG. 7 ends in step 714.



FIG. 8 illustrates a block diagram of the network device 120 in an exemplary implementation of the invention. The network device 140 may have a similar configuration as the network device 120. The network device 120 includes a processor 810, a communications interface 820, and a memory 830, which are all coupled to a system bus 850. The processor 810 is configured to execute executable instructions.


The communications interface 820 is coupled to the communications network 130 and the source 110 via the link 825. The communications interface 820 is configured to exchange communications between the source 110, the communications network 130, and the other elements in the network device 120. In some embodiments, the communications interface 820 may comprise a Local Area Network interface for the source 110 and a Wide Area Network interface for the communications network 130. The memory 830 is any memory configured to store data. Some examples of the memory 830 are storage devices, such as RAM, hard drives, optical drives, flash memory, and magnetic tape. As discussed above, the memory 830 may comprise two levels of memory. One level is a more temporary or short term memory such as RAM, and the other level is a more permanent or long term memory such as disk.


The above-described functions can be comprised of executable instructions that are stored on storage media. The executable instructions can be retrieved and executed by a processor. Some examples of executable instructions are software, program code, and firmware. Some examples of storage media are memory devices, tape, disks, integrated circuits, and servers. The executable instructions are operational when executed by the processor to direct the processor to operate in accord with the invention. Those skilled in the art are familiar with executable instructions, processor(s), and storage media.


The above description is illustrative and not restrictive. Many variations of the invention will become apparent to those of skill in the art upon review of this disclosure. The scope of the invention should, therefore, be determined not with reference to the above description, but instead should be determined with reference to the appended claims along with their full scope of equivalents.

Claims
  • 1. A system for processing packets, the system comprising: a communications interface configured to receive a plurality of packets at a network device between a source and a destination, each of the packets including data and flow information; anda processor configured to identify a flow between the source and the destination based on the flow information in each of the plurality of packets to determine whether at least a portion of the data from the received plurality of packets indicates a potential match to data in storage using hashes, to remove a header from each of the plurality of packets if no indication is found of a potential match to data in storage, to store the data from each of the plurality of packets with the removed header in a block of memory allocated for the flow, wherein the allocated block of memory comprises a page of contiguous memory and the storage includes both fast memory and slow memory used to process the flow, to retrieve a list of possible data matches, to use a computed hash index to index a hash table for the data, to determine match sizes of the most likely data match and second most likely data match, to store the data from the most likely data match and second most likely data match in the block of memory allocated for the flow if both match sizes are small, and to generate a retrieve instruction with the computed hash index of the data for the match with the larger match size if at least one match size is not small.
  • 2. The system of claim 1 wherein the processor is configured to move the storage data between the fast memory and the slow memory.
  • 3. The system of claim 1 wherein the flow comprises a session between the source and the destination.
  • 4. The system of claim 1 wherein the processor is configured to retrieve a hash state of the hashes for the flow.
  • 5. The system of claim 1 wherein the processor is configured to determine the hashes for the flow.
  • 6. The system of claim 1 wherein the processor is configured to store a hash state of the hashes for the flow.
  • 7. The system of claim 1 wherein the processor is configured to update the hash table with the hashes.
  • 8. The system of claim 1 wherein the hashes are rolling.
  • 9. The system of claim 1 wherein the processor is configured to allocate the block of the memory for the identified flow.
  • 10. The system of claim 1 wherein the processor is configured to transmit the packet data.
  • 11. The system of claim 1 wherein the block of memory allocated for the flow is allocated from a list of partially filled blocks of memory.
  • 12. A method for processing packets, the method comprising: receiving a plurality of packets at a network device between a source and a destination, each of the packets including data and flow information;identifying a flow between the source and the destination based on the flow information in each of the plurality of packets;determining whether at least a portion of the data from the received plurality of packets indicates a potential match to data in storage using hashes;retrieving a list of possible data matches;using a computed hash index to index a hash table for the data;determining match sizes of the most likely data match and second most likely data match;storing the data from the most likely data match and second most likely data match in a block of memory designated for the flow, if both match sizes are small;generating a retrieve instruction with the computed hash index of the data for the match with the larger match size, if at least one match size is not small;removing a header from each of the plurality of packets if no indication is found of a potential match to data in storage; andstoring the data from each of the plurality of packets with the removed header in a block of memory allocated for the flow, wherein the allocated block of memory comprises a page of contiguous memory and the storage includes both fast memory and slow memory used to process the flow.
  • 13. The method of claim 12 further comprising moving the storage data between the fast memory and the slow memory.
  • 14. The method of claim 12 wherein the flow comprises a session between the source and the destination.
  • 15. The method of claim 12 further comprising retrieving a hash state of the hashes for the flow.
  • 16. The method of claim 12 further comprising determining the hashes for the flow.
  • 17. The method of claim 12 further comprising storing a hash state of the hashes for the flow.
  • 18. The method of claim 12 further comprising updating the hash table with the hashes.
  • 19. The method of claim 12 wherein the hashes are rolling.
  • 20. The method of claim 12 further comprising allocating the block of the memory for the identified flow.
  • 21. The method of claim 12 further comprising transmitting the packet data.
  • 22. The method of claim 12 further comprising: allocating the block of memory for the identified flow from a list of partially filled blocks of memory.
  • 23. A software product for processing packets, the software product comprising: software operational when executed by a processor to direct the processor to receive a plurality of packets at a network device between a source and a destination, each of the packets including data and flow information, to identify a flow between the source and the destination based on the flow information in each of the plurality of packets, to determine whether at least a portion of the data from the received plurality of packets indicates a potential match to data in storage using hashes, to retrieve a list of possible data matches, using a computed hash index to index a hash table for the data, to determine match sizes of the most likely data match and second most likely data match, to store the data from the most likely data match and second most likely data match in the block of memory allocated for the flow if both match sizes are small, to generate a retrieve instruction with the computed hash index of the data for the match with the larger match size if at least one match size is not small, to remove a header from each of the plurality of packets if no indication is found of a potential match to data in storage, and to store the data from each of the plurality of packets with the removed header in a block of memory allocated for the flow, wherein the allocated block of memory comprises a page of contiguous memory and the storage includes both fast memory and slow memory used to process the flow; anda non-transitory computer-readable storage medium configured to store the software.
  • 24. The software product of claim 23 wherein the software is operational when executed by the processor to direct the processor to move the storage data between the fast memory and the slow memory.
  • 25. The software product of claim 23 wherein the flow comprises a session between the source and the destination.
  • 26. The software product of claim 23 wherein the software is operational when executed by the processor to direct the processor to retrieve a hash state of the hashes for the flow.
  • 27. The software product of claim 23 wherein the software is operational when executed by the processor to direct the processor to determine the hashes for the flow.
  • 28. The software product of claim 23 wherein the software is operational when executed by the processor to direct the processor to store a hash state of the hashes for the flow.
  • 29. The software product of claim 23 wherein the software is operational when executed by the processor to direct the processor to update the hash table with the hashes.
  • 30. The software product of claim 23 wherein the hashes are rolling.
  • 31. The software product of claim 23 wherein the software is operational when executed by the processor to direct the processor to allocate the block of the memory for the identified flow.
  • 32. The software product of claim 23 wherein the software is operational when executed by the processor to direct the processor to transmit the packet data.
  • 33. The software product of claim 23 wherein the block of memory allocated for the flow is allocated from a list of partially filled blocks of memory.
US Referenced Citations (191)
Number Name Date Kind
5307413 Denzer Apr 1994 A
5359720 Tamura et al. Oct 1994 A
5483556 Pillan et al. Jan 1996 A
5592613 Miyazawa et al. Jan 1997 A
5611049 Pitts Mar 1997 A
5675587 Okuyama et al. Oct 1997 A
5754774 Bittinger et al. May 1998 A
5802106 Packer Sep 1998 A
5883891 Williams et al. Mar 1999 A
6000053 Levine et al. Dec 1999 A
6003087 Housel et al. Dec 1999 A
6081883 Popelka et al. Jun 2000 A
6295541 Bodnar et al. Sep 2001 B1
6308148 Bruins et al. Oct 2001 B1
6311260 Stone et al. Oct 2001 B1
6374266 Shnelvar Apr 2002 B1
6434662 Greene et al. Aug 2002 B1
6438664 McGrath et al. Aug 2002 B1
6452915 Jorgensen Sep 2002 B1
6587985 Fukushima et al. Jul 2003 B1
6618397 Huang Sep 2003 B1
6633953 Stark Oct 2003 B2
6643259 Borella et al. Nov 2003 B1
6650644 Colley et al. Nov 2003 B1
6674769 Viswanath Jan 2004 B1
6718361 Basani et al. Apr 2004 B1
6738379 Balazinski et al. May 2004 B1
6769048 Goldberg et al. Jul 2004 B2
6791945 Levenson et al. Sep 2004 B1
6856651 Singh Feb 2005 B2
6859842 Nakamichi et al. Feb 2005 B1
6910106 Sechrest et al. Jun 2005 B2
6968374 Lemieux et al. Nov 2005 B2
6978384 Milliken Dec 2005 B1
7007044 Rafert et al. Feb 2006 B1
7020750 Thiyagaranjan et al. Mar 2006 B2
7035214 Seddigh et al. Apr 2006 B1
7069342 Biederman Jun 2006 B1
7113962 Kee et al. Sep 2006 B1
7120666 McCanne et al. Oct 2006 B2
7145889 Zhang et al. Dec 2006 B1
7197597 Scheid et al. Mar 2007 B1
7200847 Straube et al. Apr 2007 B2
7215667 Davis May 2007 B1
7242681 Van Bokkelen et al. Jul 2007 B1
7243094 Tabellion et al. Jul 2007 B2
7266645 Garg et al. Sep 2007 B2
7318100 Demmer et al. Jan 2008 B2
7366829 Luttrell et al. Apr 2008 B1
7380006 Srinivas et al. May 2008 B2
7383329 Erickson Jun 2008 B2
7383348 Seki et al. Jun 2008 B2
7388844 Brown et al. Jun 2008 B1
7389357 Duffie et al. Jun 2008 B2
7389393 Karr et al. Jun 2008 B1
7417991 Crawford et al. Aug 2008 B1
7420992 Fang et al. Sep 2008 B1
7428573 McCanne et al. Sep 2008 B2
7451237 Takekawa et al. Nov 2008 B2
7453379 Plamondon Nov 2008 B2
7457315 Smith Nov 2008 B1
7471629 Melpignano Dec 2008 B2
7532134 Samuels et al. May 2009 B2
7555484 Kulkarni et al. Jun 2009 B2
7571343 Xiang et al. Aug 2009 B1
7619545 Samuels et al. Nov 2009 B2
7620870 Srinivasan et al. Nov 2009 B2
7639700 Nabhan et al. Dec 2009 B1
7676554 Malmskog et al. Mar 2010 B1
7714747 Fallon May 2010 B2
7746781 Xiang Jun 2010 B1
7764606 Ferguson et al. Jul 2010 B1
7849134 McCanne et al. Dec 2010 B2
7853699 Wu et al. Dec 2010 B2
7873786 Singh et al. Jan 2011 B1
7941606 Pullela et al. May 2011 B1
7953869 Demmer et al. May 2011 B2
8140757 Singh et al. Mar 2012 B1
8595314 Hughes Nov 2013 B1
20010054084 Kosmynin Dec 2001 A1
20020007413 Garcia-Luna-Aceves et al. Jan 2002 A1
20020040475 Yap et al. Apr 2002 A1
20020061027 Abiru et al. May 2002 A1
20020065998 Buckland May 2002 A1
20020078242 Viswanath Jun 2002 A1
20020101822 Ayyagari et al. Aug 2002 A1
20020107988 Jordan Aug 2002 A1
20020116424 Radermacher et al. Aug 2002 A1
20020131434 Vukovic et al. Sep 2002 A1
20020150041 Reinshmidt et al. Oct 2002 A1
20020163911 Wee et al. Nov 2002 A1
20020169818 Stewart et al. Nov 2002 A1
20020181494 Rhee Dec 2002 A1
20020188871 Noehring et al. Dec 2002 A1
20020194324 Guha Dec 2002 A1
20030002664 Anand Jan 2003 A1
20030009558 Ben-Yehezkel Jan 2003 A1
20030123481 Neale et al. Jul 2003 A1
20030133568 Stein et al. Jul 2003 A1
20030142658 Ofuji et al. Jul 2003 A1
20030149661 Mitchell et al. Aug 2003 A1
20030149869 Gleichauf Aug 2003 A1
20030233431 Reddy et al. Dec 2003 A1
20040008711 Lahti et al. Jan 2004 A1
20040047308 Kavanagh et al. Mar 2004 A1
20040083299 Dietz et al. Apr 2004 A1
20040086114 Rarick May 2004 A1
20040114569 Naden et al. Jun 2004 A1
20040117571 Chang et al. Jun 2004 A1
20040123139 Aiello et al. Jun 2004 A1
20040179542 Murakami et al. Sep 2004 A1
20040199771 Morten et al. Oct 2004 A1
20040202110 Kim Oct 2004 A1
20040203820 Billhartz Oct 2004 A1
20040205332 Bouchard et al. Oct 2004 A1
20040243571 Judd Dec 2004 A1
20040255048 Lev Ran et al. Dec 2004 A1
20050010653 McCanne Jan 2005 A1
20050044270 Grove et al. Feb 2005 A1
20050053094 Cain et al. Mar 2005 A1
20050091234 Hsu et al. Apr 2005 A1
20050111460 Sahita May 2005 A1
20050131939 Douglis et al. Jun 2005 A1
20050132252 Fifer et al. Jun 2005 A1
20050141425 Foulds Jun 2005 A1
20050171937 Hughes et al. Aug 2005 A1
20050177603 Shavit Aug 2005 A1
20050190694 Ben-Nun et al. Sep 2005 A1
20050210151 Abdo et al. Sep 2005 A1
20050220019 Melpignano Oct 2005 A1
20050235119 Sechrest et al. Oct 2005 A1
20050243743 Kimura Nov 2005 A1
20050243835 Sharma et al. Nov 2005 A1
20050256972 Cochran et al. Nov 2005 A1
20050278459 Boucher et al. Dec 2005 A1
20050286526 Sood et al. Dec 2005 A1
20060013210 Bordogna et al. Jan 2006 A1
20060026425 Douceur et al. Feb 2006 A1
20060031936 Nelson et al. Feb 2006 A1
20060036901 Yang et al. Feb 2006 A1
20060039354 Rao et al. Feb 2006 A1
20060059171 Borthakur et al. Mar 2006 A1
20060059173 Hirsch et al. Mar 2006 A1
20060117385 Mester et al. Jun 2006 A1
20060143497 Zohar et al. Jun 2006 A1
20060195547 Sundarrajan et al. Aug 2006 A1
20060195840 Sundarrajan et al. Aug 2006 A1
20060212426 Shakara et al. Sep 2006 A1
20060218390 Loughran et al. Sep 2006 A1
20060227717 van den Berg et al. Oct 2006 A1
20060250965 Irwin Nov 2006 A1
20060268932 Singh et al. Nov 2006 A1
20060280205 Cho Dec 2006 A1
20070002804 Xiong et al. Jan 2007 A1
20070011424 Sharma et al. Jan 2007 A1
20070076693 Krishnaswamy Apr 2007 A1
20070110046 Farrell et al. May 2007 A1
20070115812 Hughes May 2007 A1
20070127372 Khan et al. Jun 2007 A1
20070130114 Li et al. Jun 2007 A1
20070140129 Bauer et al. Jun 2007 A1
20070174428 Lev Ran et al. Jul 2007 A1
20070195702 Yuen et al. Aug 2007 A1
20070198523 Hayim Aug 2007 A1
20070226320 Hager et al. Sep 2007 A1
20070244987 Pedersen et al. Oct 2007 A1
20070245079 Bhattacharjee et al. Oct 2007 A1
20070248084 Whitehead Oct 2007 A1
20070258468 Bennett Nov 2007 A1
20070263554 Finn Nov 2007 A1
20070276983 Zohar et al. Nov 2007 A1
20080005156 Edwards et al. Jan 2008 A1
20080013532 Garner et al. Jan 2008 A1
20080016301 Chen Jan 2008 A1
20080028467 Kommareddy et al. Jan 2008 A1
20080095060 Yao Apr 2008 A1
20080133536 Bjorner et al. Jun 2008 A1
20080184081 Hama et al. Jul 2008 A1
20080229137 Samuels et al. Sep 2008 A1
20080243992 Jardetzky et al. Oct 2008 A1
20080313318 Vermeulen et al. Dec 2008 A1
20080320151 McCanne et al. Dec 2008 A1
20090060198 Little Mar 2009 A1
20090100483 McDowell Apr 2009 A1
20090158417 Khanna et al. Jun 2009 A1
20090175172 Prytz et al. Jul 2009 A1
20090234966 Samuels et al. Sep 2009 A1
20100011125 Yang et al. Jan 2010 A1
20100020693 Thakur Jan 2010 A1
20100115137 Kim et al. May 2010 A1
20100290364 Black Nov 2010 A1
Foreign Referenced Citations (1)
Number Date Country
1507353 Feb 2005 EP
Non-Patent Literature Citations (12)
Entry
Muthitacharoen, Athicha et al., “A Low-bandwidth Network File System,” 2001, in Proc. of the 18th ACM Symposium on Operating Systems Principles, Banff, Canada, pp. 174-187.
Zhao et al.; “Analysis and Improvement on IPSEC Anti-Replay Window Protocol”; 2003; IEEE' pp. 553-558.
Singh et al. ;“Future of Internet Security-IPSEC”; 2005; pp. 1-8.
Shared LAN Cache Datasheet, 1996, http://www.lancache.com/slcdata.htm.
““A protocol-independent technique for eliminating redundant network traffic””, ACM SIGCOMM Computer CommunicationReview, vol. 30, Issue 4 (Oct. 2000) pp. 87-95, Year of Publication: 2000.
“B. Hong, D. Plantenberg, D. D. E. Long, and M. Sivan-Zimet. ““Duplicate data elimination in a SAN file system””, InProceedings of the 21 st Symposium on Mass Storage Systems (MSS '04), Goddard, MD, Apr. 2004. IEEE.”
“You, L. L. and Karamanolis, C. 2004. ““Evaluation of efficient archival storage techniques””, In Proceedings of the 21 st IEEESymposium on Mass Storage Systems and Technologies (MSST).”
“Fred Douglis and Arun Iyengar, Application specific Delta-encoding via Resemblance Detection”, Published in the 2003 USENIXAnnual Technical Conference.
“You L. L. et al., ““Deep Store an Archival Storage System Architecture”” Data Engineering, 2005. ICDE 2005. Proceedings. 21st.inti Conf on Tokyo, Japan, Apr. 5-8, 2005, pp. 12”.
“Udi Manber. ““Finding Similar Files in a Large File System””, TR 93-33 Oct. 1994, Department of Computer Science,University of Arizona. http://webglimpse.net/pubs/TR93-33.pdf. Also appears in the 1994 winter USENIX Technical Conference.”
Knutsson, Bjorn et al., “Transparent Proxy Signalling”, Journal of Communications and Networks, vol. 3, No. 2, Jun. 2001.
Newton, “Newton's Telecom Dictionary”, 17th Ed., 2001, pp. 38, 201, and 714.
Related Publications (1)
Number Date Country
20080031240 A1 Feb 2008 US