The field of invention relates to networking, generally; and, more specifically, to a method and apparatus for calculating TCP and UDP checksums while preserving CPU resources.
a. TCP and UDP Checksums
A number of different packet based protocols have been defined to enable interconnected systems to communicate with each other. For example, the Internet Protocol (IP) defines a networking layer protocol that allows packets to be routed, switched, or otherwise passed from network node to network node as they progress from a source point to a destination point within an IP network. At the transport layer, the Transmission Control Protocol (TCP) or the User Datagram Protocol (UDP) may be used to control the flow of packets from a source point to a destination point. TCP is a connection oriented protocol while UDP is a datagram oriented protocol. Either may be viewed as a transport layer protocol that can be configured to “run on top of” the IP protocol.
a examines this in more detail.
Referring to
The UDP header embodiment 102a of
A single IP node (e.g., a source IP node or a destination IP node) may be configured to support multiple ports. Thus, for example, a group of packets that are directed to a particular IP destination node may be ultimately delivered to different agents that each view the destination IP node as their access point to the IP network.
Commonly, multiple ports are associated with the same machine (e.g., a computing system) wherein the machine has a specific IP address. For example, a server may have a specific IP address and the application programs running on the server may each be identified with a different port. This allows the different application programs to establish different flow arrangements with the IP network (via the TCP or UDP headers) while sharing the same access point to the IP network (via the network resources of the server and its corresponding IP address).
The TCP header embodiment 102b of
Referring back to
A checksum is a number whose value represents the particular sequence of bits found within a block of data. As such, two identical blocks of data have the same checksum; and, two different blocks of data statistically have two different checksum values. In typical embodiments, the UDP/TCP pseudo header 112a, 112b, the UDP/TCP header 102a, 102b and the application data 101a are together viewed as the “block” of data over which the checksum is calculated. This “block” of data is effectively viewed as a succession of 16 bit integers that are summed using one's complement logic.
The end result of this addition is the checksum value 110b, 110c that is stored in the UDP/TCP header 102a, 102b of
As seen in
The IP packet 120a is then presented to the particular physical media network type (e.g., Ethernet) that interconnects the sending node to the “next” node in the network. Note that in
Once received at its destination, the packet is deconstructed. Moving upward in
In at least one approach, the checksum is then re-calculated over the course of the pseudo header 112a, 112b, UDP/TCP header 102a, 102b, and application data 101a. The checksum calculated at the destination is then compared with the checksum that was extracted from the UDP/TCP header 102a, 102b. If the checksums match, there is a high probability that the data was not corrupted during transmission and the packet is “accepted”; otherwise, it is discarded.
In alternate approaches, the property of 1's complement addition (where X+X′=0) is taken advantage of such that the checksum is calculated over the data and the checksum. If the final answer is “0”, the checksum is deemed “good.” Other types of checksum approaches that are known or yet to be developed may also be used.
b. Software Calculation of the TCP and UDP Checksums
In order to implement the software methodologies that execute on a CPU 200, the processor(s) 201 typically execute a plurality of instructions and manipulate a plurality of data units. The instructions and data units are found in either (or both) the system memory unit 202a and the cache unit 206a. Generally, frequently used (and/or imminently used) instructions and data units are stored in the cache unit 206a. As a result, the instruction execution logic of the processor(s) 201a has this information “nearby” so as to avoid the delay associated with retrieving it from system memory 202a. System memory 202a is typically implemented with Dynamic Random Access Memory (DRAM) cells (which are more dense but slower) while the cache unit 206a is typically implemented with Static Random Access Memory (SRAM) cells (which are less dense but faster).
In the prior art the TCP and/or UDP functionality, which includes the TCP and UDP checksum calculations described above, is implemented in software. As such, the checksum calculation process is executed via the execution of instructions and the manipulation of data units that reside in system memory 202 and/or cache 206. The intensive operations associated with the calculation of a checksum tend to hinder the performance of the CPU 200. That is, the repeated additions of fixed length sections of the application data, UDP/TCP header and pseudo header consume the resources of the system memory 202, and execution instruction logic and “pollute” the cache 206 within the processors 201 such that resources the CPU 200 can devote to other functions (e.g., application programs) is noticeably reduced.
The present invention is illustrated by way of example and not limitation in the figures of the accompanying drawings, in which like references indicate similar elements, and in which:
a shows the formation and deconstruction of an IP packet.
b shows a UDP header and its associated fields.
c shows a TCP header and its associated fields.
d shows a UDP or TCP pseudo header and its associated fields.
Embodiments of the present invention (as described below as well as others) may be realized in accordance with the following teachings and it should be evident that various modifications and changes may be made in the following teachings without departing from the broader spirit and scope of the invention. The specification and drawings are, accordingly, to be regarded in an illustrative rather than restrictive sense and the invention measured only in terms of the claims.
For simplicity, most of the following discussion refers to embodiments wherein checksums are calculated within the network processing offload unit 310; and, the remainder of the IP and TCP or UDP processing burden is executed by the CPU 300 with software routines. As such, the term “packet” as used below may be construed as referring to a body of data that includes an IP layer header, a UDP or TCP layer header and application data. The term UDP packet or TCP packet refers to a packet without the IP layer header.
Referring to the
A bridge device such as the system bridge device 307 of
The memory controller 303 embodiment of
The memory controller 303 and system bridge 307 may be implemented in various ways. For example, according to at least a few different approaches, the memory controller 303 is implemented as one or more semiconductor chips from the “chipset” product offering(s) of Intel Corporation of Santa Clara, Calif. (e.g., '440BX, '820, etc.).
In alternate embodiments, the memory controller 303 is implemented as a commercial product offering from another semiconductor chip supplier or as a proprietary solution. Note also that the system bridge 307 and the memory controller 303 can be effectively merged in various other embodiments (e.g., such that a system bus 308 emerges from the CPU 300).
The system memory interface 305 may be embodied with various current technologies such as Synchronous DRAM (SDRAM); Double Date Rate DRAM (DDR); and Rambus DRAM (RDRAM) to name a just a few. Proprietary interfaces may be used; and, with regard to future embodiments, standard interfaces yet to be defined may also be used.
In the particular system architecture embodiment of
Note that a pseudo header may be built with information that is, where appropriate, extracted from the packet itself (e.g., the source IP address 150 of
In an embodiment, the inbound checksum engine 402 performs checksum calculation on an inbound TCP or UDP packet (i.e., a TCP packet or UDP packet that has been received). Referring to
After or during calculation 502 of the checksum, the inbound packet is stored 503 within the network offload memory 409. For example, earlier bytes of the application data that have already been processed by the checksum calculation process may be stored into the network offload memory 409 while later bytes of the application data from the same packet are incorporated into the checksum calculation process or wait to be incorporated. Alternatively, an inbound packet can have its checksum calculated before any of its bytes are written into the network offload memory 409.
In an embodiment, once a checksum value is calculated, the CPU is notified 503. The calculated checksum value may be appended to the inbound packet within the network offload memory 409. Alternatively or in combination, the calculated checksum may be stored within a register located within the network processing offload unit 410 (e.g., within the inbound checksum engine 402). In the former case, the CPU receives the checksum by reading the inbound packet out of the network offload memory 409 (because the checksum, being appended to the packet, travels along with the packet). In the latter case, the CPU receives the checksum by invoking a read from the register where the checksum is stored.
The CPU may be notified that a packet has arrived in various ways such as a status line 424 that goes active upon the completion of the checksum calculation process (which can be polled by the CPU processor(s) or, alternatively, directed to an interrupt input to the CPU processor(s)). The status line 424 can be separate from (or integrated into) the interface 408 to the system bus. Other ways of signaling an event to a CPU may be used instead of those described just above as well. Note that the notification 503 may occur before, in parallel with, or after the packet has been written into the network offload memory 409.
Once the CPU becomes aware that a packet has arrived and decides to move the packet from the network offload memory 409, the packet is transferred 504 from the network offload memory 309, 409 to the system memory 302 (referring to
A preliminary methodology of the software can be to compare 505 the checksum value of the packet (which can be found in the system memory within the transport layer header of the packet) with the checksum value that was calculated by the inbound checksum engine 402. If they match, the packet may be deemed “accepted”; or, if they do not match, the packet may be deemed “not accepted” (and discarded from system memory 302 as a result).
In order to compare the packet's checksum value with the calculated checksum value: 1) the checksum value of the packet can be retrieved from the system memory 302 (e.g., by reading the packet's header information); and, 2) if the calculated checksum value is appended to the packet, the calculated checksum value can also be retrieved from system memory 302 (e.g., along with the packet's checksum value), or alternatively, the calculated checksum value can be retrieved from appropriate register space (e.g., within the network processing offload unit 410). Regardless, note that the packet's application data payload does not need to be read from the system memory 302 and processed by the CPU 300 for checksum calculation purposes. As such, CPU 300 resources are conserved and can be devoted to other tasks. In addition, the cache memory on the CPU is not polluted by the data used to calculate the checksum for the packet.
Register 604 stores the checksum value calculated by the inbound checksum engine 602. The checksum processing and control logic 610 can be designed to alternatively store the calculated checksum value into the register 604 or append the calculated checksum value to the inbound packet (e.g., as indicated by preference of the CPU); or may be designed do both (i.e., store the calculated checksum value into the register 604 and append the calculated checksum value to the inbound packet) at the request of the CPU or as an automatic function.
Referring to
Note that the embodiment of
In alternate embodiments, a bridge unit (or other type of bus-to-bus translation unit) may be inserted between the networking processing offload bus 422 and the interface to the system bus 408 (e.g., along signaling leg 423) so that the networking processing offload unit bus 422 may operate in relative isolation with respect to the system bus 408. In a further related embodiment, the bridge unit (or other translation device) is integrated into the system interface checksum engine 403.
In alternate embodiments, the network processing offload unit 410 may be designed without a central bus 422 (e.g., by wiring point-to-point communication between the functional unit 401 through 404 of the network offload processing unit 410). Further still, other architectures are possible as well (e.g., by giving the system interface checksum engine 403 its own unique interface to the system bus 408).
The above discussion applied to inbound packets.
Referring to
The packet is then read 702 from the network offload memory 409 and presented to an outbound checksum engine 401 which calculates 703 the checksum for the outgoing packet. Note that the outbound packet may be read 702 from the network offload memory 409 before or during the calculation 703 of the checksum. For example, later bytes of the application data that have not been incorporated into the checksum calculation process may be read from the network offload memory while other earlier bytes of the application data from the same packet are being (or have been) incorporated into the checksum calculation process.
Alternatively, a packet can have its checksum calculated after it has been read in its entirety from the network offload memory 409. After the outbound checksum engine 401 engine calculates the checksum 703 on the outgoing packet, the outbound packet is forwarded to the outbound network interface 421. The packet is then transmitted 709.
A register 804 may be coupled to the checksum processing and control unit 803 so that the calculated checksum can be calculated and held before it is inserted into the appropriate location in the outbound packet header (e.g., header field 110b of
A “no-transmit” mode of operation, referring back to
According to the methodology of
In an embodiment, an outbound checksum processing engine is used to perform a “no-transmit” checksum. For example, referring back to the embodiment of
Referring back to
Because UDP packets can be large as compared to supported IP Packet sizes, UDP packets are frequently fragmented (i.e., broken up into pieces) at the IP layer. As such, if the outbound checksum engine 402 is designed to calculate a checksum on an entire packet (as opposed to a series of pieces of a packet), the size of a worst case designed for UDP packet may be too large to efficiently construct an outbound queue 802 and outbound checksum processing and control unit 803. In addition, since a single UDP packet can span multiple IP packets, transmission of the first IP packet typically could not occur until all IP packets encapsulating this UDP packet are transferred to the outbound checksum engine 401. A design point may be conceived that efficiently calculates checksums on entire TCP packets and on pieces of UDP packets. Correspondingly, an embodiment may exist wherein the outbound checksum engine 401 is designed to calculate checksums on entire packets (and is therefore used to calculate the checksum for outgoing TCP packets) and the system interface engine is designed to calculate checksums on pieces of packets (and is therefore used to calculate the checksum for outgoing UDP packets).
As checksums substantially involve the repeated addition of new data to a previous calculated sum, the checksum is continuously calculated as the packet pieces flow through the system interface checksum engine 403. The packet pieces then flow out of the system interface checksum engine 403 (after having been used to update the checksum value) and are subsequently written into the network offload memory 409. With the network offload memory 409 stockpiling the packet pieces, the network offload memory will have a complete packet once all of the packet pieces have flowed through the system interface checksum engine.
Referring to
Referring back to
Note that, referring to
In this approach, the offload processor 1250 uses the network offload memory 1209 to store: 1) the data units that it operates upon; and, 2) the instruction units that collectively represent the software routine(s) that are executed by the offload processor 1250. The amount of network processing that the offload processor 1250 performs can affect the form of the data passed between the CPU 1200 and the networking offload processing unit 1210.
For example, if the offload processor 1250 performs all (or substantially all) of the networking protocol functionality (e.g., all IP and TCP/UDP related protocols), then (in one embodiment) only application data is passed between the CPU 1200 and the networking processing offload unit 1210 via system bus 1208. In another approach, the software of CPU 1200 is designed to execute TCP/UDP related functionality while the software of offload processor 1250 is designed to perform IP related functionality. In this case, UDP/TCP packets (i.e., application data plus a TCP/UDP header without an IP header) are passed between the CPU 1200 and the networking processing offload unit 1210 via system bus 1208.
In even further alternative approaches, the functional responsibility of the network processing offload unit 1210 and the CPU 1200 may be split in a less clean manner (i.e., in a manner other than a rigid, or intuitive split between the application and transport layers, or networking and transport layers) such that various responsibilities of a common networking layer are executed by both the network processing offload unit 1210 and the CPU 1200. If such approaches are undertaken those of ordinary skill can determine, on a case-by-case basis, the appropriate data format to be passed between the CPU 1200 and network processing offload unit 1210.
Regardless, the term “section of data” may be used to refer to those data pieces that are: 1) passed between the network offload processing unit and the CPU; and, 2) contain information over which the checksum value is calculated. In some cases the entire data region over which the checksum is calculated (i.e., the block of data) may be larger than the section of data. For example, if only application data is passed between the network offload processing unit and the CPU, the term “section of data” refers to the application data even though a UDP or TCP checksum calculation includes the UDP or TCP header (which would be constructed/analyzed within the network offload processor).
In other cases, the block of data over which the checksum is calculated is almost identical to the section of data. For example, in cases where the TCP/UDP header and the application data (i.e., a TCP packet or a UDP packet) are passed between the network offload processor and the CPU, the term “section of data” refers to a TCP packet or a UDP packet. Here, the only difference between the section of data and the block of data over which the checksum is calculated is the pseudo header. Other embodiments are possible as well.
Number | Name | Date | Kind |
---|---|---|---|
4991133 | Davis et al. | Feb 1991 | A |
5430842 | Thompson et al. | Jul 1995 | A |
5500864 | Gonia et al. | Mar 1996 | A |
5638370 | Seconi et al. | Jun 1997 | A |
5790786 | Wakeman et al. | Aug 1998 | A |
5826032 | Finn et al. | Oct 1998 | A |
5898713 | Melzer et al. | Apr 1999 | A |
5937169 | Connery et al. | Aug 1999 | A |
6032253 | Cashman et al. | Feb 2000 | A |
6173333 | Jolitz et al. | Jan 2001 | B1 |
6226680 | Boucher et al. | May 2001 | B1 |
6226771 | Hilla et al. | May 2001 | B1 |
6247060 | Boucher et al. | Jun 2001 | B1 |
6279140 | Slane | Aug 2001 | B1 |
6289023 | Dowling et al. | Sep 2001 | B1 |
6310884 | Odenwald, Jr. | Oct 2001 | B1 |
6345302 | Bennett et al. | Feb 2002 | B1 |
6389479 | Boucher et al. | May 2002 | B1 |
6427171 | Craft et al. | Jul 2002 | B1 |
6427173 | Boucher et al. | Jul 2002 | B1 |
6434620 | Boucher et al. | Aug 2002 | B1 |
6470415 | Starr et al. | Oct 2002 | B1 |
6530061 | Labatte | Mar 2003 | B1 |
6629288 | Bernath et al. | Sep 2003 | B1 |
6631488 | Stambaugh et al. | Oct 2003 | B1 |
6647528 | Collette et al. | Nov 2003 | B1 |