This application claims the priority of Korean Patent Application No. 2002-74352, filed on Nov. 27, 2002, in the Korean Intellectual Property Office, the disclosure of which is incorporated herein in its entirety by reference.
1. Field of the Invention
The present invention relates to a routing technique for transmitting packets to a desired destination over the Internet, and more particularly, to an Internet protocol (IP) address lookup system and method for forwarding a packet on a data plane of a router.
2. Description of the Related Art
As the number of Internet users has rapidly increased and Internet services have been remarkably diversified, Internet traffic has exponentially grown. Accordingly, a technique of searching for a desired destination to which a packet is to be transmitted as fast as possible and then forwarding the packet over the Internet has been suggested.
One of the main functions of a router is to forward a packet so that the packet can be successfully transmitted to a final destination. To forward a packet over the Internet, the router searches for an address and output port of a next hop router to which the packet is to be transmitted.
In the case of an early version of router, the time taken to search for and process a destination of a packet is shorter than the time taken to transmit the packet to the destination along a transmission path connected to the router. Thus, the early version of router can connect sub-networks to one another over the Internet without difficulty. These days, the time taken for a router to transmit a packet has been relatively reduced, as compared with the time taken for the router to search for and process the packet, and Internet traffic has exponentially increased. However, the performance of the router processing packets has not been enhanced in accordance with the speed of transmitting data. A backbone router capable of dealing with a few hundred giga bytes or a number of tera bytes, in particular, may undesirably serve as a bottleneck section on the Internet.
The present invention provides an IP address lookup system and method, which are capable of overcoming limits of a multi-bit trie and enhancing lookup speed by using a forwarding table having a multi-bit trie structure and comprised of three sub-tables in an IP address lookup.
According to an aspect of the present invention, there is provided an IP address lookup system. The IP address lookup system includes a forwarding table which has a three-layer table architecture so that it can search for each address group that constitutes an IP destination address of an input packet and a forwarding engine which obtains packet processing information and next hop information for the input packet by searching for the forwarding table using the IP destination address as a search key.
According to another aspect of the present invention, there is provided an IP address lookup method. The IP address lookup method involves (a) extracting an IP destination address from an input IP data packet; (b) extracting entries of a first table corresponding to eight most significant bits of the IP destination address; (c) extracting entries of a second table corresponding to a result of shifting fourteen bits following the eight most significant bits of the IP destination address by as much as a first shift bit; (d) extracting entries of a third table corresponding to a result of shifting twelve most significant bits of the IP destination address by as much as a second shift bit; and (e) extracting next hop information and packet processing information from a searched entry of the third table. Here, the first through third tables constitutes a forwarding table which enables each address group of the IP destination address to be searched for.
The above features and advantages of the present invention will become more apparent by describing in detail exemplary embodiments thereof with reference to the attached drawings in which:
The input interface unit 10, which includes a plurality of input link interfaces 100a through 100m, carries out an IP address lookup using a destination address of an IP packet input into the router 1 via the input link interfaces 100a through 100m. Thereafter, the input interface unit 10 transmits the input packet to an input port of a switch included in the switch fabric unit 80 according to a result of the IP address lookup.
The switch fabric unit 80 performs traffic transmission between input and output ports of the router 1, and the output interface unit 20 transmits a packet to an output port in response to the result of the IP address lookup carried out by the input interface unit 10.
The routing information collection and forwarding information generation unit 90 collects routing information using a routing protocol, processes the collected routing information into forwarding information, and transmits the forwarding information to the input interface unit 10. The input interface unit 10 creates forwarding tables 30a through 30m, each comprised of three sub-tables, based on the forwarding information transmitted from the routing information collection and forwarding information generation unit 90.
An IP address lookup function according to the present invention can be applied to the input interface unit 10 and/or the output interface unit 20 of the router 1. Therefore, the input interface unit 10 may have a different structure than the one shown in
The routing information collection and forwarding information generation unit 90 connected to the input interface unit 10 collects routing information using a routing protocol, processes the collected routing information into forwarding information, and transmits the forwarding information to the forwarding table management unit 12 through interprocessor communication (IPC).
The forwarding table management unit 12 downloads the forwarding information from the routing information collection and forwarding information generation unit 90 and creates and manages the forwarding table 30a using the downloaded forwarding information. The forwarding table 30a is comprised of three sub-tables, which will be described in greater detail later with reference to
When a packet is received from the at least one input link interface 100a, the input interface unit 10 carries out physical packet processing and other basic packet processing on the packet and then transmits the processed packet to the forwarding engine 40a.
The packet forwarding unit 50a of the forwarding engine 40a searches for a forwarding entry corresponding to an IP destination address of the packet received from the network interface unit 14 by referring to the forwarding table 30a, which is comprised of three sub-tables. If there exists a forwarding entry corresponding to the IP destination address of the packet, the packet forwarding unit 50a processes the packet using next hop information and basic packet processing information provided by the forwarding entry and then transmits the processed packet to the switch interface unit 16. The protocol packet processing unit 60a processes an internal routing protocol, and the error processing unit 70a determines the packet as an error packet and abandons the packet if forwarding information is not set in the IP destination address of the packet.
The network interface unit 14 receives the packet forwarded from the forwarding engine 40a and then transmits the received packet, also receives the packet from the input link and transmit the packets to the forwarding engine 40a. The switch interface unit 16 transmits the packet which has been through the above IP address lookup process to an input port of a switch included in the switch fabric unit 80, also transmits the packets received from the switch output port of a switch to the forwarding engine 40a.
The first table 310a is a lookup table for 8 most significant bits (MSBs) of an IP address, the second table 330a is a lookup table for ninth through twentieth bits of the IP address, and the third table 350a is a lookup table for 12 least significant bits (LSBs) of the IP address. The forwarding table 30a has multi-bit trie structure. The stride of each level of the multi-bit trie structure of the forwarding table 30a may be differently determined in consideration of the optimization and updating of memory.
Referring to
Each of the entries of the first table 310a includes four fields, i.e., a valid bit field 3101, which accounts for one bit, a flag bit field 3102, which accounts for one bit, a first shift bits field (ShiftBits1) 3103, which accounts for four bits, and a second table offset bit field 3104.
The valid bit field 3101 indicates whether or not forwarding information corresponding to each of the entries exists in a packet. A packet that attempts to look up an entry for which a valid bit 3101 has not been set is transmitted to a host processor or to the error processing unit 70a so that it can be abandoned.
The flag bit field 3102 indicates whether or not an entry designated by the second table offset bit field 3104 has next hop information. Since an 8-bit prefix is rare, the flag bit field 3102 may not be in use, in which case each of the entries of the first table 310a is comprised of only three fields, i.e., the valid bit field 3101, the first shift bits field 3103, and the second table offset bit field 3104.
The first shift bits field 3103 indicates the number of bits of an IP address that are necessary for looking up an entry in the second table 330a. The first shift bits field 3103 indicates by how many bits an IP address is to be shifted. In other words, the first shift bits field 3103 is used for shifting all bits of an IP address that do not need to be compared with entries in the second table 330a during IP lookup process, so as to express the rest bits to be compared with entries in the second table 330a except for 8 most significant bits of the IP address that have already been compared in the first table 310a. Since the first shift bits field 3103 only indicates the number of shift bits of twelve bits except for the 8 most significant bits of the IP address, four bits are enough to constitute the first shift bits field 3103.
The second table offset bit field 3104 indicates an offset of each sub-table (331 through 33j) of the second table 330a from a start address. In the case of using the second table offset bit field 3104 as a pointer for memory, it is possible to provide at most 64 MB memory addresses ranging from 0 to 226. Therefore, in the present invention, a base address, which serves as a bridge between an operating system and an input/output device, is used for expressing an offset_so that the second table 330a can be located within a 64 MB range from the base address.
The first table 310 further includes a final bit field (not shown) which indicates a table next to the first table 310 is the last one. If the final bit field is detected in the first table 310a, an IP address lookup operation is carried out by the third table 350a rather than the second table 330a, a process which will be described in greater detail later with reference to
Referring to
Table 1 shows three different border gateway protocol (BGP), i.e., AS286, AS1221, AS4637 published on Oct. 18, 2002, and the number of entries of each of the BGP tables whose most significant 8 bits are different from one another's.
Supposing that the first and second tables 310a and 330a each has 110 entries, the memory consumption of each of the first and second tables 310a and 330a is a maximum of 1.7 MB. However, each of the 110 entries of the first table 310a does not always have 212 entries in the second table 330a. Therefore, the memory consumption of the first and second tables 310a and 330a may vary depending on the number of second table entries provided for each of the 110 entries of the first table 310a, which is shown in Tables 2 through 4 below.
Tables 2 through 4 show different memory consumptions of IP address lookup systems, respectively, adopting BGP data obtained from AS286, AS1221, and AS4637, respectively. In Table 2 through 4, the stride of the first table 310a is fixed at 8, and the stride of the second table 330a is varied between 8 and 16.
As shown in Tables 2 through 4, the IP address lookup system according to the present invention has less memory consumption than a conventional IP address lookup system having a two-layer table architecture does (Tables 5 through 7.
Referring to
The valid bit field bit field 3301 of each entry of the second table 330a, like the valid bit field of each of the entries of the first table 310a, indicates whether or not each entry is valid. A packet which attempts to look up an invalid entry is transmitted to a host processor or to the error processing unit 70a so that it can be abandoned. The second shift bits field 3302, which indicates by how many bits an IP destination address is to be shifted to determine which bits of the IP destination address are to be compared with each entry of the third table 350a. The third table offset bit field 3303, which indicates an offset from the first table 310a, is used for indicating a start address of a sub-table of the third table 350a.
Referring to
Each entry constituting the third table 350a includes three fields, i.e., a valid bit field 3501, an ours bit field 3502, and a fabric head field 3503.
The valid bit field 3501, like its counterparts 3101 and 3301, indicates whether or not each entry is valid. A packet which attempts to look up an invalid entry is transmitted to a host processor or to the error processing unit 70a so that it can be abandoned. The ours bit field 3502 is used as an interface indicating bit which indicates interface information for each entry of the third table 350a. The fabric header field 3503 indicates a 6-byte next hop media access control (MAC) address and additional header information to be used by the switch fabric unit 80 of
Referring to
After extracting the destination address from the received packet in step 3140, an entry of the first table 310a is extracted in step 3150 by using eight most significant bits of the destination address as an index. For example, supposing that the destination address is “A.B.C.D”, the eight most significant bits of the destination address (i.e., A) are used as an index for the first table 310a. The extracted entry of the first table 310a includes a valid bit field, a second table offset bit field for indicating the location of the second table 330a, and first shift bits field, as shown in
Thereafter, it is checked whether or not the valid bit field included in the extracted entry of the first table 310a is valid in step 3160. If the valid bit field is determined as being invalid, the received packet is transmitted to the error processing unit 70a in step 3170 so that it can be abandoned. Otherwise, it is checked in step 3180 whether or not a final bit of the first table 310a has been detected.
If the final bit of the first table 310a has been detected, the third table 350a is searched. Otherwise, fourteen bits following the most significant 8 bits of the destination address are extracted and then shifted by as much as ShiftBits1 of the first table 310a. A result of the shifting is used as an index for the second table 330a in step 3200. Thereafter, the second table 330a is searched using the index in step 3210.
Referring to
If the valid bit of the extracted entry is determined as being invalid, the received packet is transmitted to the error processing unit 70a in step 3340 so that it can be abandoned. Otherwise, least significant 12 bits of the destination address are extracted and then shifted by as much as ShiftBits2 of the second table 330a. A result of the shifting is used as an index for the third table 350a in step 3350. In step 3360, the third table 350a is searched using the index.
In other words, supposing that the destination address of the input IP data packet is “A.B.C.D”, all bits of the destination address except for the eight most significant bits that have already been compared with the entries of the first table 310a, i.e., ‘A’, and the four least significant bits of ‘C’, and ‘D’ that are to be compared with entries of the third table 350a, are shifted by as much as ShiftBits1. Then, a result of the shifting is used as an index for a sub-table of the second table 330a. An entry designated by the index includes a second shift bits field (ShiftBits2) and a third table offset bit field for indicating the location of the third table 350a (see
For example, when the length of a prefix is 24 bits, the index for the first table 310a is 8 bits, and the index for the second table 330a is 12 bits, four bits need to perform comparison in the third table 350a. In order to compare the four bits with entries of the third table 350a, 4 is subtracted from 12, which is a maximum number of comparison bits of the third table 350a. Then, the number of bits that need to be shifted is 8.
Referring to
In step 3530, it is checked whether or not a valid bit of the extracted entry is valid. If the valid bit of the extracted entry is determined as being invalid, the input IP data packet is transmitted to the error processing unit 70a in step 3450 so that it can be abandoned. Otherwise, next hop information and packet processing information are extracted from the extracted entry of the third table 350a in step 3550. In step 3560, the input IP data packet is transmitted to the switch interface unit 16 according to the extracted packet processing information.
As described above, in the IP address lookup method according to the present invention, next hop information and packet processing information are obtained by searching for a forwarding table based on a three-layer table architecture using a destination address of an input IP data packet.
When it comes to an IP address lookup, there is a need to consider three factors, i.e., the speed of searching for an IP address, the amount of memory consumed in management of a forwarding table, and updating of the forwarding table if forwarding entries are varied. In the present invention, a forwarding table is comprised of three sub-tables and an IP address lookup is carried out using the forwarding table so as to prevent such problems as low search speed, considerable memory consumption, and difficulty in updating the forwarding table. The memory consumption of the present invention is compared with that of the prior art in which a forwarding table having a two-layer table architecture is adopted.
Tables 5 through 7 show different memory consumptions of conventional IP address lookup systems, respectively, adopting BGP data obtained from AS286, AS1221, and AS4637, respectively. Here, the conventional IP address lookup systems adopt a two-layer table architecture.
As shown in Tables 2 through 7, the IP address lookup method using a fowarding table having a three-layer table architecture according to the present invention can reduce memory consumption more considerably than the prior art using a fowarding table having a two-layer table architecture. In addition, the IP address lookup method of the present invention provides more advantages in terms of updating of the forwarding table than the prior art does.
For example, in the case of applying BGP data obtained from AS286 to an IP lookup, as shown in Tables 2 and 5, the total memory consumption of the present invention ranges from a minimum of 2,045,698 bytes to a maximum of 21,879,488 bytes, while total memory consumption of the prior art ranges from a minimum of 7,269,376 bytes to a maximum of 68,572,048 bytes. In addition, in the case of applying BGP data obtained from AS1221 to an IP address lookup, as shown in Tables 3 and 6, the total memory consumption of the present invention ranges from a minimum of 8,114,158 bytes to a maximum of 74,931,040 bytes, while the total memory consumption of the prior art ranges from a minimum of 8,114,158 bytes to a maximum of 74,931,040 bytes. Moreover, in the case of applying BGP data obtained from AS4637 to an IP address lookup, the total memory consumption of the present invention ranges from a minimum of 2,064,626 bytes to a maximum of 22,351,548 bytes, while the total memory consumption of the prior art ranges from a minimum of 7,508,368 bytes to a maximum of 68,549,248 bytes.
Even if a forwarding table used in the prior art is optimized by adjusting the stride of a primary table of the forwarding table, the minimum amount of memory required for an IP address lookup in the prior art is still larger than the amount of memory required for an IP address lookup in the present invention.
In addition, a two-layer table architecture adopted in the prior art requires a table used by a forwarding engine for an IP address lookup and another table for updating a forwarding table. Therefore, the amount of memory consumed in a real setting is twice as much as the calculated memory consumption shown above. On the other hand, in the present invention, additional memory is only necessary for the size of each sub-table constituting a second table and the size of each sub-table constituting a third table require additional memory. In other words, in the present invention, a total of 32 KB, including 16 KB memory for 212 entries of the second table and 16 KB memory for 212 entries of the third table, is additionally required. Here, 212 indicates a maximum size of each sub-table of the second table or the third table.
In the prior art, it takes a while to update a forwarding table because an entire table architecture needs to be reconstructed even when only one forwarding entry is varied. However, in the present invention, it is relatively easy to update a forwarding table because only a sub-table that includes a varied forwarding entry needs to be changed.
So far, only a case where an IP address lookup system having a triple data structure is installed in an input interface unit of a router has been described in greater detail as an embodiment of the present invention. However, the IP address lookup system of the present invention can be installed in an output interface unit of the router or can be installed in both the input and output interface units of the router.
The present invention can be realized as computer-readable codes stored on a computer-readable recording medium. The computer-readable recording medium includes all kinds of recording devices on which data can be stored in a computer-readable manner. For example, the computer-readable recording medium includes ROM, RAM, CD-ROM, a magnetic tape, a floppy disk, an optical data storage, and a carrier wave (such as data transmission through the Internet). In addition, the computer-readable recording medium can be distributed over a plurality of computer systems connected to a network, and computer-readable codes can be stored on and executed from the computer-readable recording medium in a decentralized manner.
Number | Date | Country | Kind |
---|---|---|---|
10-2002-0074352 | Nov 2002 | KR | national |
Number | Name | Date | Kind |
---|---|---|---|
6141738 | Munter et al. | Oct 2000 | A |
6157644 | Bernstein et al. | Dec 2000 | A |
6993031 | Murase | Jan 2006 | B2 |
7065084 | Seo | Jun 2006 | B2 |
7154892 | Oh et al. | Dec 2006 | B2 |
Number | Date | Country | |
---|---|---|---|
20040105442 A1 | Jun 2004 | US |