This invention relates to advertising information in a multi-domain network.
In a multi-domain network each domain, also called an Autonomous System (AS), is under the control of a different Administrative Authority. This complicates the problem of routing traffic across the network. The Border Gateway Protocol (BGP), described in the Internet Engineering Task Force (IETF) Request For Comments (RFC) RFC4271, is the most widely used routing protocol for multi-domain networks. BGP advertises reachability information between domains. BGP is used to scale to full Internet-wide use and, accordingly, a domain is typically only permitted to advertise a single route between a pair of domains. This restricts, or prevents, the possibility of performing traffic engineering (TE) techniques such as load-balancing and providing protection paths.
The IETF has proposed a Path Computation Element (PCE)-based architecture in RFC4655 to provide constraint-based path computations both in single and multi-domain networks. In a PCE-based architecture, a domain has a Path Computation Element (PCE) which is capable of computing a network path, or route, based on a network graph and applying computational constraints. The PCE calculates a route on behalf of Path Computation Clients (PCC) in the domain. A PCC submits a request for a route calculation to the PCE and receives a route in reply. The PCE-based architecture reduces computation load on nodes in the network, and effectively separates the tasks of packet forwarding (which remains at the node) and route calculation (now performed at the PCE).
In multi-domain networks, the PCE-based path computation across domains is complicated by the limited visibility of Traffic Engineering (TE) information which is usually restricted to a single domain. Two procedures called Per-Domain and Backward Recursive PCE-based Computation (BRPC) have been proposed to overcome this limitation. The BRPC procedure has been designed to compute optimal multi-domain paths. It uses the PCE communication Protocol (PCEP) to allow the PCE controlling the destination domain to initiate in a reverse fashion the recursive path computation along the sequence of domains to be traversed, towards the PCE controlling the source domain. The PCEP protocol allows the Path Computation Client (PCC), e.g. the Network Management System (NMS) or the source PCE, to specify the sequence of domains to be traversed. Such sequence is included within the PCEP Include Route Object (IRO) carried in the PCEP PCReq message.
However, the present inventors have appreciated that the limited amount of resource information typically exchanged among domains through the Border Gateway Protocol (BGP), and the acquisition of multi-domain resource information from BGP databases, has the consequence that a PCE will typically only consider one sequence of domains per network prefix. The Per-Domain and BRPC procedures may then be applied along a non-optimal sequence of domains, thus potentially affecting the overall network performance. In addition, such limitation may completely prevent the path computation subject to domain diversity (e.g. for protection purposes). It is possible to run BRPC over additional routes not advertised by BGP by, for example, setting policies at domains which force a PCE to consider a pre-defined route. However, depending on network conditions, the pre-defined route may offer poor performance.
An aspect of the invention provides a method for use in a multi-domain network, wherein each domain has a route calculation entity which is responsible for computing paths between domains on behalf of clients, the method comprising at a route calculation entity in a first of the domains:
The advertisement messages sent between route calculation entities allow route calculation entities in different domains, or Autonomous Systems, to advertise resource information typically not advertised by existing inter-domain routing protocols such as the Border Gateway Protocol (BGP). The advertisement messages enable effective path computations by the route calculation entity and helps to preserve network stability, scalability and intra-domain confidentiality. The inter-domain resource information comprises at least one of: inter-domain route information indicating a possible route between domains (which can also be called reachability information); and inter-domain Traffic Engineering (TE) information. The Traffic Engineering information is typically used for traffic engineering purposes and can comprise a metric which represents any of: path length, bandwidth, delay, packet loss, jitter. Conventionally, TE information is not advertised outside of a domain by existing protocols. Accordingly, the present method provides a way of advertising this resource/TE information between domains.
An advantage of the method is that it does not add a significant additional message or processing load on the network because the advertisement messages are exchanged by the route calculation entities, and information carried within the advertisement messages is only inspected, and stored, at the route calculation entities. This contrasts with routing protocols in which the content of advertisement messages may be inspected, and stored, at all routers along the path of the advertisement message.
Advantageously, the route calculation entity is a Path Computation Element (PCE), as defined by RFC4655, or a similar entity. The advertisement message can be a Path Computation Element communication Protocol (PCEP) message.
Another aspect of the invention provides a method for use in a multi-domain network, wherein each domain has a route calculation entity which is responsible for computing paths between domains on behalf of clients, the method comprising at a route calculation entity in one of the domains:
Further aspects of the invention provide a route calculation entity which is responsible for computing paths between domains of a multi-domain network on behalf of clients, the route calculation entity configured to perform any of the method steps.
The functionality described here can be implemented as hardware, software, or a combination of these. Accordingly, a further aspect of the present invention provides machine-readable instructions (software) for causing a processor to perform the method. The machine-readable instructions may be stored on an electronic memory device, hard disk, optical disk or other machine-readable storage medium. The machine-readable instructions can be downloaded to a processor via a network connection.
Embodiments of the invention will be described, by way of example only, with reference to the accompanying drawings in which:
Computation Element (PCE) in each AS;
A Path Computation Element (PCE), according to RFC4655, is located in each
Autonomous System AS. The PCE is responsible for path computation, and receives requests for path computations from clients, called Path Computation Clients (PCC) located within the AS. A router within AS B is shown as an example of a PCC. Typically, there is one PCE per AS and one advantageous configuration is to co-locate the PCE with a BGP Route Reflector (RR) for that AS. The PCE collects multi-domain resource information from the RR BGP database(s). PCEs in different Autonomous Systems communicate with each other to establish a route between Autonomous Systems.
In order to understand embodiments of the invention, conventional operation of the network 10 will be described. In the example network of
In embodiments of the present invention, PCEs advertise additional information between Autonomous Systems.
In
PCE 30 comprises a PCE-protocol (PCEP) module 31, a Routing Controller Module 32, a Path Computation module 33 and a database 35. Traffic Engineering
Database (TED) 35 stores traffic engineering information which can be used to compute routes within the AS, and routes between Autonomous Systems. Database 35 can be populated by acquiring information, via an interface 39, from database 45. Database 35 can store information such as topology, bandwidth information (e.g. total bandwidth, available bandwidth), QoS constraints. The database 35 contains both intra and inter-domain routing information. The database 35 is shown schematically in
PCEP module 31 performs the PCE-Protocol. PCEP messages 24 include path computation requests (PCReq) and replies (PCRep) sent via interface 36. These messages 24 are exchanged with PCCs or PCEs in the AS or in other ASs. Requests can be originated either by elements belonging to AS A (e.g. a network node) or by elements belonging to other Autonomous Systems (e.g. a remote PCE). Resource advertisement messages 25 are exchanged with PCEs in other Autonomous Systems via interface 37. Advantageously, PCEP module 31 uses policy information 34 to determine which other Autonomous Systems the PCE is authorised to communicate with.
The Path Computation Module 33 is responsible for path/route computations, i.e. it runs algorithms and heuristics that perform route computation in response to requests 24 received by the PCEP module 31, the path computations using the information stored in the TED 35. The computed path/route is then returned to the PCEP module 31 for returning to the requesting entity.
The Routing Controller Module (RCM) 32 elaborates and stores within the TED 35 the inter-domain routing information received from the PCEP module through advertisement messages 25. In addition, RCM 32 extracts the intra-domain information to be advertised to other domains from the TED 35 and sends it to the PCEP module 31, where it is packaged into a suitable form for transmission. It will be understood that the functions performed by the modules 31, 32, 33 can be implemented by a single processor, or a plurality of processors.
In accordance with embodiments of the invention, the PCE advertises resource information in the form of at least one of: inter-domain resource information comprising, for example, available/reservable bandwidth information for an inter-domain route; inter-domain route/reachability information comprising information about a possible route between domains; aggregated intra-domain resource information.
Additional PCEP messages 25, which will be called PCEP Resource Advertisement (PCRA) messages, carry the additional resource information typically not exchanged between Autonomous Systems through BGP advertisements. Two main categories of resource information can be enclosed within PCRA messages:
Two types of inter-domain resource information will be considered: (i) inter-domain route/reachability information typically not advertised by BGP for scalability reasons, and (ii) traffic engineering information, such as bandwidth information for links to other Autonomous Systems.
As described above, BGP usually propagates a single route between Autonomous Systems so as not to overload border routers with a large amount of reachability information. The PCE advertises reachability information about other routes which are normally discarded by the External BGP protocol. This alternative route information can be advertised in new messages which will be called Route-PCRA (R-PCRA) messages. In particular, the available and stable route information provided by adjacent BGP peers, referred to prefixes belonging to a limited set of authorised domains, and discarded by tiebreaking rules, are exchanged through R-PCRA messages. This does not violate confidentiality requirements since such routes are removed by BGP only for scalability reasons. Two sets of route information can be announced. The first set of route information considers all inter-domain routes with the same BGP AS_PATH length. Considering again the example network of
A second type of inter-domain information that can be advertised by the PCEs is the presence of inter-domain links together with a traffic engineering (TE) parameter, such as their available/reservable bandwidth. This information can be carried in messages which will be called Bandwidth-PCRA (B-PCRA) messages. Bandwidth information is not advertised by the routing protocol BGP. A possible way for the PCE to collect bandwidth information is to listen to OSPF-TE flooding, when the OSPF-TE signalling includes the extensions described in RFC5392. Such extensions allow the advertisement within an AS A of the TE info (including bandwidth) of an inter-domain link between AS A and another domain, say AS B. Within AS A only the bandwidth information of the link from A to B will be advertised. Thus PCE A, by simply listening to the OSPF flooding, will be aware of the bandwidth information of the link from A to B, which will be called XAB. Similarly, PCE B will be aware of just the bandwidth information of the link from B to A, which will be called XBA. Accordingly, through a combination of the OSPF-TE messaging described in RFC5392 and the new B-PCRA messages, AS A and AS B can exchange the bandwidth information that they have obtained from their own AS and thereby become aware of the bandwidth in both inter-AS directions XAB. and XBA. Conventionally, routing protocols such as OSPF-TE advertise bandwidth information only within an AS. The advertised bandwidth information can refer to the amount of reservable bandwidth on inter-domain links that adjacent peer Autonomous Systems agree to make available for remote (and authorised) Autonomous Systems. The advertisement of this kind of information should not affect confidentiality requirements, since it does not disclose intra-domain information or private inter-domain agreements. One option for the B-PCRA messages is to use the same path vector routing structure as for BGP and R-PCRA. For example, considering an R-PCRA message with: Prefix: e; AS_PATH: B, D, E and BW: XB,D,E. The term “BW: XB,C,E” represents the total reservable bandwidth for the end-to-end route between ASB and AS E.
Another option for the B-PCRA messages is to use a link-state routing structure. Each PCE receives, from the PCEs in the adjacent Autonomous Systems, advertisements of remote inter-domain links.
Advantageously, the B-PCRA message carries the following minimum set of information: source AS, destination AS and reservable bandwidth. For scalability reasons, multiple inter-domain links between the same pair of adjacent domains can be advertised as a single link with an available bandwidth equal to the sum of all available link bandwidths. Policies and Time-To-Live (TTL) mechanisms can also be implemented in order to limit the inter-domain link advertisement to a restricted set of domains (e.g., B-PCRA information with TTL=5 will be forwarded to reach all authorised domains in the range of five domains). Mechanisms can be implemented to minimise the number of exchanged B-PCRA messages, such as sending a new message when bandwidth passes (or changes by) certain threshold values and granularities.
Inter-domain bandwidth and, more particularly, reservable bandwidth on inter-domain links that adjacent peer Autonomous Systems agree to make available for remote (and authorised) Autonomous Systems, is considered to be the most useful TE metric that can be advertised. However, it is possible to advertise any other TE metric in addition to, or instead of, bandwidth in the same manner as described above for advertising bandwidth values.
Intra-domain information can be advertised to other domains. Due to confidentiality and scalability reasons, the advertisement of detailed intra-domain resource information is not realistic. However, several topology aggregation methods (e.g., Simple Node, Full Mesh, Star) have been proposed to be effectively applied in multi-domain networks. Example ways of aggregating topology information are described in G. Maier et al, “Multi-Domain Routing Techniques with Topology Aggregation in ASON Networks”, ONDM '08, March 2008.
Another category of advertisement messages advertise the aggregated intra-domain topology information to other domains. The aggregated intra-domain resource information can indicate connections between border nodes of the domain. The aggregated topologies are then utilized to compute both the sequence of domains to be traverse and the border nodes of each traversed domain. This solution could be particularly beneficial to compute optimal multi-domain paths without sending BRPC requests along each possible sequence of domains. It is also useful in cases where the BRPC procedure is not supported. As an example, consider that: in AS C the intra-domain path between the two BRs is 1000 km and comprises 10 nodes; and in AS D the intra-domain path between the two BRs is 20 km and comprises 2 nodes. Thus, if such information is available it is possible to obtain a more precise TE solution, such as selecting the path with lowest end-to-end delay.
The advertised intra-domain information can include a traffic engineering metric such as bandwidth, delay, packet loss or jitter. Multiple metrics can be advertised. Advantageously, the advertised metric is a cumulative value of the measured quantity (bandwidth, delay etc.) within the domain.
For scalability, it is advantageous that the advertisement messages that have been described are used within a restricted set of domains, and are not used throughout the entire Internet. For example, the advertisements can be exchanged between a set of domains with known relationships, like the peering relationships (e.g. a set of 20 domains described in the PCE RFC4657). In terms of message reliability, all PCEP messages exploit TCP protocol, i.e. they do not require additional specific acknowledgment or refresh mechanisms. Finally, in terms of network stability, both R-PCRA and B-PCRA updates only influence new path computations (in particular those referring to high quality traffic that require constraint-based multi-domain path computation) and do not affect the network operations, the forwarding of Best Effort traffic or already established high quality Label Switched Paths (LSP). This is particular important if compared to alternative solutions to provide multi-domain TE capabilities, for example those based on TE extensions to the BGP protocol that potentially affect the overall network stability and scalability.
Finally, the proposed communication between PCEs could be beneficial also in case of network failures. In particular, a PCE notified of network failures affecting resources belonging to the controlled domain, could immediately notify remote and trusted PCEs about the failure. In this way, remote PCEs could become aware of network failures before receiving the related BGP message Update (typically slow). This allows remote PCEs to speed up the re-computation of failed multi-domain paths and avoids that new multi-domain path computations consider the failed resources.
The functional modules and method steps described above may be implemented as electronic hardware, as software modules executed by a processor, or as combinations of both. They may be implemented by, or performed by, a general purpose processor, a digital signal processor (DSP), an application specific integrated circuit (ASIC), a field programmable gate array (FPGA) or other programmable logic device, discrete gate or transistor logic, discrete hardware components, or any combination thereof designed to perform the described functions.
The invention is not limited to the embodiments described herein, which may be modified or varied without departing from the scope of the invention.
Number | Date | Country | Kind |
---|---|---|---|
091552182 | Mar 2009 | EP | regional |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/EP2009/055111 | 4/28/2009 | WO | 00 | 12/30/2011 |