The present disclosure relates generally to communication networks, and more particularly, to detection of host status in a communications network.
The increasing use of virtualization in networks has enabled a great amount of flexibility in managing servers and workloads. One important aspect of this flexibility is mobility. Detection of host moves and status in conventional systems have a number of drawbacks, including time for detection, limitations of detection based on type of traffic, and high amount of processing resources needed.
Corresponding reference characters indicate corresponding parts throughout the several views of the drawings.
Overview
In one embodiment, a method generally comprises receiving a packet from a host at a first hop router in a network site, the first hop router in communication with a core network and operable to encapsulate packets received from the host for transmission to a remote network site, setting a status for the host in a table at the first hop router as active, starting a timer for the host at the first hop router, transmitting a probe message from the first hop router to the host if a packet is not received at the first hop router from the host before the timer expires, updating the status of the host at the table based on whether a response message is received from the host, and using the host status to detect host migration.
In another embodiment, an apparatus generally comprises a processor for processing a packet received from a host at a first hop router in a network site, setting a status for the host in a table at the first hop router as active, starting a timer for the host at the first hop router, transmitting a probe message from the first hop router to the host if a packet is not received at the first hop router from the host before the timer expires, and updating the status of the host at the table based on whether a response message is received from the host. The apparatus further comprises memory for storing the table. The first hop router is operable to encapsulate packets received from the host for transmission to a remote network site and use said host status to detect host migration.
The following description is presented to enable one of ordinary skill in the art to make and use the embodiments. Descriptions of specific embodiments and applications are provided only as examples, and various modifications will be readily apparent to those skilled in the art. The general principles described herein may be applied to other applications without departing from the scope of the embodiments. Thus, the embodiments are not to be limited to those shown, but are to be accorded the widest scope consistent with the principles and features described herein. For purpose of clarity, details relating to technical material that is known in the technical fields related to the embodiments have not been described in detail.
In a network environment such as data center interconnect (DCI), a first hop router (FHR) should be able to detect host moves, provide a consistent first hop presence, and monitor host status (e.g., aliveness). If the detection is based on ARP (Address Resolution Protocol)/ping, there may be a problem if the host is silent (e.g., no ARP, only traffic transmitted from host).
Locator Identifier Separation Protocol (LISP) is an example of a protocol that uses routing locators and endpoint identifiers to improve the scalability of a routing system. The LISP architecture provides a mechanism to separate out identification and location semantics from the current definition of an IP address. IP address semantics are extended to incorporate a distinction between routing locators (RLOCs) for routing through core networks and endpoint identifiers (EIDs) for identifying network sessions between devices.
A first hop router in a LISP architecture (e.g., LISP xTR (ingress tunnel router (ITR)/egress tunnel router (ETR)), may detect hosts from an IP (Internet Protocol) packet transmitted to an xTR's gateway MAC (Media Access Control) address or from an ARP request. The first hop router may support periodic aliveness checks in order to track the status of hosts, however, there are a number of drawbacks with this approach. For example, the first hop router may be unable to detect multicast traffic and broadcast traffic. Also, the periodic aliveness check may consume high CPU (Central Processing Unit) resources and it may take time for the aliveness check to detect the status of the host.
The embodiments described herein provide a proactive approach to detect host status (e.g., host failure, host location, host migration). In certain embodiments, a table (e.g., binding table, status table) is created on the first hop router for collecting information about active hosts. A probe mechanism may run based on information in the binding table. If a host is active, the first hop router may just monitor and maintain the table, without sending out any probe messages. If a host is idle (e.g., no packet/ARP for a specified interval), the first hop router may send a probe message to the host to check if the host is alive. The embodiments may be suitable for large scale deployments and the flexibility of the proactive probe mechanism may provide an improvement in convergence time in the case of host failure or roaming. Also, the embodiments do not rely on ARP messages to detect a host move and may detect host status in the presence of multicast and broadcast traffic.
Referring now to the drawings, and first to
The network shown in the example of
It is to be understood that LISP is used herein as an example and that other protocols that provide a locator/identifier split may be used, without departing from the scope of the embodiments. Thus, the term “locator identifier separation protocol” as used herein may refer to any protocol that provides a separation between an object identifier and its location.
Network sites 10C and 10D may include any number of endpoints (stations, user devices, clients, client devices) 25. The endpoint 25 may comprise, for example, a personal computer, set-top box, telepresence device, television, cellular phone, tablet, laptop, personal digital assistant, portable computing device, multimedia device, and the like.
Non-LISP network site 10D includes an edge device 16 in communication with routers 27. The network site 10D is not configured as a LISP site, and therefore does not include an xTR. Edge device 16 may be configured to perform proxy xTR (PxTR) functions. The PxTR 16 allows a non-LISP site (e.g., site 10D) to interoperate with a LISP site (e.g., sites 10A, 10B, and 10C).
The example shown in
The edge devices 18 at data centers 10A, 10B may be, for example, a gateway, switch, router, or other network device. The edge devices 18 are referred to herein as first hop routers (FHRs) since they are a first hop between the network site 10A, 10B (LAN/server 20) and the core network 12. The term ‘first hop router’ as used herein may refer to a router, switch, router/switch, gateway, or other network device operable to perform routing or forwarding functions. The edge device 18 may also be referred to as a LISP-VM xTR (router). The LISP-VM router's IP address may be used as the locator (RLOC) for encapsulation of traffic to and from the dynamic EID. For example, the edge device 18 may implement ingress tunnel router and egress tunnel router functions (e.g., operate as xTR). The edge devices 18 are operable to receive packets from site-facing interfaces (e.g., hosts 22) and encapsulate them for transmission to remote LISP sites (e.g., network sites 10A, 10B, 10C) or natively forward the packets to non-LISP sites (e.g., network site 10D). The edge devices 18 are also operable to receive packets from core-facing interfaces (e.g., network 12), decapsulate LISP packets, and deliver them to local EIDs at their network site 10A, 10B.
The network further includes a mapping system comprising a plurality of mapping databases (servers) 24. In one embodiment, the mapping system comprises a map server/map resolver (MS/MR). The mapping system may include any number of map servers, map resolvers, or map databases distributed throughout the network. For example, the mapping system may comprise any number of physical or virtual devices located in one or more networks and may include one or more databases stored on one or more network devices. In one example, the map server (MS) implements the mapping database distribution by accepting registration requests from its client ETRs, aggregating the EID prefixes, and advertising the aggregated prefixes. The map resolver (MR) accepts encapsulated map-request messages sent by ITRs, decapsulates them, and then forwards them toward the ETRs responsible for the EIDs being requested. Each ITR maintains a cache of the mapping database entries that it needs at a particular time. It is to be understood that the mapping system described herein is only an example and that other mapping systems and databases 24 may be used without departing from the scope of the embodiments.
As described in detail below, one or more of the first hop routers 18 may comprise a host detection module 26 operable to maintain a status (binding) table 28. The table 28 is used to collect information about active hosts 22. In certain embodiments, the host detection module 26 comprises a proactive probe mechanism that runs based on the binding table 28. If the host 22 is active, the first hop router 18 monitors and maintains the table 28 and no probe messages are transmitted. If the host 22 is idle, the first hop router 18 may send a probe message to the host 22 to check whether the host is alive.
The host detection module 26 may be used to detect host moves and monitor host aliveness. For example, any IP addressable device (e.g., VM 22) may move (roam) from its subnet to a different subnet or to an extension of its subnet in a different location (e.g., remote data center), while keeping its original IP address. The embodiments described herein allow for detection of host migration by the first hop router (e.g., xTR) 18. When a move is detected, the mappings between EIDs and RLOCs are updated by the new xTR. By updating the RLOC-to-EID mappings, traffic is redirected to the new location.
It is to be understood that the network shown in
Memory 34 may be a volatile memory or non-volatile storage, which stores various applications, operating systems, modules, and data for execution and use by the processor 32. Memory 34 may include, for example, the status (binding) table 28, or other suitable data structure for maintaining status information for active hosts. Host detection module 26 (e.g., code, logic, software, etc.) may also be stored in memory 34. The network device 30 may include any number of memory components.
Logic may be encoded in one or more tangible media for execution by the processor 32. For example, the processor 32 may execute codes stored in a computer-readable medium such as memory 34. The computer-readable medium may be, for example, electronic (e.g., RAM (random access memory), ROM (read-only memory), EPROM (erasable programmable read-only memory)), magnetic, optical (e.g., CD, DVD), electromagnetic, semiconductor technology, or any other suitable medium. In one example, the computer-readable medium comprises a non-transitory computer-readable medium. The network device 30 may include any number of processors 32.
The network interfaces 36 may comprise any number of interfaces (linecards, ports) for receiving data or transmitting data to other devices. The network interface may include, for example, an Ethernet interface for connection to a computer or network.
It is to be understood that the network device 30 shown in
It is to be understood that the process shown in
Table I illustrates an example of the status table 28 stored at the first hop router 18:
In the example shown in Table I, the binding table includes a host identifier (e.g., VM1, VM2, VM3, . . . ), IP address for the host, interface at which packets are received from the host, status of the host, up time, and last packet time. The status of the host may be, for example, active (packet or response to probe message received), down (no packet or response to probe message received), or probe (no packet received within timer interval and probe message transmitted). The up time indicates how long the host has been active. The last packet time indicates when a last packet was received from the host.
It is to be understood that Table I shown and described above is only an example and that other data structures comprising additional, less, or different entries may be used without departing from the scope of the embodiments.
Although the method and apparatus have been described in accordance with the embodiments shown, one of ordinary skill in the art will readily recognize that there could be variations made without departing from the scope of the embodiments. Accordingly, it is intended that all matter contained in the above description and shown in the accompanying drawings shall be interpreted as illustrative and not in a limiting sense.