1. Field of the Invention
The invention generally relates to security and traffic management in enterprise networks. More particularly, aspects of the invention are directed to managing access control lists and traffic flow in computer networks.
2. Description of Related Art
An Access Control List (“ACL”) is a rule-based packet classifier. It plays an essential role in network devices for supporting various services such as firewalls, Virtual Private Networks (“VPNs”) and Quality of Service (“QoS”). ACLs are de facto order-dependent and multi-dimensional. Such properties have many adverse effects. For instance, conflicts may arise that impede security compliance analysis. And these properties make ACLs highly sensitive to change.
ACLs are one of the most important security features in managing access control policies in large scale enterprise networks. An ACL contains a list of entries that define matching criteria inside packet header. Based on first matching rule criteria, each ACL is typically assumed to be a linear list of entries being executed from top down. In conventional operation, the action corresponding to the first entry that matches the condition takes effect. If no match is found, the router then drops the packet (implicit deny). When there are two or more entries matching a given criterion, based on first matching rule, the permit or deny classification of the first matching entry takes effect, whereas subsequent entries become irrelevant (redundant).
The presence of no-effect entries further muddles the ability to comprehend the true semantic meaning of long ACLs, making ACL maintenance extremely difficult. Constant improvement in hardware/software capacity allows routers to handle more traffic flows, giving rise to an increased ACL size. In addition, a fine-granular control for traffic demands for an increased expressiveness of ACL language. This, in turn, further complicates the ability to comprehend the meaning of an ACL in total scope.
Depending on ACL information maintained by the firewall 18, traffic flow may be permitted or denied. As shown, traffic may be permitted between the user computer 12 and the computer 24c coupled to second interface 26 as shown by arrow 28. In contrast, traffic from the user computer 12 to the computer 20a may be blocked by the firewall 18, as shown by the dashed arrow 30.
Resembling an if-then statement in the C programming language, the generic syntax of an ACL entry is typically expressed in the form of the if condition then action. The condition may specify source, destination IP address range, protocol and port ranges. The action is binary, either permit or deny. While seemingly straightforward, in practice ACLs can be long, complex and error-prone. Furthermore, there may be hundreds or thousands of ACL entries of ACL entries implemented by multiple routers in a given network.
The complexity of ACLs is reflected in the growing demand for fine granular control of network traffic in the context of network security management and QoS requirements. Due to the order dependency, the intended meaning of every individual ACL entry can be altered or erased with removal of existing entries or addition of new entries. Such an excessive sensitivity to semantics of an ACL due to changes makes it extremely hard to comprehend the meaning of the ACL in total scope.
One area of particular interest is priority-based ACL implementations. In such implementations, each entry in a priority-based ACL will be assigned a priority. The priority value will be used to break a tie if a conflict among entries occurs. Namely, among entries that match an incoming packet, the entry with the highest priority takes effect. A priority-based ACL is a generalization of a commonly-used ACL. It is flexible and adaptive in handling various QoS and security requirements.
Due to the practical significance in a large-scale network security management, the impact of ACLs has been an extensive research topic for many years. One type of method to address the ACL problem is to exploit fruitful theoretical results from the well-known Klee's measure problem. This is a computational geometry problem that is concerned with the efficiency of computing the measure of a union of multidimensional rectangular ranges. Klee provided an algorithm for computing the length of a union of intervals in one dimensional space and showed that time complexity of this algorithm is O(n log n).
It was subsequently shown by Fredman and Weide that Ω(n log n) is optimal in the linear decision tree model. Bentley considered the natural extension to d-dimensional cases, and showed that O(n log n) is also optimal for two dimensions (i.e., d=2). For d>2, the complexity generalizes to an upper bound of O(nd-1 log n). Overmars & Yap exploited the notion of trellis rectangles and used a generalization of the k-d tree to partition the plane into a collection of trellises. They proved that the upper bound of time complexity for computing the Klee's measure of n rectangles in the d-dimensional space is O(nd/2 log n).
Built on theoretical results from Overmars & Yap, Eppstein & Muthukrishnan proposed an algorithm based on the k-d tree for detecting conflicts in two-dimensional priority-based packet filters. A priority-based conflict refers to the presence of two filters with same priority level and different actions on the same packet. The computational complexity of the Eppstein & Muthukrishnan algorithm for determining whether a rule set contains any conflicts is O(n3/2) where n is the size of rule set. This, however, is restricted to two dimensional packet classification and filter conflict detection problems.
Other work relates to routing performance in handling traffic, focusing primarily on designing data structures that support efficient packet classification while minimizing computational resource utilization in dynamic and static environments. For instance, a scheme has been proposed that performs a binary search on a prefix-length structured hash table. Others have given a detailed review of data structures for one-dimensional packet classification in routing tables, focused on longest-prefix matching and most-specific range matching tie breaker data structures.
A refined tie-breaker data structure has been proposed to support two-dimensional packet classification. A memory-efficient B-tree for one-dimensional packet classification has also been proposed. A variant of red-black tree data structures has been proposed for supporting three operations of longest-matching prefix-tables in O(n) where n is the number of (one-dimensional) entries (rules). Another approach only detects conflicts in ACL entries using a framework limited to two dimensional space.
Due to the dimension-induced complexity in ACLs, such approaches are rudimentary solutions and are often ineffective in addressing fundamental issues in ACLs.
Systems and methods which analyze and manage access control list information are provided. Aspects of the invention focus on partial redundancy qualification as well as redundancy detection and resolution.
In one embodiment, a method of handling redundant entries in an access control list usable in a computer network is provided. The method comprises obtaining an access control list and storing it in memory, the access control list including a plurality of entries; converting the plurality of entries from an order-dependent format into one or more order-free equivalents; storing the order-free equivalents in the memory; performing redundancy detection on the order-free equivalents to identify any redundant entries in the access control list; removing the redundant entries to create a modified access control list for use by a firewall of the computer network; and quantifying an extent of partial order dependence of each entry in the access control list.
In one alternative, performing redundancy detection includes: determining a volume of a given one of the plurality of entries in the access control list; identifying a spinoff corresponding to the given entry; determining a volume of a selected order-free equivalent corresponding to the spinoff; and updating the volume of the corresponding given entry in the access control list using the volume of the selected order-free equivalent. In this case, performing redundancy detection desirably includes determining a degree of alteration for each access control list entry. In one example, determining the degree of alteration includes determining whether the spinoff is equal to zero, and if so, identifying the given entry as identifying the given entry as redundant. And in another example, if the spinoff is not equal to zero, then updating the volume of the selected order-free equivalent corresponding to the spinoff.
In another alternative, performing redundancy detection includes determining a scope contraction ratio for each access control list entry.
In a further alternative, performing redundancy detection on the order-free equivalents to identify any redundant entries comprises determining whether a selected entry produces any spinoff entries after translation to a corresponding order-free equivalent; and if no spinoff entries are produced, removing the selected entry from the access control list to form the modified access control list.
In yet another alternative, the method further comprises the firewall executing the modified access control list to permit or deny access to network resources in accordance with the entries of the modified access control list.
In another embodiment, a computer-readable medium is provided. The medium has instructions stored thereon. The instructions, when executed by a processor, cause the processor to perform a method of handling redundant entries in an access control list usable in a computer network. The method comprises obtaining an access control list and storing it in memory, the access control list including a plurality of entries; converting the plurality of entries from an order-dependent format into one or more order-free equivalents; storing the order-free equivalents in the memory; performing redundancy detection on the order-free equivalents to identify any redundant entries in the access control list; removing the redundant entries to create a modified access control list for use by a firewall of the computer network; and quantifying an extent of partial order dependence of each entry in the access control list.
In one alternative, performing the redundancy detection includes determining a volume of a given one of the plurality of entries in the access control list; identifying a spinoff corresponding to the given entry; determining a volume of a selected order-free equivalent corresponding to the spinoff; and updating the volume of the corresponding given entry in the access control list using the volume of the selected order-free equivalent.
In a further embodiment, an apparatus is provided for handling redundant entries in an access control list usable in a computer network. The apparatus comprises memory for storing information regarding an access control list and a processor means. The access control list includes a plurality of entries. The processor means is configured for converting the plurality of entries from an order-dependent format into one or more order-free equivalents, storing the order-free equivalents in the memory, performing redundancy detection on the order-free equivalents to identify any redundant entries in the access control list, removing the redundant entries to create a modified access control list for use by a firewall of the computer network, and quantifying an extent of partial order dependence of each entry in the access control list.
In one alternative, the processor means performs the redundancy detection by determining a volume of a given one of the plurality of entries in the access control list, identifying a spinoff corresponding to the given entry, determining a volume of a selected order-free equivalent corresponding to the spinoff, and updating the volume of the corresponding given entry in the access control list using the volume of the selected order-free equivalent. In this case, the processor means may perform the redundancy detection by further determining a degree of alteration for each access control list entry. Optionally, the processor means determines the degree of alteration by determining whether the spinoff is equal to zero, and if so, identifying the given entry as entry as redundant. In another example, if the spinoff is not equal to zero, then the processing means updates the volume of the selected order-free equivalent corresponding to the spinoff.
In another alternative, the processor means performs the redundancy detection by further determining a scope contraction ratio for each access control list entry.
And in yet another alternative, the processor means performs the redundancy detection on the order-free equivalents to identify any redundant entries by determining whether a selected entry produces any spinoff entries after translation to a corresponding order-free equivalent; and if no spinoff entries are produced, removing the selected entry from the access control list to form the modified access control list.
a)-(d) illustrate possible dependency situations in accordance with aspects of the invention.
a)-(b) illustrate a one-dimensional scenario for order-dependant and order-free ACLs in accordance with aspects of the invention.
a)-(f) illustrate d-box two-dimensional examples in accordance with aspects of the invention.
a)-(f) illustrate 2-d box partitions in accordance with aspects of the present invention.
a)-(b) illustrate the scope of entries in an ACL in accordance with aspects of the present invention.
a)-(b) illustrate ACL volume distribution in accordance use with aspects of the present invention.
a)-(b) illustrate spinoff entry distributions in accordance with aspects of the invention.
a)-(b) illustrate hash difference measures in accordance with aspects of the present invention.
Aspects, features and advantages of the invention will be appreciated when considered with reference to the following description of preferred embodiments and accompanying figures. The same reference numbers in different drawings may identify the same or similar elements. Furthermore, the following description is not limiting; the scope of the scope of the invention is defined by the appended claims and equivalents.
One aspect of the invention identifies an order-free equivalent for an order-dependent ACL. As used herein, the term “ordering” is generic, and is applicable to both the first-matching rule in commonly-used ACLs as well as priority-based ACLs. A theoretical framework has been developed that allows one to construct an order-free equivalent by recursively gluing together the projected results on each involved dimension, thereby overcoming inherent dimension-induced difficulty in ACL problems. This framework lays a basis for solving some fundamental key problems in ACLs, including automatic detection/resolution of redundant entries in an ACLs, determining whether given ACLs are semantic equivalents, quantitative evaluation of the impact of order-dependency on each ACL entry in an ACL, and quantitative metrics for quantifying the partial redundancy of each rule in an ACL. In addition, the framework according to aspects of the invention can handle not only commonly-used ACLs (e.g., first-matching ACLs) but also priority-based ACLs.
In this section some related notions and definitions are introduced that will be used later on. The initial discussion focuses on standard ACLs (e.g., having a one dimensional range). While the focus in this section is on one dimensional range issues stemming from standard ACLs, it serves as an important step for understanding and handling the multidimensional problem in extended ACLs. The terms “order-independent” and “order-free” are used interchangeably herein. The terms “entry” and “rule” are also used interchangeably herein.
The notion of a “d-box” is first considered for simplified problem formulation. Definition 1: Let be I1, . . . , Id be the intervals in 1, . . . , dth dimensions. A d-box denoted by Bd, is defined as the Cartesian product of I1, . . . , Id, denoted as I1 . . . Id or [I1, . . . , Id]. Let Ii(Bd)=Ii denote the ith interval interval of Bd.
A d-box is also referred to as a d-dimensional rectangle. It can be seen that a 1-box is an interval (range) in one-dimensional space, and a 2-box is a rectangle in two-dimensional space that is formed by the Cartesian product of two 1-boxes from two orthogonal dimensions. Standard ACL syntax is employed to see how to map an ACL entry into one-dimensional range. It includes an action (permit or deny) and other traffic-related information. Thus: access-list list-number {permit|deny}{host|source wildcard|any}
A standard ACL allows one to permit or deny traffic from source IP addresses specified by a pair of source IP address and source wildcard. Note that the access list number of a standard ACL ranges from 1 to 99, and is unique for a given device/router. A mapping between ACL terminology and range dimension ordering is given in the table below. For instance, the source address range is defined as I1, the source port is defined as I2 etc.
A standard ACL entry can be formulated as I1 α S, where I1=[aL,aR] is a closed interval denoting the source address range and S denotes a classification action on the source address range (S=1/0 denotes the classification permit/deny action). Here, aL=aR means there is a single IP address.
A dotted decimal format IP address represented as d1.d2.d3.d4 can be uniquely converted to an integer form as
and vice versa. Let ai be a standard ACL entry written as ai=(I1,S)i, where the subscript i denotes the ith entry in the original order in an ACL. Its source address range and traffic classification is denoted by I(ai) and S(ai). The intersection of ai and aj is defined as the one-dimensional range intersection I1 (ai)∩I1(aj). Using the foregoing terminology, several important concepts are introduced as follows.
Definition 2: A standard ACL entry is said to be order independent if and only if (“iff”) the intersection of any two of its entries ai, aj, i≠j is empty, i.e., I1(ai)∩I1(aj)=0. A standard ACL is said to be order dependent iff there is exists a non-empty intersection of ACL entries ai, aj, i≠j, i.e., I1(ai)∩I1(aj)≠0.
Definition 3: The interval difference of I(ai) and I(aj), denoted by I(ai)⊖I(aj), is an interval set such that an interval x⊂I(ai)⊖I(aj) means x∩I(ai)=x and x∩I(aj)=0. The interval union of I(ai) and I(aj) is denoted by I(ai)⊕I(aj). For example, [1,10]⊖[4,8]={[1,3],[9,10]}, [1,10]⊖[4,15]={[1,3]}, and [1,10]⊕[14,15]={[1,10],[14,15]}.
Definition 4: Two standard ACLs A and B are said to be equivalent iff A⊂B and B⊂A. According to definition 4, for any given traffic from an arbitrary source address range that is denied and permitted by A, it will also be denied and permitted by B, and vice versa.
Thus, an ACL with n entries may be viewed as an ordered n-tuple as (a1,a2, . . . , an), where ai refers to the ith entry in the original order. Let ajπai denote that ai precedes aj in this ordered n-tuple, i.e., i<j. Due to the order dependency in ACLs, if aj it ai, then the scope of aj will be altered by ai when I(ai) and I(aj) intersect.
a)-(d) show a four-fold intersection classification 100 of entry pair ai and a1+1.
(I(ai)∩I(ai+1)≠I(ai+1))∩(I(ai)∩I(ai+1)=I(ai)) (1)
(I(ai)∩I(ai+1)=I(ai) (2)
(I(ai)∩I(aj)≠φ)∩(I(ai)∩I(aj)≠(I(ai))∩I(ai)≠I(aj)) (3)
(I(ai)∩I(ai+1)=φ (4)
The contain relation in
A similar analysis also can be applied to the case in which S(ai)=S(ai+1). For instance, the containment relation, as shown in
For easy algebraic manipulation, one may define ∩I(ai,ai+1)≡I(ai) ∩I(ai+1). The relations between ai and ai+1 (ai+1πai) in
The notion of a “spinoff interval” is defined as follows. Definition 5: Let (I1,I2) be an ordered pair of intervals, V1(I1,I2)≡I2⊖∩(I1,I2) be a spinoff interval set of I′2, where the subscript refers to one-dimensional space. Let |V1(I1,I2)| be the number of spinoff intervals (1-boxes) from interval I2. With the different range assignments of ai=(I,S)i and ai+1=(I,S)i+1, the four-fold intersection classification of ai and ai+1, along with the spinoff interval(s) V1(I(ai),I(ai+1)) of I(ai+i), is illustrated as below:
Corollary 1. For any given pair of ai and aj with ajπai, V1(I(ai),I(aj)) is unique, and max |(V1I(ai),I(aj}|=2.
Spin-off entries of ai+1 with respect to ai are defined as follows:
V
1(I(ai),I(ai+1)),S(ai+1))≡I,S(ai+1)|IεV1(I(ai),I(ai+1)) (5)
The notion of spinoff interval of an order pair of intervals can be extended immediately. Definition 6: Let (I1, . . . , In) be an ordered n-tuple of intervals, then a spinoff interval set of Ii with respect to the ordered n-tuple of intervals, denoted by Ii(I1 . . . , Ii), is:
This equation can in turn be computed recursively as follows:
ν1=V1(I1,Ii),ν2=V1(I2,ν1), . . . , νi−1=V1(Ii−1,νi−2) (7)
where 2≦i≦n, I′i(I1, . . . , Ii)=νi−1.
It follows from equation (7) that a step-by-step computation is required to determine spinoff interval sets I′i with respect to an ordered n-tuple of intervals. Spin-off entries of aj may be further defined with respect to a1, . . . , aj-1 as follows:
(I′j(I(a1), . . . , I(aj-1),I(aj)),S(aj))≡(I,S(aj)|εI′j(I(a1), . . . , I(aj-1)),I(aj)),I(aj)) (8)
An example of an ACL with five entries, together with spinoff entries, is given in the table below.
Thus, for a given ai in ACL A, it can be seen that its spinoff interval I(ai) is a subset of its original interval I(ai). The extent of scope contraction reflects the impact of order dependency (or partial redundancy) of ai on a1, . . . , ai−1. Such a dependency impact is defined as follows.
Definition 7: Let ai be the ith entry in ACL A. The extent of order dependency of ai on a1, . . . , ai−1, denoted by D(A,ai), is defined as the ratio of the ai's spinoff scope length to its original scope length. This is represented as:
This equation is a measure of scope contraction of ai due to its order dependency on a1, . . . , ai−1. D(A,ai)=1 means that ai has no overlap with a1, . . . , ai−1, and D(A,ai)=0 indicates that the scope of ai is masked by a1, . . . , ai−1, and thus ai is redundant. Between these two extreme scenarios, D(A,ai) lies between (0,1). In view of this, a definition of ACL redundancy may be expressed as follows.
Definition 8: Let aj be the kth entry in an ACL A. Then ak is said to be redundant iff D(A,ak)=0. It should be noted that a d-box in the context of standard ACL is a one-dimensional interval, and in extended ACLs it is a 5-dimensional rectangle. This definition goes beyond the pairwise redundant scenario and reflects the most likely scenario that ak is redundant because it could be jointly masked by more than one aj(akπaj). For example, a4 in the “ACL Entries” table is jointly masked by a triple of a1,a2,a3. This makes the ACL redundant detection and resolution issue both challenging and interesting.
Lemma 1. Let ai and ai+1 be a pair of order dependent standard ACL entries. Then ai, spinoff entries from ai+1 (V1(I(ai),I(ai+1))) are order-free and their union is equivalent to its order-dependent pair (ai,ai+1).
Thus, in accordance with one aspect of the invention, for an order-dependent standard ACL, there is a unique order-free equivalent ACL. This may be proven as follows. Let A be an order-dependent ACL (a1,a2, . . . , an), and B its order-free equivalent, which is initially set to empty. Construction begins with removing an from A and putting it as b1 into B. Then, for each entry ai removed from A, one may substitute every entry bk εB with bk's spinoff entries (V1(I(ai),I(bk)),S(bk)), and then put ai into B. This process is continued until A is empty. Lemma 1 and Corollary 1 set forth above ensure that B contains a unique order-free equivalent. A process 200 for converting an order dependent ACL into an order free equivalent is set forth in
According to process 200, an entry higher in an ACL takes precedence over an entry which is lower. To reflect such a precedence ordering, a stack/queue (e.g., a LIFO queue) is created in which all the entries are pushed in sequentially with the highest one first. Then one entry is popped at a time. Because the latest popped entry has higher precedence ordering over all entries that have been popped so far, it is put in the order-free ACL being constructed as it is. All the other entries in the temporary order-free constructed so far are checked for any overlap with the latest one. If there is any overlap, the order-free entries constructed in previous steps are modified so that the spinoff entries have no overlap with the latest one, while at the same time maintaining the semantic equivalence.
Process 200 is explained as follows. The process is initialized at block 202, where a set of standard ACL entries (a1, a2, . . . , an) are obtained, e.g., from a router's ACL list. A pair of local stacks or queues, e.g., a first queue “F” and a second queue “T” are initialized as shown at block 204. At block 206, the first queue F is populated with ACL entries ai. This is repeated for all n entries.
As shown at block 208, the topmost entry a is obtained from the first queue F. Then, at block 210, a's relationship is checked with a first entry b in memory Q. In one example, memory Q is a LIFO stack. All rules in Q are order-free with respect to the original rules processed so far. All rules in F are intact and in the original order. Each (original) rule in F (popped out in FILO fashion) needs to be compared with each rules in Q. If a rule popped out from F overlaps with a rule in Q, then the scope of the rule in Q needs to be modified so that the modified rule (which does not overlap with the rule in F) is then reinserted back to Q. Since rules in F precede rules in Q, when a rule popped out from F, it checks all rules in Q, and modifies the scope of rules if overlap occurs. After this check is completed, it is then inserted to Q. The process ends until F becomes empty, and then Q contains order-free rules (equivalents).
As shown in block 212, the process evaluates whether a overlaps b, contains b or is disjoint with b. Or does a enclose b. For instance, does ai enclose ai+1 such as is shown in FIG. 2C? If so, this signifies that b is redundant. In this case, the process proceeds to block 214 where b is flagged as redundant. If not, meaning that a either overlaps, contains or disjoins b, then the process proceeds to block 216. Here, one or more spinoffs of b are generated. For the case where the queue T is a LIFO queue, the spinoff may be created by putting the spinoff into T as follows: T.put((V1(I(a),I(b)),S(b))). Then at block 218 these spinoffs are added to the second queue T.
The process then proceeds to block 220. Here, if the memory Q is not empty, e.g., one or more entries remain in a LIFO stack, the process returns to block 210, where a is evaluated against the next entry b. Otherwise, the process proceeds to block 222.
Here, if the first queue F is not empty, e.g., one or more a entries remain in a LIFO stack, then the process returns to block 208, where the next most recent entry a in the first queue F is obtained. Otherwise, the process proceeds to block 224. Here, any intermediate entries that are in the second queue T are transferred into memory Q. For instance, if second queue T is implemented as a stack-type storage memory, each entry is popped from the stack and placed in the memory Q, which may also be a stack-type memory. This is done until the second queue T is empty.
Then, as shown in block 226, entry a is added from first queue F into memory Q. Each entry preferably represents a single rule of an ACL. At block 228, optimization is performed to minimize the number of order-free entries. In one example, all entries may be sorted by the left endpoint in the interval in Q (Olog(n) in running time). Adjacent entries having the same classification status may be merged as part of the minimization process. For instance, two entries ai=(I1,S)i and aj=(I1,S)j are said to be adjacent iff (aL)I=(aR)j+1 or (aL)j=(aR)I+1. The complexity of the merging process is known merging process is known to be linear since Lemma 1 ensures that all (either intermediate or final) entries in Q are disjoint. The overall translation process can be carried out in O(n2), where n is the ACL size. Then, as shown in block 230, the results from Q—order-free equivalents—may be provided, e.g., to a user via a graphical user interface or stored electronically for later analysis. Then the process ends as shown at block 232.
In view of the above, one question of interest is for an order-dependent ACL with size of n, what is the size of the corresponding order-free ACL? The following lemma gives an upper bound on the size of order-free equivalent with respect to the size of order-dependent one.
Lemma 2. Let n be the size of order-dependent standard ACL, and k(n) be the size of its order-free equivalent. In this case, k(n)≦2n.
a)-(b) illustrate a worst-case scenario that maximizes the size of the order-free ACL.
Definition 9. Let n be the size of a non-redundant ACL A, and m be the size of its order-free ACL equivalent. Then the complexity of this ACL is m/n, denoted as ψ(A).
The rationale behind this definition may be explained as follows. The presence of redundant entries is entirely irrelevant to the execution outcome of an ACL, and does not contribute to the complexity of the ACL. Therefore, such a redundancy should be excluded from complexity consideration. An order-free equivalent may be considered as a factored representation of its original ACL. Each order-free free equivalent entry is a basic building block. Thus the number of basic building blocks is an attribute of the ACL. The function ψ(A) captures the essence of ACL complexity. It follows from lemma 2 and definition 9 that complexity of a standard ACL is bounded by a constant independent of the ACL's size, ranging between 1 and 2.
The next section focuses on extended ACLs, which can be formulated as a 5-dimensional range problem. Rather than dealing with this geometric problem in an ad hoc fashion, a general framework is provided that is applicable to an arbitrary number of dimensions.
An extended ACL entry according to aspects of the invention may be expressed as I1I2I3I4I5S, where S is the image of the Cartesian product of its intervals in 5 orthogonal dimensions, denoted by (I1, I2, I3, I4, I5, S) and S represents a binary action of permit or deny. For the ith entry in an extended ACL, it can be written as ai=(I1, I2, I3, I4, I5, S)i, and Ij(ai), 1≦j≦5 refers to the jth interval of ai, respectively.
The following table shows an example of a protocol range and port range number assignments based on an IP protocol specification. Notice that both tcp and udp are a point in the protocol dimension, and the IP protocol is in the full range of the protocol dimension. The port number range is given as [0, 65535].
The following example illustrates how to translate an extended ACL entry to its range representation:
The next table presents a mapping between the different IP addresses, ports and traffic type in the ACL entry and their range representations.
The port number is in the range of [0, 65535]. Thus, the neq 40 on the IP destination port is equivalent to two destination port ranges [0, 39] and [39, 65535]. In a similar manner, the neq 120 on source IP port is identical to two source port ranges [0,119] and [121, 65535]. Hence, this entry corresponds to four range representations specified below:
(I1, [0,119],I3,I4, [0,39],0)
(I1, [0,119],I3,I4, [41,65535],0)
(I1, [121,65535],I3,I4, [0,39],0)
(I1, [121,65535],I3,I4, [41,65535],0)
An extended ACL entry might yield different range representation entries, depending on the port operator operand being applied on both the source port and destination port. For an extended ACL with size of n, the size of its range representation is bounded by 4n. The notion of order dependency can be employed to extend an ACL as follows.
Definition 10. An extended ACL A is said to be order independent iff ai, aj, A, I j, there exists at least 1≦k≦d such that the intersection of the k-interval of ai, and aj, i.e.,
A is order dependent iff the intersection of all k-intervals of ai and aj are non-empty, i.e.,
Given this, a d-box partition may be defined as follows.
Definition 11. A d-box partition of O⊂Rd is a set of nonempty d-boxes denoted {B1d, . . . , Bkd} such that the union of Bi's is equal to O and the intersection of Bid and Bjd is empty for any distinct Bid and Bjd.
Consider the following example with regard to
Entry a1 precedes entry a2, and as a result, the scope of entry a2 is altered (contracted) accordingly. Consequently, this is shown by a multiplicity of partitions. The altered/contracted areas are called spinoffs. The order-dependent effect on entry a2 is the ratio of the sum volume of spinoffs to the original volume. In the case shown in
In one example, a1=([4,7],[4,7],0) (shaded rectangle in
However, there are several difficulties in extended ACLs. For instance, translation of an order-dependent ACL to an order-free equivalent is not unique for a multi-dimensional case because the number of possibilities grows exponentially with the dimensionality. And order independency does not necessarily mean semantic equivalency, as shown by the incomplete partition case of
In order to develop a proper framework and procedure for determining an order-free equivalent partition, the Vk( ) function is introduced as follows. Let Ad=(I1, . . . , Id)a and Bd=(I1 . . . , Id)b be two d-boxes, Ii(Ad) be the ith closed interval of the Ak box, and Ii(Ad)I1(Ad) . . . Ik(Ad), where 1≦k≦d, which is a k-dimensional projection of Ad. It can be seen that Ii(Ad)≡Ad. Let ∩Ii(Ad,Bd)=Ii(Ad)∩Ii (Bd) be the ith intersecting interval of Ad and Bd.
Definition 12. Let Ad and Bd be two d-boxes. The function Vk(Ad,Bd) is defined as:
Vk(Ad, Bd) is referred to as a k-dimension projection of Bd minus a k-dimension projection of the intersection of Ad and Bd. Vk( ) is clearly not symmetrical. Let |Vk(Ad, Bd)| denote the volume of Vk(Ad,Bd), which can be expressed as follows.
This equation states that the volume of Vk(Ad,Bd) equals the k-dimensional projection volume of Bd subtracted by the k-dimensional projection volume of the intersection of Ad and Bd. It should be noted that the Vk function is a multidimensional extension to the V1 function set forth above.
As an illustration,
Let Ad=(I1, . . . , Id)a and Bd=(I1, . . . , Id)b be two d-boxes. Let ∩Ii(Ad, Bd) be denoted by ∩Vk(Ad,Bd). For all positive integers 2≦k≦d,
V
k(Ad,Bd)=Vk−1(Ad,Bd)Ik(Bd))⊕(∩Vk−1(Ad,Bd)(Ik(Bd)⊖∩Ik(Ad,Bd))) (12)
with the initial value V1(Ad,Bd)=I1(Bd)⊖∩I1(Ad,Bd). This results in the following equation:
Observe that V1(Ad,Bd)=I1 (Bd)⊖∩I1(Ad,Bd) is the seed value for the recurrence relation (2≦k≦d). To establish the recurrence relation, consider
A simplified version of this equation yields:
Combining equations 13 and 15 yields:
Lemma 3. For all positive integers 1≦k≦d, Vk(Ad,Bd) and ∩Vk(Ad,Bd) are disjoint. This may be proven by:
Furthermore, a d-box partition of Vd (Ad,Bd) is expressed as follows:
This may be proven as follows. V1(Id(Ad), Id (Bd))≡Id(Bd)⊖∩Id. Vd(Ad,Bd) is partitioned into two disjoint parts:
This implies that ∩Vd-1V1(Id(Ad),Id(Bd)) and Vd-1Id(Bd) are disjoint since ∩Vd-1∩Vd-1=0 based on Lemma 3. Turn next to Vd-1Id(Bd). Based on the recurrence relation set forth above, this term can be further partitioned into two disjoint parts:
Furthermore, it can be inferred that there is no intersection between:
1) ∩Vd-1V1(Id(Ad), Id(Bd)) (21)
2) ∩Vd-2V1(Id-1(Ad),Id-1(Bd))Id(Bd) (22)
because ∩Vd-1V1(Id(Ad), Id(Bd))∩Vd-1Id(Bd)=0. This process is repeated until the initial value for the recurrence relation is reached.
Summing up all results gives rise to the following.
Note that V1(I1(Ad),I1(Bd))≡V1(Ad,Bd). It is shown that d-boxes in equation (24) are disjoint, and hence form a d-box partition of Vd(Ad,Bd). This has important implications. For instance, it provides a recursive method of obtaining a d-box partition of V(An,Bn). It also offers an efficient piecemeal means for computing the d-box partition, thereby overcoming a significant dimensionality-induced complexity. For instance, identifying a d-box partition of equations (21) and (22) is reduced to identifying an 1-box partition of V1(Id(Ad),Id(Bd)) and V1(Id-1(Bd)).
Lemma 4. Let Ad=(I1, . . . , Id)a and Bd=(I1, . . . , Id)b be two d-dimensional boxes. Let Cid, . . . , Cim be a set of d-boxes obtained via equation (18). Then there is no intersection among An, C1n, . . . , Cmn. Thus, it may be proven that:
This equation implies that Ad∩Ckd=0. Since C1d, . . . , Cmd form a d-box partition of Vd(Ad,Bd), then Cid∩Cjd=0 with i≠j. The lemma is thus proved. This leads to the following aspect of the invention. For a given pair of two extended ACL entries ai=(B5,S)i, aj=(B5,S)j (ajπai), the union of ai and ({V5(Bi5,Bj5)},S(aj)) is an order-free equivalent to entry pair ai and aj. This may be established by noting that it follows from Lemma 4 that Bi5 and V5(Bi5,Bi5) are disjoint.
As an illustration, an example from
For a given extended ACL (a1, . . . , an), one can extend the ACL procedure of
A worst-case analysis of the procedure is provided as follows. First an upper bound on the size of the order-free equivalent based on the concept of trellis is given. Then a discussion of the worst-case complexity analysis of the algorithm is provided. The following definition and lemma are provided to facilitate the proof.
Definition 13. For x=(x1, . . . , xn), y=(y1, . . . , yn)εRn++, where Rn++=(0, ∞), let x[1]≧x[n] and y[1]≧ . . . ≧y[n] denote the components of x and y in decreasing order. Then
then y majorizes x. And according to lemma 5, if y “majorizes” x, then Πi=1nyi≦Πi=1nxi. An immediate consequence of lemma is that Πi=1nxi≦
majorizes (
Theorem: let n be the size of an order dependent extended ACL, then the maximum size of order-free equivalent is bounded by O((n/d)d) where d the number of dimensions (e.g., d=5 for extended ACL). This can be proven based on the notion of trellises. One can construct a d-dimensional trellis overlapped pattern among ACL entries and prove that this overlapped pattern yields the maximum size of the order independent equivalent.
For instance, let ni be the number of disjoint intervals on the i dimensional space and
An order dependent extended ACL can be constructed as follows. For the last n1 entries, construct the n1 disjoint intervals on I1, while assigning the full ranges to I2, . . . , Id. For last entries from n−(n1+1) to n−(n1+n2), construct the n2 disjoint intervals on I2, while assigning the full ranges to I1,I3, . . . , Id and so on so forth. Under this setting, the cross-section area of II2 is partitioned into n1 piecewise disjoint rectangles, called “slabs,” by vertical lines parallel to the y-axis. Next, each vertical rectangle is partitioned into n2 rectangles, cells, by lines parallel to the x-axis (see
By the ACL's hereditary property, each long horizontal shaded rectangle in
A corollary (corollary 2) to this is that the complexity of an extended ACL is bound by O((n/d)d-1). This corollary states that when involving more than one dimension, the complexity of an ACL depends not only on the number of dimensions d but also on the size of ACL n, in a contrast to the one-dimensional case in standard ACLs, the complexity of an ACL is bounded by O(n/d)0=O(1). This shows that complexity analysis of standard ACL in lemma 2 is a special case of this corollary where only one dimension is involved.
Next, it will be shown how to employ the framework to address some fundamental ACL problems, including efficient determination of semantic differences among ACLs, accurate identification of redundant entries, and quantitative evaluation of the impact of order-dependency on each entry in an ACL.
In one scenario, a comprehensive experimental study was conducted based on hundreds of extended ACLs gathered from a large enterprise network.
Definition 14. Let m be the size of order-free ACL equivalent A, ai=(Bd,S)i, 1≦i≦m. Let |Ik(Bd)|i and 1≦k≦d be the kth interval length of ai and S(ai) be the classification status. The positive/negative volumes of that ACL are as follows:
where the subscripts p and n denote the positive/negative volumes, respectively. Furthermore, let I be an interval, and IL and IR be the left end and right end points of the interval I. Then the volume based hash function is defined as:
In accordance with an aspect of the invention, one step is to choose easily computable metrics. Then one may use these metrics to measure ACLs and compare the semantic difference in ACLs. The notion of the positive and negative volume of an ACL was introduced in definition 14 above.
A prerequisite of volume calculation is the equivalency between order-dependent ACL and order-free equivalent illustrated in the diagram of
The volume determination process herein exploits the fact that d-boxes are disjoint. Hence the total volume of an ACL may be obtained by summing up the volume of each individual d-box. Thus, according to one aspect of the invention, the computational complexity is reduced to O(nd).
Piecemeal construction as set forth above allows one to build up a d-box partition recursively by gluing together the projected results on each dimension according to a given dimension ordering. Different dimension ordering for piecemeal construction may result in a different d-box partition. For example,
The volume of an ACL defined in accordance with equations 27 and 28 has a particular geometric interpretation. In particular, it denotes the amount of d-dimension space it occupies and is invariant with the dimension ordering in the d-box partition. A small positive volume of an ACL implies a tighter control on traffic flow. The ACL volume appears to be an ideal quantitative security metric for enterprise networks. However, ACL volume constitutes an important condition for the semantic equivalence or relatedness of two ACLs. Thus, ACL volume may not be an ideal metric for discerning the semantic difference among ACLs because the semantic meaning of an ACL not only relies on its volume but also its position in d-dimensional space.
In view of this, the volume-based hash function introduced above in equations (29)-(30) may be employed as an index for efficiently discerning whether ACLs are semantically equivalent. Hp(A) takes an order-free ACL as the input parameter and generates the hashed value that reflects the volume and position of the ACL. Another aspect of the invention focuses on minimizing the chance of collision, rather than on dictionary operations of a hash function such as INSERT, SEARCH, and DELETE. To make the hash function sensitive to both the volume and position, equations (29)-(30) are constructed as follows:
Each element in equations (29)-(30) is a product of two conceptually different sub-elements, namely volume and position as shown in equation (31). Although the hash-based function may not guarantee 100% accuracy in discerning the semantic equivalence among ACLs in theory, in practice it is more than accurate enough to distinguish the subtle semantic differences among ACLs. One very attractive feature of this approach is its computational efficiency. It takes O(nd) time to compute the hash value for an order-free equivalent of size of n, and takes O(1) for comparing whether two ACLs are semantically identical.
Based on a multidimensional interval tree structure, the optimal run-time overhead of deterministic algorithm for determining whether two sets of d-boxes are identical is reported to be O(n2 logd n). By building on top of such a framework, it can be shown how to support redundancy detection and quantitative evaluation of the impact of order-dependency on each entry in a given ACL.
As shown in block 408, an order-free operation is performed. This function uses the queue F as its input and a queue Q as its output. Q will contain the corresponding order-free equivalent a. The order-free operation may follow the the process set forth in
At block 410, the order-free entry a is removed from the output queue Q (e.g., a LIFO queue). Then at block 412, the ordering information (e.g., sequence number) of the order-free entry is obtained. At block 414, the volume of this order-free entry is determined (e.g., “spinoff volume”), and at block 416 the Volume Array is updated in the corresponding entry. Then, at block 418, the number of order-free entries with respect to a given ACL entry are counted. If the output queue Q is not empty when checked at block 420, then the process returns to block 412 where the next order-free entry is processed.
Next, the degree of alteration for each ACL entry is computed. For instance, the scope contraction ratio D(A,ai) for each ACL entry may be computed. At block 422, it is checked whether an entry is redundant. For instance, for a given entry i, the process may evaluate whether the spinoff of is zero (e.g., spinoff[i]==0). If so, then that entry is marked as redundant at block 424. If it is not redundant, then the spinoff volume of that entry (i) is updated (e.g., spinoffvolume[i]=spinoffvolume[i]/volume[i]). If there are any other entries remaining, the process returns to block 422 as shown by the dashed lines. Redundant entries may be removed as shown in block 428, resulting in a modified ACL. The process terminates at block 430.
In one scenario in accordance with aspects of the invention, a set of standard ACLs was collected from an enterprise network and evaluated to uncover abnormalities hidden in the ACLs and to discern semantic differences among ACLs. Information concerning the set of standard ACLs is provided in the table below.
The above table shows that among the total of 373 standard ACLs analyzed, 5 contained redundant entries, accounting for roughly 1.34% of the total examined ACLs. The mean complexity ψ(A) of the standard ACLs was 1.625. Complexity distribution can be seen in
The processes discussed above may be employed not only to translate an order-dependent ACL into an order-free ACL, but also to identify redundant entries during the translation process. For instance,
Next, an example of translating an existing order dependent ACL into its order-free equivalent is considered.
It can be seen from the graph in
To better visualize, in the graph in
Both explicit and implicit denies have the same effect on packet classification, but they differ in their semantic meaning. Such a subtle difference between an implicit deny and an explicit deny can be captured by the volume-based analysis set forth above. It follows from equations (27)-(28) that the positive volume of the ACL in the present example is 9 while the negative volume is 4,294,967,287. However, removing removing explicit deny entry at the end of the ACL list implies that the ACL list is terminated by an implicit deny entry, which corresponds to the zero negative volume.
a) and (b) illustrate examples of ACL volume distribution. They are plotted with a log-scaled x-axis and show the positive/negative ACL volume distribution constructed from the 373 standard ACLs gathered from a large set of enterprise networks in the present example.
In another example, 314 extended ACLs were Analysis showed that among the total of 314 extended ACLs, 33 of these ACLs contain redundant entries, which accounts for 10.5% of total examined ACLs, in sharp contrast to the 1.33% redundant ACLs in standard ACLs discussed above. Among extended ACLs with redundant entries, on average 7 out of 100 entries were redundant, as shown in the following table.
The table above presenting statistics for extended ACLs shows that the average and maximum complexity of extended ACLs calculated over 314 examined ACLs is 2.367 and 92.88, respectively. In contrast, the average and maximum complexity of standard ACLs is 1.628 and 1.917. This is in line with the obtained theoretical results. When the dimension of an ACL is more than two, its complexity not only depends on its dimensionality but also depends on its size.
The dimension-induced complexity can be illustrated as follows. One extended ACL had a maximum complexity (92.88) out of the 314 extended ACLs. Aspects of the invention were used to determine that this ACL contained 230 entries, in which 118 entries were redundant. It yielded 10,403 order-free equivalent entries (basic building blocks).
The spinoff entry distribution for this is constructed and plotted in
An experimental study was performed to evaluate the ability of hash-based function to discern semantic change due to modification of ACLs, and to quantify the sensitivity to ACL change in terms of complexity metric. Complexity and volume-based hash values for the original ACL were used as the baseline for comparison. In each run each individual entry in the ACL was deleted and the extent of its impact in comparison to the baselines was quantified.
Similar to complexity variation analysis, verification of the sensitivity of volume-based hash to semantic changes in ACL due to the deletion of an entry can be analyzed. Equation 32 below gives the difference in the positive volume based hash between the ACL with a deleted entry and the original ACL. Similarly, equation 33 gives the negative volume.
HD
p(A,A′)=(Hp(A′)−Hp(A)) (32)
HD
n(A,A′)=(Hn(A′)−Hn(A)) (32)
a) and 18(b) show that the values do not change when a redundant entry is deleted. However, the values do change for all non-redundant entry deletions. This validates the sensitivity of the volume based hash to semantics of an ACL. And
To see the effects of modification on ACLs, an extensive experimental study was conducted with three different perspectives, namely: reordering, deletion, and insertion. To quantify the effect of entry reordering, a random permutation was generated among entries and then the complexity of permutated ACL was calculated in comparison to the baseline value. The experiment for random deletion was conducted in a similar fashion.
The experiment study on insertion effect was performed via ACL synthesis. ACL A, which was used as base ACL, contains 230 entries with 118 redundant entries. The complexity of A was measured as 92.88, while ACL B, which is used as additive ACL, contains 256 entries with 97 redundant entries. The complexity of B was measured as 25.96.
In the experimental study, a certain percentage of entries was randomly taken from B and then randomly added into A. Complexity statistics under different permutation, deletion, and insertion rates are presented in the table below.
Each row in the above table was obtained via 200 independent runs. It is shown that generally an increased deletion rate results in a decreased ACL complexity (decreased mean complexity), while an increased insertion rate causes an increased ACL complexity. The complexity of ACLs does not appear very sensitive to permutation rate. Complexity variation (standard deviation) is shown to be correlated highly with deletion and insertion rates. This indicates that the extent of complexity fluctuation is accompanied with an increase of deletion and insertion rates.
The experimental study based on ACLs from an enterprise network indicates the presence of relatively high percentage of redundant entries. It also suggests that ACLs may be highly sensitive to change. Thus, deleting one entry at a particular position in an ACL may significantly alter its semantic meaning, as evidenced by a relative big change in complexity (see
Aspects of the invention may be implemented using a computer network such as shown in
The client device 602 may couple to a server 606 via router 608. The server 606 is desirably associated with database 610, which may provide content to the client device 602 if access control list criteria are satisfied. The router 608 may include a firewall (not shown) and maintain an ACL therein.
Each device may include, for example, one or more hardware-based processing devices and may have user inputs such as a keyboard 612 and mouse 614 and/or various other types of input devices such as pen-inputs, joysticks, buttons, touch screens, etc. Display 616 may include, for instance, a CRT, LCD, plasma screen monitor, TV, projector, etc.
The user device 602, server 606 and router 608 may contain at least one processor, memory and other components typically present in a computer. As shown, the router 608 includes a processor 618 and memory 620. Components such as a transceiver, power supply and the like are not shown in any of the devices of
Memory 620 stores information accessible by the processor 618, including instructions 622 that may be executed by the processor 618 and data 624 that may be retrieved, manipulated or stored by the processor. The firewall may be implemented by the router 608, where the ACL(s) is stored in memory 620. The memory 620 may be of any type capable of storing information accessible by the processor, such as a hard-drive, ROM, RAM, CD-ROM, flash memories, write-capable or read-only memories.
The processor 618 may comprise any number of well known processors, such as processors from Intel Corporation. Alternatively, the processor may be a dedicated controller for executing operations, such as an ASIC.
The instructions 622 may comprise any set of instructions to be executed directly (such as machine code) or indirectly (such as scripts) by the processor. In that regard, the terms “instructions,” “steps” and “programs” may be used interchangeably herein. The instructions may be stored in any computer language or format, such as in object code or modules of source code. The functions, methods and routines of instructions in accordance with the present invention are explained in more detail below.
Data 624 may be retrieved, stored or modified by processor 618 in accordance with the instructions 622. The data may be stored as a collection of data. For instance, although the invention is not limited by any particular data structure, the data may be stored in computer registers, in a relational database as a table having a plurality of different fields and records. In one example, the memory 620 may include one or more stacks or queues for storing the data. In one example, the stacks/queues are configured as LIFOs.
The data may also be formatted in any computer readable format. Moreover, the data may include any information sufficient to identify the relevant information, such as descriptive text, proprietary codes, pointers, references to data stored in other memories (including other network locations) or information which is used by a function to calculate the relevant data.
Although the processor 618 and memory 620 are functionally illustrated in
Although aspects of the invention herein have been described with reference to particular embodiments, it is to be understood that these embodiments are merely illustrative of the principles and applications of the present invention. It is therefore to be understood that numerous modifications may be made to the illustrative embodiments and that other arrangements may be devised without departing from the spirit and scope of the invention as defined by the appended claims.
While certain processes and operations have been shown in certain orders, it should be understood that they may be performed in different orders and/or in parallel with other operations unless expressly stated to the contrary.
The instant application claims the benefit of U.S. Provisional Patent Application No. 61/149,101, entitled “System and Method for Determining Semantic Equivalence Between Access Control Lists (ACL),” filed Feb. 2, 2009, the entire disclosure of which is hereby expressly incorporated by reference herein.
Number | Date | Country | |
---|---|---|---|
61149101 | Feb 2009 | US |