At least one embodiment of the present invention pertains to data storage systems, and more particularly, to a policy engine to control the servicing of requests received by a storage server.
Modern computing systems often employ various types of storage servers. Storage servers are used for a variety of purposes, such as to provide multiple users with access to shared data and to back up mission critical data. A file server, for example, is a type of storage server which operates on behalf of one or more clients to store and manage shared files in a set of mass storage devices, such as magnetic or optical storage based disks or tapes. The mass storage devices are typically organized into one or more volumes of Redundant Array of Independent (or Inexpensive) Disks (RAID).
One configuration in which a file server can be used is a network attached storage (NAS) configuration. In a NAS configuration, a file server can be implemented in the form of an appliance, called a filer, that attaches to a network, such as a local area network (LAN) or a corporate intranet. An example of such an appliance is any of the Filer products made by Network Appliance, Inc. in Sunnyvale, Calif.
A storage server can also be employed in a storage area network (SAN) environment. A SAN is a highly efficient network of interconnected, shared storage devices. One difference between NAS and SAN is that in a SAN, the storage server (which may be an appliance) provides a remote host with block-level access to stored data, whereas in a NAS configuration, the storage server provides clients with file-level access to stored data. Some storage servers, such as certain Filers from Network Appliance, are capable of operating in either a NAS mode or a SAN mode or even both modes at the same time. Such dual-use devices are sometimes referred to as “unified storage” devices.
It is desirable to allow third parties software developers to develop applications that can access and/or control processes that run on a filer. Doing so can enhance and add value to such devices. However, development of such applications by third parties can be costly and complicated and risk having an unintended negative impact on other processes of the storage server. As a result, relatively few third party applications are available at present. It is desirable, therefore, to facilitate the development of such third party applications, without requiring that all such applications be developed by one company, while reducing the risk of a negative impact on the storage server.
The present invention includes a method of operating a storage server. The storage server receives from a client a request to perform a storage-related operation relating to a set of data. If the first request satisfies a defined criterion, the storage server invokes a policy engine configured to determine a disposition of the request. The storage server then receives a response from the policy engine indicating a disposition of the request and responds to the request in accordance with the response from the policy engine.
Another aspect of the invention is a method of operating a policy engine. The policy engine receives a first request and information relating to a set of data from a storage server. The first request is in response to a storage-related client request received by the storage server from a client and relating to the set of data. The policy engine applies a defined policy using the information relating to a set of data and then sends a first response to the storage server to indicate a result of applying the defined policy. The first response is to cause the storage server to send a second response to the client in accordance with the first response.
Other aspects of the invention will be apparent from the accompanying figures and from the detailed description which follows.
One or more embodiments of the present invention are illustrated by way of example and not limitation in the figures of the accompanying drawings, in which like references indicate similar elements and in which:
A method and apparatus for controlling the servicing of requests received by a storage server are described. Note that in this description, references to “one embodiment” or “an embodiment” mean that the feature being referred to is included in at least one embodiment of the present invention. Further, separate references to “one embodiment” or “an embodiment” in this description do not necessarily refer to the same embodiment; however, such embodiments are also not mutually exclusive unless so stated, and except as will be readily apparent to those skilled in the art from the description. For example, a feature, structure, act, etc. described in one embodiment may also be included in other embodiments. Thus, the present invention can include a variety of combinations and/or integrations of the embodiments described herein.
It is desirable to increase the number and capability of applications that enhance the functionality of a filer, without requiring that all such applications be developed by one company. To accomplish this, an interface that allows independent software developers to create applications that access and control processes that run on a filer is needed. Such an interface allows isolation of development and features to reduce any negative impact on the filer, while still allowing control of significant features in the filer, thus allowing third parties to add value to the filer.
In this regard, note that Muhlestein (referenced in full above) describes a technique for “outsourcing” from a filer various tasks, such as virus scanning, data compression/decompression, encryption/detection, database compaction, data translation, or any of various other CPU intensive tasks. In particular, Muhlestein describes the use of a “cluster device” to perform such tasks on behalf of a filer. As described further below, the technique described in Muhlestein can be extended, modified and/or generalized advantageously to achieve the above-noted goal.
Described herein is a technique for outsourcing certain tasks from a filer to a policy engine (which can be a device similar to a cluster device such as described in Muhlestein). The policy engine is, in certain embodiments, a dedicated application server used to implement various defined policies to control a filer's servicing of client requests. For example, the policy engine can be used to implement policies to control various storage-related operations relating to data managed by the filer. The controlled operations may include, for example, creation, deletion, modification, opening, closing, renaming and/or changing attributes of files or directories managed by the filer.
A simple example of a networked storage system is illustrated in
The filer 2 is capable of operating in either a NAS mode or a SAN mode, or both; that is, the filer to can provide either file-level access or block-level access to stored data. The filer 2 implements a number of different storage protocols, such as the common Internet file system (CIFS), network file system (NFS), hypertext transport protocol (HTTP), simple network management protocol (SNMP), etc., and selectively uses these protocols as needed.
In a NAS implementation, the interconnect 3 may be essentially any type of computer network, such as a local area network (LAN), a wide area network (WAN), metropolitan area network (MAN) or the Internet, and may implement the Internet Protocol (IP). In a SAN implementation, the interconnect 3 may be, for example, a Fibre Channel switching fabric which implements the Fibre Channel Protocol (FCP).
The storage system also includes a remote policy engine 6, which is connected to the filer through an interconnect 7 (which can be the same interconnect as interconnect 3). Interconnect 7 may be, for example, a conventional computer network, such as a LAN or a corporate intranet. The term “remote”, in this context, means that the policy engine 6 is external to the filer 2 and is connected to the filer 2 only indirectly, such as through a network. However, the term “remote” does not imply that the actual physical separation between the filer 2 and the policy engine 6 must be large; in fact, the physical separation between these devices can be quite small. Furthermore, it should be noted that the operations described herein as being performed by the policy engine 6 could instead be implemented internally to the filer.
In general terms, the role of the policy engine 6 is to control the servicing of certain requests received by the filer 2 from clients 1, by implementing various predefined policies. The operating system in the filer 2 includes an application programming interface (API) that allows the filer 2 to interface with the policy engine 6, as described further below. The policy engine 6 and the associated API in the filer 2 allow third party control of various processes of the filer 2. For example, the policy engine 6 can be used to implement policies to control various storage-related operations on files by the filer 2, such as creating, opening, closing, deleting, modifying, renaming and/or changing attributes of files.
Each policy can be implemented in the policy engine 6 in the form of a simple software algorithm. A few examples of policies that can be implemented in a policy engine 6 are:
block creation of files of a particular file type
deny a particular user access to data stored in a particular disk or volume
limit certain users' access to a particular file, disk or volume to a specified number of times per day
apply a load balancing procedure to any data accessed more than a specified number of times in a specified period of time.
The policies may be defined/installed or enabled within the policy engine 6 by a network administrator during a configuration process, via a separate management console (not shown). Similarly, the filer 2 can also be configured/managed via a separate management console.
It might be desirable to block the creation of certain types of files, at least under certain conditions. For example, MP3 files tend to consume large amounts of storage space, such that it may be desirable not to give users unlimited freedom to store such files on the storage system. Consequently, a policy can be defined and implemented in the policy engine 6 to cause the policy engine 6 to block creation of MP3 files (or any other specified type of file), either unconditionally or under specified conditions. A policy may be applied only to certain users or to certain client machines or groups of user/clients, for example, or only during certain times of today, etc.
Thus, the policy engine 6 can be used to implement policies to control essentially any storage-related operation of the filer 2, based on any of various criteria, such as filename, file type (e.g., file extension), the user or client which initiated the request, disk usage, specified quotas, etc., or any combination thereof. Further, the policy engine 6 can be used to collect and maintain summaries of attributes of data, such as keeping and running track of directory usage. The summarized data can be used by heuristics which implement the policies within the policy engine 6.
At startup, the policy engine 6 gets the name of the filer(s) 2 it will be serving from a command line interface or some other invocation interface. The policy engine 6 communicates with the filer 2 using a named pipe and registers itself with the filer 2 as a policy engine 6. The policy engine 6 then tells the filer 2 which communication method it will be using and which file operation(s) to screen. Separate named pipes are created for communications from the filer 2 to the policy engine 6 (“screen requests”) and for communications from the policy engine 6 to the filer 2 (“screen responses”).
As part of the registration process, the policy engine 6 causes the filer 2 to store a set of criteria for each policy the policy engine 6 will implement. The criteria, when satisfied by a client request, will trigger the filer 2 to invoke the policy engine 6 with a “screen request”. When a client-initiated request received by the filer 2 meets any such set of criteria, the filer 2 defers to the policy engine 6 for disposition of the request, rather than immediately attempting to satisfy the request. For example, a set of criteria which the policy engine 6 registers with the filer 2 might essentially tell the filer 2, “Notify me (the policy engine 6) before servicing any request to create an MP3 file.” When the criteria are met, the filer 2 sends no response to the requesting client until the policy engine 6 has provided a response to the filer 2; hence, the policy engine's servicing of screen requests from the filer 2 is synchronous with the filer's servicing of client requests. If a negative response is received from the policy engine 6 or if the response time exceeds a predefined limit, the request from the client 1 will be denied.
In one embodiment, the criteria examined for purposes of generating screen requests are implemented as filters. In this embodiment, the filer 2 sends a screen request only if a certain feature of a file is matched (or not matched). The filer 2 in this embodiment acts to filter out request that are not relevant to the policy on the policy engine 6, reducing the communication required between the filer 2 and policy engine 6. For example, the filer 2 can send screen requests based on filters on file extensions, metadata attributes (e.g., an “offline” bit), or file attributes (file size, file age, etc.).
With the screen request, the filer 2 also sends to the policy engine 6 information relating to the file, such as metadata of the file, e.g., canonical names for the file, the file extension of the file, the access type, user name of the requesting user, etc. The policy engine 6 may also set or modify such metadata of files managed by the filer 2, for later use. The information sent to the policy engine 6 with a particular screen request will depend upon which set of criteria the client request matched, and the criteria depend upon the policy with which they are associated. For example, if the matched criteria are based on the underlying policy of blocking creation of MP3 files, the filer 2 may include the MP3 file extension in the screen request.
In another example, the policy engine 6 is used to provide functionality that dynamically modifies information that is displayed to a client application based on a user credential or process ID. The filer 2 transfers to the policy engine 6, the identity of the client, process ID, or other user-centric attribute when a file is accessed. The file transformation is then carried out by the policy engine 6 or systems called by it. This functionality takes advantage of the synchronous nature of filer-to-policy-engine communication.
The policy engine 6 receives the screen request and the information relating to the file from the filer 2. At 204 the policy engine 6 applies the appropriate one or more of its defined policies to the received information. The result of applying the policy may be a decision by the policy engine 6 to approve or deny the request. To simplify description, it is generally assumed herein that the result of applying the policy is either approval or denial of the client request. Note, however, that the result of applying the policy can be something other than approval or denial of the request; in other words, the policy engine 6 can perform, or can cause the filer 2 or some other device to perform, some other predetermined action.
After applying the policy (or multiple policies), the policy engine 6 sends a screen response by RPC to the filer 2 at 205 to indicate the result. At 206 the filer 2 sends a response in accordance with the screen response to the client 1 which made the original request. For example, if the screen response was to approve the client request, the filer 2 satisfies the client request (in the current example, the filer 2 would allow creation of the file per the client request). If the screen response was to deny the request, the filer 2 sends the client a message indicating that the request was denied.
A policy engine 6 such as described herein can be used to control operations of multiple storage servers (e.g., multiple filers 2), as illustrated in
A policy engine 6 such as described herein can be implemented as a cluster device within a cluster, as described in Muhlestein, or the policy engine 6 can be implemented as a cluster. The cluster device or a policy engine 6 within a cluster can be selected by the filer 2 in a round robin manner, for example. Also, as illustrated
In certain embodiments, multiple policies are implemented in a logic AND fashion to implement an overall policy. In certain embodiments, policies can be linked in a more complex policy expression, for example by ORing or XORing two or more policies to create a more complex expression.
The policy engine 6 can be distributed amongst multiple nodes (e.g., multiple physical machines/platforms). Accordingly, a particular policy may be distributed amongst multiple nodes. Each node may implement different policies and/or each node may implement a portion of a total policy. Additionally, information from intermediate policy engines can be used as inputs to systems for evaluation and enforcement
In the case of a multiple-node policy engine, for example, one part of the policy engine can reside in the filer 2, while the other part or parts reside externally to the filer 2. In that case the portion of the policy engine 6 residing in the filer 2 may still be external to all other processes of the filer, in that such portion of the policy engine is not linked into any executables of the filer 2, either statically or dynamically.
The distribution of policies is advantageous in that it can reduce necessary bandwidth between nodes and reduce the complexity of each node. In certain embodiments, a two-node policy engine is used to distribute implementation of a policy asymmetrically. The portion of the policy engine 6 which resides on the filer 2 can be relatively simple, providing high-performance decisions regarding the file system and/or filtering “uninteresting” events, while the off-filer part of the policy engine 6 would deal with more complex decisions and/or multiple-filer views.
The above-described communications between a filer 2 and a policy engine 6 can be implemented with the help of interface definition language (IDL) files, which are platform-independent representations of data as exchanged between two machines. Certain embodiments of the invention employ two IDLs, referred to as fprequest.idl and fpcompletion.idl. Fprequest.idl describes the RPC calls made by the filer 2 to the policy engine 6, i.e., the screen request RPCs.
Screen Request RPCs
In certain embodiments of the invention, the following calls are implemented by the policy engine 6; they are initiated from a filer 2 at the time screening of the file operation is requested:
Screen Completion RPCs
The following APIs are implemented on the filer, and are called by the policy engine.
In certain situations it may be desirable to store some files (or other data) managed by a filer 2 remotely from the filer 2, such as in a separate nearline storage device, instead of in the filer's local storage. Where a file is stored may be determined by a separate data backup application. A policy engine 6 such as described above can be advantageous in situations where files managed by a filer 2 are stored remotely from the filer 2, as will now be described.
In certain embodiments, remotely stored files are replaced in the filer 2 by a “stub” (e.g., a header), and the inode of such a file includes a flag (e.g., an “offline” bit) indicating that the file is actually stored remotely. When the filer 2 receives a request relating to such a file, the filer 2 detects this flag in the inode of the file and responds by sending a corresponding notification to the policy engine 6. In response to this notification, the policy engine 6 obtains the file from the remote storage (if appropriate after applying any applicable policies), and provides the file to the filer 2. The filer 2 then uses the file as appropriate to satisfy the client request.
The processor 41 is the central processing unit (CPU) of the processing system 40 and, thus, controls the overall operation of the processing system 40. In certain embodiments, the processor 41 accomplishes this by executing software stored in memory 42. The processor 41 may be, or may include, one or more programmable general-purpose or special-purpose microprocessors, digital signal processors (DSPs), programmable controllers, application specific integrated circuits (ASICs), programmable logic devices (PLDs), or the like, or a combination of such devices.
The memory 42 represents any form of random access memory (RAM), read-only memory (ROM), flash memory, or a combination thereof. Memory 42 stores, among other things, the operating system of the processing system 40, in which techniques of the present invention can be implemented.
Also connected to the processor 41 through the bus system 43 are (optionally) a mass storage device 44, a network adapter 45 and (at least in the case of a filer) a storage adapter 46. Mass storage device 44 may be or include any conventional medium for storing large volumes of data in a non-volatile manner, such as one or more disks. The network adapter 45 provides the processing system 40 with the ability to communicate with remote devices, such as clients and/or a filer, over a network and may be, for example, an Ethernet adapter. The storage adapter 46 allows a filer to access external mass storage devices and may be, for example, a Fibre Channel adapter or SCSI adapter.
Thus, a method and apparatus for controlling the servicing of requests received by a storage server have been described. Although the present invention has been described with reference to specific exemplary embodiments, it will be recognized that the invention is not limited to the embodiments described, but can be practiced with modification and alteration within the spirit and scope of the appended claims. Accordingly, the specification and drawings are to be regarded in an illustrative sense rather than a restrictive sense.
This is a continuation-in-part of U.S. patent application Ser. No. 10/010,959 of M. Muhlestein, filed on Nov. 30, 2001 and entitled, “Decentralized Virus Scanning for Stored Data” (hereinafter “Muhlestein”), which is a continuation-in-part of U.S. patent application Ser. No. 09/728,701 filed on Dec. 1, 2000 and entitled, “Decentralized Appliance Virus Scanning,” both of which are incorporated herein by reference.
Number | Name | Date | Kind |
---|---|---|---|
4104718 | Poublan et al. | Aug 1978 | A |
4937763 | Mott | Jun 1990 | A |
5067099 | McCown et al. | Nov 1991 | A |
5138712 | Corbin | Aug 1992 | A |
5261051 | Masden et al. | Nov 1993 | A |
5392446 | Tower et al. | Feb 1995 | A |
5396609 | Schmidt et al. | Mar 1995 | A |
5604862 | Midgely et al. | Feb 1997 | A |
5623600 | Ji et al. | Apr 1997 | A |
5630049 | Cardoza et al. | May 1997 | A |
5649099 | Theimer et al. | Jul 1997 | A |
5649152 | Ohran et al. | Jul 1997 | A |
5682535 | Knudsen | Oct 1997 | A |
5771354 | Crawford | Jun 1998 | A |
5787409 | Seiffert et al. | Jul 1998 | A |
5819047 | Bauer et al. | Oct 1998 | A |
5819292 | Hitz et al. | Oct 1998 | A |
5835953 | Ohran | Nov 1998 | A |
5918008 | Togawa et al. | Jun 1999 | A |
5925126 | Hsieh | Jul 1999 | A |
5933594 | La Joie et al. | Aug 1999 | A |
5946690 | Pitts | Aug 1999 | A |
5963962 | Hitz et al. | Oct 1999 | A |
5968176 | Nessett et al. | Oct 1999 | A |
6014700 | Bainbridge et al. | Jan 2000 | A |
6061504 | Tzelnic et al. | May 2000 | A |
6076105 | Wolff et al. | Jun 2000 | A |
6088803 | Tso et al. | Jul 2000 | A |
6101558 | Utsunomiya et al. | Aug 2000 | A |
6108785 | Poisner | Aug 2000 | A |
6115741 | Dominekos et al. | Sep 2000 | A |
6138126 | Hitz et al. | Oct 2000 | A |
6148349 | Chow et al. | Nov 2000 | A |
6185598 | Farber et al. | Feb 2001 | B1 |
6189114 | Orr | Feb 2001 | B1 |
6226752 | Gupta et al. | May 2001 | B1 |
6230200 | Forecast et al. | May 2001 | B1 |
6237114 | Wookey et al. | May 2001 | B1 |
6253217 | Dourish et al. | Jun 2001 | B1 |
6256773 | Bowman-Amuah | Jul 2001 | B1 |
6266774 | Sampath et al. | Jul 2001 | B1 |
6275393 | Baudelot et al. | Aug 2001 | B1 |
6275939 | Garrison | Aug 2001 | B1 |
6311213 | Dawson et al. | Oct 2001 | B2 |
6324581 | Xu et al. | Nov 2001 | B1 |
6327594 | Van Huben et al. | Dec 2001 | B1 |
6327658 | Susaki et al. | Dec 2001 | B1 |
6327677 | Garg et al. | Dec 2001 | B1 |
6338141 | Wells | Jan 2002 | B1 |
6401126 | Douceur et al. | Jun 2002 | B1 |
6405327 | Sipple et al. | Jun 2002 | B1 |
6490666 | Cabrera et al. | Dec 2002 | B1 |
6502102 | Haswell et al. | Dec 2002 | B1 |
6519679 | Devireddy et al. | Feb 2003 | B2 |
6523027 | Underwood | Feb 2003 | B1 |
6542967 | Major | Apr 2003 | B1 |
6560632 | Chess et al. | May 2003 | B1 |
6577636 | Sang et al. | Jun 2003 | B1 |
6606744 | Mikurak | Aug 2003 | B1 |
6697846 | Soltis | Feb 2004 | B1 |
6721721 | Bates et al. | Apr 2004 | B1 |
6721862 | Grant et al. | Apr 2004 | B2 |
6728766 | Cox et al. | Apr 2004 | B2 |
6757753 | DeKoning et al. | Jun 2004 | B1 |
6757794 | Cabrera et al. | Jun 2004 | B2 |
6785732 | Bates et al. | Aug 2004 | B1 |
6801949 | Bruck et al. | Oct 2004 | B1 |
6802012 | Smithson et al. | Oct 2004 | B1 |
6832313 | Parker | Dec 2004 | B1 |
6859841 | Narad et al. | Feb 2005 | B2 |
6918113 | Patel et al. | Jul 2005 | B2 |
6931540 | Edwards et al. | Aug 2005 | B1 |
6970939 | Sim | Nov 2005 | B2 |
6981070 | Luk et al. | Dec 2005 | B1 |
7020697 | Goodman et al. | Mar 2006 | B1 |
7032022 | Shanumgam et al. | Apr 2006 | B1 |
7089293 | Grosner et al. | Aug 2006 | B2 |
7124180 | Ranous | Oct 2006 | B1 |
7127388 | Yates et al. | Oct 2006 | B2 |
7146377 | Nowicki et al. | Dec 2006 | B2 |
7237027 | Raccah et al. | Jun 2007 | B1 |
7293083 | Ranous et al. | Nov 2007 | B1 |
7349960 | Pothier et al. | Mar 2008 | B1 |
7555482 | Korkus | Jun 2009 | B2 |
20010013059 | Dawson et al. | Aug 2001 | A1 |
20010013064 | Cox et al. | Aug 2001 | A1 |
20020040405 | Gold | Apr 2002 | A1 |
20020042866 | Grant et al. | Apr 2002 | A1 |
20020065946 | Narayan | May 2002 | A1 |
20020087479 | Malcolm | Jul 2002 | A1 |
20020103783 | Muhlestein | Aug 2002 | A1 |
20020103907 | Petersen | Aug 2002 | A1 |
20020120741 | Webb et al. | Aug 2002 | A1 |
20020124090 | Poier et al. | Sep 2002 | A1 |
20020133491 | Sim et al. | Sep 2002 | A1 |
20020133561 | O'Brien et al. | Sep 2002 | A1 |
20020161855 | Manczak et al. | Oct 2002 | A1 |
20020194251 | Richter et al. | Dec 2002 | A1 |
20030031176 | Sim | Feb 2003 | A1 |
20030046396 | Richter et al. | Mar 2003 | A1 |
20030056069 | Cabrera et al. | Mar 2003 | A1 |
20030120476 | Yates et al. | Jun 2003 | A1 |
20030191957 | Hypponen et al. | Oct 2003 | A1 |
20030195895 | Nowicki et al. | Oct 2003 | A1 |
20030236745 | Hartsell et al. | Dec 2003 | A1 |
20040024688 | Bi et al. | Feb 2004 | A1 |
20040044744 | Grosner et al. | Mar 2004 | A1 |
20040078419 | Ferrari et al. | Apr 2004 | A1 |
20040148382 | Narad et al. | Jul 2004 | A1 |
20040186844 | Muhlestein | Aug 2004 | A1 |
20040226010 | Suorsa | Nov 2004 | A1 |
20040230795 | Armitano et al. | Nov 2004 | A1 |
20050138204 | Iyer et al. | Jun 2005 | A1 |
20050198238 | Sim et al. | Sep 2005 | A1 |
20050203881 | Sakamoto et al. | Sep 2005 | A1 |
20050226153 | Scifres et al. | Oct 2005 | A1 |
20050251500 | Vahalia et al. | Nov 2005 | A1 |
20060195616 | Petersen | Aug 2006 | A1 |
20080027746 | Exall et al. | Jan 2008 | A1 |
20080066151 | Thomsen et al. | Mar 2008 | A1 |
Number | Date | Country |
---|---|---|
0 903 901 | Mar 1999 | EP |
1100001 | May 2001 | EP |
2004-523820 | Jun 2002 | JP |
2004523820 | Aug 2006 | JP |
WO 9739399 | Oct 1997 | WO |
WO 9749252 | Dec 1997 | WO |
PCTUS0146688 | Nov 2001 | WO |
WO 0244862 | Jun 2002 | WO |
WO 02095588 | Nov 2002 | WO |
Number | Date | Country | |
---|---|---|---|
20040230795 A1 | Nov 2004 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 10010959 | Nov 2001 | US |
Child | 10777418 | US | |
Parent | 09728701 | Dec 2000 | US |
Child | 10010959 | US |