The invention relates to managing data processing system configurations. More specifically, the invention relates to managing system configurations in a network storage system.
Large-scale data processing systems frequently rely on banks of similar or identical machines to provide infrastructure services, rather than requiring each computer in the system to operate autonomously by including all of its own infrastructure. For example, data storage functionality is often consolidated into a server or data server array; this can facilitate efficient backup strategies and permit an arbitrary number of data processing systems to access and operate on a common data set. (An alternative to this arrangement might involve providing some of the overall data storage space required at each processing system, but this approach may complicate application design by requiring software to distinguish between data stored locally and data stored on other peer systems. Successful backup strategies may be more difficult to implement, and the failure of an individual system may impact the work of other systems that needed access to the data stored on the failed system.)
Of course, shifting an infrastructure function such as data storage onto a bank of special-purpose machines gives rise to a different set of challenges, even as it alleviates some difficulties of managing a large-scale system. Important among these challenges is the task of configuring the bank of machines so that each operates as intended to provide the infrastructure function. Even special-purpose machines that only provide a limited range of services may have complex configuration requirements, particularly when the machines are installed at diverse geographical locations to provide redundancy and/or to take advantage of services or facilities with locality-dependent aspects.
Groups of servers that are to work together to provide a basic data processing service with redundancy and high availability may require consistent and coordinated, but not identical, configurations. As a simple example, consider two data storage servers (e.g., “fileservers”) that are to store data for client systems. These servers may need identical configurations to control access from remote clients, but they may need different network communication configurations because they are connected to a distributed data network through two different circuits.
Current system management procedures generally take an ad hoc approach to managing similar systems. For example, an administrator may keep copies of various generic configurations in a library, and prepare a configuration for a new system based on the closest generic configuration. However, once a generic configuration is customized and deployed, there is often no way to update a common parameter in all the configurations of servers in the farm without connecting to each machine in turn and making the modification—a time-consuming and error-prone task.
Therefore, a better method of configuring and managing many similar (but not necessarily identical) data processing systems efficiently may be of value in this field.
Some embodiments of the invention obtain data processing system configurations, each configuration to include an element to control access from a remote client to a storage facility of a data processing system, and compare the configurations. Configuration differences detected during the comparison may be reported to a system administrator or management center for further investigation and possible correction. Other embodiments obtain a base configuration for a data processing system and override some elements of the base configuration to produce a locally-adapted configuration that may be used to control interactions between the data processing system and a data storage client. Still other embodiments permit attributes to be attached to elements of a data processing system configuration, the attributes to control the subsequent use of the configuration.
Embodiments of the invention are illustrated by way of example and not by way of limitation in the figures of the accompanying drawings in which like references indicate similar elements. It should be noted that references to “an” or “one” embodiment in this disclosure are not necessarily to the same embodiment, and such references mean “at least one.”
As described in greater detail below, the solution introduced here provides methods and apparatus to improve the accuracy and efficiency of configuring a large number of similar systems. For simplicity, these techniques are often described in scenarios limited to a small number of systems, but it will be clear to those of skill in the art that extending the ideas to an arbitrarily large number of systems is straightforward and proportionately beneficial.
If the client's request is to retrieve information, the data flow proceeds in the reverse direction: data fragments 358 may be retrieved from the array of mass storage devices 113 and put together 348 to prepare a client reply 338. The reply may be encapsulated within a protocol-compliant packet 328, then placed in an “envelope” 318 for transmission over the distributed data network to client 160.
The subsystems shown in
A basic configuration management tool might include the elements shown in
The configuration management tool, which may run on a management workstation 180, presents a consolidated view of configuration elements for many different subsystems and interact with a system administrator to select appropriate configuration values. When the administrator is satisfied with the configuration, export logic 470 writes the configuration elements back to the appropriate locations (i.e. the same configuration files, startup scripts, and registry entries that the elements came from in the first place) and in the appropriate formats. For example, a text-based configuration file may have lines of “keys” and “values,” or may be formatted as a hierarchical extensible Markup Language (“XML”) file. The configuration management tool may provide a button or control to reset the newly-reconfigured system or otherwise make the new configuration effective.
Some embodiments may use XML as a common or intermediate format (i.e. the import logic may produce an XML document containing configuration elements drawn from various sources, and the export logic may convert an XML document back into native formats such as text files or database records). XML is a widely-adopted text-based format that permits hierarchical data to be represented effectively for convenient machine analysis and processing.
The tool described with reference to
A second data processing system configuration is also retrieved (520). The second configuration may come from a second remote system or from a data repository containing a previously-stored copy of the first system's configuration.
Next, the two configurations are compared, element by element (530) to detect differences between the configurations. Some differences between the configurations may be expected (for example, when two similar but not identical servers' configurations are compared, their hostnames and Internet Protocol (“IP”) addresses will probably differ). An embodiment may maintain a list of identifiers of configuration elements that are expected to be different, and remove expected differences from the list of all differences found (540), leaving only unexpected differences. Finally, the differences detected (except for those that may have been filtered out in 540) are reported to a user (550). Reports may list all differences or only unexpected differences in a plain text format, in XML or HTML, or in a database-friendly format such as comma-separated values (“CSV”) or records for the Excel™ database by Microsoft Corporation of Redmond, Wash. (“XLS records”).
An embodiment of the invention may repeat this procedure on a configurable schedule to produce a periodic report of changes between a system's actual configuration and its expected configuration. Differences may be reported by printing them as a list, sending them as electronic mail, or generating a notification event at a network management station. Periodic configuration monitoring may help guard against inadvertent or malicious tampering with the configurations of a plurality of servers that may be detected as unexpected configuration element differences. For example, an unexpected change in the identities of users who are authorized to administer the server, or an unexpected disk partition allocation, may indicate a security breach.
Another embodiment of the invention can assist in deploying similar (but not necessarily identical) configurations to a number of data processing systems. As shown in
The “override” logic described above with reference to
Now, the administrator is asked to enable a new feature on one of the servers so that it can be evaluated before being enabled for all ten servers. A configuration change to enable the feature can be entered and stored at the management workstation, and the change can be marked to override the generic configuration (which leaves the new feature disabled) for only one of the ten servers. Subsequently, only that server will receive the non-standard configuration, and only that server will activate the feature. Later, if the evaluation period is successful, the change can be set to override the generic configuration for all ten servers, or the generic configuration itself can be modified to enable the feature.
Note that the preceding example involves various actions by a system administrator only to explain a possible usage scenario and the motivations behind it. Embodiments of the invention can operate without human interaction.
Configuration elements in a generic configuration may contain a flag or permission indicator to control whether they are allowed to be changed or overridden by override logic on a server system. In some embodiments, several layers of permissions may be implemented, so that a generic configuration may be distributed to a regional data center containing a plurality of server systems (e.g., storage servers), and a first group of generic configuration elements may be overridden to adapt the configuration to the common needs of the servers at the regional data center. Then, the region-appropriate configuration may be forwarded to individual servers, which may apply server-specific overrides to a different (though possibly overlapping) set of configuration elements.
In a manner similar to the periodic configuration retrieval and check described with reference to
Once the manager is satisfied with the configuration displayed by the software program, he may activate control 793 to cause the configuration to be pushed to one or more remote systems, or control 796 to cause the configuration to be saved for later reference. A scheduling control or menu selection (not shown) may lead to an interface screen such as that shown in
An embodiment of the invention may be a machine-readable medium having stored thereon instructions which cause a programmable processor to perform operations as described above. In other embodiments, the operations might be performed by specific hardware components that contain hardwired logic. Those operations might alternatively be performed by any combination of programmed computer components and custom hardware components.
A machine-readable medium may include any mechanism for storing or transmitting information in a form readable by a machine (e.g., a computer), including but not limited to Compact Disc Read-Only Memory (CD-ROMs), Read-Only Memory (ROMs), Random Access Memory (RAM) and Erasable Programmable Read-Only Memory (EPROM).
The term “logic,” as used herein, can include, for example, hardwired circuitry, programmable circuitry (e.g. Field-Programmable Gate Arrays (“FPGAs”)), software instructions executed by a programmable processor, or any combination thereof.
The applications of the present invention have been described largely by reference to specific examples and in terms of particular allocations of functionality to certain hardware and/or software components. However, those of skill in the art will recognize that flexible fileserver configuration management can also be achieved by software and hardware that distribute the functions of embodiments of this invention differently than herein described. Such variations and implementations are understood to be captured according to the following claims.
Number | Name | Date | Kind |
---|---|---|---|
6098098 | Sandahl et al. | Aug 2000 | A |
6151674 | Takatani | Nov 2000 | A |
6513115 | Nock et al. | Jan 2003 | B2 |
6543004 | Cagle et al. | Apr 2003 | B1 |
6601066 | Davis-Hall | Jul 2003 | B1 |
6792461 | Hericourt | Sep 2004 | B1 |
7415509 | Kaltenmark et al. | Aug 2008 | B1 |
7516218 | Besson | Apr 2009 | B2 |
7783800 | Staats et al. | Aug 2010 | B2 |
7912916 | Rakowski et al. | Mar 2011 | B2 |
20030069946 | Nair et al. | Apr 2003 | A1 |
20030105881 | Symons et al. | Jun 2003 | A1 |
20040003266 | Moshir et al. | Jan 2004 | A1 |
20040044763 | Besson | Mar 2004 | A1 |
20040117358 | von Kaenel et al. | Jun 2004 | A1 |
20050256871 | Semerdzhiev | Nov 2005 | A1 |
20060104220 | Yamazaki et al. | May 2006 | A1 |
20060112135 | Warshawsky | May 2006 | A1 |
20060179116 | Speeter et al. | Aug 2006 | A1 |
20060195575 | Delany et al. | Aug 2006 | A1 |
20060282479 | Johnson et al. | Dec 2006 | A1 |
20070168721 | Luiro et al. | Jul 2007 | A1 |
20070198609 | Black et al. | Aug 2007 | A1 |
20080028048 | Shekar CS et al. | Jan 2008 | A1 |
20080052391 | Rogers et al. | Feb 2008 | A1 |
Number | Date | Country |
---|---|---|
0 952 521 | Oct 1999 | EP |
0 957 617 | Nov 1999 | EP |
WO 0171979 | Sep 2001 | WO |
WO 2005088473 | Sep 2005 | WO |
Number | Date | Country | |
---|---|---|---|
20080028048 A1 | Jan 2008 | US |