The invention relates to supervising a data storage system.
Computer systems may include different resources used by one or more host processors. Resources and host processors in a computer system may be interconnected by one or more communication connections. These resources may include, for example, data storage devices such as those included in the data storage systems manufactured by EMC Corporation. These data storage systems may be coupled to one or more servers or host processors and provide storage services to each host processor. Multiple data storage systems from one or more different vendors may be connected and may provide common data storage for one or more host processors in a computer system.
A host processor may perform a variety of data processing tasks and operations using the data storage system. For example, a host processor may perform basic system I/O operations in connection with data requests, such as data read and write operations.
Host processor systems may store and retrieve data using a storage device containing a plurality of host interface units, disk drives, and disk interface units. The host systems access the storage device through a plurality of channels provided therewith. Host systems provide data and access control information through the channels to the storage device and the storage device provides data to the host systems also through the channels. The host systems do not address the disk drives of the storage device directly, but rather, access what appears to the host systems as a plurality of logical disk units. The logical disk units may or may not correspond to the actual disk drives. Allowing multiple host systems to access the single storage device unit allows the host systems to share data in the device. In order to facilitate sharing of the data on the device, additional software on the data storage systems may also be used.
Computers, computer networks, and other computer-based systems are becoming increasingly important as part of the infrastructure of everyday life. Networks are used for sharing peripherals and files. In such systems, complex components are the most common sources of failure or instability. The proliferation of multiple interacting components leads to problems that are difficult or impossible to predict or prevent. The problems are compounded by the use of networks, which introduce the added complexity of multiple machines interacting in obscure and unforeseen ways.
Additionally, the need for high performance, high capacity IT systems is driven by several factors. In many industries, critical IT applications require outstanding levels of service. At the same time, the world is experiencing an information explosion as more and more users demand timely access to a huge and steadily growing mass of data including high quality multimedia content. The users also demand that information technology solutions protect data and perform under harsh conditions with minimal data loss and minimum data unavailability. Computing systems of all types are not only accommodating more data but are also becoming more and more interconnected, raising the amounts of data exchanged at a geometric rate.
To address this demand, modem data storage systems (“storage systems”) are put to a variety of commercial uses. For example, they are coupled with host systems to store data for purposes of product development, and large storage systems are used by financial institutions to store critical data in large databases. For many uses to which such storage systems are put, it is highly important that they be highly reliable so that critical data is not lost or unavailable.
It will be appreciated that different tasks may be performed in connection with data storage systems. For example, software may be executed on the data storage systems in connection with performing data storage administration tasks such as for data storage configuration, management, and/or monitoring. The monitoring of data storage systems may include analyzing the health of the data storage system and investigating the cause of a failure in the data storage system when the data storage system fails to function successfully. The monitoring of the data storage system may be performed by services such as a data collection service, a configuration service and a performance analyzer service. For example, the data collection service may gather logs of the storage system (“system logs”), snapshots of the data storage system's memory, configuration and status information of the data storage system. It will be appreciated in case of a failure in such storage systems, it is useful that the information is investigated in a timely manner for preventing further failures in such storage systems.
However, it is also useful that potential problems in connection with data storage systems are identified before a failure or crisis situation arises. If a crisis situation is left to develop it may be too late to rectify the problem in the data storage system without causing disruption and inconvenience to the users of the data storage system.
A method and system for use in supervising a data storage system is disclosed. The status of an operational feature in the data storage system is monitored. A human perceptible signal is outputted in response to monitoring the status of the operational feature in the data storage system. The data storage system outputs the human perceptible signal locally thereto indicating the current status of the operational feature in the data storage system.
Features and advantages of the present invention will become more apparent from the following detailed description of exemplary embodiments thereof taken in conjunction with the accompanying drawings in which:
The invention can be implemented in numerous ways, including as a process, an apparatus, a system, a computer program embodied on a computer readable storage medium, and/or a processor, such as a processor configured to execute instructions stored on and/or provided by a memory coupled to the processor. In this specification, the implementations, or any other form that the invention may take, may be referred to as techniques. In general, the order of the steps of disclosed processes may be altered within the scope of the invention. Unless stated otherwise, a component such as a processor or a memory described as being configured to perform a task may be implemented as a general component that is temporarily configured to perform the task at a given time or a specific component that is manufactured to perform the task. As used herein, the term ‘processor’ refers to one or more devices, circuits, and/or processing cores configured to process data, such as computer program instructions.
A detailed description of one or more embodiments of the invention is provided below along with accompanying figures that illustrate the principles of the invention. The invention is described in connection with such embodiments, but the invention is not limited to any embodiment. The scope of the invention is limited only by the claims and the invention encompasses numerous alternatives, modifications and equivalents. Numerous specific details are set forth in the following description in order to provide a thorough understanding of the invention. These details are provided for the purpose of example and the invention may be practiced according to the claims without some or all of these specific details. For the purpose of clarity, technical material that is known in the technical fields related to the invention has not been described in detail so that the invention is not unnecessarily obscured.
Described below are techniques for use in supervising a data storage system. The techniques can perform service actions and system supervision without the necessity of having to connect to a data storage system via a common user interface such as a command-line interface (hereinafter “CLI”) or a graphical user interface (hereinafter “GUI”). The techniques can allow an administrator or user to supervise the data storage system by quickly looking at the front or back panel of the storage processor or disk rack. The techniques may be used not only by IT specialists but other personnel with access to the data storage system. Advantageously, the techniques can allow for early prevention of problems or issues.
The techniques may comprise use of a panel of visual indicators, which might be either built-in, located on front or back side of the storage processor, or may be implemented as removable module using one of standard interfaces, for example, USB. The panel of visual indicators may be implemented as a small LCD, a set of LED colored lamps, semi-segment indicators, or a combination of the above. The panel of visual indicators may indicate, for example, the amount/percentage of disk space used, CPU load, network load, temperature or other important internal system-specific values, warnings, alarms. A visual representation may be augmented with an audio or sound signal or notification, for example, a simple sound alarm, or voice for a case where specific attention is needed. The meaning of the indicators may be assigned programmatically by system developer/IT/user via some additional interface or settings file. As a result, the potential problems or issues may be detected at an early stage since the indicators can attract attention more easily than conventional approaches.
For example, conventional system alarms may not be triggered to notify a user when the amount of disk space moves closer to the “red zone” or danger zone. However, the techniques as disclosed herein may attract the attention of users or service personnel. If the techniques as described herein can identify a potential problem or issue it can be easily noted by a non-expert and may be reported for analysis to proper service personnel. Accordingly, the techniques can present an ability to prevent possible future failures of the data storage system significantly conserving resources and time.
It will be known by those skilled in the art that the control and supervision of computational centers and labs is very critical but traditional approaches and machinery can sometimes be not enough. On a working data storage system, the current status may be viewed remotely or via special system control panel connected to the rack. This may require the usage of specific software for status displaying, control and automated issue triggering/reporting. The conventional systems may have no indication of the status of, for example, the disc volume allocated, current CPU load, current network load, CPU temperature, system alarms etc. The status could not be viewed by service personnel without logging-in to the system. It will be appreciated that widely used system event alarm notifications like SMS/e-mail or other triggers may not be enough for early issue diagnostics. The amount of free disk space in storage and its current productivity can remain out of sight to the experts up to the moment of a “crisis” situation.
It will be appreciated that the techniques as described herein may only require a quick glance on the indicator panel or interface to ascertain the status. For small range systems, the techniques as described herein can operate with minimum supervision of IT specialists. As a result, the potential problems or issues may be addressed and prevented at an early stage.
Referring to
Each of the host systems 14a-14n and the data storage system 12 included in the system 10 may be connected to the communication medium 18 by any one of a variety of connections as may be provided and supported in accordance with the type of communication medium 18. Similarly, the management system 16 may be connected to the communication medium 20 by any one of variety of connections in accordance with the type of communication medium 20. The processors included in the host computer systems 14a-14n and management system 16 may be any one of a variety of proprietary or commercially available single or multi-processor system, such as an Intel-based processor, or other type of commercially available processor able to support traffic in accordance with each particular embodiment and application.
It should be noted that the particular examples of the hardware and software that may be included in the data storage system 12 are described herein in more detail, and may vary with each particular embodiment. Each of the host computers 14a-14n, the management system 16 and data storage system may all be located at the same physical site, or, alternatively, may also be located in different physical locations. In connection with communication mediums 18 and 20, a variety of different communication protocols may be used such as SCSI, Fibre Channel, iSCSI, and the like. Some or all of the connections by which the hosts, management system, and data storage system may be connected to their respective communication medium may pass through other communication devices, such as a Connectrix or other switching equipment that may exist such as a phone line, a repeater, a multiplexer or even a satellite. In one embodiment, the hosts may communicate with the data storage system over an iSCSI or a Fibre Channel connection and the management system may communicate with the data storage systems over a separate network connection using TCP/IP. It should be noted that although
Each of the host systems may perform different types of data operations in accordance with different types of tasks. In the embodiment of
The management system 16 may be used in connection with management of the data storage system 12. The management system 16 may include hardware and/or software components. The management system 16 may include one or more computer processors connected to one or more I/O devices such as, for example, a display or other output device, and an input device such as, for example, a keyboard, mouse, and the like. A data storage system manager may, for example, view information about a current storage volume configuration on a display device of the management system 16, provision data storage system resources, and the like.
In one embodiment, the data storage system 12 of
In connection with an embodiment in which the data storage system 12 is an appliance including hardware and software, the appliance may also include other software for performing different data services. For example, the appliance may include a data collection service which interacts with software on the hosts 14a-14n when performing a data collection operation on the data storage system 12.
In another embodiment, the data storage system 12 may include one or more data storage systems such as one or more of the data storage systems offered by EMC Corporation of Hopkinton, Mass. Each of the data storage systems may include one or more data storage devices, such as disks. One or more data storage systems may be manufactured by one or more different vendors. Each of the data storage systems included in 12 may be inter-connected (not shown). Additionally, the data storage systems may also be connected to the host systems through any one or more communication connections that may vary with each particular embodiment and device in accordance with the different protocols used in a particular embodiment. The type of communication connection used may vary with certain system parameters and requirements, such as those related to bandwidth and throughput required in accordance with a rate of I/O requests as may be issued by the host computer systems, for example, to the data storage systems 12. It should be noted that each of the data storage systems may operate stand-alone, or may also be included as part of a storage area network (SAN) that includes, for example, other components such as other data storage systems.
In such an embodiment in which the data storage system 12 of
Servers or host systems, such as 14a-14n, provide data and access control information through channels to the one or more storage systems, and the storage systems may also provide data to the host systems also through the channels. The host systems may not address the disk drives of the storage systems directly, but rather access to data may be provided to one or more host systems from what the host systems view as a plurality of logical devices or logical volumes (LVs). The LVs may or may not correspond to the actual disk drives. For example, one or more LVs may reside on a single physical disk drive. Data in a single storage system may be accessed by multiple hosts allowing the hosts to share the data residing therein. An LV or LUN (logical unit number) may be used to refer to the foregoing logically defined devices or volumes.
Referring to
Referring to
Each link control card 21 includes a primary port 22 and an expansion port 24. These ports are used to link the storage enclosures together on a single communication loop FC-AL. A cable 26 may come from a host or from another storage system, and plugs into the primary port 22. The communication loop FC-AL extends from the primary port 22, is coupled to the disk drives 17, and continues out the expansion port 24. A cable 28 couples the expansion port 24 of a first storage enclosure 15 to the primary port 22 of a second storage enclosure 15. All the storage enclosures 15 are interconnected in this manner in a daisy chain to form the communication loop FC-AL. Thus, all the disk drives 17 are interconnected on the same communication loop FC-AL. Each link control card 21 is capable of controlling all the disks 17 in a given enclosure. The data storage system 12 may also include a peripheral device connector 29 that allows stand-alone peripheral devices to be connected thereto.
Referring to
Referring to
Referring to
While the above techniques have been described with respect to outputting a human perceptible signal such as a visual signal on an interface the signal may be outputted on an LED. The techniques may also be configured to output the signal on a LED with different colors depending on the importance or severity of the potential problem associated with the operational feature in the data storage system. For example, if the problem is particularly serious the signal may be outputted using a different color. As an example, if the data usage or allocation in connection with a data storage device was approaching maximum usage or allocation then the color may be, for example, red identifying a serious problem.
While the visual signals have been described as been outputted in numerical and/or graphical form, it will be appreciated that it may be outputted in many other forms. For example, the visual signals may be outputted as bars in the same manner as a thermometer.
While the techniques have been described in some examples as outputting the human perceptible signal indicating the amount of storage usage or storage allocation in connection with a data storage device, it will be appreciated by those skilled in the art that the techniques may output the amount of data storage usage or allocation in connection with all the data storage devices in the data storage system. The above techniques describe examples only.
While it has been described above with respect to
While the invention has been disclosed in connection with preferred embodiments shown and described in detail, their modifications and improvements thereon will become readily apparent to those skilled in the art. Accordingly, the spirit and scope of the present invention should be limited only by the following claims.
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/RU2011/000953 | 12/2/2011 | WO | 00 | 3/5/2012 |