The present disclosure relates to data storage in a Virtual Storage Area Network (VSAN). More particularly, but not exclusively, the present disclosure relates to effective utilization of data storage units for storing data in the VSAN.
A storage area network (SAN) is a dedicated high-speed network for interconnecting and presenting a shared pool of storage devices to multiple servers. SAN is primarily used to enhance performance of the storage devices as they appear as a single locally attached device to the servers, where the servers are centrally managed. Virtual Storage Area Network (VSAN) is a logical partition in the SAN. The VSAN allows data traffic of the servers to be isolated within specific portions of SAN so that system of storage devices is scaled out and is easy to configure. The main objective of the VSAN is the easy management of subscribers which can be added or relocated without the need for changing the physical layout of storage. Another objective of the VSAN is to provide data redundancy which minimizes the risk of data loss.
A conventional VSAN storage array includes a brick as a basic building block. The brick of the VSAN architecture comprises two storage controllers for receiving Input/Output (I/O) request from host machines connected to the bricks. The brick further comprises a Disk Array Enclosure (DAE) for holding multiple Solid State Drives (SSD) for storing data, and backup units. Each storage controller has multi-core Central Processing Units (CPUs) for processing I/O tasks. In the conventional architecture of VSAN, the bricks may be clustered together to increase performance and capacity of the system. A cluster in VSAN may include two or more bricks. When storage space in a brick of the cluster is utilized up to a threshold, an alert is sent to a user to scale out the system by including additional bricks in the system. For example, the threshold may be 90% of total storage available in the brick. Thus, the user has to manually instruct the system for including additional bricks in the system. Also, inclusion of additional bricks in the system may cost substantial amount of money. Additionally, if the threshold is reached, a new I/O request has to wait in a process queue to be processed, hence increasing latency to process the request.
Furthermore, the existing VSAN architecture only alerts the user, and does not provide automatic inclusion of bricks to the VSAN network, i.e. in the existing architecture, there does not exist a mechanism to route the data to different a secondary data storage unit when primary data storage unit is overloaded. Therefore, the bricks of the VSAN network is not fully utilized. Also, in the conventional VSAN architecture, an I/O request cannot be passed to different bricks of the VSAN. Thus, processing load of a brick is not efficiently managed.
The information disclosed in this background of the disclosure section is only for enhancement of understanding of the general background of the invention and should not be taken as an acknowledgement or any form of suggestion that this information forms the prior art already known to a person skilled in the art.
In an embodiment, the present disclosure presents a method for storing data in a data storage unit of a Virtual Storage Area Network (VSAN). The method comprises receiving, by a server management system, a request message for storing data, from an array controller of a first node among a plurality of nodes of a Virtual Storage Area Network (VSAN) cluster, where each of the plurality of nodes comprises a pair of array controllers and a data storage unit, identifying a second node from the plurality of nodes, for storing the data in a data storage unit of the second node and routing the data to an array controller of the second node, for storing the data in the data storage unit, for effectively utilizing the data storage unit of the VSAN network.
In one embodiment, the present disclosure presents a system for storing data in a data storage unit of a Virtual Storage Area Network (VSAN). The system comprises a processor and a memory. The processor is configured to receive a request message for storing data, from an array controller of a first node among a plurality of nodes of a Virtual Storage Area Network (VSAN) cluster, wherein each of the plurality of nodes comprises a pair of array controllers and a data storage unit, identify a second node from the plurality of nodes, for storing the data in a data storage unit of the second node and route the data to an array controller of the second node, for storing the data in the data storage unit, for effectively utilizing the data storage unit of the VSAN network.
The foregoing summary is illustrative only and is not intended to be in any way limiting. In addition to the illustrative aspects, embodiments, and features described above, further aspects, embodiments, and features will become apparent by reference to the drawings and the following detailed description.
The novel features and characteristic of the disclosure are set forth in the appended claims. The disclosure itself, however, as well as a preferred mode of use, further objectives and advantages thereof, will best be understood by reference to the following detailed description of an illustrative embodiment when read in conjunction with the accompanying figures. One or more embodiments are now described, by way of example only, with reference to the accompanying figures wherein like reference numerals represent like elements and in which:
It should be appreciated by those skilled in the art that any block diagrams herein represent conceptual views of illustrative systems embodying the principles of the present subject matter. Similarly, it will be appreciated that any flow charts, flow diagrams, state transition diagrams, pseudo code, and the like represent various processes which may be substantially represented in computer readable medium and executed by a computer or processor, whether or not such computer or processor is explicitly shown.
In the present document, the word “exemplary” is used herein to mean “serving as an example, instance, or illustration.” Any embodiment or implementation of the present subject matter described herein as “exemplary” is not necessarily to be construed as preferred or advantageous over other embodiments.
While the disclosure is susceptible to various modifications and alternative forms, specific embodiment thereof has been shown by way of example in the drawings and will be described in detail below. It should be understood, however that it is not intended to limit the disclosure to the particular forms disclosed, but on the contrary, the disclosure is to cover all modifications, equivalents, and alternative falling within the scope of the disclosure.
The terms “comprises”, “comprising”, or any other variations thereof, are intended to cover a non-exclusive inclusion, such that a setup, device or method that comprises a list of components or steps does not include only those components or steps but may include other components or steps not expressly listed or inherent to such setup or device or method. In other words, one or more elements in a system or apparatus proceeded by “comprises . . . a” does not, without more constraints, preclude the existence of other elements or additional elements in the system or apparatus.
In an embodiment, the present disclosure discloses a server management system for storing data in a data storage unit of a Virtual Storage Area Network (VSAN). The server management system receives a request message from a first node of a VSAN, for storing data, when a data storage unit of the first node cannot accommodate the data. Then, the server management system identifies a second node among plurality of nodes of the VSAN for storing the data in data storage unit of the second node. Thus, the present disclosure provides a system for utilizing data storage unit of the VSAN.
In an embodiment, the host device 104 may be a Personal Digital Assistant (PDA), a laptop, a Personal Computer (PC), a smartphone or any other computing unit.
In an embodiment, for processing an I/O request, either the first array controller 2011 or the second array controller 2012, takes control of the data storage unit 208. Either of the first array controller 201, or the second array controller 2012 help in storing the data in the data storage unit 208. For illustration, let us consider that the first array controller 2011 receives the I/O request from the host device 104 through the switch 103, for storing the data in the data storage unit 208. The SCSI driver 2021 facilitates data communication, within the first array controller 2011, with the switch 103 and with the server management system 102. The SCSI driver 2021 receives the one or more I/O request messages from the switch 103. Further, one of, the routing modules 2031 and the routing module 2032 receives the one or more I/O request messages from the SCSI driver 2021. Let us consider that the routing module 2031 receives the one or more request messages from the SCSI driver 2021. The routing module 2031 then routes the data and the address present in the one or more I/O request messages to corresponding control module 2041. The control module 2041 converts the address to an internal address. Then, the control module 2041 forwards the data to corresponding data module 2051 based on the internal address. The data module 2051 comprises a mapping of the internal address with corresponding physical address of the data storage unit 208 where the data is intended to be stored. The data module 2051 stores the data in the physical address of the data storage unit 208 based on the mapping.
In an embodiment, the data storage unit 208 of each of the plurality of nodes 101 is associated with a corresponding predefined threshold memory. The predefined threshold memory is set on total memory of respective data storage unit 208.
In an embodiment, the data storage unit 208 comprises a counter. The value of the counter increases based on amount of data stored in the data storage unit 208. The kernel 2061 monitors the data storage unit 208 for available memory based on the predefined threshold. The kernel 2061 configures the SCSI driver 2021 to forward the one or more I/O requests to the server management system 102 when the available memory in the data storage unit 208 is less than the predefined threshold memory. Also, the kernel 2061 configures the SCSI driver 2021 to transmit available memory in the data storage unit 208 to the server management system 102 at predefined intervals of time.
In an embodiment, data 404 may be stored within the memory 402. The data 404 may include, for example, I/O request data 405, first node data 406, second node data 407 and other data 408.
In an embodiment, I/O request data 404 may include data to be stored and a request message for storing the data. In an embodiment, the first node data 406 and the second node data 407 may include I/O request message type, available memory in data storage unit 208 of the first node 101A and the second node 101B. In an embodiment, the I/O request message type may include a read request, a write request, request for editing, etc.
In an embodiment, the data 404 in the memory 402 is processed by modules 409 of the server management system 102. As used herein, the term module refers to an application specific integrated circuit (ASIC), an electronic circuit, a field-programmable gate arrays (FPGA), Programmable System-on-Chip (PSoC), a combinational logic circuit, and/or other suitable components that provide the described functionality. The said modules 409 when configured with the functionality defined in the present disclosure will result in a novel hardware.
In one implementation, the modules 409 may include, for example, receiving module 410, node identification module 411, generation module 412, mapping module 413, routing module 414 and other modules 415. It will be appreciated that such aforementioned modules 409 may be represented as a single module or a combination of different modules.
In an embodiment, the receiving module 410 receives the one or more I/O request messages from the SCSI driver 2021 of the first node when available memory in the data storage unit 208 of the first node is less than the predefined threshold memory. Let the data storage unit 208 of the first node 101A be represented as first data storage unit for illustration of this embodiment. The receiving module 410 may store the one or more I/O request messages in the CRU 301. The receiving module 410 may communicate with the SCSI driver 2021 through wired interface or wireless interface.
In an embodiment, the node identification module 411 identifies a second node 101B among the plurality of nodes 101 for storing the data. Here, the second node is identified based on size of the data and available memory in the data storage unit 208 of the second node 101B. Let the data storage unit 208 of the second node be represented as second data storage unit for illustration of this embodiment. In an embodiment, the node identification module 411 receives data regarding available memory in data storage unit 208 from each of the plurality of nodes 101. In an embodiment, the node identification module 411 receives the data regarding available memory in data storage unit 208 from each of the plurality of nodes 101 at predefined intervals of time. Further, the node identification module 411 identifies a second node 101B among the plurality of nodes 101 such that available memory in the second data storage unit is greater than the corresponding predefined threshold memory. Further, the second node 101B is identified when the size of the data can be accommodated in the available memory in the second data storage unit 208.
In an embodiment, the generation module 412 generates an ID and address for the first node 101A and the second node 101B, when the server management system 102 receives the one or more I/O request messages from the first node 101A. The generation module 412 generates the ID based on cluster of the first node 101A and second node 101B, the array controller 201 used for processing the one or more I/O request messages and the routing module 203 of the array controller 201. Further, the generation module 412 generates an address for the first node 101A and the second node 101B. The address for the first node 101A is generated based on the I/O request message type and ID of the first node 101A. Likewise, the generation module generates address for the second node 101B based on the I/O request message type and ID of the second node 101B. In an embodiment, the ID and address for each of the plurality of nodes 101 is unique. In an embodiment, the mapping module 413 forms a part of the MRU 302.
In an embodiment, a cluster indicates a group of nodes. In an embodiment, the first node 101A and the second node 101B may be a part of a same cluster or may be a part of different clusters. The VSAN 100 comprises plurality of clusters, each cluster comprising plurality of nodes 101.
In an embodiment, the mapping module 413 receives the address of the first node 101A and the second node 101B and maps the address of the first node 101A with the address of the second node 101B. The mapping indicates a source node and a corresponding destination node. Thus, the mapping module 413 generates a map of all source nodes requesting the server management system 102 for storing the data, and corresponding destination nodes. In an embodiment, the mapping module 413 forms a part of the MRU 302.
In an embodiment, the routing module 414 routes the data present in the CRU 301 to the second node 101B based on the map generated by the mapping module 413. Here, the routing module 414 routes the data to SCSI driver 202B of the second node 101B for storing the data in the data storage unit 208B.
In an embodiment, the other modules 415 may include notification module, node monitoring module, etc. In an embodiment, the notification module may notify administrator of the VSAN 100 regarding available memory of data storage unit 208 of each of the plurality of nodes 101. In an embodiment, the server management system 102 may include a monitoring unit for monitoring available memory in the data storage unit 208 of each of the plurality of nodes 101.
From Table 1, the source ID indicates that the first node is from cluster 2, the one or more I/O request messages are processed by the first array controller 201A1 and the routing module 203A1 has received the one or more I/O request messages. Further, the source address indicates the one or more I/O request messages type, the array controller 201 used to process the one or more I/O request messages. Likewise, the ID and address of the second node 101B indicates that the second node 101B lies in cluster 1. Further, the array controller 201B and the routing module 203B1 should be used for storing the data in the data storage unit 208B. Then, the routing module 414 routes the data stored in the CRU 301 to the second node 101B based on the mapping. The SCSI driver 202B1 of the second node 101B receives the data and forwards the data to the routing module 203B1 of the first array controller 201B. The routing module 203B1 routes the data to corresponding control module 204B1 and in turn the control module 204B1 routes the data to corresponding data module 205B1. Then, the data module 205B1 stores the data in the data storage unit 208B.
As illustrated in
The order in which the method 600 and 603 is described is not intended to be construed as a limitation, and any number of the described method blocks can be combined in any order to implement the method. Additionally, individual blocks may be deleted from the methods without departing from the spirit and scope of the subject matter described herein. Furthermore, the method can be implemented in any suitable hardware, software, firmware, or combination thereof.
At step 601, the receiving module 410 receives one or more I/O request messages from the first node 101A of the VSAN 100. Here, the receiving module 410 receives the one or more I/O request messages when the available memory in the data storage unit 208A is less than the predefined threshold memory. Further, the receiving module stores the data present in the one or more I/O request messages in the CRU 301.
At step 602, the node identification module 411 identifies the second node 101B from the plurality of nodes 101. In an embodiment, the node identification module 411 receives the data regarding available memory in data storage unit 208 from each of the plurality of nodes 101. The second node 101B is identified such that the available memory in the data storage unit 208B is greater than the corresponding threshold memory. Also, the available memory can accommodate the size of the data.
At step 603, the routing module 414 routes the data to the second node 101B for storing the data in the data storage unit 208B. The step 603 can be further divided into 603A, 603B and 603C.
At step 603A, the generation module 412 generates an ID and address for the first node 101A. Here, the ID of the first node 101A indicates the cluster of the first node 101A, the array controller 201 used for processing the one or more I/O request messages and the routing module 203 used to route the one or more I/O request messages. Further, the address is generated based on the ID of the first node 101A. The address comprises one or more I/O request messages type and the array controller 201 which has processed the one or more I/O request messages.
At step 603B, the generation module 412 generates an ID and address for the second node 101B. Here, the ID of the second node 101B indicates the cluster of the second node 101B, the array controller 201 to be used for processing the one or more I/O request messages and the routing module 203 to be used to route the one or more I/O request messages for storing the data on the data storage unit 208B. Further, the address is generated based on the ID of the second node 101B. The address comprises one or more I/O request messages type and the array controller 201 which has to be used to process the one or more I/O request messages.
At step 603C, the mapping module 413 maps the address of the first node 101A with the address of the second node 101. The map generated provides a link between the first node 101A and the second node 101B. Thus, the link can be used to track the data which was intended to be stored in the first node 101A, but is stored in the second node 101B.
Referring back to step 603 of
The processor 702 may be disposed in communication with one or more input/output (I/O) devices (not shown) via I/O interface 701. The I/O interface 701 may employ communication protocols/methods such as, without limitation, audio, analog, digital, monoaural, RCA, stereo, IEEE-1394, serial bus, universal serial bus (USB), infrared, PS/2, BNC, coaxial, component, composite, digital visual interface (DVI), high-definition multimedia interface (HDMI), RF antennas, S-Video, VGA, IEEE 802.n/b/g/n/x, Bluetooth, cellular (e.g., code-division multiple access (CDMA), high-speed packet access (HSPA+), global system for mobile communications (GSM), long-term evolution (LTE), WiMax, or the like), etc.
Using the I/O interface 701, the computer system 700 may communicate with one or more I/O devices. For example, the input device 710 may be an antenna, keyboard, mouse, joystick, (infrared) remote control, camera, card reader, fax machine, dongle, biometric reader, microphone, touch screen, touchpad, trackball, stylus, scanner, storage device, transceiver, video device/source, etc. The output device 711 may be a printer, fax machine, video display (e.g., cathode ray tube (CRT), liquid crystal display (LCD), light-emitting diode (LED), plasma, Plasma display panel (PDP), Organic light-emitting diode display (OLED) or the like), audio speaker, etc.
In some embodiments, the computer system 700 is connected to the service operator through a communication network 709. The processor 702 may be disposed in communication with the communication network 709 via a network interface 703. The network interface 703 may communicate with the communication network 709. The network interface 703 may employ connection protocols including, without limitation, direct connect, Ethernet (e.g., twisted pair 10/100/1000 Base T), transmission control protocol/Internet protocol (TCP/IP), token ring, IEEE 802.11a/b/g/n/x, etc. The communication network 709 may include, without limitation, a direct interconnection, e-commerce network, a peer to peer (P2P) network, local area network (LAN), wide area network (WAN), wireless network (e.g., using Wireless Application Protocol), the Internet, Wi-Fi, etc. Using the network interface 703 and the communication network 709, the computer system 700 may communicate with the one or more service operators.
In some embodiments, the processor 702 may be disposed in communication with a memory 705 (e.g., RAM, ROM, etc. not shown in
The memory 705 may store a collection of program or database components, including, without limitation, user interface 706, an operating system 707, web server 708 etc. In some embodiments, computer system 700 may store user/application data 706, such as the data, variables, records, etc. as described in this disclosure. Such databases may be implemented as fault-tolerant, relational, scalable, secure databases such as Oracle or Sybase.
The operating system 707 may facilitate resource management and operation of the computer system 700. Examples of operating systems include, without limitation, Apple Macintosh OS X, Unix, Unix-like system distributions (e.g., Berkeley Software Distribution (BSD), FreeBSD, NetBSD, OpenBSD, etc.), Linux distributions (e.g., Red Hat, Ubuntu, Kubuntu, etc.), IBM OS/2, Microsoft Windows (XP, Vista/7/8, 10 etc.), Apple iOS, Google Android, Blackberry OS, or the like.
In some embodiments, the computer system 700 may implement a web browser 708 stored program component. The web browser 708 may be a hypertext viewing application, such as Microsoft Internet Explorer, Google Chrome, Mozilla Firefox, Apple Safari, etc. Secure web browsing may be provided using Secure Hypertext Transport Protocol (HTTPS), Secure Sockets Layer (SSL), Transport Layer Security (TLS), etc. Web browsers 708 may utilize facilities such as AJAX, DHTML, Adobe Flash, JavaScript, Java, Application Programming Interfaces (APIs), etc. In some embodiments, the computer system 700 may implement a mail server stored program component. The mail server may be an Internet mail server such as Microsoft Exchange, or the like. The mail server may utilize facilities such as ASP, ActiveX, ANSI C++/C #, Microsoft .NET, CGI scripts, Java, JavaScript, PERL, PHP, Python, WebObjects, etc. The mail server may utilize communication protocols such as Internet Message Access Protocol (IMAP), Messaging Application Programming Interface (MAPI), Microsoft Exchange, Post Office Protocol (POP), Simple Mail Transfer Protocol (SMTP), or the like. In some embodiments, the computer system 700 may implement a mail client stored program component. The mail client may be a mail viewing application, such as Apple Mail, Microsoft Entourage, Microsoft Outlook, Mozilla Thunderbird, etc.
In one embodiment, the computer system 700 is associated with plurality of nodes 712. Each of the plurality of nodes 712 provide the computer system 500 with data related to available memory in the corresponding data storage unit 208.
In the present document, the word “exemplary” is used herein to mean “serving as an example, instance, or illustration.” Any embodiment or implementation of the present subject matter described herein as “exemplary” is not necessarily to be construed as preferred or advantageous over other embodiments.
While the disclosure is susceptible to various modifications and alternative forms, specific embodiment thereof has been shown by way of example in the drawings and will be described in detail below. It should be understood, however that it is not intended to limit the disclosure to the particular forms disclosed, but on the contrary, the disclosure is to cover all modifications, equivalents, and alternative falling within the scope of the disclosure.
The terms “comprises”, “comprising”, or any other variations thereof, are intended to cover a non-exclusive inclusion, such that a setup, device or method that comprises a list of components or steps does not include only those components or steps but may include other components or steps not expressly listed or inherent to such setup or device or method. In other words, one or more elements in a system or apparatus proceeded by “comprises . . . a” does not, without more constraints, preclude the existence of other elements or additional elements in the system or apparatus.
The terms “an embodiment”, “embodiment”, “embodiments”, “the embodiment”, “the embodiments”, “one or more embodiments”, “some embodiments”, and “one embodiment” mean “one or more (but not all) embodiments of the invention(s)” unless expressly specified otherwise.
The terms “including”, “comprising”, “having” and variations thereof mean “including but not limited to”, unless expressly specified otherwise.
The enumerated listing of items does not imply that any or all of the items are mutually exclusive, unless expressly specified otherwise. The terms “a”, “an” and “the” mean “one or more”, unless expressly specified otherwise.
A description of an embodiment with several components in communication with each other does not imply that all such components are required. On the contrary a variety of optional components are described to illustrate the wide variety of possible embodiments of the invention.
When a single device or article is described herein, it will be readily apparent that more than one device/article (whether or not they cooperate) may be used in place of a single device/article. Similarly, where more than one device or article is described herein (whether or not they cooperate), it will be readily apparent that a single device/article may be used in place of the more than one device or article or a different number of devices/articles may be used instead of the shown number of devices or programs. The functionality and/or the features of a device may be alternatively embodied by one or more other devices which are not explicitly described as having such functionality/features. Thus, other embodiments of the invention need not include the device itself.
The illustrated operations of
In an embodiment, the present disclosure discloses a method and a system for distributing load to different data storage system units in a VSAN. Thus, processing efficiency of individual cluster increases.
In an embodiment, the cluster utilization is increased by storing data in data storage units of different nodes.
In an embodiment, the disclosed system and method is compatible with existing VSAN architecture. Hence, cost of implementing the disclosed system is low.
In an embodiment, the data can be stored in data storage unit of different cluster associated with the server management system, therefore the system managing memory efficiently in the VSAN.
Finally, the language used in the specification has been principally selected for readability and instructional purposes, and it may not have been selected to delineate or circumscribe the inventive subject matter. It is therefore intended that the scope of the invention be limited not by this detailed description, but rather by any claims that issue on an application based here on. Accordingly, the disclosure of the embodiments of the invention is intended to be illustrative, but not limiting, of the scope of the invention, which is set forth in the following claims.
While various aspects and embodiments have been disclosed herein, other aspects and embodiments will be apparent to those skilled in the art. The various aspects and embodiments disclosed herein are for purposes of illustration and are not intended to be limiting, with the true scope and spirit being indicated by the following claims.
Number | Date | Country | Kind |
---|---|---|---|
201741000193 | Jan 2017 | IN | national |
Number | Name | Date | Kind |
---|---|---|---|
5463381 | Ryu | Oct 1995 | A |
6092170 | Kori | Jul 2000 | A |
9710165 | Lherault et al. | Jul 2017 | B1 |
20020023161 | Ito | Feb 2002 | A1 |
20020114326 | Mahalingaiah | Aug 2002 | A1 |
20020156887 | Hashimoto | Oct 2002 | A1 |
20060129779 | Cannon | Jun 2006 | A1 |
20060277383 | Hayden | Dec 2006 | A1 |
20070260822 | Adams | Nov 2007 | A1 |
20120079096 | Cowan | Mar 2012 | A1 |
20130086351 | Coronado et al. | Apr 2013 | A1 |
20150074222 | Liang | Mar 2015 | A1 |
20150117458 | Gurkan | Apr 2015 | A1 |
20160077966 | Stabrawa | Mar 2016 | A1 |
20160092121 | Nazari | Mar 2016 | A1 |
20160337426 | Shribman | Nov 2016 | A1 |
20160364484 | Dong | Dec 2016 | A1 |
20170109184 | Ramani | Apr 2017 | A1 |
20170109247 | Nakajima | Apr 2017 | A1 |
20170315752 | Colgrove | Nov 2017 | A1 |
20170324813 | Jain | Nov 2017 | A1 |
Entry |
---|
Cormac Hogan; VSAN and Storage Controllers; VMWare, VSphere Blog; Sep. 26, 2013; pp. 1-6 (Year: 2013). |
White Paper, “High Availability, Data Protection and Data Integrity in the XtremlO Architecture”, 2015, EMC2, pp. 1-28. |
Extended European Search Report issued in the European Patent Office in counterpart European Application No. 17163901.6, dated Oct. 25, 2017, 9 pages. |
Number | Date | Country | |
---|---|---|---|
20180191826 A1 | Jul 2018 | US |