Systems that utilize a single server for storing information that is accessed by one or more clients generally do not have issues with generating unique identifiers for objects, e.g., files. However, when systems expand to include, for example, a server cluster with a number of servers, there must be some mechanism in place for ensuring that files or other objects have identifiers that are unique within the server cluster. As can be appreciated, if one server in the server cluster fails, the other servers must be able to carry the load of the failed server. Having unique identifiers ensures that a client, which previously accessed the failed server, can send the identifier to any server in the server cluster and access the same object or file. Also, it is not practical from a performance perspective to implement a system that checks every identifier among all the nodes in a cluster before generating the identifier.
In addition, the cluster, and the servers within the cluster, may be configured to communicate with clients using a predetermined set of protocols. Thus, in addition to having to generate identifiers that are unique across a server cluster, the identifiers have to meet the requirements of the protocols such as specific structure, size, format, etc. as dictated by the protocols.
It is with respect to these and other considerations that embodiments have been made. Also, although relatively specific problems have been discussed, it should be understood that the embodiments should not be limited to solving the specific problems identified in the background.
This summary is provided to introduce a selection of concepts in a simplified form that are further described below in the Detail Description section. This summary is not intended to identify key features or essential features of the claimed subject matter, nor is it intended to be used as an aid in determining the scope of the claimed subject matter.
Described are embodiments for providing unique identifiers for files or objects across servers in a server cluster. Embodiments include generating a 64-bit identifier that includes at least three portions. The first portion includes a node identifier which identifies the particular server in a server cluster which created the unique identifier. The second portion includes a major sequence number that is incremented when a server is rebooted or otherwise taken off-line and then brought back online. Additionally, the major sequence number is incremented when all of the minor sequence numbers, which are included in a third portion of the unique identifier, have been used. The minor sequence numbers in the third portion are incremented for every unique file or object requested. The minor sequence numbers fall within a particular range. When the minor sequence numbers within the range are all used, the major sequence number is incremented and the minor sequence numbers within the range are reused.
Embodiments may be implemented as a computer process, a computing system or as an article of manufacture such as a computer program product or computer readable media. The computer program product may be a computer storage media readable by a computer system and encoding a computer program of instructions for executing a computer process. The computer program product may also be a propagated signal on a carrier readable by a computing system and encoding a computer program of instructions for executing a computer process.
Non-limiting and non-exhaustive embodiments are described with reference to the following figures.
Various embodiments are described more fully below with reference to the accompanying drawings, which form a part hereof, and which show specific exemplary embodiments. However, embodiments may be implemented in many different forms and should not be construed as limited to the embodiments set forth herein; rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the scope of the embodiments to those skilled in the art. Embodiments may be practiced as methods, systems or devices. Accordingly, embodiments may take the form of a hardware implementation, an entirely software implementation or an implementation combining software and hardware aspects. The following detailed description is, therefore, not to be taken in a limiting sense.
As shown in
In accordance with one embodiment, cluster 106 generates unique identifiers 110 for files or other objects that are accessed by clients 102 and 104. The unique identifiers 110 are passed from cluster 106 to clients 102 and 104. As described in greater detail below, the unique identifiers are unique across servers 106A, 106B, and 106C. In embodiments, clients 102 and 104 establish a session for accessing information, such as files or objects, stored on server cluster 106. The session is established with one of the servers 106A, 106B, and 106C. As part of establishing the session, the server will send a session identification to the client.
As one example, client 102 may request to establish a session with server cluster 106 to access file information stored on server cluster 106. In this example, server 106A receives a request and negotiates a session with client 102. In one embodiment, the session is negotiated using a file access protocol such as a version of the server message block (SMB) protocol or a version of the network fileserver (NFS) protocol.
After the session has been established, client 102 sends a file access request to server 106A. When server 106A receives the request, it assigns a unique identifier to the file requested by the client 102 and provides the identifier to the client 102. The client 102 will include the unique identifier in any subsequent request to access the file. The identifier provided by the server 106A is unique across servers 106A, 106B, and 106C.
Server 106A is configured to generate the unique identifier. One embodiment of a unique identifier 200 is shown in
The unique identifier 200 also includes a second portion 204 with a major sequence number and a third portion 206 with a minor sequence number. For each major sequence number there is a range of minor sequence numbers that may be used. As can be appreciated, the range of numbers that can be used for the minor sequence number is dependent upon the particular structure of the unique identifier 200.
In one embodiment, the unique identifier 200 is 64 bits in length. This length may be determined by for example the particular protocol being used to access a file. In one example, if the SMB protocol is used to access files, the file identifiers generated by a server must be 64 bits long. Thus, in order to comply with the requirements of SMB, unique identifier 200 will be 64 bits long. In this embodiment, the first portion 202 of the unique identifier 200 is 8 bits long, while the second portion 204 is 24 bits long, and the third portion is 32 bits long. Embodiments are not limited to 64 bits in length. In other embodiments, the size may be different with the bit division changed to reflect the different size.
Also shown in
In some embodiments, the major sequence number will be incremented upward each time there is a server reboot, or a server is otherwise taken off-line and brought back online. The use of the major sequence number can in these embodiments be used to count the number of times that the cluster service restarts.
As noted above, clients 102 and 104 may use a version of the SMB protocol to connect to the server cluster 106. Version of the SMB protocol use file identifiers that contain a persistent and a volatile portion, the former remains the same across a disconnect/reconnect, while the latter changes each time the handle is re-established. The persistent portion is generated on the first open. In embodiments, the structure of identifier 200 (
In embodiments, the file identifier used with versions of the SMB protocol can be useful for example to support an administrative API where an administrator could close a file and the server can look at the portion of the file identifier with the node identifier and know exactly which server needs to be contacted to handle the close. This can be used in an administrative API's e.g., NetSessionClose, NetFileClose, to allow the management of a group of servers as a single server, and the server can identify which server needs to process a given request based on the node identifier.
The particular embodiments described above, such as with respect to identifier 200 with the first portion being 8 bits long, the second portion being 24 bits long, and the third portion being 32 bits long is provided merely for illustrative purposes. Embodiments are not limited to this particular segmenting of a 64-bit identifier. In other embodiments, the first portion, the second portion, and the third portion, may have different bit lengths depending upon the particular embodiment. Also, it can be appreciated that identifier 200 may be of any length and is not limited to 64 bits but may be in embodiments shorter or longer than 64 bits.
As one example, if the client and server communicate using a version of the SMB protocol they may exchange a “PreviousSessionId” parameter in a SESSION_SETUP request to identify that that the client was previously connected and was later disconnected, and is now connecting again. The server must disconnect the previous session if it exists as part of establishing the new session. Since a portion of the session ID 300 contains the node identifier of the server who created the session, it allows the server to know exactly which server must be contacted to disconnect the session.
Turning now to
As shown in
In the embodiment shown in
Each of servers 406A, 406B, and 406C also include an identifier generation component. The identifier generation component generates unique identifiers associated with files or file handles that are provided to clients that request access to files. The unique identifiers are unique across all of the servers in cluster 406. That is, no identifier will be exactly the same, which prevents collisions among the identifiers. The identifier generation components include the logic necessary for determining the particular major sequence number and minor sequence number to include in a unique file identifier. Additionally, the identifier generation component on each of servers 406A, 406B, and 406C communicates with the local cluster service component to reserve additional major sequence numbers in a cluster registry 408.
In the embodiment shown in
As one example, assume that server 406A receives a request from a client to access file information stored on cluster 406. The client access component on server 406A will communicate with the client and establish a session that allows the client to access file information. After establishment of the session, the client will send a request to access a file. In response to receiving the request, the client access component on server 406A will request an identifier from the identifier generation module on server 406A. The identifier generation module will determine the appropriate node identifier, major sequence number, and minor sequence number to include in the unique identifier. After generating the unique identifier, the identifier generation component will pass the unique identifier to the client access component which will in turn provide the unique identifier to the client.
After a series of requests from the client, the identifier generation component will determine that all of the minor sequence numbers have been used in previous unique identifiers and will then request from the client service component on server 406A to reserve the next major sequence number. The cluster service component on server 406A will then communicate with the cluster service component on server 406A to request reservation of the next major sequence number within registry 408. The cluster service component on server 406C will then provide a response to the cluster service component on 406A indicating that the next major sequence number has been reserved in the cluster registry 408. The cluster service component within server 406A will provide an indication to the identifier generation component that the next major sequence number has been reserved in cluster registry 408. The identifier generation component on server 406A can then begin to use the next major sequence number and start from the beginning of the range of minor sequence numbers.
In some embodiments, one of servers 406A or 406B may fail or otherwise be taken off-line and then later brought back online. When the server is brought back online, and the client access component on that server receives a request for a file, the identifier generation component on that server will request from the local cluster service component a request for the next major sequence number. The local cluster service component will then follow the process previously described for reserving the next major sequence number from the cluster registry 408. The cluster registry 408 can therefore be used in these embodiments to keep track of the number of reboots that a server has undergone.
The above description is provided merely to illustrate embodiments. It should be understood that although the server cluster 406 in
Furthermore, although operational flows 500 and 600 are illustrated and described sequentially in a particular order, in other embodiments, the operations may be performed in different orders, multiple times, and/or in parallel. Further, one or more operations may be omitted or combined in some embodiments.
Operational flow 500 is illustrated to show the various steps that may be performed in determining the appropriate major sequence number and minor sequence number for including in a unique identifier. In embodiments, the identifier generation components of servers 406A, 406B, and 406C (
Flow 500 begins at operation 502 where a request to access a file is received. Operation 502 is in embodiments preceded by a number of other steps such as negotiating a session. The session may be negotiated using a file access protocol such as a version of the SMB protocol. In other embodiments, operation 502 is preceded by a number of previous requests to access a number of different files. In response to the request received at operation 502, a unique identifier must be generated to identify the file that is being accessed.
As shown in the embodiment in
To avoid being in a situation where generating a unique identifier must be paused until there is an approval for reserving the next major sequence number, decision 504 determines whether the next minor sequence number is at a threshold position from the end of the range of minor sequence numbers. This allows the reservation request to be sent early and asynchronously.
If at decision 504 it is determined that the next minor sequence number that is to be included in the unique identifier is greater that a first threshold position, flow passes to operation 506 where a request to reserve the next major sequence number is sent. At operation 508 an indication of the reservation is received. The indication received at operation 508 may be an approval indication which would allow the major sequence number to be incremented when necessary, i.e., when the minor sequence numbers have all been used. In other embodiments, the indication received at operation 508 may not be an approval of the next major sequence number. This may occur for example if the server that includes the cluster registry is unavailable. Flow 500 passes from operation 508 to operation 510 where a unique identifier is generated. Flow ends at 512.
If at decision 504 a determination is made that the minor sequence number is not greater than a first threshold position, flow 500 passes to decision 514 where a determination is made whether the next minor sequence number to be included in a unique identifier is greater than a second threshold position and the last major sequence number is already in use. The second threshold position is closer to the end of the minor sequence number range than the first threshold position which is used in decision 504. Being closer to the end of the range is a more urgent situation for securing a reservation of a next major sequence number. If the minor sequence numbers run out, and no major sequence number has been reserved, no more unique identifiers can be generated. This may result in an error where there is a delay in generating identifiers and thus in responding to request from a client to access files.
Decision 514 also takes into account whether the last major sequence number is currently in use. As shown in
After operation 516, flow passes to operation 518 where an indication is received as to whether or not the major sequence numbers have been reset. Flow passes from operation 518 to decision 520 ray determination is made as to whether or not the indication received at 518 indicates whether the major sequence numbers have been successfully reset. If at decision 520 a determination is made that the major sequence numbers have been successfully reset, flow passes to operation 510 where the unique identifiers generated. Flow then ends at 512.
If however, a determination is made at decision 520 that the indication from operation 518 indicates that the major sequence number has not been reset, flow passes to operation 522 where a local reservation is provided. Operation 522 is meant to deal with situations in which for example there is a problem with accessing a cluster registry and the ability to reserve a major sequence number or to reset the major sequence numbers. When faced with this situation, embodiments will still be able to generate some type of identifiers by reserving or resetting major sequence numbers locally. This avoids situations in which no unique identifiers can be generated and the system stops generating identifiers for access requests.
On servers that are not part of a cluster, unique file identifiers are, in embodiments, also generated the same way as servers that are part of a cluster. However, the node identifier equals zero and the major sequence number is stored in the regular registry. On a server that is part of a cluster, if the cluster registry fails to update after the first threshold is reached and the cluster service is not running, then the local registry key (not the cluster registry key) is read to get the major sequence number that is used to generate the next unique file identifiers. When the second threshold is reached, a unique identifier, using the node identifier zero, will be generated. If the cluster service was running when the first request is sent, another request can be sent to update the cluster registry but if the cluster registry fails to update this time, no new unique file identifiers will be generated. Instead, an event will be fired so an administrator can resolve the issue by restarting the cluster service or the machine because such situation indicates a serious error.
Referring once again to
If at decision 526 a determination is made that a reset or a reservation of the major sequence number was not successful, and then flow passes to operation 528 where an error indication occurs. This may result for example in a notification to an administrator or some other action or event to correct the error. Flow then ends at 512.
Referring back to decision 514, if a determination is made that the next minor sequence number to be used in generating a unique identifier is not greater than the second threshold or the major sequence number that is currently in use is not the last major sequence number, flow 500 passes to decision 530. At decision 530 a determination is made as to whether the next minor sequence number to be included in the unique identifier is greater than the second threshold position and the next major sequence number has not been reserved. As noted before, being at a second threshold which is closer to the end of the range of minor sequence numbers is a more urgent situation. If previous attempts have been made to reserve the next major sequence number but have been unsuccessful, then a situation can potentially occur that would prevent the generation of any identifier and thus possibly result in not providing responses to file access requests. As can be seen, if a determination is made at decision 530 that the next minor sequence number is greater than the second threshold position and the next major sequence number has not been reserved, flow will pass to operation 522, operation 524, decision 526 and operation 528 as described above.
However if at decision 530 a determination is made that the next minor sequence number to be used is not greater than the second threshold or that the next major sequence has not, not been reserved, flow 500 passes to operation 510 where the unique identifiers generated. Flow then ends at operation 512.
One feature of the embodiment shown in
As noted above, in embodiments, flow 500 is performed by an identifier generation component that is included on a number of servers in a cluster. It should be understood that in some embodiments, flow 500 may be implemented in other types of environments than the one shown in
Operational flow 600 illustrates steps for sending or providing the unique identifier to a client. Operations of flow 600 may be combined in embodiments with various one or more operations of flow 500 shown in
Flow 600 begins at operation 602 where a session is established with a client to allow the client to access file information. The session may be established using for example a version of the SMB protocol. The session established at operation 602 can be identified with a session identifier. The session identifier can be included in communications from the client so that a server, even a different server of the cluster, can identify the particular session that has been established with client.
After operation 602, flow 600 passes to operation 604 where a request for file information is received. The request may be formatted according to the same file access protocol, e.g., a version of the SMB protocol. As noted above, a unique identifier is generated in embodiments to identify the file that is requested. Operation 606 generates the unique identifier.
As shown in
In other embodiments, the three optional sub operations (608-612) are not performed. Rather, the first time a unique identifier is generated, the node identifier is set and the remaining portions, such as the sequence number, are incremented every time a new unique file identifier is generated. There may be logic to check on thresholds and boundaries which may result in creating a completely new unique file identifier if a new decision is taken such as to change the node identifier portion, then a new 64 bit identifier may be created and used to create the next requested unique file identifier by simply incrementing it.
As shown in the embodiment in
In its most basic configuration, system 700 typically includes at least one processing unit 702 and memory 704. Depending on the exact configuration and type of computing device, memory 704 may be volatile (such as RAM), non-volatile (such as ROM, flash memory, etc.) or some combination of the two. This most basic configuration is illustrated in
The term computer readable media as used herein may include computer storage media. Computer storage media may include volatile and nonvolatile, removable and non-removable media implemented in any method or technology for storage of information, such as computer readable instructions, data structures, program modules, or other data. System memory 704, removable storage, and non-removable storage 708 are all computer storage media examples (i.e. memory storage.) Computer storage media may include, but is not limited to, RAM, ROM, electrically erasable read-only memory (EEPROM), flash memory or other memory technology, CD-ROM, digital versatile disks (DVD) or other optical storage, magnetic cassettes, magnetic tape, magnetic disk storage or other magnetic storage devices, or any other medium which can be used to store information and which can be accessed by computing device 700. Any such computer storage media may be part of device 700. Computing device 700 may also have input device(s) 714 such as a keyboard, a mouse, a pen, a sound input device, a touch input device, etc. Output device(s) 716 such as a display, speakers, a printer, etc. may also be included. The aforementioned devices are examples and others may be used.
The term computer readable media as used herein may also include communication media. Communication media may be embodied by computer readable instructions, data structures, program modules, or other data in a modulated data signal, such as a carrier wave or other transport mechanism, and includes any information delivery media. The term “modulated data signal” may describe a signal that has one or more characteristics set or changed in such a manner as to encode information in the signal. By way of example, and not limitation, communication media may include wired media such as a wired network or direct-wired connection, and wireless media such as acoustic, radio frequency (RF), infrared, and other wireless media.
Reference has been made throughout this specification to “one embodiment” or “an embodiment,” meaning that a particular described feature, structure, or characteristic is included in at least one embodiment. Thus, usage of such phrases may refer to more than just one embodiment. Furthermore, the described features, structures, or characteristics may be combined in any suitable manner in one or more embodiments.
One skilled in the relevant art may recognize, however, that the embodiments may be practiced without one or more of the specific details, or with other methods, resources, materials, etc. In other instances, well known structures, resources, or operations have not been shown or described in detail merely to avoid obscuring aspects of the embodiments.
While example embodiments and applications have been illustrated and described, it is to be understood that the embodiments are not limited to the precise configuration and resources described above. Various modifications, changes, and variations apparent to those skilled in the art may be made in the arrangement, operation, and details of the methods and systems disclosed herein without departing from the scope of the claimed embodiments.
Number | Name | Date | Kind |
---|---|---|---|
5828876 | Fish | Oct 1998 | A |
6047332 | Viswanathan | Apr 2000 | A |
6622163 | Tawill et al. | Sep 2003 | B1 |
6694335 | Hopmann | Feb 2004 | B1 |
7185076 | Novaes et al. | Feb 2007 | B1 |
7502860 | Champagne | Mar 2009 | B1 |
7627706 | Kaushik | Dec 2009 | B2 |
20030229545 | Veres et al. | Dec 2003 | A1 |
20040098490 | Dinker | May 2004 | A1 |
20050160413 | Broussard et al. | Jul 2005 | A1 |
20050273505 | Kim | Dec 2005 | A1 |
20060067244 | Sekaran et al. | Mar 2006 | A1 |
20070277227 | Brendel | Nov 2007 | A1 |
20080243950 | Webman | Oct 2008 | A1 |
20080243952 | Webman | Oct 2008 | A1 |
20090192978 | Hewett et al. | Jul 2009 | A1 |
20110196900 | Drobychev | Aug 2011 | A1 |
20130095866 | Yang | Apr 2013 | A1 |
Number | Date | Country |
---|---|---|
101238680 | Aug 2008 | CN |
101883181 | Nov 2010 | CN |
1734451 | Dec 2006 | EP |
2009029783 | Mar 2009 | WO |
Entry |
---|
“Cluster Management”, IBM—AIX Version 7.1, (First Edition Sep. 2010), Retrieved Feb. 4, 2011; 36 pages. |
“HP Open VMS Systems Documentation”, <http://h71000.www7.hp.com/doc/731final/4477/4477pro_019.html.>, Retrieved Feb. 7, 2011; 5 pages. |
“Sheepdog”, Design Sheepdog Project, <http://www.osrg.net/sheepdog/design.html,>, Retrieved Feb. 7, 2011; 3 pages. |
W. Elmenreich, W. Haidinger, R. Kirner, T. Losert, R. Obermaisser and C. Trodhandl, “TTP/A Smart Transducer Programming—A Beginner's Guide”, Published Nov. 13, 2002; 51 pages. |
PCT International Search Report and Written Opinion in International Application PCT/US2012/032630, dated Jan. 3, 2013, 9 pgs. |
European Search Report Issued in European Patent Application No. 12768552.7, dated Nov. 19, 2014, 6 Pages. |
European Communication mailed in European Patent Application No. 12768552.7, dated Dec. 5, 2014, 1 Page. |
“First Office Action and Search Report Issued in Chinese Patent Application No. 201280016717.4”, dated May 5, 2015, 17 Pages. |
“Third Office Action Issued in Chinese Patent Application No. 201280016717.4”, dated Mar. 28, 2016, 13 Pages. |
“Second Office Action Issued in Chinese Patent Application No. 201280016717.4”, dated Dec. 25, 2015, 16 Pages. |
Chinese Notice of Allowance in Application 201280016717.4, dated Sep. 29, 2016, 6 pages. |
Number | Date | Country | |
---|---|---|---|
20120259912 A1 | Oct 2012 | US |