The present invention is related to a file operation management device for performing file system operations.
File systems are currently mainly software-implemented and are usually an integrated part of an operating system. A software program which intends to access a file for reading or writing data therefore invokes a subsidiary software routine which is executed on the same processor as the application program.
Because the processor is typically optimized for user applications running as software on the processor, the resources of the processor are not optimally used when executing file system operations. For instance, a floating point unit or extensions for multimedia processing are included in the processor which are not used for file system operations. On the other hand, functions like encryption or decryption or checksum calculation are not very well supported by processors designed predominantly for application processing.
Furthermore, interactions with hardware such as a hard disk drive are time-consuming in an operating system which provides security mechanisms, hardware abstraction and other features.
In EP 1074 916 a system and method for managing data stored in a cache block in a cache memory is described. The cache block is located at a cache block address in the cache memory, and the data in the cache block corresponds to a storage location in a storage array identified by a storage location identifier. A storage processor accesses the cache block in the cache memory and provides a cache management command to a command processor. A processor memory coupled to the storage processor stores a search key based on the storage location identifier corresponding to the cache block. A command processor coupled to the storage processor receives a cache management command specified by the storage processor and transfers the storage location identifier from the processor memory. A cache management memory stores a cache management structure including the cache block address and the search key. A cache management processor is coupled to the cache management memory by a second link to manipulate the cache management structure in a hash table with linked lists at each hash queue element within the cache management memory in accordance with the cache management command and the search key.
In US 2002/0065924 an apparatus and method for hardware implementation or acceleration of operating system functions is described. The apparatus in one embodiment handles service requests over a network, wherein the network utilizes a protocol. In this aspect, the apparatus includes: a network subsystem for receiving and transmitting network service requests using the network protocol; and a service subsystem, coupled to the network subsystem, for satisfying the network service requests. Also in this aspect, at least one of the network subsystem and the service subsystem is hardware-implemented; the other of the network subsystem and the service subsystem may optionally be hardware-accelerated. A variety of related embodiments are also provided, including file servers and web servers.
Therefore, the present invention provides file operation management devices, methods and processing systems which allow better utilization of the application-processing unit. It also provides a file system operation functionality which requires less resources of the application-processing unit and provides a performance increase of the application.
According to a first aspect of the present invention, a file operation management device is provided to perform file system operations. The file operation management device includes a storage interface for coupling to a storage device. Furthermore, a data interface for coupling to a processing unit in which application programs are processed, also referred to as application-processing unit, is included to receive user and/or command data from the application-processing unit and to transmit status and user data to the coupled application-processing unit. The file operation management device further includes a file system unit for receiving the user and/or command data via the data interface and for performing a file system operation indicated by the command data on the data storage device depending on the user and/or command data received.
According to an embodiment of the present invention, the file operation management device receives command data to perform the file system operations as create, open, read, and write of files. Optionally, the file operation management device can improve the redundancy of performance including checksum checking and generation (RAID functionality), background copy for back-up and data migration services and monitoring for performance tuning, billing and service.
According to another embodiment of the present invention, a token/lock management unit is included to maintain the integrity of files stored in the coupled storage device by synchronizing accesses to the files. Thereby, concurrent uncoordinated access to a file can be avoided.
According to another embodiment of the present invention, a file operation management device is provided wherein the file system unit includes a fail-over functionality via a fail-over interface, wherein in the event of a failure in the file system unit the performing of the file system operations is handed over to a file operation-handling instance by means of the fail-over interface.
According to another aspect of the present invention, a processing system including a file operation management device is provided. Therein, the file operation management device is connected to one or more application-processing units.
These and other objects and features of the present invention will become clear from the following description taken in conjunction with the accompanying drawings, in which:
The present invention provides file operation management devices and processing systems that allow better utilization of an application-processing unit, and provides a file system operation functionality which requires less resources of the application-processing unit and provides a performance increase of the application. Thus, in the present invention, a file operation management device is provided to perform file system operations. An example of a file operation management device includes a storage interface for coupling to a storage device. Furthermore, a data interface for coupling to a processing unit in which application programs are processed, also referred to as application-processing unit, is included to receive user and/or command data from the application-processing unit and to transmit status and user data to the coupled application-processing unit. The file operation management device further includes a file system unit for receiving the user and/or command data via the data interface and for performing a file system operation indicated by the command data on the data storage device depending on the user and/or command data received.
The coupling of the file operation management device to the application-processing unit provides the file system operations that are invoked by the processed application to be outsourced to the file operation management device instead of being processed in the application-processing unit. This has the advantage that the application-processing unit no longer needs to perform the file system operations provided by software routines which are normally included in the operating system running on the application-processing unit. Furthermore, software routines for file operations are normally stored in the main memory coupled to the application-processing unit such that by outsourcing them memory space can be saved.
According to an embodiment of the present invention, the file operation management device receives command data to perform the file system operations as create, open, read, and write of files. Optionally, the file operation management device can improve the redundancy of performance including checksum checking and generation (RAID functionality), background copy for back-up and data migration services and monitoring for performance tuning, billing and service.
A memory unit can further be provided in order to store status information of the file system. Particularly, the memory unit can comprise a non-volatile memory. Furthermore, a cache memory unit can be included to cache user data.
According to another embodiment of the present invention, a token/lock management unit is included to maintain the integrity of files stored in the coupled storage device by synchronizing accesses to the files. Thereby, concurrent uncoordinated access to a file can be avoided.
Furthermore, the file system unit may provide a fail-over functionality wherein in the event of a failure in the file system unit, the performing of the file system operations is handed over to a software-implemented file operating system processed in the application-processing unit.
According to another embodiment of the present invention, a file operation management device is provided wherein the file system unit includes a fail-over functionality via a fail-over interface, wherein in the event of a failure in the file system unit the performing of the file system operations is handed over to a file operation-handling instance by means of the fail-over interface. The file operation-handling instance can be of any type able to perform file operations.
Preferably, the fail-over interface of the file operation management device is designed to hand over file operations to the application-processing unit as the file operation-handling instance which is implemented as a software implemented file system unit of the application-processing unit. Optionally the fail-over interface is designed to hand over file operations to another file system unit which is coupled to the application-processing unit to receive the user and/or command data via the data interface and to perform the file system operation on the data storage device depending on the user and/or command data received via the data interface.
In order to better utilize the resources of the file operation management device, the file system unit can share the performing of the file system operation with another file system unit coupled to the fail-over interface. Furthermore, the file operation management device can provide that the file system unit is designed to take over performing of the file system operation of another coupled file system unit when an indication about a failure of the coupled other file system unit is received via the fail-over interface. The file operation management device may further comprise an adapter unit for controlling the storage device which can be one of SCSI, S-ATA, and FibreChannel.
According to another embodiment of the present invention, a file operation management device is provided. Therein, the file operation management device is connected to one or more application-processing units. Preferably, the one or more application-processing units are coupled to the file operation management device via at least one of a processor bus, a PCI bus, a PCI-X bus, a PCI-Express bus, an SCSI bus, a Microchannel bus, a Frontside bus (FSB), a Hypertransport bus, a S-ATA bus, a Rapid-I/O bus and a Fire Wirebus.
In
For communicating with an application-processing unit 14 which performs software applications, the file operation management device 1 comprises a data interface 3 that is coupled to the application-processing unit 14 so that file operations can be invoked by the application-processing unit 14 by accessing the file operation management device 1. The file system unit 2 via the data interface 3 is prepared to receive user and/or command data from the application-processing unit 14 and to send status data and user data to the coupled application-processing unit 14. For accessing a storage device 11 a storage interface 4 is provided to allow the file system unit 2 to transmit to and receive data from the coupled storage device 11. The interfaces 3, 4 can also be integrated into one single interface.
The connection from the file operation management device 1 to the application-processing unit 14 via the data interface 3 is characterized by low latency, low-level supplied reliability and identifiability of the originating application process on the CPU 12. The connection is system-internal, i.e. the connection is arranged within the same system as the application-processing unit 14 and the operation management device 1, thereby not offering access to devices that could interfere with intended system operation nor observing the data transferred over this connection. Protection is ensured thereby without need for e.g. cryptographic security mechanisms. Devices that have not been identified to the system as trustable are untrusted devices, and hence may have no such access. This means that for instance no authentication methods like cryptographic signatures, timeout detection for the connection, or handling of overload situations need be included in every command transfer or status retrieval. The inherent security of the closed system hence allows doing without such additional security mechanisms, thereby reducing the latency of data transfer. This also allows doing without local caches within the application-processing unit 14, since the connection is fast enough to make use of the data via the operation management device 1.
Examples for this connection are the front-side-bus of a processor, hypertransport, Elastic Interface, RapidlO. These connection standards are based on addresses which allow the protected mapping of command and status address regions into the virtual address space of each application running on the application-processing unit 14. In the data interface 3 and in the application-processing unit 14, an intrinsic transaction-management mechanism is usable to automatically detect failures of the connection between the file operation management device 1 and the application-processing unit 14 or failures of the file operation management device 1. Such a failure can be handled by an exception handler in the application-processing unit 14 which will map a different file operation management device 1 into the same address space for continued reliable operation. Therefore, error handling is transparent to the application, i.e. it need not take care of error handling. The data interface 3 is hence a system-internal interface in contrast to a web interface of a file server.
The file operation management device 1 comprises a file system processor 201 that retrieves functionalities from a hardware-accelerated file-system-specific function unit 202 or a file system code 203 implemented as firmware. The hardware-accelerated file-system-specific function unit 202 and the file system code 203 are described in more detail below.
The hardware-accelerated file-system-specific function unit 202 is preferably in charge of functions that are easier implementable in hardware, those which require functionalities that are not typically provided by a general-purpose processor, such as long-integer-arithmetics, and/or those that have little data locality, e.g. in DMA, making no or negligible use of a local cache. In other words, the hardware-accelerated file-system-specific function unit 202 is handling those functions that are executed as part of most file system functions, i.e. with substantially every file access. Those functions comprise here one or more of
The frequency of those functions is hence larger than of those functions handled by the file system code 203.
The file system code 203 comprises three code fractions, namely framework software, file system software, and service software. It generally provides for the flexibility of the file system, e.g. in the event that additional filetypes should be handled, or to allow to accommodate changes in the communication protocol standard. It becomes clear that the functionality of the file system code is less frequently required for the file system functionality and/or more complex to realize in hardware than in software.
The framework software is intended to provide abstractions of hardware details, managing independent processes such as parallel file accesses, authentication and protection management, error handling, installation or de-installation of service software.
The file system software configures the hardware-accelerated file-system-specific function unit 202 to fit to the various data structures of different file systems. A data structure comprises inter-alia the way file directories are stored on a disk. Furthermore it handles those cases that are beyond the capability of the hardware-accelerated file-system-specific function unit 202. This could for instance be an extension of features in a file system that have not been implemented in the hardware-accelerated file-system-specific function unit 202 at its construction time.
The service software is not essential for the file system operations but it makes advantageous use of the fact that all file system operations are handled by the file operation management device 1. The functionalities of the service software can include creation of back-up copies, data migration, data erasing, and monitoring for performance tuning, system maintenance, or billing for file accesses. This service software can be designed by the manufacturer of the 1 to ensure secure and flawless operation.
Monitoring is a functionality to assess system performance, intensity of system use, system integrity, frequency of data accesses, storage device utilization, and similar system parameters. This monitoring can be performed per application, per user or per user group, such as a company or the like. The results of the monitoring can be made accessible to a system maintainer via the data interface 3, allowing adapting an application to the reported system status. Such adaptation can be utilized to tune the system performance. System maintenance can comprise restructuring the number and type of storage devices connected.
The monitoring result also provides a basis for billing a user of the application-processing unit 14. The file system unit 2 provides an advantageous entity for accounting the file-system-related activities independent of operating systems and user partitions. For example a user in a data center can sign up to an outsourcing service arrangement with a service provider who operates the system 19 and bills the user individually for use of the application-processor unit 19, based for instance on the utilized network bandwidth, storage capacity occupied, or storage access frequency. Another example could be a gaming application in an online environment, wherein a multitude of users are provided by the service provider who operates the system 19, with the opportunity to save game parameters in the system 19. The service provider then charges the users per duration of storage time, and access frequency. The measured parameter that is used as the basis for billing is being monitored on a per-user basis.
The file operation management device 1 can be designed with one or more of the file system processors 201 and/or one or more memory units 6, wherein the file operation management device 1 can take advantage of different types of memory regions within the same physical memory unit 6, for instance memory regions with error correction, memory regions with error detection, or memory regions without any means for error handling. Also, different memory types such as memories having a high bandwidth and high latency, blockwise accessible memory, etc. can be used.
The file operation management device 1 also comprises a fail-over interface 5, a cache memory unit 9, a memory interface 7 to a memory unit 6, and a token/lock management unit 8. The interfaces 7, 5 present in the file operation management device 1 provide advanced features. Thus, the fail-over interface 5 is usable to couple the file operation management device 1 to other file operation management devices for synchronization and for communication of fail-over states as discussed in more detail below.
The file operation management device 1 also comprises an access scheduling unit 20 connected to the file system unit 2, described in more detail below.
The application-processing unit 14 addresses the file operation management device 1 preferably like a permanent memory by transmitting commands such as for reading or writing of data wherein the file operation management device 1 handles the further operations related to addressing the connected storage device 11 in the appropriate manner by itself. The file operation management device 1 communicates via its storage interface 4 by means of an appropriate communication protocol with the storage device 11.
By providing the file operation management device 1 that comprises a file system unit 2 that is separate from the application-processing unit 14, file operations are processed separately from the application-processing unit 14, thereby saving resources of the application-processing unit 14 by outsourcing them.
For example, the file operation management device 1 can be included into a chipset for an AMD Opteron or an AMD Athlon CPU unit. Thereby, the file operation management device 1 can be integrated into personal computers and workstations. The integration can be carried out by connecting the file operation management device 1 to the chipset via an appropriate interface such as hypertransport, PCI-X, PCI-Express or CSA (Communications Streaming Architecture). Optionally, the file operation management device 1 can be integrated in the CPU itself or in interface devices coupled to the CPU, for instance, the AMD-811 I/O Hub.
For coupling the file operation management device 1 according to the present invention with servers and PCs, the file operation management device 1 can be integrated together with an adapter for the data storage device 11 such as SCSI, S-ATA (Serial ATA), FibreChannel etc. on one board to be integrated into a variety of products, from servers and other dedicated storage-related products to PCs and workstations.
In
It is preferred to operate the various file operation management devices 1 in parallel in order to achieve a higher performance. The fact that there is more than one active file operation management device 1 for the application-processing unit 14 provides an advantageous take-over option in case of a failure of one of the file operation management devices 1. In the event that one of the file operation management devices 1 fails, one of the other file operation management devices 1 can take over its functionality. Furthermore, the file operation management devices 1 can be coupled to share the file operations, thereby providing a faster access to the files.
The file system unit 2 of the file operation management devices 1 has a fail-over functionality so that a transfer of the file system operation functionality from e.g. the first file operation management device 1 to e.g. the second file operation management device 1 in the event of a failure in the file system unit 2 of the first file operation management device 1, is carried out. The file system operation functionality is then provided by the second file operation management device 1 implemented in the same server or workstation 19.
Also, if any of the file operation management devices 1 fails, its functionality can be taken over by one or more of the CPUs 12 wherein in that case an operation system is loaded into the main memory 13 to provide the file system operation functionality as it is done by conventional operating-system software. This can be particularly useful in an embodiment of the system 19 where only one file operation management device 1 is available or if the remaining file operation management device 1 does not have the capacity to handle all file operations.
Thirdly, the file system operation functionality may be provided by a remote server if the file operation management device 1 fails. This is particularly attractive if the storage device 11 is provided by means of a shared access, e.g. to a Storage Area Network. The fail-over interface 5 of any of the file operation management devices 1 is designed to continuously transfer data on the operations of its file system unit 2 to the fail-over interface of one or more of the other file operation management devices 1 to allow the completion of those operations that were outstanding in the failing file operation management device 1 at the time at which the failure occurred. The same applies to all other file operation management devices 1. The respective data can be compressed and communicated in any standardized format.
The file system unit 2 of the each file operation management device 1 may include a mechanism to detect whether a failure in a connected other file operation management device 1 has occurred so that it can be determined when a task of a failed file operation management device 1 should be transferred to one of the working file operation management devices 1. For signaling of the failure, a respective failure signal is transmitted via the fail-over interface 5. A variety of different task management and failure mechanisms that are known in the art can be used. If more than one file operation management device 1 is used in the system 19, these can be connected to one or more storage devices 11.
As depicted in
As a further interface, an authentication interface can be provided such as a smartcard interface, not shown in the figure. This provides security to the system 19, for which the installation of additional functions, the configuration, or the retrieval of monitoring results should not be permitted all the time by the host system, i.e. the processing memory unit 14. For carrying out administrative work, the administrator connects a secure data carrier, such as a smartcard to the authentication interface.
This invokes a mechanism to install a new plug-in or the retrieval of information. By making use of the secure data carrier, this method also allows the retrieved information to be in encrypted form such that the administrative work can be carried out by an operator with lower privileges who will not see the content of the unencrypted information. Moreover, in order to execute the data protocol for communication via the interfaces, a protocol state machine such as SCSI, FibreChannel, StorageTank etc. can be provided.
Another function of the file system is maintaining the integrity of the files by avoiding concurrent, uncoordinated access thereto. This function is here performed by the token/lock management unit 8. Every file access of an application is observed by the token/lock management unit 8 and in the event of concurrent accesses to the same file or data section of a file the compatibility of the access types is checked and in case of a conflict the second access is delayed or denied. This integrity maintenance function has several aspects. A first aspect is maintaining the integrity between processes running on the same system 19. As a second aspect, however, the same functionality of the token/lock management unit 8 can also be used to ensure consistency in case of accesses to common files between different CPUs 12. The lock management functionality for files can use hardware acceleration included in the file operation management device 1 such as a content-addressable memory or tree search/insert engines. The token/lock management unit 8 can also be inserted into the path between the data interface 3 and the file system unit 2, thereby providing a reduction of the utilization of the file system unit 2 by those file accesses that are denied by the token/lock management unit 8 in case of access conflict.
Furthermore, the file operation management device 1 handles the scheduling of the access sequence of the file systems handled to the storage devices 11. This can be performed by the access scheduling unit 20 that maintains a data structure in which the pending data-block-read or data-block-write requests are stored and a second data structure also stores the states of the connected storage devices 11. The access scheduling unit 20 therefrom determines a priority which of the pending requests is to be executed next. A preferred priority scheme would be to select those requests that are physically, i.e. with regard to the storage location on the storage device 11, closest to the storage location addressed by the last access, thereby increasing the performance of the file operation management device 1 by reducing the head movements of the storage device 11. Another preferred priority scheme favors the longest-pending request over others, thereby guaranteeing equal performance for the pending applications. Another priority scheme would be to group together requests that apply to physically close or identical data blocks.
Variations described for the present invention can be realized in any combination desirable for each particular application. Thus particular limitations, and/or embodiment enhancements described herein, which may have particular advantages to the particular application need not be used for all applications. Also, not all limitations need be implemented in methods, systems and/or apparatus including one or more concepts of the present invention.
It is noted that the foregoing has outlined some of the more pertinent aspects and embodiments of the present invention. This invention may be used for many applications. Thus, although the description is made for particular arrangements and methods, the intent and concept of the invention is suitable and applicable to other arrangements and applications. It will be clear to those skilled in the art that modifications to the disclosed embodiments can be effected without departing from the spirit and scope of the invention. The described embodiments ought to be construed to be merely illustrative of some of the more prominent features and applications of the invention. Other beneficial results can be realized by applying the disclosed invention in a different manner or modifying the invention in ways known to those familiar with the art.
Number | Date | Country | Kind |
---|---|---|---|
04405456.7 | Jul 2004 | EP | regional |