In a large interdisciplinary engineering project spanning multiple timezones, each participant may require read-only access to thousands of files while retaining write authority to only a few hundred. Some of the files are related in a logical hierarchy while others are related in a physical hierarchy. Many of the files evolve over time and participants may at one time want access to the most recent version of a certain file and at others desire a stable collation. Thus the files that concern one participant will be different from another and multiple versions of a certain file may be appropriate for various roles.
Moreover concentration of files into centralized servers has been observed to increase congestion and lower both accessibility and reliability. Centralized servers require heavier investment in information technology and redundancy while duplicating storage.
Referring now to
Thus it can be appreciated that what is needed is coherent management of all files committed to a project with distributed storage and improved data block accessibility.
To further clarify the above and other advantages and features of the present invention, a more particular description of the invention will be rendered by reference to specific embodiments thereof which are illustrated in the appended drawings. It is appreciated that these drawings depict only typical embodiments of the invention and are therefore not to be considered limiting of its scope. The invention will be described and explained with additional specificity and detail through the use of the accompanying drawings in which:
a-e is a flow chart of method steps;
A plurality of peer workstation clients first transmit only metadata about versions of files upon request and, subsequently, file blocks only when needed to fulfill a file block read of a particular version. A file state view at each workstation client enables access to file blocks of every version of each file which the user currently has in local store. A peer file system view at each workstation client enables access to file blocks of versions previously stored by other users and may be more or less out of date. A file state sub-set satellite collects metadata from a group of neighboring workstation clients. A plurality of file state sub-set satellites may exchange metadata among one another and answer a request from a workstation client who has gotten no response from its neighbors. A coherent file state authority receives via the sub-set satellites, all the metadata from all the workstation clients in the coherent file system but only transmits on exceptional circumstances e.g. disaster recovery, restoration, or initializing new satellites. Generally files are not distributed at all and only file blocks are transmitted upon request. Metadata about file states are optimized among peers and minimized up to or down from the coherent file state authority to reduce latency due to congestion.
Reference will now be made to the drawings to describe various aspects of exemplary embodiments of the invention. It should be understood that the drawings are diagrammatic and schematic representations of such exemplary embodiments and, accordingly, are not limiting of the scope of the present invention, nor are the drawings necessarily drawn to scale.
Referring now to
The method of operating the workspace client apparatus is to present to the user a file system view of a virtual store 191. When a user commits a file in the file system view, the access is intercepted by the workspace client and in an example written to a locally attached file store 141. The workspace client further communicates the change in state to the file system server 150 which records the change in file state in the content point store 160 for this particular user's file state view and archives the new version. In an embodiment, the workspace client 131 also receives information about its peer workspace clients.
Referring now to
Referring now to
Referring now to
Referring now to
The present invention comprises a method for operating a system comprising the steps: within each workspace client apparatus, exchanging with other workspace clients file information, responding to a request for a managed file from a workstation, presenting a file state view to a user, updating a file state server with state information for each file in the file state view. In an embodiment the present invention comprises a file state server apparatus comprising a processor coupled to a plurality of stores and to a network adapter, the processor adapted to maintain the temporal state of files in workspace clients. In an embodiment, a file state server apparatus further comprises a file store. In an embodiment the method for operating file state server comprises: providing likely candidate workspaces to provide the files, providing the files from its own file store, controlling the access rights for managed files, and updating workspace clients about changes in other workspace clients.
In an embodiment the file state server provides a managed file to a workspace client. In an embodiment a workspace client stores redundant file copies which are not presented to a file state view. The present invention is a method for operating a workspace client apparatus comprising the following processes: updating managed file information comprising: receiving information about workspace clients from other workspace clients receiving information about workspace clients from the file state server presenting a file system view of files managed by the file state server to all users, presenting a file system view of pre-managed files local to the workspace client only to the owner of a pre-managed file, providing managed files to other workspace clients, upon receiving a network request for a managed file from a peer workspace client, examining local file store for requested file, transmitting change log for requested file, transmitting content point for request file, transmitting identities of workspace clients known to have the requested file, exchanging updated file state views of managed files; upon receiving a local user file write storing the file to locally attached file store, if the file is a managed file, determining a change log to a content point, transmitting to a file state server apparatus the current file state, transmitting to a file state server apparatus the file store location, updating the file system view, and archiving a new version of the file; upon receiving a local user file read retrieving the file from locally attached store, if the file is a managed file not in locally attached file store, If possible, satisfies request from within its current holdings If possible, satisfies request from other workspace client, and otherwise, identifying another workspace client.
In an embodiment, the method for operating a workspace client further comprises receiving a change log for a file, receiving a content point for a file, receiving and combining file parts from a plurality of sources. In an embodiment the method for operating a workspace client further comprises: optimizing workspace client information exchange maintaining change logs over a finite range exchanging only incremental changes if possible and, otherwise, exchanging full state. In an embodiment, the method for operating a workspace client further comprises: choosing suitable peer workspace clients based on similar file system views to request blocks from, collecting statistics about performance and reliability of other workspace clients to prioritize block requests, and recording the frequency of network link failures to determine redundant resource gaps. In an embodiment, the network is an encrypted ssl tunnel.
Referring to
Referring to
Referring to
Referring to
Referring to
Referring to
In an embodiment the method for operating a workspace client comprises: committing a file or modified file into a file system view, intercepting a block read request of a certain committed file, requesting the current revision number for a certain committed file from a file state server, determining if local store contains a current revision of a certain committed file, examining file system views of peer workspace clients to locate a current revision of a certain committed file, requesting transfer of a block of a certain committed file from a peer workstation client, recording each access to each file in file system view, and removing from local store a least recently used file.
Referring now to
An other aspect of the invention is a method for operating a plurality of peer workstation client apparatuses in a coherent file system to minimize network traffic, the method comprising: at a second peer workstation client apparatus communicatively coupled to a first peer workstation client apparatus, receiving a request for metadata about a version of a file stored at the second peer workstation client apparatus 357, retrieving from a second file state view the requested metadata about all the file blocks of all the versions of a file stored at the second peer workstation client apparatus 358, transmitting the requested metadata about the versions of the file stored at the second peer workstation apparatus 359; receiving a request for metadata about a version of a file stored at an other peer workstation client apparatus 360, retrieving from a second peer file system view the requested metadata about any file blocks of any versions of the file stored at an other peer workstation client apparatus 361, transmitting the requested metadata about versions of the file stored at an other peer workstation client apparatus 362; receiving a request for a file block stored at a second local file block store 363, reading the requested file block from the second local file block store 364, and transmitting the requested file block to the requesting first peer workstation apparatus 365; at a first workstation client apparatus, determining that a version of a file is not entirely stored at a first local file block store 366 by examining metadata in a first file state view for metadata of file blocks, determining that a version of a file is stored at a second local file block store of the communicatively coupled second peer workstation client apparatus 367 by examining metadata in a first peer file system view, intercepting file block read commands from an application configuring the first peer workstation client apparatus 368, on the condition that the requested file block is stored in local file block store, fulfilling the file block read without network traffic 371, on the condition that the requested file block is stored in a second local file block store at a second peer workstation client apparatus, transmitting a request over a network, fulfilling the file block read with the response, storing the response to the first local file block store and updating the first file state view 372, on the condition that the requested version of the file is not found in either the first file state view or the first peer file system view, transmitting a request to at least one other peer workstation client apparatus for metadata about a directory or sub-directory of files and updating the first peer file system view with the response 374.
An other aspect of the invention is a method for operating a first file sub-set satellite apparatus communicatively coupled to a second file sub-set satellite apparatus, the method comprising: receiving a request for metadata about a directory or sub-directory of files 380, reading from a plurality of stored peer file system views the metadata which identifies the location of each file block that makes up each version of the requested directory or sub-directory of files 381, transmitting the requested metadata to the second file sub-set satellite apparatus 382, requesting from the second file sub-set satellite apparatus metadata about another directory or sub-directory of files 383, and updating the peer file system views stored at the first file sub-set satellite apparatus 384.
In an embodiment, operating a first file sub-set satellite apparatus communicatively coupled to a workstation client apparatus is improved by: receiving a request for metadata about a directory or sub-directory of files 385, reading from a plurality of stored peer file system views the metadata which identifies the location of each file block that makes up each version of the requested directory or sub-directory of files which it received from another file state sub-set apparatus 386, if not found, requesting from another file sub-set satellite apparatus metadata about said directory or sub-directory of files 387, and transmitting the requested metadata and location of the requested file block to the requesting workstation client apparatus 388.
In an embodiment, the method for operating a file state sub-set satellite of further includes the steps: establishing communication through a network with a coherent file state authority 389, reading a plurality of peer file system views from storage 390, transmitting metadata about files stored at a plurality of peer workstation clients communicatively coupled to the file state sub-set satellite to a coherent file state authority 391, and uploading file blocks required to fulfill the metadata to a conventional backup server accessible by the coherent file state authority 392.
In an embodiment the method of operating a coherent file system transforms the plurality of individual workstation file state views as follows: receiving from a file state sub-set satellite metadata about a plurality file blocks which make up a version of a file at one of the plurality of workstation clients 393, transforming the metadata to integrate a coherent file state view for every version of said file at every workstation client 394; storing said coherent file state view into a coherent file system view of every version of every file at any workstation client 395, and requesting a file block referenced by the metadata to be stored in conventional backup server storage 396.
In an embodiment, the method of operating a coherent file system is extended for recovery or reinitialization: reading metadata about a directory or sub-directory of files from file system view store 398, transmitting said metadata about each file block of each version of each file to a file state sub-set satellite apparatus 399. This enables, a conventional backup server to read upon request each file block necessary to fulfill any request for a file block of the file state sub-set metadata transmitted to the file state sub-set satellite, and transmits the requested file blocks to the file state sub-set satellite.
Referring now to
A file state view circuit 441 tracks every file block stored in the local file block store 461 and provides metadata on which file blocks belong to each version and where the file blocks are located. A user has a workspace which includes every version of every file that he may read or modify. When a user operates a program product to read a file block of a version of a file, the workstation client determines from the metadata in file state view, which file block in local file block store corresponds to that version and fulfills the file block read command. When a user modifies a file and commits the modified file into the coherent file system, the metadata is changed in the file state view to create a new version which points to one or more new file blocks which contain the changes as well as to the file blocks which remain unchanged.
In an embodiment of the invention, a first peer workstation client apparatus 421 of a coherent file system comprises: a processor configured by computer executable instructions to store, retrieve, and transform data and to track versions of files which are made up of file data blocks and to intercept file block read and file block write commands issued by a program product; a file state view circuit 441 configured to create, store, and retrieve metadata about file blocks, the versions of files that each block is a component of, and the location of each block whereby file blocks that are identical in multiple versions of a file are not redundantly stored; a local file block store 461 for all file blocks that comprise versions of files whose metadata is stored in the file state view circuit 441.
Referring now to
Each workstation client apparatus maintains metadata about file versions it generates and stores in its own file state view 441-442. Files which are committed into the coherent file system by a user operating the second peer workstation client apparatus 422 have all its blocks stored into local file block store 462 and have the metadata that relates the file version to the file blocks stored in the file state view 442. A file may be stored in multiple versions on either one or a plurality of workstation client apparatuses. A version of a file may differ from another version of the same file by only one file block. A version of a file on the first peer workstation client may differ from another version of a file on the second peer workstation client entirely within one file block. Each workstation client apparatus has at least one peer file system view 481-482 which contains metadata about the files of peer workstation clients. The peer file system views may not be up-to-date while the file state views always must reflect every version of every file currently accessible in the local file block store.
Each workstation client apparatus may query its neighbors for updates of metadata concerning directories or sub-directories of files. Note that the files themselves or their blocks are not distributed to every other peer workstation client in the present invention because the files are enormous for the intended use and the individual file blocks that make up the files would consume scarce bandwidth to transfer unnecessarily. Neither the file system view metadata nor the file blocks are transmitted except upon request among the peer workstation clients.
On the condition when a file version is not found by a workstation client 421 in its own file state view 441, then the workstation client will check its peer file system view 481 for the desired version. Even then, no file blocks are transferred. Upon a file block read command of an application in the first peer workstation client, it may be determined by examining the metadata of the file version that the requested file block is already stored within the local file block store and can be fulfilled without network traffic. Applicant points out that entire versions of files are not unnecessarily transferred between peer workstations. Applicant points out that it is unnecessary to assemble and integrate all the file blocks of a version of a file from a plurality of peer workstations. Even if a workstation does not have a complete set of the file blocks of a desired version, it may still fulfill a file block read of a version which it does not have completely by providing a locally stored file block when the requested file block remained unchanged from one version to another. It is unnecessary to retrieve and integrate all the file blocks of a file when only one block is required.
On the condition that a file block read command requires a file block that is recorded in the peer file system view 481 to be located at the local file block store 482 of the second peer workstation client, a request will be transmitted and fulfilled across the network. In an embodiment, on the condition that metadata is not found for a file version in peer file system view 481 of the first peer workstation, a request will be transmitted for updated metadata to the second peer workstation client for metadata in its file state view 442 or peer file system view 482. Much as neighbors pass gossip around, the peer file systems views will converge over time but cannot constantly be as accurate as the file state view which each workstation client maintains for its own local file block store. In an embodiment, after requesting and fulfilling a file block request, the first or second or both peer workstation client apparatuses may request updated metadata on directories or sub-directories of files stored in their respective neighbor's file state view and peer file system views.
Referring now to
Referring now to
Referring now to
Referring now to
The present invention is easily distinguished from peer-to-peer distribution systems that increase rather than decrease unnecessary network traffic and congestion by broadcasting or pushing files, checksums, authentication codes, or notifications.
The present invention is easily distinguished from conventional network file systems and source code control solutions by its coherent file state authority which tracks every version of every managed file and its plurality of network attached workspace client apparatuses which respond to requests for randomly accessible blocks of files among themselves. It can be appreciated that that a conventional network file system has no concept of disjoint file views where each user may be concerned with using a plurality of versions of a file in parallel.
It is distinguishing characteristic that file blocks and files are not “pushed” to subscribers which would increase traffic and unnecessary congestion on the network. File blocks are only transmitted upon request. Only metadata about file blocks and new versions are “pushed” to a file state sub-set satellite or to a coherent file state authority when a new version is generated at each workstation client. Among workstation clients, such metadata is only transmitted upon request.
It is a distinguishing characteristic that workspace clients respond to requests for randomly accessed data blocks rather than whole file transfers and that workspace clients request metadata about the file blocks of a version of a file from a file state view before fulfilling data reads with locally stored file blocks. It is a distinguishing characteristic that the present invention does not compute a checksum or authentication code for an entire file assembled from parts provided by distributed storage. It is a distinguishing characteristic that the present invention does not integrate an entire file from file blocks. It is a distinguishing characteristic that the present invention does not disclose new ways to determine if a version of a file is newer or more current than another version or if they are the same, or if the file has been corrupted in transmission. The present invention reduces congestion and minimizes latency in enabling access to a desired file block of any version of any file committed into the coherent file system.
The techniques described herein can be implemented in digital electronic circuitry, or in computer hardware, firmware, software, or in combinations of them. The techniques can be implemented as a computer program product, i.e., a computer program tangibly embodied in an information carrier, e.g., in a machine-readable storage device or in a propagated signal, for execution by, or to control the operation of, data processing apparatus, e.g., a programmable processor, a computer, or multiple computers. A computer program can be written in any form of programming language, including compiled or interpreted languages, and it can be deployed in any form, including as a stand-alone program or as a module, component, subroutine, or other unit suitable for use in a computing environment. A computer program can be deployed to be executed on one computer or on multiple computers at one site or distributed across multiple sites and interconnected by a communication network.
Method steps of the techniques described herein can be performed by one or more programmable processors executing a computer program to perform functions of the invention by operating on input data and generating output. Method steps can also be performed by, and apparatus of the invention can be implemented as, special purpose logic circuitry, e.g., an FPGA (field programmable gate array) or an ASIC (application-specific integrated circuit). Modules can refer to portions of the computer program and/or the processor/special circuitry that implements that functionality.
Processors suitable for the execution of a computer program include, by way of example, both general and special purpose microprocessors, and any one or more processors of any kind of digital computer. Generally, a processor will receive instructions and data from a read-only memory or a random access memory or both. The essential elements of a computer are a processor for executing instructions and one or more memory devices for storing instructions and data. Generally, a computer will also include, or be operatively coupled to receive data from or transfer data to, or both, one or more mass storage devices for storing data, e.g., magnetic, magneto-optical disks, or optical disks. Information carriers suitable for embodying computer program instructions and data include all forms of non-volatile memory, including by way of example semiconductor memory devices, e.g., EPROM, EEPROM, and flash memory devices; magnetic disks, e.g., internal hard disks or removable disks; magneto-optical disks; and CD-ROM and DVD-ROM disks. The processor and the memory can be supplemented by, or incorporated in special purpose logic circuitry.
A number of embodiments of the invention have been described. Nevertheless, it will be understood that various modifications may be made without departing from the spirit and scope of the invention. For example, other network topologies may be used. Accordingly, other embodiments are within the scope of the following claims.
The present application is a continuation in part of 12/541,883 filed Aug. 12, 2009 by the same inventors which is incorporated in its entirety by reference.
Number | Date | Country | |
---|---|---|---|
Parent | 12541883 | Aug 2009 | US |
Child | 13596049 | US |