1. Field of the Invention
The disclosed embodiments relate generally to techniques for building client server applications. In particular, described embodiments include systems and methods for efficient interactions between application servers and distributed datastores.
2. Description of Related Art
Traditional client/server computing architectures are typically composed of two main components: the compute and storage tiers. The compute tier is sometimes broken down into the access and logic tiers, but their purpose remains the same (compute). These tiers are often physically separated on different computers for isolation, scaling, and other purposes. As requests are served by the compute tier, data is fetched and processed from the storage tier and manipulated before being sent back to the requesting client.
Components in the access tier manage the communication from clients. The logic tier includes algorithms to service requests from the access tier. The storage tier contains data stored in a persistent mechanism. By way of example, a remote client contacts a server with a request. This server contains both the access and logic tiers. The server communicates with the storage tier, in this case a remote database, and pulls data back to itself. It then applies business logic to this data and returns a response to the remote client. As the number of clients increases, the server tier can be scaled larger by the addition of more nodes. Servers are stateless, as data itself is stored in the storage tier. The traditional storage tier, however, does not scale by adding more database servers; there is no logic for partitioning data across a set of database servers.
An application server is a piece of software code running in a service container or framework that is accessed over a network by other software. The application server provides logical ability to process data, but it stores data in a storage tier. This means that if an application server is restarted or shutdown, its processing ability is lost but data stored in the separate storage tier is available upon the application server's return.
One type of storage tier instance is a distributed datastore (
Distributed datastores often contain built-in logic to manage replicas of individual pieces of data across multiple physical machines. This replication ensures availability of individual data despite individual machine failure.
Described embodiments provide systems and methods for building client server applications. The application server code is deployed within a distributed datastore and utilizes the datastore's data locality information to fulfill requests with minimal remote procedure calls (RPC), transfer of data between servers, and shared data caches. The application server inherits the properties of the distributed datastore such as statefulness, replication and failover.
Described embodiments also enable the fetching and processing of data in a “big data” environment—a system that contains multiple servers' worth of data, with improved time and bandwidth considerations. The integration of the application server with the datastore eliminates network hops and transfer of data.
In described embodiments, an application process is embedded in the distributed datastore process and the distributed datastore's locality information is used to make the application server aware of the locality of data in the cluster. Locality refers to information describing where a given piece of data physically resides in a set of computers that act together to provide greater computing resources. This allows the application server to either proxy requests to other nodes that have the data locally or access the local data without a remote call since the datastore APIs are available internally.
A more complex use case includes a parallel processing request. Consider an implementation that aggregates the values of data points based on a user-defined set of identifiers, such as an Internet search system. The set of identifiers is sent to an embedded server 406a, which uses the distributed datastore's 408a data locality lookup API to identify the list of nodes 408b, 408c to which to send sub-requests. The sub-requests are created and sent by the embedded server 406a to the other embedded servers 406b, 406c to aggregate the data contained locally on those nodes 408b, 408c and send the response back to the original server 406a, which aggregates those results and returns the response to the requesting client 402.
As demonstrated by each of the above examples, by merging the application service 406 with the distributed datastore 408, the scalable and fault tolerant properties of the distributed datastores architecture provide a highly available application server, unlike conventional systems. Also, by using the native API of the distributed datastore the efficiency and physical awareness of an application can be greatly improved.
At runtime each distributed datastore node is running the embedded application service. This allows the application service to be accessed from any node.
Returning to
If the data required for the request lives on different datastore nodes, the application forwards 1006 the request to another distributed datastore's embedded application service layer to be processed. While the request is being processed by the other node(s), the initial node can wait for a response or service 1008 other requests. Once the responses from the other nodes are all received, extra logic can be applied if required by the application server 1012 and the final response is sent to the request sender 1014.
The present invention has been described in particular detail with respect to a limited number of embodiments. Those of skill in the art will appreciate that the invention may additionally be practiced in other embodiments.
Within this written description, the particular naming of the components, capitalization of terms, the attributes, data structures, or any other programming or structural aspect is not mandatory or significant, and the mechanisms that implement the invention or its features may have different names, formats, or protocols. Further, the system may be implemented via a combination of hardware and software, as described, or entirely in hardware elements. Also, the particular division of functionality between the various system components described herein is merely exemplary, and not mandatory; functions performed by a single system component may instead be performed by multiple components, and functions performed by multiple components may instead be performed by a single component.
Some portions of the above description present the feature of the present invention in terms of algorithms and symbolic representations of operations on information. These algorithmic descriptions and representations are the means used by those skilled in the art to most effectively convey the substance of their work to others skilled in the art. These operations, while described functionally or logically, are understood to be implemented by computer programs. Furthermore, it has also proven convenient at times, to refer to these arrangements of operations as modules or code devices, without loss of generality.
It should be borne in mind, however, that all of these and similar terms are to be associated with the appropriate physical quantities and are merely convenient labels applied to these quantities. Unless specifically stated otherwise as apparent from the present discussion, it is appreciated that throughout the description, discussions utilizing terms such as “selecting” or “computing” or “determining” or the like, refer to the action and processes of a computer system, or similar electronic computing device, that manipulates and transforms data represented as physical (electronic) quantities within the computer system memories or registers or other such information storage, transmission or display devices.
Certain aspects of the present invention include process steps and instructions described herein in the form of an algorithm. It should be noted that the process steps and instructions of the present invention could be embodied in software, firmware or hardware, and when embodied in software, could be downloaded to reside on and be operated from different platforms used by real time network operating systems.
The present invention also relates to an apparatus for performing the operations herein. This apparatus may be specially constructed for the required purposes, or it may comprise a general-purpose computer selectively activated or reconfigured by a computer program stored in the computer. Such a computer program may be stored in a computer readable storage medium, such as, but is not limited to, any type of disk including floppy disks, optical disks, DVDs, CD-ROMs, magnetic-optical disks, read-only memories (ROMs), random access memories (RAMs), EPROMs, EEPROMs, magnetic or optical cards, application specific integrated circuits (ASICs), or any type of media suitable for storing electronic instructions, and each coupled to a computer system bus. Furthermore, the computers referred to in the specification may include a single processor or may be architectures employing multiple processor designs for increased computing capability.
The algorithms and displays presented herein are not inherently related to any particular computer or other apparatus. Various general-purpose systems may also be used with programs in accordance with the teachings herein, or it may prove convenient to construct more specialized apparatus to perform the required method steps. The required structure for a variety of these systems will appear from the description above. In addition, the present invention is not described with reference to any particular programming language. It is appreciated that a variety of programming languages may be used to implement the teachings of the present invention as described herein, and any references to specific languages are provided for disclosure of enablement and best mode of the present invention.
Finally, it should be noted that the language used in the specification has been principally selected for readability and instructional purposes, and may not have been selected to delineate or circumscribe the inventive subject matter. Accordingly, the disclosure of the present invention is intended to be illustrative, but not limiting, of the scope of the invention.
This application is a continuation of U.S. patent application Ser. No. 13/590,080, filed on Aug. 20, 2012, which claims the benefit of U.S. Provisional Application 61/525,763, filed on Aug. 20, 2011, each of which are incorporated by reference herein in their entirety.
Number | Name | Date | Kind |
---|---|---|---|
7162467 | Eshleman et al. | Jan 2007 | B2 |
7596571 | Sifry | Sep 2009 | B2 |
8037024 | Bozkaya et al. | Oct 2011 | B1 |
8214827 | Ben-Arie et al. | Jul 2012 | B2 |
8244831 | McLennan et al. | Aug 2012 | B1 |
8631094 | Alpert | Jan 2014 | B1 |
8634947 | Kleinpeter | Jan 2014 | B1 |
8825608 | Lipstone | Sep 2014 | B2 |
8935318 | Konerding et al. | Jan 2015 | B1 |
9026493 | Weng | May 2015 | B1 |
20010039565 | Gupta | Nov 2001 | A1 |
20050193021 | Peleg | Sep 2005 | A1 |
20050278422 | Prust | Dec 2005 | A1 |
20070061543 | Uppala | Mar 2007 | A1 |
20070143369 | Uppala | Jun 2007 | A1 |
20070234334 | Araujo et al. | Oct 2007 | A1 |
20090248699 | Alvarez et al. | Oct 2009 | A1 |
20090260016 | Ramakrishnan et al. | Oct 2009 | A1 |
20090313265 | Sifry | Dec 2009 | A1 |
20100269158 | Ehler et al. | Oct 2010 | A1 |
20110196855 | Wable et al. | Aug 2011 | A1 |
20110202929 | Schleimer et al. | Aug 2011 | A1 |
20110276692 | Waldman et al. | Nov 2011 | A1 |
20110288970 | Kidron et al. | Nov 2011 | A1 |
20120110150 | Kosuru et al. | May 2012 | A1 |
20120233228 | Barton et al. | Sep 2012 | A1 |
20120233293 | Barton et al. | Sep 2012 | A1 |
20120233522 | Barton et al. | Sep 2012 | A1 |
20120290714 | Cohen | Nov 2012 | A1 |
Number | Date | Country |
---|---|---|
WO-2008027768 | Mar 2008 | WO |
WO-2010019105 | Feb 2010 | WO |
Entry |
---|
United States Office Action for U.S. Appl. No. 13/590,080, Jan. 31, 2014, 11 pages. |
Number | Date | Country | |
---|---|---|---|
61525763 | Aug 2011 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 13590080 | Aug 2012 | US |
Child | 14576845 | US |