Cache region concept

Abstract
A method to associate a storage policy with a cache region is disclosed. In this method, a cache region associated with an application is created. The application runs on virtual machines, and where a first virtual machine has a local memory cache that is private to the first virtual machine. The first virtual machine additionally has a shared memory cache that is shared by the first virtual machine and a second virtual machine. Additionally, the cache region is associated with a storage policy. Here, the storage policy specifies that a first copy of an object to be stored in the cache region is to be stored in the local memory cache and that a second copy of the object to be stored in the cache region is to be stored in the shared memory cache.
Description
TECHNICAL FIELD

Embodiments of the present invention relate to memory management, and in one embodiment, a method to minimize memory footprint of different software entities and maximize performance using already constructed objects.


BACKGROUND


FIG. 1 shows a prior art computing system 100 having N virtual machines 113, 213, . . . N13. The prior art computing system 100 can be viewed as an application server that runs web applications and/or business logic applications for an enterprise (e.g., a corporation, partnership or government agency) to assist the enterprise in performing specific operations in an automated fashion (e.g., automated billing, automated sales, etc.).


The prior art computing system 100 runs are extensive amount of concurrent application threads per virtual machine. Specifically, there are X concurrent application threads (1121 through 112X) running on virtual machine 113; there are Y concurrent application threads (2121 through 212Y) running on virtual machine 213; . . . and, there are Z concurrent application threads (N121 through N12Z) running on virtual machine N13; where, each of X, Y and Z are a large number.


A virtual machine, as is well understood in the art, is an abstract machine that converts (or “interprets”) abstract code into code that is understandable to a particular type of a hardware platform. For example, if the processing core of computing system 100 included PowerPC microprocessors, each of virtual machines 113, 213 through N13 would respectively convert the abstract code of threads 1121 through 112X, 2121 through 212Y, and N121 through N12Z into instructions sequences that a PowerPC microprocessor can execute.


Because virtual machines operate at the instruction level they tend to have processor-like characteristics, and, therefore, can be viewed as having their own associated memory. The memory used by a functioning virtual machine is typically modeled as being local (or “private”) to the virtual machine. Hence, FIG. 1 shows local memory 115, 215, N15 allocated for each of virtual machines 113, 213, . . . N13 respectively.


A portion of a virtual machine's local memory may be implemented as the virtual machine's cache. As such, FIG. 1 shows respective regions 116, 216, . . . N16 of each virtual machine's local memory space 115, 215, . . . N15 being allocated as local cache for the corresponding virtual machine 113, 213, . . . N13. A cache is a region where frequently used items are kept in order to enhance operational efficiency. Traditionally, the access time associated with fetching/writing an item to/from a cache is less than the access time associated with other place(s) where the item can be kept (such as a disk file or external database (not shown in FIG. 1)).


For example, in an object-oriented environment, an object that is subjected to frequent use by a virtual machine (for whatever reason) may be stored in the virtual machine's cache. The combination of the cache's low latency and the frequent use of the particular object by the virtual machine corresponds to a disproportionate share of the virtual machine's fetches being that of the lower latency cache; which, in turn, effectively improves the overall productivity of the virtual machine.


A problem with the prior art implementation of FIG. 1, is that, a virtual machine can be under the load of a large number of concurrent application threads; and, furthermore, the “crash” of a virtual machine is not an uncommon event. If a virtual machine crashes, generally, all of the concurrent application threads that the virtual machine is actively processing will crash. Thus, if any one of virtual machines 113, 213, N13 were to crash, X, Y or Z application threads would crash along with the crashed virtual machine. With X, Y and Z each being a large number, a large number of applications would crash as a result of the virtual machine crash.


Given that the application threads running on an application server 100 typically have “mission critical” importance, the wholesale crash of scores of such threads is a significant problem for the enterprise.


SUMMARY

Methods for a treatment of cached objects are described. In one embodiment, an application is configured to cache objects according to different caching treatments, each different caching treatment associated with its own region of cache. The configuring also involves installing a different combination of storage and eviction policy plug-ins for each different region of cache, with each of the storage plug-ins including code to perform get and put operations, and each of the eviction policy plug-ins including code to perform cache eviction operations.





BRIEF DESCRIPTION OF THE DRAWINGS

The present invention is illustrated by way of example, and not limitation, in the figures of the accompanying drawings in which:



FIG. 1 shows a portion of a prior art computing system.



FIG. 2 shows a portion of an improved computing system.



FIG. 3 shows a cache management service.



FIG. 4 illustrates one embodiment of a cache implementation with respect to local memory and shared memory.



FIG. 5 illustrates an embodiment of a first cache region flavor.



FIG. 6 illustrates an embodiment of a second cache region flavor.



FIG. 7 illustrates an embodiment of a third cache region flavor.



FIG. 8 illustrates an embodiment of a fourth cache region flavor.



FIG. 9 illustrates one embodiment of different programming models for a storage plug-in.



FIG. 10 illustrates one embodiment of an organization structure of a cache region.



FIG. 11 illustrates a block diagram of one embodiment of a “get” operation using the Group Manipulation functionality.



FIG. 12 illustrates a detailed perspective of retrieving an attribute associated with a particular object.



FIG. 13
a illustrates an embodiment of an eviction policy plug-in.



FIG. 13
b illustrates a detailed perspective of various types of queues that may be implemented by the Sorting component of an eviction policy plug-in.



FIG. 14 illustrates a detailed graph of one type of Eviction timing component functionality.



FIG. 15 illustrates a detailed graph of another type of Eviction timing component functionality.



FIG. 16 shows a depiction of a cache region definition building process.



FIG. 17 illustrates a detailed perspective of one embodiment of a distributed cache architecture.



FIG. 18 illustrates a block diagram of one method of sharing an object between different computing systems.



FIG. 19 illustrates an embodiment of a computing system.





DETAILED DESCRIPTION

In the following description, for the purposes of explanation, numerous specific details are set forth in order to provide a thorough understanding of the present invention. It will be apparent, however, to one skilled in the art that the present invention may be practiced without some of these specific details. In other instances, well-known structures and devices are shown in block diagram form.


Note that in this detailed description, references to “one embodiment” or “an embodiment” mean that the feature being referred to is included in at least one embodiment of the invention. Moreover, separate references to “one embodiment” in this description do not necessarily refer to the same embodiment; however, neither are such embodiments mutually exclusive, unless so stated, and except as will be readily apparent to those skilled in the art. Thus, the invention can include any variety of combinations and/or integrations of the embodiments described herein.


The present invention includes various steps, which will be described below. The steps of the present invention may be performed by hardware components or may be embodied in machine-executable instructions, which may be used to cause a general-purpose or special-purpose processor programmed with the instructions to perform the steps. Alternatively, the steps may be performed by a combination of hardware and software.


The present invention may be provided as a computer program product that may include a machine-readable medium having stored thereon instructions, which may be used to program a computer (or other electronic devices) to perform a process according to the present invention. The machine-readable medium may include, but is not limited to, floppy diskettes, optical disks, CD-ROMs, and magneto-optical disks, ROMs, RAMs, EPROMs, EEPOMs, magnetic or optical cards, flash memory, or other type of media/machine-readable medium suitable for storing electronic instructions.


Shared Memory and Shared Closures


FIG. 2 shows a computing system 200 that is configured with less application threads per virtual machine than the prior art system of FIG. 1. Less application threads per virtual machine results in less application thread crashes per virtual machine crash; which, in turn, should result in the new system 200 of FIG. 2 exhibiting better reliability than the prior art system 100 of FIG. 1.


According to the depiction of FIG. 2, which is an extreme representation of the improved approach, only one application thread exists per virtual machine (specifically, thread 122 is being executed by virtual machine 123; thread 222 is being executed by virtual machine 223, . . . and, thread M22 is being executed by virtual machine M23). In practice, the computing system 200 of FIG. 2 may permit a limited number of threads to be concurrently processed by a single virtual machine rather than only one.


In order to concurrently execute a comparable number of application threads as the prior art system 100 of FIG. 1, the improved system 200 of FIG. 2 instantiates more virtual machines than the prior art system 100 of FIG. 1. That is, M>N.


Thus, for example, if the prior art system 100 of FIG. 1 has 10 application threads per virtual machine and 4 virtual machines (e.g., one virtual machine per CPU in a computing system having four CPUs) for a total of 4×10=40 concurrently executed application threads for the system 100 as a whole, the improved system 200 of FIG. 2 may only permit a maximum of 5 concurrent application threads per virtual machine and 6 virtual machines (e.g., 1.5 virtual machines per CPU in a four CPU system) to implement a comparable number (5×6=30) of concurrently executed threads as the prior art system 100 in FIG. 1.


Here, the prior art system 100 instantiates one virtual machine per CPU while the improved system 200 of FIG. 2 can instantiate multiple virtual machines per CPU. For example, in order to achieve 1.5 virtual machines per CPU, a first CPU will be configured to run a single virtual machine while a second CPU in the same system will be configured to run a pair of virtual machines. By repeating this pattern for every pair of CPUs, such CPU pairs will instantiate 3 virtual machines per CPU pair (which corresponds to 1.5 virtual machines per CPU).


Recall from the discussion of FIG. 1 that a virtual machine can be associated with its own local memory. Because the improved computing system of FIG. 2 instantiates more virtual machines that the prior art computing system of FIG. 1, in order to conserve memory resources, the virtual machines 123, 223, . . . M23 of the system 200 of FIG. 2 are configured with less local memory space 125, 225, . . . M25 than the local memory space 115, 215, . . . N15 of virtual machines 113, 213, . . . N13 of FIG. 1. Moreover, the virtual machines 123, 223, . . . M23 of the system 200 of FIG. 2 are configured to use a shared memory 230. Shared memory 230 is memory space that contains items that can be accessed by more than one virtual machine (and, typically, any virtual machine configured to execute “like” application threads that is coupled to the shared memory 230).


Thus, whereas the prior art computing system 100 of FIG. 1 uses fewer virtual machines with larger local memory resources containing objects that are “private” to the virtual machine; the computing system 200 of FIG. 2, by contrast, uses more virtual machines with less local memory resources. The less local memory resources allocated per virtual machine is compensated for by allowing each virtual machine to access additional memory resources. However, owing to limits in the amount of available memory space, this additional memory space 230 is made “shareable” amongst the virtual machines 123, 223, . . . M23.


According to an object oriented approach where each of virtual machines 123, 223, . . . M23 does not have visibility into the local memories of the other virtual machines, specific rules are applied that mandate whether or not information is permitted to be stored in shared memory 230. Specifically, to first order, according to an embodiment, an object residing in shared memory 230 should not contain a reference to an object located in a virtual machine's local memory because an object with a reference to an unreachable object is generally deemed “non useable”.


That is, if an object in shared memory 230 were to have a reference into the local memory of a particular virtual machine, the object is essentially non useable to all other virtual machines; and, if shared memory 230 were to contain an object that was useable to only a single virtual machine, the purpose of the shared memory 230 would essentially be defeated.


In order to uphold the above rule, and in light of the fact that objects frequently contain references to other objects (e.g., to effect a large process by stringing together the processes of individual objects; and/or, to effect relational data structures), “shareable closures” are employed. A “closure” is a group of one or more objects where every reference stemming from an object in the group that references another object does not reference an object outside the group. That is, all the object-to-object references of the group can be viewed as closing upon and/or staying within the confines of the group itself. Note that a single object without any references stemming from can be viewed as meeting the definition of a closure.


If a closure with a non shareable object were to be stored in shared memory 230, the closure itself would not be shareable with other virtual machines, which, again, defeats the purpose of the shared memory 230. Thus, in an implementation, in order to keep only shareable objects in shared memory 230 and to prevent a reference from an object in shared memory 230 to an object in a local memory, only “shareable” (or “shared”) closures are stored in shared memory 230. A “shared closure” is a closure in which each of the closure's objects are “shareable.”


A shareable object is an object that can be used by other virtual machines that store and retrieve objects from the shared memory 230. As discussed above, in an embodiment, one aspect of a shareable object is that it does not possess a reference to another object that is located in a virtual machine's local memory. Other conditions that an object must meet in order to be deemed shareable may also be effected. For example, according to a particular Java embodiment, a shareable object must also posses the following characteristics: 1) it is an instance of a class that is serializable; 2) it is an instance of a class that does not execute any custom serializing or deserializing code; 3) it is an instance of a class whose base classes are all serializable; 4) it is an instance of a class whose member fields are all serializable; 5) it is an instance of a class that does not interfere with proper operation of a garbage collection algorithm; 6) it has no transient fields; and, 7) its finalize ( ) method is not overwritten.


Exceptions to the above criteria are possible if a copy operation used to copy a closure into shared memory 230 (or from shared memory 230 into a local memory) can be shown to be semantically equivalent to serialization and deserialization of the objects in the closure. Examples include instances of the Java 2 Platform, Standard Edition 1.3 java.lang.String class and java.util.Hashtable class.


Cache Management Across Local and Shared Memory Resources

Note that the introduction of the shared memory 230 introduces the prospect of a shared cache 240. Thus, the architecture of FIG. 2 includes both local memory level caches 126, 226, . . . M26 and a shared memory cache 240. FIG. 3 shows a depiction of a cache management service 302 that can, for example, be added to the suite of services offered by a container 301 that an application thread runs in. A container is used to confine/define the operating environment for the application thread(s) that are executed within the container. In the context of J2EE, containers also provide a family of services that applications executed within the container may use (e.g., (e.g., Java Naming and Directory Interface (JNDI), Java Database Connectivity (JDSC), Java Messaging Service (JMS) among others).


Different types of containers may exist. For example, a first type of container may contain instances of pages and servlets for executing a web based “presentation” for one or more applications. A second type of container may contain granules of functionality (generically referred to as “components” and, in the context of Java, referred to as “beans”) that reference one another in sequence so that, when executed according to the sequence, a more comprehensive overall “business logic” application is realized (e.g., stringing revenue calculation, expense calculation and tax calculation components together to implement a profit calculation application).



FIG. 3 shows that more than one thread can be actively processed by the virtual machine 323 depicted therein. It should be understood that, in accordance with the discussion concerning FIG. 2, the number of threads that the virtual machine 323 can concurrently entertain should be limited (e.g., to some fixed number) to reduce the exposure to a virtual machine crash. For example, according to one implementation, the default number of concurrently executed threads is 5. In a further implementation, the number of concurrently executed threads is a configurable parameter so that, conceivably, for example, in a first system deployment there are 10 concurrent threads per virtual machine, in a second system deployment there are 5 concurrent threads per virtual machine, in a third system deployment there is 1 concurrent thread per virtual machine. It is expected that a number of practical system deployments would choose less than 10 concurrent threads per virtual machine.


The cache management service 302 is configured to have visibility into the local memory cache 325 of the virtual machine 323, the shared memory cache 340 and one or more other storage resources 350 such as a database or file system used for storing persisted objects. Here, as will be described in more detail below, different applications whose abstract code (e.g., Java byte code in the case of Java) is executed by virtual machine 323 can specially configure the cache management service 302 to treat its cached objects in accordance with specific guidelines.


According to various schemes, the cache manager 302 effectively configures regions of cache for the storage of objects in local cache memory 326 and/or in shared memory cache 340 according to different treatment policies. Multiple cache regions defining different cache treatments may be established for a single application. Cached objects placed in local memory cache 326 may be conveniently utilized by the virtual machine 323 associated with the local memory where local cache 326 resides for quick processing by the application. By contrast, cached objects placed in shared memory cache 340 may be utilized by the local virtual machine 323 as well as other virtual machines that have visibility into the shared memory in which the shared memory cache 340 is implemented.



FIG. 4 illustrates a more detailed perspective of an embodiment of the cache manager 302 of FIG. 3. Specifically, FIG. 4 illustrates the formation of multiple cache regions (cache region_1410, cache region_2412, . . . cache region_N 414) that are controlled by cache manager 402. In one embodiment, a plurality of cache regions may be controlled by cache manager 402 for a single application. The cache regions may, for example, be formed by commands executed by an application (e.g., app_1401) calling for the establishment of the cache regions themselves.


A cache region effectively determines the treatment that an object that is stored in the cache region will receive. For example, cache region_1410 determines the treatment of object 460, while cache region_2412 determines the treatment of cached object 461. By comparison, object 460 will receive different treatment than object 461 because of the different treatment imposed by the different cache regions 410, 412.


For each cache region, in an embodiment, cache manager 402 implements a storage plug-in and an eviction policy plug-in. The storage plug-in may be, in one embodiment, the actual piece of software or code that executes the “get” and “put” operations for the objects stored according to the treatment determined by the associated cache region. That is, for example, whether the object is placed in the local memory cache, the shared memory cache, or some other type of storage resource such as a database or file system for storing persisted objects. The eviction policy plug-in may be, in one embodiment, the actual piece of software or code that dictates the removal of an object from cache (e.g., when some form of cache capacity threshold is exceeded).


In continuing from the example provided above, cache region_1410 defines the treatment of object 460 with storage plug-in_1420 and eviction policy plug-in_1421. Cache region_2412 defines the treatment of object 461 with storage plug-in_2422 and eviction policy plug-in_2423. Cache region_N 414 is generally represented as having storage plug-in_N 424 and eviction policy plug-in_N 425. For simplicity of description, each cache region is described as having only a single object that is treating according to the treatment determined by the cache region, but, it should be appreciated that any number of objects may be referenced by a particular cache region. Moreover, any object stored in, copied from, written to, or removed from the shared memory cache 432 may be a single object; or, an object that is part of a shared closure where the shared closure itself is respectively stored in, copied from, written to, or removed from the shared memory cache 432.


As illustrated in FIG. 4, a storage policy plug-in of a cache region may dictate that an object stored in the local and/or shared cache memory be copied into a persisted storage space 440 (e.g., as part of the object's removal from the cache). One type of eviction process, referred to as “spooling,” initiates persistence of the object upon the object's eviction from cache. As such, the evicted object is written into deeper storage space such as a hard disk file or a remote database 440. Another or related storage policy plug-in function may be used to perform a “write-through” process, in which a “put” of an object into cache automatically results in a copy of that object being directed to storage space 440.


Until now, a cache region (e.g., cache region_1410) has been generally described as defining the treatment for a particular object, that is, for putting and/or getting an object to/from either the local memory cache and/or the shared memory cache. The following provides greater detail as to the different types of cache regions that may be implemented by cache manager 402 for the treatment of objects as defined by its storage and eviction policy plug-ins. The different types of cache management treatments are referred to as “flavors” or “cache flavors”.


Cache Management Flavors


FIG. 5 illustrates an embodiment of the first cache region flavor 500, referred to as “Local”, which has object treatment behavior defined by cache region_1511. Cache region_1511 managed by cache manager 502 includes a “local” flavor storage plug-in_1520 and eviction policy plug-in_1521 that together implement the treatment of objects that are cached within the first cache region. The local flavor is useable with non-shareable objects that have no potential for storage into shared memory. The essence of the local flavor is that an object 560 is kept in local memory cache 530 and not shared memory cache 532; and, that hard reference(s) 570, 571 are made to the object 560 so that the object 560 cannot be removed from the local memory cache 530 by a “garbage collector.” A garbage collector, which is a well known process, removes objects from local memory cache 530 (depending on the local memory usage and the type of references being made to the objects). Note that the garbage collector is a background process that is different than the eviction policy processes instituted by the eviction policy plug-in 521.


As shown in FIG. 5, according to the “local” flavor, a first “hard” reference 570 is made by application_1501 to object 560 (at least while the object 560 is actively being used by the application 501), and a second “hard” reference 571 is made to the object 560 by storage plug-in_1520. A particular type of reference to an object represents, in one embodiment, a relative difficulty level in removing an object from the memory cache. A “hard” (or “strongly reachable”) referenced object remains in the memory (i.e., the object is not removed from local memory 530 by the garbage collector). A “soft” referenced object remains in the memory until there is a danger of OutofMemoryError (e.g., threshold level is exceeded in terms of available local memory space) or some other algorithm (typically based on memory usage) used by the garbage collector. A “weak” referenced object is removed by the garbage collector regardless of the local memory's available space. A java VM implementation is allowed however, to treat soft references like weak references (i.e., softly referred to objects are removed by the garbage collector irrespective of memory usage).


Active removal of an object by the eviction policy plug-in (i.e., eviction) ignores the referenced states of the object as described above. As such, hard referenced objects may be just as easily removed as weak referenced objects according to the policies set forth by the eviction policy plug-in 521. Here, note that storage plug-in 520 may also institute “spooling” and “write through” policies to deeper storage. In an embodiment, a separate plug-in in cache region 511 (not shown) is used to interface with the deeper storage and is called upon as needed by storage plug-in 520 to institute spooling and write through policies.



FIG. 6 illustrates an embodiment of a second cache region flavor 600 referred to as “Local Soft.” The Local Soft flavor is similar to the Local flavor of FIG. 5 but is different with respect to the references made to the object 561 by the storage plug-in 522. In particular, storage plug-in_2522 does not maintain a hard reference to object 561 in local memory cache_1530. Instead, a soft reference 573 is established. With a soft reference, according to one embodiment, object 561 remains in local memory cache_1530 until the eviction policy plug-in raises some type of memory availability concern, at which time an active eviction process is invoked by eviction policy plug-in_2523.


When the active eviction process is invoked, soft reference 573 is changed to a weak reference 574. Under this condition, object 561 may be removed by a garbage collector if the application's hard reference no longer exists (e.g., because the application is no longer actively using the object 561). That is, object 561 remains protected from removal by the garbage collector as long as the application's hard reference 572 to the object 561 is present, otherwise the object will be removed. Here, note that storage plug-in 522 may also institute “spooling” and “write through” policies to deeper storage. In an embodiment, a separate plug-in in cache region 512 (not shown) is used to interface with the deeper storage and is called upon as needed by storage plug-in 522 to institute spooling and write through policies. In one embodiment, by invoking the removal of object 560 from local memory cache_1530 (either by active eviction or garbage collection), cache region_2512 may also provide for object 560 to be copied to deeper storage.


Before moving forward it is important to re-emphasize that objects stored according to either of the local flavors discussed above may be of the non shareable type so as to be incapable of membership in a shared closure and storage into shared memory. Moreover, the application is apt to configure its different local cache regions such that objects receiving local flavor treatment are apt to be more heavily used (i.e., some combination of the number of “get” and “put” accesses over time) than objects treated according to the Soft Local flavor.



FIG. 7 illustrates an embodiment of a third flavor 700 referred to as “Shared.” The “Shared” flavor is different from both the Local flavors in that an object representation resides in shared memory cache 532 as part of a shared closure. Under the “shared” flavor, when an application 501 causes an object 562a to be first “put” into local memory cache, the storage plug-in 524 also puts a copy 562b of the object 562a into the shared memory cache 520. The application 501 places a hard reference 575 to the local copy 561a.


The application 501 is then free to use the local copy 562a as a “work horse” object. For each “put” operation made to the local copy 562a, (e.g., to effectively modify the object's earlier content) the storage plug-in 524 updates/writes to the shared copy 562b to reflect the “put” into the local memory cache 530. Note that because of the presence of shared copy 562b, a virtual machine other than the virtual machine that is associated with the local memory within which local memory cache_1530 is implemented may copy the shared copy 562b into its local memory cache (e.g., local memory cache 531) so as to create a third copy 562c of the object. The third copy 562c of the object can be used as a “work horse” object for another application (not shown) that runs off of the other local memory cache 531. This other application will make a hard reference to this object 562c as well (not shown). In one embodiment, storage plug-in 524 does not place any kind of reference to shared copy 562b because any shared closure is reachable in shared memory through a key name that uniquely identifies that shared closure; and moreover, shared closures are kept in shared memory until an application explicitly calls a “delete” operation (i.e., no garbage collection process is at work in shared memory at least for cached objects). As such, there is no need for any type of reference to a shared closure residing in shared memory.


If the other application associated with local memory cache_2531 effectively modifies its local object 562c (e.g., with a “put” operation), the storage plug-in for local memory cache_2531 will create a “second version” 563 of shared object 562b in shared memory cache 532 that incorporates the modification made to local object 562c. According to an implementation, the storage plug-in 524 does not receive any affirmative indication of the existence of the new version but is instead configured to look for new versions in shared memory (e.g., upon a “put” or “get” operation) given the possibility of their existence under the shared flavor scheme. For instance, a “get” operation by application 501 will result in the reading of object 562a and object 563 by plug-in 524. Likewise, a “put” operation by application 501 can result in the fetching of object 563 by plug-in 524 so that it is possible to modify a local copy of the object 563 version. Here, note that storage plug-in 524 may also institute “spooling” and “write through” policies to deeper storage. In an embodiment, a separate plug-in in cache region 513 (not shown) is used to interface with the deeper storage and is called upon as needed by storage plug-in 524 to institute spooling and write through policies.



FIG. 8 shows another shared memory based flavor that may be referred to as “Shared Read-Only.” The essence of the Shared Read-Only approach is that local copies do not exist (i.e., only an object 564 in shared memory cache 532 exists); and, no modification is supposed to be made to the shared object under typical circumstances. The eviction policy plug-in 527 determines when the object 564 does not need to reside in shared memory cache 532 any longer.


In an extended embodiment, if a requirement to modify the object 564 arises, the storage plug-in 526 associated with the application 501 that desires to make the modification creates an entirely new object and places it into the shared memory 532 as a second version 565. Subsequently, when object 564 is requested from shared memory 532 by another application, the updated, second version 565 may also be retrieved. Here, note that storage plug-in 526 may also institute “spooling” and “write through” policies to deeper storage. In an embodiment, a separate plug-in in cache region 514 (not shown) is used to interface with the deeper storage and is called upon as needed by storage plug-in 526 to institute spooling and write through policies.


For either of the shared flavors discussed above, the storage plug-in may be configured to control the size of the shared closures that are being entered into shared memory cache 532. Specifically, smaller shared closures may be “bundled” with other shared closures to form effectively a data structure that contains multiple shared closures and that is effectively treated as a single shared closure for copying operations from shared memory cache 532 into local memory cache 530 (or vice versa). Here, a bundle may be created simply by ensuring that each shared closure in the bundle is associated through a reference to another shared closure in the bundle.


By increasing bundle size, overhead associated with copying objects back and forth between shared memory and local memory is reduced in certain circumstances, particularly, environments where many smaller shared closures are to be sent between shared memory and local memory at about the same time. Here, by bundling them, all shared closures can effectively be transported between shared memory and local memory by a single transfer process.


Storage Plug-in Programming Models

Until now, the storage plug-in for a particular cache region has been generally described as defining the cache storage treatment of one or more objects associated with the cache region. The storage plug-in may be, in one embodiment, the actual piece of software or code that executes various operations (e.g., “get” or “put”) for objects stored according to the treatment determined by the associated cache region. FIG. 9 illustrates a more detailed perspective of a possible implementation for a single cache region 602. Recall that multiple cache regions may be established for a single application. Cache region 602 is shown having storage plug-in 603 and eviction policy plug-in 610.


Storage plug-in 603, in one embodiment, is logically represented as being capable of performing several functions, including Key Object Manipulation 604, Key Attribute Manipulation 605, Group Manipulation 606, Key Set Operations 607, Key System Attribute Manipulation 608, and Object Size Information 609. Several functionalities are also associated with eviction policy plug-in 610. These functionalities include Sorting 611, Eviction Timing 612, and Object Key Attribution 613. The various functionalities of eviction policy plug-in 610, which also define a treatment of objects in local memory cache 630 and shared memory cache 632, are described in greater detail further below with respect to FIGS. 13a,b-15. One, all, or a combination of these functionalities may be associated with each object that is handled according to the treatment defined by cache region 602. Again, exemplary discussing is provided in the context of a single object. But, it should be understood that at least with respect to the treatment of objects cached in shared memory, such objects may also be in the form of a shared closure.


Key Object Manipulation 604 is a storage plug-in function that relates to the “get” and “put” operations for an object. For example, a “get” operation retrieves a particular object from local cache memory 630 and/or shared memory cache 632 depending on the “flavor” of the plug-in (consistent with the different caching flavors described in the preceding section). A “put” operation places a copy of an object into local memory cache 630 and/or shared memory cache 632 (again, consistent with the specific “flavor” of the plug-in). For each object associated with a cache region, an object name may be assigned to each object. In turn, each object name may correspond to a unique key value. One embodiment of this organizational structure is illustrated in FIG. 10.


Referring to FIG. 10, cache region 602 includes a cache group_1620 associated with N objects 670, 671, . . . 672. It is important to point out that multiple groups of objects (or “object groups”) may be established per cache region (i.e., FIG. 10 only shows one group but multiple such groups may exist in cache region 602). As will be described in more detail below, assignment of objects into a group allows for “massive” operations in which, through a single command from the application, an operation is performed with every object in the group.


Each object of cache group_1620 is associated with a unique key. That is, for example, Key_1640 is associated with object 670, key_2641 is associated with object 671, and key_N is associated with object 672. Each key is a value (e.g., alphanumeric) that, for instance, in one embodiment, is the name of the object. In an embodiment, the key for an object undergoes a hashing function in order to identify the numerical address in cache memory where the object is located.


As such, the Key Object Manipulation functionality 604 of storage plug-in 603 utilizes the key associated with an object to carry out “put” and “get” operations on that object. For simplicity, only a local memory cache 635 is considered (e.g., the storage plug-in may be a “local” or “soft local” flavor).


As an example, object 670 may have the key “Adam” in simple text form. An application (e.g., application_1601 of FIG. 9) provides the input for a “put” operation of object 670 which may take the form of [PUT, ADAM] in cache. The key, “Adam,” undergoes a hashing function by storage plug-in 603 to generate the cache address where object 670 is to be stored. The key object manipulation “put” functionality of storage plug-in 603 completes the “put” operation by'writing object 670 to local memory cache 630 at the address described provided by the hashing function.


A feature of the Key Object Manipulation 604 functionality is that an application does not need to know the exact location of a desired object. The application merely needs to reference an object by its key only and the Key Object Manipulation 604 functionality of the storage plug-in is able to actually put the object with that key into the local memory cache 630.


A “get” operation may be performed by the Key Object Manipulation 604 functionality in a similar manner. For example, object 671 may have the name “Bob.” An application (e.g., application_1601 of FIG. 9) provides the input for the “get” operation of object 671 which may take the form of, [GET BOB] from cache. The key, “Bob,” undergoes a hashing function by storage plug-in 603 to determine the numerical address where object 671 is stored in local memory cache 630. The Key Object Manipulation 604 functionality of storage plug-in 603 completes the “get” operation by copying or removing object 671 to some other location in local memory 635 outside the cache.


Key Attribute Manipulation 605 is a functionality that relates to defining or changing particular attributes associated with an object. Here, each object has its own associated set of “attributes” that, in one embodiment, are stored in cache address locations other than that of the object itself. Attributes are characteristics of the object's character and are often used for imposing appropriate treatment upon the object. Examples of attributes include shareable/non-shareable and time-to-live (an amount of time an object is allowed to stay in a cache region before being evicted). As each cached object is associated with a key, an object's attributes may also be associated with the key.


Thus, as depicted in FIG. 10, Key_1640, which is the key for object 670, is also associated with the collection of attributes_1680 for object 670. Key_2641, which is the key for object 671, is also associated with the collection of attributes_2681 for object 671. Note that the attributes 680-682 are also stored in local memory cache 630 (but are not drawn in FIG. 10 for illustrative simplicity). As will be described in more detail below, in an embodiment, the key Attribute Manipulation function 605 performs a first hashing function on the key to locate the collection of attributes for the object in cache; and, performs a second hashing function on a specific type of attribute provided by the application to identify the specific object attribute that is to be manipulated (e.g., written to or read).


The Key System Attribute Manipulation 608 allows for system attributes (i.e., system level parameters) to be keyed and manipulated, and, operates similarly to the key attribute manipulation 605.


Group Manipulation 606 is a storage plug-in functionality that allows for “put” or “get” manipulation of all the objects within a particular group. By specifying the group name for a group of objects, the application may retrieve (“get”) all the objects within that group. In an embodiment, the keys for a particular group are registered with the storage plug-in 603. As such, a group name that is supplied by the application is “effectively” converted into all the keys of the objects in the group by the storage plug-in 603. For example, application_1601 may run a “get” operation for cache group_1620. By using the name of cache group_1620 as the input, each of keys key_1640, key_2641, . . . key_N 642 are used by the storage plug in cache of keys to perform a “get” operation.



FIG. 11 illustrates a block diagram 900 of one embodiment of a “get” operation using Group Manipulation 606 functionality and is described in association with FIG. 9 and FIG. 10. This functionality is particularly useful in scenarios involving “massive” operations in which all objects from a particular group are to be affected. Application_1601 first specifies 701 the group name (e.g., the name of cache group_1620) needed for the operation and the “get” operation itself. In response, the Group Manipulation 606 functionality of the storage plug-in 603 retrieves 702 all the objects in the group by using the key for the object the group. The “get” operation ends 703 when there are no more keys to use.


The Object Size Information function 609 causes the storage plug-in 603 to calculate the size of an object (e.g., in bytes). Here, the application supplies the key of the object whose size is to be calculated and specifies the object size information function 609. Combined with a Group Manipulation function, the Object Size Information function 609 enables storage plug-in 603 to calculate the size of an entire group. The Key Set Operations function 607 is used to perform specific operations with the keys themselves (e.g., return to the application all key values in a group specified by the application).


As discussed above, each object may have a collection of attributes (e.g., shareable/non-shareable, time-to-live, etc.). In one embodiment, these attributes may be organized in local memory cache to be accessed by an application in a manner similar to the retrieving of an object with a “get” operation described above with respect to the Key Object Manipulation 604 function. In one embodiment, a series of hashing operations may be performed to retrieve one or attributes of a particular object.



FIG. 12 illustrates a more detailed perspective of an approach for accessing an attribute associated with a particular object. As an extension of the example provided above for object 670 (the get operation for the “Adam” object), FIG. 12 illustrates an attributes table 655 for object 670 organized in local memory cache 630. In one embodiment, attributes table 655 may be within a region of local memory cache in which a specific address value (e.g., address_1660, address_2662, . . . address N 664) is associated with a specific attribute value (e.g., value_1661, value_2, 663, . . . value_N 665).


A “get” operation for a particular attribute of an object may be carried out in the following manner. Application_1601 specifies: 1) the operation 658 (e.g., “get”); 2) the key for the object 668 (e.g., “ADAM”); and, 3) the applicable attribute 678 (e.g., “SHAREABLE/NON-SHAREABLE”). As discussed above with respect to FIG. 10, a collection of attributes (e.g., attributes table 655) may be associated with a particular object. In the approach of FIG. 12, the table 655 for a particular object 670 is made accessible with the object's key. For example, as illustrated in FIG. 12, the key 688 (“Adam”) for object 670 undergoes a first hashing function (i.e., hash_1650), that, in consideration of the operation pertaining to an attribute, causes the numerical address in local memory cache 630 where attributes table 655 is located to be identified.


A second hashing function (i.e., hash_2651) is performed using the desired attribute 678 (e.g., SHAREABLE/NON-SHAREABLE) as the key. The hash_2651 hashing function identifies the particular numerical address of the particular attribute of attributes table 655 that is to be accessed. For example, if the Shareable/Non-shareable attribute value corresponds to value_2663, the alphanumeric “name” of the attribute (e.g., “Shareable/Non-shareable”) would map to address_2662 of attributes table 655.


Eviction Policy Programming Models

Caches, either local or shared, have limited storage capacities. As such, a cache may require a procedure to remove lesser used objects in order, for example, to add new objects to the cache. Similar to a storage plug-in being designed to impose certain storage treatment(s) on an object, the eviction policy plug-in provides various functionalities for the active removal of an object from cache. As briefly discussed above with respect to FIG. 9, and as again provided in FIG. 13A, eviction policy plug-in 610 is logically represented as being cable of performing several functions, including object sorting 611, eviction timing 612, and object key attribution 613.


Referring to FIG. 13A, sorting component 611 is type of a queuing function that effectively sorts objects stored in a cache region so that a cached object that is most appropriate for eviction can be identified. Different sorting component types that each enforces a different sorting technique may be chosen from to instantiate a particular eviction policy with plug-in 606. That is, in one embodiment, there are different “flavors” of object sorting that may be selected from, and, one of these may be used to impose treatment on, for instance, an entire cache region. In other embodiments, multiple object sorting components (conceivably of different flavors) may be established per cache region e.g., one mutation notification per cache group).


To the extent the sorting component 611 can be viewed as a component that chooses “what” object should be removed from cache, the eviction timing component 612 is a component that determines “when” an object should be removed from cache. Different flavors for eviction timing components may also exist and be chosen from for implementation. Typically, a single eviction timing component is instantiated per cache region; but, conceivably, multiple eviction policy components may be instantiated as well (e.g., one per cache group). The object key attribution 613 component enables the involvement of certain object attributes (e.g., object size) in eviction processes.


For simplicity, the remainder of this detailed description will be written as if an eviction policy plug-in applies to an entire cache region.



FIG. 13B illustrates a more detailed perspective of various types of sorting components 611 that may be chosen for use within a particular eviction policy plug-in 603. In one embodiment, four types of queues may be implemented by sorting component 611: 1) a Least Recently Used (LRU) queue 617; 2) a Least Frequently Used (LFU) queue 618; 3) a size-based queue 619; and, 4) a First In First Out (FIFO) queue 621. In one embodiment, the different types of sorting techniques queue the keys associated with the cache region's cached objects. The identification of a key that is eligible for eviction results in the key's corresponding object being identified for eviction. In the case of shared closures, various approaches are possible. According to a first approach, sorting is performed at the object level, that is, keys are effectively sorted where each key represents an object; and, if a particular object identified for eviction happens to be a member of a shared closure that is cached in shared memory cache, the entire shared closure is evicted from shared memory cache (e.g., with a “delete” operation). According to a second approach, if an object is a member of a shared closure, a key for the shared closure as a whole is sorted amongst other keys. In either case, identifying a “key” that is eligible for eviction results in the identifying of an object for eviction (where, in the case of shared closure, an object and all its shared closure member objects are identified for eviction).


According to the design of the LRU queue 617, objects cached in a cache region that are accessed least recently (e.g., through either a “get” or “put” operation) are discarded first. LRU queue 617 is represented with a vertical ordering structure for multiple keys (e.g., key_1655, key_2656, . . . key_N 657). Essentially, the top of the queue represents keys for objects that have been used most recently, and, the bottom of the queue represents keys for objects that have been used least recently. According to one implementation of LRU queue 617, the object corresponding to the key at the very bottom would be next evicted. Removal of a key from the bottom of the queue triggers the eviction of that key's corresponding object from cache.


Here, any time an object is accessed (e.g., by way of a “get” or “put” operation), the key corresponding to that object is positioned at the very top of LRU queue 617. As illustrated by the position of key_1655, the object associated with key_1655 is the most recently accessed object. If, however, in a following operation an application (e.g., application_1601) accesses the object associated with key_2656, then, key_2656 would be repositioned above key_1655 in the LRU queue 617.


At any given instant of time, the key whose object has spent the longest amount of time in the cache region without being accessed will reside at the bottom of the queue. As such, when the moment arises to remove an object from the cache region, the object whose key resides at the bottom of the queue will be selected for removal from the cache region.


LFU queue 618 is an eviction policy in which cached objects accessed least frequently (e.g., through either a “get” or “put” operation), based on a counter, are discarded first. Each key for an object may have an associated counter that measures or keeps track of the number of times the object is accessed (e.g., the counter for the object's key is incremented each time the object is accessed). In one embodiment, the counter value may be an “attribute” for the object as described previously.


As with LRU queue 617, LFU queue 618 is represented with a vertical ordering structure for multiple keys (e.g., key_1665, key_2666, . . . key_N 667). The top of the queue represents keys for objects that have the highest counter value, and the bottom of the queue represents keys for objects with the lowest counter value. Here, over the course of time, those keys whose corresponding objects are accessed more frequently than other cached objects will be “buoyant” and reside near the top of the queue; while, those keys whose corresponding objects are accessed less frequently than the other objects in the cache region will “sink” toward the bottom of the queue.


At any instant of time, the key whose corresponding object has been used less than any other object in the cache region will be at the bottom of the queue. Thus, according to one implementation of LFU queue 618, the object corresponding to the key at the very bottom would be next evicted, because that object has the lowest counter value (i.e., lowest frequency of use). Removal of the key from the bottom of the queue triggers the eviction of that key's corresponding object from the cache region. Note that the counters for all the keys may be reset periodically or with each entry of a newly cached object in order to ensure that all the counter values can be used as a comparative measurement of use.


Size-based queue 619 is an eviction policy in which cached objects are prioritized according to size (e.g., the number of total bytes for the object). As such, object size may be another object attribute. The keys for objects in size-based queue 619 are shown arranged vertically with the smallest objects positioned near the top of the queue and keys for the largest objects positioned near the bottom of the queue. According to one implementation of size-based queue 619, the object corresponding to the key at the very bottom would be evicted first, because that object consumes the most amount of cache region space, and its subsequent removal would result in the most amount of free cache region space recovered (amongst all the objects that are cached in the cache region).


FIFO queue 621 is an eviction policy in which cached objects are removed according to the order that they are placed in the cache relative to one another. In one embodiment, when an eviction moment arises, the first cached object eligible for eviction corresponds to the object that has spend the most time in the cache, followed by the next oldest object, and so on. FIFO queue 621, illustrated in FIG. 13b, is also depicted with a vertical ordering structure for key_1655, key_2656, . . . key_N 657, with key_1655 corresponding to the oldest object (i.e., the first object placed in the cache) and key_N 677 corresponding to the newest object (i.e., the most recent object placed in the cache). When an eviction process is triggered, the object for key_1675 would be the first for removal. Unlike the other types of queues described above (assuming the size of an object can change in respect of size-based queue 619), there is no possibility for any rearrangement of the key order in FIFO queue 621. The keys are maintained in the order they are added to the cache, regardless of frequency, counter value, or size.


Referring back to FIG. 13A, the eviction timing component 612 is a functionality that determines when an object should be removed from a cache region. FIG. 14 illustrates a detailed graph of one type of eviction timing approach. The vertical axis represents the total number of objects in the cache as represented by the total number of keys in the queue associated with the applicable object sorting approach. The horizontal axis represents time (e.g., in milliseconds). Count allocation 648 represents the “targeted” maximum number of keys in the queue, which, in turn, corresponds to the targeted maximum number of allowable objects in the cache region.


In one embodiment, three threshold levels may be established for the cache region. A first threshold level, threshold_1645, corresponds to a level at which the eviction of a key from the sorting queue occurs on a timely basis. For example, when the count exceeds threshold_1645 (but not threshold_2646), a key is evicted from the sorting queue every millisecond until the total count falls below threshold_1645. In one embodiment, no active eviction occurs for count levels below threshold_1645.


A second threshold level, threshold_2646, corresponds to a level above which eviction of a key occurs on each entry into the cache of a newly cached object. That is, with each new addition of an object into cache, the key at the bottom of the applicable sorting queue is removed from the queue resulting in its corresponding object's eviction from cache. With this approach, the population of the cache region should remain constant in the absence of objects being removed from the cache region by processes other than eviction (such as deletion and/or garbage collection and/or attribute based as described below with respect to Object Key Attribution). With processes other than eviction, the cache region population may fall below threshold_2646 after the threshold has been crossed.


A third threshold level, threshold_3647, corresponds to a level equal to the targeted maximum allocation 648 for the cache region. When this level is exceeded, keys are evicted from the sorting queue until, in one embodiment, the total count of keys decreases to threshold_3647 (or just beneath threshold_3647). Note that this approach contemplates the population of the cache region exceeding its “targeted” maximum allocation for some reason.


Either of the eviction timing techniques may be used with the LRU 617 LFU 618 or FIFO 619 sorting technique. FIG. 15, by contrast, illustrates a detailed graph of another type of eviction timing technique that is to be used with the size based 619 sorting technique. In this embodiment, the vertical axis represents the total amount of consumed memory space of the cache. The horizontal axis represents time (e.g., in milliseconds). Size allocation 689 represents the maximum “targeted” allocated memory capacity of the cache region in terms of size (e.g., bytes).


In one embodiment, threshold_1685, threshold_2686, and threshold_3687 have similar properties with threshold_1645, threshold_2646, and threshold_3647, respectively. The only difference is that the memory consumption of the cache region (through the caching of its cached objects) triggers the crossing of the various thresholds.


Referring again back to FIG. 13A, Object Key Attribution 613 is a functionality that allows for the eviction of objects based on specific attributes that may be user-defined, system-defined, or otherwise customizable. For example, objects may be evicted on a Time-To-Live (TTL) basis in which case an object's key is pulled from the sorting queue (regardless of where it is located within the queue) if the object resides in the cache region for more than an amount of time set by the TTL attribute. Another attribute based eviction policy is Absolute Eviction Time (AET). In the case of AET, an actual time is set (e.g., 12:00 AM). If the object resides in the cache region after this time the object is evicted from the cache region.


Also, in the case of size based eviction policies, each objects size may be found in its attribute table.


Cache Management Library

The preceding discussions revealed that, referring to FIG. 16, a cache management library 1601 containing various plug-ins may be used to help build cache regions that impose various forms of object/shared closure treatment. Specifically, the Local, Local Soft, Shared and Shared Read Only storage plug-ins 1602_1, 1602_2, 1602_3, 1602_4 may be part of a collective storage plug in library 1602; and, the LRU, LFU, Size Based and FIFO sorting plug-in components 1603_1, 16032, 1603_3, 1603_4 may be part of a collective sorting plug-in component library 1601.


Here, definition of a specific cache region is effected by selecting 1604 a storage plug-in from the storage plug-in part 1602 of the cache management library 1601 and by selecting 1605 a sorting component plug-in from the sorting component plug-in part 1603 of the cache management library 1601. For each new cache region to be implemented, another iteration of the selection processes 1604, 1605 is performed. Thus, if a single application were to establish multiple cache regions, the configuration for the application would entail running through selection processes 1604, 1605 for each cache region to be implemented.


Distributed Cache Architecture

As discussed above with respect to FIG. 4, a storage policy plug-in of a cache region may dictate that an object stored in the local and/or shared cache memory be copied into deeper storage space 440 (e.g., a persisted database, in response to an object's removal from the cache). In one embodiment, the storage of a particular object into deeper storage allows for the “sharing” of that object on a much larger scale (e.g., between different computing systems or application servers). For example, an object commonly used by a cluster of application servers may be written to a persisted database for retrieval by any physical machine.


In another example, a first computing system having a first virtual machine may crash during the course of running operations with a number of objects. If the objects are stored in a persisted database, a second virtual machine from a second computing system may be able to restore the operations that were running on the first computing system, using the same objects retrieved from the persisted database.



FIG. 17 and block diagram 1000 of FIG. 18, taken together, illustrate one method of preserving an object's cached status between two computing systems. Application_1803, running on computing system_1801, specifies a “PUT” operation 830 for object 850a into local memory cache_1811 or shared memory cache_1805. In one embodiment, the “PUT” operation may involve the various functionalities of a storage plug-in described above for a cache region by cache manager_1804. Object 850a is generically represented but in one embodiment object 850a may be a group of objects, and in another embodiment may be objects contained within a shared closure. Object 850a is then persisted 831 in database 820 that is visible to other computing systems, including computing system_2802.


In one embodiment, a Structured Query language (SQL), or SQL-like command statement may be used to write a serialized version of object 850b into database 820. (In FIG. 17, the de-serialized object is referenced as 850a, and the serialized object is referenced as 850b). In alternate embodiments, other known database languages may be used to write object 850b into database 820. Upon successful writing of object 850b in database 820, a notification “statement of success” is sent 832 to cache manager_1804. Along with the success notification statement, the key for object 850b may also be sent to cache manager_1804, where, according to a further implementation, the key is in a de-serialized form. Object keys have been discussed in detail above with respect to FIGS. 10-12.


Upon receiving the success notification and the de-serialized key for object 850b, cache manager_1804 serializes 833 the key for object 850b and sends the serialized key 834 across a network 806 to computing system_2802. Cache manager_2808 receives the serialized key for object 850b and then de-serializes the key, 835. The de-serialized key may then be registered with a storage plug-in associated with cache manager_2808.


When application_2807 running on computing system_2802 requests 837 object 850b, the de-serialized object key that is registered with cache manager_2808 is used to retrieve 838 the serialized object 850b from database 820 at computing system_2802. The serialized object 850b may then be de-serialized 839 by cache manager_2808. The de-serialized object 850a may then be saved in local memory cache_2809 and/or shared memory cache_2810.


Closing Comments

Processes taught by the discussion above may be performed with program code such as machine-executable instructions which cause a machine (such as a “virtual machine”, a general-purpose processor disposed on a semiconductor chip or special-purpose processor disposed on a semiconductor chip) to perform certain functions. Alternatively, these functions may be performed by specific hardware components that contain hardwired logic for performing the functions, or by any combination of programmed computer components and custom hardware components.


An article of manufacture may be used to store program code. An article of manufacture that stores program code may be embodied as, but is not limited to, one or more memories (e.g., one or more flash memories, random access memories (static, dynamic or other)), optical disks, CD-ROMs, DVD ROMs, EPROMs, EEPROMs, magnetic or optical cards or other type of machine-readable media suitable for storing electronic instructions. Program code may also be downloaded from a remote computer (e.g., a server) to a requesting computer (e.g., a client) by way of data signals embodied in a propagation medium (e.g., via a communication link (e.g., a network connection)).



FIG. 19 is a block diagram of a computing system 1900 that can execute program code stored by an article of manufacture. It is important to recognize that the computing system block diagram of FIG. 19 is just one of various computing system architectures. The applicable article of manufacture may include one or more fixed components (such as a hard disk drive 1902 or memory 1905) and/or various movable components such as a CD ROM 1903, a compact disc, a magnetic tape, etc operable with removable media drive 1904. In order to execute the program code, typically instructions of the program code are loaded into the Random Access Memory (RAM) 1905; and, the processing core 1905 then executes the instructions. The processing core 1906 may include one or more processors and a memory controller function. A virtual machine or “interpreter” (e.g., a Java Virtual Machine) may run on top of the processing core 1806 (architecturally speaking) in order to convert abstract code (e.g., Java byte code) into instructions that are understandable to the specific processor(s) of the processing core 1906.


It is believed that processes taught by the discussion above can be practiced within various software environments such as, for example, object-oriented and non-object-oriented programming environments, Java based environments (such as a Java 2 Enterprise Edition (J2EE) environment or environments defined by other releases of the Java standard), or other environments (e.g., a .NET environment, a Windows/NT environment each provided by Microsoft Corporation).


In the foregoing specification, the invention has been described with reference to specific exemplary embodiments thereof. It will, however, be evident that various modifications and changes may be made thereto without departing from the broader spirit and scope of the invention as set forth in the appended claims. The specification and drawings are, accordingly, to be regarded in an illustrative rather than a restrictive sense.

Claims
  • 1. A method comprising: creating a first cache region associated with an application, the first cache region being one of a plurality of cache regions associated with the application, each of the plurality of cache regions being associated with a different caching treatment policy, the application running on a plurality of virtual machines, a first one of the plurality of virtual machines having a local memory cache that is private to the first one of the plurality of virtual machines and a shared memory cache that is shared by the first one of the plurality of virtual machines and a second one of the plurality of virtual machines;associating the first cache region with a first storage policy, the first storage policy specifying that a first copy of an object to be stored in the first cache region is to be stored in the local memory cache and that a second copy of the object to be stored in the first cache region is to be stored in the shared memory cache;receiving a request from the application to retrieve the object, the object having been stored in the first cache region;detecting that a second version of the object is newer than a first version of the object, the second version of the object being stored in the shared memory cache and the first version of the object being stored in the local memory cache; andproviding the second version of the object to the application,wherein the providing of the second version of the object comprises copying the second version of the object from the shared memory cache to the local memory cache.
  • 2. The method of claim 1, wherein the providing of the second version of the object further comprises enabling the application to establish a hard reference to the second version of the object in the local memory cache, an identifying of the hard reference by a garbage collector preventing the garbage collector from removing the second version of the object from the local memory cache.
  • 3. The method of claim 1, wherein the first storage policy further specifies that an eviction of the object from the local memory cache is to result in a storage of the object in a storage resource having a higher latency than the local memory cache.
  • 4. The method of claim 3, wherein the storage resource is a shared memory associated with the shared memory cache.
  • 5. The method of claim 1, further comprising: creating a second cache region associated with the application; andassociating the second cache region with a second storage policy, the second storage policy specifying that a first version of an additional object to be stored in the second cache region is to be stored in the shared memory cache and that the first version of the additional object is not to be modified.
  • 6. The method of claim 5, further comprising: receiving a request from the application to incorporate a modification into the additional object, the additional object having been stored in the second cache region; andin response to the receiving of the request from the application to incorporate the modification, incorporating the modification into a second version of the additional object and storing the second version of the additional object in the shared memory cache.
RELATED APPLICATIONS

This application is a continuation of U.S. application Ser. No. 11/024,554, filed Dec. 28, 2004 now abandoned, entitled “CACHE REGION CONCEPT,” which is incorporated herein by reference.

US Referenced Citations (334)
Number Name Date Kind
5034885 Matoba et al. Jul 1991 A
5274804 Jackson et al. Dec 1993 A
5276835 Mohan et al. Jan 1994 A
5311318 Dobrovolny May 1994 A
5331318 Montgomery Jul 1994 A
5553242 Russell et al. Sep 1996 A
5566302 Khalidi et al. Oct 1996 A
5566315 Milillo et al. Oct 1996 A
5590328 Seno et al. Dec 1996 A
5594886 Smith et al. Jan 1997 A
5617570 Russell et al. Apr 1997 A
5636355 Ramakrishnan et al. Jun 1997 A
5682328 Roeber et al. Oct 1997 A
5692193 Jagannathan et al. Nov 1997 A
5710909 Brown et al. Jan 1998 A
5745778 Alfieri Apr 1998 A
5778430 Ish et al. Jul 1998 A
5781924 Zaitzeva et al. Jul 1998 A
5805790 Nota et al. Sep 1998 A
5809527 Cooper et al. Sep 1998 A
5822759 Treynor Oct 1998 A
5844781 Schlotterer Dec 1998 A
5884316 Bernstein et al. Mar 1999 A
5905868 Baghai et al. May 1999 A
5926834 Carlson et al. Jul 1999 A
5933601 Fanshier et al. Aug 1999 A
5933848 Moenne-Loccoz Aug 1999 A
5944781 Murray Aug 1999 A
5951643 Shelton et al. Sep 1999 A
5961584 Wolf Oct 1999 A
5966127 Yajima Oct 1999 A
5974566 Ault et al. Oct 1999 A
6038571 Numajiri et al. Mar 2000 A
6047295 Endicott et al. Apr 2000 A
6065006 Decarmo May 2000 A
6075938 Bugnion et al. Jun 2000 A
6092171 Relph Jul 2000 A
6115712 Islam et al. Sep 2000 A
6115721 Nagy Sep 2000 A
6141728 Simionescu et al. Oct 2000 A
6154811 Srbljic et al. Nov 2000 A
6167423 Chopra et al. Dec 2000 A
6167449 Arnold et al. Dec 2000 A
6199179 Kauffman et al. Mar 2001 B1
6216212 Challenger et al. Apr 2001 B1
6256712 Challenger et al. Jul 2001 B1
6272598 Arlitt et al. Aug 2001 B1
6292099 Tse et al. Sep 2001 B1
6295582 Spencer Sep 2001 B1
6336170 Dean et al. Jan 2002 B1
6356529 Zarom Mar 2002 B1
6356946 Clegg et al. Mar 2002 B1
6385643 Jacobs et al. May 2002 B1
6385653 Sitaraman et al. May 2002 B1
6389460 Stewart et al. May 2002 B1
6389509 Berenguel et al. May 2002 B1
6412045 DeKoning et al. Jun 2002 B1
6415364 Bauman et al. Jul 2002 B1
6425057 Cherkasova et al. Jul 2002 B1
6438654 Elko et al. Aug 2002 B1
6446088 Vaduvur et al. Sep 2002 B1
6467052 Kaler et al. Oct 2002 B1
6502148 Krum Dec 2002 B1
6519594 Li Feb 2003 B1
6523027 Underwood Feb 2003 B1
6539445 Krum Mar 2003 B1
6587937 Jensen et al. Jul 2003 B1
6591347 Tischler et al. Jul 2003 B2
6601112 O'Rourke et al. Jul 2003 B1
6601143 Lamparter Jul 2003 B1
6615253 Bowman-Amuah Sep 2003 B1
6640244 Bowman-Amuah Oct 2003 B1
6651080 Liang et al. Nov 2003 B1
6675214 Stewart et al. Jan 2004 B2
6687702 Vaitheeswaran et al. Feb 2004 B2
6728748 Mangipudi et al. Apr 2004 B1
6732237 Jacobs et al. May 2004 B1
6738977 Berry et al. May 2004 B1
6748487 Takamoto et al. Jun 2004 B1
6754662 Li Jun 2004 B1
6757708 Craig et al. Jun 2004 B1
6760815 Traversat et al. Jul 2004 B1
6760911 Ye Jul 2004 B1
6766419 Zahir et al. Jul 2004 B1
6772409 Chawla et al. Aug 2004 B1
6779088 Benveniste et al. Aug 2004 B1
6795856 Bunch Sep 2004 B1
6799202 Hankinson et al. Sep 2004 B1
6829679 DeSota et al. Dec 2004 B2
6938085 Belkin et al. Aug 2005 B1
6944711 Mogi et al. Sep 2005 B2
6970925 Springmeyer et al. Nov 2005 B1
6976090 Ben-Shaul et al. Dec 2005 B2
6990534 Mikhailov et al. Jan 2006 B2
6996679 Cargnoni et al. Feb 2006 B2
7013329 Paul et al. Mar 2006 B1
7024512 Franaszek et al. Apr 2006 B1
7035870 McGuire et al. Apr 2006 B2
7051161 Dixit et al. May 2006 B2
7069271 Fadel et al. Jun 2006 B1
7089566 Johnson Aug 2006 B1
7096319 Mogi et al. Aug 2006 B2
7096418 Singhal et al. Aug 2006 B1
7111300 Salas et al. Sep 2006 B1
7124170 Sibert Oct 2006 B1
7127472 Enokida et al. Oct 2006 B1
7127713 Davis et al. Oct 2006 B2
7139792 Mishra et al. Nov 2006 B1
7149741 Burkey et al. Dec 2006 B2
7155512 Lean et al. Dec 2006 B2
7165239 Hejlsberg et al. Jan 2007 B2
7167917 Creamer et al. Jan 2007 B2
7174363 Goldstein et al. Feb 2007 B1
7177823 Lam et al. Feb 2007 B2
7184922 Ousley et al. Feb 2007 B2
7185096 Kalyanavarathan et al. Feb 2007 B2
7191170 Ganguly et al. Mar 2007 B2
7194761 Champagne Mar 2007 B1
7231435 Ohta Jun 2007 B2
7254634 Davis et al. Aug 2007 B1
7277935 Sato Oct 2007 B2
7296267 Cota-Robles et al. Nov 2007 B2
7302423 De Bellis Nov 2007 B2
7302609 Matena et al. Nov 2007 B2
7305495 Carter Dec 2007 B2
7308501 DeLima et al. Dec 2007 B2
7373661 Smith et al. May 2008 B2
7383289 Kraft Jun 2008 B2
7409709 Smith et al. Aug 2008 B2
7418560 Wintergerst Aug 2008 B2
7421495 Yang et al. Sep 2008 B2
7444644 Slaughter et al. Oct 2008 B1
7467162 Rosenbloom et al. Dec 2008 B2
7512737 Petev Mar 2009 B2
7532571 Price et al. May 2009 B1
7539821 Petev et al. May 2009 B2
7552284 Petey et al. Jun 2009 B2
7694065 Petev et al. Apr 2010 B2
7840760 Petey et al. Nov 2010 B2
20010029520 Miyazaki Oct 2001 A1
20010054004 Powers Dec 2001 A1
20020046325 Cai et al. Apr 2002 A1
20020052914 Zalewski et al. May 2002 A1
20020073283 Lewis et al. Jun 2002 A1
20020078060 Garst et al. Jun 2002 A1
20020083118 Sim Jun 2002 A1
20020083166 Dugan et al. Jun 2002 A1
20020087700 Chae Jul 2002 A1
20020093487 Rosenberg Jul 2002 A1
20020099691 Lore et al. Jul 2002 A1
20020099753 Hardin et al. Jul 2002 A1
20020133805 Pugh et al. Sep 2002 A1
20020143958 Montero et al. Oct 2002 A1
20020147888 Trevathan Oct 2002 A1
20020156863 Peng Oct 2002 A1
20020165909 Martin et al. Nov 2002 A1
20020169926 Pinckney et al. Nov 2002 A1
20020174097 Rusch et al. Nov 2002 A1
20020181307 Fifield et al. Dec 2002 A1
20020198923 Hayes, Jr. Dec 2002 A1
20020198953 O'Rourke et al. Dec 2002 A1
20030009533 Shuster Jan 2003 A1
20030014521 Elson et al. Jan 2003 A1
20030014525 DeLima et al. Jan 2003 A1
20030014552 Vaitheeswaran et al. Jan 2003 A1
20030018707 Flocken Jan 2003 A1
20030023827 Palanca et al. Jan 2003 A1
20030028671 Mehta et al. Feb 2003 A1
20030037148 Pedersen Feb 2003 A1
20030037178 Vessey et al. Feb 2003 A1
20030065711 Acharya et al. Apr 2003 A1
20030070047 Dwyer et al. Apr 2003 A1
20030074525 Yamauchi et al. Apr 2003 A1
20030074580 Knouse et al. Apr 2003 A1
20030084248 Gaither et al. May 2003 A1
20030084251 Gaither et al. May 2003 A1
20030088604 Kuck et al. May 2003 A1
20030093420 Ramme May 2003 A1
20030093487 Czajkowski et al. May 2003 A1
20030097360 McGuire et al. May 2003 A1
20030105887 Cox et al. Jun 2003 A1
20030115190 Soderstrom et al. Jun 2003 A1
20030131010 Redpath Jul 2003 A1
20030131286 Kaler et al. Jul 2003 A1
20030135509 Davis et al. Jul 2003 A1
20030154239 Davis et al. Aug 2003 A1
20030167333 Kumar et al. Sep 2003 A1
20030177382 Ofek et al. Sep 2003 A1
20030191795 Bernardin et al. Oct 2003 A1
20030196136 Haynes et al. Oct 2003 A1
20030200526 Arcand Oct 2003 A1
20030208563 Acree et al. Nov 2003 A1
20030212654 Harper et al. Nov 2003 A1
20030212865 Hicken et al. Nov 2003 A1
20030229529 Mui et al. Dec 2003 A1
20030229760 Doyle et al. Dec 2003 A1
20030236857 Takase et al. Dec 2003 A1
20040024610 Fradkov et al. Feb 2004 A1
20040024971 Bogin et al. Feb 2004 A1
20040045014 Radhakrishnan Mar 2004 A1
20040054725 Moller et al. Mar 2004 A1
20040054860 Dixit et al. Mar 2004 A1
20040068554 Bales et al. Apr 2004 A1
20040088412 John et al. May 2004 A1
20040117441 Liu et al. Jun 2004 A1
20040128370 Kortright Jul 2004 A1
20040133759 Sekiguchi Jul 2004 A1
20040153509 Alcorn et al. Aug 2004 A1
20040167980 Doyle et al. Aug 2004 A1
20040168029 Civlin Aug 2004 A1
20040168031 Haskins Aug 2004 A1
20040181537 Chawla et al. Sep 2004 A1
20040181782 Findeisen Sep 2004 A1
20040187140 Aigner et al. Sep 2004 A1
20040205162 Parikh et al. Oct 2004 A1
20040205299 Bearden Oct 2004 A1
20040215883 Bamford et al. Oct 2004 A1
20040221261 Blevins Nov 2004 A1
20040221285 Donovan et al. Nov 2004 A1
20040221294 Kalmuk et al. Nov 2004 A1
20040243709 Kalyanavarathan et al. Dec 2004 A1
20050021917 Mathur et al. Jan 2005 A1
20050027943 Steere et al. Feb 2005 A1
20050044301 Vasilevsky et al. Feb 2005 A1
20050055686 Buban et al. Mar 2005 A1
20050060704 Bulson et al. Mar 2005 A1
20050071459 Costa-Requena et al. Mar 2005 A1
20050086237 Monnie et al. Apr 2005 A1
20050086656 Whitlock et al. Apr 2005 A1
20050086661 Monnie et al. Apr 2005 A1
20050086662 Monnie et al. Apr 2005 A1
20050091388 Kamboh et al. Apr 2005 A1
20050102670 Bretl et al. May 2005 A1
20050125503 Iyengar et al. Jun 2005 A1
20050125607 Chefalas et al. Jun 2005 A1
20050131962 Deshpande Jun 2005 A1
20050138193 Encarnacion et al. Jun 2005 A1
20050154837 Keohane et al. Jul 2005 A1
20050160396 Chadzynski Jul 2005 A1
20050180429 Ghahremani et al. Aug 2005 A1
20050182844 Johnson et al. Aug 2005 A1
20050188068 Kilian Aug 2005 A1
20050198199 Dowling Sep 2005 A1
20050216502 Kaura et al. Sep 2005 A1
20050256880 Nam Koong et al. Nov 2005 A1
20050262181 Schmidt et al. Nov 2005 A1
20050262493 Schmidt et al. Nov 2005 A1
20050262512 Schmidt et al. Nov 2005 A1
20050268294 Petev et al. Dec 2005 A1
20050278270 Carr et al. Dec 2005 A1
20050278278 Petev Dec 2005 A1
20050278341 Kostadinov et al. Dec 2005 A1
20050278346 Shang et al. Dec 2005 A1
20050283585 Sexton et al. Dec 2005 A1
20060026286 Lei et al. Feb 2006 A1
20060029054 Breh et al. Feb 2006 A1
20060036448 Haynie et al. Feb 2006 A1
20060047974 Alpern et al. Mar 2006 A1
20060053112 Chitkara et al. Mar 2006 A1
20060059453 Kuck et al. Mar 2006 A1
20060064545 Wintergerst Mar 2006 A1
20060064549 Wintergerst Mar 2006 A1
20060069712 Anders et al. Mar 2006 A1
20060070051 Kuck et al. Mar 2006 A1
20060092165 Abdalla et al. May 2006 A1
20060094351 Nowak et al. May 2006 A1
20060129512 Braun Jun 2006 A1
20060129546 Braun Jun 2006 A1
20060129981 Dostert et al. Jun 2006 A1
20060130063 Kilian et al. Jun 2006 A1
20060136530 Rossman Jun 2006 A1
20060136667 Shultz et al. Jun 2006 A1
20060143217 Stanev et al. Jun 2006 A1
20060143256 Galchev et al. Jun 2006 A1
20060143328 Fleischer et al. Jun 2006 A1
20060143360 Petev et al. Jun 2006 A1
20060143389 Kilian et al. Jun 2006 A1
20060143392 Petev et al. Jun 2006 A1
20060143393 Petev Jun 2006 A1
20060143394 Petev et al. Jun 2006 A1
20060143427 Marwinski et al. Jun 2006 A1
20060143608 Dostert et al. Jun 2006 A1
20060143609 Stanev Jun 2006 A1
20060143618 Fleischer et al. Jun 2006 A1
20060143619 Galchev et al. Jun 2006 A1
20060150197 Werner Jul 2006 A1
20060155756 Stanev Jul 2006 A1
20060155867 Kilian et al. Jul 2006 A1
20060159197 Kraut et al. Jul 2006 A1
20060167980 Werner Jul 2006 A1
20060168646 Werner Jul 2006 A1
20060168846 Juan Aug 2006 A1
20060206856 Breeden et al. Sep 2006 A1
20060212852 Hwang Sep 2006 A1
20060235810 Wen et al. Oct 2006 A1
20060236306 DeBruin et al. Oct 2006 A1
20060248036 Stanev et al. Nov 2006 A1
20060248119 Stanev et al. Nov 2006 A1
20060248131 Marwinski et al. Nov 2006 A1
20060248140 Birenheide Nov 2006 A1
20060248177 Dostert et al. Nov 2006 A1
20060248198 Galchev Nov 2006 A1
20060248199 Stanev Nov 2006 A1
20060248200 Stanev Nov 2006 A1
20060248234 Pope et al. Nov 2006 A1
20060248283 Galchev et al. Nov 2006 A1
20060248284 Petev Nov 2006 A1
20060248350 Stanev Nov 2006 A1
20060253558 Acree et al. Nov 2006 A1
20060271586 Federighi et al. Nov 2006 A1
20060282509 Kilian et al. Dec 2006 A1
20060294253 Linderman Dec 2006 A1
20070027877 Droshev et al. Feb 2007 A1
20070050768 Brown et al. Mar 2007 A1
20070055781 Fleischer et al. Mar 2007 A1
20070067469 Luik et al. Mar 2007 A1
20070118538 Ahern et al. May 2007 A1
20070150586 Killian et al. Jun 2007 A1
20070156869 Galchev et al. Jul 2007 A1
20070156907 Galchev et al. Jul 2007 A1
20070226683 Stoodley et al. Sep 2007 A1
20070245167 De La Cruz et al. Oct 2007 A1
20070250779 Wallach et al. Oct 2007 A1
20070255722 Leffert et al. Nov 2007 A1
20070261043 Ho et al. Nov 2007 A1
20070266305 Cong et al. Nov 2007 A1
20080086564 Putman et al. Apr 2008 A1
20080162547 Bonev Jul 2008 A1
20080162552 Bonev Jul 2008 A1
20080163063 Bonev Jul 2008 A1
20080163124 Bonev Jul 2008 A1
20080201417 McCain et al. Aug 2008 A1
20080222270 Gilbert Sep 2008 A1
20090282196 Petev et al. Nov 2009 A1
Foreign Referenced Citations (8)
Number Date Country
0459931 Dec 1991 EP
1027796 Aug 2000 EP
1380941 Jan 2004 EP
1387262 Feb 2004 EP
WO-0023898 Apr 2000 WO
WO-0142908 Jun 2001 WO
WO-03073204 Sep 2003 WO
WO-2004038586 May 2004 WO
Related Publications (1)
Number Date Country
20100268881 A1 Oct 2010 US
Continuations (1)
Number Date Country
Parent 11024554 Dec 2004 US
Child 12831918 US