Multiple Java® virtual machines (JVMs) may concurrently execute on the same computing device (Java is a registered trademark owned by Sun Microsystems, Inc., Mountain View, Calif.). Each such JVM may execute within a separate process (referred to herein as a “JVM process”) that is isolated from the other JVM processes that concurrently execute on the same computing device. Each of these JVM processes may consume a substantial amount of the computing device's memory. The quantity of memory that each JVM process consumes may limit the number of JVM processes that can be executed concurrently on the computing device.
A JVM process needs to load certain programming classes when the JVM process initiates. Loading these classes may require a substantial amount of time. When multiple JVM processes are executed concurrently, the amount of time required for each JVM process to load associated classes may become prohibitive.
Typically, when a JVM process is initiated, the JVM process needs to load a class that is associated with a reference. The reference may be a symbolic name such as, for example, “java.lang.String.” To locate the reference, the JVM process engages in a look-up process which involves a class loader. The class loader defines the class and is configured to load, from a file system, bytes to which the class corresponds. The JVM process processes the loaded bytes and loads the bytes into the JVM processes' internal data structures (e.g., look-up tables) for fast access.
The internal data structures associated with each JVM process are typically stored in memory that has been allocated within the heap of the JVM process. The JVM process may allocate this memory using, for example, a “malloc” call which calls a memory allocation function. The heap memory region of the JVM process is writable private memory that is not shared (referred to hereinafter as “private heap”).
In general, in one aspect, the invention relates to a computing system, comprising a first virtual machine, executing on a processor, and associated with a memory region readable by the first virtual machine, and a first private memory region, a shareable memory region defined by first data that is readable and writable by the first virtual machine and a second virtual machine, wherein a mapping is established between the first virtual machine and a particular area of the shareable memory region, and the second virtual machine, executing on the processor, and associated with a second private memory region, and a reference to the particular area of the shareable memory region, wherein the mapping enables both the first virtual machine and second virtual machine to read and write second data in the shareable memory region without creating a copy of any portion of the second data in the first and second private memory regions.
In general, in one aspect, the invention relates to a computer-readable storage medium configure to store instructions, that when executed by one or more processors, performs a method for sharing a programmatic module among isolated virtual machines, comprising creating first data in a defined shareable memory region that is readable and writable by the first virtual machine, establishing a mapping between the first virtual machine and a second virtual machine over a particular area of the shareable memory region, and passing, to the second virtual machine, a reference to a particular area of the shareable memory region, wherein the mapping enables the second virtual machine to read and write second data in the shareable memory region without creating a copy of any portion of the second data in a private memory region of one selected from a group consisting of the first virtual machine and the second virtual machine.
Other aspects of the invention will be apparent from the following description and the appended claims.
Specific embodiments of the invention will now be described in detail with reference to the accompanying figures. Like elements in the various figures are denoted by like reference numerals for consistency.
In the following detailed description of embodiments of the invention, numerous specific details are set forth in order to provide a more thorough understanding of the invention. However, it will be apparent to one of ordinary skill in the art that the invention may be practiced without these specific details. In other instances, well-known features have not been described in detail to avoid unnecessarily complicating the description.
In general, embodiments of the invention provide a method and system for sharing objects (e.g., data) among isolated virtual machines. More specifically, each virtual machine may load objects using a class loader that has its own memory heap in shared memory, which may facilitate cross-process data sharing between JVMs.
In one or more embodiments of the invention, a computing device's memory is separated into distinct regions. One such region is shareable region (102). As explained further below, the shareable region (102) is fully readable and writable by one or more JVM processes. While not illustrated in
Shareable region (102) is created by an Initiating JVM (104) as part of this process' (104) initialization. Once the Initiating JVM process (104) is initialized, other JVM processes (106) that initialize subsequent to the Initiating JVM process (104) are permitted access to the shareable region (102). Additionally, each JVM process (e.g., 104 and 106) has a separate, private ‘read/write’ region (not shown in
In one embodiment of the invention, the shareable region (102) further includes one or more shared heaps (112, 114).
During initialization, while the Initiating JVM process (104) is loading classes (typically, before any of those classes has been requested by any application), Initiating JVM process (104) digests data from a Java® JAR file (116). In one or more embodiments of the invention, the Java® JAR file (116) is a programmatic module that is shared among isolated virtual machines. Java® JAR file (116) is loaded with a “SharedClassLoader” (118) which is a variant of the standard Java® virtual machine specification “ClassLoader”.
While SharedClassLoader (118) behaves in a similar manner to the typical ClassLoader in that the SharedClassLoader has a source of classes which may be loaded on request, in contrast to the standard ClassLoader, SharedClassLoader (118) stores all its data (e.g., classes, objects, and monitors) in a separate heap located in shared memory. That is, instances of classes loaded by the SharedClassLoader are created in the shared memory heap rather than the regular Java® heap. During its initialization, the Initiating JVM process (104) places the digested data into shareable region (102) and acquires a reference (or pointer) to a first shared heap (112). Any JVM process (104) is then free to access and modify objects (132) instantiated from classes (130) residing on the first shared heap (112).
In one or more embodiments, the SharedClassLoader is serializable and may be transferred to an application running on a JVM. The serialized version of the SharedClassLoader may not include the actual data of the SharedClassLoader, but may include only sufficient information to reconstruct the SharedClassLoader in another process such that the new SharedClassLoader instance, attaches itself to the shared memory and uses the shared memory for all operations. The serialized form of the SharedClassLoader may be made up of a single reference and an identifier. Because the SharedClassLoader resides in shared memory, only a native reference to the shared memory may be required to be passed to the new SharedClassLoader. In one embodiment of the invention, another Java® process which receives a SharedClassLoader is immediately able to access classes loaded into the shared memory heap, along with any compiled code, class fields, or object instances stored in the shared memory.
More specifically, in one or more embodiments of the invention, during initialization of the first shared heap (112), the Initiating JVM process (104) acquires sharing information including data regarding shared region (102) in which the first shared heap (112) resides and an actual reference to the first shared heap (112) in the form of a pointer. The Initiating JVM process (104) may then pass this sharing information to one or more other JVM processes (106). Alternatively, a user may pass the sharing information upon invocation of the one or more JVM processes (106). In essence, this provides a serialized version of the SharedClassLoader. Thus, the one or more other JVM processes (106) need not re-create a huge serialized instance but may instead access the first shared heap (112) using the sharing information in a remote method invocation (RMI) or inter-Xlet communication (IXC). Those skilled in the art will appreciate that the serialization of the SharedClassLoader described above allows the ability to pass complex and large objects between isolated JVMs by passing the special serialized form of the SharedClassLoader, which is extremely efficient.
As classes are defined within the SharedClassLoader, any objects later instantiated may be done so on the shared heap. A class may be loaded into more than one ClassLoader. For example, in one or more embodiments, a large part of Java® core library classes are loaded into both the system ClassLoader and into a SharedClassLoader. In the case of a class that is defined in multiple boot ClassLoaders (for example, both the system ClassLoader and a SharedClassLoader that is in the boot classpath), for purposes of the instanceof operator, these are considered the same class if they have the same name. Classes are not considered identical by the VM if they are loaded in non-boot ClassLoaders. When the new operator is used to instantiate a new object in a Java heap, the location of the object class determines which heap used for the new object. An instance of a class that is defined only in the SharedClassLoader will be itself created in the shared heap. For a class that is defined in both the SharedClassLoader and a private ClassLoader, the VM determines whether or not the newly created object is to be immediately assigned to a field of an object in the shared heap. In the case that the new object is to be so assigned, the object is created in the shared heap. Otherwise the object is created in the private heap. A simple algorithm that the VM could use to determine whether an assignation takes place to a field of an object in the shared heap would be to look ahead in the current method's bytecode to determine whether or not the next operation immediately following the object creation is an assignation to such a field. Those skilled in the art will appreciate that other algorithms to determine assignation to the shared heap are possible. If required, the requested class needed for an object in the shared heap is loaded into the relevant SharedClassLoader.
In a scenario in which only one SharedClassLoader is used, a newShared( ) static method may be created that performs exactly as the operator new which allocates the object from the shared heap. The newShared( ) static method may also search for the loaded class in the SharedClassLoader and if the class is not available, then the class is loaded into the SharedClassLoader. The newShared( ) method is useful when initially creating a shared object and assigning its reference into a location in the shared memory heap (or any other location not on the shared heap). As noted above, once an object is allocated to a shared heap, any objects that it creates and assigns to its instance fields are automatically allocated from the shared heap as well.
To the user, creation of a shared heap closely resembles currently known methods to load classes at run time. For example, suppose Java® JAR file “animals” has been configured to load the class “Cat” in a sharable region. Additionally, suppose it is desired that Initiating JVM process “master” should create the shareable region defined from Java® JAR file “animals.” The file “animals.jar” includes a SharedClassLoader constructed using the methods discussed herein. Finally, suppose it is desired that second JVM process “other” have complete access to class “Cat.”
For this example, the user might proceed as follows:
In this example, the command creating second JVM process “other” would include a flag or other designator (here called “SCL”) that would alert the second JVM process to the existence of the shareable region created by the process “master” and would further permit the second JVM process to access and modify all instances of the class “Cat” without re-loading the JAR file “animals.”
In one or more embodiments of the invention, the Initiating JVM process (104) and other JVM processes (106) reside on different computing hardware and have some form of (virtually) shared memory between the computing hardware, with the hardware communicatively coupled together via at least one network connection. Accordingly, the invention allows for the group of virtual machines that access shared objects to be running on different devices, with programmatic ability to access a common shared memory.
Now that implementation of the SharedClassLoader has been described, additional mechanisms, which in one or more embodiments of the invention are involved in the SharedClassLoader's usage, are described below.
By copying the SharedClassLoader into the Initiating JVM process shared memory, a mapping is established between the Initiating JVM process and the shareable memory region (Step 155). Subsequently, a reference pointing to the shareable region of memory is passed to a JVM process (e.g., a second JVM process distinct from the Initiating JVM process) (Step 165). More specifically, the reference passed may point directly to a shared heap within the shareable memory region created by the SharedClassLoader. In one or more embodiments of the invention, a reference may be passed either through another shared object or through serialization (e.g., Inter-Xlet communication (IXC)). Any other JVM process that receives a reference to the SharedClassLoader may then use the SharedClassLoader and create shared objects in the shared heap (ST 170). In one or more embodiments of the invention, the JAVA® class object data defines the shareable region of memory that is both readable and writable by multiple isolated and distinct JVM processes.
Finally, the mapping established as part of Step 160 enables both the Initiating JVM process and the second JVM process to read and write data in the shared memory heap. Data read/written may include classes and/or objects associated with classes. Thus, multiple isolated and distinct JVM processes are able to read and write data in the shared memory heap without creating a copy of any portion of the data read/written in any private memory region of the multiple isolated JVM processes (Step 165). For example, objects that reference an object in the shared memory heap may be created in a private heap of any of the multiple isolated JVM processes. Regular objects that reference an object in a private heap of a JVM process may not be created in the shared memory heap. The only exception to this rule is the special ProcessLocalReferences object.
Addition of the SharedClassLoader should in no way constrict the ability of a JVM to operate its own private heap. As such, objects in a private heap may include references to objects in the shared memory heap. In addition, there is no limitation on references between two objects that are both on the shared memory heap. In one embodiment of the invention, objects in the shared memory heap may not reference objects in a private heap. Any attempt to do so triggers a runtime exception, unless the assignment is the direct result of a new operator. In that case, the JVM is responsible for allocating the new object from a shared heap, thereby preventing a situation that will cause an exception.
In the situation where it is necessary for an object in the shared memory heap to reference an object in a private heap, the reference is made explicitly using an indirect process-private object reference that is valid only for the assigning process. Such a mechanism, designated “ProcessLocalReferences,” may be similar to the existing ThreadLocal API that assigns values on a per-thread basis.
In this manner, the ProcessLocalReferences may maintain a data structure in shared memory that provides a mapping of JVM identifiers (such as process handles) to reference values (such as addresses in the particular JVM's private heap). When a value is assigned to a ProcessLocalReferences, the value is entered into the mapping as corresponding to the assigning JVM. Programming code in another JVM has no access to this value and will see “null” as the reference value, unless the other JVM has assigned its own value into the mapping.
In one or more embodiments of the invention, garbage collection on a shared heap differs from that on a private heap.
Once the scan is completed, the list of actual references from the ProcessLocalReferences to the JVM performing the garbage collection is constructed and added to the list of roots when garbage collection is performed on the private heap (Step 185). Garbage collection is then performed on private heaps using the list of roots (ST 190). All activity on the JVMs is subsequently resumed (ST 195). Thus, in contrast to placing multiple applications within a single JVM, an application running inside a JVM performs garbage collection without interfering with other applications running inside other JVMs and related through their use of the shared memory.
The standard conditions triggering a garbage collection on (a) the SharedClassLoader, (b) all data in the shared heap, and (c) the block of shared memory are when (1) there is no reference from the private heap or stack to data on the shared heap, and (2) there is no reference from the private heap or stack to the SharedClassLoader. These conditions mirror those triggering a garbage collection of a ClassLoader. Other implementations are possible, such as those that those that involve delaying garbage collection of objects on the shared heap until the conditions listed above are met. In one embodiment of the invention, garbage collection on the shared memory waits until one or all applications running in the set of JVMs finally terminate.
Embodiments of the invention may be implemented on virtually any type of computer device regardless of the platform being used. For example, as shown in
Further, those skilled in the art will appreciate that one or more elements of the aforementioned computer system (400) may be located at a remote location and connected to the other elements over a network (not shown). Further, embodiments of the invention may be implemented on a distributed system having a plurality of nodes, where each portion of the implementation may be located on a different node within the distributed system. In one or more embodiments, the node corresponds to a computer system. Alternatively, the node may correspond to a processor with associated physical memory. The node may alternatively correspond to a processor with shared memory and/or resources. Further, software instructions to perform embodiments of hierarchical item level entitlement may be stored on a tangible computer readable medium such as a compact disc (CD), a diskette, a tape, or any other tangible computer readable storage device.
Embodiments of the invention provide a system and method for multiple applications, each running in their own virtual machine, to share the same middleware. Dynamic system-wide data (such as a system-wide resource table) may be shared between applications in a more efficient manner. The invention allows seamless transition of applications using shared middleware from a single Java VM environment to a multiple VM environment, and also allows for memory sharing in Java without native code that must be accessed through JNI.
While the invention has been described with respect to a limited number of embodiments, those skilled in the art, having benefit of this disclosure, will appreciate that other embodiments can be devised which do not depart from the scope of the invention as disclosed herein. Accordingly, the scope of the invention should be limited only by the attached claims.
This application is related to U.S. application Ser. No. 11/824,850 entitled “Shared JAVA JAR files,” filed Jul. 2, 2007, having common assignment, and fully incorporated by reference herein.