Virtual machine hardware for RISC and CISC processors

Information

  • Patent Grant
  • 8769508
  • Patent Number
    8,769,508
  • Date Filed
    Wednesday, June 29, 2005
    20 years ago
  • Date Issued
    Tuesday, July 1, 2014
    11 years ago
Abstract
A hardware Java™ accelerator is comprised of a decode stage and a microcode stage. Separating into the decode and microcode stage allows the decode stage to implement instruction level parallelism while the microcode stage allows the conversion of a single Java™ bytecode into multiple native instructions. A reissue buffer is provided which stores the converted instructions and reissues them when the system returns from an interrupt. In this manner, the hardware accelerator need not be flushed upon an interrupt. A native PC monitor is also used. While the native PC is within a specific range, the hardware accelerator is enabled to convert the Java™ bytecodes into native instructions. When the native PC is outside the range, the hardware accelerator is disabled and the CPU operates on native instructions obtained from the memory.
Description
BACKGROUND OF THE INVENTION

Java™ is an object-orientated programming language developed by Sun Microsystems. The Java™ language is small, simple and portable across platforms and operating systems, both at the source and at the binary level. This makes the Java™ programming language very popular on the Internet.


Java™'s platform independence and code compaction are the most significant advantages of Java™ over conventional programming languages. In conventional programming languages, the source code of a program is sent to a compiler which translates the program into machine code or processor instructions. The processor instructions are native to the system's processor. If the code is compiled on an Intel-based system, the resulting program will only run on other Intel-based systems. If it is desired to run the program on another system, the user must go back to the original source code, obtain a compiler for the new processor, and recompile the program into the machine code specific to that other processor.


Java™ operates differently. The Java™ compiler takes a Java™ program and, instead of generating machine code for a particular processor, generates bytecodes. Bytecodes are instructions that look like machine code, but aren't specific to any processor. To execute a Java™ program, a bytecode interpreter takes the Java™ bytecode converts them to equivalent native processor instructions and executes the Java™ program. The Java™ bytecode interpreter is one component of the Java™ Virtual Machine.


Having the Java™ programs in bytecode form means that instead of being specific to any one system, the programs can run on any platform and any operating system as long a Java™ Virtual Machine is available. This allows a binary bytecode file to be executable across platforms.


The disadvantage of using bytecodes is execution speed. System specific programs that run directly on the hardware from which they are compiled, run significantly faster that Java™ bytecodes, which must be processed by the Java™ Virtual Machine. The processor must both convert the Java™ bytecodes into native instructions in the Java™ Virtual Machine and execute the native instructions.


One way to speed up the Java™ Virtual Machine is by techniques such as the “Just in Time” (JIT) interpreter, and even faster interpreters known as “Hot Spot JITs” interpreters. The JIT versions all result in a JIT compile overhead to generate native processor instructions. These JIT interpreters also result in additional memory overhead.


The slow execution speed of Java™ and overhead of JIT interpreters have made it difficult for consumer appliances requiring local-cost solutions with minimal memory usage and low energy consumption to run Java™ programs. The performance requirements for existing processors using the fastest JITs more than double to support running the Java™ Virtual Machine in software. The processor performance requirements could be met by employing superscalar processor architectures or by increasing the processor clock frequency. In both cases, the power requirements are dramatically increased. The memory bloat that results from JIT techniques, also goes against the consumer application requirements of low cost and low power.


It is desired to have an improved system for implementing Java™ programs that provides a low-cost solution for running Java™ programs for consumer appliances.


SUMMARY OF THE INVENTION

The present invention generally relates to a Java™ hardware accelerator which can be used to quickly translate Java™ bytecodes into native instructions for a central processing unit (CPU). The hardware accelerator speeds up the processing of the Java™ bytecodes significantly because it removes the bottleneck which previously occurred when the Java Virtual Machine is run in software on the CPU to translate Java bytecodes into native instructions.


In the present invention, at least part of the Virtual Machine is implemented in hardware as the Java hardware accelerator. The Java hardware accelerator and the CPU can be put together on a single semiconductor chip to provide an embedded system appropriate for use with commercial appliances. Such an embedded system solution is less expensive than a powerful superscalar CPU and has a relatively low power consumption.


The hardware Java accelerator can convert the stack-based Java bytecodes into a register-based native instructions on a CPU. The hardware accelerators of the present invention are not limited for use with Java language and can be used with any stack-based language that is to be converted to register-based native instructions. Also, the present invention can be used with any language that uses instructions, such as bytecodes, which run on a virtual machine.





BRIEF DESCRIPTION OF THE DRAWINGS

The present invention may be further understood from the following description in conjunction with the drawings.



FIG. 1 is a diagram of the system of the parent invention including the hardware Java™ accelerator.



FIG. 2 is a diagram illustrating the use of the hardware Java™ accelerator of the present invention.



FIG. 3 is a diagram illustrating some the details of a Java™ hardware accelerator of one embodiment of the present invention.



FIG. 4 is a diagram illustrating the details of one embodiment of a Java™ accelerator instruction translation in the system of the present invention.



FIG. 5 is a diagram illustration the instruction translation operation of one embodiment of the present invention.



FIG. 6 is a diagram illustrating the instruction translation system of one embodiment of the present invention using instruction level parallelism.



FIGS. 7A-7D are tables showing the possible lists of bytecodes which can cause exceptions in a preferred embodiment.





DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS


FIG. 1 is a diagram of the system 20 showing the use of a hardware Java™ accelerator 22 in conjunction with a central processing unit 26. The Java™ hardware accelerator 22 allows part of the Java™ Virtual Machine to be implemented in hardware. This hardware implementation speeds up the processing of the Java™ bytecodes. In particular, in a preferred embodiment, the translation of the Java™ bytecodes into native processor instructions is at least partially done in the hardware Java™ accelerator 22. This translation has been part of a bottleneck in the Java™ Virtual Machine when implemented in software. In FIG. 1, instructions from the instruction cache 24 or other memory is supplied to the hardware Java™ accelerator 22. If these instruction are Java™ bytecode, the hardware Java™ accelerator 22 can convert these bytecodes into native processor instruction which are supplied through the multiplexer 28 to the CPU. If a non-Java™ code is used, the hardware accelerator can be by-passed using the multiplexer 26.


The Java™ hardware accelerator can do some or all of the following tasks:

  • 1. Java™ bytecode decode;
  • 2. identifying and encoding instruction level parallelism (ILP), wherever possible;
  • 3. translating bytecodes to native instructions;
  • 4. managing the Java™ stack on a register file associated with the CPU or as a separate stack;
  • 5. generating exceptions on instructions on predetermined Java™ bytecodes;
  • 6. switching to native CPU operation when native CPU code is provided;
  • 7. performing bounds checking on array instructions; and
  • 8. managing the variables on the register file associated with the CPU.


In a preferred embodiment, the Java™ Virtual Machine functions of bytecode interpreter, Java™ register, and Java™ stack are implemented in the hardware Java™ accelerator. The garbage collection heap and constant pool area can be maintained in normal memory and accessed through normal memory referencing.


The major advantages of the Java™ hardware accelerator is to increase the speed in which the Java™ Virtual Machine operates, and allow existing native language legacy applications, software base, and development tools to be used. A dedicated microprocessor in which the Java™ bytecodes were the native instructions would not have access to those legacy applications.


Although the Java™ hardware accelerator is shown in FIG. 1 as separate from the central processing unit, the Java™ hardware accelerator can be incorporated into a central processing unit. In that case, the central processing unit has a Java™ hardware accelerator subunit to translate Java™ bytecode into the native instructions operated on by the main portion of the CPU.



FIG. 2 is a state machine diagram that shows the operation of one embodiment of the present invention. Block 32 is the power-on state. During power-on, the multiplexer 28 is set to bypass the Java™ hardware accelerator. In block 34, the native instruction boot-up sequence is run. Block 36 shows the system in the native mode executing native instructions and by-passing the Java™ hardware accelerator.


In block 38, the system switches to the Java™ hardware accelerator mode. In the Java™ hardware accelerator mode, Java™ bytecode is transferred to the Java™ hardware accelerator 22, converted into native instructions then sent to the CPU for operation.


The Java™ accelerator mode can produce exceptions at certain Java™ bytecodes. These bytecodes are not processed by the hardware accelerator 22 but are processed in the CPU 26. As shown in block 40, the system operates in the native mode but the Java™ Virtual Machine is implemented in the CPU which does the bytecode translation and handles the exception created in the Java™ accelerator mode.


The longer and more complicated bytecodes that are difficult to handle in hardware can be selected to produce the exceptions. FIG. 7 is a table showing one possible list of bytecodes which can cause exceptions in a preferred embodiment.



FIG. 3 is a diagram illustrating details of one embodiment of the Java™ hardware accelerator of the parent invention. The Java™ hardware accelerator includes Java™ accelerator instruction translation hardware 42. The instruction translation Unit 42 is used to convert Java™ bytecodes to native instructions. One embodiment of the Java™ accelerator instruction translation hardware 42 is described in more detail below with respect to FIG. 4. This instruction translation hardware 42 uses data stored in hardware Java™ registers 44. The hardware Java™ Registers store the Java™ Registers defined in the Java™ Virtual Machine. The Java™ Registers contain the state of the Java™ Virtual Machine, affect its operation, and are updated after each bytecode is executed. The Java™ registers in the Java™ a virtual machine include the PC, the program counter indicating what bytecode is being executed; Optop, a pointer to the top of the operand stack; Frame, a pointer to the execution environment of the current method; and Vars, a pointer to the first local variable available of the currently executing method. The virtual machine defines these registers to be a single 32-bit word wide. The Java™ registers are also stored in the Java™ stack which can be implemented as the hardware Java™ stack 50 or the Java™ stack can be stored into the CPU associated register file.


In a preferred embodiment, the hardware Java™ registers 44 can include additional registers for the use of the instruction translation hardware 42. These registers can include a register indicating a switch to native instructions and a register indicating the version number of the system.


The Java™ PC can be used to obtain bytecode instructions from the instruction cache 24. In one embodiment the Java™ PC is multiplexed with the normal program counter 54 of the central processing unit 26 in multiplexer 52. The normal PC 54 is not used during the operation of the Java™ hardware bytecode translation. In another embodiment, the normal program counter 54 is used as the Java™ program counter.


The Java™ registers are a part of the Java™ Virtual Machine and should not be confused with the general registers 46 or 48 which are operated upon by the central processing unit 26. In one embodiment, the system uses the traditional CPU register file 46 as well as a Java™ CPU register file 48. When native code is being operated upon the multiplexer 56 connects the conventional register file 46 to the execution logic 26c of the CPU 26. When the Java™ hardware accelerator is active, the Java™ CPU register file 48 substitutes for the conventional CPU register file 46. In another embodiment, the conventional CPU register file 46 is used.


As described below with respect to FIGS. 3 and 4, the Java™ CPU register file 48, or in an alternate embodiment the conventional CPU register file 46, can be used to store portions of the operand stack and some of the variables. In this way, the native register-based instructions from the Java™ accelerator instruction translator 42 can operate upon the operand stack and variable values stored in the Java™ CPU register file 48, or the values stored in the conventional CPU register file 46. Data can be written in and out of the Java™ CPU register file 48 from the data cache or other memory 58 through the overflow/underflow line 60 connected to the memory arbiter 62. The overflow/underflow transfer of data to and from the memory can be done concurrently with the CPU operation. Alternately, the overflow/underflow transfer can be done explicitly while the CPU is not operating. The overflow/underflow bus 60 can be implemented as a tri-state bus or as two separate buses to read data in and write data out of the register file when the Java™ stack overflows or underflows.


The register files for the CPU could alternately be implemented as a single register file with native instructions used to manipulate the loading of operand stack and variable values to and from memory. Alternately, multiple Java™ CPU register files could be used: one register file for variable values, another register file for the operand stack values, and another register file for the Java™ frame stack holding the method environment information.


The Java™ accelerator controller (co-processing unit) 64 can be used to control the hardware Java™ accelerator, read in and out from the hardware Java™ registers 44 and Java™ stack 50, and flush the Java™ accelerator instruction translation pipeline upon a “branch taken” signal from the CPU execute logic 26c.


The CPU 26 is divided into pipeline stages including the instruction fetch 26a, instruction decode 26b, execute logic 26c, memory access logic 26d, and writeback logic 26e. The execute logic 26c executes the native instructions and thus can determine whether a branch instruction is taken and issue the “branch taken” signal. FIG. 4 illustrates an embodiment of a Java™ accelerator instruction translator which can be used with the present invention. The instruction buffer 70 stores the bytecode instructions from the instruction cache. The bytecodes are sent to a parallel decode unit 72 which decodes multiple bytecodes at the same time. Multiple bytecodes are processed concurrently in order to allow for instruction level parallelism. That is, multiple bytecodes may be converted into a lesser number of native instructions.


The decoded bytecodes are sent to a state machine unit 74 and Arithmetic Logic Unit (ALU) 76. The. ALU 76 is provided to rearrange the bytecode instructions to make them easier to be operated on by the state machine 74. The state machine 74 converts the bytecodes into native instructions using the lookup table 78. Thus, the state machine 74 provides an address which indicates the location of the desired native instruction in the look-up table 78. Counters are maintained to keep a count of how many entries have been placed on the operand stack, as well as to keep track of the top of the operand stack. In a preferred embodiment, the output of the look-up table 78 is augmented with indications of the registers to be operated on at line 80. The register indications are from the counters and interpreted from bytecodes. Alternately, these register indications can be sent directly to the Java™ CPU register file 48 shown in FIG. 3.


The state machine 74 has access to the Java™ registers in 44 as well as an indication of the arrangement of the stack and variables in the Java™ CPU register file 48 or in the conventional CPU register file 46. The buffer 82 supplies the translated native instructions to the CPU.


The operation of the Java™ hardware accelerator of one embodiment of the present invention is illustrated in FIGS. 5 and 6. FIG. 5, section I shows the instruction translation of the Java™ bytecode. The Java™ bytecode corresponding to the mnemonic iadd is interpreted by the Java™ virtual machine as an integer operation taking the top two values of the operand stack, adding them together and pushing the result on top of the operand stack. The Java™ translating machine translates the Java™ bytecode into a native instruction such as the instruction ADD R1, R2. This is an instruction native to the CPU indicating the adding of value in register R1 to the value in register R2 and the storing of this result in register R2. R1 and R2 are the top two entries in the operand stack.


As shown in FIG. 5, section II, the Java™ register includes a PC value of “Value A” that is incremented to “Value A+1”. The Optop value changes from “Value B” to “Value B−1” to indicate that the top of the operand stack is at a new location. The Vars base value which points to the start of the variable list is not modified. In FIG. 5, section III, the contents of a native CPU register file such as the Java™ CPU register file 48 in FIG. 3, is shown. The Java™ CPU register file starts off with registers R0-R5 containing operand stack values and registers R6-R7 containing variable values. Before the operation of the native instruction, register R1 contains the top value of the operand stack. Register R6 contains the first variable. After the execution of the native instruction, register R2 now contains the top value of the operand stack. Register R1 no longer contains a valid operand stack value and is available to be overwritten by a operand stack value from the memory sent across the overflow/underflow line 60 or from the bytecode stream.



FIG. 5, section IV, shows the memory locations of the operand stack and variables which can be stored in the data cache 58 or in main memory. For convenience, the memory is illustrated without illustrating any virtual memory scheme. Before the native instruction executes, the address of the top of the operand stack, Optop, is “Value B”. After the native instruction executes, the address of the top of the operand stack is “Value B−1” containing the result of the native instruction. Note that the operand stack value “4427” can be written into register R1 across the overflow/underflow line 60. Upon a switch back to the native mode, the data in the Java™ CPU register file 48 should be written to the data memory.


Consistency must be maintained between the Hardware Java™ Registers 44, the Java™ CPU register file 48 and the data memory. The CPU 26 and Java™ Accelerator Instruction Translation Unit 42 are pipelined and any changes to the hardware Java™ registers 44 and changes to the control information for the Java™ CPU register file 48 must be able to be undone upon a “branch taken” signal. The system preferably uses buffers (not shown) to ensure this consistency. Additionally, the Java™ instruction translation must be done so as to avoid pipeline hazards in the instruction translation unit and CPU.



FIG. 6 is a diagram illustrating the operation of instruction level parallelism with the present invention. In FIG. 6 the Java™ bytecodes iload13 n and iadd are converted by the Java™ bytecode translator to the single native instruction ADD R6, R1. In the Java™ Virtual Machine, iload13 n pushes the top local variable indicated by the Java™ register Var onto the operand stack.


In the parent invention the Java™ hardware translator can combine the iload_n and iadd bytecode into a single native instruction. As shown in FIG. 6, section II, the Java™ Register, PC, is updated from “Value A” to “Value A+2”. The Optop value remains “value B”. The value Var remains at “value C”.


As shown in FIG. 6, section III, after the native instruction ADD R6, R1 executes the value of the first local variable stored in register R6, “1221”, is added to the value of the top of the operand stack contained in register R1 and the result stored in register R1. In FIG. 6, section IV, the Optop value does not change but the value in the top of the register contains the result of the ADD instruction, 1371.


The Java™ hardware accelerator of the parent invention is particularly well suited to a embedded solution in which the hardware accelerator is positioned on the same chip as the existing CPU design. This allows the prior existing software base and development tools for legacy applications to be used. In addition, the architecture of the present embodiment is scalable to fit a variety of applications ranging from smart cards to desktop solutions. This scalability is implemented in the Java™ accelerator instruction translation unit of FIG. 4. For example, the lookup table 78 and state machine 74 can be modified for a variety of different CPU architectures. These CPU architectures include reduced instruction set computer (RISC) architectures as well as complex instruction set computer (CISC) architectures. The present invention can also be used with superscalar CPUs or very long instruction word (VLIW) computers.


While the present invention has been described with reference to the above embodiments, this description of the preferred embodiments and methods is not meant to be construed in a limiting sense. For example, the term Java™ in the specification or claims should be construed to cover successor programming languages or other programming languages using basic Java™ (the use of generic instructions, such as bytecodes, to indicate the operation of a virtual machine). It should also be understood that all aspects of the present invention are not to be limited to the specific descriptions, or to configurations set forth herein. Some modifications in form and detail the various embodiments of the disclosed invention, as well as other variations in the present invention, will be apparent to a person skilled in the art upon reference to the present disclosure. It is therefore contemplated that the following claims will cover any such modifications or variations of the described embodiment as falling within the true spirit and scope of the present invention.

Claims
  • 1. A method for a central processing unit (CPU), comprising: selectively operating decode logic to decode Reduced Instruction Set Computer (RISC) instructions and virtual machine instructions wherein register indications are produced for the virtual machine instructions without translating to RISC instructions;a mechanism to store operands for the RISC instructions in the register file;processing the decoded instructions in a single execution unit within the CPU;said processing comprising selectively operating the single execution unit and a register file to process outputs from the decode logic corresponding to the RISC instructions or the virtual machine instructions;operating a common program counter for the RISC instructions and the virtual-machine instructions;maintaining a virtual machine operand stack in the register file with at least one of an underlow and overflow mechanism for the operand stack when selectively decoding virtual machine instructions; andconfiguring the CPU to process RISC instructions after at least one of a reset and power-on.
  • 2. The method of claim 1, wherein the selective decoding comprises decoding instructions of the RISC instruction set after at least one of a reset and power-on corresponding to a first mode of the CPU.
  • 3. The method of claim 2, wherein the first mode comprises operating a CPU pipeline to process the RISC instructions.
  • 4. The method of claim 2, wherein in the first mode no software virtual machine is operative.
  • 5. The method of claim 3, wherein the first mode comprises a second mode wherein a virtual machine is operative in software.
  • 6. The method of claim 5, further comprising a third mode wherein a CPU pipeline to process the virtual machine instructions is operative.
  • 7. The method of claim 6, wherein in the third mode at least some virtual machine instructions are executed without calls to any virtual machine running in software.
  • 8. The method of claim 1, further comprising operating a virtual machine in the CPU.
  • 9. A system, comprising: memory for storing instructions and data; and a central processing unit (CPU) coupled to the memory, comprising: a single execution unit and a register file for executing Reduced Instruction Set Computer (RISC) instructions;logic to decode RISC instructions;a mechanism to store operands for the RISC instructions in the register file;logic to operate a virtual machine and logic to produce register indications for the virtual machine;logic for processing the RISC instructions and operating the virtual machine with said register file and single execution unit, wherein said operating of the virtual machine is based on the register indications;a common program counter for processing the RISC instructions and operating the virtual machine; andlogic to maintain a virtual machine operand stack in the register file with at least one of an underflow and overflow mechanism for the operand stack, wherein the CPU is configured to decode RISC instructions after at least one of a reset and power-on.
  • 10. The system of claim 9, further comprising logic to operate a CPU pipeline to process the RISC instructions.
  • 11. The system of claim 10, comprising logic to operate the CPU pipeline without a virtual machine.
  • 12. The system of claim 9, further comprising logic to operate a CPU pipeline to operate a software virtual machine using RISC instructions.
  • 13. The system of claim 9, further comprising logic to process virtual machine instructions in a CPU pipeline.
  • 14. The system of claim 13, further comprising logic to manage an operand stack in the common register file.
  • 15. The system of claim 14 or 13, wherein the CPU comprises logic to process at least some virtual machine instructions without calls to any virtual machine running in software.
  • 16. The system of claim 14, comprising logic to decode multiple virtual machine instructions in parallel.
  • 17. The system of claim 9, wherein the CPU comprises logic to operate a virtual machine.
  • 18. A central processing unit (CPU), comprising: a common register file for processing the RISC instructions and the virtual machine instructions;logic to decode RISC instructions;a mechanism to store operands for the RISC instructions in the register file;logic to decode virtual machine instructions including producing register indications for the virtual machine instructions without translating to RISC instructions;a common program counter for processing the RISC instructions and the virtual machine instructions;logic to process first outputs from said logic to decode corresponding to RISC instructions and second outputs from said logic to decode corresponding to virtual machine instructions, said logic to process first and second outputs further comprising a single execution unit; and wherein the CPU has a mechanism to store operands for the virtual machine instructions in the common register file, including logic to maintain a virtual machine operand stack in the register file with at least one of an underlow and overflow mechanism for the operand stack; anda mechanism to configure the CPU to process RISC instructions after at least one of a reset and power-on.
  • 19. The CPU of claim 18, further comprising logic to operate a CPU pipeline to process the RISC instructions.
  • 20. The CPU of claim 18 further comprising logic to operate the CPU without a virtual machine.
  • 21. The CPU of claim 18, further comprising logic to operate a virtual machine using RISC instructions.
  • 22. The CPU of claim 21, further comprising a pipeline and logic to process virtual machine instructions in the CPU pipeline.
  • 23. The CPU of claim 18 or 22, further comprising logic to process at least some virtual machine instructions without calls to any virtual machine running in software.
  • 24. The CPU of claim 22, comprising logic to maintain an operand stack for a virtual machine in the register file.
  • 25. The CPU of claim 24, comprising logic to produce one of an overflow and underflow indication for the operand stack.
  • 26. The CPU of claim 25, further comprising logic to move operands between the common register file and a memory due to one of a overflow and underflow condition.
  • 27. The CPU of claim 24, comprising logic to produce register references for the operands in the operand stack.
  • 28. The CPU of claim 24, further comprising logic to produce an exception for some virtual machine instructions.
  • 29. The CPU of claim 21 further including logic to perform bounds checking for array instructions.
  • 30. The CPU of claim 18, further comprising logic to operate a virtual machine.
  • 31. The CPU of claim 18, comprising logic to decode multiple virtual machine instructions at the same time.
  • 32. A central processing unit (CPU), comprising: a single execution unit and associated register file, the single execution unit having logic to execute first output of decode logic corresponding to RISC instructions including a mechanism to store operands for the RISC instructions in the register file, and second output of a two stage decode logic further comprising logic to produce register indications for the register file corresponding to virtual machine instructions without translating to RISC instructions;logic to maintain at least some data for processing the first output and the second output in the register file including logic to maintain an operand stack for a virtual machine in the register file;and logic for a stack control mechanism that includes at least one of an overflow and underflow mechanism;a common program counter for the RISC instructions and the virtual-machine instructions;logic to generate an exception for at least some virtual machine instructions; and a mechanism to configure the CPU to process register-based instructions after at least one of a reset and power-on.
  • 33. The CPU of claim 32, comprising logic to move operands between the register file and a memory due to one of a overflow and underflow condition.
  • 34. The CPU of claim 32, comprising logic to produce register references for the operands in the operand stack.
  • 35. The CPU of claim 32, further comprising logic to switch from executing register-based instructions to executing virtual machine instructions.
  • 36. The CPU of claim 32 further comprising array bounds checking logic for array instructions.
  • 37. The CPU of claim 32, comprising logic to operate a common program counter for processing the RISC instructions and the virtual machine instructions.
VIRTUAL MACHINE HARDWARE FOR RISC AND CISC PROCESSORS

This application is a continuation of co-pending U.S. patent application Ser. No. 09/938,886 filed 8 Aug. 2001 and entitled “Java Virtual Machine hardware for RISC and CISC Processor.”

US Referenced Citations (126)
Number Name Date Kind
3889243 Drimak Jun 1975 A
4236204 Groves Nov 1980 A
4524416 Stanley et al. Jun 1985 A
4587612 Fisk et al. May 1986 A
4587632 Ditzel May 1986 A
4631663 Chilinski et al. Dec 1986 A
4763255 Hopkins et al. Aug 1988 A
4783738 Li et al. Nov 1988 A
4860191 Nomura et al. Aug 1989 A
4922414 Holloway et al. May 1990 A
4961141 Hopkins et al. Oct 1990 A
4969091 Muller Nov 1990 A
5077657 Cooper et al. Dec 1991 A
5113522 Dinwiddie, Jr. et al. May 1992 A
5136696 Beckwith et al. Aug 1992 A
5142681 Driscoll et al. Aug 1992 A
5163139 Haigh et al. Nov 1992 A
5193180 Hastings Mar 1993 A
5201056 Daniel et al. Apr 1993 A
5218711 Yoshida Jun 1993 A
5241636 Kohn Aug 1993 A
5265206 Shackelford et al. Nov 1993 A
5307492 Benson Apr 1994 A
5313614 Goettelmann et al. May 1994 A
5333296 Bouchard et al. Jul 1994 A
5335344 Hastings Aug 1994 A
5355460 Eickemeyer et al. Oct 1994 A
5430862 Smith et al. Jul 1995 A
5481684 Richter et al. Jan 1996 A
5490256 Mooney et al. Feb 1996 A
5535329 Hastings Jul 1996 A
5542059 Blomgren Jul 1996 A
5574927 Scantlin Nov 1996 A
5577233 Goettelmann et al. Nov 1996 A
5584026 Knudsen et al. Dec 1996 A
5619665 Emma Apr 1997 A
5634118 Blomgren May 1997 A
5638525 Hammond et al. Jun 1997 A
5650948 Gafter Jul 1997 A
5659703 Moore et al. Aug 1997 A
5668999 Gosling Sep 1997 A
5680641 Sidman Oct 1997 A
5692170 Isaman et al. Nov 1997 A
5740441 Yellin et al. Apr 1998 A
5740461 Jaggar Apr 1998 A
5748964 Gosling May 1998 A
5752035 Trimberger May 1998 A
5761477 Wahbe et al. Jun 1998 A
5764908 Shoji et al. Jun 1998 A
5768593 Walters et al. Jun 1998 A
5774868 Cragun et al. Jun 1998 A
5778178 Arunachalam Jul 1998 A
5781750 Blomgren et al. Jul 1998 A
5784584 Moore et al. Jul 1998 A
5794068 Asghar et al. Aug 1998 A
5805895 Breternitz, Jr. et al. Sep 1998 A
5809336 Moore et al. Sep 1998 A
5838165 Chatter Nov 1998 A
5838948 Bunza Nov 1998 A
5875336 Dickol et al. Feb 1999 A
5889996 Adams Mar 1999 A
5898850 Dickol et al. Apr 1999 A
5898885 Dickol et al. Apr 1999 A
5903761 Tyma May 1999 A
5905895 Halter May 1999 A
5920720 Toutonghi et al. Jul 1999 A
5923892 Levy Jul 1999 A
5925123 Tremblay et al. Jul 1999 A
5926832 Wing et al. Jul 1999 A
5937193 Evoy Aug 1999 A
5953736 O'Connor et al. Sep 1999 A
5953741 Evoy Sep 1999 A
5619666 Coon et al. Nov 1999 A
5983334 Coon et al. Nov 1999 A
5999731 Yellin et al. Dec 1999 A
6003038 Chen et al. Dec 1999 A
6009499 Koppala Dec 1999 A
6014723 Tremblay et al. Jan 2000 A
6021469 Tremblay et al. Feb 2000 A
6026485 O'Connor et al. Feb 2000 A
6031992 Cmelik et al. Feb 2000 A
6038643 Tremblay et al. Mar 2000 A
6052526 Chatt Apr 2000 A
6065108 Tremblay et al. May 2000 A
6067577 Beard May 2000 A
6071317 Nagel Jun 2000 A
6075940 Gosling Jun 2000 A
6076141 Tremblay et al. Jun 2000 A
6081665 Nilsen Jun 2000 A
6085198 Skinner et al. Jul 2000 A
6088786 Feierbach et al. Jul 2000 A
6108768 Koppala et al. Aug 2000 A
6110226 Bothner Aug 2000 A
6118940 Alexander, III et al. Sep 2000 A
6122638 Huber et al. Sep 2000 A
6125439 Tremblay et al. Sep 2000 A
6131144 Koppala Oct 2000 A
6131191 Cierniak et al. Oct 2000 A
6139199 Rodriguez Oct 2000 A
6141794 Dice et al. Oct 2000 A
6148391 Petrick Nov 2000 A
6151702 Overturf et al. Nov 2000 A
6158048 Lueh et al. Dec 2000 A
6167488 Koppala Dec 2000 A
6209077 Robertson et al. Mar 2001 B1
6233678 Bala May 2001 B1
6275903 Koppala et al. Aug 2001 B1
6275984 Morita Aug 2001 B1
6292883 Augusteijn et al. Sep 2001 B1
6298434 Lindwer Oct 2001 B1
6317872 Gee et al. Nov 2001 B1
6321323 Nugroho et al. Nov 2001 B1
6330659 Poff et al. Dec 2001 B1
6338160 Patel et al. Jan 2002 B1
6349377 Lindwer Feb 2002 B1
6374286 Gee et al. Apr 2002 B1
6397379 Yates et al. May 2002 B1
6513156 Bak et al. Jan 2003 B2
6532531 O'Conner et al. Mar 2003 B1
6606743 Raz et al. Aug 2003 B1
6826748 Hohensee et al. Nov 2004 B1
7137110 Reese et al. Nov 2006 B1
7225436 Patel May 2007 B1
7254806 Yates et al. Aug 2007 B1
20020032718 Yates et al. Mar 2002 A1
20020078115 Poff et al. Jun 2002 A1
Non-Patent Literature Citations (39)
Entry
“The Java.TM. Virtual Machine Specification”, Sun Microsystems, Inc., Sep. 1996, Chapters 1-3 and 10 (65 pages). Online retrieved at <java.sun.com/docs/books/jvms/>.
Wikipedia website, “List of instruction sets”, accessed on Jul. 30, 2012, 10 pages, <http://en.wikipedia.org/wiki/List—of—instruction—sets>.
Paez-Monzon et al., The RISC processor DMN-6: a unified data-control flow architecture, Sep. 1996, 8 pages.
Hilgendorf et al., Instruction translation for an experimental S/390 processor, Mar. 2001, 6 pages.
Andrews, et al., “Migrating a CISC computer family onto RISC via object code translation”, Proceedings of the Fifth International Conference on Architectural Support for Programming Languages and Operating Systems, 1992.
Berekovic, et al., “Hardware Realization of a Java Virtual Machine for High Performance Multimedia Applications”, IEEE Workshop on Signal Processing Systems , (Jan. 1, 1997).
Debaere, et al., “Interpretation and Instruction Pathcoprocessing”, The MIT Press, (Jan. 1, 1990).
Deutsch. Peter “Efficient Implementation of the Smalltalk-80 System”, 11th ACM SIGACT-SIGPLAN Symposium on Principles of Programming Languages, 1984.
El-Kharashi, et al., “JAVA Microprocessor: Computer Architecture Implications,”, IEEE,(Aug. 20, 1997).
ERTL, “A new approach to forth native code generation”, EuroForth Conference Proceedings, 1992.
Ertl, “Implementation of stack-based languages on register machines”, dissertation, Apr. 1996.
ERTL, “Stack caching for interpreters”, SIGPLAN, 1995.
ERTL, “Stack caching for Interpreters”, EuroForth Conference Proceedings 1994.
Glossner, et al., “Delft-Java Link Translation Buffer”, Proceedings of the 24th EUROMICRO conference, Aug. 1998.
Glossner, et al., “The Delft Java Engine: An Introduction, Euro-Part '97, Parallel Processing, Third International Euro-Par Conference”, (Aug. 1, 1997).
Hsieh, at al., “Java Byte Code to Native Code Translation: The Caffeine Prototype and Preliminary Results”, IEEE, (Jan. 1, 1996).
INFOWORLD, “SGI Webforce 02 is a one-stop web authoring platform”, Infoworld Jan. 20, 1997.
Interactive Daily, “Sun Says Java Chips Will Vastly Increase Speed, Reduce Costs to Run Java Programs”, Download From Internet, (Dec. 1996).
Kieburtz, “A RISC architecture for symbolic computation”, ACM 1987.
Krall, et al., “A 64 bit Java VM just-intime compiler”, XP-002117590, 1997.
Krall, Andreas “Efficient Java VM Just-In-Time Compilation”, IEEE, (Jan. 1, 1998).
Mahlke, et al, “A Comparison of Full and Partial Predicted Execution Support for ILP Processors”, IEEE, (Jan. 1, 1995).
Maierhofer, et al., “Optimizing stack code”, Forth-Tagung, 1997.
McGhan, et al., “picoJava: A Direct Execution Engine for Java Bytecode”, IEEE, 1998.
Miyoshi, at al., “Implementation and Evaluation of Real Time Java Threads”, IEEE, (Jan. 1, 1997).
O'Conner, et al., “plcoJava-I: The Java Virtual Machine in Hardware”, IEEE, Mar. 1997.
Pang, et al., “Providing Soft Real-Time QoS Guarantees for Java Threads”, ACM (Jan. 1, 2001).
Radhakrishnan, et al., “Improving Java Performance Using Hardware Translation”, ACM, (Jan. 1, 2001).
Rose, A C., “Hardware Java Accelerator for the ARM 7”, 4th Year Undergraduate Project in Group D, (1996/97), 1-49, Appendix.
Steensgarrd, et al. “Object and Native Code Thread Mobility Among Heterogeneous Computers”, ACM, (Jan. 1, 1995).
Steinbusch, Otto , “Designing Hardware to Interpret Virtual Machine Instructions”, Dept. of Electrical Engineering, Eindhoven University of Technology, Masters Degree Thesis, Feb. 1998, (Jan, 1, 1998),59.
Sun Microsystems, “PicoJava 1 Microprocessor Core Architecture”, Oct. 1996, (Oct. 1996).
Sun Microsystems, “PicoJava I, Java Processor Core Data Sheet”, Dec. 1997.
Tomasulo, R. , “An Efficient Algorithm for Exploring Multiple Arithmetic Units”, IBM Journal of Research and Development, (Jan. 1, 1967).
Ungar, et al., “Architecture of SOAR: Smalltalk on a RISC”, 11th Symposium on Computer Architecture Jun. 1984, (Jun. 1, 1984).
Watanabe, et al., “Exploring Java Instruction/Thread Level Parallelism With Horizontal Mutithreading”, IEEE, (Jan. 1, 2001).
Kim et al, Designing a Java Microprocessor Core Using FPGA Technology, Lucent Technologies and Illinois Institute of Technology, 1998 IEEE Xplore.
Andrews et al. Migrating a CISC Computer Family onto RISC via Object Code Translation, Tandem Computers Inc, Cupertino, CA, 1992 ACM.
O' Connor et al, PicoJava-1:The Java Virtual Machine in Hardware, Sun Microelectronics,Mar./Apr. 1997 IEEE Micro.
Related Publications (1)
Number Date Country
20050240915 A1 Oct 2005 US
Continuations (1)
Number Date Country
Parent 09938886 Aug 2001 US
Child 11171681 US