This application is based on an application No. 2002-225286 filed in Japan, the content of which is hereby incorporated by reference.
(1) Field of the Invention
The present invention relates to a compiler that translates a source program into an object program, i.e. a machine code program. More particularly, the present invention relates to a compiler apparatus and a method used by a compiler apparatus to determine locations for variables in a memory area.
(2) Description of the Related Art
With the recent trend for larger-scale software, it becomes common to write software in high-level languages, such as C and C++, which are suitable for developing a larger-scale system.
Before loaded onto memory and executed, source programs written in a high-level language need to be translated into machine language codes by compilers dedicated for each source language so as to be directly executable by a CPU. A program loaded to memory includes, sets each composed of a code representing an instruction and a data item representing a variable. Each instruction represented by a code is executed by referencing a corresponding data item.
Here, there are two types of variables represented by data items. One is a global variable for which memory area is reserved at all times throughout run-time of the program, and the other is an automatic variable for which memory area (stack area) is reserved at the time when a predetermined function is called. Hereinafter, an “automatic variable” is simply referred to as a “variable”.
A compiler determines locations in a stack area for each variable that corresponds to a predetermined function.
First, the compiler reads a source program (step S1), and performs parsing of the construction of sentences constituting the read program (step S2), and generates a variable correspondence table that contains variables to be allocated to the stack area (step S3).
Here, the variable correspondence table shows, for each variable contained, its variable type, variable name, data size, and alignment. Note that a “data size” is hereinafter referred to simply as a “size”.
“Alignment” is a value predetermined according to each variable type and shows the strength of a constraint on a location in a stack area to which a corresponding variable is permitted to be allocated. For example, a variable having alignment “2” must be allocated to a location in a stack area whose address is a multiple of 2, and a variable having alignment “4” must be allocated to a location in a stack area whose address is a multiple of 4.
The compiler regards variables contained in the variable correspondence table as a target-variable set to be allocated to the stack area (step S4). The compiler selects from the target-variable set, a smallest-size variable, and determines a location in the stack area for the selected variable (step S5). The compiler then judges whether all of the variables in the target-variable set have been determined their locations in the stack area (step S6).
When judging that locations have been determined for all of the variables (step S6: Y), the compiler terminates the processing. On the other hand, when judging that any of the variables has not yet been determined its location (step S6: N), the compiler goes back to the step S5.
For example, when the variables contained in the variable correspondence table shown in
Here, each cell in
In
With this memory allocation, when a variable stored within, for example, the first 32 bytes of the stack area is accessible with a single instruction, three are three different type variables each of which is accessible with a single instruction.
As described above, when variables are determined to be allocated in a stack area in the ascending order of size, a faster processing speed is achieved especially in the case of a program in which small-sized variables are frequently referenced.
Further, there is a conventional scheme 2 which determines to allocate variables to a stack area in the descending order of alignment.
The compiler reads a source program (step S11); performs parsing of the construction of sentences constituting the read program (step S12); generates, based on the result of parsing, a variable correspondence table that contains variables to be allocated to the stack area (step S13); regards variables in the variable correspondence table as a target-variable set to be allocated to the stack area (step S14); selects, from the target-variable set, a variable having a largest alignment value and determines a location in the stack area for the selected variable (step S15); and judges whether all of the variables in the target-variable set have been determined their locations in the stack area (step S16).
When judging that locations have been determined for all of the variables (step S16: Y), the compiler terminates the processing. On the other hand, when judging that any of the variables has not yet been determined its location (step S16: N), the compiler goes back to the step S15.
For example, when the variables contained in the variable correspondence table shown in
As shown in
As described above, memory allocation in the descending order of alignment minimizes a wasted, unused memory, and thus variables are effectively stored in a smaller capacity stack area.
Unfortunately, however, both conventional schemas have the following problems. Memory allocation according to the conventional scheme 1 inevitably results in that some of the stack area remains unused and thus wasted. As a result, a greater memory capacity is required.
Memory allocation according to the conventional scheme 2 tends to allocate a large-size variable at the top of the stack area, and thus fewer variables are accessible with a single instruction. This leads to decrease processing speed especially in the case of a program in which variables having smaller alignment are frequently referenced.
In view of the above problems, an object of the present invention is to provide a compiler apparatus and a method for optimally determining locations of variables in a stack area.
To achieve the object stated above, (1) one aspect of the present invention provides a compiler apparatus for
(2) Here, the allocation data selecting unit may
(3) Further, another aspect of the present invention provides
(4) Here, the allocation data selecting step may
(5) Further, yet another aspect of the present invention provides
(6) Here, the allocation data selecting step may
(7) Further, yet another aspect of the present invention provides
(8) Here, the allocation data selecting step may
With the constructions stated above, locations for storing data items are optimally determined so that data items are allocated in a memory area of a predetermined size.
Further, in the construction (1) stated above, the first criterion may
Further, in the construction (1) stated above, the compiler apparatus may
Further, in the construction (2) stated above, the first criterion may
Further, in the construction (3) stated above, the first criterion may
Further, in the construction (4) stated above, the first criterion may
Further, in the construction (5) stated above, the first criterion may
Further, in the construction (6) stated above, the first criterion may
Further, in the construction (7) stated above, the first criterion may
Further, in the construction (8) stated above, the first criterion may
With the constructions stated above, locations for storing data items are optimally determined in a manner that data items are allocated in a memory area of a predetermined size with a minimum memory space left unused and that as many data items as possible are accessed with a small number of instructions.
Further, in the construction (2) stated above, the first criterion may
Further, in the construction (1) stated above, the first criterion may
Here, the compiler may
Further, in the construction (3) stated above, the first criterion may
Further, in the construction (4) stated above, the first criterion may
Further, in the construction (5) stated above, the first criterion may
Further, in the construction (6) stated above, the first criterion may
Further, in the construction (7) stated above, the first criterion may
Further, in the construction (8) stated above, the first criterion may
With the constructions stated above, locations for storing data items are optimally determined in a manner that a plurality of data items is allocated in a memory area of a predetermined size with a minimum memory space left unused and in consideration of how frequently each data item is referenced. As a result, processing speed of the resulting object program improves.
These and the other objects, advantages and features of the invention will become apparent from the following description thereof taken in conjunction with the accompanying drawings which illustrate a specific embodiment of the invention.
In the drawings:
<Construction>
In response to a user instruction inputted via the input unit 103, the CPU 101 executes a compiler program 204 stored in the memory unit 104 to compile a source program 205. In the compilation process, the CPU 101 determines locations in a stack area for storing variables included in the source program 205, and translates the source program 205 into an object program 305.
To be more specific, the CPU 101 reads the source program 205 from the memory unit 104, and performs parsing of the construction of sentences constituting the read source program 205, and determines locations in the stack area to allocate the variables based on the result of parsing, and then generates the object program 305.
Note that the location determination processing is described later in detail.
The display unit 102 displays the result of compilation conducted by the CPU 101.
The input unit 103 receives a user input instructing to execute the compilation.
The memory unit 104 stores therein the compiler program 204 and the source program 205. In addition, when the compilation is done, the memory unit 104 stores therein the object program 305 translated from the source program 205.
<Operations>
Next, description is given to the location determination processing performed by the CPU 101 to determine locations of the variables in the stack area.
First, the CPU 101 reads the source program 205 from the memory unit 104, and performs parsing of the construction of sentences constituting the read program. Based on the result of parsing, the CPU 101 generates a variable correspondence table (step S1002). In addition, the CPU 101 generates an empty set as an exclusion-variable set (step S1003), and a target-variable set composed of all the variables included in the variable correspondence table (step S1004) The CPU 101 then selects a variable having a largest alignment value from the target-variable set (step S1005), and determines for the selected variable a location in the stack area that (i) has not been determined as a location for any variable, (ii) satisfies the alignment constraint, and (iii) has a smallest possible address (step S1006). The CPU 101 removes from the target-variable set the variable for which location is determined (step S1007), and then adds the thus removed variable to a determined-variable set (step S1008) The CPU 101 then judges whether the address of the thus determined location is within a predetermined address range from the starting address of the stack area (step S1009).
When the address of the determined location is not within the predetermined address range (step S1009: N), the CPU 101 specifies a largest-size variable out of all the variables in the determined-variable set (when the determined-variable set includes only one variable, that variable is naturally specified) The CPU 101 removes the specified variable from the determined-variable set, and adds the thus removed variable to the exclusion-variable set (step S1010). Further, the CPU 101 puts all the variables remaining in the determined-variable set back to the target-variable set (step S1011). Thereafter, the CPU 101 repeats the steps S1005-S119 to newly determine locations for all the variables in the current target-variable set.
On the other hand, when the address of the location determined in the step S1006 is within the predetermined address range (step S1009: Y), the CPU 101 judges whether there is no more variable left in the target-variable set (step S1012).
When there is a variable left (step S1012: N), the CPU 101 returns to the step S1005.
When there is no variable left (step S1012: Y), on the other hand, the CPU 101 further judges whether there is any variable included in the exclusion-variable set (step S1013).
When there is a variable in the exclusion-variable set (step S1013: Y), the CPU 101 selects a smallest-size variable from the exclusion-variable set (step S1014). (Note that when the exclusion set includes only one variable, that variable is naturally selected.) The CPU 101 then determines for the thus selected variable a location in the stack area that (i) has not been determined as a location for any variable, (ii) satisfies the alignment constraint, and (iii) has a smallest possible address (step S1015). The CPU 101 removes, from the exclusion-variable set, the variable for which location is determined (step S1016), and goes back to the step S1013.
On the other hand, when there is no variable left in the exclusion set (step S1013: N), the CPU 101 terminates the processing.
Referring now to the example shown in
Since the location of the variable d begins at the address 0, the CPU 101 judges that the address of the determined location is within the predetermined address range (step S1009: Y), and further judges whether there is no more variable remains in the target-variable set (step S1012).
Here, there are three variables a, b and c remaining in the target-variable set, so that the CPU 101 judges accordingly (step S1012: N), and selects the variable b having a largest alignment value in the target-variable set (step S1005). The CPU 101 then determines to allocate the variable b to a location in the stack area whose address begins at 32 (step S1006). This is because the variable b with the alignment 4 must be allocated to a location in the stack area that (i) has not been determined as a location for any variable, (ii) satisfies the alignment constraint, and (iii) has a smallest possible address. As shown in
Since the location of the variable b begins at the address 32, the CPU 101 judges that the address of the determined location falls out of the predetermined address range (step S1009: N). The CPU 101 thus specifies the variable d that is a largest-size variable in the determined-variable set composed of the variables b and d, and then moves the specified variable d from the determined-variable set to the exclusion-variable set (step S1010). Next, the CPU 101 puts back to the target-variable set, all the variables remaining in the determined-variable set after removal of the specified variable d (step S1011). Thereafter, the CPU 101 repeats the steps S1005–S1009 and S1012 to newly determine locations in the stack area for the variables b, c and a sequentially (the order of the variables c and a may be reversed), and then the variables b, c and a are sequentially added to the determined-variable set.
With the above processing, the variables b, c and a are determined to be allocated to locations in the stack area so as to fall within a range that begins at the address 0 and ends at the address 23, as shown in a schematic view of
With the above operations, the variables a, b, c and d are determined to be allocated to the locations in the stack area as shown inaschematic diagram of
In the object program shown in
Similarly, in the object program shown-in
As described above, the location determination processing according to the present embodiment is effective to reduce the number of instructions required to access variables stored in the stack area, and thus to reduce the size of a required instruction code.
<Supplemental Remarks>
Up to this point, a compiler apparatus according to the present invention has been described by way of the above embodiment. However, it is naturally appreciated the present invention is not limited to the specific embodiment described above, and following modifications may be made.
(1) In the embodiment above, the step S1005 shown in
In the step S1010 shown in
In another alternative, the specification may be made based on whether each variable in the stack area is accessible with an instruction with less constraint or with an instruction having a smaller code size.
(2) In the step S1014 shown in
In another alternative, the selection may be made based on whether each variable in the stack area is accessible with an instruction with less constraint or with an instruction having a smaller code size.
(3) In the location determination processing shown in
To be more specific, for example, the above steps are performed first on the variables included in the target-variable set to determine locations for the variable within a 32-byte address range of a stack area corresponding to the address 0–32. At this stage, some of the variable may not be determined to be allocated in the above range and thus remains in the exclusion-variable set. Next, the same steps are performed on the variables remaining in the exclusion-variable set so as to determine locations for the remaining variables within another 32-byte address range of the stack area corresponding to the address 32–36. The same steps are further performed on the variables included in the exclusion-variable set so as to determine locations for the variables in a yet another 32-byte address range of the stack area corresponding to the address 64–95. In this manner, the operations of the steps S1016–1016 may be repeatedly performed for a different address range of the stack area until locations are determined for all the variables.
Note that unlike the above example, the address range subjected to each sequence of location determination processing performed for the exclusion-variable set may not be equal in size. Instead, the stack area may be divided at the offset boundaries (which are determined depending on, for example, instruction size, latency, and combinations of instructions), and subjected to location determination processing in the ascending order of offset values.
Although the present invention has been fully described by way of examples with reference to the accompanying drawings, it is to be noted that various changes and modifications will be apparent to those skilled in the art. Therefore, unless such changes and modifications depart from the scope of the present invention, they should be construed as being included therein.
Number | Date | Country | Kind |
---|---|---|---|
2002-225286 | Aug 2002 | JP | national |
Number | Name | Date | Kind |
---|---|---|---|
5586323 | Koizumi et al. | Dec 1996 | A |
20030126399 | Troutman et al | Jul 2003 | A1 |
20040153589 | Nishioka | Aug 2004 | A1 |
Number | Date | Country | |
---|---|---|---|
20040172624 A1 | Sep 2004 | US |