This application is based upon and claims the benefit of priority of the prior Japanese Patent Application No. 2017-25400, filed, on Feb. 14, 2017, the entire contents of which are incorporated herein by reference.
The embodiments discussed herein are related of a technique for recursive processing.
To solve a certain problem in using a result of a small problem corresponding to part of the problem, a recursive process is used,
For example, in a recursive process executed in the case where a suffix array is generated, in solving a problem of a certain step, information of a problem size of a previous step is used. For this reason, in addition to a storage space that stores a problem instance of each step, a storage space that stores information of a problem size of each step is prepared,
Thus, in the case where a certain type of recursive process is executed, an additional storage space is used. In the case of a system (for example, an embedded system) in which a particularly limited storage space is usable, the occupation of the storage space may affect the performance of a process other than the recursive process.
A related technique is disclosed in Japanese Laid-open Patent Publication No. 1-142959, for example.
According to an aspect of the invention, an apparatus for recursive processing includes: a memory and a processor configured to: in a first step of a plurality of steps in which a specific process is executed, execute a determination process of whether a size of first data to be processed in the first step coincides with a first upper size limit defined based on a state of a second step of the plurality of steps, store a result of the determination process in the memory, and in a third step of the plurality of steps, identify the size of the first data with reference to the result stored in the memory, and execute the specific process of the third step based on the identified size of the first data,
The object and advantages of the invention will be realized and attained by means of the elements and combinations particularly pointed out in the claims.
It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory and are not restrictive of the invention, as claimed.
First Embodiment
A suffix array is an array obtained by sorting all suffixes of a character string in dictionary order. Taking, as an example, a recursive process executed in generating a suffix array, this embodiment will be described below. First, a recursive process executed in generating a suffix array will be simply described with reference to
In generating a suffix array, in each step, suffixes that are not more than half of target suffixes (that is, 1/c suffixes (c is an integer satisfying the relationship of 1<c≤N), and N represents an initial problem size) are recursively sorted. Thus, suffixes of a step (i+1) correspond to suffixes that are not more than half of suffixes of the step i, and suffixes of a step (i+2) correspond to suffixes that are not more than half of the suffixes of the step (i+1). In the following description, the value of c is 2, that is, c=2, unless otherwise specified.
In each step, target suffixes are sorted by using suffixes sorted in a subsequent step. Specifically, in the step (i+1), target suffixes are sorted by using suffixes sorted in the step (i+2). In the step i, target suffixes are sorted by using the suffixes sorted in the step (i+1). Here, in each step, a problem size (for example, the number of suffixes) of a previous step is used for sorting. Specifically, in the step (i+2), a problem size of the step (i+1) is used for sorting. In the step (i+1), a problem size of the step i is used for sorting.
A first method considered as a method for storing information of a problem size will be described with reference to
The first method enables high-speed processing, but results in an increase in the size of a space used.
A second method considered as a method for storing information of a problem size will be described with reference to
The second method enables a reduction in the size of a space used, but results in low-speed processing.
As described above, the two method described above result in low-speed processing or an increase in the size of a space used. Thus, in this embodiment, the following method to be described reduces the size of a storage space used while maintaining the performance of a recursive process.
The storage processing unit 101, the reconstruction processing unit 103, and the problem processing unit 105 are implemented by a central processing unit (CPU) 2503 executing a program loaded into memory 2501 illustrated in
The storage processing unit 101 generates, based on data stored in the input data storage unit 110, information used for reconstructing information of a problem size and stores the information in the management data storage unit 113. The reconstruction processing unit 103 reconstructs the information of the problem size by using the information stored in the management data storage unit 113 by the storage processing unit 101. The problem processing unit 105 executes a suffix array generation process (including a recursive process) by using the information of the problem size reconstructed by the reconstruction processing unit 103 and stores an execution result in the output data storage unit 114,
Next, a process of storing information used for reconstructing information of a problem size will be described with reference to
First, the storage processing unit 101 calculates in accordance with Li=Mi, mod c (step S1 in
Note that c has been stored in the input data storage unit 110. Mi is calculated in accordance with Mi=floor (Mi−1/c), and M0=N0=N holds. M0, N0, and N have been stored in the input data storage unit 110. Note that M1 is a maximum size of a continuous space usable in the step i (that is, an upper limit of a problem size).
The storage processing unit 101 determines whether Ni=Mi, holds (step S3). When Ni=Mi holds (Yes route in step S3), the storage processing unit 101 sets Pi to 1 (step S5). Then, the storage processing unit 101 stores Pi (that is, 1) in the management data storage unit 113.
On the other hand, when Ni=Mi, does not hold (No route in step S3), the storage processing unit 101 sets Pi to 0 (step S7). Then, the storage processing unit 101 stores Pi (that is, 0) in the management data storage unit 113. Furthermore, the storage processing unit 101 stores Ni in an Mi-th space of a work space (that is, a space of the working data storage unit 111) of the step i (step S9). Ni, has been identified when a call is made from a step (i-1). Then, the process ends.
The work space of the step i includes an Mi number of spaces, and thus Ni is stored in the Mi-th space of the Mi number of spaces in step S9. The Mi-th space is a space corresponding to a difference between Ni, and Mi. This will be described later using a specific example.
As described above, in this embodiment, information of a problem size is not directly stored. Minimum information (that is, Pi and Li ) used for reconstructing the information of the problem size is stored. Although, when Ni ≠Mi holds, the information of the problem size is stored, a space in which the information of the problem size is stored is part of the work space, and thus an additional space does not have to be prepared for storing the information of the problem size.
As a specific example is given later, Li and Pi are stored for each step, but Mi and Ni are stored only for a final step.
Next, a process of reconstructing information of a problem size will be described with reference to
First, the reconstruction processing unit 103 reads Mi and Li−1 from the management data storage unit 113 and calculates Mi−1 in accordance with Mi−1=ceil(c*Mi)+Li−1 (step S11 in
The reconstruction processing unit 103 reads Pi−1 from the management data storage unit 113 and determines whether Pi−1=1 (that is, Mi−1=Ni−1) holds (step S13).
When Pi−1=1 holds (Yes route in step S13), the reconstruction processing unit 103 sets Ni−1 to Mi−1 (step S17). Then, the problem processing unit 105 executes a process for a problem of the step i by using Ni−1 reconstructed by the reconstruction processing unit 103.
On the other hand, when Pi−132 1 does not hold (No route in step S13), the reconstruction processing unit 103 reads a value of an Mi−1-th space of a work space of the step (i−1) from the working data storage unit 111. Then, the reconstruction processing unit 103 sets Ni−1 to the read value (step S15). Then, the problem processing unit 105 executes a process for a problem of the step i by using Ni−1 reconstructed by the reconstruction processing unit 103. Then, the process ends.
A final execution result (that is, a suffix array) is stored in the output data storage unit 114.
As described above, in this embodiment, processing may be executed in each step without storing information itself of a problem size. Thus, in the case where a particularly limited storage space is usable, or in the case where a recursive process is executed on large amounts of data, the number of resources may be reduced, thus enabling cost reduction. Furthermore, a higher processing speed and the omission of unnecessary processing enable an improvement in real-time performance and low power consumption.
In the method according to this embodiment, information used is Mi, Pi, and Li, and thus the time it takes to acquire data is O(1) time in O notation. Also, Pi=1 bit holds, Li=log 2=1 bit holds, and thus a space is O(log N)+(1+1)*h=O(log N)+2 log N bits in O notation. In comparison with the first method described with reference to
State transitions of a work space and a management space (here, space of the management data storage unit 113) in this embodiment will be specifically described with reference to
As illustrated in
As illustrated in
As illustrated in
In this example, assume that the step 3 is a final step. In this case, as illustrated in
Then, a problem size of the step 2-is reconstructed based on a state illustrated in
Then, a problem size of the step 1 is reconstructed based on a state illustrated in
Then, a problem size of the step 0 is reconstructed based on a state illustrated in
When the above-described processing is executed, information of a problem size of a previous step may be identified. As illustrated in
Second Embodiment
The method according to the first embodiment is provided for the case where a problem size of the step (i+1) is 1/c times a problem size of the step i. On the other hand, a method according to the second embodiment is provided for the case where a problem size of the step (i±1) is smaller than a problem size of the step i by c.
In this case, the method according to the first embodiment may be changed as described below,
First, definitions are changed as follows.
(1) c≥1
(2) Mi+1=floor(Mi−c)
(3) h=o(N)
(4) Li=Mi−ceil(c+Mi+1)
Furthermore, when Mi has been identified in the step i, calculations for obtaining Mi−1 and Ni−1 are changed as follows.
(1) Mi=ceil(c+Mi)+Li−1
(2) for the case of Pi−=1, Ni−1=Mi−1
(3) for the case of Pi−1≠1, Ni−1=a value, stored in the Mi−1-th space of the work space of the step (i-1)
When changes are made as described above, with respect to the case as well where a problem size of the step (i+1) is smaller than a problem size of the step i by c, processing may be speeded up, and an increase in the size of a storage space may be inhibited.
Although the embodiments have been described above, the present, disclosure is not limited to these embodiments. For example, in some cases, the functional block configuration of the information processing apparatus 1 described above does not coincide with an actual program module configuration,
Furthermore, the data configuration described above is merely an example, and a data configuration does not have to be such a configuration.
With respect to each process flow, the order in which operations are executed may be rearranged as long as a processing result is the same. Additionally, operations may be executed in parallel.
Furthermore, although a recursive process executed in generating a suffix array has been described as an example the embodiments are applicable to another recursive process. For example, in the case where information before being subjected to a recursive process is used after the recursive process, the processes according to the embodiments are applicable, and thus processes similar to the processes according to the embodiments may be executed for information other than information of a problem size.
Furthermore, the embodiments may be applied to a process that other than a recursive process and in which a specific process is executed repeatedly.
The above-described information processing apparatus 1 is a computer apparatus. As illustrated in
The above-described embodiments are summarized as follows.
A storage method according to a first aspect of the embodiments includes (A), in a first step of a plurality of steps in which a specific process is executed, determining whether a problem size that is a size to be processed in the first step coincides with an upper size limit defined based on a state of a step previous to the first step; (B) storing a determination result in a storage space; and (C), in a second step that is a step subsequent to the first step, identifying the problem size by using the determination result stored in the storage space, and executing the specific process of the second step based on the identified problem size.
This makes it possible to inhibit an increase in the storage space that stores information of a problem size of a previous step. Thus, in the case where the specific process is executed repeatedly, an increase in the size of the storage space used during the execution is inhibited.
The storage method may further include (D), in the first step, storing information of a remainder obtained by dividing the upper size limit by a certain number in the storage space. Then, in the executing the specific process of the second step, (c1), when the determination result represents that the problem size coincides with the upper size limit, the problem size may be identified by using the information of the remainder stored in the storage space,
This makes it possible to deal with a problem even when the problem size is not able to be identified only by the determination result.
The storage method may further include (E), when the problem size does not coincide with the upper size limit, in the first step, storing information of the problem size in a space corresponding to a difference between the problem size and the upper size limit. Then, in the executing the specific process of the second step, (c2), when the determination result represents that the problem size does not coincide with the upper size limit, the problem size may be read from the space corresponding to the difference.
The information of the problem size is stored in the space not used, and thus newly used spaces do not increase in number,
In the executing the specific process of the second step, (c3) the problem size may be identified by using a value obtained by multiplying an upper size limit of a size to be processed in the second step by the certain number, and the information of the remainder.
A storage apparatus according to a second aspect of the embodiments includes (F) a storing processing unit (for example, the storage processing unit 101) configured to, in a first step of a plurality of steps in which a specific process is executed, determine whether a problem size that is a size to be processed in the first step coincides with an upper size limit defined based on a state of a step previous to the first step, and store a determination result in a storage space (for example, a space of the management data storage unit 113); and (G) an execution unit (for example, the reconstruction processing unit 103) configured to, in a second step that is a step subsequent to the first step, identify the problem size by using the determination result stored in the storage space, and execute the specific process of the second step based on the identified problem size.
A program for causing a processor to execute processing according to the above-described method may be created. The program is stored in computer-readable storage media or storage devices, such as a flexible disk, a compact disc read only memory (CD-ROM), a magnetic optical disk, a semiconductor memory, and a hard disk. An intermediate processing result is temporarily stored in a storage device, such as main memory.
All examples and conditional language recited herein are intended for pedagogical purposes to aid the reader in understanding the invention and the concepts contributed by the inventor to furthering the art, and are to be construed as being without limitation to such specifically recited examples and conditions, nor does the organization of such examples in the specification relate to a showing of the superiority and inferiority of the invention. Although the embodiments of the present invention have been described in detail, it should be understood that the various changes, substitutions, and alterations could be made, hereto without departing from the spirit and scope of the invention.
Number | Date | Country | Kind |
---|---|---|---|
2017-025400 | Feb 2017 | JP | national |