1. Field of the Invention
The present invention relates to a Secure Hash Algorithm (SHA) operation method, a SHA operation circuit and a hash operation circuit for calculating the hash value of input data.
2. Description of the Background Art
In recent years, there is a growing trend toward hardware implementation of security algorithms for speeding up security processes. Because of this, in many cases, Secure Hash Algorithm (SHA) is implemented with digital operation circuits. The SHA is used in authentication, digital signature and tamper detection and implemented with one of several algorithms, such as SHA-1, SHA-224, SHA-256, SHA-384 and SHA-512.
In this situation, it has been proposed to use a pipeline for performing hash operation at a high speed in a conventional SHA high speed arithmetic circuit, for example, as described in Japanese patent laid-open publication No. 2002-287635.
However, in the case of the conventional circuit, while the hash operation can be speeded up, it is difficult to reduce the power consumption of the hardware which is used for the hash operation. It is therefore necessary to reduce the power consumption of the hardware which is used for the SHA operation.
It is an object of the present invention to provide a SHA operation method, a SHA operation circuit and a hash operation circuit, in which it is possible to reduce the power consumption of the hardware.
In accordance with the present invention, a SHA operation method comprises a store step of storing, in first to fifth storages, first to fifth variables for use in a hash operation on the basis of the SHA-1 algorithm, a permutation step of permuting the first to fifth variables are stored as first to fifth intermediate values, respectively, a first calculation step of calculating a temporary value of the SHA-1 algorithm by the use of input data and the first to fifth intermediate values permuted, a selection step of selecting one of the first to fifth storages which is for use in storing the intermediate value calculated, a maintenance step of storing the temporary value in the selected storage, bitwise-rotating one of the variables which is stored in another of the storages to store a resultant value in the other storage, and maintaining the variables stored in the remaining storages without change, said maintenance step being performed successively in a timing in synchronization with clock cycles, and a second calculation step of calculating the hash value of the input data by the use of the variables held in the storages.
In an aspect of the invention, when at least one of the first to fifth intermediate values is permutated by bitwise rotation, in said calculation step, the temporary value is calculated by the use of the intermediate values bitwise rotated, and said maintenance step maintains, while the temporary value calculated by the use of the intermediate values is stored in the selected storage, the variable stored in the other storage without bitwise rotation as well as the variables stored in the remaining storages.
Further, in accordance with the present invention, a SHA operation method comprises a store step of storing, in first to eighth storages, first to eighth variables for use in calculating the hash value of input data on the basis of a SHA-2 algorithm, a permutation step of permuting the first to eighth variables stored as first to eighth intermediate values, a first calculation step of calculating a first and a second temporary value of the SHA-2 algorithm by the use of the input data and the first to eighth intermediate values permuted, a selection step of selecting two of the first to eighth storages for use in storing the intermediate values calculated respectively, a maintenance step of storing the temporary values in the selected storages, and maintaining the variables stored in the remaining storages without change, said maintenance step being performed successively in a timing in synchronization with clock cycles, and a second calculation step of calculating the hash value of the input data by the use of the variables held in the storages.
Furthermore, in accordance with the present invention, a SHA operation circuit comprises a permutation section permuting first to fifth variables stored in first to fifth storages, respectively, for use in a hash operation on the basis of the SHA-1 algorithm as first to fifth intermediate values, respectively, an arithmetic unit calculating a temporary value of the SHA-1 algorithm by the use of input data and the first to fifth intermediate values permuted, a selector selecting one of said first to fifth storages which is for use in storing the intermediate value calculated, said first to fifth storages being operable in synchronization with clock cycles to store the temporary value in said selected storage, to bitwise-rotate the variable stored in another of the storages to store a resultant value in said other storage, and to maintain the variables stored in remaining ones of said storages without change, and a hash holder operable to calculate the hash value of the input data by the use of the variables stored in said first to fifth storages and holding the hash value.
Further, in accordance with the present invention, a SHA operation circuit comprises a permutation section permuting first to eighth variables stored in first to eighth storages, respectively, for use in a hash operation on the basis of the SHA-2 algorithm, as first to eighth intermediate values, respectively, an arithmetic unit calculating a first and a second temporary value of the SHA-2 algorithm by the use of input data and the first to eighth intermediate values permuted, a selector selecting two of said first to eighth storages for use in storing the first and second intermediate values calculated respectively, said first to eighth storages being sequentially operable in synchronization with clock cycles to store the first and second temporary values in said selected storages respectively, and maintaining the variables stored in the remaining storages without change, and a hash holder calculating the hash value of the input data by the use of the variables stored in said first to eighth storages and holding the hash value.
Moreover, in accordance with the present invention, a hash operation circuit is provided for generating a message digest on the basis of a hash algorithm from input data partitioned into a plurality of blocks which are sequentially processed one after another, wherein each block is partitioned into a plurality of data segments and processed by sequentially processing the data segments one after another. Said hash operation circuit comprises a hash holder holding intermediate hash values, a plurality of storages connected to said hash holder and operable to store arithmetic values, a permutation section connected to said plurality of storages and operable to permute the arithmetic values, an arithmetic unit connected to said permutation section and said storages for performing an arithmetic operation on input data and the arithmetic values permuted on the basis of the hash algorithm and partially replacing the arithmetic values stored in said plurality of storages by the result of the arithmetic operation in accordance with the hash algorithm. Said plurality of storages, permutation section and arithmetic unit are synchronously driven in order to repeatedly process the arithmetic values while the data segments are input one after another as the input data. When all the data segments of each block are processed, said hash holder holds next intermediate hash values that are calculated on the basis of the arithmetic values stored in said storages and the previous intermediate hash values, and initializes the arithmetic values by the next intermediate hash values for starting processing the next block, wherein after processing all the blocks the message digest is formed from the intermediate hash values. The permutation is performed in order that, when each data segment is processed, at least one of said plurality of storages is supplied with no clock signal to thereby maintain the values stored therein without change.
The objects and features of the present invention will become more apparent from consideration of the following detailed description taken in conjunction with the accompanying drawings in which:
An illustrative embodiment of a hash operation process of the SHA-1 algorithm according to the present invention will hereinafter be described. Meanwhile, in what follows, like elements and components are designated with the same reference numerals, and no redundant description is repeated thereon.
In advance of describing the illustrative embodiment, the hash arithmetic operation according to SHA-1 will be described on the basis of the specifications prescribed by the Federal Information Processing Standards Publication (FIPS PUB) 180-2, “Secure Hash Standard”, published by the National Institute of Standards and Technology.
First, a message as target or input data of the hash operation is padded to make the total length of the padded message a multiple of 512 bits. The process of padding is specified in the above Secure Hash Standard. The message is then partitioned into 512-bit blocks, which are named blocks M(1), M(2), . . . and M(N) respectively where N is an positive integer. The message can have any length less than 2 bits.
Next, the blocks M(1), M(2), . . . and M(N) are processed one after another in this order respectively in the following steps 1 and 2.
In step 1, each block M(i) (i=1 to N) is partitioned into 32-bit words W0, W1, . . . and W15 (Wt: t=0 to 15). In step 2, each block M(i) is processed to generate intermediate hash values H0(i) to H4(i) (i=1 to N) by the use of 32-bit length variables a, b, c, d and e. In other words, the step 2 is implemented by a one-way compression function serving to compress each 512-bit block M(i) into a 160-bit intermediate hash value. This intermediate hash value is used as the start hash value by the one-way compression function when compressing the next block M(i+1). The hash value or message digest can be obtained after processing the final block M(N).
Next, the process in step 2 will be described with reference to
Next, in each process, i.e. the arithmetic operation performed in one clock cycle, for t=0 to 79, the variables “a” to “e” are successively calculated by the following expressions (1) in the timing on the basis of clock cycles.
e=d
d=c
c=ROTL
30(b)
b=a
a=TEMP (1)
Incidentally, ROTL30(b) of the expressions (1) is defined as a 30-bit rotation of the variable “b” to the left. Also, TEMP is a temporary variable given in the following expression (2).
TEMP=ROTL
5(a)+ft(b,c,d)+e+Kt+Wt (2)
ROTL5(a) of the expression (2) is defined as a 5-bit rotation of the variable “a” to the left, and each ft (b, c, d) is defined as a function having three 32-bit arguments, i.e. “b”, “c” and “d”. Each Kt is a constant given in accordance with the specifications of the above Secure Hash Standard. Wt (t=16 to 79) is successively calculated by the use of Wt (t=0 to 15).
Then, the intermediate hash values H0(0) to H4(i) are calculated from the variables “a” to “e” and the previous intermediate hash values H0(i−1) to H4(i−1), where i=1 to N, in accordance with the Compute process of
H
0
(i)
=H
0
(i−1)
+a
H
1
(i)
=H
1
(i−1)
+b
H
2
(i)
=H
2
(i−1)
+c
H
3
(i)
=H
3
(i−1)
d
H
4
(i)
=H
4
(i−1)
+e (3)
After the above process for obtaining the intermediate hash values is repeated for N times, the message digest or hash value is finally obtained by the following expression (4) where the symbol “∥” is used to denote the bitwise concatenation.
H0(N)∥H1(N)∥H2(N)∥H3(N)∥H4(N) (4)
Well, with reference to
The hash holder 11 serves to hold intermediate hash values H0, H1, H2, H3 and H4 in correspondence with the five storages 12A to 12E. More specifically, the hash holder 11 is initialized by hash start values H0(0) to H4(0) and then receives and holds the intermediate hash values H0(i) to H4(i) from the five storages 12A to 12E.
Each of the storages 12A to 12E is implemented with flip-flops which store the variables “a” to “e” of
The ordinary operation process of SHA-1 has generally been discussed above. Now, a description will be made of the feature of the operation process of SHA-1 in accordance with the illustrative embodiment.
As shown in
In each process for t=0 to 79, one of the values “a” to “e” is used to save the temporary value TEMP while another value “*” of the values “a” to “e” is used to save ROTL30 (*); as will be more accurately described in the following. The remaining three values are maintained as they are in the previous process.
For example, when t=1, the value “d” is used to save TEMP, and the value “a” is used to save ROTL30 (a). On the other hand, the remaining values “b”, “c” and “e” hold the previous values (“b”, “c” and “e) which are loaded in the previous process when t=0. It will be understood that, by this processes shown in
The temporary value TEMP of
TEMP=ROTL
5(v)+ft(w,x,y)+z+Kt+Wt (5)
Incidentally, ROTL5 (v) of the expression (5) is defined as a 5-bit rotation of the value “v” to the left. In this case, the values “a” to “e” are permuted and used as the intermediate values “v” to “z” in a one-to-one correspondence. This correspondence is cyclically changed as shown in
For example, when t=0, the values “a” to “e” after initialization are used as the variables of the expression (5) after permutation as v=a, w=b, x=c, y=d and z=e. Also, when t=1, the values “a” to “e” obtained after t=0 are used as the variables of the expression (5) after permutation as v=e, w=a, x=b, y=c and z=d, refer to
After completing the operation process through t=0 to 79, the Compute process of
This is because when t=0, 5, . . . 70, 75, the correspondence relationship between the intermediate values “v” to “z” and the values “a” to “e” is the same as v=a, w=b, x=c, y=d and z=e, refer to
As has been discussed above, in accordance with the operation process of
Next, a SHA operation circuit or SHA calculation device for performing the hash operation of SHA-1 of
The selector 16 selects one of the storages in which TEMP is to be stored in accordance with the operation process of
The permutation section 17 is made, for example, of a number of 5-1 selectors which serve to output the values “a” to “e”, which are acquired from the respective storages 121A to 121E, as the intermediate values “v” to “z” after permutation in accordance with the correspondence relationship stored in
The arithmetic unit 131 calculates TEMP on the basis of the intermediate values “v” to “z” obtained from the permutation section 17, Wt output from the first constant setting section 14, and Kt output from the second constant setting section 15 in accordance with the process of
Each of the storages 121A to 121E is implemented with flip-flops or a 32-bit D-type of latch which store predetermined one of the values “a” to “e”, TEMP or the ROTL30 value, in accordance with the selection signal output from the selector 16. Then, after completing the process when t=79, the respective storages 121A to 121E outputs the values “a” to “e” to the hash holder 11. Receiving the values “a” to “e”, the hash holder 11 performs the Compute process of
For example, when t=1, the storage 121D is used to save TEMP, and the storage 121A is used to save the result of the ROTL30 (a) operation. Then, the remaining storages 121B, 121C and 121E hold the previous values (“b”, “c” and “e), as they are, which are loaded in the previous process. By this configuration, no value is moved along the storages 121A to 121E of
The first storage 121A will specifically be described with reference to
In
As has been discussed above, the SHA operation circuit 10 of
Particularly, in the case of cellular phones, mobile devices, intelligent transport systems (ITSs) and the like implemented with a number of battery-powered circuits, it is effective to use the SHA operation circuit 10 or the SHA operation method of
Furthermore, the clock supply to the respective storages 121A to 121E may be performed through a clock gating circuit in order to halt the clock supply to the storages which maintain the values stored therein. In this case, it is possible to further reduce the power consumption.
Moreover, the SHA-1 operation process in accordance with the embodiment of the present invention may be modified as described hereinafter.
As shown in
Next, in each process for t=0 to 79, one of the values “a” to “e” is used to save the temporary value TEMP while the remaining four values are maintained as they are in the previous process.
For example, when t=1, the value “d” is used to save TEMP, and the remaining values “a” to “c” and “e” are maintained as they are in the previous process when t=0. By this configuration, it will be understood that the number of times the values “a” to “e” of
In this case, TEMP of
TEMP=ROTL
5(v)+ft(w,x,y)+z+Kt+Wt
x=ROTL
30(xtemp)
y=ROTL
30(ytemp)
z=ROTL
30(ztemp) (6)
For example, when t=0, the values “a” to “e” after the Initialize process are used as the variables of the expressions (6) after permutation as v=a, w=b, xtemp=c, ytemp=d, and ztemp=e. Also, for example, when t=1, the values “a” to “e” obtained after t=0 are permuted as v=e, w=a, xtemp=b, ytemp=c, ztemp=d, refer to
Then, in the Compute process of
Incidentally, the reason for adding the ROTL30 values to the remaining three intermediate hash values H2(i) to H4(i) is as follows. For t=77 to 79 as enclosed with bold line in
Also, in the Initialize process of
As has been discussed above, since TEMP is calculated by the use of the intermediate values xtemp to ztemp introduced anew in the process operation of
Next, the hash operation circuit for performing the hash arithmetic operation of
The permutation section 17 outputs three of the values “a” to “e” obtained from the respective storages 121A to 121E as the intermediate values xtemp, ytemp and ztemp in accordance with the operation process of
For example, when t=0, the permutation section 17 outputs the values “c” to “e” obtained from the respective storages 121C to 121E in the Initialize process as the intermediate values xtemp, ytemp and ztemp respectively. By this process, xtemp=c, ytemp=d and ztemp=e. Furthermore, the permutation section 17 outputs the values “a” and “b” obtained from the respective storages 111A and 121B in the Initialize process as the intermediate values “v” and “w” respectively.
In addition, the permutation section 17 performs the ROTL30 operation of xtemp, ytemp and ztemp in accordance with the operation process of
The arithmetic unit 131 calculates TEMP on the basis of the intermediate values “v” to “z” obtained from the permutation section 17, Wt output from the first constant setting section 14, and Kt output from the second constant setting section 15 in accordance with the process of
Each of the storages 121A to 121E stores predetermined one of the values “a” to “e”, TEMP, in accordance with the selection signal output from the selector 16.
For example, when t=1 in
Still further, this modification example is particularly advantageous in that it does not require hardware expansion of the SHA operation circuit 10.
In accordance with the expressions (6), three rotation operations are performed for each process, whereas only one operation is performed in the case of the process shown in
Now, a description will be made on a power verification simulation of a 0.15 μm semiconductor process which was conducted in accordance with the process of
For example, under a first condition (a power supply voltage of 1.65 V, an ambient temperature of −40° C.), the power consumption of the SHA operation circuit 100 (refer to
Furthermore, under a second condition (a power supply voltage of 1.35 V, an ambient temperature of 125° C.), the power consumption of the SHA operation circuit 100 (refer to
Now, an alternative embodiment of hash operation process of the SHA-2 algorithm according to the present invention will herein after be described. In the alternative embodiment, the hash arithmetic operation will be described on the basis of SHA-2 (SHA-224, 256, 384 and 512) in a simplified description for the sake of clarity.
In advance of describing the alternative embodiment, the ordinary SHA-2 hash arithmetic operation will be described. First, a message as target data of the hash operation is padded to make the total length of the padded message a multiple of blocks having a predetermined bit length (512 bits in the case of SHA-224 or 256, 1024 bits in the case of SHA-384 or 512) in accordance with the above Secure Hash Standard, and then partitioned into blocks having the predetermined bit length, which are named blocks M(1), M(2), . . . and M(N) respectively, where N is an positive integer.
Next, the blocks M(1), M(2), . . . and M(N) are processed one after another in this order respectively in the following steps 1A and 2A.
In step 1A, each block M(i) is partitioned into 16 words W0, W1, . . . and W15 (Wt: t=0 to 15). A word equals a 32-bit string in the case of SHA-224 or 256. In the case of SHA-384 or 512, however, a 64-bit string is handled as one word or data segment corresponding to the value of a variable in SHA calculation.
In step 2A, each block M(i) (i=1 to N) is processed to generate an intermediate hash value H0(i) to H7(i) (i=1 to N) by the use of variables “a”, “b”, “c”, “d”, “e”, “f”, “g” and “h” having the bit length corresponding to the word. Namely, the bit length is 32 bits in the case of SHA-224 or 256, and 64 bits in the case of SHA-384 or 512.
Next, the process in step 2A will be described with reference to
As shown in
Next, in each process for t=0 to 79, the variables “a” to “h” are successively calculated by the following expressions (7).
h=g
f=e
e=T
1
+d
d=c
c=b
b=a
a=T
1
+T
2 (7)
The values of T1 and T2 of the expressions (7) are given by the following expressions (8). In this description, T3=T1+d for the sake of clarity in description.
T
1=Σ1(e)+Ch(e,f,g)+h+Kt+Wt
T
2=Σ0(a)+Maj(a,b,c) (8)
where the values Σ0, Σ1, Ch, Maj, Kt and Wt are specified in the above Secure Hash Standard.
Then, the intermediate hash values H0(i) to H7(i) where i=1 to N, are calculated by the following expressions (9) as shown in the Compute process of
H
0
(i)
=H
0
(i−1)
+a
H
1
(i)
=H
1
(i−1)
+b
H
2
(i)
=H
2
(i−1)
+c
H
3
(i)
=H
3
(i−1)
+d
H
4
(i)
=H
4
(i−1)
+e
H
5
(i)
=H
5
(i−1)
+f
H
6
(i)
=H
6
(i−1)
+g
H
7
(i)
=H
7
(i−1)
+h (9)
After the above process for obtaining the intermediate hash values is repeated for N times, the message digest is finally obtained by the following expression (10).
H0(N)∥H1(N)∥H2(N)∥H3(N)∥H4(N)∥H5(N)∥H6(N)∥H7(N) . . . 10)
Now, a SHA-2 operation process and a SHA-2 operation circuit in accordance with the alternative embodiment of the present invention will be described with reference to
The operation process in accordance with the alternative embodiment of
As shown in
In each process for t=0 to 79, one of the values “a” to “h” is used to save T1+T2 while another value is used to save T3. The remaining six values are maintained as they are in the previous process.
For example, when t=1, “c” is used to save T1+T2, and “g” is used to save T3. The remaining values “a”, “b”, “d”, “e”, “f” and “h” are maintained as they are in the previous process when t=0. It will be understood that, by this processes shown in
The values T1, T2 and T3 in
T
1=Σ1(w1)+Ch(w1,x1,y1)+z1+Kt+Wt
T
2=Σ0(W0)+Maj(w0,x0,y0)
T
3
=T
1
+z
0 (11)
In this case, the values “a” to “d” are permuted and used as the intermediate values w0 to z0, and the values “e” to “h” are permuted and used as the intermediate values w1 to z1. This correspondence is cyclically changed as shown in
After completing the operation process through t=0 to 79, the Compute process of
This is because when t=0, 4, . . . , and 76, the correspondence relationship between the intermediate values w0 to z0 and w1 to z1 and the values “a” to “h” is the same as w0=a, x0=b, y0=c, z0=d, w1=e, x1=f, y1=g, and z1=h, refer to
Incidentally, this is true also in the case of SHA-224 and 256 having 64 processes (t=0 to 63) because 64 processes are a multiple of the four-process cycle. After the above process for obtaining the intermediate hash values is repeated for N times, the message digest is finally obtained by the expression (10).
As has been discussed above, in accordance with the operation process of
Next, the SHA operation circuit or SHA calculation device for performing the hash operation of SHA-2 of
In
The selector 26 selects one of the storages in which (T1+T2) or T3 is to be stored in accordance with the operation process of
For example, when t=0 in
Each of the permutation sections 27A and 27B is made, for example, of a number of 4-1 selectors which receive the values “a” to “h” acquired from the respective storages 22A to 22H and output the intermediate values w0 to z0 and w1 to z1 after permutation in accordance with the correspondence relationship stored in
For example, when t=0 in
The arithmetic unit 23 calculates (T1+T2) and T3 on the basis of the intermediate values w0 to z0 obtained from the permutation section 27A, the intermediate values w1 to z1 obtained from the permutation section 27B, Wt output from the first constant setting section 24, and Kt output from the second constant setting section 25 in accordance with the process of
The storages 22A to 22D belong to a first group, and the storages 22E to 22H belong to a second group. Each of the storages 22A to 22H is implemented with flip-flops, or a 32-bit or 64-bit 9-type of latch, which store predetermined one of values “a” to “h”, T1+T2, or T3 in accordance with the selection signal output from the selector 26. Then, after completing the process when t=79 (or t=63), the respective storages 22A to 22H outputs the values “a” to “h” to the hash holder 21. Receiving the values “a” to “h”, the hash holder 21 performs the Compute process of
For example, when t=1, the storage 22C is used to save “c”=T1+T2, and the storage 22G is used to save “g” (=T3). On the other hand, the remaining storages 22A, 22B, 22D to 22F and 22H hold the previous values (“a”, “b”, “d” to “f” and “h), as they are in the previous process when t=0. By this configuration, no value is moved along the respective storages 22A to 22H during the operation process of
Accordingly, it is possible to decrease the number of times of driving the respective storages 22A to 22H and thereby reduce the power consumption of the SHA operation circuit 20.
It is to be noted that the present invention is not limited to the specific illustrative embodiments, but rather various modifications may be made. For example, while the storage circuit made of flip-flops is described by way of example, it can be formed with a memory device such as a RAM (Random Access Memory).
Also, while the storage circuits are classified into two groups in the case of the alternative embodiment, this grouping is not requisite.
Furthermore, while there are many implementations of a clock gating circuit for supplying gated clocks to the respective storages as described above, such a clock gating circuit can be incorporated in the permutation section 17 or 27A and 27B, the selector 16 or 26, or the like. For example, some internal signals of the permutation section 17 may be used to generate the gated clocks.
The entire disclosure of Japanese patent application No. 2006-302202 filed on Nov. 8, 2006, including the specification, claims, accompanying drawings and abstract of the disclosure, is incorporated herein by reference in its entirety.
While the present invention has been described with reference to the particular illustrative embodiments, it is not to be restricted by the embodiments. It is to be appreciated that those skilled in the art can change or modify the embodiments without departing from the scope and spirit of the present invention.
Number | Date | Country | Kind |
---|---|---|---|
2006-302202 | Nov 2006 | JP | national |