Method and apparatus for reducing power dissipation in finite field arithmetic circuits

Description

FIELD OF THE INVENTION

The present invention relates to finite field arithmetic circuits, and more particularly to finite field multiplication and division circuits for error correction coding and decoding.

BACKGROUND OF THE INVENTION

Reed-Solomon error-correcting codes help reduce bit error rate (BER) in applications such as data storage, digital video broadcasts, and wireless communications. Reed-Solomon codes operate on finite fields (GF(2

m

)). For high speed and low power, complex finite field arithmetic operations such as multiplication and division must be implemented efficiently.

Several different architectures that currently perform finite field arithmetic operations include standard, normal, and dual basis representations. For each representation, bit-parallel, bit-serial and digital architectures have been proposed for different application speed requirements. Most designs focus on reducing circuit area to improve speed.

Increased demand for portable DSP architectures has generated significant interest in the design of low power VLSI systems. For finite field arithmetic circuits, many approaches have been proposed for reducing power dissipation. For example, the impact of a primitive polynomial p(x) on power dissipation has been studied. The use of an XOR-tree has also been studied as a low power approach for finite field multipliers.

SUMMARY OF THE INVENTION

A method and apparatus according to the invention reduces power dissipation of a finite field arithmetic circuit having first and second circuit inputs. A first circuit transition probability of the first circuit input is calculated by applying a random input to the first circuit input and a constant input to the second circuit input. A second circuit transition probability of the second circuit input is calculated by applying a constant input to the first circuit input and a random input to the second circuit input. The first circuit transition probability is compared with the second circuit transition probability. One of the first and second circuit inputs having a lower circuit transition probability is selected.

In other features, a first time-varying rate that a first input signal to the arithmetic circuit varies is compared with a second time-varying rate that a second input signal to the arithmetic circuit varies. One of the first and second input signals having a higher time-varying rate is selected.

In yet other features, the selected one of the first and second input signals is coupled to the selected one of the first and second circuit inputs.

In other features, the circuit transition probability of the first circuit input is determined by computing node transition probabilities of circuit nodes associated with the first circuit input. The circuit transition probability of the second circuit input is determined by computing node transition probabilities of circuit nodes associated with the second circuit input.

In other features, the arithmetic circuit is a finite field multiplier. The finite field multiplier is one of a semi-systolic multiplier and a Mastrovito multiplier.

In still other features, the arithmetic circuit is a finite field multiplier implemented in a finite field divider. The finite field divider is implemented as an inverter that is connected to the finite field multiplier. The finite field multiplier is one of a semi-systolic multiplier and a Mastrovito multiplier and the inverter is implemented directly in combinatorial logic.

In still other features, the arithmetic circuit is implemented in an error correcting coding circuit. The error correction coding circuit is a Reed-Solomon error correction coding circuit.

Further areas of applicability of the present invention will become apparent from the detailed description provided hereinafter. It should be understood that the detailed description and specific examples, while indicating the preferred embodiment of the invention, are intended for purposes of illustration only and are not intended to limit the scope of the invention.

BRIEF DESCRIPTION OF THE DRAWINGS

The present invention will become more fully understood from the detailed description and the accompanying drawings, wherein:

FIG. 1

is an electrical schematic of a basic cell of a least significant bit (LSB)-first finite field array multiplier with a random first input A and a constant second input B;

FIG. 2

is an electrical schematic of a basic cell of a LSB-first finite field array multiplier with a constant first input A and a random second input B;

FIG. 3

is an electrical schematic of a 4×4 finite field multiplier with a random first input A and a constant second input B;

FIG. 4

is an electrical schematic of a 4×4 finite field multiplier with a constant first input A and a random second input B;

FIG. 5

is a composite field multiplier;

FIG. 6

is a first implementation of a finite field divider;

FIG. 7

is a second implementation of a finite field divider;

FIG. 8

is a composite field divider;

FIG. 9

is a transition probability calculating circuit;

FIGS. 10A and 10B

illustrate fast and slow time-varying inputs connected to a finite field arithmetic circuit.

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS

The following description of the preferred embodiment(s) is merely exemplary in nature and is in no way intended to limit the invention, its application, or uses.

In high-speed CMOS circuits, dynamic switching activities dominate the overall power dissipation characteristics of the circuit. Reducing switching activities is an effective method for reducing power dissipation. The present invention reduces power dissipation by identifying the impact of slow and fast time-varying inputs on the power dissipation of finite field arithmetic circuits. The present invention exploits the asymmetry of the slow and fast time-varying inputs to achieve significant power savings. For finite field GF(2

m

) with large m, the present invention provides a low power implementation of composite field multipliers and dividers. The reduced power dissipation provided by the present invention does not involve increased circuit delay or area overhead. The present invention can be used in conjunction with other approaches for further reducing power dissipation.

The finite field GF(2

m

) contains 2

m

elements. GF(2

m

) is an extension of GF(2), which has elements 0 and 1. Finite fields contain a zero element and a primitive element. The finite field is constructed as a vector space over GF(2) with a primitive polynomial p(x)=x

m

+p

m−1

x

m−1

+ . . . +p

1

x

+p

0

(where p

m−1

, . . . , p

0

∈GF(2)). The primitive element α is a root of the primitive polynomial p(x). In other words,

α

m

=p

m−1

α

m−1

+ . . . +p

1

α+p

0

.

The primitive element α can be used to generate all nonzero elements in the finite field GF(2

m

). A standard basis of GF(2

m

) over GF(2) is a basis of the form {1, α, . . . , α

m−1

}. Any element A of finite field GF(2

m

) can be represented by polynomials of degree m−1 as:

GF

(2

m

)={

A|A=a

m−1

α

m−1

+ . . . +a

1

α+a

0

}

where a

m−1

, . . . , a

0

∈ GF(2). A finite field multiplication under the standard basis can be performed by polynomial multiplication followed by a modulo operation:

C=AB

mod

p

(

x

).

Various architectures have been proposed to implement a standard basis multiplication. A semi-systolic array multiplier is the most common architecture. A Mastrovito multiplier is designed for a dedicated primitive polynomial p(x). A composite field multiplier has reduced circuit complexity. In re-configurable Reed-Solomon codecs, input signals of a finite field multiplier often have different switching probabilities. For example in a Reed-Solomon codec, a random data symbol is multiplied by a coefficient of the finite field. The coefficient is set equal to a constant once the re-configurable codec is programmed.

Referring now to

FIGS. 1 and 2

, circuit switching probabilities in basic cells

10

and

10

′ of a least significant bit (LSB)-first semi-systolic array multiplier are illustrated. The LSB-first semi-systolic array multiplier has a regular structure and the computation of

C=AB

mod

p

(

x

)

is decomposed into m columns of basic cells (1≦k≦m), where each column computes:

A

(k)

=A

(k−1)

αmod

p

(

x

)

C

(k)

=A

(k−1)

b

k−1

+C

(k−1)

with W

(0)

=0 and A

(0)

=A. Finally, the product is

C

(m)

=c

m−1

(m)

α

m−1

+ . . . +c

1

(m)

α+c

0

(m)

.

The basic cell

10

includes AND-gates

14

-

1

and

14

-

2

and XOR-gates

16

-

1

and

16

-

2

. To analyze the transition probability in the basic cell

10

, random data is applied to a first input A and a string of zeros or ones are applied to a second input B. A transition probability p

Δ

of each circuit node is computed. The transition probability of all circuits nodes are at maximum value p

Δ

=0.5. In

FIG. 2

, half of a basic cell

10

′, including the shaded AND-gate

14

-

1

and the shaded XOR-gate

16

-

1

, is not switching at all. Therefore, the power dissipation in the basic cell

10

′ of the LSB-first semi-systolic array multiplier is not symmetric with respect to the transition probabilities of the first and second multiplier inputs A and B.

Referring now to

FIG. 3

, a 4×4 LSB-first semi-systolic array multiplier

50

illustrates transition probabilities with a random first input A and a constant second input B.

FIG. 4

shows a 4×4 LSB-first semi-systolic array multiplier

50

′ with a constant first input A and a random second input B. A gate

14

or

16

that is fully shaded has constant inputs. Therefore, the fully shaded gates dissipate no dynamic switching power. A gate

14

or

16

that is half-shaded has one constant input. The half-shaded gates dissipate less power than gates having both inputs that are actively switching.

FIGS. 3 and 4

show signal transitions of inputs A and B propagating through the multiplier

50

. The configuration in

FIG. 4

has lower switching activity and therefore lower power dissipation. The power dissipation in the LSB-first semi-systolic array multiplier is asymmetric with respect to the transition probabilities of first and second multiplier inputs A and B. Therefore, a slow varying input is assigned to the A-input of the multiplier for reduced power dissipation.

A Mastrovito multiplier is designed for a dedicated primitive polynomial p(x). The multiplication

C=AB

mod

p

(

x

)

is described as matrix multiplication

C=

A

B

Where matrix

A

is given by A and p(x). For GF(2

4

) with p(x)=x

4

+x+1,

C = \underline{A} B = [\begin{matrix} a_{o} & a_{3} & a_{2} & a_{1} \\ a_{1} & a_{o} + a_{3} & a_{2} + a_{3} & a_{o} + a_{2} \\ a_{2} & a_{1} & a_{o} + a_{3} & a_{2} + a_{3} \\ a_{3} & a_{2} & a_{1} & a_{o} + a_{3} \end{matrix}] [\begin{matrix} b_{o} \\ b_{1} \\ b_{2} \\ b_{3} \end{matrix}]

To compute C, the matrix

A

is evaluated and a multiplication

A

B is computed. The multiplication step can be implemented with AND-gates followed by an XOR-tree. The evaluation of matrix

A

only occurs when A switches. The multiplication occurs at every input sample. If the two input signals have different switching probabilities, a slow varying input is assigned to the A-input of the multiplier for reduced power dissipation.

Referring now to

FIG. 5

, a composite field multiplier often decomposes a GF(2

m

) multiplier into multipliers in small finite fields. For example, a GF(2

m

) multiplier

70

is implemented with multipliers

72

-

1

,

72

-

2

, and

72

-

3

and adders

74

-

1

,

74

-

2

,

74

-

3

, and

74

-

4

as shown in FIG.

5

. For the composite field multiplier

70

, the multiplier

72

-

1

will be analyzed first. If Y={Y

1

, Y

0

}∈GF(2

m

) is a slow varying signal, Y

1

+Y

0

is slow varying. Therefore, Y

1+Y

0

is assigned to the A-input of multiplier

72

-

1

for low power dissipation. Similarly, the same guidelines are used to configure the inputs of the remaining multipliers

72

-

2

and

72

-

3

in FIG.

5

.

Finite field dividers also find application in Reed-Solomon codecs. For example, in Berlekamp-Massey and Euclidean algorithms, Λ(x)/Δ is computed, where Λ(x)=Λ

t

x

t

+Λ

t−1

x

t−1

+ . . . +Λ

0

is a polynomial on the finite field GF(2

m

) and Δ∈GF(2

m

). In a circuit implementation, the divider is often shared and computes several polynomial coefficient divisions. For example, Δ is a fixed input of the divider while the dividend is changing from Λ

t

to Λ. The two divider inputs also have different switching probabilities.

Referring now to

FIG. 6

, a finite field divider

80

can be implemented using a finite field inverter

82

followed by a finite field multiplier

84

. For a finite field GF(2

m

) with a small value for m, the inverter

82

is preferably implemented directly with combinatorial logic. The output of the inverter

82

is connected to the A-input of the multiplier

84

. In the divider

80

′ of

FIG. 7

, however, the output of the inverter

82

is connected to the B-input of the multiplier

84

.

The implementation in

FIG. 6

has lower power dissipation when the divisor Y is a slow varying signal. In some Reed-Solomon codec applications, the divisor Y is a slow varying signal.

FIGS. 6 and 7

may have different circuit delays depending on the implementation of the multiplier. For example, if a Mastrovito multiplier is used, the implementation in

FIG. 6

has a longer circuit delay. The A-input of a Mastrovito multiplier has a longer delay path to the output of the multiplier.

Referring now to

FIG. 8

, a composite field divider architecture is often used as an efficient divider implementation when the value of m is large.

FIG. 8

shows a composite field divider

100

for computing X/Y for GF(2

m

). The divider is built with adders

102

-

1

,

102

-

2

,

102

-

3

and

102

-

4

, multipliers

104

-

1

,

104

-

2

, . . . , and

104

-

7

, and dividers

106

-

1

and

106

-

2

. If the dividend X is random and the divisor Y is slow varying, the shaded multipliers

104

-

1

,

104

-

5

and

104

-

6

have inputs with different switching probabilities. The shaded multipliers

104

-

1

,

104

-

5

and

104

-

6

are configured as discussed above in

FIG. 2

or as the Mastrovito multiplier described above for low power dissipation. For the two GF(2

m/2

) dividers in

FIG. 8

, the divisor is a function of Y, which is slow varying. The low power division method discussed above is applied. However, extra delay may be introduced.

As can be appreciated from the foregoing, the present invention reduces power dissipation of finite field arithmetic circuits. Several finite field multipliers and dividers were implemented to identify potential power reduction. For finite field multipliers, one input is set as a random signal and the other is a slow varying random signal. For the finite field divider, the divisor is a slow varying random signal, as in some Reed-Solomon codec applications. The results are set forth in Table 1 below:

TABLE 1

Design

Traditional

Proposed

Saving %

GF (2

4

) Mastrovito Multiplier

22.8

μW

18.6

μW

18.4

GF (2

8

) Mastrovito Multiplier

148

μW

88.8

μW

40.0

GF (2

8

) Composite Field

89.6

μW

77.8

μW

13.2

Multiplier

GF (2

8

) Composite Field Divider

181

μW

153

μW

15.0

Referring now to

FIG. 9

, a transition probability determining circuit

120

is shown. The transition probability determining circuit

120

includes a controller

122

that generates first and second input signals

123

and

124

that are input to first and second circuit inputs

125

and

126

of a finite field arithmetic circuit

128

. In a first mode, the first input signal

123

is random and the second input signal

124

is constant (e.g. ones or zeros). A transition probability calculator

130

monitors transition probabilities of the first and second circuit inputs

125

and

126

during the first mode. In a second mode, the first input signal

123

is constant and the second input signal

124

is random. A transition probability calculator

130

monitors transition probabilities of the first and second circuit inputs during the second mode. The transition probabilities of the first and second circuit inputs

125

and

126

during the first and second modes are used to select the input for a fast time-varying input signal and the input for a slow time-varying input signal as described above. As can be appreciated, the transition probability determining circuit

120

can be implemented in a software simulator.

Referring now to

FIGS. 10A and 10B

, based upon the results calculated in

FIG. 9

, a fast time-varying input

140

is coupled to one of the first and second circuit inputs

125

and

126

having a lower circuit transition probability. A slow time-varying input is coupled to the other of the first and second circuit inputs

124

and

126

having a higher circuit transition probability. As a result, the power dissipation of the finite field arithmetic circuit is reduced.

Those skilled in the art can now appreciate from the foregoing description that the broad teachings of the present invention can be implemented in a variety of forms. Therefore, while this invention has been described in connection with particular examples thereof, the true scope of the invention should not be so limited since other modifications will become apparent to the skilled practitioner upon a study of the drawings, the specification and the following claims.

Claims

1. A method for reducing power dissipation of a finite field arithmetic circuit having first and second circuit inputs, comprising:calculating a first circuit transition probability of said first circuit input by applying a random input to said first circuit input and a constant input to said second circuit input; calculating a second circuit transition probability of said second circuit input by applying a constant input to said first circuit input and a random input to said second circuit input; comparing said first circuit transition probability to said second circuit transition probability; selecting one of said first and second circuit inputs having a lower circuit transition probability; comparing a first time-varying rate that a first input signal to said arithmetic circuit varies with a second time-varying rate that a second input signal to said arithmetic circuit varies; and selecting one of said first and second input signals having a higher time-varying rate.
2. The method of claim 1 further comprising coupling said selected one of said first and second input signals to said selected one of said first and second circuit inputs.
3. The method of claim 2 further comprising coupling the other of said first and second input signals to the other of said first and second circuit inputs.
4. The method of claim 2 wherein said arithmetic circuit is a finite field multiplier.
5. The method of claim 4 wherein said finite field multiplier is one of a semi-systolic multiplier and a Mastrovito multiplier.
6. The method of claim 2 wherein said arithmetic circuit is a finite field multiplier implemented in a finite field divider.
7. The method of claim 6 wherein said finite field divider is implemented as an inverter that is connected to said finite field multiplier.
8. The method of claim 7 wherein said finite field multiplier is one of a semi-systolic multiplier and a Mastrovito multiplier and wherein said inverter is implemented directly in combinatorial logic.
9. The method of claim 2 wherein said arithmetic circuit is a finite field multiplier implemented in a composite field multiplier.
10. The method of claim 2 wherein said arithmetic circuit is a finite field multiplier implemented in a composite field divider.
11. The method of claim 2 wherein said arithmetic circuit is implemented in an error correcting coding circuit.
12. The method of claim 11 wherein said error correction coding circuit is a Reed-Solomon error correction coding circuit.
13. The method of claim 1 wherein said circuit transition probability of said first circuit input is determined by computing node transition probabilities of circuit nodes associated with said first circuit input.
14. The method of claim 1 wherein said circuit transition probability of said second circuit input is determined by computing node transition probabilities of circuit nodes associated with said second circuit input.
15. A finite field arithmetic circuit comprising:a first circuit input having a first circuit transition probability; a second circuit input having a second circuit transition probability, wherein said second circuit transition probability is lower that said first circuit transition probability; a first input that generates a first input signal to said first circuit input of said finite field arithmetic circuit that has a first time-varying rate; and a second input that generates a second input signal to said second circuit input of said finite field arithmetic circuit, wherein said second input signal has a second time-varying rate that is higher than said first time-varying rate.
16. The finite field arithmetic circuit of claim 15 wherein said arithmetic circuit is a finite field multiplier.
17. The finite field arithmetic circuit of claim 16 wherein said finite field multiplier is one of a semi-systolic multiplier and a Mastrovito multiplier.
18. The finite field arithmetic circuit of claim 15 wherein said arithmetic circuit is a finite field multiplier implemented in a finite field divider.
19. The finite field arithmetic circuit of claim 18 wherein said finite field divider is implemented as an inverter that is connected to said finite field multiplier.
20. The finite field arithmetic circuit of claim 19 wherein said finite field multiplier is one of a semi-systolic multiplier and a Mastrovito multiplier and wherein said inverter is implemented directly in combinatorial logic.
21. The finite field arithmetic circuit of claim 15 wherein said arithmetic circuit is a finite field multiplier implemented in a composite field multiplier.
22. The finite field arithmetic circuit of claim 15 wherein said arithmetic circuit is a finite field multiplier implemented in a composite field divider.
23. The finite field arithmetic circuit of claim 15 wherein said arithmetic circuit is implemented in an error correcting coding circuit.
24. The finite field arithmetic circuit of claim 23 wherein said error correction coding circuit is a Reed-Solomon error correction coding circuit.
25. A method for reducing power dissipation of a finite field arithmetic circuit having first and second circuit inputs, comprising:calculating a first circuit transition probability of said first circuit input by applying a random input to said first circuit input and a constant input to said second circuit input; calculating a second circuit transition probability of said second circuit input by applying a constant input to said first circuit input and a random input to said second circuit input; comparing said first circuit transition probability to said second circuit transition probability; selecting one of said first and second circuit inputs having a higher circuit transition probability; comparing a first time-varying rate that a first input signal to said arithmetic circuit varies with a second time-varying rate that a second input signal to said arithmetic circuit varies; and selecting one of said first and second input signals having a lower time-varying rate.
26. The method of claim 25 further comprising the step of coupling said selected one of said first and second input signals to said selected one of said first and second circuit inputs.
27. The method of claim 26 further comprising coupling the other of said first and second input signals to the other of said first and second circuit inputs.
28. The method of claim 26 wherein said arithmetic circuit is a finite field multiplier.
29. The method of claim 28 wherein said finite field multiplier is one of a semi-systolic multiplier and a Mastrovito multiplier.
30. The method of claim 26 wherein said arithmetic circuit is a finite field multiplier implemented in a finite field divider.
31. The method of claim 30 wherein said finite field divider is implemented as an inverter that is connected to said finite field multiplier.
32. The method of claim 31 wherein said finite field multiplier is one of a semi-systolic multiplier and a Mastrovito multiplier and wherein said inverter is implemented directly in combinatorial logic.
33. The method of claim 26 wherein said arithmetic circuit is a finite field multiplier implemented in a composite field multiplier.
34. The method of claim 26 wherein said arithmetic circuit is a finite field multiplier implemented in a composite field divider.
35. The method of claim 26 wherein said arithmetic circuit is implemented in an error correcting coding circuit.
36. The method of claim 35 wherein said error correction coding circuit is a Reed-Solomon error correction coding circuit.
37. The method of claim 25 wherein said circuit transition probability of said first circuit input is determined by computing node transition probabilities of circuit nodes associated with said first circuit input.
38. The method of claim 25 wherein said circuit transition probability of said second circuit input is determined by computing node transition probabilities of circuit nodes associated with said second circuit input.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application claims the benefit of U.S. Provisional Application No. 60/326,967, filed Oct. 3, 2001, which is hereby incorporated by reference.

US Referenced Citations (2)

Number	Name	Date	Kind
5602753	Fukui	Feb 1997	A
6286022	Kaliski, Jr. et al.	Sep 2001	B1

Foreign Referenced Citations (2)

Number	Date	Country
WO 9120028	Dec 1991	WO
WO 0120427	Mar 2001	WO

Non-Patent Literature Citations (7)

Entry
Christof Paar “Some Remarks on Efficient Inversion in Finite Fields” Worcester Polytechnic Institute, ECE Department, Worcester, MA.
Anatha P. Chandrakasan and Robert W. Brodersen, “Minimizing Power Consumption in Digital CMOS circuits” Proceedings of the IEEE, vol. 83, No. 4, Apr. 1995.
A. G. Wassal, M. A. Hassan and M. I. Elmasry, “Low-Power Design of Finite Field Multipliers for Wireless Applications,” Department of Electrical and Computer Engineering, University of Waterloo, Waterloo, ON.
Leilei Song and, Keshab K. Parhi “Optimum Primitive Ploynomials for Low-Area Low-Power Finite Field Semi-Systolic Multipliers” IEEE, 1997 pp. 375-384.
Chris J. Nicol and Patrick Larsson, “Low Multiplier Multiplication for FIR Filters” Bell Laboratories, Lucent Technologies, Holmdel, NJ.
Surendra K. Jain, Leilei Song, “Efficient Semisystoloc Architectures for Finite-Field Arithmetic,” IEEE, 1998 pp. 101-113.
Leilei Song and Keshab K. Parhi, “Low-Energy Digit-Serial/Parallel Finite Field Multipliers,” Journal of VLSI Signal Processing 19, 1998, Kluwer Academic Publishers: Netherlands, pp. 149-166.

Provisional Applications (1)

	Number	Date	Country
	60/326967	Oct 2001	US

Method and apparatus for reducing power dissipation in finite field arithmetic circuits

Information

Patent Number

Date Filed

Date Issued

Inventors

Original Assignees

Examiners

CPC

US Classifications

Field of Search

US

International Classifications