This invention relates to the fields of computer architecture, microprocessor design, multivalued logic circuits and memory, and digital design.
The performance of current computers is reaching their limit. Almost all present day computers are built based on two-valued logic. In two-valued logic, each wire can have two states. The performance of current computer depend primarily on how quickly the states can be changed, which determines the clock speed. During the past decades, the clock speed for CPU had doubled almost every year. In recent years, the clock speed doubled every 18 months. Now, it has become progressively more difficult to increase the clock speed. The limit is approaching. Recently, CPU manufacturers have tried to circumvent the limitation of clock speed by packing ever more “cores” into a chip, which has resulted in dual-core or quad-core CPUs. However, this multi-core approach does not greatly improve the performance. This is due in part to a limit on the amount of data that can be transferred between the CPU and its connected components, which is in turn determined by the number of pins on the CPU. Using two-value logic, each pin on the CPU can have at most two states, and again the amount of data that can be transferred is determined by the clock speed. Thus, the multi-core approach does not circumvent the limitation.
Thus, there is a need for an innovative approach in order to push the speed limit of computing. Advancing from two-valued to 4-valued (also sometimes referred to as “quaternary”) logic provides an progressive approach. Four symbols {0, 1, 2, 3} are needed to distinguish the four values, as shown in Table 1.
The four values may represent any four things, such as the four bases {A, T, C, G} found in DNA, or the probability {0, ⅓, ⅔, 1}. These four values can be converted to binary numbers {00, 01, 10, 11}, or they can simply represent integers or digits {0, 1, 2, 3}. The four values could obviously be implemented with four voltage levels, e.g., {0V, 5V, 10V, 15V}.
To fully exploit the multivalued computational paradigm, it is advantageous to start from the ground up by designing components needed for constructing multivalued logic circuits. For example, each 4-valued logic gate will operate upon two bits of data at a time, and each memory cell will record two bits at once. With this design, each wire or CPU pin can have four states, which could double the amount of data that can be transferred between the CPU and its connected components without increasing the number of pins on the CPU. With eight-valued logic, each logic gate operates three bits of data and each CPU pin carries three bits of data at a time. Thus, the n-value logic described herein contemplates n values of 4, 8, 16, 32, 64, etc. The extreme case will be the infinite-valued logic.
The approach for using multivalued logic is currently being employed in building higher capacity flash memory. The industry is pushing to allow each memory cell to store not just one bit, but two bits, three bits, and even four bits. The prior arts includes U.S. Pat. Nos. 5,017,817, 5,227,993, 5,398,199, 5,438,533, 5,463,573, 5,467,298, 5,644,253, 5,773,996, 5,867,423, 5,973,960, 6,133,754, 6,218,713, 7,979,627, 8,064,253, and 8,120,384.
One aspect of this invention provides methods and apparatuses to increase the speed of computation by creating new types of computers that are capable of working on multiple values. This concept exploits the multivalued computation in hardware by using fundamental building blocks of multivalued logic circuits: multivalued logic gates, memory cells, and flip-flops. To make multivalued computation possible, the below described embodiments provide the necessary methods and tools for designing and building multivalued computers entirely within the domain of multivalued logic. The process is to design a multivalued microprocessor based entirely on multivalued circuits and then to use the microprocessors to build multivalued computers. Another aspect of this invention provides a method for designing multivalued microprocessors, by providing the overall architecture, defining the instruction set, providing designs of the processing units, the registers, and the control units. To implement these designs in hardware, the designs of multivalued memory cells, tristate buffers, and decoders are also provided. To build multivalued computers using multivalued logic gates and memory, further embodiments provide a methodology for designing any multivalued circuit to implement any multivalued function. The design of an adder for adding two four-valued numbers is provided to illustrate the methodology.
The process for designing and building multivalued computers includes a process for designing and building multivalued microprocessors. After construction of multivalued microprocessors, multivalued computers may then constructed by combining one or multiple multivalued microprocessors. Thus, the description of one embodiment will focus on constructing multivalued microprocessors. The description starts by providing the high-level design of the multivalued microprocessor, highlighting the overall architecture, and defining the instruction set. The description continues with the design of the multivalued processing units and then provides a methodology for designing any multivalued circuit. To illustrate the methodology, a multivalued circuit for adding two multivalued numbers is disclosed. The description further illustrates the design of one example of a multivalued memory cell that is used in the design of multivalued registers. The description then provides one example design of a 4-valued tristate buffer and a decoder that are used for the design of a control circuit. The description concludes with an example of certain programming aspects of the microprocessor.
The high-level architecture of one embodiment of the multivalued microprocessor 1 is shown in
The process of preforming one computation specified by one instruction is outlined as follows. The program counter (PC) register contains the address of the instruction to be executed. First, this address is passed to the instruction memory 9 through the F bus (as shown in
The microprocessor can be built to perform any number of instructions. In the illustrated embodiment, a simple instruction set is defined. For this design, the microprocessor can perform 16 instructions that are defined as shown in Table 2.
The first instruction is the Move (Rd, Ra) operation that implements Rd=Ra, where Rd is one of the 16 registers for storing the result of the operation, and Ra is anyone of the 16 registers. For example, Move (R2, R1) will results in R2=R1 (the contents of register R2 is replaced by the contents of R1). Sometimes Rd will be referred to as the “destination register” (the register where the results of the operation are stored) and Ra will be referred to as the “operand register” (the source of the number being operated upon). The Move operation has one operand register while operations like Add have two operand registers. The Not, And, Or functions are logic operations. The Add, Sub, Addi, and Subi functions are arithmetic operations. For instance, Add(R3, R2, R1) will results in R3=R2+R1, while Subi(R3, R1, 5) will results in R3=R1−5. The Set and Seth put a constant value into a register. For example, Set(R3, v4) puts the value v4 into R3. In the embodiment where each register can store eight 4-valued numbers, Set(R3, v4) puts 4 zeros on the “left” or ‘high’ part of the R3 register and the four 4-valued numbers v4 on “right” or “low” part of the R3 register. Seth(R3, v4) puts into R3 the four 4-valued numbers v4 on the high part of the register and leaves the low part unchanged. The Store and Load functions transfer data between registers and the Data Storage module. The Movez, Movex, Movep, and Moven functions are conditional move statements. For instance, Movez(R5, R7, R9) will results in R5=R7 only if R9 is zero, otherwise R5 remains unchanged.
As suggested above, an advantageous feature of this microprocessor is that anyone of these 16 instructions can write into the program counter PC and thus can function as a Jump statement. For example, Move(PC, R1) will jump to the address specified by R1, while Movz(PC, R1, R2) will jump only if R2 is zero (a conditional jump), and Addi(PC, PC, 8) will function as a relative jump for jumping forward.
Each of the instructions is encoded with 4-valued numbers (0,1,2,3). The 16 operation codes (op-code) is encoded with two 4-valued numbers as show in Table 2. The 16 registers are also referred to with two 4-valued numbers. For instance, Add(R3, R2, R1) is coded with 02 03 02 01, where the leftmost 02 is the op-code for Add, the 03 refers to register R3, the 02 to R2, and the 01 to R1. Thus, only 8 of the 4-valued digits are needed to encode one instruction.
The computational model and the instructions described above will serve as a prototype for the design and the implementation of a microprocessor using multivalued circuits. As previously suggested, one major advantage of using multivalued circuits to implement a microprocessor (or a computer) is to reduce the number of wires and components, as will be described in the following sections.
Designing Multivalued Processing Units
This section realizes the design of a microprocessor, and more specifically the processing module 2, by using multivalued circuits. The design begins by implementing the multivalued processing units 13. As shown in
For the example embodiment of the multivalued microprocessor described herein, the implementation of the processing units 13 will be done by using 4-valued logic circuits, although 16-valued logic circuits would also be well suited (or potentially even greater-valued logic circuits). Each of the processing units 13 in this example will be implemented by using 4-valued logic circuits: each (4-valued) wire can carry 2 bits of data at any given time (4 states) and each (4-valued) logic gate can operate on 2 bits of data at a time. For instance, the design of the AND processing unit 13 is shown in
The OR and NOT processing units can be implemented using the same method as outlined for the AND processing unit. An n-valued OR gate takes two or more n-valued input wires and produces one n-valued output wire that transmits the maximal value of all the input wires. Likewise, a n-valued NOT gate takes an n-valued input wire and produces an n-valued output wire that transmits the value of (n−1) minus the input value.
The MOVE, SET, SETH, STORE, and LOAD processing units are primarily composed of a set of wires for transferring specific digits from the input to the output of the processing units (see
The remaining processing units, ADD, SUB, ADDI, and SUBI, all require the function of adding two numbers. For instance, SUB (A−B) is implemented as A+(−B). These processing units may be implemented using a general methodology for designing multivalued circuits as described in the following.
A General Method for Designing any Multivalued Circuits
This disclosure further provides a general method for designing any multivalued circuit. It has been shown mathematically that any multivalued function can be decomposed into three types of basic operations. Epstein, George, “The Lattice Theory of Post Algebras”, Transactions of the American Mathematical Society, Vol. 95, No. 2, pp. 300-317, May, 1960, which is incorporated by reference herein in its entirety. For designing multivalued circuits, three types of multivalued logic gates (“Component gates,” AND gates, and OR gates) are combined to form the required circuit.
A Component gate takes one input and produce one output. For a n-valued input x, there are n component gates Ci(x) for 0≤i≤n−1. The Ci(x) gate produces an output n−1 (also sometimes referred to as “high” or “true”) whenever x is equal to i, otherwise it produces an output 0 (also sometimes referred to as “low” or “false”). For example, if a 4-valued input x is equal to 2, then the Component gates would output C0(x)=0, C1(x)=0, C2(x)=3, and C3(x)=0.
The methodology can be used to design any multivalued circuit, although the design of a 4-valued adder is provided to illustrate the method. The following outlines a four-step process for designing multivalued circuits to implement any multivalued (n-valued) function. The four steps are: (1) Creating a truth table to define the function; (2) Connecting each input x to n Component gates; (3) Creating an AND gate for each output instance having a value>0; and (4) Connecting the outputs of all the AND gates to an OR gate, which produces the final outputs of the required function. These 4 steps are described in more details in the following:
Step 1. Truth Table: Creating a Truth Table to Define the Multivalued Functions
For illustrating the method, this description provides the design for an adder which adds two 4-valued numbers A, B. First, a truth table is created to define the required functions, as shown in Table 3.
All possible input combinations are shown in column A and B. The results of the addition is encoded by two outputs K and S, where K stands for carry and S stands for sum, and the total value is 4K+S. The column K defines the function required to produce K as output, and the column S defines the function required to produce S as output.
Step 2. Component Gates: Connecting Each Input x to n Component Gates Ci(x) for 0≤i≤n−1.
Continuing the above example of designing an adder, the adder has two inputs, A and B. The input A is connected to 4 Ci(A) gates:
C0(A), C1(A), C2(A), C3(A)
Similarly, input B is connected to 4 Ci(B) gates:
C0(B), C1(B), C2(B), C3(B)
The result of these connections is shown in
Step 3. AND Gates: Creating an AND Gate for Each Output Instance (in the Output Columns of the Table from Step 1) “e” Having a Value>0.
For each input instance A0,A1, . . . Am−1=x0,x1, . . . xm−1 that produces an output e>0, an AND gate is created connecting:
Cx0(A0)·Cx1(A1)⋅ . . . ⋅Cxm−1(Am−1)·e
For e=en-1, there is no need to connect the AND gate to e, which is the results of simplification based on the postulate that is en-1·A=A, e.g., in 4-valued context, the value 3 “anded” to any value x will equal x.
Continuing the example of designing an adder, for the function that produces S as output (in the S column of the truth table), there are 12 instances that produce an output e>0. For example, referring to the truth table, when inputs A=0, B=1, the output is 5=1, thus an AND gate is created connecting: C0(A)·C1(B)·1. In other words, since A=0, the AND gate connects to the output of C0(A) gate (from step 2), and since B=1, the AND gate connects to the output of C1(B) gate (see
C0(A)·C1(B)·1, C0(A)·C2(B)·2, C0(A)·C3(B),
C1(A)·C0(B)·1, C1(A)·C1(B)·2, C1(A)·C2(B),
C2(A)·C0(B)·2, C2(A)·C1(B), C2(A)·C3(B)·1,
C3(A)·C0(B), C3(A)·C2(B)·1, C3(A)·C3(B)·2
Similarly, for the function that produces K as output (in the K column of the truth table), there are six instances that produce output e>0. Six AND gates are created as shown below and the connections are shown in
C1(A)·C3(B)·1, C2(A)·C2(B)·1, C2(A)·C3(B)·1,
C3(A)·C1(B)·1, C3(A)·C2(B)·1, C3(A)·C3(B)·1
Step 4: OR Gate: Connecting the Outputs of all the AND Gates to an OR Gate, which Produces the Outputs of the Required Function.
Finishing the example of designing an adder, for the function that produces S as output (in the S column of the truth table), the outputs of all twelve AND gates (from Step 2) are connected to an OR gate, as defined below:
S=C0(A)·C1(B)·1+C0(A)·C2(B)·2+C0(A)·C3(B)+C1(A)·C0(B)·1+C1(A)·C1(B)·2+C1(A)·C2(B)+C2(A)·C0(B)·2+C2(A)·C1(B)+C2(A)·C3(B)·1+C3(A)·C0(B)+C3(A)·C2(B)·1+C3(A)·C3(B)·2
Similarly, for the function that produces K as the output (in the K column of the truth table), the outputs of all the six AND gates (from Step 2) are connected to an OR gate, as defined below:
K=C1(A)·C3(B)·1+C2(A)·C2(B)·1+C2(A)·C3(B)·1+C3(A)·C1(B)·1+C3(A)·C2(B)·1+C3(A)·C3(B)·1
The results of all these connections are shown in
Designing Multivalued Registers and Control Circuits
To continue the design of the multivalued microprocessor, this section describes the design of the registers and the control circuits. There are 16 registers 5 (shown in
To implement multivalued registers requires multivalued memory cells. Another embodiment of this invention provides a design of a multivalued memory cell that can store any multivalued data. The design is shown in
Besides the multivalued memory cell, another component used in implementing the multivalued microprocessor is the multivalued tristate buffer 15. The tristate buffers 15 are used in
The control unit of the microprocessor comprises of many decoders. The decoders are shown in
The current embodiment of the microprocessor further comprises a control circuit. One example of a control circuit is shown in
Multivalued Logic Gates
The multivalued logic gates used to construct the multivalued microprocessor described above include the Component gates, the multivalued AND, OR and NOT gates. These multivalued logic gates can be implemented using many different kinds of technologies, including using transistors. One embodiment of the Component gates for 4-valued logic is shown in
Programming the Multivalued Computer
After completing the design of a multivalued microprocessor, a multivalued computer is constructed by combining one or multiple multivalued microprocessors to additional memory and/or input and output devices. The multivalued computer can then be used to execute programs.
One of the major advantages of the above described multivalued computers is that the programming is much like the programming of conventional binary computers. A sample program is shown in
This application is a continuation of PCT/US17/22339 filed Mar. 14, 2017 which claims the benefit of U.S. Provisional No. 62/308,289 filed Mar. 15, 2016, both of which are incorporated by reference herein in their entirety.
Number | Name | Date | Kind |
---|---|---|---|
5017817 | Yamakawa | May 1991 | A |
5130704 | Ogawa et al. | Jul 1992 | A |
5227993 | Yamakawa | Jul 1993 | A |
5331320 | Cideciyan | Jul 1994 | A |
5398199 | Lefons | Mar 1995 | A |
5438533 | Yoshida | Aug 1995 | A |
5463573 | Yoshida | Oct 1995 | A |
5467298 | Yoshida | Nov 1995 | A |
5644253 | Takatsu | Jul 1997 | A |
5751775 | Fensch et al. | May 1998 | A |
5773996 | Takao | Jun 1998 | A |
5867423 | Kapoor et al. | Feb 1999 | A |
5969648 | Garnett | Oct 1999 | A |
5973960 | Shibata et al. | Oct 1999 | A |
6133754 | Olson | Oct 2000 | A |
6218713 | Wakayama | Apr 2001 | B1 |
6674712 | Yang | Jan 2004 | B1 |
7979627 | Arakawa | Jul 2011 | B2 |
8064253 | Paul | Nov 2011 | B2 |
8120384 | Olexenko | Feb 2012 | B2 |
9584259 | Millar | Feb 2017 | B2 |
9742431 | Babin | Aug 2017 | B1 |
20060259744 | Matthes | Nov 2006 | A1 |
20090300333 | Harper, III | Dec 2009 | A1 |
20090324247 | Kikuchi | Dec 2009 | A1 |
20110010493 | Kimura et al. | Jan 2011 | A1 |
20130205413 | Grocutt | Aug 2013 | A1 |
Entry |
---|
M. H. A. Khan, “Reversible realization of quaternary decoder, multiplexer, and demultiplexer circuits, ” Engineering Letters, vol. 15, No. 2, 2007, pp. 203-207. (Year: 2007). |
Ascia, G.; Catania, V.; Russo, M.; “VLSI hardware architecture for complex fuzzy systems” IEEE Transaction on Fuzzy systems, vol. 7, issue 5, Oct. 1999, p. 553-570. |
Catania, V.; Puliafito, A.; Russo, M.; Vita, L.; “A VLSI fuzzy inference processor based on a discrete analog approach,” IEEE Transactions on Fuzzy Systems, vol. 2, Issue 2, May 1994, p. 93-106. |
Epstein, George, “The Lattice Theory of Post Algebras”, Transactions of the American Mathematical Society, vol. 95, No. 2, pp. 300-317, May 1960. |
PCT Application No. US2017/022339; International Search Report and Written Opinion of the International Searching Authority for Applicant Louisiana Tech Research Corporation dated Jul. 13, 2017. |
Number | Date | Country | |
---|---|---|---|
20190018822 A1 | Jan 2019 | US |
Number | Date | Country | |
---|---|---|---|
62308289 | Mar 2016 | US |
Number | Date | Country | |
---|---|---|---|
Parent | PCT/US2017/022339 | Mar 2017 | US |
Child | 15984518 | US |