Learning machine

Information

  • Patent Grant
  • 5384896
  • Patent Number
    5,384,896
  • Date Filed
    Friday, October 22, 1993
    31 years ago
  • Date Issued
    Tuesday, January 24, 1995
    30 years ago
  • CPC
  • US Classifications
    • 395
    Field of Search
    • US
    • 395 11
    • 395 24
    • 395 27
    • 395 10
    • 395 11
    • 395 20
    • 395 21
    • 395 23
    • 395 24
    • 395 27
    • 395 77
  • International Classifications
    • G06G712
Abstract
A learning machine with multi-input single-output circuits connected in hierachical structure, wherein product-sums of the output signals of input and output signal registers and the weights are obtained by the parallel processing of a plurality of product-sum computing units with a plurality of input and output signal registers being connected in cascades, thereby to render to scale down the circuit of the learning machine with the sigmoid function computing unit being one in number.
Description

BACKGROUND OF THE INVENTION
The present invention generally relates to a learning machine for a data processing apparatus.
As the conventional learning machine is shown, for example, "A Parallel Neurocomputer Architecture toward Billion Connection Updates Per Second", International Joint Conference on Neural Network (January 1990).
FIG. 9 shows a block diagram of the conventional learning machine, which includes input and output signal registers 51, 52, 53 and 54, product-sum and sigmoid function computing units 55, 56 and 57, weight memories 58, 59 and 60 and ring length controlling units 61, 62 and 63. FIG. 10 shows a model of a learning machine to be realized with a structure shown in FIG. 9. In FIG. 10, reference numerals 64, 65, 66 and 67 are input terminals, reference numerals 68, 69, 70, 71 and 72 show multi-input single-output circuits, reference numeral 73 is an input layer, reference numeral 74 is a hidden layer, reference numeral 75 is an output layer. As shown in FIG. 10, a learning machine can be expressed in a model with a multi-input single-output circuits connected in hierachical structure. A layer composed of multi-input single-output circuits for outputting output signals from among multi-input single-output circuits connected in hierachical structure is called an output layer. Layers composed of multi-input single-output circuits excluding the output layer are called hidden layers. A layer composed of input terminals is called an input layer. Generally the hidden layer may be composed of multi-input single-output circuits constituting one layer, or may be composed of multi-input single-output circuits constituting a plurality of layers. FIG. 10 shows a case where the hidden layer is composed of multi-input single-output circuits constituting one layer. Generally the input terminals for constituting an input layer may be arbitrary in number. The multi-input single-output circuits for constituting the hidden layer and the output layer may be arbitrary in number. FIG. 10 shows a learning machine composed of a structure where the input layer 73 is composed of four input terminals, the hidden layer 74 is composed of three multi-input single-output circuits and the output layer 75 is composed of two multi-input single-output circuits. The multi-input single-output circuits 68, 69, 70, 71 and 72 of the respective layers output signals each having a saturation characteristic with respect to the sum of these products with individual weight being multiplied respectively by a plurality of input signals. Namely, an output signal Y.sub.j of a j-th multi-input single-output circuit is expressed by
Y.sub.j =fnc (.sub.i .SIGMA.(W.sub.ji X.sub.i)) (1)
Here X.sub.i is an output signal of an i-th multi-input single-output circuit in the proceeding layer. W.sub.ji is a weight to be multiplied when the output signal of an i-th of multi-input single-output circuit in the proceeding layer is inputted into the j-th multi-input single-output circuit. fnc ( ) is a sigmoid function having a saturation characteristic, and outputs for example ##EQU1## with respect to X.
In a block diagram of the conventional learning machine of FIG. 9, the weight memories 58, 59 and 60 store the weights to be multiplied in the multi-input single-output circuits 68, 69, 70, 71 in a model of FIG. 10. Input and output signal registers 51, 52, 53 and 54 store signals to be inputted from input terminals 64, 65, 66 and 67 or output signals of multi-input single-output circuits 68, 69, 70, 71 and 72. As a signal stored in the input and output signal register 54 is transferred to the input and output signal register 53 at the next machine-cycle and a signal stored in the input and output signal register 53 is transferred to the input and output signal register 52 at the next machine-cycle, signals stored in the input and output signal registers 51, 52, 53 and 54 are transferred in a sequence. A product-sum and sigmoid function computing units 55, 56 and 57 obtain a product-sum between weights stored in the weight memories 58, 59 and 60 and signals stored in the input and output signal registers 51, 52 and 53 so as to output a signal having a saturation characteristic with respect to product-sum in accordance with the (formula 2). The output signals of the product-sum and sigmoid function computing units 55, 56 and 57 are stored in the input and output signal registers 51, 52, 53 and 54. The ring length controlling units 61, 62 and 63 adjust the number of the input and output signal registers for transferring the stored signals in accordance with the number of the input signals and the number of the multi-input single-output circuits for constituting the hidden layer 74. Namely, in computing the product-sum of the multi-input single-output circuits of the hidden layer 74 the ring length is adjusted in the ring length controlling units 61, 62 and 63 so that the transfer operation of the signals among the input and output signal registers 51, 52, 53 and 54, as the number of the input signals is four. In computing the product-sum of the multi-input single-output circuits of the output layer 75, the ring length is adjusted in the ring length controlling units 61, 62 and 63 so that the signals are transferred among the input and output signal registers 51, 52 and 53, as the number of the multi-input single-output circuits of the hidden layer 74 is three.
FIG. 11 shows a block diagram of a product-sum and sigmoid function computing units 55, 56 and 57. In FIG. 11, reference numeral 76 is a multiplier, reference numeral 77 is a product-sum register, reference numeral 78 is an addition unit, reference numeral 79 is a sigmoid function computing element, reference numeral 80 is an input signal terminal and reference numeral 81 is an output signal terminal, reference numeral 82 is an input terminal of weights. The operations of the product-sum and sigmoid function computing units 55, 56 and 57 are shown hereinafter. A signal stored in the product-sum register 77 is initiated with zero. The multiplier 76 outputs to an addition unit 78 a product of a signal to be inputted from the input signal terminal 80 by a weight to be inputted from an input terminal of weights 82. The addition unit 78 obtains the sum of a product stored in the multiplication unit 76 outputs and a product-sum the product-sum register 77 so as to output it to a product-sum register 77. By repetition of an operation for obtaining such product and sum, a product-sum of signals to be inputted from the input signal terminal 80 and weights to be inputted from the input terminal of weights 82 is stored in the product-sum register 77. When the product-sum operation is completed, the sigmoid function computing element 79 outputs a signal having a saturation characteristic given in the (formula 2) with respect to a signal stored in the product-sum register 77. Therefore, a signal given in the (formula 2) is outputted from the output signal terminal 81.
FIG. 12 is a diagram for illustrating a parallel processing of the product-sum and sigmoid function computing units 55, 56 and 57 in obtaining the outputs of the multi-input single-output circuits 68, 69 and 70 of the hidden layer 74. In FIG. 12, X.sub.i (1.ltoreq.i.ltoreq.4) is an input signal, W.sub.ji (1.ltoreq.i.ltoreq.4, 1.ltoreq.j.ltoreq.3) is a weight to be multiplied by an input signal X.sub.i in a j-th multi-input single-output circuit of a hidden layer 74. In order to obtain the outputs of the multi-input single-output circuits 68, 69 and 70 of the hidden layer 74, the ring length is adjusted by the ring length controlling units 61, 62 and 63 so that the transfer of the signals may be effected among the input and output signal registers 51, 52, 53 and 54. At first, input signals X.sub.i (1.ltoreq.i.ltoreq.4) are loaded to the input and output signal registers 51, 52, 53 and 54. The product-sum registers of the product-sum and sigmoid function computing units 55, 56 and 57 are initiated with zero. The parallel processing of the product-sum and sigmoid function computing units 55, 56 and 57 at the next machine.cycle is shown in the (a) of FIG. 12. The product-sum and sigmoid function computing unit 55 obtains a product of a weight W.sub.11 stored in a weight memory 58 by an input signal X.sub.1 to be stored in an input and output signal register 51 so as to store it in its product-sum register. At the same time, the product-sum and sigmoid function computing unit 56 obtains of a product of W.sub.22 by X.sub.2, and the product-sum and sigmoid function computing unit 57 obtains a product of W.sub.33 by X.sub.3 so as to store them in the respective product-sum registers. The parallel processing of the product-sum and sigmoid function computing units 55, 56 and 57 at the next machine.cycle is shown in the (b) of FIG. 12. Signals stored in the input and output signal registers 51, 52, 53 and 54 are transferred in a sequence. The product-sum and sigmoid function computing unit 55 obtains a product W.sub.12 X.sub.2 of a signal X.sub.2 stored in the input and output signal register 51 by a weight W.sub.12 stored in a weight memory 58 so as to store in its product-sum register the sum with W.sub.11 X.sub.1 shown in the (formula 3). ##EQU2## At the same time, the product-sum and sigmoid function computing portions 56 and 57 respectively store in the product-sum register the product-sum shown in the (formula 4). ##EQU3## Likewise, in the following machine.cycles the signals stored in the input and output signal registers 51, 52, 53 and 54 are transferred in a sequence. The product-sum and sigmoid function computing units 55, 56 and 57 obtain product-sum of the weights stored in the weight memories 58, 59 and 60 by the signals stored in the input and output signal registers 51, 52, 53 and 54. Namely, the product-sum and sigmoid function computing units 55, 56 and 57 respectively obtain the product-sum in the first, second and third multi-input single-output circuits of the hidden layer. When the product-sum is obtained, the product-sum and sigmoid function computing units 55, 56 and 57 obtain signals each having a saturation characteristic given in the (formula 2) with respect to product-sum by a sigmoid function computing element so as to output them into the input and output signal registers 51, 52 and 53.
The output signals of the multi-input single-output circuits 68, 69 and 70 of the hidden layer 74 are obtained in a manner described hereinabove and are stored in the input and output signal registers 51, 52 and 53. In obtaining the outputs of the multi-input single-output circuits 71 and 72 of the output layer 75, the ring length is adjusted in the ring length controlling units 61, 62 and 63 so that the signals may be transferred among the input and output signal registers 51, 52 and 53 so as to keep the ring length consistent with the number of output signals (3 in this case) of the hidden layer. In the same way as obtaining of the output signals of the multi-input single-output circuits 68, 69 and 70 of the hidden layer 74, the outputs of the multi-input single-output circuits 71 and 72 of the output layer 75 are obtained by the parallel processing of the product-sum and sigmoid function computing units 55 and 56.
FIG. 13 shows a time chart showing the temporal sequence of the computing unit in the operation in the conventional learning machine. When the product-sum of the multi-input signal-output circuits 68, 69 and 70 of the hidden layer 74 are being obtained, the product-sum and sigmoid function computing units 55, 56 and 57 are in the operation, and the number of the product-sum and sigmoid function computing units operating at this time is in conformity of the number of the multi-input single-output circuits of the hidden layer. Time required to obtain the product-sum of the multi-input single-output circuit 68, 69 and 70 of the hidden layer 74 is
machine.cycle.times.number of signals in the input layer signals in the input layer (5)
Then, the sigmoid function of the multi-input single-output circuits of the hidden layer are obtained. The number of the product-sum and sigmoid function computing units operating at this time is equal to that of the multi-input single-output circuits of the hidden layer. When the product-sum of the multi-input single-output circuits 71 and 72 of the output layer 75 is obtained, the product-sum and sigmoid function computing units 55 and 56 are operating. The number of the product-sum and sigmoid function computing units operating at this time is in conformity with that of the multi-input single-output circuits of the output layer. Time required to obtain the product-sum of the multi-input single-output circuits 71 and 72 of the output layer 74 is
machine.cycle.times.number of signals in the hidden layer (6)
Then, the sigmoid function of the multi-input single-output circuits of the output layer is obtained. The number of the product-sum and sigmoid function computing units operating at this time is equal to that of the multi-input single-output circuits of the output layer. Time required for the output signal to be obtained from the input signal by the operation is
machine.cycle.times.(number of signals in the input layer+number of signals in the hidden layer)+duration of computing sigmoid function of the hidden layer+duration of computing sigmoid function of the output layer(7)
Then, the weights of the output layer are modified. The amount of weight modification of the output layer is abtained by the computation in the product-sum and sigmoid function computing units 55 and 56 so as to modify the weights. Further, a back-propagating signal .delta. of the hidden layer is obtained. Time required to obtain the weight modification of the output layer and the back-propagating signal .delta. of the hidden layer is
machine.cycle.times.number of signals in the hidden layer.times.3(8)
The weight modification of the hidden layer is effected by the operation in the product-sum and sigmoid function computing units 55, 56 and 57. The time required to do it is
machine.cycle.times.number of signals in the input layer.times.2(9)
Time required for the weight modification to be completed from a time point when the output signal of the output layer was obtained in the manner is
machine.cycle.times.(3.times.number of signals in the hidden layer+2.times.number of signals in the input layer) (10)
In the construction, the product-sum are abtained by the simultaneous operations of the product-sum computing elements number of which is equal to the number of the multi-input single-output circuits of one layer, so the product-sum computing elements are required by plurality equal to the number of the multi-input single-output circuits of the hidden layer or that of the output layer. Thus one of the problems of the PRIOR ART is that it results in larger scale of the circuits.
As the number of the product-sum and sigmoid function computing units for the parallel processing is less than that of the multi-input single-output circuits of the hidden layer or the output layer, the PRIOR ART has a problem that it cannot constitute such a learning machine with a number of multi-input single-output circuits in one layer exceed the number (3 in the conventional embodiment) of the previously prepared product-sum and sigmoid function computing units.
As the calculation of the amount of weight modification, the weight modification, the back-propagating signal .delta. of the hidden layer are sequentially carried out in a sequence by the product-sum and sigmoid function computing unit, the PRIOR ART has a problem that time required for the weight modification is long.
SUMMARY OF THE INVENTION
Accordingly, an essential object of a first invention is to provide an improved learning machine smaller in the scale of the circuit, with a sigmoid function computing element being one only.
Another important object of a second invention is to provide a learning machine capable of modifying weights in a shorter time.
Still another object of a third invention is to provide a learning machine capable of realizing with simpler changes arbitrary number of the input signals, arbitrary number of multi-input single-output circuits of the hidden layer or of the output layer.
The first invention is a learning machine which comprises a plurality of input and output signal registers connected in cascades, a plurality of weight memories for storing the weights to be multiplied by the input signals, a plurality of product-sum computing units for outputting product-sum of the output signals of the input and output signal registers by the weights stored in the weight memories, and a single sigmoid function computing unit for outputting signals having a saturation characteristic with respect to each product-sum to be outputted by the product-sum computing unit.
The second invention is a learning machine which comprises a plurality of input and output signal registers connected in cascades, a plurality of weight memories for storing the weights to be multiplied by the input signals, products and product-sum of product-sum computing units for outputting a product of the output signals of the input and output signal registers by the weights stored in the weight memories, a single sigmoid function computing unit for outputting signals each having a saturation characteristic with respect to the product-sum to be outputted by the product-sum computing unit, an output layer .delta. outputting unit for computing the back-propagating signal .delta. of the output layer dependent on the output signal and the supervising signal of the sigmoid function computing unit, a hidden layer .delta. calculating unit for computing the back-propagating signal .delta. of the hidden layer dependent on a product to be outputted by the product-sum computing unit, and a weight modification unit for obtaining the amount of weight modification dependent on the output of the input and output signal register, the output of the output layer .delta. computing unit and the output of the hidden layer .delta. computing unit.
The third invention is a learning machine which is provided, with a single sigmoid function computing unit in the construction of the first or the second invention, consisting of a sigmoid function element for outputting signals each having a saturation characteristic with respect to the input signals, an input selecting element for selecting the output signals of a plurality of product-sums so as to input them into the sigmoid function element and a delay element for delaying proper time the output signal of the sigmoid function element.
In a first invention, with the construction, input signals are transferred in a sequence with an input and output signal register connected in cascades, the weight memories output the weights to be multiplied by the input signals, by the parallel processing of a plurality of product-sum computing units, a product-sum of the signals of the input and output signal registers by the weights stored in the weight memories are obtained. A first product-sum computing unit from among a plurality of product-sum computing units completes the computation of the product-sum at first, and a second product-sum computing unit completes the product-sum computation at the next machine.cycle. A single sigmoid function computing unit sequentially multiplies a product-sum to be outputted from the plurality of product-sum computing units by a function (sigmoid function) having a saturation characteristic calculated by one sigmoid function element. The output signals of the multi-input single-output circuit of the hidden layer obtained in this manner are sequentially loaded to the input and output signal resisters connected in cascades. The product-sum is obtained in a sequence again by the parallel processing of the product-sum computing units. In a single sigmoid function computing unit, the sigmoid functions with respect to the product-sum are obtained in a sequence by one sigmoid function element so as to obtain the output signals of the multi-input single-output circuits of the output layer.
In the second invention, with the construction, the output signals of the multi-input single-output circuits of the hidden layer and the output layer are outputted by a similar operation to that of the first invention about the input and output signal registers, the weight memories and the product-sum computing units and the single sigmoid function computing unit. The output layer .delta. computing unit obtains a back-propagating signal .delta. in the multi-input single-output circuit of the output layer so as to sequentially transfer it to the input and output signal resisters connected in cascades. When the back-propagating signal .delta. of the multi-input single-output circuits of all the output layers are set in the input and output signal registers, the weight memories output weights showing the strength of the connections between a first multi-input single-output circuit of the hidden layer and the multi-input signal-output circuits of the output layer. A plurality of product-sum computing units output products to the hidden layer .delta. computing unit at the same time. Namely, the products are obtained through the simultaneous multiplication of the back-propagating signals .delta. of the multi-input single-output circuits of the output layer by weights showing the strength of the connections between a first multi-input signal-output circuit of the hidden layer and the multi-input single-output circuits of the output layer. In the hidden layer .delta. computing unit, the back-propagating signal .delta. in the first multi-input single-output circuit of the hidden layer is obtained through the multiplication of the sum of the plurality of products by a differential coefficient of the sigmoid function in the first multi-input single-output circuit of the hidden layer. At the same time, a plurality of weight modification units obtain the amount of modification of the weights showing the strength of the connections between a first multi-input single-output circuit of the hidden layer and the multi-input single-output circuits of the output layer so as to output it to the weight memories. In the weight memories, the amount of weight modification obtained in the weight modification units are added to the weights so as to modify the weights. In this manner, the calculation of .delta. of the first multi-input single-output circuit of the hidden layer and the modification of weights showing the strength of the connections between the first multi-input single-output circuit of the hidden layer and the multi-input single-output circuits of the output layer are carried out at the same time. Thereafter, by the repetition, the calculation of the back-propagating signal .delta. of all the multi-input single-output circuits of the hidden layer, and the modification of weights showing the strength of the connections between all the multi-input single-output circuits of the hidden layer and the multi-input single-output circuits of the output layer are carried out. The back-propagating signals .delta. of the multi-input single-output circuits of the hidden layer is transferred in a sequence to the input and output signal registers connected in cascades. When the back-propagating signals .delta. of all the multi-input single-output circuits of the hidden layer are set in the input and output signal registers, the weights showing the strength of the connections between the first input signal of the input layer and the multi-input single-output circuits of the hidden layer are changed by a plurality of weight modification units and the weight memories. By the repetition thereof, the amount of weight modification showing the strength of the connections between the input signals of the input layer and the multi-input single-output circuits of the hidden layer are obtained.
With the construction, in the third invention, a plurality of product-sum computing units sequentially output the product-sum at each machine.cycle, by the function similar to that of the first invention about the input and output signal registers, the weight memories and the product-sum computing units. In the single sigmoid function computing unit, the input selecting element sequentially selects the outputs of a plurality of product-sum computing units so as to input them to one sigmoid function element. The sigmoid function element outputs a signal having a saturation characteristic with respect to the product-sum input, the delay element delays the output signal of the sigmoid function element by proper time before outputting it. The delay time of the signal in the delay element is decided so that the product-sum of the all the multi-input single-output circuits of the hidden layer are computed in the product-sum computing units in the delay time and that the waiting time of the product computing unit is minimum. The output signals of the multi-input single-output circuits of the hidden layer are loaded in a sequence to the input and output registers connected in cascades after being delayed by proper time in the delay unit as described hereinabove. The output signals of the multi-input single-output circuits of the output layer are obtained in the similar operation for obtaining the output of the multi-input single-output circuits of hidden layer. Any delay time of the signals in the delay unit in this case may be acceptable. Thus, a learning machine with any number of the input signals, the multi-input single-output circuits of the hidden layer, and the output layer may be constituted.





BRIEF DESCRIPTION OF THE DRAWINGS
These and other objects and features of the present invention will become apparent from the following description taken in conjunction with the preferred embodiment thereof with reference to the accompanying drawings, in which;
FIG. 1 is a block diagram of a learning machine in an embodiment of the present invention;
FIG. 2 is a block diagram of a product-sum computing unit in the embodiment;
FIGS. 3a-3d show a schematic diagrams of the parallel processing of a product-sum computing unit in the embodiment;
FIGS. 4a-4c show schematic diagrams of the parallel processing of product-sum computing units 5, 6 and weight modification units 32, 33 in the embodiment;
FIGS. 5a-5b show schematic diagrams of the parallel processing of the weight modification units 31, 32 and 33 in the embodiment;
FIG. 6 is a time chart showing the temporal change of the computing units working in the learning machine of the embodiment;
FIG. 7 is a block diagram of a learning machine in another embodiment of a present invention;
FIG. 8 is a time chart showing the temporal change of the computing units working in the learning machine of the embodiment;
FIG. 9 is a block diagram of the conventional learning machine;
FIG. 10 is a model diagram of a learning machine;
FIG. 11 is a block diagram of the conventional product-sum and sigmoid function computing unit; and
FIGS. 12a-12b show schematic diagrams of the parallel processing of the conventional product-sum and sigmoid function computing units.
FIG. 13 shows a time chart showing the temporal sequence of the computing unit in the operation in the conventional learning machine.





DETAILED DESCRIPTION OF THE INVENTION
Before the description of the present invention proceeds, it is to be noted that like parts are denoted by like reference numerals throughout the accompanying drawings.
Referring now to the drawings, there is shown in FIG. 1, a block diagram of a learning machine according to one preferred embodiment of the present invention, which includes input and output signal registers 1, 2 and 3, product-sum computing units 4, 5 and 6, weight memories 7, 8 and 9, a sigmoid function computing unit 10, a signal switching unit 11, an input signal register 12, weight modification units 31, 32 and 33, a hidden layer .delta. calculating unit 34, an output layer .delta. calculating unit 35.
As shown in FIG. 1, a learning machine in the present embodiment is composed of cascade connections of the input and output signal registers 1, 2 and 3. A model diagram of the learning machine in the present embodiment is shown in FIG. 10. It can be shown in a model with multi-input single-output circuits being connected in hierarchical structure. The present embodiment is a learning machine composed of four input terminals in the input layer 73, three multi-input single-output circuits in the hidden layer 74, two multi-input single-output circuits in the output layer 75. The multi-input single-output circuits 68, 69, 70, 71 and 72 of each layer outputs signals each having a saturation characteristic with respect to the product-sum of a plurality of input signals and corresponding weights.
In the block diagram of the present embodiment of FIG. 1, a weight memory 7 stores the weights to be multiplied by the third multi-input single-output circuit 70 of the hidden layer, a weight memory 8 stores the weights to be multiplied by the second multi-input single-output circuit 69 of the hidden layer and the second multi-input single-output circuit 72 of the output layer and a weight memory 9 stores the weights to be multiplied by the first multi-input single-output circuit 68 of the hidden layer and the first multi-input single-output circuit 71 of the output layer. Signals to be input from the input terminals 64, 65, 66 and 67 are loaded in a sequence into the input signal register 12. The signal switching unit 11 is set so as to transfer the output signal of the input signal register 12 into the input and output signal register 3. The signals stored in the input, output signals registers 3 are transferred into the input and output signal register 2 at the next machine.cycle and the signals stored in the input and output signal register 2 are transferred into the input and output signal register 1 at the next machine.cycle. In this manner, the signals stored in the input and output signal registers 3, 2 and 1 are transferred in a sequence. The product-sum computing units 4, 5 and 6 obtain a product-sum of the weights stored in the weight memories 7, 8 and 9 and the signals stored in the input and output signal registers 1, 2 and 3. The sigmoid function computing unit 10 outputs a signal having a saturation characteristic in accordance with the (formula 1) with respect to a product-sum the product-sum computing units 4, 5 and 6 output. The output signals of the sigmoid function computing unit 10 are output to a signal switching function 11. At this time, a signal switching unit 11 is set to transfer the output of the sigmoid function computing unit 10 into the input and output signal register 3.
FIG. 2 shows a block diagram of product-sum computing units 4, 5 and 6. In FIG. 2, reference numeral 13 is a multiplier, reference numeral 14 is an adder, reference numeral 15 is a product-sum register, reference numeral 16 is a signal input terminal, reference numeral 17 is an input terminal of weights, reference numeral 18 is an output terminal of product-sum, and reference numeral 36 is an output terminal of products. The operations of the product-sum computing units 4, 5 and 6 are shown hereinafter. The signal stored in the product-sum register 15 is initialized with zero. The multiplier 13 outputs to the adder a product of the signals to be input from the signal input terminal 16 and the weights input from the input terminal of weights 17. The adder 14 obtains the sum of the product output from the multiplier 13 and the product-sum stored in the product-sum register 15 so As to output it to the product-sum register 15. By repeating this operation for obtaining such product and sum, a product-sum of signals input from the signal input terminal 16 and weights input from the input terminal of weights 17 is stored in the product-sum register 15, and the product-sum is output from the output terminal of product-sum 18.
FIG. 3 shows the schematic diagram of the parallel processing of product-sum computing units 4, 5 and 6 in obtaining the outputs of the multi-input single-output circuits 68, 69 and 70 of the hidden layer 74. In FIG. 3, X.sub.1 (1.ltoreq.i.ltoreq.4) is an input signal, W.sub.ji (1.ltoreq.i.ltoreq.4, 1.ltoreq.j.ltoreq.3) is a weight to be multiplied at the j-th multi-input single-output circuit of the hidden layer 74 by the input single X.sub.i. The weight memory 7 stores the weights W.sub.3i (1.ltoreq.i.ltoreq.4) to be multiplied at the third multi-input single-output circuit 70 of the hidden layer. The weight memory 8 stores the weights W.sub.2i (1.ltoreq.i.ltoreq.4) to be multiplied at the second multi-input single-output circuit 69 of the hidden layer. The weight memory 9 stores the weights W.sub.1i (1.ltoreq.i.ltoreq.4) to be multiplied at the first multi-input single-output circuit 68 of the hidden layer. An input signal X.sub.1 is loaded to an input signal signal 12, and a signal switching unit 11 is set to transfer the output of the input signal register 12 into an input and output signal register 3. The product-sum registers of the product-sum computing units 4, 5 and 6 are initialized with zero. The operation of the product-sum computing unit 6 at the next machine.cycle is shown in the (a) of FIG. 3. The product-sum computing unit 6 obtains a product of the weight W.sub.11 stored in the weight memory 9 and the input signal X.sub.1 stored in the input and output signal register 3 so as to store it in the product-sum register of the product-sum computing unit 6. At this time, the input signal X.sub.2 is loaded to the input signal register 12 simultaneously. The parallel processing of the product-sum computing units 5 and 6 at the next machine.cycle is shown in the (b) of FIG. 3. The product-sum computing unit 6 obtains a product W.sub.12 .sub.2 of a signal X.sub.2 stored in the input and output signal register 3 and a weight W.sub.12 stored in the weight memory 9 so as to store the sum ##EQU4## with respect to W.sub.11 X.sub.1 stored in the product-sum register. At the same time, the product-sum computing unit 5 obtains in the product
W.sub.21 X.sub.1 (12)
At this time, the input signal X.sub.3 is loaded to the input signal register 12 simultaneously. Likewise, the signals stored in the input signal register 12 and the input and output signal registers 3, 2, 1 are transferred in a sequence. The product-sum computing units 6, 5 and 4 obtain (see FIG. 3 (c), (d)) the product-sum of the weights stored in the weight memories 9, 8 and 7 at the signals stored in the input and output signal registers 3, 2 and 1. When the product-sum computing unit 6 obtains (see FIG. 3 (d)) ##EQU5## the product-sum computing unit 5 obtains at the next machine.cycle ##EQU6## and the product-sum computing unit 4 obtains at the next machine.cycle ##EQU7## In this manner, the product-sum computing units 6, 5 and 4 output to the sigmoid function computing unit 10 the product-sum in the multi-input single-output circuit of the hidden layer, being delayed respectively by 1 machine.cycle. The sigmoid function computing unit 10 obtains signals having a saturation characteristic given in the (formula 1) with respect to the input product-sum so as to output them to the signal switching unit 11, delayed respectively by one machine.cycle. They correspond to the output values of the multi-input single-output circuits of the hidden layer expressed by ##EQU8## The output signals of the multi-input single-output circuits 68, 69 and 70 of the hidden layer 74 are obtained in such a manner as described hereinabove.
The signal switching unit 11 is set so that the output signals of the sigmoid function computing unit 10 may be transferred to the input and output signal register 3 when the output signals of the hidden layer 74 are input from the sigmoid function computing unit 10. Thus, the output Y.sub.j (1.ltoreq.j.ltoreq.3) of the sigmoid function computing unit 10 is transferred in a sequence to the input and output signal register 3. As the multi-input signal-output circuits composing the output layer are two in number S shown in FIG. 10, the product-sum in the multi-input single-output circuits of the output layer are obtained by the parallel processing of the product-sum computing units 6 and 5. The weight memory 8 stores weights V.sub.2j (1.ltoreq.j.ltoreq.3) to be multiplied at the second multi-input single-output circuit 72 of the output layer and the weight memory 9 stores the weights V.sub.1j (1.ltoreq.j.ltoreq.3) to be multiplied at the first multi-input single-output circuit 71 of the output layer. The product-sum computing unit 6 outputs to the sigmoid function computing unit 10, ##EQU9## Being delayed by 1 machine.cycle thereafter, the product-sum computing unit 5 outputs to the sigmoid function computing unit 10, ##EQU10## The sigmoid function computing unit 10 obtains the sigmoid function given in the (formula 2) with respect to the product-sum so as to output them to the output layer .delta. calculating unit 35, delayed respectively by 1 machine.cycle. They correspond to the output values of the multi-input single-output circuits of the output layer expressed by ##EQU11## The output signals of the multi-input single-output circuits 71 and 72 of the output layer 75 are obtained as described hereinabove.
The output signals of the multi-input single-output circuits 71 and 72 of the output layer 75 to be obtained by the sigmoid function computing unit 10 are input in a sequence to the output layer .delta. calculating unit 35. The output layer .delta. calculating unit 35 obtains the back-propagating signal .delta. of the multi-input single-output circuit of the output layer in accordance with the (formula 23) dependent on the output signal Z.sub.k (1.ltoreq.k.ltoreq.2) of the multi-input single-output circuit of the output layer and a supervising signal t.sub.k (1.ltoreq.k.ltoreq.2). ##EQU12##
In the (formula 23), .delta..degree..sub.k is a back-propagating signal of the k-th multi-input single-output circuit of the output layer, Z.sub.k is an output signal of the multi-input single-output circuit, t.sub.k is a supervising signal of the multi-input single-output circuit, Z'.sub.k is a differential of the sigmoid function of the multi-input single-output circuit. In the manner, the back-propagating signal .delta. of the multi-input single-output circuit of the output layer is obtained.
FIG. 4 shows schematic diagrams of the parallel processing of the product-sum computing units 5 and 6 and the weight modification units 32 and 33 in obtaining the back-propagating signal .delta. of the multi-input single-output circuit of the hidden layer and modifying the weights of the equivalent to the strength of the connections between the multi-input single-output circuits of the hidden layer and those of the output layer. In FIG. 4, Y.sub.j (1.ltoreq.j.ltoreq.3) is the output signal of the multi-input single-output circuit of the hidden layer, V.sub.kj (1.ltoreq.j.ltoreq.3, 1.ltoreq.k.ltoreq.2) is the weights to be multiplied at the k-th multi-input single-output circuit of the output layer 75 by the output signal Y.sub.j of the multi-input single-output circuit of the hidden layer. The .delta..degree..sub.k (1.ltoreq.k.ltoreq.2) which is the back-propagating signal of the multi-input single-output circuit of the output layer is input to the signal switching unit 11 from the output layer .delta. calculating unit 35 in the order of .delta..degree..sub.2, .delta..degree..sub. 1. At this time, the signal switching unit 11 is set so that the output of the output layer .delta. calculating unit 35 is transferred to the input and output signal register 3. In the order of the .delta..degree..sub.2, .delta..degree..sub.1, they are transferred to the input and output signal registers 3 and 2. At the moment when the .delta..degree..sub.2 is loaded to the input signal register 2 and the .delta..degree..sub.1 is loaded to the input and output signal register 3, the transferring of the signal between the input output signal registers is suspended. As described hereinabove, the weight memory 8 stores the weights V.sub.2j (1.ltoreq.j.ltoreq.3) equivalent to the strength of the connections between the second multi-input single-output circuit 72 of the output layer and the multi-input single-output circuit of the hidden layer, and the weight memory 9 stores the weights V.sub.1j (1.ltoreq.j.ltoreq.3) of the weights equivalent to the strength of the connections between the first multi-input single-output circuit 71 of the output layer and the multi-input single-output circuit of the hidden layer. FIG. 4 (a) shows schematic diagram of the parallel processing of the product-sum computing units 5 and 6 and the weight modification units 32 and 33 at this machine.cycle. A multiplier 13 in the product-sum computing unit 6 multiplies .delta..degree..sub.1 to be output from the input and output signal register 3 by V.sub.11 to be output from the weight memory 9 so as to output it to the hidden layer .delta. computing unit 34. At the same time, the multiplier 13 in the product-sum computing unit 5 multiplies .delta..degree..sub.2 to be output from the input and output register 2 by V.sub.21 to be output from the weight memory 8 so as to output it to the hidden layer .delta. calculating unit 34. In the hidden layer .delta. calculating unit 34, the sum of the two products V.sub.11 .delta..degree..sub.1 and V.sub.21 .delta..degree..sub.2 is obtained, and is multiplied by a differential Y'.sub.1 of the sigmoid function of the first multi-input single-output circuit of the hidden layer so as to obtain the back-propagating signal of the first multi-input single-output circuit of the hidden layer. ##EQU13## At the same time, the output value Y.sub.1 of the first multi-input single-output circuit of the hidden layer from the sigmoid function computing unit 10 is input to the weight modification units 33 and 32. In the weight modification unit 33, a learning rate .epsilon. is multiplied by a back-propagating signal .delta..degree..sub.1 stored in the input and output signal register 3, and is multiplied by the output value Y.sub.1 of the first multi-input single-output circuit of the hidden layer so as to obtain the modification amount of V.sub.11 equivalent to the strength of the connections between the first multi-input single-output circuit of the hidden layer and the first multi-input single-output circuit of the output layer.
.DELTA.V.sub.11 =.epsilon..delta..degree..sub.1 Y.sub.1 (25)
At the same time, the modification amount of V.sub.21
.DELTA.V.sub.21 =.epsilon..delta..degree..sub.2 Y.sub.1 (26)
is obtained in the weight modification unit 32. In the weight memories 9 and 8, the weights V.sub.11 and V.sub.21 are modified dependent on the amount of weight modification .DELTA.V.sub.11 and .DELTA.V.sub.21 to be output from the weight modification units 33 and 32. At the subsequent machine.cycle, as shown in FIG. 4, (b), (c), the sigmoid function computing unit 10 outputs output values Y.sub.2 and Y.sub.3 of the multi-input single-output circuit of the hidden layer in a sequence, with .delta..degree..sub.1 and .delta..degree..sub.2 which are the back-propagating signals of the multi-input single-output circuits of the output layer being retained in the input and output signal registers 3 and 2. The weight memories 33 and 32 output the corresponding weights in a sequence. The hidden layer .delta. calculating unit 34 obtains in a sequence ##EQU14## by the operation similar to that of the previous machine.cycle so as to modify in a sequence the weights V.sub.kj (2.ltoreq.j.ltoreq.3, 1.ltoreq.k.ltoreq.2) in the weight memories 9 and 8. In a manner as described hereinabove, the back-propagating signal .delta. of the multi-input single-output circuit of the hidden layer is obtained so as to modify weights equivalent to the strength of the connections between the multi-input single-output circuits of the hidden layer and those of the output layer.
FIG. 5 shows schematic diagrams of the parallel processing of the weight modification units 31, 32 and 33 in the modification of the weights equivalent to the strength of the connections between the input terminals of the input layer and the multi-input single-output circuits of the hidden layer. In FIG. 5, X.sub.i (1.ltoreq.i.ltoreq.4) are the input signals from the input terminals and W.sub.ji (1.ltoreq.i.ltoreq.4, 1.ltoreq.j.ltoreq.3) are the weights to be multiplied at the j-th multi-input single-output circuit of the hidden layer by the input signals X.sub.i. .delta..sup.h.sub.j (1.ltoreq.j.ltoreq.3) which are back-propagating signals of the multi-input single-output circuit of the hidden layer are input to a signal switching unit 11 from the hidden layer .delta. calculating unit 34 in the order of .delta..sup.h.sub.3, .delta..sup.h.sub.2, .delta..sup.h.sub.1. At this time, the signal switching unit 11 is set so that the output of the hidden layer .delta. calculating unit 34 may be transferred to the input and output signal register 3. It is transferred to the input and output signal registers 3, 2 and 1 in the order of the .delta..sup.h.sub.3, the .delta..sup.h.sub.2, the .delta..sup.h.sub. 1. At a moment point when the .delta..sup.h.sub.3 is stored in the input and output signal register 1, the .delta..sup.h.sub.2 is stored in the input and output signal register 2 and the .delta..sup.h.sub.1 is stored in the input and output signal register 3, the transferring operation of the signal between the input and output signal registers is suspended. The schematic diagram of the parallel processing of the weight modification units 31, 32 and 33 at the next machine.cycle is shown in FIG. 5 (a). The first input signal X.sub.1 is input from the input signal register 12 to the weight modification units 33, 32 and 31. In the weight modification unit 33, .delta.h.sub.1 stored in the input and output signal register 3 is multiplied by a learning rate .epsilon., and is multiplied by the first input signal X.sub.1 so as to obtain the modification amount of W.sub.11
.DELTA.W.sub.11 =.epsilon..delta..sup.h.sub.1 X.sub.1 (28)
equivalent to the strength of the connection between the first input terminal of the input layer and the first multi-input single-output circuit of the hidden layer. At the same time, the modification amount of W.sub.j1 (2.ltoreq.j.ltoreq.3) is obtained in the weight modification units 32 and 31.
.DELTA.W.sub.j1 =.epsilon..delta..sup.h.sub.j X.sub.1 (29)
In the weight memories 9, 8 and 7, the weights W.sub.j1 (1.ltoreq.j.ltoreq.3) are modified dependent on the amount of weight modification .DELTA.W.sub.j1 (1.ltoreq.j.ltoreq.3) to be output from the weight modification units 33, 32 and 31. The schematic diagram of the parallel processing of the weight modification units 31, 32 and 33 at the subsequent machine.cycle is shown in FIG. 5 (b). The i-th input signal X.sub.i (2.ltoreq.i.ltoreq.4) are input from the input signal register 12 to the weight modification units 33, 32 and 31 so as to obtain the modification amount of W.sub.ji
.DELTA.W.sub.ji =.epsilon..delta..sup.h.sub.j X.sub.i (2.ltoreq.i.ltoreq.4, 1.ltoreq.j.ltoreq.3) (30)
In the weight memories 9, 8 and 7, the weights W.sub.jk are modified in accordance with the amount of modification .DELTA.W.sub.ji (2.ltoreq.i.ltoreq.4, 1.ltoreq.j.ltoreq.3) to be output from the weight modification units 33, 32 and 31. In a manner as described hereinabove, the weights equivalent to the strength of the connections between the input terminals of the input layer and the multi-input single-output circuits of the hidden layer are modified.
FIG. 6 shows a time chart showing the temporal change of the computing units working in the learning machine in the present embodiment. Because the input signals are input in a sequence from the input signal register 12, only the product-sum computing unit 6 is working (see FIG. 3 (a)) at the first machine.cycle and the product-sum computing units 6 and 5 are working at the next machine.cycle (see FIG. 3 (b)). In such a manner as described hereinabove, the number of the product-sum computing units for carrying out the parallel processing operation changes to 1, 2, 3, 3 for each machine.cycle (see FIG. 3). At this time, the product-sum computing unit 6 outputs the product-sum to given in the (formula 13) and at the next machine.cycle, the sigmoid function computing unit 10 obtains in accordance with the (formula 16) the sigmoid function with respect to the product-sum to be expressed by the (formula 13). As the effective signal is not stored in the input and output register 3 at this time, the product-sum computing unit 6 is not working. Therefore, the number of the product-sum computing units for carrying out the parallel processing operation at this machine.cycle is 2. In this manner, this time required for obtaining the output of the first multi-input single-output circuit of the hidden layer is
machine.cycle.times.(number of signals of the input layer+1)(31)
At the next machine.cycle, the sigmoid function computing unit 10 obtains the sigmoid function with respect to the (formula 14) in accordance with the (formula 17). The input and output signal register 3 stores the output signal of the first multi-input single-output circuit of the hidden layer. The product-sum computing unit 6 starts its computation of the product-sum of the first multi-input single-output circuit of the output layer. At this time, effective signal is not stored in the input and output signal register 2, and the product-sum computing unit 5 is not working. The product-sum computing unit 4 is computing the product-sum given in the (formula 15). Thus the number of the product-sum computing units for carrying out the parallel processing operation at this machine.cycle is 2. The output signal of the multi-input signal-output circuit of the hidden layer is transferred in a sequence by the input and output signal registers 3 and 2 from the next machine.cycle. The number of the product-sum computing units for carrying out the parallel processing operation is 2, 2 for each machine.cycle and then the product-sum computing unit 6 outputs the product-sum given in with the (formula 19). At the next machine.cycle, the sigmoid function computing unit 10 obtains in accordance with the (formula 21) the sigmoid function with respect to the product-sum given in the (formula 19). At this time, effective signal is not stored at the input and output register 3 and the product-sum computing unit 6 is not working. Hence, the number of the product-sum computing units for carrying out the parallel processing operation at this machine.cycle is 1. Thereafter, for obtaining, in accordance with the (formula 22), the sigmoid function with respect to the product-sum given in the (formula 20), one machine.cycle is required. Thus, the time required for obtaining the outputs of all the multi-input single-output circuits of the output layer is
machine.cycle.times.(number of output signals of the hidden layer+number of output signals of the output layer) (32)
Then, the back-propagating signals of the multi-input signal-output circuits of the output layer are obtained in the order of .delta..degree..sub.1, .delta..degree..sub.2 in the output layer .delta. calculating unit 35. The obtained back-propagating signals are transferred to the input and output signal register in the order of .delta..degree..sub.2, .delta..degree..sub.1 with the order being reversed. At the next machine.cycle, the back-propagating signal of the first multi-input single-output circuit in the hidden layer is obtained, and at the same time, the weights equivalent to the strength of the connections between the first multi-input single-output circuit of the hidden layer and the multi-input single-output circuits of the output layer are modified (see FIG. 4 (a)). The computing units working at this time are the product-sum computing units 6, 5, the hidden layer .delta. calculating unit 34 and the weight modification units 33, 32. In this manner, the back-propagating signals of the output layer are transferred to the input and output signal registers, and the back-propagating signals of the hidden layer are calculated. The time required for obtaining the back-propagating signals of the hidden layer is
machine.cycle.times.(number of output signals of the hidden layer+number of output signals of the output layer+1) (33)
The back-propagating signals of the multi-input single-output circuits of the hidden layer obtained in this manner are transferred to the input and output signal register in the order of .delta.h.sub.3, .delta.h.sub.2, .delta.h.sub.1. At the next machine.cycle, the weights equivalent to the strength of the connections between the first input terminal of the input layer and the multi-input single-output circuits of the hidden layer (see FIG. 5 (a)) are modified. The computing units working at this time are the product-sum computing units 6, 5, 4 and the weight modification units 33, 32, 31. In this manner, the back-propagating signals of the hidden layer are transferred to the input and output signal registers, and the weights for connecting the input layer with the hidden layer are modified. The time required for the transferring and the modifying is
machine.cycle.times.(number of signals of the input layer+number of output signals of the hidden layer) (34)
Following the above operation, the time required for obtaining the output signal of the output layer from the input signal is
machine.cycle.times.(number of signals of the input layer+number of output signals of the hidden layer+number of output signals of the output layer+1)(35)
Also, the time required for the weight modification to be completed after the moment when the output signal of the output layer is obtained is
machine.cycle.times.{number of signals of the input layer+2.times.number of output signals of the hidden layer+number of output signals of the output layer+1} (36)
According to the present embodiment as described hereinabove, from the input signal register 12, the input signals are input in a sequence, the signals are transferred in a sequence by the input and output signal registers 3, 2 and 1 and the product-sum computing units 6, 5 and 4 for carrying out the parallel processing operation output the product-sum of the multi-input single-output circuits with delay of one machine.cycle respectively. Thus, the sigmoid function computing unit 10 may be reduced to one. In the present embodiment, the time given in the (formula 35) is required to obtain the output signal from the input signal. The increase in the time for calculating the output signal, due to the reduction in the number of the sigmoid function computing units, as compared with the required time given in the (formula 7) in the conventional embodiment is
ti machine.cycle.times.(number of output signals of the output layer+1)-(time for computing the sigmoid function of the hidden layer and the output layer) (37)
In the conventional learning machine, one machine.cycle is required for the computing the sigmoid function of the hidden layer and the output layer respectively. In the present embodiment and the conventional embodiment, when 2 is substituted to the number of output signals of the output layer, the increase in time for calculating the output signal is one machine.cycle. The first effect of the present embodiment is that the learning machine smaller in scale than before may be realized with small increase of the time for obtaining the output signals.
The second effect of the present embodiment is that the time required before the weight modification is completed after the output signal of the output layer is obtained can be shortened by the parallel processing of the product-sum computing units 4, 5, 6, the weight modification units 31, 32, 33 and the hidden layer .delta. calculating unit 34. The time required after the output signal of the output layer is obtained before the weight modification is completed is shortened down to the time given in the (formula 36) in the present embodiment compared with the (formula 10) in the conventional embodiment. When the number of signals in the input layer being 4, the number of output signals of the hidden layer being 3, the number of output signals of the output layer being 2 are substituted into the (formula 10) and the (formula 36), the time to be shortened is
machine.cycle.times.4 (38)
In the present embodiment, although the product-sum computing unit 6 obtains the product-sum of the first multi-input single-output circuit of the output layer given in the (formula 19), it may be given as ##EQU15## The V.sub.10 is a threshold value of the first multi-input single-output circuit of the output layer. When such computing operation is carried out, 1 is transferred to an input and output signal register 3 through a signal switching unit 11 from the input signal register 12 so as to obtain the product of 1 and V10 in the product-sum computing unit 6. According to such an operation, in FIG. 6, waiting time for computing the sigmoid function is removed.
In the present embodiment, although the back-propagating signals obtained by an output layer .delta. calculating unit 35 are transferred to the input and output signal register 3 in the order of .delta..degree..sub.2, .delta..degree..sub.1, they may be transferred to the input and output signal register 2 in the order of .delta..degree..sub.1, .delta..degree..sub.2 and they may be transferred to the input and output signal register 3 from the input and output signal register 2. According to such an operation, in FIG. 6, the waiting time for transferring .delta. is removed. According to the method, the time required before the weight modification is completed after the output signal of the output layer is obtained can be shortened to
machine.cycle.times.(number of signals of the input layer+number of output signals of the hidden layer) (40)
By the comparison of the time given in the (formula 10) with that given in the (formula 40) required in the conventional embodiment, the number of signals of the input layer being 4, the number of output signals of the hidden layer being 3, the number of output signals of the output layer being 2, the shortened time is
Machine.cycle.times.10 (41)
FIG. 7 is a block diagram of the sigmoid function computing unit 10 in a learning machine in another embodiment of the present invention having the whole construction of FIG. 1. In FIG. 7, reference numeral 19 denotes an input selection unit, reference numeral 20 denotes a sigmoid function computing element, reference numeral 21 denotes a delay unit, reference numeral 22 denotes a first product-sum input terminal, reference numeral 23 denotes a second product-sum input terminal, reference numeral 24 denotes a third product-sum input terminal. In this embodiment of the present invention, a sigmoid function computing unit 10 is provided with a delay unit 21 with which the embodiment is provided.
An effect in the present embodiment is that a learning machine of various construction with the number of the input signals, and the multi-input single-output circuits of the hidden layer and the output layer being different may be realized by the adjustment of the delay time in the delay unit 21. When the delay time is made zero, a learning machine of an input signal 4, a multi-input single-output circuit 3 of the hidden layer, a multi-input single-output circuit 2 of the output layer can be realized by the same operation as the previsous embodiment.
In the description of operation of the present embodiment hereafter, it is shown how the learning machine with 4 input signals, 4 multi-input single-output circuits of the hidden layer, 2 multi-input single-output circuits of the output layer may be realized by the adjustment of the delay time of the delay unit 21 without changing the construction.
FIG. 7 shows a time chart showing the time temporal of the computing unit working in the learning machine in the present embodiment. As four input signals are input in a sequence from the input signal register 12, only the product-sum computing unit 6 operates at the first machine.cycle, and the product-sum computing units 6 and 5 works at the next machine.cycle. In such a manner, the product-sum computing units 6, 5 and 4 compute the product-sum in the first, second and third multi-input single-output circuits. The number of the product-sum computing units for carrying out the parallel processing operation is changed to 1, 2, 3, 3 for each machine.cycle, and at this time point, the product-sum computing unit 6 outputs the product-sum. ##EQU16## In order to compute the product-sum of the fourth multi-input single-output circuit of the hidden layer, the input signal X1 from the input signal register 12 is transferred to the input and output signal register 3 again through the signal switching unit 11 at the next machine.cycle. At the same time, the input signal X.sub.2 is loaded to the input signal register 12. At this machine.cycle, the output (formula 42) of the product-sum computing unit 6 is selected by the input selecting unit 19 in the sigmoid function computing unit 10. The sigmoid function computing element 20 computes the sigmoid function with respect to the product-sum. As the signal switching unit 11 is set so that the signal from the input signal register 12 is transferred to the input and output signal register 3 at this machine.cycle, the output of the sigmoid function computing element 20 is delayed by the delay unit 21 till the signal switching unit 11 is set so that the output of the sigmoid function computing unit 10 may be transferred to the input and output signal register 3. This delay time is necessary, when the number (four) of the multi-input single-output circuits of the hidden layer 74 is more than the number (three) of the product-sum computing units, in this case the product-sum of the multi-input single-output circuits of the hidden layer are obtained in the parallel processing of more than one by the product-sum computing units. In this case, the input signals are transferred to the input and output signal register 3 more than one from the input signal register 12, and the output of the multi-input single-output circuit of the hidden layer obtained previously during the transferring operation is required to be stored in the sigmoid function computing unit 10. In the sigmoid function computing unit 10 from the next machine.cycle, the input selecting unit 19 selects the outputs of the product-sum computing units 5, 4 in a sequence, and the sigmoid function computing element 20 obtains in a sequence the sigmoid function with respect to the these product-sums. The signal (namely, the output of the multi-input single-output circuit of the hidden layer) of the sigmoid function is delayed in a sequence by the delay unit 21 till the signal switching unit 11 is set so that the output of the sigmoid function computing unit 10 is transferred to the input and output signal register 3. While the product-sum of the fourth multi-input single-output circuit of the hidden layer is computed, the product-sum computing units 5 and 4 are not operated after the outputs of the second and third multi-input single-output circuits of the hidden layer are obtained. In this manner, the time required for obtaining the product-sum of all the multi-input single-output circuits of the hidden layer from the input signals is
machine.cycle.times.number of signals in the input layer.times.2(43)
When the product-sum computing unit 6 completes the computing of the product-sum of the fourth input single-output circuit of the hidden layer, the signal switching unit 11 is set to transfer the output of the sigmoid function computing unit 10 to the input and output signal register 3. At the next machine.cycle, the sigmoid function computing unit 10 computes the sigmoid function with respect to the product-sum of the fourth multi-input single-output circuit of the hidden layer, and at the same time, the output of the first multi-input single-output circuit of the hidden layer is transferred to the input and output signal register 3. Thereafter, in the input and output signal registers 3 and 2, the output signals of the multi-input single-output circuits of the hidden layer are transferred in a sequence. The product-sums of the multi-input single-output circuit of the output layer are obtained in the product-sum computing units 6 an 5. Accordingly, the delay time in the delay unit 21 is
machine.cycle.times.3 (44)
for the outputs of the first, second and third multi-input single-output circuit of the hidden layer. It is
machine.cycle.times.2 (45)
for the output of the fourth multi-input single-output circuit of the hidden layer. In the sigmoid function computing unit 10, the sigmoid functions given in the (formula 2) are sequentially obtained with respect to the product-sum of the multi-input single-output circuit of the output layer, and the output of the multi-input single-output circuit of the output layer is obtained. In this manner, the time required for the outputs of all the multi-input single-output circuits of the output layer to be output after the product-sum of all the multi-input single-output circuits of the hidden layer are calculated is
machine.cycle.times.(the number of output signals of the hidden layer+the number of output signals of the output layer) (46)
The calculation of the back-propagating signal .delta. of the multi-input single-output circuit of the hidden layer and the modification of weights equivalent to the strength of the connections between the multi-input single-output circuits of the hidden layer and those of the output layer are carried out by the same operation as the previous embodiment of the present invention shown in FIG. 4. The back-propagating signals of the multi-input single-output circuits of the hidden layer are transferred to the input and output signal register in the order of .delta.h.sub.3, .delta.h.sub.2, .delta.h.sub.1. The modification of weights equivalent to the strength of the connections between the first through third multi-input single-output circuits of the hidden layer and the input terminals of the input layer are carried out by the same operation as the previous embodiment of the present invention shown in FIG. 5. Thereafter, the back-propagating signal .delta. of the fourth multi-input single-output circuit of the hidden layer is transferred to the input and output signal register. The modification of weights equivalent to the strength of the connections between the fourth multi-input single-output circuit of the hidden layer and the input terminal, of the input layer is carried out by the same operation. The time to be required after all the weight modifications of the output layer are completed before the completion of all the weight modifications of the hidden layer is
Machine.cycle.times.(2.times.number of signals in the input layer+number of output signals of the hidden layer) (45)
According to the present embodiment as described hereinabove, the input signals are input in a sequence from the input signal register 12. In the input and output signal registers 3, 2 and 1, the signals are transferred in a sequence, so that the product-sum computing units 6, 5 and 4 for carrying out the parallel processing output the product-sum of the multi-input single-output circuits with the delay by one machine.cycle respectively. Thus, the sigmoid function computing unit 10 may be one. Therefore, the scale of the circuit of the learning machine may be made smaller than the conventional learning machine. Also, in the sigmoid function computing unit 10, the delay unit 21 adjusts the delay time for the output of the sigmoid function computing element 20, so that learning machines which vary in number of the input signals, number of the multi-input single-output circuits of the hidden layer and the output layer may be constructed.
As is clear from the foregoing description, according to the arrangement of the present invention, the sigmoid function computing unit may be one in number, and the scale of the circuit of the learning machine may be made smaller. Also, according the present invention, the weights may be modified in the short time. Also, according to the present invention, the learning machines which differ in the number of the input signals, the number of the multi-input single-output circuits of the hidden layer and the output layer may be realized with a simple change of the setting.
Although the present invention has been fully described by way of example with reference to the accompanying drawings, it is to be noted here that various changes and modifications will be apparent to those skilled in the art. Therefore, unless otherwise such changes and modifications depart from the scope of the present invention, they should be construed as included therein.
Claims
  • 1. A learning machine comprising:
  • a plurality of input-output signal registers connected in cascade,
  • a plurality of weight memories for simultaneously outputting a plurality of weights, each of said weights to be multiplied by an output of a respective one of said plurality of input-output registers,
  • a plurality of product-sum computing units each of which receives at different machine cycles a) an output signal of a respective one of said plurality of input-output signal registers and b) the weight stored in a respective one of said plurality of weight memories, each of said plurality of product-sum computing units for generating a product-sum,
  • a single sigmoid function computing unit receiving product-sums at different machine cycles from all of Said plurality of product-sum computing units, said sigmoid function computing unit outputting output signals having a saturation characteristic with respect to each product-sum which is outputted by every one of said plurality of product-sum computing units.
  • 2. A learning machine comprising:
  • a plurality of input-output signal registers connected in cascade and generating a plurality of output signals,
  • a plurality of weight memories for simultaneously outputting a plurality of weights, each of said weights to be multiplied by an output of a respective one of said plurality of input-output registers,
  • a plurality of product-sum computing units each of which receives at different machine cycles a) an output signal of a respective one of said plurality of input-output signal registers and b) the weight stored in a respective one of said plurality of weight memories, each of said plurality of product-sum computing units for generating a product sum,
  • a single sigmoid function computing unit receiving product-sums at different machine cycles from all of said plurality of product-sum computing units, said sigmoid function computing unit outputting output signals having a saturation characteristic with respect to each product-sum which is outputted by every one of said plurality of product-sum computing units,
  • an output layer delta computing unit for computing a back-propagating signal delta of an output layer which is dependent on a) an output signal of the sigmoid function computing unit and b) a supervising signal of the sigmoid function computing unit,
  • a hidden layer delta computing unit for computing a back-propagating signal delta of a hidden layer in accordance with the product which is generated by the product-sum computing unit,
  • a weight modification unit for determining an amount of weight modification dependent on said plurality of output signals generated by said plurality of input-output signal registers, the back-propagating signal delta of the output layer delta computing unit and the back-propagating signal delta of the hidden layer delta computing unit,
  • the amount of weight being modifiable.
  • 3. A learning machine described in accordance with claim 1 or claim 2, wherein said single sigmoid function computing unit comprises:
  • a single sigmoid function element for outputting signals having a saturation characteristic with respect to each input signal received from said input selecting unit,
  • an input selecting unit for selecting one output signal of said plurality of product-sum computing units so as to input said one output signal into the sigmoid function element,
  • a delay element for delaying the output signals of the sigmoid function element,
  • wherein a change in structure of the learning machine is realized by a change of delay time in the delay element.
Priority Claims (2)
Number Date Country Kind
2-335092 Nov 1990 JPX
3-112992 May 1991 JPX
Parent Case Info

This application is a continuation of application Ser. No. 07/800,592 filed Nov. 27, 1991 now abandoned.

US Referenced Citations (17)
Number Name Date Kind
4874963 Alspector Oct 1989
5010512 Hartstein et al. Apr 1991
5039871 Engeler Aug 1991
5056037 Eberhardt Oct 1991
5058179 Denker et al. Oct 1991
5063601 Hayduk Nov 1991
5067095 Peterson et al. Nov 1991
5073867 Murphy et al. Dec 1991
5095443 Watanabe Mar 1992
5109351 Simar, Jr. Apr 1992
5142666 Yoshizawa et al. Aug 1992
5146542 Engeler Sep 1992
5148514 Arima et al. Sep 1992
5187680 Engeler Feb 1993
5216746 Yoshizawa et al. Jun 1993
5220559 Tsuzuki et al. Jun 1993
5293457 Arima et al. Mar 1994
Foreign Referenced Citations (1)
Number Date Country
0349819 Jan 1990 EPX
Non-Patent Literature Citations (3)
Entry
Moors et al, "Cascading Content-Adressable Memories"; IEEE Micro, vol. 12, iss. 3, pp. 56-66, Jun. 1992.
M. Yasunaga et al., "Design, Fabrication and Evaluation of a 5-Inch Wafer Scale Neural Network LSI Composed of 576 Digital Neurons", IJCNN International Joint Conference on Neural Networks, vol. 2, pp. 527-535 (Jun. 1990).
W. Wike et al., "The VLSI Implementation of STONN", IJCNN International Joint Conference on Neural Networks, vol. 2, pp. 593-598 (Jun. 1990).
Continuations (1)
Number Date Country
Parent 800592 Nov 1991