The present invention relates generally to neuromorphic systems, and more specifically to neuromorphic systems based on phase change memory (PCM) synapses.
Biological systems impose order on the information provided by their sensory input. This information typically comes in the form of spatiotemporal patterns comprising localized events with a distinctive spatial and temporal structure. These events occur on a wide variety of spatial and temporal scales, and yet a biological system such as the brain is still able to integrate them and extract relevant pieces of information. Such biological systems can rapidly extract signals from noisy spatiotemporal inputs.
In biological systems, the point of contact between an axon of a neuron and a dendrite on another neuron is called a synapse, and with respect to the synapse, the two neurons are respectively called pre-synaptic and post-synaptic. The essence of our individual experiences is stored in the conductance of the synapses. The synaptic conductance can change with time as a function of the relative spike times of pre-synaptic and post-synaptic neurons, as per spike-timing dependent plasticity (STDP). The STDP rule increases the conductance of a synapse if its post-synaptic neuron fires after its pre-synaptic neuron fires, and decreases the conductance of a synapse if the order of the two firings is reversed.
Neuromorphic systems, also referred to as artificial neural networks, are computational systems that permit electronic systems to essentially function in a manner analogous to that of biological brains. Neuromorphic systems do not generally utilize the traditional digital model of manipulating 0s and 1s. Instead, neuromorphic systems create connections between processing elements that are roughly functionally equivalent to neurons of a biological brain. Neuromorphic systems may comprise various electronic circuits that are modeled on biological neurons.
Embodiments of the invention provide an architecture and method to realize an electronic implementation of spiking neurons interacting with each other via programmable, plastic synapses for computation, and pattern matching tasks such as association and recall. An aspect of the invention includes a method for producing spike-timing dependent plasticity using electronic neurons. In response to an electronic neuron spiking, a spiking signal is sent from the electronic neuron to each driver circuit connected to the axon and dendrite wires (called axon driver and each dendrite driver) connected to the spiking electronic neuron. Each axon driver receiving the spiking signal sends an axonal signal from the axon driver to a variable state resistor. Each dendrite driver receiving the spiking signal sends a dendritic signal from the dendrite driver to the variable state resistor, wherein the variable state resistor couples the axon driver and the dendrite driver. The combination of the axonal and dendritic signals is capable of increasing or decreasing conductance of the variable state resistor.
Another aspect of the invention includes a system for producing spike-timing dependent plasticity. The system comprises a plurality of electronic neurons and a cross-bar array coupled to the plurality of electronic neurons and configured to interconnect the plurality of electronic neurons. The cross-bar array comprises a plurality of axons and a plurality of dendrites such that the axons and dendrites are orthogonal to one another. The cross-bar array further comprises plural variable state resistors, such that each variable state resistor is at a cross-point junction of the cross-bar array coupled between a dendrite and an axon. The cross-bar array further comprises a plurality of dendrite drivers corresponding to the plurality of dendrites, each dendrite driver coupled to a dendrite at a first side of the cross-bar array. The cross-bar array further comprises a plurality of axon drivers corresponding to the plurality of axons, each axon driver coupled to an axon at a second side of the cross-bar array. Wherein an axon driver and a dendrite driver coupled by a variable state resistor at a cross-point junction are configured to generate signals which in combination are capable of changing the state of the variable state resistors as a function of time since a last spiking of an electronic neuron firing a spiking signal into the axon driver and the dendrite driver.
Another aspect of the invention includes a neuromorphic system comprising a plurality of electronic neurons having a layered relationship with directional connectivity. The system further comprises a first excitatory spiking electronic neuron layer comprising first excitatory spiking electronic neurons, and a second excitatory spiking electronic neuron layer comprising second excitatory spiking electronic neurons. The system further comprises a first inhibitory spiking electronic neuron layer comprising one or more first inhibitory spiking electronic neurons. Wherein the first excitatory spiking electronic neuron layer is configured to receive input, and wherein the first and second excitatory spiking electronic neuron layers and the first inhibitory spiking electronic neuron layer, are configured to process the received input based on learning rules. Each of the first excitatory spiking electronic neuron layer and first excitatory spiking electronic neuron layer comprises a system for producing spike-timing dependent plasticity including a cross-bar array mentioned above.
These and other features, aspects and advantages of the present invention will become understood with reference to the following description, appended claims and accompanying figures.
Embodiments of the invention provide neuromorphic systems, including Phase Change Memory (PCM) synaptronic circuits for spiking computation, association and recall. In one embodiment, the present invention provides a synaptronic circuit architecture and operating method.
In one embodiment, the synaptronic circuit comprises a synapse cross-bar array which implements spike-timing dependent plasticity (STDP) using PCM synapse devices. Embodiments include analog variable state resistor which implement amplitude modulated STDP versions and binary variable state resistor which implement probability modulated STDP versions. Disclosed embodiments include systems with access devices and systems without access devices. Referring now to
The system 100 further comprises synapse devices 22 including variable state resistors at the cross-point junctions of the cross-bar array 12, wherein the synapse devices 22 are connected between axons 24 and dendrites 26 such that the axons 24 and dendrites 26 are orthogonal to one another. The term variable state resistor refers to a class of devices in which the application of an electrical pulse (either a voltage or a current) will change the electrical conductance characteristics of the device. For a general discussion of cross-bar array neuromorphic systems as well as to variable state resistors as used in such cross-bar arrays, reference is made to K. Likharev, “Hybrid CMOS/Nanoelectronic Circuits: Opportunities and Challenges”, J. Nanoelectronics and Optoelectronics, 2008, Vol. 3, p. 203-230, which is hereby incorporated by reference. In one embodiment of the invention, the variable state resistor may comprise a PCM synapse device. Besides PCM devices, other variable state resistor devices that may be used in embodiments of the invention include devices made using metal oxides, sulphides, silicon oxide and amorphous silicon, magnetic tunnel junctions, floating gate FET transistors, and organic thin film layer devices, as described in more detail in the above-referenced article by K. Likharev. The variable state resistor may also be constructed using a static random access memory device.
Each electronic neuron comprises a pair of RC circuits 15. In general, in accordance with an embodiment of the invention, neurons “fire” (transmit a pulse) in response to the integrated inputs they receive from dendritic input connections 26 exceeding a threshold. When neurons fire, they maintain an anti-STDP (A-STDP) variable that decays with a relatively long, predetermined, time constant determined by the values of the resistor and capacitor in one of its RC circuits. For example, in one embodiment, this time constant may be about 50 ms. The A-STDP variable may be sampled by determining the voltage across the capacitor using a current mirror, or equivalent circuit. This variable is used to achieve axonal STDP, by encoding the time since the last firing of the associated neuron. Axonal STDP is used to control “potentiation”, which in this context is defined as increasing synaptic conductance. When neurons fire, they also maintain a D-STDP variable that decays with a relatively long, predetermined, time constant based on the values of the resistor and capacitor in one of its RC circuits 15. As used herein, the phrase “in response to” or the term “when” can mean that a signal is sent instantaneously after a neuron fires, or some period of time after the neuron fires.
As shown in
The cross-bar array 12 further includes driver devices X2, X3 and X4 as shown in
The sense amplifier devices X4 feed into excitatory spiking electronic neurons (Ne) 14, 16 and 18, which in turn connect into the axon driver devices X3 and dendrite driver devices X2. The neuron 20 is an inhibitory spiking electronic neuron (Ni). Generally, an excitatory spiking electronic neuron makes its target neurons more likely to fire, while an inhibitory spiking electronic neuron makes its targets less likely to fire. A variety of implementations of spiking electronic neurons can be utilized. Generally, such neurons comprise a counter that increases when inputs from source excitatory neurons are received and decreases when inputs from source inhibitory neurons are received. The amount of the increase or decrease is dependent on the strength of the connection from a source neuron to a target neuron. If the counter reaches a certain threshold, the neuron then generates its own spike (i.e., fires) and the counter undergoes a reset to a baseline value. The term spiking electronic neuron is referred to as “electronic neuron” herein.
In this example, each of the excitatory neurons 14, 16, 18 (Ne) is configured to provide integration and firing. Each inhibitory neuron 20 (Ni) is configured to regulate the activity of the excitatory neurons depending on overall network activity. As those skilled in the art will recognize, the exact number of excitatory neurons and inhibitory neurons can vary depending on the nature of the problem to solve using the disclosed architecture herein.
A read spike of a short duration (e.g., about 0.05 ms to 0.15 ms and preferably about 0.1 ms long) may be applied to an axon driver device X3 for communication. An elongated pulse (e.g., about 150 ms to 250 ms and preferably about 200 ms long) may be applied to the axon driver device X3 and a short negative pulse may be applied to the dendrite driver device X2 midway through the axon driver pulse (e.g., about 45 ns to 55 ns and preferably about 45 ns long) for programming. As such, the axon driver device X3 provides a long programming pulse and communication spikes. A dendrite driver device X2 provides a programming pulse with a delay. In one embodiment of the invention where a neuron circuit is implemented using analog logic circuits, a corresponding sense amplifier X4 translates PCM current levels to neuron current levels for integration. In another embodiment of the invention where a neuron circuit is implemented using digital logic circuits, a corresponding sense amplifier X4 translates PCM current levels to binary digital signals for integration.
In
In general, the combined action of the signals from drivers X2 and X3 in response to spiking signals from the firing neurons in the cross-bar array 12, causes the corresponding resistors 23 in synapses 22 at the cross-bar array junctions thereof, to change value based on the spiking timing action of the firing neurons. This provides programming of the resistors 23. Referring to
In an analog implementation of a neuron, each level translator device X4 comprises a circuit configured to translate the amount of current from each corresponding synapse 22 for integration by the corresponding neuron. As shown by example in
The timing in delivering signals from the neurons in the cross-bar array 12 to the devices X2, X3, X4, and the timing of the devices X2, X3, X4 in generating signals, allows programming of the synapses. One implementation comprises changing the state of a resistor 23 by increasing or decreasing conductance of the resistor 23 as a function of time since a last spiking of an electronic neuron firing a spiking signal into the axon driver and the dendrite driver coupled by the resistor 23. In general, neurons generate spike signals and the devices X2, X3, X4 interpret the spikes signals, and in response generate signals described above for programming the synapses 22. The synapses and neurons can be analog or digital. The example signals in
The system 100 serves as the basic building block to generate any spiking network of integrate-and-fire neurons interacting through plastic synapses. Other schemes to achieve STDP can also be used with the architecture of system 100.
One embodiment of the architecture 35 comprises a spiking electronic neuron microcircuit implementing an unsupervised pattern recognition system of the associative recall type. A pattern recognition system comprises an assembly of interacting spiking electronic neurons in a memory microcircuit configured to store and associatively recall spatiotemporal patterns. Learning rules provide the strengths (i.e., level of conductance) of synaptic interconnections between the electronic neurons as a function of the patterns to be stored.
According to an embodiment of the invention, given an input data stream that contains spatiotemporal patterns, the architecture 35 learns to detect the presence of the patterns, and to extract and store the patterns without requiring that any information about the patterns to be detected be provided ahead of time. The system stores the patterns in such a way that when presented with a fragmentary and/or noisy version of the stored pattern, the system is able to retrieve a proper matching pattern from memory.
In addition to the spatiotemporal patterns, the input data stream may in general contain a level of noise. The pattern recognition system carries out pattern recognition in a real-time or online fashion, and does not require separate stages for processing the incoming information. The system processes the incoming information in real-time as the data stream is fed in to the system. In an embodiment of the invention, the system architecture is modular and scalable, suitable for problems of a combinatorial nature on multiple spatial and temporal scales while using a single, streamlined architecture.
The architecture 35 comprising two layers E1 and E2 of excitatory electronic neurons. The system 100 further comprises two layers I1 and I2 of inhibitory electronic neurons. The architecture 35 provides directional connectivity between the neurons (feedforward and feedback), implementing interplay of a winner-take-all (WTA) process via lateral inhibition and spike driven learning rules which serve to select causal associations between events. The E1 layer receives spatio-temporal inputs (e.g., images of circles or squares with temporal variations in appearance). Patterns presented to the E1 layer lead to compressed representations on the E2 layer. Partial or corrupted versions of previously encountered patterns lead to error-free retrieval of complete versions.
In one example, a random distribution of weights is utilized, such that each E2 neuron needs input from about 10% of E1 neurons, in order to spike. The feed forward (FF) connections exhibit STDP, wherein inputs leading to significant spatiotemporal correlations in E1 layer neuronal activity cause certain E2 layer neurons to fire. The 12 layer ensures that the activity in the E2 layer is limited to a very small number. If E1 layer neurons fire before E2 layer neurons, this leads to strengthening synapses to form associations. If E2 layer neurons fire before E1 layer neurons, this leads to weakening synapses to wash out noise.
The feedback path (FB) connections exhibit anit-STDP (i.e., aSTDP). If a corrupt or incomplete input appears at the E1 layer, the correct E2 layer neurons should fire. Based on that E2 neuron firing, the full E1 input can be reconstructed. If E1 layer neurons fire before E2 layer neurons, synapses are weakened to remove spurious activity. If E2 layer neurons fire before E1 layer neurons, synapses are strengthened for pattern completion by enhancing connections from inputs seen earlier. The architecture 35 provides a Feed-Forward path with STDP and a Feed-Back path with anti-STDP.
In one example, the WTA process generally models a neuromorphic net of excitatory neurons and an inhibitory neuron. Active excitatory neurons excite the inhibitory neuron. The inhibitory neuron inhibits the excitatory neurons. Activity of the inhibitory neuron increases until most excitatory neurons are inhibited.
The electronic neurons are interconnected as follows. Each electronic neuron makes a fixed number, M, of outgoing connections with other electronic neurons. Each E1 layer electronic neuron connects to I1 layer and E2 layer electronic neurons. Each I1 layer electronic neuron connects exclusively to E1 layer electronic neurons. Similarly, each E2 layer electronic neuron connects to 12 layer and E1 layer electronic neurons. Each 12 layer electronic neuron connects exclusively to E2 layer electronic neurons. Each pathway connecting any pair of neurons is also assigned a conduction delay. The connections and the delays can be assigned either randomly (e.g., drawn from a distribution) or in a predetermined topographic fashion depending on the intended application. No population is allowed to connect back to itself.
A neuromorphic method and system according to an embodiment of the invention implements features of cortical networks, including spiking neurons, spike-time driven learning rules and recurrent connections between neurons. The system requires only a single spike per neuron to perform input pattern identification and subsequent pattern recall, following an unsupervised training session. The system may serve as an intermediate processing stage of a larger system, wherein a functionally significant amount of processing can occur under time constraints similar to those suggested by neurobiological experiments.
According to an embodiment of the invention, the neuromorphic system implements an architecture configured to accommodate spiking neurons with competitive dynamics and unsupervised learning. The neuromorphic system implements transient neuron assemblies with extinguishing activities as soon as a successful retrieval has been carried out and once the pattern is deactivated. Such transient assemblies allow for an efficient rapid successive activation and retrieval of memories. The neuromorphic system comprises a dedicated neuromorphic circuit for pattern completion with the ability to pinpoint events that require attention and subsequent analysis. The neuromorphic system is readily amenable to incorporation into a modular framework, with each module having the basic two-layer electronic neuron implementation disclosed herein.
As an example, a modular framework construction comprises stacking the basic two-layer modules in a hierarchical fashion, with each level in the hierarchy representing features at varying degrees of abstraction. Additional neuron layers (e.g., E2a, E2b, etc.) may be added to the basic two-layer system, with each neuron layer responding, in parallel, to different pattern features in the same input stream. This can be achieved by using different receptive field profiles for each neuron layer sheet. Alternatively, the system may comprise multiple E1 layers with distinct input streams all feeding into a single E2 layer. The system can consolidate previously-learned patterns into complex composites by taking various permutations and combinations of these alternatives.
The embodiments of the invention can take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment containing both hardware and software elements. An example architecture of a canonical spiking neuron system according to the invention as described above includes neurons in layers E1, E2, I1 and I2 as well as their connections and learning rules between them. Such a system may be implemented in different ways, such as implementation through simulations on a traditional computer system or through a variety of different hardware schemes, one of which comprises an ultra-dense synapse cross-bar array providing spike-timing dependent plasticity.
The term electronic neuron as used herein represents an architecture configured to simulate a biological neuron. An electronic neuron creates connections between processing elements that are roughly functionally equivalent to neurons of a biological brain. As such, a neuromorphic system comprising electronic neurons according to embodiments of the invention may include various electronic circuits that are modeled on biological neurons. Further, a neuromorphic system comprising electronic neurons according to embodiments of the invention may include various processing elements (including computer simulations) that are modeled on biological neurons. Although certain illustrative embodiments of the invention are described herein using electronic neurons comprising electronic circuits, the present invention is not limited to electronic circuits. A neuromorphic system according to embodiments of the invention can be implemented as a neuromorphic architecture comprising analog or digital circuitry, and additionally as a computer simulation. Indeed, the embodiments of the invention can take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment containing both hardware and software elements.
Embodiments of the invention can take the form of a computer simulation or program product accessible from a computer-usable or computer-readable medium providing program code for use by or in connection with a computer, processing device, or any instruction execution system. As will be appreciated by one skilled in the art, aspects of the present invention may be embodied as a system, method or computer program product. Accordingly, aspects of the present invention may take the form of an entirely hardware embodiment, an entirely software embodiment (including firmware, resident software, micro-code, etc.) or an embodiment combining software and hardware aspects that may all generally be referred to herein as a “circuit,” “module” or “system.” Furthermore, aspects of the present invention may take the form of a computer program product embodied in one or more computer readable medium(s) having computer readable program code embodied thereon.
Any combination of one or more computer readable medium(s) may be utilized. The computer readable medium may be a computer readable signal medium or a computer readable storage medium. A computer readable storage medium may be, for example, but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any suitable combination of the foregoing. More specific examples (a non-exhaustive list) of the computer readable storage medium would include the following: an electrical connection having one or more wires, a portable computer diskette, a hard disk, a random access memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or Flash memory), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing. In the context of this document, a computer readable storage medium may be any tangible medium that can contain, or store a program for use by or in connection with an instruction execution system, apparatus, or device.
A computer readable signal medium may include a propagated data signal with computer readable program code embodied therein, for example, in baseband or as part of a carrier wave. Such a propagated signal may take any of a variety of forms, including, but not limited to, electro-magnetic, optical, or any suitable combination thereof. A computer readable signal medium may be any computer readable medium that is not a computer readable storage medium and that can communicate, propagate, or transport a program for use by or in connection with an instruction execution system, apparatus, or device.
Program code embodied on a computer readable medium may be transmitted using any appropriate medium, including but not limited to wireless, wireline, optical fiber cable, RF, etc., or any suitable combination of the foregoing. Computer program code for carrying out operations for aspects of the present invention may be written in any combination of one or more programming languages, including an object oriented programming language such as Java, Smalltalk, C++ or the like and conventional procedural programming languages, such as the “C” programming language or similar programming languages. The program code may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server. In the latter scenario, the remote computer may be connected to the user's computer through any type of network, including a local area network (LAN) or a wide area network (WAN), or the connection may be made to an external computer (for example, through the Internet using an Internet Service Provider).
Aspects of the present invention are described below with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems) and computer program products according to embodiments of the invention. It will be understood that each block of the flowchart illustrations and/or block diagrams, and combinations of blocks in the flowchart illustrations and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions/acts specified in the flowchart and/or block diagram block or blocks.
These computer program instructions may also be stored in a computer readable medium that can direct a computer, other programmable data processing apparatus, or other devices to function in a particular manner, such that the instructions stored in the computer readable medium produce an article of manufacture including instructions which implement the function/act specified in the flowchart and/or block diagram block or blocks.
The computer program instructions may also be loaded onto a computer, other programmable data processing apparatus, or other devices to cause a series of operational steps to be performed on the computer, other programmable apparatus or other devices to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide processes for implementing the functions/acts specified in the flowchart and/or block diagram block or blocks.
The computer system can include a display interface 106 that forwards graphics, text, and other data from the communication infrastructure 104 (or from a frame buffer not shown) for display on a display unit 108. The computer system also includes a main memory 110, preferably random access memory (RAM), and may also include a secondary memory 112. The secondary memory 112 may include, for example, a hard disk drive 114 and/or a removable storage drive 116, representing, for example, a floppy disk drive, a magnetic tape drive, or an optical disk drive. The removable storage drive 116 reads from and/or writes to a removable storage unit 118 in a manner well known to those having ordinary skill in the art. Removable storage unit 118 represents, for example, a floppy disk, a compact disc, a magnetic tape, or an optical disk, etc., which is read by and written to by removable storage drive 116. As will be appreciated, the removable storage unit 118 includes a computer readable medium having stored therein computer software and/or data.
In alternative embodiments, the secondary memory 112 may include other similar means for allowing computer programs or other instructions to be loaded into the computer system. Such means may include, for example, a removable storage unit 120 and an interface 122. Examples of such means may include a program package and package interface (such as that found in video game devices), a removable memory chip (such as an EPROM, or PROM) and associated socket, and other removable storage units 120 and interfaces 122 which allow software and data to be transferred from the removable storage unit 120 to the computer system.
The computer system may also include a communications interface 124. Communications interface 124 allows software and data to be transferred between the computer system and external devices. Examples of communications interface 124 may include a modem, a network interface (such as an Ethernet card), a communications port, or a PCMCIA slot and card, etc. Software and data transferred via communications interface 124 are in the form of signals which may be, for example, electronic, electromagnetic, optical, or other signals capable of being received by communications interface 124. These signals are provided to communications interface 124 via a communications path (i.e., channel) 126. This communications path 126 carries signals and may be implemented using wire or cable, fiber optics, a phone line, a cellular phone link, an radio frequency (RF) link, and/or other communication channels.
In this document, the terms “computer program medium,” “computer usable medium,” and “computer readable medium” are used to generally refer to media such as main memory 110 and secondary memory 112, removable storage drive 116, and a hard disk installed in hard disk drive 114.
Computer programs (also called computer control logic) are stored in main memory 110 and/or secondary memory 112. Computer programs may also be received via a communication interface 124. Such computer programs, when run, enable the computer system to perform the features of the present invention as discussed herein. In particular, the computer programs, when run, enable the processor 102 to perform the features of the computer system. Accordingly, such computer programs represent controllers of the computer system.
The flowchart and block diagrams in the Figures illustrate the architecture, functionality, and operation of possible implementations of systems, methods and computer program products according to various embodiments of the present invention. In this regard, each block in the flowchart or block diagrams may represent a module, segment, or portion of code, which comprises one or more executable instructions for implementing the specified logical function(s). It should also be noted that, in some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved. It will also be noted that each block of the block diagrams and/or flowchart illustration, and combinations of blocks in the block diagrams and/or flowchart illustration, can be implemented by special purpose hardware-based systems that perform the specified functions or acts, or combinations of special purpose hardware and computer instructions.
This invention was made with United States Government support under Agreement No. HR0011-09-C-0002 awarded by Defense Advanced Research Projects Agency (DARPA). The Government has certain rights in the invention.