A portion of the disclosure of this patent document contains material that is subject to copyright protection. The copyright owner has no objection to the facsimile reproduction by anyone of the patent document or the patent disclosure, as it appears in the Patent and Trademark Office patent files or records, but otherwise reserves all copyright rights whatsoever.
1. Field of the Invention
The present innovation relates generally to artificial neural networks, and more particularly in one exemplary aspect to computer apparatus and methods for efficient operation of spiking neural networks.
2. Description of Related Art
Artificial spiking neural networks are frequently used to gain an understanding of biological neural networks, and for solving artificial intelligence problems. These networks typically employ a pulse-coded mechanism, which relies on encoding information using timing of the pulses. Such pulses (also referred to as “spikes” or ‘impulses’) are short-lasting (typically on the order of 1-2 ms) discrete temporal events and are used, inter alia, to encode information. Several exemplary embodiments of such encoding are described in a commonly owned and co-pending U.S. patent application Ser. No. 13/152,084 entitled APPARATUS AND METHODS FOR PULSE-CODE INVARIANT OBJECT RECOGNITION”, and U.S. patent application Ser. No. 13/152,119 entitled “SENSORY INPUT PROCESSING APPARATUS AND METHODS”, each incorporated herein by reference in its entirety.
A typical artificial spiking neural network includes a plurality of units (or nodes), which correspond to neurons in a biological neural network. A single unit may be connected to many other units via connections, also referred to as communications channels, or synaptic connections. Those units providing inputs to any given unit are commonly referred to as the pre-synaptic units, while the units receiving the inputs from the synaptic connections are referred to as the post-synaptic units.
Each of the unit-to-unit connections is assigned, inter alia, a connection strength (also referred to as the synaptic weight). During operation of the pulse-coded network, synaptic weights are dynamically adjusted using what is referred to as the spike-timing dependent plasticity (STDP) in order to implement, among other things, network learning. Typically, each unit may receive inputs from a large number (up to 10,000) of pre-synaptic units having associated pre-synaptic weights, and provides outputs to a similar number of downstream units via post-synaptic connections (having associated post-synaptic weights). Such network topography therefore comprises several millions of connections (channels), hence requiring access, modification, and storing of a large number of synaptic variables for each unit in order to process each of the incoming and outgoing pulse through the unit.
Various techniques for accessing the synaptic variables from the synaptic memory exist. The synaptic weights are typically stored in the synaptic memory using two approaches: (i) post-synaptically indexed: that is, based on the identification (ID) of the destination unit, e.g., the post-synaptic unit; and (ii) pre-synaptically indexed: that is based on the source unit ID, e.g., the pre-synaptic unit.
When the synaptic data are stored according to the pre-synaptic index, then access based on the post-synaptic index is inefficient. That is, a unit receiving input from in pre-synaptic units and providing n outputs via n post-synaptic channels, requires n reads and n writes of a one-weight block (scattered access) to process the pre-synaptic inputs, and one read, one write of a m-weight block to process the post-synaptic outputs. Similarly, the post-synaptic index based storage scheme results in one read, one write of an in-weight block to process the pre-synaptic inputs, and n reads and n writes of a one-weight block to process the post-synaptic outputs, because one or the other lookup would require a scattered traverse of non-contiguous areas of synaptic memory.
One approach to implement efficient memory access of both pre-synaptic and post-synaptic weights is proposed by Jin et al, and is referred to as the “pre-synaptic sensitive scheme with an associated deferred event-driven model”. In the model of Jin, synaptic variable modification is triggered during a pre-synaptic spike event (no synaptic variables access during post-synaptic spike event), and hence the synaptic information is stored based only on the pre-synaptic index (see Jin, X., Rast, A., F. Galluppi, F., S. Davies., S., and Furber, S. (2010) “Implementing Spike-Timing-Dependent Plasticity on SpiNNaker Neuromorphic Hardware”, WCCI 2010, IEEE World Congress on Computational Intelligence), incorporated herein by reference in its entirety. In addition, the actual update of synaptic variables is deferred until a certain time window expires.
However, this approach has several limitations. For a typical STDP window of 100 ins, the corresponding firing rate of the pre-synaptic neuron needs to be greater than 10 Hz for the scheme of Jin et al. (2010) to work properly. Furthermore, the deferred approach of Jin et al. (2010) does not provide immediate update for the synaptic weights, because the approach waits for the time window to expire before modifying the synaptic weight, thereby adversely affecting the accuracy of post-synaptic pulse generation by the unit.
Existing synaptic update approaches do not provide synaptic memory access mechanisms that are efficient for a large category of spiking neural networks. Such approaches also do not provide up-to-date synaptic variables for different kind of learning rules, and are limited by the firing rate of the pre-synaptic and post-synaptic units.
Furthermore, existing synaptic weight update schemes are not applicable to different plasticity models, such as the nearest-neighbor, all-to-all etc. See Izhikevich and Desai 2003, entitled “Relating STDP to BCM”, Neural Computation 15, 1511-1523, incorporated herein by reference in its entirety, relating to various plasticity rules such as STDP, inverse STDP, and “bump” STDP. See also Abbott L. F. and Nelson S. B. (2000), “Synaptic plasticity: taming the beast”, Nature Neuroscience, 3, 1178-1183, also incorporated herein by reference in its entirety.
Accordingly, there is a salient need for a more efficient, timely, and scalable synaptic variable update mechanism that is applicable to many different types of plasticity models and different plasticity rules.
In a first aspect of the invention, a computerized spiking network apparatus is disclosed. In one embodiment, the apparatus includes: a storage apparatus; and a pre-synaptic unit in communication with a post-synaptic unit by a communication channel. In one variant, the apparatus is configured to operate by: storing information related to a plurality of pulses in the storage apparatus, the plurality of output pulses being generated by the post-synaptic unit at a plurality of generation times between a triggering pulse and a system event; and evaluating a plurality of updates based on a plurality of intervals between the triggering pulse and each of the plurality of generation times. The system event enables removal of at least a portion of the information related to the plurality of pulses from the storage apparatus.
In another variant, the triggering pulse is generated by the pre-synaptic unit, and is communicated through the channel.
In a second aspect of the invention, a method of operating a communications channel coupled to a post-synaptic unit in a computerized spiking neuronal network is disclosed. In one embodiment, the method includes: modifying the channel based on a first group of pulses associated with the post-synaptic unit, the first group of pulses occurring between a preceding trigger communicated via the channel and a system event; and maintaining the channel substantially unmodified between the system event and the preceding trigger.
In one variant, the method further includes storing, in a shared memory block of a storage apparatus, information related to the first group of pulses a second group of pulses, the second group of pulses being associated with one other post-synaptic unit. The system event is generated by a network entity in data communication with the post-synaptic unit and is based at least in part on a portion of the information being stored in the shared memory block, the portion being related to at least one pulse of the first group of pulses.
In another variant, the method further includes storing information related to the first group of pulses in a shift register.
In a third aspect of the invention, a method of operating a node of a spiking network is disclosed. In one embodiment, the method includes: responsive to a system event, storing in a storage device a plurality of intervals between a first trigger and a plurality of pulses generated by the node. The first trigger is communicated to the node via a channel prior to the system event.
In one variant, each of the plurality of pulses is being generated prior to the system event; and the plurality of intervals is configured based on information related to the plurality of pulses.
In another variant, the method further includes performing a first update of the channel based at least in part on the plurality of intervals.
In yet another variant, the method further includes: performing a first update of the channel based on a first interval of the plurality of intervals; and subsequent to performing the first update, performing a second update of the channel based on a second interval between a second trigger and a latest pulse of the plurality of pulses generated by the node. The first update and the second update cooperate to determine an updated channel weight; and the second trigger is communicated via the channel subsequent to the system event.
In a fourth aspect of the invention, a method of optimizing operation of a shared storage of computerized network apparatus is disclosed. In one embodiment, the apparatus includes at least one node coupled to a channel, and the method includes: storing, in the shared storage, information related to a plurality of pulses associated with the at least one node; and updating the channel in response to a system event by at least a plurality of updates based on a plurality of intervals between a trigger being communicated through the channel, and the plurality of pulses. The updating of the channel enables removal of at least a portion of the information from the shared storage.
In one variant, the method further includes storing, in the shared storage, information related to a group of pulses associated with one other node; wherein: the at least one node is characterized by a first output pulse rate; the one other node is characterized by a second output pulse rate, and the second output pulse rate is lower than the first output pulse rate. In another variant, the shared storage includes: a first memory portion configured to stored data related to the plurality of pulses associated with the at least one node; and at least a second memory portion configured to stored data related to the group of pulses associated with the one other node. The second portion is smaller than the first portion.
In yet another aspect of the invention, a computerized neuronal system is disclosed. In one embodiment, the system includes a spiking neuronal network, and an apparatus controlled at least in part by the neuronal network.
In a further aspect of the invention, a shared architecture is disclosed.
In still a further aspect of the invention, a computer readable apparatus is disclosed. In one embodiment, the apparatus includes a storage medium having at least one computer program stored thereon, the at least one program being configured to, when executed, implement shared storage architecture operation.
All Figures disclosed herein are © Copyright 2011 Brain Corporation. All rights reserved.
Embodiments of the present invention will now be described in detail with reference to the drawings, which are provided as illustrative examples so as to enable those skilled in the art to practice the invention. Notably, the figures and examples below are not meant to limit the scope of the present invention to a single embodiment, but other embodiments are possible by way of interchange of or combination with some or all of the described or illustrated elements. Wherever convenient, the same reference numbers will be used throughout the drawings to refer to same or like parts.
Where certain elements of these embodiments can be partially or fully implemented using known components, only those portions of such known components that are necessary for an understanding of the present invention will be described, and detailed descriptions of other portions of such known components will be omitted so as not to obscure the invention.
In the present specification, an embodiment showing a singular component should not be considered limiting; rather, the invention is intended to encompass other embodiments including a plurality of the same component, and vice-versa, unless explicitly stated otherwise herein.
Further, the present invention encompasses present and future known equivalents to the components referred to herein by way of illustration.
As used herein, the term “bus” is meant generally to denote all types of interconnection or communication architecture that is used to access the synaptic and neuron memory. The “bus” could be optical, wireless, infrared or another type of communication medium. The exact topology of the bus could be for example standard “bus”, hierarchical bus, network-on-chip, address-event-representation (AER) connection, or other type of communication topology used for accessing, e.g., different memories in pulse-based system.
As used herein, the terms “computer”, “computing device”, and “computerized device”, include, but are not limited to, personal computers (PCs) and minicomputers, whether desktop, laptop, or otherwise, mainframe computers, workstations, servers, personal digital assistants (PDAs), handheld computers, embedded computers, programmable logic device, personal communicators, tablet computers, portable navigation aids, J2ME equipped devices, cellular telephones, smart phones, personal integrated communication or entertainment devices, or literally any other device capable of executing a set of instructions and processing an incoming data signal.
As used herein, the term “computer program” or “software” is meant to include any sequence or human or machine cognizable steps which perform a function. Such program may be rendered in virtually any programming language or environment including, for example, C/C++, C#, Fortran, COBOL, MATLAB™, PASCAL, Python, assembly language, markup languages (e.g., HTML, SGML, XML, VoXML), and the like, as well as object-oriented environments such as the Common Object Request Broker Architecture (CORBA), Java™ (including J2ME, Java Beans, etc.), Binary Runtime Environment (e.g., BREW), Java Bytecode, Low-level Virtual Machine (LLVM), and the like.
As used herein, the term “memory” includes any type of integrated circuit or other storage device adapted for storing digital data including, without limitation, ROM. PROM, EEPROM, DRAM, SDRAM, DDR/2 SDRAM, EDO/FPMS, RLDRAM, SRAM, “flash” memory (e.g., NAND/NOR), memristor memory, and PSRAM.
As used herein, the terms “microprocessor” and “digital processor” are meant generally to include all types of digital processing devices including, without limitation, digital signal processors (DSPs), reduced instruction set computers (RISC), general-purpose (CISC) processors, microprocessors, gate arrays (e.g., FPGAs), PLDs, reconfigurable computer fabrics (RCFs), array processors, stream processors (e.g., GPU), secure microprocessors, and application-specific integrated circuits (ASICs). Such digital processors may be contained on a single unitary IC die, or distributed across multiple components.
As used herein, the terms “pulse”, “spike”, “burst of spikes”, and “pulse train” are meant generally to refer to, without limitation, any type of a pulsed signal, e.g., a rapid change in some characteristic of a signal such as amplitude, intensity, phase or frequency, from a baseline value to a higher or lower value, followed by a rapid return to the baseline or other value, and may refer to any of a single spike, a burst of spikes, an electronic pulse, a pulse in voltage, a pulse in electrical current, a software representation of a pulse and/or burst of pulses, a software representation of a latency or timing of the pulse, and any other pulse or pulse type associated with a pulsed transmission system or mechanism.
As used herein, the term “pulse-code” is meant generally to denote, without limitation, information encoding into a patterns of pulses (or pulse latencies) along a single pulsed channel or relative pulse latencies along multiple channels.
As used herein, the terms “pulse delivery”, “spike delivery”, and “pulse application” is meant generally to denote, without limitation, transfer of connection information related to the connection (e.g., synaptic channel) to a destination unit in response to a pulse from a sending unit via the connection.
As used herein, the terms “receiving pulse” and “arrival of the pulse” are meant generally to denote, without limitation, a receipt of a physical signal (either voltage, lights, or current) or a logical trigger (memory value) indicating a trigger event associated with the transmission of information from one entity to another.
As used herein, the term “synaptic channel”, “connection”, “link”, “transmission channel”, “delay line”, and “communications channel” are meant generally to denote, without limitation, a link between any two or more entities (whether physical (wired or wireless), or logical/virtual) which enables information exchange between the entities, and is characterized by a one or more variables affecting the information exchange.
As used herein, the term “spike-timing dependent plasticity” or STDP is meant generally to denote, without limitation, an activity-dependent learning rule where the precise timing of inputs and output activity (spikes) determines the rate of change of connection weights.
Overview
The present invention provides, in one salient aspect, apparatus and methods for efficient memory access during synaptic variable updates in a spiking neural network for implementing synaptic plasticity and learning.
In one embodiment, a computerized network apparatus is disclosed which includes multiple pre-synaptic units (or nodes) connected to post-synaptic units (or nodes) via communications links (synaptic connections), and a storage device configured to store information related to the connections. In order to implement synaptic plasticity and learning, one or more parameters associated with the synaptic connections are updated based on (i) a pre-synaptic pulse generated by the pre-synaptic node and received by the post-synaptic node (a pre-synaptic update), and (ii) a post synaptic pulse generated by the post-synaptic node subsequent to the pre-synaptic pulse (a post-synaptic update). In one embodiment, the post-synaptic updates are delayed until receipt of the next subsequent pre-synaptic pulse by the post-synaptic node. The pre-synaptic update is performed first, followed by the post-synaptic update, thus ensuring that synaptic connection status is up-to-date.
In another embodiment, the connection updates are only preformed whenever a pre-synaptic pulse is received, while leaving the connection state unchanged in between adjacent pre-synaptic pulses.
The delay update mechanism is used in conjunction with system “flush” events (i.e., events which are configured to cause removal (flushing) of a portion of the data related to some of the post-synaptic pulses) in order to ensure network accurate operation, and prevent loss of information under a variety of pre-synaptic and post-synaptic unit firing rates. A large network partition mechanism is used in one embodiment with network processing apparatus in order to enable processing of network signals in a limited functionality embedded hardware environment.
The use of delayed connection updates advantageously reduces memory access fragmentation and improves memory bandwidth utilization. These improvements may be traded for processing of additional pulses (increased pulse rate), additional nodes (higher network density), or use of simpler and less costly computerized hardware for operating the network.
Detailed Description of Exemplary Embodiments
Detailed descriptions of the various embodiments and variants of the apparatus and methods of the invention are now provided. Embodiments of the invention may be, for example, deployed in a hardware and/or software implementation of a computer-vision system, provided in one or more of a prosthetic device, robotic device and a specialized visual system. In one such implementation, an image processing system may include a processor embodied in an application specific integrated circuit (“ASIC”), a central processing unit (CPU), a graphics processing unit (GPU), a digital signal processor (DSP) or an application specific processor (ASIP) or other general purpose multiprocessor, which can be adapted or configured for use in an embedded application such as a prosthetic device.
Exemplary Network Architecture
A typical pulse-coded artificial spiking neural network (such as the network 100 shown in
Each synaptic connection is characterized by one or more synaptic variables, comprising one or more synaptic (channel) weight, channel delay, and post-synaptic unit identification, i.e. target unit ID. The synaptic weight describes the strength or amplitude of the connection between two units (affecting, inter alfa, amplitude of pulses transmitted by that connection), corresponding in biology to the amount of influence the firing of one neuron has on another neuron. The synaptic variables (also referred to as the synaptic nodes), denoted by circles 116 in
The network 100 shown in
Units providing inputs to any given unit (such as the unit 122_3 in
Similarly, connections that deliver inputs to a unit are referred to as the input channel (or pre-synaptic) connections for that unit (e.g., the channels 108 for the unit 122_3), while connections that deliver outputs from the unit (such as the channels 114) are referred to as output channel (or post-synaptic) connections for that unit 122_3. As seen from
Any given unit (such as for example the unit 122_3) may receives inputs from a number m of pre-synaptic units, and it provides outputs to a number n of downstream units. During operation of the spiking neural network 100, whenever a unit (for example the unit 122_3) processes a synaptic event (e.g., generates an output pulse), synaptic variables of the pre-synaptic and post-synaptic connections are dynamically adjusted based, inter alia, on the timing difference between input and output pulses processed by the unit 122_3 using a variety of mechanisms described below.
Typically, a given network topography 100 comprises several millions or billions of connections, each characterized by a synaptic variable (e.g., weight). As a result, such pulse-coded network requires access, modification, and storing of a large number of synaptic variables (typically many millions to billions for n, m ˜1000) in order to implement learning mechanisms when processing the incoming and outgoing signals at each unit of the network 100.
The synaptic variables of a spiking network may be stored and addressed in using a pre-synaptic indexing (as illustrated by the network embodiment of
In a post-synaptically indexed network 101 such as that of
As described above, the synaptic nodes, denoted by circles 116 in
In another embodiment, the node entity 121 comprises the synaptic node 116 and the post-synaptic unit 122, as illustrated by the configuration 162 in
Various concepts associated with spike propagation from a pre-synaptic unit to a post-synaptic unit are described with respect to
The pulse arriving at the synaptic node 116 (or the entity 122) at the time 144 is referred to as the pre-synaptic pulse. After the pre-synaptic pulse reaches the synaptic node 116, the synaptic variables associated with the synaptic node 116 are loaded (delivered) to the post-synaptic unit 122 at time Tpre1+Tdelivery. In one variant, the delivery is instantaneous (Tdelivery=0). The post-synaptic unit 122 operates according to a node dynamical model, such as for example that described in U.S. patent application Ser. No. 13/152,105 entitled “APPARATUS AND METHODS FOR PULSE-CODE TEMPORALLY PROXIMATE OBJECT RECOGNITION” incorporated supra. Upon receiving the pre-synaptic pulse 144, the unit 121 may generate (subject to the node model state) a post-synaptic pulse 140 at time Tpost. In one variant, the post-synaptic pulse generation time Tpost (also referred to as the post-synaptic unit pulse history) is stored internally (in the unit 121) as a unit variable. In another embodiment, the post-synaptic unit pulse history is stored in a dedicated memory array external to the unit.
Similarly, at time Tpre2 the unit 122 receives another input, the pre-synaptic pulse 146, that is processed in a manner that is similar to the delivery of the pre-synaptic pulse 144, described supra. The arrival times Tpre1, Tpre2 of the pre-synaptic pulses 144, 146, respectively, and the generation time Tpost of the post-synaptic pulse 140 are used in updating (adjusting) synaptic variables or state of node 116 using any one or more of a variety of spike plasticity mechanisms. An embodiment of one such mechanism, useful for modeling learning in a pulse coded network 100, is shown and described with respect to
When the unit subsequently receives another pre-synaptic pulse 146 (generated by the same unit 102_1), the post-pre window, corresponding to the time interval 142 between Tpost and Tpre2 is computed. In the embodiment of
In one variant, the pre-post rule potentiates synaptic connections when the pre-synaptic pulse (such as the pulse 144) is received by the post-synaptic unit before the pulse 140 is fired. Conversely, post-pre STDP rule depresses synaptic connections when the pre-synaptic pulse (such as the pulse 146) is received by to the post-synaptic unit after the pulse 140 is generated. Such rules are typically referred to as the long-term potentiation (LTP) rule and long-term depression (LTD) rule, respectively. Various potentiating and depression implementations exist, such as for example, an exponential rule defined as:
where:
A1, A2 are the maximum adjustment amplitudes of the pre-synaptic and post-synaptic modifications, respectively;
t1, t2 are the time-windows for the pre-synaptic and post-synaptic modifications, respectively;
Δt=Tpre−Tpost; and
Tpre, Tpost are the pre-synaptic and the post-synaptic pulse time stamps, respectively.
As a result, in a typical realization of the STDP rule, the following steps are performed a network unit (for example, the unit 122_3 in
Pre-synaptic Pulse Rule: For every pre-synaptic pulse received by a group of post-synaptic units (pulse from 102_1 received by 122_1, 122_3, 122—k in
Post-synaptic Pulse Rule: For every post-synaptic pulse generated by a unit (e.g. 122_3 in
The above LTP and LTD updates are performed, for example, according to Eqns. 1-2 above, and are shown in
Various other STDP implementations can be used with the invention, such as, for example, the bump-STDP rule, illustrated in
Exemplary Implementation of Spiking Network Architecture
In one aspect of the invention, and the calculation of spike-timing dependent plasticity rules is based on the relative time difference between the pre-synaptic pulse and the post-synaptic pulse. A computerized network apparatus, implementing e.g., the spiking neural network of
In one embodiment, the computerized network apparatus comprises a synchronous implementation, where operation of the network is controlled by a centralized entity (within the network apparatus) that provides the time (clock) step, and facilitates data exchange between units. The arrival time of pre-synaptic pulses is derived from the synchronized time step that is available to all units and synapses within the network. Spike transmission between different units in the network can be carried out using for example direct point-to-point connection, shared memory or distributed memory communication, or yet other communication mechanisms which will be recognized by those of ordinary skill in the neurological modeling sciences given the present disclosure.
In another embodiment, the computerized network apparatus is implemented as an asynchronous network of units, where units are independent from one another and comprise their own internal clocking mechanism. In one variant of the asynchronous network, the pre-synaptic pulse timing is obtained using a time stamp, associated with the receipt of each pulse. The time stamp is derived from a local clock of the post-synaptic unit that has received the pre-synaptic pulse. In another variant of the asynchronous network, the pre-synaptic pulse timing is obtained using information related to the occurrence of the pre-synaptic pulse (such as, for example, a time stamp of the pre-synaptic unit, the channel delay and the clock offset) that may be required to obtain the pulse firing time if it is needed. One useful technique is to include the reference clock of the sending (pre-synaptic) unit with each spike. The receiving unit can accordingly adjust the timing difference based this additional timing information.
Exemplary Update Methods
Referring now to
The spiking neural network processing architecture further comprises a neuronal computation block 312 (either on the same IC as block 302, or on a separate IC) communicating with the synaptic computation block over a neuronal bus 306. The neuronal computation block implements various computations that describe the dynamic behavior the units within the network 100. Different neuronal dynamic models exist, such as described, for example, in Izhikevich, E. (2003), entitled “Simple Model of Spiking Neurons”, IEEE Transactions on Neural Networks, 14, 1569-1572, which is incorporated herein by reference in its entirety. In one variant, the neuronal computation block 312 comprises a memory for storing the unit information, such as recent history of firing, and unit internal states. The unit also stores the firing time of the most recent pulse. In another embodiment, the neuronal memory comprising of the neuronal state is a separate memory block 313 interconnected to the neuronal computation block 312.
In order to increase synaptic data access efficiency and to maximize performance of the pulse-based network, both the size of the synaptic memory 310 and the bandwidth of the bus 308 should be efficiently utilized. As described above, synaptic variables may be stored in the synaptic memory 310 using two approaches: (i) post-synaptically indexed—that is, based on the destination unit ID; or (ii) pre-synaptically indexed—that is, based on the source unit ID. When the data is stored using one of the above indexing method (e.g., the post-synaptically indexed), memory access using the other indexing method (e.g., the pre-synaptically indexed) is inefficient, and vice versa.
The synaptic variables for the channel group 118 in
As a result, in order to implement the pre-synaptic pulse based synaptic updates of synaptic variables of the group 120 in response to a pre-synaptic pulse generated by the unit 102_1, the exemplary embodiment of the synaptic computational block 302 is required to perform the following operations:
Similarly, in order to implement the pre-post STDP update updates of synaptic variables of the group 118 for a post-synaptic pulse generated by unit 122_3, the exemplary synaptic computational block is required to perform the following operations:
In one embodiment (shown in
In another embodiment (shown in
One embodiment of pre-synaptically indexed synaptic memory implementation associated with the pre and post synaptic updates is illustrated in
Similarly, when another pre-synaptic pulse 404 (generated by unit 102_1) is received by various units, the synaptic variables of the channel group (such as the group 120 in
The detailed structure of the pre-synaptic bus transactions 406 and the post-synaptic bus transactions 418 is shown in
Contrast the transaction 406 with the post-synaptic update transactions 418, 428 shown in
Such fragmented access of the post-synaptic memory block becomes even less efficient when multiple post-synaptic pulses are generated by different post-synaptic units (such as, for example, the units 122_1, 122_2, 122—k in
Typically, the memory bus (308 in
Memory access during post-synaptic updates described with respect to
One embodiment of memory access architecture according to the invention, referred to as the “lazy synaptic update”, for use in pulse coded artificial spiking neural networks, is illustrated in
At the time the first pre-synaptic-based update transaction 506_1 is executed, the post-synaptic timing information for the pulses 510, 512, 516 in
In order to enable delayed post-synaptic update, generation time for all post-synaptic pulses is recorded. In one variant, the exact timing of every post-synaptic pulse is stored in a memory buffer of the respective post-synaptic unit (e.g., the unit 122_3 of
In one variant, the unit firing timing information is stored using the absolute time for each of the pulses. In another variant, the timing information is stored using an offset relative to a reference event in order to reduce memory required to store the pulse firing timing information. In yet another variant (particularly useful with a synchronous iterative network processing implementation), a circular bit-vector is used to store the recent firing history, where each bit corresponds to a processing iteration of network computations (a step), and the bit value indicates the unit firing status (‘0’ unit did not fire, and ‘1’ unit fired).
Memory access of pre-synaptic transaction 506 is structured similarly to the bus transaction 406, described in detail with respect to
Comparing the bus transaction 308 activity shown in
I=(S+O)×M/(S×M+O). (Eqn. 3)
For S=10 cycles, BO=10 cycles, and M=100 nodes, the improvement is on the order of two. Such substantial improvement advantageously allows for processing of additional pulses, additional connections for with the same hardware when compared to prior art. Alternatively, the bus usage improvement of the present allows the use of less complex (and less costly) hardware to implement the same functionality as the solutions of prior art.
An underlying reason which allows the delayed implementation of the post-synaptic updates as illustrated in
While the postponement of post-synaptic updates according to the exemplary embodiment of the invention requires additional memory for storing the post-synaptic pulse generation times, the amount of additional storage is determined by the maximum number of expected post synaptic pulses and can be easily accommodated by the neuronal memory block which stores the remaining neuronal state variables. The postponement of post-synaptic updates advantageously obviates synaptic memory bus transaction (associated with the post-synaptic pulse generation) as the unit no longer requires reading and writing of the synaptic variables.
In another embodiment, shown in
Synaptic Update Methods Based on System Event
The previous embodiments of memory access during synaptic updates described with respect to
In one such implementation of the unit, the post-synaptic pulse history is stored locally at each unit using a circular buffer of size 606, as illustrated in
Referring now to
The standard plasticity rule shown in
In certain applications, it is required that a synaptic update is performed for every post-synaptic pulse. Such mechanism is particularly useful for synaptic plasticity rules that continue to adapt synaptic variables even for long plasticity time scales, such as the bump-STDP rule shown in
At step 652 of the method 650, the unit determines if it has fired or not. If the unit has fired, then a pre-synaptic pulse is generated by the unit and the method 650 proceeds via the pathway 654 and causes the pre-spike to invoke the necessary post-pre STDP update rule 656. In one variant, the synaptic memory update comprises synaptic bus transaction 506, described with respect to
If the check step 652 determines that no pulse has been generated by the unit, then the method 650 decrements the event counter at step 660. At step 662, a check is performed in order to determine if the event counter is greater than zero. If it is, then the unit operation continues to the step 652. If the pulse counter is equal to zero (indicating that the Nfire time-steps has elapsed since the last update), then the flush system event 664 is generated and unit/network operation continues to the step 656, as shown in
As described with respect to
When subsequent post-synaptic pulses 710, 712-1, 712-k, 716 are generated by one or more units 122_2, . . . 122—k, no synaptic updates are performed (as indicated by the absence of activity on the bus 308). Instead, the post-synaptic pulse times are recorded in the pulse buffer (such as the buffer 618 of
In the embodiment of
In one variant, each unit comprises a fixed-size buffer configured to store post-synaptic pulse history (firing bits), thereby enabling individual units to generate system events independently from one another.
In another embodiment, the synaptic update 726, initiated by the system event 754 in
The timing of the system event 754 in embodiment of
Whenever a flush system event 754 is generated, then the pre-post synaptic updates (corresponding to the time windows) 748 are applied for all post-synaptic pulses 710, 712, 716 that are generated within the time window 747 (that is computed as Tflush−Tpre1) in
When the next pre-synaptic pulse 704 is received, synaptic variables update only needs to account for the post-synaptic pulses generated within the time window 746 since the last flush event 754. Hence, the pre-post STDP is evaluated for the post-spikes 710_2, 712_2 using the time differences 750_1, 750_2 with respect to the pre-pulse 702 occurring at Tpre. The post-pre STDP rule is applied for the pulses occurring at 710_2, 712_2 using the time differences 742_5, 742_6 with respect to the current pre-pulse 704 occurring at Tpre2. This approach is applicable to nearest-neighbor based STDP update rule. Thus, each post-synaptic pulse (e.g. 710_1, 710_2, 712_1, 712_2) will not cause any memory transaction in the synaptic bus for updating the incoming synaptic variables. Only the spike history is updated for every post-synaptic pulse as illustrated in the flowchart 6C. For other types of STDP rules, a trace-based mechanism described in the next para is necessary to account for the post-pre STDP rule due to the post-synaptic pulses 712_1, 716_1 and the current pre-pulse 704.
For other kinds of plasticity rules where every post-synaptic pulse needs to be accounted for in the STDP calculations, a post-synaptic trace-based mechanism is used. In spiking neural networks, each post-synaptic node can contain an internal trace variable that is updated with each postsynaptic spike by certain amount, and decays between spikes with a fixed time constant based on the synaptic plasticity rule. This internal trace variable stored in the post-synaptic unit can be used by each synapses to calculate the overall change in the synaptic variable before actual delivery.
One exemplary embodiment of the trace-based post-synaptic mechanism, which accounts for the post-synaptic pulses flushed based on a system event, is illustrated in
In another embodiment of the system event-based synaptic update method (not shown), only the time difference (Δt=Tpost−Tpre) between the last pre-synaptic pulse (e.g., the pulse 702 in
In another embodiment, successive flush-events are generated for every Nfire post-synaptic pulses. Such update mechanism is especially useful with synaptic plasticity rules that adjust synaptic variables for every post-synaptic pulse. One specific example of such plasticity rule is shown in
In another approach, generation of flush system events is stopped after a certain number Nstop of post-synaptic pulses, when additional post synaptic pulses do not significantly affect data propagation accuracy within the network. For example, the plasticity rules, such as illustrated in
In a different approach, the actual mechanism of flush system event generation is determined at run-time of the network apparatus (such as the apparatus 300) based on various parameters, which are determined by the application by the application developer. In one variant, these parameters comprise the width of the plasticity window, and/or network error tolerance. In another variant, the flush events are generated using a stochastic model, where some loss of accuracy of the network performance is traded for simplicity of the network apparatus. These mechanisms form a category of techniques that reduces the overall number and frequency of flush system events without deteriorating the accuracy or performance of the simulation.
Referring now to
The shared memory block is accessible and shared by a number of post-synaptic units (such as the units 122 in
The embodiment of
Partitioned Network Apparatus
Typically, the following synaptic computations are performed for each post-synaptic unit receiving a pre-synaptic pulse:
The lazy synaptic update mechanism, described supra, results in efficient access of the synaptic memory block 310, and improves the steps (a), (d) and (e) above. A network comprising a large number of units and connections, requires a large number of post-synaptic neuron updates for every pre-synaptic pulse (steps (b) and (c) above). The update approach of the invention described below, advantageously improves performance of steps (b) and (c) by providing an efficient access mechanism for the neuronal state information (post-synaptic neuron timing and post-synaptic neuronal variables).
In an exemplary non-partitioned network, every unit stores a single connectivity table that describes all of the unit connections within the network (e.g., connections 114 in
Thus, at any point of execution, the on-chip memory that stores the neuronal state information, needs to store only a small subset (N/P) of the entire network neuronal state, where N is the total number of units, and P is the total number of partitions.
One particular embodiment of the network processing apparatus 910 is shown and described with respect to
The synaptic block comprises multiple synaptic computations instances 922 that evaluate the synaptic computation for many synapses in parallel. Although only three instances 922 are shown in
The synaptic computation block comprises a partition memory cache 924 is shared by multiple instances 922 as shown in
The synaptic computation block is coupled to the synaptic memory 918 via the synaptic memory bus 912, and to the neuronal block via the bus 916. The neuronal block 914 comprises a neuronal processing unit 926 and neuronal memory 928, which stores information related to the units (within the network 900), such as the spike timing, unit internal state, etc.
In the embodiment of
In another embodiment (shown in
In a different embodiment shown in
It will be appreciated that the embodiments shown in
During operation of the exemplary network 900, each partition data (comprising the neuronal data for that partition) is stored in the shared memory cache 924 directly or by caching mechanism, and updated one after another. The entire state resides in the off-chip global state memory 300. The connection table is also broken into P connection sub-tables, where each sub-table stores all the incoming connections for one particular partition. The network synaptic update computations are performed one partition at a time in a predetermined partition sequence. During synaptic update phase, the synaptic variables are streamed via the bus 912 to/from the synaptic memory 918, and various post-synaptic updates are concurrently applied to the data within the partition buffer or cache 924. That is, each synaptic computation block 922 reads the synaptic variables associated with a given pre-synaptic pulse from the synaptic memory 918, examines the pulse timing of the post-synaptic neuronal state stored in the local partition cache 924, calculates new synaptic variables (including the synaptic weights), updates the post-synaptic neuronal state using the updated synaptic variables, and stores the modified synaptic variables (including the synaptic weight) back in the synaptic memory 918.
Having smaller partition size (e.g., fewer units within each partition 902) reduces the on-chip memory 924 requirements but increases the number of partitions. Furthermore, if the number of post-synaptic units within a partition small, than each pre-synaptic pulse will require an update of only a small subset of the post-synaptic neuronal states for the partition. As a result, the amount of data streamed through the memory bus 912 is reduced when smaller partitions are used, resulting in a less efficient usage of the memory bus 912 due to increased overhead associated with the multiple memory transactions (such as the overhead block 436 in
Larger partitions, comprising more units, require larger on-chip memory 924 in order to store the synaptic connection data for the units. Hence, a trade-off exists between the number of partitions, efficient usage of the streaming synaptic memory bandwidth, and the size of the simulated network.
When a pre-synaptic neuron fires, the generated pre-synaptic pulse may affect a large number (depending on a specific network topology) of post-synaptic neurons. As discussed above with respect to synaptic variables updates, in a pre-synaptically indexed memory model, access to post-synaptically indexed units is inefficient. Thus each pre-pulse will result in multiple accesses of the neuronal memory while updating the post-synaptic neuronal states. Such fragmented access results result in an inefficient utilization of memory bus bandwidth. By way of example, consider one variant of network processing apparatus (such as the apparatus 910) which implements neuronal bus 916 having the minimum transaction size of 16 words. That is, 16 sequential neuron unit data items (comprising, for example, the unit state, recent firing time, and firing history) are retrieved/stored from/to a given memory address range in a single transaction. Consider that the neuronal updates are applied to memory locations at <40>, <4000>, . . . , <52>, <4010>, <5000>, and so on. By ordering (sorted) the memory requests as {<40>, <52>, <4000>, <4010>, <5000>} the total number of memory transactions on the neuronal bus 916 is reduced, because multiple neuronal states can be simultaneously read or stored within one transaction. In the above example, the data at addresses <40> and <52>, <4000> and <4010> are accessed within a single bus-transaction, thereby reducing the number of bus 916 transactions (and hence the bus overhead) and improving bus utilization. Note that the above grouping of memory transactions increases bus use efficiency, provided that the adjacent addresses are within the minimum transaction size address range (16 words in the above example).
For reordering the memory transaction, the synaptic connections for the given pre-synaptic neuron can be rearranged based on the memory-addresses of the post-synaptic neuronal address (as indicated, for example, by the target unit ID 326 in
Exemplary Uses and Applications of Certain Aspects of the Invention
Apparatus and methods for implementing lazy up-to-date synaptic update in a pulse-coded network offer mechanisms that substantially improve synaptic memory access efficiency compared to the previously used un-coalesced memory transactions. This improved memory access can advantageously be used to process a larger number of synaptic connections (for the same bus throughput) or to realize pulse coded networks using a less costly memory bus implementations (i.e., a lower speed and/or a smaller bus width).
Furthermore, the synaptic memory update mechanism that is based on the pre-synaptic pulse generation/receipt provides an up-to-date synaptic connection information and, therefore, improves network accuracy.
The mechanism described in this invention can be utilized to implement many different types of synaptic plasticity models described in literature (see Izhikevich E. M. and Desai N. S. (2003), incorporated herein supra.
The approach and mechanism described in this invention is applicable to various hardware platform including Graphics Processors, Field Programmable Gate Arrays, and dedicated ASICs.
Moreover, the use of system events further improves timeliness of synaptic updates and allows for a simpler network implementation with reduce unit memory size.
As previously noted, methods for efficient synaptic variable update that implement lazy update scheme, described with respect to
Advantageously, exemplary embodiments of the present invention can be built into any type of spiking neural network model that are useful in a variety of devices including without limitation prosthetic devices, autonomous and robotic apparatus, and other electromechanical devices requiring objet recognition functionality. Examples of such robotic devises are manufacturing robots (e.g., automotive), military, medical (e.g. processing of microscopy, x-ray, ultrasonography, tomography). Examples of autonomous vehicles include rovers, unmanned air vehicles, underwater vehicles, smart appliances (e.g. ROOMBA®), etc.
Embodiments of the present invention are further applicable to a wide assortment of applications including computer human interaction (e.g., recognition of gestures, voice, posture, face, etc.), controlling processes (e.g., an industrial robot, autonomous and other vehicles), augmented reality applications, organization of information (e.g., for indexing databases of images and image sequences), access control (e.g., opening a door based on a gesture, opening an access way based on detection of an authorized person), detecting events (e.g., for visual surveillance or people or animal counting, tracking), data input, financial transactions (payment processing based on recognition of a person or a special payment symbol) and many others.
It will be recognized that while certain aspects of the invention are described in terms of a specific sequence of steps of a method, these descriptions are only illustrative of the broader methods of the invention, and may be modified as required by the particular application. Certain steps may be rendered unnecessary or optional under certain circumstances. Additionally, certain steps or functionality may be added to the disclosed embodiments, or the order of performance of two or more steps permuted. All such variations are considered to be encompassed within the invention disclosed and claimed herein.
While the above detailed description has shown, described, and pointed out novel features of the invention as applied to various embodiments, it will be understood that various omissions, substitutions, and changes in the form and details of the device or process illustrated may be made by those skilled in the art without departing from the invention. The foregoing description is of the best mode presently contemplated of carrying out the invention. This description is in no way meant to be limiting, but rather should be taken as illustrative of the general principles of the invention. The scope of the invention should be determined with reference to the claims.
This application is a continuation of and claims priority to co-owned, U.S. patent application Ser. No. 13/239,259, entitled “APPARATUS AND METHOD FOR PARTIAL EVALUATION OF SYNAPTIC UPDATES BASED ON SYSTEM EVENTS”, filed Sep. 21, 2011, now U.S. Pat. No. 8,725,662, which is incorporated by reference herein in its entirety. This application is related to co-owned U.S. patent application Ser. No. 13/239,255, and entitled “APPARATUS AND METHODS FOR SYNAPTIC UPDATE IN A PULSE-CODED NETWORK”, filed Sep. 21, 2011, U.S. patent application Ser. No. 13/239,123, entitled “ELEMENTARY NETWORK DESCRIPTION FOR NEUROMORPHIC SYSTEMS”, filed Sep. 21, 2011, U.S. patent application Ser. No. 13/239,148, entitled “ELEMENTARY NETWORK DESCRIPTION FOR EFFICIENT LINK BETWEEN NEURONAL MODELS AND NEUROMORPHIC SYSTEMS”, filed Sep. 21, 2011, now U.S. Pat. No. 8,712,941, U.S. patent application Ser. No. 13/239,155, entitled “ELEMENTARY NETWORK DESCRIPTION FOR EFFICIENT MEMORY MANAGEMENT IN NEUROMORPHIC SYSTEMS”, filed Sep. 21, 2011, now U.S. Pat. No. 8,725,658, and U.S. patent application Ser. No. 13/239,163, entitled “ELEMENTARY NETWORK DESCRIPTION FOR EFFICIENT IMPLEMENTATION OF EVENT-TRIGGERED PLASTICITY RULES IN NEUROMORPHIC SYSTEMS”, filed Sep. 21, 2011, now U.S. Pat. No. 8,719,199, each of the foregoing incorporated herein by reference in its entirety.
Number | Name | Date | Kind |
---|---|---|---|
5063603 | Burt | Nov 1991 | A |
5355435 | DeYong et al. | Oct 1994 | A |
5638359 | Peltola et al. | Jun 1997 | A |
5673367 | Buckley | Sep 1997 | A |
5875108 | Hoffberg et al. | Feb 1999 | A |
5980096 | Thalhammer-Reyero | Nov 1999 | A |
6009418 | Cooper | Dec 1999 | A |
6014653 | Thaler | Jan 2000 | A |
6458157 | Suaning | Oct 2002 | B1 |
6545705 | Sigel et al. | Apr 2003 | B1 |
6545708 | Tamayama et al. | Apr 2003 | B1 |
6546291 | Merfeld et al. | Apr 2003 | B2 |
6581046 | Ahissar | Jun 2003 | B1 |
7536374 | Au | May 2009 | B2 |
7849030 | Ellingsworth | Dec 2010 | B2 |
8015130 | Matsugu et al. | Sep 2011 | B2 |
8103602 | Izhikevich | Jan 2012 | B2 |
8315305 | Petre et al. | Nov 2012 | B2 |
8467623 | Izhikevich et al. | Jun 2013 | B2 |
8712939 | Szatmary et al. | Apr 2014 | B2 |
8712941 | Izhikevich et al. | Apr 2014 | B2 |
8719199 | Izhikevich et al. | May 2014 | B2 |
8725658 | Izhikevich et al. | May 2014 | B2 |
8725662 | Izhikevich et al. | May 2014 | B2 |
20020038294 | Matsugu | Mar 2002 | A1 |
20030050903 | Liaw et al. | Mar 2003 | A1 |
20040193670 | Langan et al. | Sep 2004 | A1 |
20050015351 | Nugent | Jan 2005 | A1 |
20050036649 | Yokono et al. | Feb 2005 | A1 |
20050283450 | Matsugu et al. | Dec 2005 | A1 |
20060161218 | Danilov | Jul 2006 | A1 |
20060224533 | Thaler | Oct 2006 | A1 |
20070176643 | Nugent | Aug 2007 | A1 |
20070208678 | Matsugu | Sep 2007 | A1 |
20090043722 | Nugent | Feb 2009 | A1 |
20090287624 | Rouat et al. | Nov 2009 | A1 |
20100086171 | Lapstun | Apr 2010 | A1 |
20100119214 | Shimazaki et al. | May 2010 | A1 |
20100166320 | Paquier | Jul 2010 | A1 |
20110016071 | Guillen et al. | Jan 2011 | A1 |
20110106741 | Denneau et al. | May 2011 | A1 |
20110119214 | Breitwisch et al. | May 2011 | A1 |
20110119215 | Elmegreen et al. | May 2011 | A1 |
20110160741 | Asano et al. | Jun 2011 | A1 |
20120011090 | Tang et al. | Jan 2012 | A1 |
20120109866 | Modha | May 2012 | A1 |
20120303091 | Izhikevich | Nov 2012 | A1 |
20120308076 | Piekniewski et al. | Dec 2012 | A1 |
20120308136 | Izhikevich | Dec 2012 | A1 |
20130073491 | Izhikevich et al. | Mar 2013 | A1 |
20130073495 | Izhikevich et al. | Mar 2013 | A1 |
20130073500 | Szatmary et al. | Mar 2013 | A1 |
20130151448 | Ponulak | Jun 2013 | A1 |
20130151450 | Ponulak | Jun 2013 | A1 |
20130218821 | Szatmary et al. | Aug 2013 | A1 |
20130251278 | Izhikevich et al. | Sep 2013 | A1 |
20130297539 | Piekniewski et al. | Nov 2013 | A1 |
20130297541 | Piekniewski et al. | Nov 2013 | A1 |
20130297542 | Piekniewski et al. | Nov 2013 | A1 |
20130325768 | Sinyavskiy et al. | Dec 2013 | A1 |
20130325773 | Sinyavskiy et al. | Dec 2013 | A1 |
20130325774 | Sinyavskiy et al. | Dec 2013 | A1 |
20130325775 | Sinyavskiy et al. | Dec 2013 | A1 |
20130325776 | Ponulak et al. | Dec 2013 | A1 |
20130325777 | Petre et al. | Dec 2013 | A1 |
20140012788 | Piekniewski | Jan 2014 | A1 |
20140032458 | Sinyavskiy et al. | Jan 2014 | A1 |
20140032459 | Sinyavskiy et al. | Jan 2014 | A1 |
20140052679 | Sinyavskiy et al. | Feb 2014 | A1 |
20140064609 | Petre et al. | Mar 2014 | A1 |
20140081895 | Coenen et al. | Mar 2014 | A1 |
20140122397 | Richert et al. | May 2014 | A1 |
20140122398 | Richert | May 2014 | A1 |
20140122399 | Szatmary et al. | May 2014 | A1 |
20140156574 | Piekniewski et al. | Jun 2014 | A1 |
20140219497 | Richert | Aug 2014 | A1 |
20140222739 | Ponulak | Aug 2014 | A1 |
20140229411 | Richert et al. | Aug 2014 | A1 |
20140244557 | Piekniewski et al. | Aug 2014 | A1 |
20140250036 | Izhikevich et al. | Sep 2014 | A1 |
20140250037 | Izhikevich et al. | Sep 2014 | A1 |
Number | Date | Country |
---|---|---|
102226740 | Oct 2011 | CN |
4087423 | May 2008 | JP |
2108612 | Apr 1998 | RU |
2406105 | Dec 2010 | RU |
2424561 | Jul 2011 | RU |
201110040 | Mar 2011 | TW |
2008083335 | Jul 2008 | WO |
2008132066 | Nov 2008 | WO |
Entry |
---|
Abbott L.F., et al., “Synaptic plasticity: taming the beast,” Nature Neuroscience, Nov. 2008, vol. 3, 1178-1183. |
Aleksandrov, Stochastic optimization, Engineering Cybernetics, 5 (1968), 1116. |
Amari S., et al., “Why natural gradient?,” Acoustics, Speech and Signal Processing, Proceedings of the 1998 IEEE International conference, 1998, vol. 2, pp. 1213-1216. |
Baras D., et al., Reinforcement learning, spike-time-dependent plasticity, and the BCM rule, Neural Computation, Aug. 2007, vol. 19 Issue 8, pp. 2245-2279. |
Bartlett et al., A biologically plausible and locally optimal learning algorithm for spiking neurons, 2000, pp. 1-7, retrieved from http://arp.anu.edu.au/ftp/papers/jon/brains.pdf. |
Baxter J., et al., Direct gradient-based reinforcement learning, In Proceedings of the International Symposium on Circuits and Systems, 2000, vol. 3, pp. 271-274. |
Bohte et al., “Spike Prop: backpropagation for networks of spiking neurons,” In Proceedings of ESANN'2000, pp. 419-424. |
Bohte., “Spiking Neural Networks”, Doctorate at the University of Leiden, Holland, URL:http://homepages.cwi.nl/˜sbohte/publication/phdthesis.pdf, Mar. 5, 2003, pp. 1-133. |
Bohte S.M., A computational theory of spike-timing dependent plasticity: achieving robust neural responses via conditional entropy minimization, 2005,SEN-E0505. |
Booij., “A Gradient Descent Rule for Spiking Neurons Emitting Multiple Spikes”, Information Processing Letters, N. 6, v.95, pp. 552-558. |
Brette et al., “Brian: a simple and flexible simulator for spiking neural networks,” The Neuromorphic Engineer, Jul. 1, 2009, pp. 1-4, doi: 10.2417/1200906.1659. |
Brette., “On the design of script languages for neural simulation”, Laboratoire Psychologie de la Perception, CNRS Universite Paris Descartes, Paris, France, 7 pp. |
Brette., “Vectorised algorithms for spiking neural network simulation” Oct. 2010, 23pp. |
Cuntz, et al., “One Rule to Grow Them All: A General Theory of Neuronal Branching and Its Paractical Application”, PLOS Computational Biology, 6 (8), Published Aug. 5, 2010. |
Davison A.P., et al., “PyNN: a common interface for neuronal network simulators”, Frontiers in Neuroinformatics, Jan. 2009, pp. 1-10, vol. 2, Article 11. |
Djurfeldt M., “The Connection-set Algebra: a formalism for the representation of connectivity structure in neuronal network models, implementations in Python and C++, and their use in simulators”, BMC Neuroscience , Jul. 18, 2011, p. 1, 12(Suppl 1):P80. |
El-Laithy K., “A reinforcement learning framework for spiking networks with dynamic synapses,” Computational Intelligence and Neuroscience, Jan. 2011 , vol. 2011. |
Farabet C., et al., “NeuFlow: A Runtime Reconfigurable Dataflow Processor for Vision”, IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), 2011, pp. 109-116. |
Fidjeland et al., “Accelerated Simulation of Spiking Neural Networks Using GPUs” WCCI 2010 IEEE World Congress on Compulational Intelligence, Jul. 18-23, 2010—CCIB, Barcelona, Spain, pp. 536-543, [retrieved on Nov. 14, 2012]. Retrieved from the Internet: <URL: http://www.doc.ic.ac.uk/-mpsha/IJCNN10b.pdf>. |
Fletcher., “Practical methods of optimization”, New York, NY: Wiley-Interscience, 1987. |
Floreano D., et al., “Neuroevolution: from architectures to learning”, Evolutionary Intelligence, Jan. 2008, vol. 1, pp. 47-62. |
Florian R.V., “A reinforcement learning algorithm for spiking neural networks”, SYNASC '05 Proceedings of the Seventh International Symposium on Symbolic and Numeric Algorithms for Scientific Computing, 2005. |
Froemke, et al.,. “Temporal modulation of spike-timing-dependent plasticity,” Frontiers in Synaptic Neuroscience, vol. 2, Article 19, Jun. 2010, 16 pgs. |
Fu M., “Stochastic Gradient Estimation,” 2005, pp. 32. |
Fu M.C., What You Should Know About Simulation and Derivatives Naval Research Logistics, 2008, vol. 55, No. 8, pp. 723-736. |
Fyfe C., et al., “Reinforcement Learning Reward Functions for Unsupervised Learning”, ISNN '07 Proceedings of the 4th international symposium on Neural Networks: Advances in Neural Networks, 2007, vol. 4491, pp. 397-402. |
Gerstner, Spiking neuron models: single neurons, populations, plasticity, Cambridge, U.K: Cambridge University Press, 2002. |
Gewaltig, et al., “NEST by example: an introduction to the neural simulation tool NEST”, Computational Systems Neurobiology Springer, Dordrecht, 2012, 27 pages. |
Gewaltig et al., “NEST (NEural Simulation Tool)”, Scholarpedia, 2007, pp. 1-15, 2(4):1430, Doi:10.4249/scholarpedia.1430. |
Gleeson et al., “NeuroML: A Language for Describing Data Driven Models of Neurons and Networks with a High Degree of Biological Detail”, PLoS Computational Biology, Jun. 2010, pp. 1-19 vol. 6 Issue 6. |
Gluck, “Stimulus Generalization and Representation in Adaptive Network Models of Category Learning” Psychological Science, vol. 2, No. 1, Jan. 1991, pp. 50-55. |
Glynn P.W., et al., “Likelihood ratio gradient estimation for regenerative stochastic recursion”, Advances in Applied Probability, 1995, 27, 4, 1019-1053. |
Goodman., “Code Generation: A Strategy for Neural Network Simulators”, Neuroinform, Springer Science + Business Media, LLC , Human Press, Sep. 2010, 14 pp. |
Goodman et al., “Brian: a simulator for spiking neural networks in Python”, Frontiers in Neuroinformatics, Nov. 2008, pp. 1-10, vol. 2, Article 5. |
Goodman, et al., “The Brian Simulator”, Frontiers in Neuroscience, Focused Review, Sep. 15, 2009, pp. 192-197. |
Gorchetchnikov et al., “NineML: declarative, mathematically-explicit descriptions of spiking neuronal networks”, Frontiers in Neuroinformatics, Conference Abstract: 4th INCF Congress of Neuroinformatics, doi: 10.3389/conf.fninf.2011.08.00098. |
Graham L., “The Surf Hippo User Manual Version 3.0 B”, Unite de Neurosiences Integratives et Computationnelles Institut Federatif de Neurobiologie Alfred Fessard, CNRS. France. Mar. 2002 [retrieved Jan. 16, 2014]. [retrieved biomedicale.univ-paris5.fr]. |
Graham L., “The Surf-Hippo Reference Manual”, Mar. 2002, pp. 1-128, http://www.neurophys.biomedicale.univparis5.fr/-graham/surf-hippo-files/Surf-Hippo%20Reference%20Reference%20Manual.pdf. |
Izhikevich E. M. “Simple model of spiking neurons”, IEEE Transactions on Neural Networks, Nov. 1, 2003, vol. 14, No. 6, pp. 1569-1572, IEEE Service Center, Piscataway, NJ, US, XP011105173, ISSN: 1045-9227, DOI: 10.1109/TNN.2003.820440. |
Izhikevich, E. M., “Solving the Distal Reward Problem Through Linkage of STDP and Dopamine Signaling”, In Cerebral Cortex, pp. 2443-2452, Oct. 2007. |
Izhikevich et al., “Relating STDP to BCM”, Neural Computation (2003) 15, 1511-1523. |
Izhikevich, “Polychronization: Computation with Spikes”, Neural Computation, 25, 2006, 18, 245-282. |
Jin X. et al., “Implementing Spike-Timing-Dependent Plasticity on SpiNNaker Neuromorphic Hardware”, Proceedings of the 2010 International Joint Conference on Neural Networks (IJCNN '10), (Jul. 18, 2010), XP031771405, DOI: 10.1109/IJCNN.2010.5596372, sections III and v. |
Karbowski et al., “Multispikes and Synchronization in a Large Neural Network with Temporal Delays”, Neural Computation, 2000, 12, 1573-1606. |
Khotanzad, “Classification of invariant image representations using a neural network” IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 38, No. 6, Jun. 1990, pp. 1028-1038. |
Kiefer, “Stochastic Estimation of the Maximum of a Regression Function,” Annals of Mathematical Statistics 23, #3, 462-466, 1952. |
Klampfl S., et al., “Spiking neurons can learn to solve information bottleneck problems and extract independent components”, Neural Computation, (2009), 21(4), pp. 911-959. |
Kleijnen et al., “Optimization and sensitivity analysis of computer simulation models by the score function method”, Invited Review European Journal of Operational Research, Mar. 1995. |
Larochelle H., et al., “Exploring Strategies for Training Deep Neural Networks”, Journal of Machine Learning Research, v. 10, pp. 1-40. |
Laurent, “Issue 1—nnql—Refactor Nucleus into its own file—Neural Network Query Language” [retrieved on Nov. 12, 2012]. Retrieved from the Internet: <URL:https://code.google.com/p/nnql/issues/detail?id=1>. |
Laurent, “The Neural Network Query Language (NNQL) Reference” [retrieved on Nov. 12, 2013]. Retrieved from the Internet: <URL:http://nnql.org/nnql.org>. |
Morrison et al., “Advancing the Boundaries of High-Connectivity Network Simulation with Distributed Computing”, Neural Compuation 17, 2005, pp. 1775-1801. |
Nageswaran J.M., et al., “Computing Spike-based Convolutions on GPUs”, IEEE International Symposium on Circuits and Systems, 2009, ISCAS 2009, May, pp. 1917-1920. |
Neuflow., “A Data Flow Processor”, www.neuflow.org/category/neuflow-2/, Dec. 2010, 3 pp. |
Nichols., “A Reconfigurable Computing Architecture for Implementing Artificial Neural Networks on FPGA”, Masters Thesis, The University of Guelph, 2003, pp. 1-235. |
Paugam-Moisy H., et al., “Computing with spiking neuron networks”, Handbook of Natural Computing, Springer-Verlag, 2010, pp. 1-47. |
Pavlidis N.G., et al., “Spiking neural network training using evolutionary algorithms”, In Proceedings 2005 IEEE International Joint Conference on Neural Networks, 2005, IJCNN'05, vol. 4, pp. 2190-2194. |
Pecevski., et al., PCSIM: a parallel simulation environment for neural circuits fully integrated with Python [online], 2009 [retrieved on Jan. 12, 2015]. Retrieved from the Internet<URL:http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2698777/pdf/fninf-03-011.pdf>. |
Pfister J., et al., “Optimal Spike-Timing-Dependent Plasticity for Precise Action Potential Firing in Supervised Learning, ” Neural Computing, vol. 18, No. 6, Apr. 28, 2006, pp. 1318-1348. [retrieved Oct. 23, 2013]. [retrieved online from ebscohost.com]. |
Pfister, “Optimal Hebbian Learning: A Probabilistic Point of View”, In ICANN Proceedings. Springer, (2003). |
Plesser et al., “Efficient Parallel Simulation of Large-Scale Neuronal Networks on Clusters of Multiprocessor Computers”, Springer-Verlag Berlin Heidelberg, 2007, 10pp. |
Reiman M.I., et al., “Sensitivity analysis for simulations via likelihood ratios”, Oper Res 37, 1989, 830-844. |
Robbins H., et al., “A Stochastic Approximation Method,” Annals of Mathematical Statistics 22, #3, 1951, pp. 400-407. |
Rosenstein M T et al., “Supervised learning combined with an actor-critic architecture,” Technical Report 02-41, Department of Computer Science, University of Massachusetts, Amherst, Oct. 18, 2002, pp. 1-10. |
Rumelhart et al., “Learning representations by back-propagating errors,” Nature 323 (6088) , 1986, pp. 533-536. |
Rumelhart., “Learning internal representations by error propagation,” Parallel distributed processing, 1986, vol. 1 (pp. 318-362), Cambridge, MA: MIT Press. |
Schemmel J. et al., “Implementing Synaptic Plasticity in a VLSI Spiking Neural Network Model”, International Joint Conference on Neural Networks, 2006. IJCNN '06, Piscataway, NJ: IEEE Operations Center, Piscataway, NJ, USA, Jan. 1, 2006, pp. 1-6, XP002557202, ISBN: 978-0-7803-9490-2 Retrieved from the Internet: URL:http://www.kip.uni-heidelberg.de/Veroeffentiichungen/download.cgi/4620/ps/1774.pdf [retrieved on Nov. 23, 2009]. |
Simulink.RTM. model [online], [Retrieved on Dec. 10, 2013] Retrieved from the Internet;URL: http://www.mathworks.com/ products/simulink/index.html>. |
Sinyavskiy et al., “Generalized Stochatic Spiking Neuron Model and Extended Spike Response Model in Spatial-Temporal Impulse Pattern Detection Task”, Optical Memory and Neural Networks (Information Optics), 2010, vol. 19, No. 4, pp. 300-309. |
Sinyavskiy O.Y., et al., “Reinforcement learning of a spiking neural network in the task of control of an agent in a virtual discrete environment,” Rus. J. Nonlin. Dyn., 2011, vol. 7, No. 4 (Mobile Robots), pp. 859-875, chapters 1-8 (Russian Article with English Abstract). |
Sjostrom et al., “Spike-Timing Dependent Plasticity”, Scholarpedia, 5(2):1362 (2010), pp. 1-18. |
Szatmary et al., “Spike-timing Theory of Working Memory”, PLoS Computational Biology, vol. 6, Issue 8, Aug. 19, 2010 [retrieved on Dec. 30, 2013]. Retrieved from the Internet: <URL: http://www.ploscompbiol.org/article/info%3Adoi%2F10.1371%2Fjournal.pcbi.1000879#>. |
Tishby et al., “The information bottleneck method”, In Proceedings of the 37th Annual Allerton Conference on Communication, 1999, Control and Computing, B Hajek & RS Sreenivas, eds., pp. 368-377, University of Illinois. |
Toyoizumi et al., “Generalized Bienenstock-Cooper-Munro rule for spiking neurons that maximizes information transmission,” Proc. Natl. Acad. Sci. USA, 2005, 102, pp. 5239-5244. |
Toyoizumi, T., et al., “Optimality Model of Unsupervised Spike-Timing-Dependent Plasticity: Synaptic Memory and Weight Distribution,” In Neural Computation, (19) 3: 639-671, Year 2007. |
Weaver L., et al., “The Optimal Reward Baseline for Gradient-Based Reinforcement Learning”, UAI 01 Proceedings of the 17th Conference in Uncertainty in Artificial Intelligence, 2001, pp. 538-545, Morgan Kaufman Publishers. |
Weber C., et al., “Goal-Directed Feature Learning,” In: Proc, International Joint Conference on Neural Networks, 2009, pp. 3319-3326. |
Williams R.J., et al., “Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning,” Machine Learning, 1992, vol. 8, Issue. 3-4, pp. 229-256. |
Yi S., et al., “Stochastic search using the natural gradient,” ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning, 2009, New York, NY, USA. |
Number | Date | Country | |
---|---|---|---|
20140372355 A1 | Dec 2014 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 13239259 | Sep 2011 | US |
Child | 14275663 | US |