This application is related to co-pending and co-owned U.S. patent application Ser. No. 14/070,239 entitled “REDUCED DEGREE OF FREEDOM ROBOTIC CONTROLLER APPARATUS AND METHODS”, filed herewith, and U.S. patent application Ser. No. 14/070,114 entitled “APPARATUS AND METHODS FOR ONLINE TRAINING OF ROBOTS”, filed herewith, each of the foregoing being incorporated herein by reference in its entirety.
This application is also related to commonly owned, and co-pending U.S. patent application Ser. No. 13/866,975, entitled “APPARATUS AND METHODS FOR REINFORCEMENT-GUIDED SUPERVISED LEARNING”, filed Apr. 19, 2013, Ser. No. 13/918,338 entitled “ROBOTIC TRAINING APPARATUS AND METHODS”, filed Jun. 14, 2013, 13/918,298, entitled “HIERARCHICAL ROBOTIC CONTROLLER APPARATUS AND METHODS”, filed Jun. 14, 2013, Ser. No. 13/907,734, entitled “ADAPTIVE ROBOTIC INTERFACE APPARATUS AND METHODS”, filed May 31, 2013, Ser. No. 13/842,530, entitled “ADAPTIVE PREDICTOR APPARATUS AND METHODS”, filed Mar. 15, 2013, Ser. No. 13/842,562, entitled “ADAPTIVE PREDICTOR APPARATUS AND METHODS FOR ROBOTIC CONTROL”, filed Mar. 15, 2013, Ser. No. 13/842,616, entitled “ROBOTIC APPARATUS AND METHODS FOR DEVELOPING A HIERARCHY OF MOTOR PRIMITIVES”, filed Mar. 15, 2013, Ser. No. 13/842,647, entitled “MULTICHANNEL ROBOTIC CONTROLLER APPARATUS AND METHODS”, filed Mar. 15, 2013, Ser. No. 13/842,583, entitled “APPARATUS AND METHODS FOR TRAINING OF ROBOTIC DEVICES”, filed Mar. 15, 2013, Ser. No. 13/152,084, filed Jun. 2, 2011, entitled “APPARATUS AND METHODS FOR PULSE-CODE INVARIANT OBJECT RECOGNITION”, Ser. No. 13/757,607, filed Feb. 1, 2013, entitled “TEMPORAL WINNER TAKES ALL SPIKING NEURON NETWORK SENSORY PROCESSING APPARATUS AND METHODS”, Ser. No. 13/623,820, filed Sep. 20, 2012, entitled “APPARATUS AND METHODS FOR ENCODING OF SENSORY DATA USING ARTIFICIAL SPIKING NEURONS”, Ser. No. 13/623,842, entitled “SPIKING NEURON NETWORK ADAPTIVE CONTROL APPARATUS AND METHODS”, filed Sep. 20, 2012, Ser. No. 13/487,499, entitled “STOCHASTIC APPARATUS AND METHODS FOR IMPLEMENTING GENERALIZED LEARNING RULES”, filed Jun. 4, 2012, Ser. No. 13/465,903 entitled “SENSORY INPUT PROCESSING APPARATUS IN A SPIKING NEURAL NETWORK”, filed May 7, 2012, Ser. No. 13/488,106, entitled “SPIKING NEURON NETWORK APPARATUS AND METHODS”, filed Jun. 4, 2012, Ser. No. 13/541,531, entitled “CONDITIONAL PLASTICITY SPIKING NEURON NETWORK APPARATUS AND METHODS”, filed Jul. 3, 2012, Ser. No. 13/691,554, entitled “RATE STABILIZATION THROUGH PLASTICITY IN SPIKING NEURON NETWORK”, filed Nov. 30, 2012, Ser. No. 13/660,967, entitled “APPARATUS AND METHODS FOR ACTIVITY-BASED PLASTICITY IN A SPIKING NEURON NETWORK”, filed Oct. 25, 2012, Ser. No. 13/660,945, entitled “MODULATED PLASTICITY APPARATUS AND METHODS FOR SPIKING NEURON NETWORKS”, filed Oct. 25, 2012, Ser. No. 13/774,934, entitled “APPARATUS AND METHODS FOR RATE-MODULATED PLASTICITY IN A SPIKING NEURON NETWORK”, filed Feb. 22, 2013, Ser. No. 13/763,005, entitled “SPIKING NETWORK APPARATUS AND METHOD WITH BIMODAL SPIKE-TIMING DEPENDENT PLASTICITY”, filed Feb. 8, 2013, Ser. No. 13/660,923, entitled “ADAPTIVE PLASTICITY APPARATUS AND METHODS FOR SPIKING NEURON NETWORK”, filed Oct. 25, 2012, Ser. No. 13/239,255 filed Sep. 21, 2011, entitled “APPARATUS AND METHODS FOR SYNAPTIC UPDATE IN A PULSE-CODED NETWORK”, Ser. No. 13/588,774, entitled “APPARATUS AND METHODS FOR IMPLEMENTING EVENT-BASED UPDATES IN SPIKING NEURON NETWORKS”, filed Aug. 17, 2012, Ser. No. 13/560,891 entitled “APPARATUS AND METHODS FOR EFFICIENT UPDATES IN SPIKING NEURON NETWORK”, filed Jul. 27, 2012, Ser. No. 13/560,902, entitled “APPARATUS AND METHODS FOR STATE-DEPENDENT LEARNING IN SPIKING NEURON NETWORKS”, filed Jul. 27, 2012, Ser. No. 13/722,769 filed Dec. 20, 2012, and entitled “APPARATUS AND METHODS FOR STATE-DEPENDENT LEARNING IN SPIKING NEURON NETWORKS”, Ser. No. 13/842,530 entitled “ADAPTIVE PREDICTOR APPARATUS AND METHODS”, filed Mar. 15, 2013, Ser. No. 13/239,255 filed Sep. 21, 2011, entitled “APPARATUS AND METHODS FOR SYNAPTIC UPDATE IN A PULSE-CODED NETWORK”, Ser. No. 13/487,576entitled “DYNAMICALLY RECONFIGURABLE STOCHASTIC LEARNING APPARATUS AND METHODS”, filed Jun. 4, 2012; Ser. No. 13/953,595 entitled “APPARATUS AND METHODS FOR TRAINING AND CONTROL OF ROBOTIC DEVICES”, filed Jul. 29, 2013; Ser. No. 13/918,620 entitled “PREDICTIVE ROBOTIC CONTROLLER APPARATUS AND METHODS”, filed Jun. 14, 2013; and commonly owned U.S. Pat. No. 8,315,305, issued Nov. 20, 2012, entitled “SYSTEMS AND METHODS FOR INVARIANT PULSE LATENCY CODING”; each of the foregoing incorporated herein by reference in its entirety.
A portion of the disclosure of this patent document contains material that is subject to copyright protection. The copyright owner has no objection to the facsimile reproduction by anyone of the patent document or the patent disclosure, as it appears in the Patent and Trademark Office patent files or records, but otherwise reserves all copyright rights whatsoever.
Technological Field
The present disclosure relates to adaptive control and training, such as control and training of robotic devices.
Background
Robotic devices are used in a variety of industries, such as manufacturing, medical, safety, military, exploration, and/or other. Robotic “autonomy”, i.e., the degree of human control, varies significantly according to application. Some existing robotic devices (e.g., manufacturing assembly and/or packaging) may be programmed in order to perform desired functionality without further supervision. Some robotic devices (e.g., surgical robots) may be controlled by humans.
Robotic devices may comprise hardware components that enable the robot to perform actions in 1-dimension (e.g., a single range of movement), 2-dimensions (e.g., a plane of movement), and/or 3-dimensions (e.g., a space of movement). Typically, movement is characterized according to so-called “degrees of freedom”. A degree of freedom is an independent range of movement; a mechanism with a number of possible independent relative movements (N) is said to have N degrees of freedom. Some robotic devices may operate with multiple degrees of freedom (e.g., a turret and/or a crane arm configured to rotate around vertical and/or horizontal axes). Other robotic devices may be configured to follow one or more trajectories characterized by one or more state parameters (e.g., position, velocity, acceleration, orientation, and/or other). It is further appreciated that some robotic devices may simultaneously control multiple actuators (degrees of freedom) resulting in very complex movements.
One aspect of the disclosure relates to a non-transitory computer readable medium having instructions embodied thereon. The instructions, when executed, are configured to control a robotic platform.
In another aspect, a method of operating a robotic controller apparatus is disclosed. In one implementation, the method includes: determining a current controller performance associated with performing a target task; determining a “difficult” portion of a target trajectory associated with the target task, the difficult portion characterized by an extent of a state space; and providing a training input for navigating the difficult portion, the training input configured to transition the current performance towards the target trajectory.
In one variant, the difficult portion of the target trajectory is determined based at least on the current performance being outside a range from the target trajectory; the state space is associated with performing of the target task by the controller, and performing by the controller of a portion of the target task outside the extent is configured based on autonomous controller operation.
In another variant, the controller is operable in accordance with a supervised learning process configured based on the teaching input, the learning process being adapted based on the current performance; and the navigating of the difficult portion is based at least in part on a combination of the teaching input and an output of the controller learning process.
In a further variant, the extent is characterized by a first dimension having a first value, and the state space is characterized by a second dimension having a second value; and the first value is less than one-half (½) of the second value.
In yet another variant, the controller is operable in accordance with a supervised learning process configured based on the teaching input and a plurality of training trials, the learning process being adapted based on the current performance; and the difficult trajectory portion determination is based at least on a number of trials within the plurality of trials required to attain the target performance.
In another aspect, an adaptive controller apparatus is disclosed. In one implementation, the apparatus includes a plurality of computer readable instructions configured to, when executed, cause performing of a target task by at least: during a first training trial, determining a predicted signal configured in accordance with a sensory input, the predicted signal configured to cause execution of an action associated with the target task, the action execution being characterized by a first performance; during a second training trial, based on a teaching input and the predicted signal, determining a combined signal configured to cause execution of the action, the action execution during the second training trial being characterized by a second performance; and adjusting a learning parameter of the controller based on the first performance and the second performance.
In one variant of the apparatus, the execution of the target task comprises execution of the action and at least one other action; the adjusting of the learning parameter is configured to enable the controller to determine, during a third training trial, another predicted signal configured in accordance with the sensory input; and the execution, based on the another predicted signal, of the action during the third training trial is characterized by a third performance that is closer to the target task compared to the first performance.
In another variant, execution of the target task the target task is characterized by a target trajectory in a state space; execution of the action is characterized by a portion of the target trajectory having a state space extent associated therewith; and the state space extent occupies a minority fraction of the state space.
In a further aspect, a robotic apparatus is disclosed. In one implementation, the apparatus includes a platform characterized by first and second degrees of freedom; a sensor module configured to provide information related to the platform's environment; and an adaptive controller apparatus configured to determine first and second control signals to facilitate operation of the first and the second degrees of freedom, respectively.
In one variant, the first and the second control signals are configured to cause the platform to perform a target action; the first control signal is determined in accordance with the information and a teaching input; the second control signal is determined in an absence of the teaching input and in accordance with the information and a configuration of the controller; and the configuration is determined based at least on an outcome of training of the controller to operate the second degree of freedom.
In another variant, the determination of the first control signal is effectuated based at least on a supervised learning process characterized by multiple iterations; and performance of the target action in accordance with the first control signal at a given iteration is characterized by a first performance.
In a further aspect, a method of optimizing the operation of a robotic controller apparatus is disclosed. In one implementation, the method includes: determining a current controller performance associated with performing a target task, the current performance being non-optimal for accomplishing the task; and for at least a selected first portion of a target trajectory associated with the target task, the first portion characterized by an extent of a state space, providing a training input that facilitates navigation of the first portion, the training input configured to transition the current performance towards the target trajectory.
In one variant, the first portion of the target trajectory is selected based at least on the current performance not meeting at least one prescribed criterion with respect to the target trajectory. The at least one prescribed criterion comprises for instance the current performance exceeding a disparity from, or range associated with, an acceptable performance.
In another variant, a performance by the controller of a portion of the target task outside the extent is effectuated in the absence of the training input.
In yet another variant, the controller is configured to be trained to perform the target task using multiple iterations; and for a given iteration of the multiple iterations, the selected first portion comprises a portion with a higher rate of non-optimal performance determined based on one or more prior iterations of the multiple iterations.
These and other features, and characteristics of the present disclosure, as well as the methods of operation and functions of the related elements of structure and the combination of parts and economies of manufacture, will become more apparent upon consideration of the following description and the appended claims with reference to the accompanying drawings, all of which form a part of this specification, wherein like reference numerals designate corresponding parts in the various figures. It is to be expressly understood, however, that the drawings are for the purpose of illustration and description only and are not intended as a definition of the limits of the disclosure. As used in the specification and in the claims, the singular form of “a”, “an”, and “the” include plural referents unless the context clearly dictates otherwise.
All Figures disclosed herein are ©Copyright 2013 Brain Corporation. All rights reserved.
Implementations of the present technology will now be described in detail with reference to the drawings, which are provided as illustrative examples so as to enable those skilled in the art to practice the technology. Notably, the figures and examples below are not meant to limit the scope of the present disclosure to a single implementation, but other implementations are possible by way of interchange of, or combination with, some or all of the described or illustrated elements. Wherever convenient, the same reference numbers will be used throughout the drawings to refer to same or like parts.
Where certain elements of these implementations can be partially or fully implemented using known components, only those portions of such known components that are necessary for an understanding of the present technology will be described, and detailed descriptions of other portions of such known components will be omitted so as not to obscure the disclosure.
In the present specification, an implementation showing a singular component should not be considered limiting; rather, the disclosure is intended to encompass other implementations including a plurality of the same components, and vice-versa, unless explicitly stated otherwise herein.
Further, the present disclosure encompasses present and future known equivalents to the components referred to herein by way of illustration.
As used herein, the term “bus” is meant generally to denote all types of interconnection or communication architecture that are used to access the synaptic and neuron memory. The “bus” may be electrical, optical, wireless, infrared, and/or any type of communication medium. The exact topology of the bus could be, for example: a standard “bus”, a hierarchical bus, a network-on-chip, an address-event-representation (AER) connection, and/or any other type of communication topology configured to access e.g., different memories in a pulse-based system.
As used herein, the terms “computer”, “computing device”, and “computerized device” may include one or more of personal computers (PCs) and/or minicomputers (e.g., desktop, laptop, and/or other PCs), mainframe computers, workstations, servers, personal digital assistants (PDAs), handheld computers, embedded computers, programmable logic devices, personal communicators, tablet computers, portable navigation aids, J2ME equipped devices, cellular telephones, smart phones, personal integrated communication and/or entertainment devices, and/or any other device capable of executing a set of instructions and processing an incoming data signal.
As used herein, the term “computer program” or “software” may include any sequence of human and/or machine cognizable steps which perform a function. Such program may be rendered in a programming language and/or environment including one or more of C/C++, C#, Fortran, COBOL, MATLAB™. PASCAL, Python, assembly language, markup languages (e.g., HTML, SGML, XML, VoXML), object-oriented environments (e.g., Common Object Request Broker Architecture (CORBA)), Java™ (e.g., J2ME, Java Beans), Binary Runtime Environment (e.g., BREW), and/or other programming languages and/or environments.
As used herein, the terms “synaptic channel”, “connection”, “link”, “transmission channel”. “delay line”, and “communications channel” include a link between any two or more entities (whether physical (wired or wireless), or logical/virtual) which enables information exchange between the entities, and may be characterized by a one or more variables affecting the information exchange.
As used herein, the term “memory” may include an integrated circuit and/or other storage device adapted for storing digital data. By way of non-limiting example, memory may include one or more of ROM, PROM, EEPROM, DRAM, Mobile DRAM, SDRAM, DDR/2 SDRAM, EDO/FPMS, RLDRAM, SRAM, “flash” memory (e.g., NAND/NOR), memristor memory, PSRAM, and/or other types of memory.
As used herein, the terms “integrated circuit (IC)”, and “chip” are meant to refer without limitation to an electronic circuit manufactured by the patterned diffusion of elements in or on to the surface of a thin substrate. By way of non-limiting example, integrated circuits may include field programmable gate arrays (e.g., FPGAs), programmable logic devices (PLD), reconfigurable computer fabrics (RCFs), application-specific integrated circuits (ASICs), printed circuits, organic circuits, and/or other types of computational circuits.
As used herein, the terms “microprocessor” and “digital processor” are meant generally to include digital processing devices. By way of non-limiting example, digital processing devices may include one or more of digital signal processors (DSPs), reduced instruction set computers (RISC), general-purpose (CISC) processors, microprocessors, gate arrays (e.g., field programmable gate arrays (FPGAs)), PLDs, reconfigurable computer fabrics (RCFs), array processors, secure microprocessors, application-specific integrated circuits (ASICs), and/or other digital processing devices. Such digital processors may be contained on a single unitary IC die, or distributed across multiple components.
As used herein, the term “network interface” refers to any signal, data, and/or software interface with a component, network, and/or process. By way of non-limiting example, a network interface may include one or more of FireWire (e.g., FW400, FW800, etc.), USB (e.g., USB2), Ethernet (e.g., 10/100, 10/100/1000 (Gigabit Ethernet), 10-Gig-E, etc.), MoCA, Coaxsys (e.g., TVnet™), radio frequency tuner (e.g., in-band or OOB, cable modem, and/or other.), Wi-Fi (802.11), WiMAX (802.16), PAN (e.g., 802.15), cellular (e.g., 3G, LTE/LTE-A/TD-LTE, GSM, etc.), IrDA families, and/or other network interfaces.
As used herein, the term “Wi-Fi” includes one or more of IEEE-Std. 802.11, variants of IEEE-Std. 802.11, standards related to IEEE-Std. 802.11 (e.g., 802.11 a/b/g/n/s/v), and/or other wireless standards.
As used herein, the term “wireless” means any wireless signal, data, communication, and/or other wireless interface. By way of non-limiting example, a wireless interface may include one or more of Wi-Fi, Bluetooth, 3G (3GPP/3GPP2), HSDPA/HSUPA, TDMA, CDMA (e.g., IS-95A, WCDMA, etc.), FHSS, DSSS, GSM, PAN/802.15, WiMAX (802.16), 802.20, narrowband/FDMA, OFDM, PCS/DCS, LTE/LTE-A/TD-LTE, analog cellular, CDPD, satellite systems, millimeter wave or microwave systems, acoustic, infrared (i.e., IrDA), and/or other wireless interfaces.
Overview and Description of Exemplary Implementations
Apparatus and methods for training and controlling of robotic devices are disclosed. In one implementation, a robot or other entity may be utilized to perform a target task characterized by e.g., a target trajectory. The target trajectory may be, e.g., a race circuit, a surveillance route, a manipulator trajectory between a bin of widgets and a conveyor, and/or other. The robot may be trained by a user, such as by using an online supervised learning approach. The user may interface to the robot via a control apparatus, configured to provide teaching signals to the robot. In one variant, the robot may comprise an adaptive controller comprising a neuron network, and configured to generate actuator control commands based on the user input and output of the learning process. During one or more learning trials, the controller may be trained to navigate a portion of the target trajectory. Individual trajectory portions may be trained during separate training trials. Some trajectory portions may be associated with the robot executing complex actions that may require more training trials and/or more dense training input compared to simpler trajectory actions. A complex trajectory portion may be characterized by e.g., a selected range of state space parameters associated with the task and/or operation by the robot.
By way of illustration and example only, a robotic controller of a race car may be trained to navigate a trajectory (e.g., a race track), comprising one or more sharp turns (e.g., greater than, or equal to, 90° in some implementations). During training, the track may be partitioned into one or more segments comprised of e.g., straightaway portions and turn portions. The controller may be trained on one or more straightaway portions during a first plurality of trials (e.g., between 1 and 10 in some implementations depending on the car characteristics, trainer experience, and target performance). During a second number of trials, the controller may be trained on one or more turn portions (e.g., a 180° turn) using a second plurality of trials. The number of trials in the second plurality of trials may be greater than number of first plurality of trials (e.g., between 10 and 1000 in some implementations), and may depend on factors such as the car characteristics, trainer experience, and/or target performance. Training may be executed in one or more training sessions, e.g., every week to improve a particular performance for a given turn.
In the exemplary context of the above race car, individual ones of the one or more turn portions may be characterized by corresponding ranges (subsets) of the state space associated with the full trajectory of navigation. The range of state parameters associated with each of the one or more turn portions may be referred as a selected subset of the state space. The added training associated with the state space subset may be referred to as selective state space sampling (SSSS). Selection of a trajectory portion for SSSS added training may be configured based on one or more state parameters associated of the robotic device navigation of the target trajectory. In one or more implementations, the selection may be based on location (a range of coordinates), velocity, acceleration, jerk, operational performance (e.g., lap time), the rate of performance change over multiple trials, and/or other parameters.
In some implementations of devices characterized by multiple controllable degrees of freedom (CDOF), the trajectory portion selection may correspond to training a subset of CDOF of the device, and operating one or more remaining CDOF based on prior training and/or pre-configured operational instructions.
An exemplary implementation of the robot may comprise an adaptive controller implemented using e.g., a neuron network. Training the adaptive controller may comprise for instance a partial set training during so-called “trials”. The user may train the adaptive controller to separately train a first actuator subset, and a second actuator subset of the robot. During a first set of trials, the control apparatus may be configured to select and operate a first subset of the robot's complement of actuators e.g., operate a shoulder joint of a manipulator arm. The adaptive controller network may be configured to generate control commands for the shoulder joint actuator based on the user input and output of the learning process. However, since a single actuator (e.g., the shoulder joint) may be inadequate for achieving a target task (e.g., reaching a target object), subsequently thereafter the adaptive controller may be trained to operate the second subset (e.g., an elbow joint) during a second set of trials. During individual trials of the second set of trials, the user may provide control input for the second actuator, while the previously trained network may provide control signaling for the first actuator (the shoulder). Subsequent to performing the second set of trials, the adaptive controller may be capable of controlling the first and the second actuators in absence of user input by e.g., combining the training of the first and second trials.
In some implementations, the arm 100 may be controlled using an adaptive controller (e.g., comprising a neuron network described below with respect to
During controller training, the supervised learning process may receive supervisory input (training) from a trainer. In one or more implementations, the trainer may comprise a computerized agent and/or a human user. In some implementations of controller training by a human user, the training input may be provided by the user via a remote control apparatus e.g., such as illustrated in
In the implementation illustrated in
In one exemplary embodiment, the one or more control signals represent a fewer number of CDOF than the robot can support. For instance, with respect to
Referring now to
In the exemplary robotic devices of
It is further appreciated that the illustrated examples are readily understood to translate the value from the actuator displacement value to a linear displacement, angular displacement, rotational displacement, and/or other. Translation may be proportional, non-proportional, linear, non-linear, and/or other. For example, in some variable translation schemes, the actuator displacement value may be “fine” over some ranges (e.g., allowing small precision manipulations), and much more “coarse” over other ranges (e.g., enabling large movements). While the present examples use an actuator displacement value, it is appreciated that e.g., velocity values may also be used. For example, an actuator velocity value may indicate the velocity of movement which may be useful for movement which is not bounded within a range per se. For example, with respect to
Those of ordinary skill will appreciate that actuator mechanisms vary widely based on application. Actuators may use hydraulic, pneumatic, electrical, mechanical, and/or other. mechanisms to generate e.g., linear force, rotational force, linear displacement, angular displacement, and/or other. Common examples include: pistons, comb drives, worm drives, motors, rack and pinion, chain drives, and/or other.
In some implementations of supervised learning by neuron networks, the training signal may comprise a supervisory signal (e.g., a spike) that triggers neuron response. Referring now to
As shown in
The multilayer network 500 of neurons is depicted within
In some instances, a network layer may provide an error feedback signal to a preceding layer. For example, as shown by arrows 530, 520 in
The exemplary network 500 may comprise a network of spiking neurons configured to communicate with one another by means of “spikes” or electrical pulses. Additionally, as used herein, the terms “pre-synaptic” and “post-synaptic” are used to describe a neuron's relation to a connection. For example, with respect to the connection 512, the units 502 and 522 are referred to as the pre-synaptic and the post-synaptic unit, respectively. It is noteworthy, that the same unit is referred to differently with respect to different connections. For instance, unit 522 is referred to as the pre-synaptic unit with respect to the connection 532, and the post-synaptic unit with respect to the connection 512. In one or more implementations of spiking networks, the error signal 520, 530 may be propagated using spikes, e.g., as described in U.S. patent application Ser. No. 14/054,366, entitled “APPARATUS AND METHODS FOR BACKWARD PROPAGATION OF ERRORS IN A SPIKING NEURON NETWORK”, filed Oct. 15, 2013, the foregoing being incorporated herein by reference in its entirety.
The input 508 may comprise data used for solving a particular control task. For example, the signal 508 may comprise a stream of raw sensor data and/or preprocessed data. Raw sensor data may include data conveying information associated with one or more of proximity, inertial, terrain imaging, and/or other information. Preprocessed data may include data conveying information associated with one or more of velocity, information extracted from accelerometers, distance to obstacle, positions, and/or other information. In some implementations, such as those involving object recognition, the signal 508 may comprise an array of pixel values in the input image, or preprocessed data. Preprocessed data may include data conveying information associated with one or more of levels of activations of Gabor filters for face recognition, contours, and/or other information. In one or more implementations, the input signal 508 may comprise a target motion trajectory. The motion trajectory may be used to predict a future state of the robot on the basis of a current state and the target state. In one or more implementations, the signal 508 in
In one or more implementations, such as object recognition and/or obstacle avoidance, the input 508 may comprise a stream of pixel values associated with one or more digital images. In one or more implementations (e.g., video, radar, sonography, x-ray, magnetic resonance imaging, and/or other types of sensing), the input may comprise electromagnetic waves (e.g., visible light, IR, UV, and/or other types of electromagnetic waves) entering an imaging sensor array. In some implementations, the imaging sensor array may comprise one or more of RGCs, a charge coupled device (CCD), an active-pixel sensor (APS), and/or other sensors. The input signal may comprise a sequence of images and/or image frames. The sequence of images and/or image frame may be received from a CCD camera via a receiver apparatus and/or downloaded from a file. The image may comprise a two-dimensional matrix of RGB values refreshed at a 25 Hz frame rate. It will be appreciated by those skilled in the arts that the above image parameters are merely exemplary, and many other image representations (e.g., bitmap, CMYK, HSV, HSL, grayscale, and/or other representations) and/or frame rates are equally useful with the present technology. Pixels and/or groups of pixels associated with objects and/or features in the input frames may be encoded using, for example, latency encoding described in commonly owned and co-pending U.S. patent application Ser. No. 12/869,583, filed Aug. 26, 2010 and entitled “INVARIANT PULSE LATENCY CODING SYSTEMS AND METHODS”; U.S. Pat. No. 8,315,305, issued Nov. 20, 2012, entitled “SYSTEMS AND METHODS FOR INVARIANT PULSE LATENCY CODING”; Ser. No. 13/152,084, filed Jun. 2, 2011, entitled “APPARATUS AND METHODS FOR PULSE-CODE INVARIANT OBJECT RECOGNITION”; and/or latency encoding comprising a temporal winner take all mechanism described U.S. patent application Ser. No. 13/757,607, filed Feb. 1, 2013 and entitled “TEMPORAL WINNER TAKES ALL SPIKING NEURON NETWORK SENSORY PROCESSING APPARATUS AND METHODS”, each of the foregoing being incorporated herein by reference in its entirety.
In one or more implementations, encoding may comprise adaptive adjustment of neuron parameters, such neuron excitability described in commonly owned and co-pending U.S. patent application Ser. No. 13/623,820 entitled “APPARATUS AND METHODS FOR ENCODING OF SENSORY DATA USING ARTIFICIAL SPIKING NEURONS”, filed Sep. 20, 2012, the foregoing being incorporated herein by reference in its entirety.
Individual connections (e.g., 512, 532) may be assigned, inter alia, a connection efficacy, which in general may refer to a magnitude and/or probability of input into a neuron affecting neuron output. The efficacy may comprise, for example a parameter (e.g., synaptic weight) used for adaptation of one or more state variables of post-synaptic units (e.g., 530). The efficacy may comprise a latency parameter by characterizing propagation delay from a pre-synaptic unit to a post-synaptic unit. In some implementations, greater efficacy may correspond to a shorter latency. In some other implementations, the efficacy may comprise probability parameter by characterizing propagation probability from pre-synaptic unit to a post-synaptic unit; and/or a parameter characterizing an impact of a pre-synaptic spike on the state of the post-synaptic unit.
Individual neurons of the network 500 may be characterized by a neuron state. The neuron state may, for example, comprise a membrane voltage of the neuron, conductance of the membrane, and/or other parameters. The learning process of the network 500 may be characterized by one or more learning parameters, which may comprise input connection efficacy, output connection efficacy, training input connection efficacy, response generating (firing) threshold, resting potential of the neuron, and/or other parameters. In one or more implementations, some learning parameters may comprise probabilities of signal transmission between the units (e.g., neurons) of the network 500.
Referring back to
During normal operation (e.g., subsequent to learning), data 508 arriving to neurons of the network may cause changes in the neuron state (e.g., increase neuron membrane potential and/or other parameters). Changes in the neuron state may cause the neuron to generate a response (e.g., output a spike). However, during normal operation, The training input 540 is absent; the input data 508 is required for the neuron to generate output.
In some implementations, one of the outputs (e.g., generated by neuron 542) may be configured to actuate the first CDOF of the robotic arm 100 (e.g., joint 102); another output (e.g., generated by neuron 542) may be configured to actuate the second CDOF of the robotic arm 100 (e.g., the joint 106).
While
The network 550 of
In sensory data processing and/or object recognition implementations, the first neuron layer (e.g., 552, 554, 556) may be referred to as non-adaptive feature extraction layer configured to respond to occurrence of one or more features/objects (e.g., edges, shapes, color, and or other) in the input 558. The second layer neurons (572, 574) generate control output 576, 570 based on one or more inputs received from the first neuron layer (e.g., 562, 564, 566) to a respective actuator (e.g., the joints 102, 106 in
The network 500 and/or 550 of
In one or more implementations, neuron operation may be configured based on one or more inhibitory connections providing input configured to delay and/or depress response generation by the neuron, as described in commonly owned and co-pending U.S. patent application Ser. No. 13/660,923, entitled “ADAPTIVE PLASTICITY APPARATUS AND METHODS FOR SPIKING NEURON NETWORK”, filed Oct. 25, 2012, the foregoing being incorporated herein by reference in its entirety. Connection efficacy updated may be effectuated using a variety of applicable methodologies such as, for example, event-based updates described in detail in commonly owned and co-pending U.S. patent application No. 13/239,255 filed Sep. 21, 2011, entitled “APPARATUS AND METHODS FOR SYNAPTIC UPDATE IN A PULSE-CODED NETWORK”, Ser. No. 13/588,774, entitled “APPARATUS AND METHODS FOR IMPLEMENTING EVENT-BASED UPDATES IN SPIKING NEURON NETWORKS”, filed Aug. 17, 2012; and Ser. No. 13/560,891 entitled “APPARATUS AND METHODS FOR EFFICIENT UPDATES IN SPIKING NEURON NETWORK”, each of the foregoing being incorporated herein by reference in its entirety.
A neuron process may comprise one or more learning rules configured to adjust neuron state and/or generate neuron output in accordance with neuron inputs. In some implementations, the one or more learning rules may comprise state dependent learning rules described, for example, in commonly owned and co-pending U.S. patent application Ser. No. 13/560,902, entitled “APPARATUS AND METHODS FOR GENERALIZED STATE-DEPENDENT LEARNING IN SPIKING NEURON NETWORKS”, filed Jul. 27, 2012 and/or U.S. patent application Ser. No. 13/722,769 filed Dec. 20, 2012, and entitled “APPARATUS AND METHODS FOR STATE-DEPENDENT LEARNING IN SPIKING NEURON NETWORKS”, each of the foregoing being incorporated herein by reference in its entirety.
In some implementations, the single-layer network 550 of
Similarly, during a second plurality of trials, the network 550 may trained to operate a second subset of the robot's available CDOF (e.g., the joint 106 in
By employing time multiplexed learning of multiple CDOF operations, learning speed and/or accuracy may be improved, compared to a combined learning approach wherein the entire complement of the robot's CDOF are being trained contemporaneously. It is noteworthy, that the two-layer network architecture (e.g., of the network 550 in
In some implementations, the multi-layer network 500 of
During a second plurality of trials, the network 500 may trained to operate a second subset of the robot's available CDOF (e.g., the joint 106 in
A robotic device may be configured to execute a target task associated with a target trajectory. A controller of the robotic device may be trained to navigate the target trajectory comprising multiple portions. Some trajectory portions may be associated with the robot executing complex actions (e.g., that may require more training trials and/or more dense training input compared to simpler trajectory actions). A complex trajectory portion may be characterized by, e.g., a selected range of state space parameters associated with the task operation by the robot. In one or more implementations, the complex action may be characterized by a high rate of change of one or more motion parameters (e.g., acceleration), higher position tolerance (e.g., tight corners, precise positioning of components during manufacturing, fragile items for grasping by a manipulator, high target performance (e.g., lap time of less than N seconds), actions engaging multiple CDOF of a manipulator arm, and/or other parameters).
The range of state parameters associated with the complex trajectory portion may be referred as a selected subset of the state space. The added training associated with the state space subset may be referred to as selective state space sampling. The selection of a trajectory portion for selective state space sampling added training may be configured based on one or more state parameters associated with the robotic device navigation of the target trajectory in the state space.
The target trajectory navigation may be characterized by a performance measure determined based on one or more state parameters. In some implementations, the selection of the trajectory portion (e.g., complex trajectory portion, and/or other.) may be determined based on an increased level of target performance. By way of illustration, consider one exemplary autonomous rover implementation: the rover performance may be determined based on a deviation of the actual rover position from a nominal or expected position (e.g., position on a road). The rover trajectory may comprise unrestricted straightaway portions and one or more portions disposed in a constricted terrain e.g., with a drop on one side and a wall on the other side. The rover target position deviation range may be reduced for the trajectory portions in the constricted environment, compared to the rover target position deviation range for the unrestricted straightaway portions.
In some implementations, the amount of time associated with traversing the complex trajectory portion may comprise less than a half the time used for traversing the whole trajectory. In one or more implementations, state space extent associated with the complex trajectory portion may comprise less than a half of the state space extent associated with the whole trajectory.
Individual trajectory portions may be trained during respective training trials. In one or more implementations, a selective CDOF methodology, such as that described herein, may be employed when training one or more portions associated with multiple CDOF operations.
A robotic platform 1010 may be configured to perform a target task comprising navigation of the target trajectory 1000. One or more portions 1002, 1004, 1012 of the trajectory 1000 in
Training of the robotic platform 1000 controller navigating the trajectory portion 1004 may be configured on one or more trials. During individual trials, the controller of the platform 1000 may receive teaching input, indicated by symbols ‘X’ in
The adaptive controller may be configured to produce control output based on the teaching input and output of the learning process. Output of the controller may comprise a combination of an adaptive predictor output and the teaching input. Various realizations of adaptive predictors may be utilized with the methodology described including, e.g. those described in U.S. patent application Ser. No. 13/842,562 entitled “ADAPTIVE PREDICTOR APPARATUS AND METHODS FOR ROBOTIC CONTROL”, filed Mar. 15, 2013, incorporated supra.
Training may be executed in one or more training sessions, e.g., every week or according to a prescribed periodicity, in an event-driven manner, aperiodically, and/or other, to improve a particular performance for a given trajectory portion. By way of illustration, subsequent to an initial group of training trials, a particularly difficult operation (e.g., associated with the portion 1004) may continue to be trained in order to improve performance, while the remaining trajectory is based on the training information determined during the initial group of training trials.
Actions associated with navigating the portion 1004 of the trajectory may be characterized by a corresponding range (subset) of the state space associated with the full trajectory 1000 navigation. In one or more implementations, the selection may be based on location (a range of coordinates), velocity, acceleration, jerk, operational performance (e.g., lap time), the rate of performance change over multiple trials, and/or other parameters.
The partial trajectory training methodology (e.g., using the selective state space sampling) may enable the trainer to focus on more difficult sections of a trajectory compared to other relatively simple trajectory portions (e.g., 1004 compared to 1002 in
In some implementations of robotic devices characterized by multiple controllable degrees of freedom (CDOF), the trajectory portion selection may correspond to training a subset of CDOF and operating one or more remaining CDOF based on prior training and/or pre-configured operational instructions.
As shown, the trajectory 1060 is characterized by portions 1066, 1070, 1068. The portion 1070 may be more difficult to train compared to the portions 1066, 1068. In one or more implementations, the training difficulty may be characterized by one or more of lower performance, longer training time, a larger number of training trials, frequency of training input, and/or variability of other parameters associated with operating the portion 1070 as compared to the portions 1066, 1068. The trajectory portions 1066, 1068, and 1070 may be characterized by state space extent 1076, 1074, and 1078, respectively. As illustrated in
The trajectory portion 1070 may correspond to execution of an action (or multiple actions) that may be more difficult to learn compared to other action. The learning difficulty may arise from one or more of the following (i) the action is more complex (e.g. a sharp turn characterized by an increased rate of change of speed, direction, and or other state parameter of a vehicle, and/or increased target precision of a manipulator), (ii) the associated with the action is difficult to identify (e.g., another portion of the trajectory may be associated with a similar context but may require a different set of motor commands), or (iii) there are multiple and contradictory ways to solve this part of the trajectory (e.g., a wider turn with faster speed, and/or a sharp turn with low speed) and the teacher is not consistent in the way he solves the problem; or a combination thereof).
In one or more implementations, the state space configuration of
The selective state space sampling may reduce training duration and/or amount of training data associated with the trajectory portions 1066, 1068. Reducing the amount of training data and/or training trials for simpler tasks (e.g., the portions 1066, 1068 in
In some implementations, methods 600, 700, 800, 900 may be implemented in one or more processing devices (e.g., a digital processor, an analog processor, a digital circuit designed to process information, an analog circuit designed to process information, a state machine, and/or other mechanisms for electronically processing information). The one or more processing devices may include one or more devices executing some or all of the operations of methods 600, 700, 800, 900 in response to instructions stored electronically on an electronic storage medium. The one or more processing devices may include one or more devices configured through hardware, firmware, and/or software to be specifically designed for execution of one or more of the operations of methods 600, 700, 800, 900. Operations of methods 600, 700, 800, 900 may be utilized with a robotic apparatus (see e.g., the robotic arm 100 of
At operation 602 of method 600, a first actuator associated with a first CDOF operation of a robotic device is selected. In some implementations, the CDOF selection may be effectuated by issuing an instruction to the robotic control apparatus (e.g., pressing a button, issuing a voice command, an audible signal (e.g., a click), an initialization after power-on/reset sequence, a pre-defined programming sequence, and/or other.). In one or more implementations, the CDOF selection may be effectuated based on a timer event, and/or training performance reaching a target level, e.g., determined based on ability of the trainer to position of one of the joints within a range from a target position. For example, in the context of
At operation 604, the adaptive controller is trained to actuate movement in the first CDOF of the robot to accomplish a target action. In some implementations, the nature of the task is too complex to be handled with a single CDOF and thus require multiple CDOF.
Operation 604 may comprise training a neuron network (such as e.g., 500, 550 of
At operation 606, a second actuator associated with a second CDOF operation of the robotic device is selected. The CDOF selection may be effectuated by issuing an instruction to the robotic control apparatus (e.g., pressing the button 210, issuing a voice command, and/or using another communication method). For example, in the context of
At operation 608, the adaptive controller may be trained to operate the second CDOF of the robot in order to accomplish the target action. In some implementations, the operation 608 may comprise training a neuron network (such as e.g., 500, 550 of
It is appreciated that the method 600 may be used with any number of degrees of freedom, additional degrees being iteratively implemented. For example, for a device with six (6) degrees of freedom, training may be performed with six independent iterations, where individual iteration may be configured to train one (1) degree of freedom. Moreover, more complex controllers may further reduce iterations by training multiple simultaneous degrees of freedom; e.g., three (3) iterations of a controller with two (2) degrees of freedom, two (2) iterations of a controller with three (3) degrees of freedom, and/or other.
Still further it is appreciated that the robotic apparatus may support a number of degrees of freedom which is not evenly divisible by the degrees of freedom of the controller. For example, a robotic mechanism that supports five (5) degrees of freedom can be trained in two (2) iterations with a controller that supports three (3) degrees of freedom.
At operation 702 of method 700, a context is determined. In some implementations, the context may be determined based on one or more sensory input and/or feedback that may be provided by the robotic apparatus to the controller. In some implementations, the sensory aspects may include an object being detected in the input, a location of the object, an object characteristic (color/shape), a sequence of movements (e.g., a turn), a characteristic of an environment (e.g., an apparent motion of a wall and/or other surroundings turning and/or approaching) responsive to the movement. In some implementations, the sensory input may be received during one or more training trials of the robotic apparatus.
At operation 704, a first or a second actuator associated with a first or second CDOF of the robotic apparatus is selected for operation. For example, the first and the second CDOF may correspond to operation of the motorized joints 102, 106, respectively, of the manipulator arm 100 in
Responsive to selecting the first actuator of the robotic apparatus, the method may proceed to operation 706, wherein the neuron network of the adaptive controller may be operated in accordance with the learning process to generate the first CDOF control output based on the context (e.g., learn a behavior associated with the context). In some implementations, the teaching signal for the first CDOF may comprise (i) a signal provided by the user via a remote controller, (ii) a signal provided by the adaptive system for the controlled CDOF, and/or (iii) a weighted combination of the above (e.g., using constant and/or adjustable weights).
Responsive to selecting the second actuator of the robotic apparatus, the method may proceed to operation 710 wherein the neuron network of the adaptive controller is operated in accordance with the learning process configured to generate the second CDOF control output based on the context (e.g., learn a behavior associated with the context).
At operation 708, network configuration associated with the learned behavior at operation 704 and/or 710 may be stored. In one or more implementations, the network configuration may comprise efficacy of one or more connections of the network (e.g., weights) that may have been adapted during training.
At operation 822 of method 800, an actuator associated with a CDOF is selected for training. In one or more implementations, the CDOF selection may be effectuated by issuing an instruction to the robotic control apparatus (e.g., pressing a button, issuing an audible signal (e.g., a click, and/or a voice command), and/or using another communication method). In one or more implementations, the CDOF selection may be effectuated based on a timer event, and/or training performance reaching a target level. For example, upon learning to position/move one joint to a target location, the controller may automatically switch to training of another joint.
Responsive to selection of a first actuator associated with a first CDOF of the robotic apparatus, the method proceeds to operation 824, where training input for the first CDOF (CDOF1) is provided. For example, in the context of the robotic arm 100 of
At operation 828, the control output may be determined in accordance with the learning process and context. In some implementations, the context may comprise the input into the adaptive controller e.g., as described above with respect to operation 702 of method 700.
The control output determined at operation 828 may comprise the first CDOF control instructions 830 and/or the second CDOF control instructions 844. The learning process may be implemented using an iterative approach wherein control of one CDOF may be learned partly before switching to learning another CDOF. Such back and forth switching may be employed until the target performance is attained.
Referring now to operation 826, the control CDOF 1 output 830 may be combined with the first CDOF training input provided at operation 824. The combination of operation 826 may be configured based on a transfer function. In one or more implementations, the transfer function may comprise addition, union, a logical ‘AND’ operation, and/or other operations e.g., as described in commonly owned and co-pending U.S. patent application Ser. No. 13/842,530 entitled “ADAPTIVE PREDICTOR APPARATUS AND METHODS”, filed Mar. 15, 2013, incorporated supra.
At operation 832, the first actuator associated with the first CDOF (CDOF1) of the robotic device is operated in accordance with the control output determined at operation 826. Within the context of the robotic arm 100 of
Responsive to selection of a second actuator associated with a second CDOF of the robotic apparatus, the method proceeds to operation 840, where training input for the second CDOF (CDOF2) is provided. For example, in the context of the robotic arm 100 of
Referring now to operation 842, the control CDOF 2 output 844 may be combined with the second CDOF training input provided at operation 840. The combination of operation 842 may be configured based on a transfer function. In one or more implementations, the transfer function may comprise addition, union, a logical ‘AND’ operation, and/or other operations e.g., as described in commonly owned and co-pending U.S. patent application Ser. No. 13/842,530 entitled “ADAPTIVE PREDICTOR APPARATUS AND METHODS”, filed Mar. 15, 2013, incorporated supra.
At operation 846, the second actuator associated with the second CDOF (CDOF2) of the robotic device is operated in accordance with the control output determined at operation 842. Within the context of the robotic arm 100 of
At operation 902 of method 900 illustrated in
At operation 904 a determination may be made as to whether a teaching input may be expedient for navigating the trajectory portion selected at operation 902. In some implementations exemplary embodiment, the determination of expediency is based on complexity of the task (e.g., required precision, speed of operation, desired success rate, minimum failure rate, and/or other.)
Responsive to a determination at operation 904 that the teaching input is not expedient (and will not be provided), the method may proceed to operation 910 wherein the trajectory portion determined at operation 902 may be navigated based on a previously learned controller configuration. In one or more implementations of a controller comprising a neuron network, the previously learned controller configuration may comprise an array of connection efficacies (e.g., 578 in
Responsive to a determination at operation 904 that teaching input may be expedient, the method may proceed to operation 906, wherein training input may be determined. In some implementations of multiple controllable CDOF robots (e.g., the arm 100 in
At operation 908 the trajectory portion may be navigated based on a previously learned controller configuration and the teaching input determined at operation 906. In some implementations, the trajectory portion may be navigation may be effectuated over one or more training trials configured in accordance with an online supervised learning methodology, e.g., such as described in co-owned U.S. patent application Ser. No. 14/070,114 entitled “APPARATUS AND METHODS FOR ONLINE TRAINING OF ROBOTS”, filed Nov. 1, 2013, incorporated supra. During individual trials, the controller may be provided with the supervisor input (e.g., the input 1008, 1028 in
At operation 912 a determination may be made as to whether the target task has been accomplished. In one or more implementations, task completion may be based on an evaluation of a performance measure associated with the learning process of the controller. Responsive to a determination at operation that the target task is has not been completed the method may proceed to operation 902, wherein additional trajectory portion(s) may be determined.
Various exemplary computerized apparatus configured to implement learning methodology set forth herein are now described with respect to
A computerized neuromorphic processing system, consistent with one or more implementations, for use with an adaptive robotic controller described, supra, is illustrated in
The system 1100 further may comprise a random access memory (RAM) 1108, configured to store neuronal states and connection parameters and to facilitate synaptic updates. In some implementations, synaptic updates may be performed according to the description provided in, for example, in commonly owned and co-pending U.S. patent application Ser. No. 13/239,255 filed Sep. 21, 2011, entitled “APPARATUS AND METHODS FOR SYNAPTIC UPDATE IN A PULSE-CODED NETWORK”, incorporated by reference, supra.
In some implementations, the memory 1108 may be coupled to the processor 1102 via a direct connection 1116 (e.g., memory bus). The memory 1108 may also be coupled to the processor 1102 via a high-speed processor bus 1112.
The system 1100 may comprise a nonvolatile storage device 1106. The nonvolatile storage device 1106 may comprise, inter alia, computer readable instructions configured to implement various aspects of spiking neuronal network operation. Examples of various aspects of spiking neuronal network operation may include one or more of sensory input encoding, connection plasticity, operation model of neurons, learning rule evaluation, other operations, and/or other aspects. In one or more implementations, the nonvolatile storage 1106 may be used to store state information of the neurons and connections for later use and loading previously stored network configuration. The nonvolatile storage 1106 may be used to store state information of the neurons and connections when, for example, saving and/or loading network state snapshot, implementing context switching, saving current network configuration, and/or performing other operations. The current network configuration may include one or more of connection weights, update rules, neuronal states, learning rules, and/or other parameters.
In some implementations, the computerized apparatus 1100 may be coupled to one or more of an external processing device, a storage device, an input device, and/or other devices via an I/O interface 1120. The I/O interface 1120 may include one or more of a computer I/O bus (PCI-E), wired (e.g., Ethernet) or wireless (e.g., Wi-Fi) network connection, and/or other I/O interfaces.
In some implementations, the input/output (I/O) interface may comprise a speech input (e.g., a microphone) and a speech recognition module configured to receive and recognize user commands.
It will be appreciated by those skilled in the arts that various processing devices may be used with computerized system 1100, including but not limited to, a single core/multicore CPU, DSP, FPGA, GPU, ASIC, combinations thereof, and/or other processing entities (e.g., computing clusters and/or cloud computing services). Various user input/output interfaces may be similarly applicable to implementations of the disclosure including, for example, an LCD/LED monitor, touch-screen input and display device, speech input device, stylus, light pen, trackball, and/or other devices.
Referring now to
The micro-blocks 1140 may be interconnected with one another using connections 1138 and routers 1136. As it is appreciated by those skilled in the arts, the connection layout in
The neuromorphic apparatus 1130 may be configured to receive input (e.g., visual input) via the interface 1142. In one or more implementations, applicable for example to interfacing with computerized spiking retina, or image array, the apparatus 1130 may provide feedback information via the interface 1142 to facilitate encoding of the input signal.
The neuromorphic apparatus 1130 may be configured to provide output via the interface 1144. Examples of such output may include one or more of an indication of recognized object or a feature, a motor command (e.g., to zoom/pan the image array), and/or other outputs.
The apparatus 1130, in one or more implementations, may interface to external fast response memory (e.g., RAM) via high bandwidth memory interface 1148, thereby enabling storage of intermediate network operational parameters. Examples of intermediate network operational parameters may include one or more of spike timing, neuron state, and/or other parameters. The apparatus 1130 may interface to external memory via lower bandwidth memory interface 1146 to facilitate one or more of program loading, operational mode changes, retargeting, and/or other operations. Network node and connection information for a current task may be saved for future use and flushed. Previously stored network configuration may be loaded in place of the network node and connection information for the current task, as described for example in commonly owned and co-pending U.S. patent application Ser. No. 13/487,576 entitled “DYNAMICALLY RECONFIGURABLE STOCHASTIC LEARNING APPARATUS AND METHODS”, filed Jun. 4, 2012, incorporated herein by reference in its entirety. External memory may include one or more of a Flash drive, a magnetic drive, and/or other external memory.
Different cell levels (e.g., L1, L2, L3) of the apparatus 1150 may be configured to perform functionality various levels of complexity. In some implementations, individual L1 cells may process in parallel different portions of the visual input (e.g., encode individual pixel blocks, and/or encode motion signal), with the L2, L3 cells performing progressively higher level functionality (e.g., object detection). Individual ones of L2, L3, cells may perform different aspects of operating a robot with one or more L2/L3 cells processing visual data from a camera, and other L2/L3 cells operating motor control block for implementing lens motion what tracking an object or performing lens stabilization functions.
The neuromorphic apparatus 1150 may receive input (e.g., visual input) via the interface 1160. In one or more implementations, applicable for example to interfacing with computerized spiking retina, or image array, the apparatus 1150 may provide feedback information via the interface 1160 to facilitate encoding of the input signal.
The neuromorphic apparatus 1150 may provide output via the interface 1170. The output may include one or more of an indication of recognized object or a feature, a motor command, a command to zoom/pan the image array, and/or other outputs. In some implementations, the apparatus 1150 may perform all of the I/O functionality using single I/O block (not shown).
The apparatus 1150, in one or more implementations, may interface to external fast response memory (e.g., RAM) via a high bandwidth memory interface (not shown), thereby enabling storage of intermediate network operational parameters (e.g., spike timing, neuron state, and/or other parameters). In one or more implementations, the apparatus 1150 may interface to external memory via a lower bandwidth memory interface (not shown) to facilitate program loading, operational mode changes, retargeting, and/or other operations. Network node and connection information for a current task may be saved for future use and flushed. Previously stored network configuration may be loaded in place of the network node and connection information for the current task, as described for example in commonly owned and co-pending U.S. patent application Ser. No. 13/487,576, entitled “DYNAMICALLY RECONFIGURABLE STOCHASTIC LEARNING APPARATUS AND METHODS”, incorporated, supra.
In one or more implementations, one or more portions of the apparatus 1150 may be configured to operate one or more learning rules, as described for example in commonly owned and co-pending U.S. patent application Ser. No. 13/487,576 entitled “DYNAMICALLY RECONFIGURABLE STOCHASTIC LEARNING APPARATUS AND METHODS”, filed Jun. 4, 2012, incorporated herein by reference in its entirety. In one such implementation, one block (e.g., the L3 block 1156) may be used to process input received via the interface 1160 and to provide a teaching signal to another block (e.g., the L2 block 1156) via interval interconnects 1166, 1168.
The partial trajectory training methodology (e.g., using the selective state space sampling) described herein may enable a trainer to focus on portions of particular interest or value e.g., more difficult trajectory portions as compared to other trajectory portions (e.g., 1004 compared to 1002 in
In some implementations, a user may elect to re-train and/or to provide additional training to a previously trained controller configuration for a given target trajectory. The additional training may be focused on a subset of the trajectory (e.g., one or more complex actions) so that to reduce training time and/or reduce over-fitting errors for trajectory portions comprising less complex actions.
In some implementations, the trajectory portion (e.g., the subset characterized by complex actions) may be associated with an extent of the state space. Based on the training of a controller to navigate the portion, the state space extent may be reduced and autonomy of the robotic device may be increased. In some implementations, the training may enable full autonomy so as to enable the robot to traverse the trajectory in absence of teaching input.
The selective state space sampling methodology may be combined with online training approaches, e.g., such as described in co-owned U.S. patent application Ser. No. 14/070,114 entitled “APPARATUS AND METHODS FOR ONLINE TRAINING OF ROBOTS”, filed Nov. 1, 2013, incorporated supra. During some implementations of online training of a robot to perform a task, a trainer may determine one or portions of the task trajectory wherein the controller may exhibit difficulty of controlling the robot. In one or more implementations, the robot may detect an ‘unknown state’ (e.g., previously not encountered). The robot may be configured to request assistance (e.g., teaching input) from one or more teachers (e.g., humans, supervisory processes or entities, algorithms, etc.). In accord with the selective state space sampling methodology, the trainer may elect to train the controller on the one or more challenging trajectory portions online thereby reducing and/or eliminating delays that may be associated with offline training approaches of the prior art that may rely on recording/replaying/review of training results in order to evaluate quality of training.
One or more of the methodologies comprising partial degree of freedom learning and/or use of reduced CDOF robotic controller described herein may facilitate training and/or operation of robotic devices. In some implementations, a user interface may be configured to operate a subset of robot's CDOF (e.g., one joint of a two joint robotic manipulator arm). The methodologies of the present disclosure may enable a user to train complex robotic devices (e.g., comprising multiple CDOF) using the reduced CDOF control interface. During initial training of a given CDOF subset, the user may focus on achieving target performance (e.g., placing the manipulator joint at a target orientation) without being burdened by control of the whole robotic device. During subsequent training trials for another CDOF subset, operation of the robot by the user (e.g., the joints 106) may be augmented by the controller output for the already trained CDOF (e.g., the joint 102 in
In some implementations, the training methodologies described herein may reduce cognitive load on a human trainer, e.g., by enabling the trainer to control a subset of DOF at a given trial, and alleviating the need to coordinate control signals for all DOF.
Dexterity constraints placed on the user may be reduced, when controlling fewer degrees of freedom (e.g., the user may use a single hand to train one DOF at a time of a six DOF robot).
The selective state space sampling methodology described herein may reduce training time compared to the prior art as only the DOF and/or trajectory portions that require improvement in performance may be trained. As training progresses, trainer involvement may be reduced over time. In some implementations, the trainer may provide corrections to DOF that need to improve performance, switching from one to the other as needed.
The selective state space sampling methodology described herein may enable development of robotic autonomy. Based on learning to navigate one or more portions of the task trajectory and/or operate one or more CDOF, the robot may gradually gain autonomy (e.g., perform actions in based on the learned behaviors and in absence of supervision by a trainer or other entity).
Dexterity requirements placed on a trainer and/or trainer may be simplified as the user may utilize, e.g., a single to train and/or control a complex (e.g., with multiple CDOF) robotic body. Using the partial degree of freedom (cascade) training methodology of the disclosure, may enable use of a simpler (e.g., a single DOF) control interface configured, e.g., to control a single CDOF to control a complex robotic apparatus comprising multiple CDOF.
Partial degree of freedom training and/or selective state space sampling training may enable the trainer to focus on a subset of DOF that may be more difficult to train, compared to other DOF. Such approach may reduce training time for the adaptive control system as addition as additional training time may be dedicated to the difficult to train DOF portion without retraining (and potentially confusing) a better behaving DOF portion.
It will be recognized that while certain aspects of the disclosure are described in terms of a specific sequence of steps of a method, these descriptions are only illustrative of the broader methods of the disclosure, and may be modified as required by the particular application. Certain steps may be rendered unnecessary or optional under certain circumstances. Additionally, certain steps or functionality may be added to the disclosed implementations, or the order of performance of two or more steps permuted. All such variations are considered to be encompassed within the disclosure disclosed and claimed herein.
While the above detailed description has shown, described, and pointed out novel features of the disclosure as applied to various implementations, it will be understood that various omissions, substitutions, and changes in the form and details of the device or process illustrated may be made by those skilled in the art without departing from the disclosure. This description is in no way meant to be limiting, but rather should be taken as illustrative of the general principles of the technology. The scope of the disclosure should be determined with reference to the claims.
Number | Name | Date | Kind |
---|---|---|---|
3920972 | Corwin, Jr. et al. | Nov 1975 | A |
4468617 | Ringwall | Aug 1984 | A |
4617502 | Sakaue et al. | Oct 1986 | A |
4638445 | Mattaboni | Jan 1987 | A |
4706204 | Hattori | Nov 1987 | A |
4763276 | Perreirra et al. | Aug 1988 | A |
4852018 | Grossberg | Jul 1989 | A |
5063603 | Burt | Nov 1991 | A |
5092343 | Spitzer | Mar 1992 | A |
5121497 | Kerr et al. | Jun 1992 | A |
5245672 | Wilson | Sep 1993 | A |
5303384 | Rodriguez et al. | Apr 1994 | A |
5355435 | DeYong | Oct 1994 | A |
5388186 | Bose | Feb 1995 | A |
5408588 | Ulug | Apr 1995 | A |
5467428 | Ulug | Nov 1995 | A |
5579440 | Brown | Nov 1996 | A |
5602761 | Spoerre et al. | Feb 1997 | A |
5612883 | Shaffer et al. | Mar 1997 | A |
5638359 | Peltola | Jun 1997 | A |
5673367 | Buckley | Sep 1997 | A |
5687294 | Jeong | Nov 1997 | A |
5719480 | Bock | Feb 1998 | A |
5739811 | Rosenberg et al. | Apr 1998 | A |
5841959 | Guiremand | Nov 1998 | A |
5875108 | Hoffberg | Feb 1999 | A |
5994864 | Inoue et al. | Nov 1999 | A |
6009418 | Cooper | Dec 1999 | A |
6014653 | Thaler | Jan 2000 | A |
6169981 | Werbos | Jan 2001 | B1 |
6218802 | Onoue et al. | Apr 2001 | B1 |
6243622 | Yim et al. | Jun 2001 | B1 |
6259988 | Galkowski et al. | Jul 2001 | B1 |
6272479 | Farry et al. | Aug 2001 | B1 |
6363369 | Liaw | Mar 2002 | B1 |
6366293 | Hamilton | Apr 2002 | B1 |
6442451 | Lapham | Aug 2002 | B1 |
6458157 | Suaning | Oct 2002 | B1 |
6489741 | Genov | Dec 2002 | B1 |
6493686 | Francone et al. | Dec 2002 | B1 |
6545705 | Sigel | Apr 2003 | B1 |
6545708 | Tamayama | Apr 2003 | B1 |
6546291 | Merfeld | Apr 2003 | B2 |
6581046 | Ahissar | Jun 2003 | B1 |
6601049 | Cooper | Jul 2003 | B1 |
6636781 | Shen | Oct 2003 | B1 |
6643627 | Liaw | Nov 2003 | B2 |
6697711 | Yokono | Feb 2004 | B2 |
6703550 | Chu | Mar 2004 | B2 |
6760645 | Kaplan et al. | Jul 2004 | B2 |
6961060 | Mochizuki et al. | Nov 2005 | B1 |
7002585 | Watanabe | Feb 2006 | B1 |
7024276 | Ito | Apr 2006 | B2 |
7243334 | Berger et al. | Jul 2007 | B1 |
7324870 | Lee | Jan 2008 | B2 |
7342589 | Miserocchi | Mar 2008 | B2 |
7395251 | Linsker | Jul 2008 | B2 |
7398259 | Nugent | Jul 2008 | B2 |
7426501 | Nugent | Sep 2008 | B2 |
7668605 | Braun | Feb 2010 | B2 |
7672920 | Ito | Mar 2010 | B2 |
7752544 | Cheng | Jul 2010 | B2 |
7849030 | Ellingsworth | Dec 2010 | B2 |
8015130 | Matsugu | Sep 2011 | B2 |
8145355 | Danko | Mar 2012 | B2 |
8214062 | Eguchi et al. | Jul 2012 | B2 |
8271134 | Kato et al. | Sep 2012 | B2 |
8315305 | Petre et al. | Nov 2012 | B2 |
8364314 | Abdallah et al. | Jan 2013 | B2 |
8380652 | Francis, Jr. | Feb 2013 | B1 |
8419804 | Herr et al. | Apr 2013 | B2 |
8452448 | Pack et al. | May 2013 | B2 |
8467623 | Izhikevich | Jun 2013 | B2 |
8509951 | Gienger | Aug 2013 | B2 |
8571706 | Zhang et al. | Oct 2013 | B2 |
8639644 | Hickman et al. | Jan 2014 | B1 |
8655815 | Palmer et al. | Feb 2014 | B2 |
8751042 | Lee | Jun 2014 | B2 |
8793205 | Fisher | Jul 2014 | B1 |
8924021 | Dariush et al. | Dec 2014 | B2 |
8958912 | Blumberg et al. | Feb 2015 | B2 |
8972315 | Szatmary et al. | Mar 2015 | B2 |
8990133 | Ponulak et al. | Mar 2015 | B1 |
9008840 | Ponulak et al. | Apr 2015 | B1 |
9015092 | Sinyavskiy et al. | Apr 2015 | B2 |
9015093 | Commons | Apr 2015 | B1 |
9047568 | Fisher et al. | Jun 2015 | B1 |
9056396 | Linnell | Jun 2015 | B1 |
9070039 | Richert | Jun 2015 | B2 |
9082079 | Coenen | Jul 2015 | B1 |
9104186 | Sinyavskiy et al. | Aug 2015 | B2 |
9144907 | Summer et al. | Sep 2015 | B2 |
9177245 | Richert et al. | Nov 2015 | B2 |
9186793 | Meier | Nov 2015 | B1 |
9189730 | Coenen et al. | Nov 2015 | B1 |
9193075 | Cipollini et al. | Nov 2015 | B1 |
9195934 | Hunt et al. | Nov 2015 | B1 |
9213937 | Ponulak | Dec 2015 | B2 |
9242372 | Laurent et al. | Jan 2016 | B2 |
20010045809 | Mukai | Nov 2001 | A1 |
20020038294 | Matsugu | Mar 2002 | A1 |
20020103576 | Takamura et al. | Aug 2002 | A1 |
20020158599 | Fujita et al. | Oct 2002 | A1 |
20020169733 | Peters | Nov 2002 | A1 |
20020175894 | Grillo | Nov 2002 | A1 |
20020198854 | Berenji et al. | Dec 2002 | A1 |
20030023347 | Konno | Jan 2003 | A1 |
20030050903 | Li Aw | Mar 2003 | A1 |
20030108415 | Hosek et al. | Jun 2003 | A1 |
20030144764 | Yokono et al. | Jul 2003 | A1 |
20030220714 | Nakamura et al. | Nov 2003 | A1 |
20040030449 | Solomon | Feb 2004 | A1 |
20040036437 | Ito | Feb 2004 | A1 |
20040051493 | Furuta | Mar 2004 | A1 |
20040128028 | Miyamoto et al. | Jul 2004 | A1 |
20040131998 | Marom et al. | Jul 2004 | A1 |
20040136439 | Dewberry | Jul 2004 | A1 |
20040158358 | Anezaki et al. | Aug 2004 | A1 |
20040162638 | Solomon | Aug 2004 | A1 |
20040167641 | Kawai et al. | Aug 2004 | A1 |
20040172168 | Watanabe et al. | Sep 2004 | A1 |
20040193670 | Langan | Sep 2004 | A1 |
20040267404 | Danko | Dec 2004 | A1 |
20050004710 | Shimomura | Jan 2005 | A1 |
20050008227 | Duan et al. | Jan 2005 | A1 |
20050015351 | Nugent | Jan 2005 | A1 |
20050036649 | Yokono et al. | Feb 2005 | A1 |
20050049749 | Watanabe et al. | Mar 2005 | A1 |
20050065651 | Ayers et al. | Mar 2005 | A1 |
20050069207 | Zakrzewski et al. | Mar 2005 | A1 |
20050113973 | Endo et al. | May 2005 | A1 |
20050119791 | Nagashima | Jun 2005 | A1 |
20050125099 | Mikami et al. | Jun 2005 | A1 |
20050283450 | Matsugu | Dec 2005 | A1 |
20060069448 | Yasui | Mar 2006 | A1 |
20060082340 | Watanabe et al. | Apr 2006 | A1 |
20060094001 | Torre | May 2006 | A1 |
20060129277 | Wu et al. | Jun 2006 | A1 |
20060129506 | Edelman et al. | Jun 2006 | A1 |
20060149489 | Joublin et al. | Jul 2006 | A1 |
20060161218 | Danilov | Jul 2006 | A1 |
20060161300 | Gonzalez-Banos et al. | Jul 2006 | A1 |
20060167530 | Flaherty et al. | Jul 2006 | A1 |
20060181236 | Brogardh | Aug 2006 | A1 |
20060189900 | Flaherty et al. | Aug 2006 | A1 |
20060207419 | Okazaki et al. | Sep 2006 | A1 |
20060250101 | Khatib et al. | Nov 2006 | A1 |
20070022068 | Linsker | Jan 2007 | A1 |
20070074177 | Kurita et al. | Mar 2007 | A1 |
20070100780 | Fleischer et al. | May 2007 | A1 |
20070112700 | Den et al. | May 2007 | A1 |
20070151389 | Prisco et al. | Jul 2007 | A1 |
20070176643 | Nugent | Aug 2007 | A1 |
20070200525 | Kanaoka | Aug 2007 | A1 |
20070208678 | Matsugu | Sep 2007 | A1 |
20070250464 | Hamilton | Oct 2007 | A1 |
20070255454 | Dariush et al. | Nov 2007 | A1 |
20070260356 | Kock | Nov 2007 | A1 |
20080024345 | Watson | Jan 2008 | A1 |
20080040040 | Goto et al. | Feb 2008 | A1 |
20080097644 | Kaznov | Apr 2008 | A1 |
20080100482 | Lazar | May 2008 | A1 |
20080112596 | Rhoads et al. | May 2008 | A1 |
20080133052 | Jones | Jun 2008 | A1 |
20080140257 | Sato et al. | Jun 2008 | A1 |
20080154428 | Nagatsuka | Jun 2008 | A1 |
20080162391 | Izhikevich | Jul 2008 | A1 |
20080294074 | Tong et al. | Nov 2008 | A1 |
20080319929 | Kaplan et al. | Dec 2008 | A1 |
20090037033 | Phillips et al. | Feb 2009 | A1 |
20090043722 | Nugent | Feb 2009 | A1 |
20090069943 | Akashi et al. | Mar 2009 | A1 |
20090105786 | Fetz et al. | Apr 2009 | A1 |
20090231359 | Bass, II et al. | Sep 2009 | A1 |
20090234501 | Ishizaki | Sep 2009 | A1 |
20090265036 | Jamieson et al. | Oct 2009 | A1 |
20090272585 | Nagasaka | Nov 2009 | A1 |
20090287624 | Rouat | Nov 2009 | A1 |
20090299751 | Jung | Dec 2009 | A1 |
20090312817 | Hogle et al. | Dec 2009 | A1 |
20100036457 | Sarpeshkar | Feb 2010 | A1 |
20100081958 | She | Apr 2010 | A1 |
20100086171 | Lapstun | Apr 2010 | A1 |
20100152896 | Komatsu et al. | Jun 2010 | A1 |
20100152899 | Chang et al. | Jun 2010 | A1 |
20100166320 | Paquier | Jul 2010 | A1 |
20100169098 | Patch | Jul 2010 | A1 |
20100198765 | Fiorillo | Aug 2010 | A1 |
20100222924 | Gienger | Sep 2010 | A1 |
20100225824 | Lazar | Sep 2010 | A1 |
20100228264 | Robinson et al. | Sep 2010 | A1 |
20100286824 | Solomon | Nov 2010 | A1 |
20100292835 | Sugiura et al. | Nov 2010 | A1 |
20100299101 | Shimada | Nov 2010 | A1 |
20100305758 | Nishi et al. | Dec 2010 | A1 |
20100312730 | Weng et al. | Dec 2010 | A1 |
20110010006 | Tani et al. | Jan 2011 | A1 |
20110016071 | Guillen | Jan 2011 | A1 |
20110026770 | Brookshire | Feb 2011 | A1 |
20110035052 | Mclurkin | Feb 2011 | A1 |
20110035188 | Martinez-Heras et al. | Feb 2011 | A1 |
20110040405 | Lim et al. | Feb 2011 | A1 |
20110060460 | Oga et al. | Mar 2011 | A1 |
20110060461 | Velliste et al. | Mar 2011 | A1 |
20110067479 | Davis et al. | Mar 2011 | A1 |
20110071676 | Sanders et al. | Mar 2011 | A1 |
20110107270 | Wang et al. | May 2011 | A1 |
20110110006 | Meyer et al. | May 2011 | A1 |
20110119214 | Breitwisch | May 2011 | A1 |
20110119215 | Elmegreen | May 2011 | A1 |
20110144802 | Jang | Jun 2011 | A1 |
20110158476 | Fahn et al. | Jun 2011 | A1 |
20110160741 | Asano et al. | Jun 2011 | A1 |
20110160906 | Orita et al. | Jun 2011 | A1 |
20110160907 | Orita | Jun 2011 | A1 |
20110196199 | Donhowe | Aug 2011 | A1 |
20110208350 | Eliuk et al. | Aug 2011 | A1 |
20110218676 | Okazaki | Sep 2011 | A1 |
20110231016 | Goulding | Sep 2011 | A1 |
20110244919 | Aller et al. | Oct 2011 | A1 |
20110282169 | Grudic et al. | Nov 2011 | A1 |
20110296944 | Carter | Dec 2011 | A1 |
20110319714 | Roelle et al. | Dec 2011 | A1 |
20120008838 | Guyon et al. | Jan 2012 | A1 |
20120011090 | Tang | Jan 2012 | A1 |
20120011093 | Aparin et al. | Jan 2012 | A1 |
20120017232 | Hoffberg et al. | Jan 2012 | A1 |
20120036099 | Venkatraman et al. | Feb 2012 | A1 |
20120045068 | Kim et al. | Feb 2012 | A1 |
20120053728 | Theodorus | Mar 2012 | A1 |
20120071752 | Sewell et al. | Mar 2012 | A1 |
20120079670 | Yoon et al. | Apr 2012 | A1 |
20120109866 | Modha | May 2012 | A1 |
20120143495 | Dantu | Jun 2012 | A1 |
20120144242 | Vichare et al. | Jun 2012 | A1 |
20120150777 | Setoguchi et al. | Jun 2012 | A1 |
20120150781 | Arthur | Jun 2012 | A1 |
20120173021 | Tsusaka | Jul 2012 | A1 |
20120185092 | Ku | Jul 2012 | A1 |
20120197439 | Wang | Aug 2012 | A1 |
20120209428 | Mizutani | Aug 2012 | A1 |
20120209432 | Fleischer | Aug 2012 | A1 |
20120221147 | Goldberg et al. | Aug 2012 | A1 |
20120296471 | Inaba et al. | Nov 2012 | A1 |
20120303091 | Izhikevich | Nov 2012 | A1 |
20120303160 | Ziegler et al. | Nov 2012 | A1 |
20120308076 | Piekniewski | Dec 2012 | A1 |
20120308136 | Izhikevich | Dec 2012 | A1 |
20130000480 | Komatsu et al. | Jan 2013 | A1 |
20130006468 | Koehrsen et al. | Jan 2013 | A1 |
20130019325 | Deisseroth et al. | Jan 2013 | A1 |
20130066468 | Choi et al. | Mar 2013 | A1 |
20130073080 | Ponulak | Mar 2013 | A1 |
20130073484 | Izhikevich | Mar 2013 | A1 |
20130073491 | Izhikevich | Mar 2013 | A1 |
20130073492 | Izhikevich | Mar 2013 | A1 |
20130073493 | Modha | Mar 2013 | A1 |
20130073495 | Izhikevich | Mar 2013 | A1 |
20130073496 | Szatmary | Mar 2013 | A1 |
20130073498 | Izhikevich | Mar 2013 | A1 |
20130073499 | Izhikevich | Mar 2013 | A1 |
20130073500 | Szatmary | Mar 2013 | A1 |
20130096719 | Sanders | Apr 2013 | A1 |
20130116827 | Inazumi | May 2013 | A1 |
20130151442 | Suh et al. | Jun 2013 | A1 |
20130151448 | Ponulak | Jun 2013 | A1 |
20130151449 | Ponulak | Jun 2013 | A1 |
20130151450 | Ponulak | Jun 2013 | A1 |
20130172906 | Olson et al. | Jul 2013 | A1 |
20130173060 | Yoo et al. | Jul 2013 | A1 |
20130206170 | Svendsen et al. | Aug 2013 | A1 |
20130218339 | Maisonnier et al. | Aug 2013 | A1 |
20130218821 | Szatmary | Aug 2013 | A1 |
20130245829 | Ohta et al. | Sep 2013 | A1 |
20130251278 | Izhikevich | Sep 2013 | A1 |
20130274924 | Chung et al. | Oct 2013 | A1 |
20130297541 | Piekniewski et al. | Nov 2013 | A1 |
20130297542 | Piekniewski | Nov 2013 | A1 |
20130310979 | Herr et al. | Nov 2013 | A1 |
20130325244 | Wang | Dec 2013 | A1 |
20130325766 | Petre et al. | Dec 2013 | A1 |
20130325768 | Sinyavskiy | Dec 2013 | A1 |
20130325773 | Sinyavskiy | Dec 2013 | A1 |
20130325774 | Sinyavskiy et al. | Dec 2013 | A1 |
20130325775 | Sinyavskiy | Dec 2013 | A1 |
20130325776 | Ponulak | Dec 2013 | A1 |
20130325777 | Petre | Dec 2013 | A1 |
20130345718 | Crawford et al. | Dec 2013 | A1 |
20130346347 | Patterson et al. | Dec 2013 | A1 |
20140012788 | Piekniewski | Jan 2014 | A1 |
20140016858 | Richert | Jan 2014 | A1 |
20140025613 | Ponulak | Jan 2014 | A1 |
20140027718 | Zhao | Jan 2014 | A1 |
20140032458 | Sinyavskiy et al. | Jan 2014 | A1 |
20140032459 | Sinyavskiy et al. | Jan 2014 | A1 |
20140052679 | Sinyavskiy et al. | Feb 2014 | A1 |
20140081895 | Coenen | Mar 2014 | A1 |
20140089232 | Buibas | Mar 2014 | A1 |
20140114479 | Okazaki | Apr 2014 | A1 |
20140122397 | Richert et al. | May 2014 | A1 |
20140122398 | Richert | May 2014 | A1 |
20140156574 | Piekniewski et al. | Jun 2014 | A1 |
20140163729 | Shi et al. | Jun 2014 | A1 |
20140187519 | Cooke et al. | Jul 2014 | A1 |
20140193066 | Richert | Jul 2014 | A1 |
20140222739 | Ponulak | Aug 2014 | A1 |
20140229411 | Richert et al. | Aug 2014 | A1 |
20140244557 | Piekniewski et al. | Aug 2014 | A1 |
20140277718 | Izhikevich et al. | Sep 2014 | A1 |
20140277744 | Coenen | Sep 2014 | A1 |
20140309659 | Roh et al. | Oct 2014 | A1 |
20140350723 | Prieto et al. | Nov 2014 | A1 |
20140358284 | Laurent et al. | Dec 2014 | A1 |
20140358828 | Phillipps et al. | Dec 2014 | A1 |
20140369558 | Holz | Dec 2014 | A1 |
20140371907 | Passot et al. | Dec 2014 | A1 |
20140371912 | Passot et al. | Dec 2014 | A1 |
20150032258 | Passot et al. | Jan 2015 | A1 |
20150094850 | Passot et al. | Apr 2015 | A1 |
20150094852 | Laurent et al. | Apr 2015 | A1 |
20150120128 | Rosenstein et al. | Apr 2015 | A1 |
20150127149 | Sinyavskiy et al. | May 2015 | A1 |
20150127154 | Passot et al. | May 2015 | A1 |
20150127155 | Passot et al. | May 2015 | A1 |
20150148956 | Negishi | May 2015 | A1 |
20150185027 | Kikkeri et al. | Jul 2015 | A1 |
20150204559 | Hoffberg et al. | Jul 2015 | A1 |
20150283701 | Izhikevich et al. | Oct 2015 | A1 |
20150283702 | Izhikevich et al. | Oct 2015 | A1 |
20150283703 | Izhikevich et al. | Oct 2015 | A1 |
20150306761 | O'Connor et al. | Oct 2015 | A1 |
20150317357 | Harmsen et al. | Nov 2015 | A1 |
20150338204 | Richert et al. | Nov 2015 | A1 |
20150339589 | Fisher | Nov 2015 | A1 |
20150339826 | Buibas et al. | Nov 2015 | A1 |
20150341633 | Richert | Nov 2015 | A1 |
20160004923 | Piekniewski et al. | Jan 2016 | A1 |
20160014426 | Richert | Jan 2016 | A1 |
Number | Date | Country |
---|---|---|
102226740 | Oct 2011 | CN |
2384863 | Nov 2011 | EP |
4087423 | Mar 1992 | JP |
2003175480 | Jun 2003 | JP |
2108612 | Oct 1998 | RU |
WO 2011039542 | Apr 2007 | WO |
2008083335 | Jul 2008 | WO |
2010136961 | Dec 2010 | WO |
WO-2012151585 | Nov 2012 | WO |
Entry |
---|
Abbott et al. (2000), “Synaptic plasticity: taming the beast”, Nature Neuroscience, 3, 1178-1183. |
Alvarez, ‘Review of approximation techniques’, PhD thesis, chapter 2, pp. 7-14, University of Bradford, 2000. |
Asensio et al., “Robot Learning Control Based on Neural Network Prediction” ASME 8th Annual Dynamic Systems and Control Conference joint with the JSME 11th Motion and Vibration Conference 2012 [Retrieved on: Jun. 24, 2014]. Retrieved fro internet: <http://msc.berkely.edu/wjchen/publications/DSC12—8726—Fl.pdf> (11 pgs). |
Bartlett et al., “Convexity, Classification, and Risk Bounds” Jun. 16, 2005, pp. 1-61. |
Bartlett et al., “Large margin classifiers: convex loss, low noise, and convergence rates” Dec. 8, 2003, 8 pgs. |
Bohte, ‘Spiking Nueral Networks’ Doctorate at the University of Leiden, Holland, Mar. 5, 2003, pp. 1-133 [retrieved on Nov. 14, 2012]. Retrieved from the Internet: <URL: http://holnepages,cwi ,n11-sbolltedmblica6ond)hdthesislxif>. |
Bouganis et al., Training a Spiking Neural Network to Control a 4-DoF Robotic Arm based on Spiking Timing-Dependent Plasticity in WCCI 2010 IEEE World Congress on Computational Intelligence Jul. 2010 [Retrieved on Jun. 24, 2014] Retrieved from internet: http://www.doc.ic.ac.uk/˜mpsha/IJCNN10a.pdf> (8 pgs). |
Brette et al., Brian: a simple and flexible simulator for spiking neural networks, The Neuromorphic Engineer, Jul. 1, 2009, pp. 1-4, doi: 10.2417/1200906.1659. |
Cessac et al. ‘Overview of facts and issues about neural coding by spikes.’ Journal of Physiology, Paris 104.1 (2010): pp. 5-18. |
Cuntz et al., ‘One Rule to Grow Them All: A General Theory of Neuronal Branching and Its Paractical Application’ PLOS Computational Biology, 6 (8), Published Aug. 5, 2010, pp. 1-14. |
Davison et al., PyNN: a common interface for neuronal network simulators, Frontiers in Neuroinformatics, Jan. 2009, pp. 1-10, vol. 2, Article 11. |
Djurfeldt, Mikael, The Connection-set Algebra: a formalism for the representation of connectivity structure in neuronal network models, implementations in Python and C++, and their use in simulators BMC Neuroscience Jul. 18, 2011 1 pg. (Suppl 1):P80. |
Dorval et al. ‘Probability distributions of the logarithm of inter-spike intervals yield accurate entropy estimates from small datasets.’ Journal of neuroscience methods 173.1 (2008): 129-139, doi:10.1016/ j.jneumeth.2008.05.013 28 pgs. |
Fidjeland et al. ‘Accelerated Simulation of Spiking Neural Networks Using GPUs’ WCCI 2010 IEEE World Congress on Computational Intelligience, Jul. 18-23, 2010—CCIB, Barcelona, Spain, pp. 536-543, [retrieved on Nov. 14, 2012]. Retrieved from the Internet: URL:http://www.doc.ic.ac.ukl-mpsha/IJCNN10b.pdf>,. |
Floreano et al., ‘Neuroevolution: from architectures to learning’ Evol. Intel. Jan. 1, 2008: pp. 47-62, [retrieved Dec. 30, 2013] [retrieved online from URL:<http://inforscience.epfl.ch/record/112676/files/FloreanoDuerrMattiussi2008.p df>. |
Froemke et al., “Temporal Modulation of Spike-Timing-Dependent Plasticity”, Frontiers in Synaptic Neuroscience, vol. 2, article 19, Jun. 2010, pp. 1-16. |
Gewaltig et al., ‘NEST (Neural Simulation Tool)’, Scholarpedia, 2007, pp. 1-15, 2(4): 1430, doi: 1 0.4249/scholarpedia.1430. |
Gleeson et al., NeuroML: A Language for Describing Data Driven Models of Neurons and Networks with a High Degree of Biological Detail, PLoS Computational Biology, Jun. 2010, pp. 1-19 vol. 6 Issue 6. |
Gollisch et al. ‘Rapid neural coding in the retina with relative spike latencies.’ Science 319.5866 (2008): pp. 1108-1111. |
Goodman et al., Brian: a simulator for spiking neural networks in Python, Frontiers in Neuroinformatics, Nov. 2008, pp. 1-10, vol. 2, Article 5. |
Gorchetchnikov et al., NineML: declarative, mathematically-explicit descriptions of spiking neuronal networks, Frontiers in Neuroinformatics, Conference Abstract: 4th INCF Congress of Neuroinformatics, doi: 1 0.3389/conf.fninf.2011.08.00098, 2 pgs. |
Graham, Lyle J., The Surf-Hippo Reference Manual, http:// www.neurophys.biomedicale.univparis5. fr/-graham/surf-hippo-files/Surf-Hippo%20Reference%20Manual.pdf, Mar. 2002, pp. 1-128. |
Grollman et al., 2007 “Dogged Learning for Robots” IEEE International Conference on Robotics and Automation (ICRA) (6 pgs). |
http://www.braincorporation.com/specs/BStem—SpecSheet—Rev—Nov 11—2013.pdf. |
Izhikevich et al., ‘Relating STDP to BCM’, Neural Computation (2003) 15, pp. 1511-1523. |
Izhikevich, ‘Polychronization: Computation with Spikes’, Neural Computation, 25, 2006, 18, 245-282. |
Izhikevich, ‘Simple Model of Spiking Neurons’, IEEE Transactions on Neural Networks, vol. 14, No. 6, Nov. 2003, pp. 1569-1572. |
Jin et al. (2010) ‘Implementing Spike-Timing-Dependent Plasticity on SpiNNaker Neuromorphic Hardware’, WCCI 2010, IEEE World Congress on Computational Intelligence, 8 pgs. |
Karbowski et al., ‘Multispikes and Synchronization in a Large Neural Network with Temporal Delays’, Neural Computation 12; (2000), pp. 1573-1606. |
Kasabov, “Evolving Spiking Neural Networks for Spatio-and Spectro-Temporal Pattern Recognition”, IEEE 6th International Conference Intelligent Systems 2012 (6 pgs) [Retrieved on Jun. 24, 2014], Retrieved from internet: <http://ncs.ethz.ch/projects/evospike/publications/evolving-spiking-neural-networks for-spatio-and-spectro-temporal-pattern-recognition-plenary-talk-ieee-is>. |
Khotanzad, ‘Classification of invariant image representations using a neural network’ IEEF. Transactions on Acoustics, Speech, and Signal Processing, vol. 38, No. 6, Jun. 1990, pp. 1028-1038 [online], [retrieved on Dec. 10, 2013]. Retrieved from the Internet <URL: http://www-ee.uta.edu/eeweb/IP/Courses/SRP/Reference/Khotanzad.pdf>. |
Laurent, ‘Issue 1—nnql—Refactor Nucleus into its own file—Neural Network Query Language’ 47 pgs [retrieved on Nov. 12, 2013]. Retrieved from the Internet: URL:https:// code.google.com/p/nnql/issues/detail?id=1. |
Laurent, ‘The Neural Network Query Language (NNQL) Reference’ [retrieved on Nov. 12, 2013]. 1 pg., Retrieved from the Internet: <URL'https://code.google.com/p/nnql/issues/detail?id=1>. |
Lazar et al. “Consistent recovery of sensory stimuli encoded with MIMO neural circuits.” Computational intelligence and neuroscience (2009): 2. (13 pgs). |
Lazar et al. ‘A video time encoding machine’, in Proceedings of the 15th IEEE International Conference on Image Processing (ICIP '08}, 2008, pp. 717-720. |
Lazar et al. ‘Multichannel time encoding with integrate-and-fire neurons.’ Neurocomputing 65 (2005): pp. 401-407. |
Makridakis et al., ‘Evaluating Accuracy (or Error) Measures’, INSEAD Technical Report, 1995/18/TM (41 pgs). |
Masquelier, Timothee. ‘Relative spike time coding and STOP-based orientation selectivity in the early visual system in natural continuous and saccadic vision: a computational model.’ Journal of computational neuroscience 32.3 (2012): pp. 425-441. |
Miller III, “Real-Time Application of Neural Networks for Sensor-Based Control of Robots with Vision”, IEEE Transactions on Systems, Man, and Cypernetics Jul./Aug. 1989, pp. 825-831, vol. 19, No. 4. |
Nguyen et al., “Estimating divergence functionals and the likelihood ratio by penalized convex risk minimization” 2007, pp. 1-8. |
Nichols, A Reconfigurable Computing Architecture for Implementing Artificial Neural Networks on FPGA, Master's Thesis, The University of Guelph, 2003, pp. 1-235. |
Paugam-Moisy et al., “Computing with spiking neuron networks” G. Rozenberg T. Back, J. Kok (Eds.), Handbook of Natural Computing, Springer-Verlag (2010) [retrieved Dec. 30, 2013], [retrieved online from link.springer.com], 5 pgs. |
Pavlidis et al. Spiking neural network training using evolutionary algorithms. In: Proceedings 2005 IEEE International Joint Conference on Neural Networkds, 2005. IJCNN'05, vol. 4, pp. 2190-2194 Publication Date Jul. 31, 2005 [online] [Retrieved on Dec. 10, 2013] Retrieved from the Internet <URL: http://citeseerx.ist.psu.edu/viewdoc/download?doi-10.1.1.5.4346&rep=rep1&type=pdf. |
PCT International Search Report and Written Opinion for PCT/US14/48512 dated Jan. 23, 2015, pp. 1-14. |
PCT International Search Report for International Application PCT/US2013/026738 dated Jul. 21, 2014 (10 pgs). |
PCT International Search Report for PCT/US2014/040407 dated Oct. 17, 2014 (2 pgs). |
Sato et al., ‘Pulse interval and width modulation for video transmission.’ Cable Television, IEEE Transactions on 4 (1978): pp. 165-173. |
Schemmel et al., Implementing synaptic plasticity in a VLSI spiking neural network model in Proceedings of the 2006 International Joint Conference on Neural Networks (IJCNN'06), IEEE Press (2006) Jul. 16-21, 2006, pp. 1-6 [online], [retrieved on Dec. 10, 2013]. Retrieved from the Internet <URL: http://www.kip.uni-heidelberg.de/veroeffentlichungen/download.egi/4620/ps/1774.pdf>. |
Simulink.RTM. model [online], [Retrieved on Dec. 10, 2013] Retrieved from URL: http://www.mathworks.com/products/simulink/index.html> (2 pgs). |
Sinyavskiy et al. ‘Reinforcement learning of a spiking neural network in the task of control of an agent in a virtual discrete environment’ Rus. J. Nonlin. Dyn., 2011, vol. 7, No. 4 (Mobile Robots), pp. 859-875, chapters 1-8 (Russian Article with English Abstract). |
Sjostrom et al., ‘Spike-Timing Dependent Plasticity’ Scholarpedia, 5(2):1362 (2010), pp. 1-18. |
Specification, figures and EFS receipt of U.S. Appl. No. 14/265,113, filed Apr. 29, 2014 and entitled “Trainable convolutional network apparatus and methods for operating a robotic vehicle” (71 pages). |
Specification, figures and EFS receipt of U.S. Appl. No. 14/285,385, filed May 22, 2014 and entitled “Apparatus and methods for real time estimation of differential motion in live video” (42 pages). |
Specification, figures and EFS receipt of U.S. Appl. No. 14/285,414, filed May 22, 2014 and entitled “Apparatus and methods for distance estimation using multiple image sensors” (63 pages). |
Specification, figures and EFS receipt of U.S. Appl. No. 14/285,466, filed May 22, 2014 and entitled “Apparatus and methods for robotic operation using video imagery” (64 pages). |
Specification, figures and EFS receipt of U.S. Appl. No. 14/321,736, filed Jul. 1, 2014 and entitled “Optical detection apparatus and methods” (49 pages). |
Specification, figures and EFS receipt of U.S. Appl. No. 14/326,374, filed Jul. 8, 2014 and entitled “Apparatus and methods for distance estimation using stereo imagery” (75 pages). |
Specification, figures and EFS receipt of U.S. Appl. No. 14/489,242, filed Sep. 17, 2014 and entitled “Apparatus and methods for remotely controlling robotic devices” (100 pages). |
Specification, figures and EFS receipt of U.S. Appl. No. 14/542,391, filed Nov. 14, 2014 and entitled “Feature detection apparatus and methods for training of robotic navigation” (83 pages). |
Specification, figures and EFS receipt of U.S. Appl. No. 14/637,138, filed Mar. 3, 2015 and entitled “Salient features tracking apparatus and methods using visual initialization” (66 pages). |
Specification, figures and EFS receipt of U.S. Appl. No. 14/637,164, filed Mar. 3, 2015 and entitled “Apparatus and methods for tracking salient features” (66 pages). |
Specification, figures and EFS receipt of U.S. Appl. No. 14/637,191, filed Mar. 3, 2015 and entitled “Apparatus and methods for saliency detection based on color occurrence analysis” (66 pages). |
Specification, figures and EFS receipt of U.S. Appl. No. 14/244,888, filed Apr. 3, 2014 and entitled “Learning apparatus and methods for control of robotic devices via spoofing” (100 pages). |
Specification, figures and EFS receipt of U.S. Appl. No. 14/244,890, filed Apr. 3, 2014 and entitled “Apparatus and methods for remotely controlling robotic devices” (91 pages). |
Specification, figures and EFS receipt of U.S. Appl. No. 14/244,892, filed Apr. 3, 2014 and entitled “Spoofing remote control apparatus and methods” (95 pages). |
Specification, figures and EFS receipt of U.S. Appl. No. 14/588,168, filed Dec. 31, 2014 and entitled “Apparatus and methods for training robots” (101 pages). |
Specification, figures and EFS receipt of U.S. Appl. No. 14/705,487, filed May 6, 2015 and entitled “Persistent predictor apparatus and methods for task switching” (119 pages). |
Szatmary et al., ‘Spike-timing Theory of Working Memory’ PLoS Computational Biology, vol. 6, Issue 8, Aug. 19, 2010 [retrieved on Dec. 30, 2013]. Retrieved from the Internet: <URL: http://www.ploscompbiol.org/article/info%3Adoi%2F10.1371%2Fjournal.pcbi.10008 79#> (11 pgs). |
Walters, “Implementation of Self-Organizing Neural Networks for Visuo-Motor Control of an Industrial Robot”, IEEE Transactions on Neural Networks, vol. 4, No. 1, Jan. 1993, pp. 86-95. |
Wang ‘The time dimension for scene analysis.’ Neural Networks, IEEE Transactions on 16.6 (2005): pp. 1401-1426. |
A Neural Network for Ego-motion Estimation from Optical Flow, by Branka, Published 1995. |
Graham The Surf Hippo User Manual Version 3.0 B. Unite de Neurosiences Integratives et Computationnelles Institut Federatif de Neurobiologie Alfred Fessard CNRS. France. Mar. 2002 [retrieved Jan. 16, 2014]. [retrieved biomedical.univ-paris5.fr ]. |
Huang, Fall Detection Using Modular Neural Networks with Back-projected Optical Flow, Published 2007. |
Lazar et al. ‘Consistent recovery of sensory stimuli encoded with MIMO neural circuits.’ Computational intelligence and neuroscience (2010): 2. |
Visual Navigation with a Neural Network, by Hatsopoulos, Published 1991. |
Zhou, Computation of Optical Flow Usinga Neural Network, Published 1988. |
Chung Hyuk Park, et al., Transfer of Skills between Human Operators through Haptic Training with Robot Coordination. International Conference on Robotics and Automation Anchorage Convention District, 2010,Anchorage, Alaska, USA, pp. 229-235 [Online] [retrieved Dec. 3, 2015]. Retrieved from Internet: <URL:https://smartech.gatech.edu/bitstream/handle/1653/36279/IEE—2010—ICRA—002.pdf>. |
Number | Date | Country | |
---|---|---|---|
20150127155 A1 | May 2015 | US |