The disclosed embodiments generally relate to designs for photonic computing systems. More specifically, the disclosed embodiments relate to new parallel architectures for nanophotonic computing systems, which can be implemented using multi-stage nanophotonic circuits or multi-wavelength interconnections between nanophotonic neural circuits.
As computer systems become increasingly faster, communication delays are beginning to significantly constrain computational performance. Most modern computer systems are based on a “von Neumann architecture,” wherein data is retrieved from memory and is processed at a central processing unit (CPU). Unfortunately, as computer systems become faster, the limited data throughput that is available between CPU and memory (and between levels of cache within the CPU) is beginning to significantly limit computational performance and associated energy efficiency. This throughput-related performance limitation between CPU and memory is referred to as the “von Neumann bottleneck.”
To overcome the performance problems associated with the von Neumann bottleneck, a significant amount of research has been directed toward computing systems that use “photonic circuits” to facilitate various parallel processing operations. Some researchers have investigated using lenses to perform optical computations, which can potentially provide a high degree of parallelism. (For example, see Demetri Psaltis, David Casasent, Deborah Neft, Mark Carlotto, “Accurate Numerical Computation By Optical Convolution,” Proc. SPIE 0232, 1980 Intl Optical Computing Conf II, 22 Aug. 1980; https://doi.org/10.1117/12.958883, 1980, p. 6.) Unfortunately, it has proven impractical to perform computations in this way because of the limited scalability of such bulky optical components when considering large scale computing systems.
Other researchers have investigated using “photonic neuromorphic circuits,” which attempt to mimic the behavior of neural networks in the human brain. Neuromorphic circuits comprise a collection of elements that model individual neurons with synaptic interconnects, wherein each neuron receives input pulses from upstream neurons through synaptic interconnects (upstream) and generates output pulses that are directed to downstream neurons via synaptic interconnects (downstream). The large number of interconnections among individual neurons in a neuromorphic circuit makes it possible to achieve massively parallel computing and to overcome the limitations coming from the von Neumann bottleneck in conventional computing systems (also called “von Neumann computing systems”). These neuromorphic circuits provide energy efficiency and throughput improvements for certain types of computation tasks, such as pattern-recognition operations in relatively small-scale electronic neuromorphic computing systems as compared with electronic von Neumann computing systems. (Please see M. Davies, N. Srinivasa, T. Lin, G. Chinya, Y. Cao, S. Choday, G. Dimou, P. Joshi, N. Imam, S. Jain, Y. Liao, C. Lin, A. Lines, R. Liu, D. Mathaikutty, S. McCoy, A. Paul, J. Tse, G. Venkataramanan, Y. Weng, A. Wild, Y. Yang, and H. Wang, “Loihi: A Neuromorphic Manycore Processor with On-Chip Learning,” IEEE Micro, vol. 38, no. 01, pp. 82-99, 2018, doi: 10.1109/MM.2018.112130359. Also see P. A. Merolla, J. V Arthur, R. Alvarez-Icaza, A. S. Cassidy, J. Sawada, F. Akopyan, B. L. Jackson, N. Imam, C. Guo, Y. Nakamura, B. Brezzo, I. Vo, S. K. Esser, R. Appuswamy, B. Taba, A. Amir, M. D. Flickner, W. P. Risk, R. Manohar, and D. S. Modha, “A million spiking-neuron integrated circuit with a scalable communication network and interface,” Science (80-.)., vol. 345, no. 6197, pp. 668 LP-673, August 2014, doi: 10.1 126/science.1254642. Also see R. Carney, K. Bouchard, P. Calafiura, D. Clark, D. Donofrio, M. Garcia-Sciveres, and J. Livezey, “Neuromorphic Kalman filter implementation in IBM's TrueNorth,” 2017, doi: 10.1088/1742-6596/898/4/042021.)
It is widely anticipated that optical neuromorphic computing can bring far more advantages in energy-efficiency, throughput, and scalability compared with the electronic counterparts. However, it has proven to be quite challenging in practice to provide the required “synaptic weights” for the photonic interconnections between individual photonic neurons to effectively perform neuromorphic computations.
Hence, what is needed are new photonic neuromorphic computing systems that support a high degree of parallelism and a large set of synaptic weights, without the aforementioned shortcomings of existing electronic systems.
The disclosed embodiments relate to a nanophotonic computing system, which comprises a set of nanophotonic computing elements and an optical interconnect that interconnects the set of nanophotonic computing elements. More specifically, it is a photonic neuromorphic computing system comprising a large number of nanophotonic neurons that interconnect with many other nanophotonic neurons via photonic synaptic interconnects.
The photonic synaptic interconnects includes one or more photonic components comprising multiple waveguides. One particular example is arrayed-waveguide grating routers (AWGRs), which provide cyclic, single-wavelength, all-to-all routing between AWGR inputs and AWGR outputs, wherein each AWGR comprises free-propagation-region slabs connected by an array of waveguides to facilitate routing different wavelengths. In the cyclic AWGRs, the length of the arrayed waveguides increments linearly so that the optical phase increments linearly with the arrayed waveguides for the output waveguides of the AWGRs, which will receive signals of differing wavelengths to be routed. Hence, in this particular embodiment of the cyclic AWGR, they synaptic weight values are ‘1’ for the wavelengths routed and ‘0’ for the other wavelengths.
The inventive nanophotonic synaptic interconnects can achieve arbitrary values of synaptic weights for each input neuron-output neuron pair at each wavelength by adjusting optical path lengths or optical phase delays for each wavelength at these array waveguides. Hence, these waveguides include phase modulators for varying optical lengths of the waveguides, wherein varying the optical lengths of the waveguides facilitates adjusting weights on interconnections through the nanophotonic synaptic interconnects in a lossless manner. Lossless interconnects are represented by unitary matrices. Hence, the inventive nanophotonic synaptic interconnect device is a device that represents a unitary matrix at each wavelength. In some configurations, this inventive nanophotonic synaptic interconnect device (NSID) can represent an arbitrary unitary matrix at each wavelength independently of each other.
As previously stated, the neuromorphic computing system comprises many neurons interconnecting with many other neurons via the synaptic interconnects. In some embodiments, the set of nanophotonic computing elements comprises a set of spiking nanophotonic neurons, wherein each spiking nanophotonic neuron operates by integrating weighted outputs received from other spiking nanophotonic neurons, and producing a threshold-based nonlinear response that generates output pulses, which are broadcast to other spiking nanophotonic neurons.
In some embodiments, the set of spiking nanophotonic neurons is interconnected through the photonic synaptic interconnects to form a recurrent photonic neural network. The synaptic weights in this recurrent neural network can be adjusted by using the phase modulators in the array arms of the photonic synaptic interconnects to adjust corresponding synaptic weights in the photonic synaptic interconnect device.
In some embodiments, the synaptic weights can be positive weights or negative weights to represent positive or negative electrical field superpositions. More generally, the synaptic weight values are represented in phase and amplitude values.
In some embodiments, the nanophotonic computing system is organized as a set of interconnected neuron clusters, wherein each neuron cluster comprises: an array of spiking nanophotonic neurons; an input synaptic coupler comprising an input NSID connecting inputs of the neuron cluster to inputs of the array of spiking nanophotonic neurons; and an output synaptic coupler comprising an output NSID connecting outputs of the array of spiking nanophotonic neurons to outputs of the neuron cluster.
In some embodiments, the set of detectors is implemented within a field-programmable gate array (FPGA).
In some embodiments, the reconfigurable couplers comprise 2×2 Nano-Electro-Mechanical System (NEMS)-Mach-Zehnder interferometer (MZI) synapses to achieve optical reconfiguration with nearly zero static energy consumption.
In some embodiments, the NEMS-MZI synapses include tunable NEMS phase shifters.
In some embodiments, the reconfigurable couplers comprise 2×2 synapses composed of a phase-change material embedded in an MZI.
In some embodiments, the phase-change material comprises GeSbTe (GST) or Ge2Sb2Se4Te1(GSST).
In some embodiments, the phase modulators in the NSID comprise thermo-optic phase modulators or electro-optic phase modulators.
In some embodiments, the phase modulators in the NSID comprise resonant rings that are over-coupled to corresponding waveguides in the NSID so that optical loss is nearly negligible regardless of wavelength, wherein the resonant rings can be thermally or electro-optically tuned to have resonant wavelengths on the blue or red side of a corresponding laser wavelength so that the optical phase can be modulated from zero to 2π.
In some embodiments, each nanophotonic neuron comprises: an excitatory-input photo detector that converts an optical excitatory input signal into a corresponding electrical excitatory input signal; an inhibitory-input photo detector that converts an optical inhibitory input signal into a corresponding electrical inhibitory input signal; an electrical neuron that receives the electrical excitatory and inhibitory input signals, and generates an electrical output signal, which includes periodic voltage spikes that are triggered by integration of the electrical excitatory and inhibitory input signals; and a light-emitting output device, which converts the electrical output signal into a corresponding optical output signal.
In some embodiments, each electrical neuron in a nanophotonic neuron implements an integrate-and-fire model, wherein the electrical excitatory and inhibitory input signals are integrated until a firing threshold is reached, which causes the electrical neuron to fire and generate a voltage spike on the electrical output signal.
In another embodiment, the nanophotonic computing system can utilize metaphotonics with optical waves propagating in free space instead of waveguides. Such a nanophotonic computing system comprises: an optical source; a stack of photonic layers composed of metalenses and intervening specialized modulators, wherein each metalens comprises a flat lens metastructure composed of subwavelength scale elements, and wherein each specialized modulator comprises a modulator metastructure composed of subwavelength scale elements; and an optical detector array. During operation, this nanophotonic computing system is configured to channel light emanating from the optical source through the stack of photonic layers and onto the optical detector array to facilitate optical computing operations
In some embodiments, each specialized modulator comprises a liquid-crystal-on-silicon-based spatial light modulator.
In some embodiments, the metalenses perform wavelength-dependent diffraction, focusing and collimating operations.
In some embodiments, the system includes an optical or electrical feedback path that facilitates cycling through the stack of photonic layers to perform multi-layer optical processing operations, as performed in multi-layer deep neural networks.
In some embodiments, the optical computing operations can include: a Fourier transform operation; a convolution operation; a matrix-multiplication operation; and an arbitrary algebraic operation.
In some embodiments, the system can be programmed to perform various operations, including: feature recognition operations; associative memory operations; correlation operations; and neural network processing operations.
The disclosed embodiments also relate to a universal optical waveform transformer, which includes a metaphonic mode multiplexer that facilitates arbitrary beamforming, and a metaphonic mode demultiplexer, which facilitates arbitrary decomposition. It also includes a unitary photonic matrix element, coupled between the metaphonic mode multiplexer and the metaphonic mode demultiplexer, which facilitates converting any input spatial mode to any output spatial mode.
In some embodiments, the metaphonic mode multiplexer comprises an orbital angular momentum (OAM) state multiplexer, and the metaphonic mode demultiplexer comprises an OAM state demultiplexer.
In some embodiments, the OAM state multiplexer and the OAM state demultiplexer each comprise: a circular arrangement of apertures; a set of phase-matched waveguides coupled to the circular arrangement of apertures; and a star coupler coupled to the set of phase-matched waveguides.
In some embodiments, the unitary photonic matrix element comprises a photonic mesh that connects a set of input waveguides to a set of output waveguides. This photonic mesh incorporates 2×2 Mach-Zehnder interferometer blocks that facilitate a matrix multiplication of the complex values (the amplitude and phase of the optical fields) in the set of input waveguides to produce a result encoded in corresponding complex values on the set of output waveguides.
The following description is presented to enable any person skilled in the art to make and use the present embodiments, and is provided in the context of a particular application and its requirements. Various modifications to the disclosed embodiments will be readily apparent to those skilled in the art, and the general principles defined herein may be applied to other embodiments and applications without departing from the spirit and scope of the present embodiments. Thus, the present embodiments are not limited to the embodiments shown, but are to be accorded the widest scope consistent with the principles and features disclosed herein.
The data structures and code described in this detailed description are typically stored on a computer-readable storage medium, which may be any device or medium that can store code and/or data for use by a computer system. The computer-readable storage medium includes, but is not limited to, volatile memory, non-volatile memory, magnetic and optical storage devices such as disk drives, magnetic tape, CDs (compact discs), DVDs (digital versatile discs or digital video discs), or other media capable of storing computer-readable media now known or later developed.
The methods and processes described in the detailed description section can be embodied as code and/or data, which can be stored in a computer-readable storage medium as described above. When a computer system reads and executes the code and/or data stored on the computer-readable storage medium, the computer system performs the methods and processes embodied as data structures and code and stored within the computer-readable storage medium. Furthermore, the methods and processes described below can be included in hardware modules. For example, the hardware modules can include, but are not limited to, application-specific integrated circuit (ASIC) chips, field-programmable gate arrays (FPGAs), and other programmable-logic devices now known or later developed. When the hardware modules are activated, the hardware modules perform the methods and processes included within the hardware modules.
The disclosed embodiments provide a system comprising energy-efficient bio-inspired nanophotonic neurons together with synapses and neural networks to interconnect them. Biological neurons are known to emit electrical pulses, or a series of stereotyped action potentials, or spikes, after receiving stimuli. Coding of information in the form of the timing of the spikes (temporal coding) and the spike rate (rate coding) has been a subject of active research. In designing nanophotonic spiking neural networks, three fundamental elements, namely the neuron, the synapses, and the coding scheme, should preferably be designed together to have: (1) weighted addition—the ability to sum weighted inputs; (2) integration—the ability to integrate the weighted sum over time; (3) thresholding—the ability to make a decision whether or not to send a spike (all-or-none); (4) reset—the ability to have a refractory period during which no firing can occur immediately after a spike is released; and (5) pulse generation—the ability to generate new pulses.
In the nanophotonic neurons shown in
The disclosed embodiments provide an energy-efficient, high-throughput, hardware-reduced, scalable, robust, and accurate machine learning system based on a spiking nanophotonic neural reservoir computing (SNNRC) system 300, which is conceptually illustrated in
As illustrated in more detail in
The self-optimizing nanophotonic synaptic interconnect 310 illustrated in
Referring to
Unlike previous optical interconnects, this can be accomplished without having to throw away any of the optical power unless necessary for the desired linear mapping. There also exist a number of simple techniques for configuring such networks, which have been successfully demonstrated in such meshes. In particular, such networks can be configured by a simple training operation that maps arbitrary inputs to specific outputs. This training requires no calculations or calibrations to set the mesh components, and the mesh can even realign to compensate for any drifts in components or to new training vectors. These techniques work by using a succession of simple feedback loops on Mach-Zehnder settings, which are based on power minimization in the embedded detectors D11-D31 illustrated in
Alternatively, the 2×2 MZI synapses can be implemented by incorporating phase-change materials instead of MEMS or NEMS. For example, see “Low-Loss and Broadband Nonvolatile Phase-Change Directional Coupler Switches,” Peipeng Xu, Jiajiu Zheng, Jonathan K. Doylend and Arka Majumdar, ACS Photonics 2019, 6, 2, 553-557, Jan. 7, 2019. This article demonstrates how GeSbTe (GST) or Ge2Sb2Se4Tet(GSST) materials embedded in an MZI can be used to implement a nonvolatile 2×2 synapse.
We now discuss how a single-wavelength spiking neural network can be extended to a WDM spiking neural reservoir computing network with far greater interconnectivity. If each spiking neuron emits at its own characteristic wavelength and receives spikes at multiple wavelengths, then significant enhancement of interconnectivity is possible (by a factor of w, where w is the number of wavelength channels). For example, in the context of the system 300 illustrated in
Because the N×N WDM coupler illustrated in
The N×N WDM coupler illustrated in
This type of modulation can be implemented using a ring-assisted Mach-Zehnder modulator, as is described in “Differential Microring Modulators for Intensity and Phase Modulation: Theory and Experiments,” Chia-Ming Chang, Guilhem de Valicourt, S. Chandrasekhar and Po Dong, Journal of Lightwave Technology, vol. 35, issue 15, August 2017. These resonators can be wavelength-tuned using thermal, electro-optical and mechanical-optical mechanisms, and also by incorporating phase-change materials into the resonators.
Phase-change-materials, such as GST, can be used in a Fabry-Perot filter to selectively transmit or block the part of the spectrum of interest. By forming an aperture and placing the GST layer in between the top and the bottom distributed Bragg reflectors (DBRs), one can create a phase-change-tunable-filter. Application of long (short) electrical pulses to the GST layer will change the phase of the GST layer from amorphous to crystalline (crystalline to amorphous), which will change the optical refractive index from 6.2 to 3.5 (3.5 to 6.2) while the material loss is relatively low. See S. J. Ben Yoo, “Nanophotonic computing: scalable and energy-efficient computing with attojoule nanophotonics,” in 2017 IEEE Photonics Society Summer Topical Meeting Series (SUM), 2017, pp. 1-2. Also, see J. Hu, K. Zhang, and S. J. Ben Yoo, “Hardware-Based Simulation of Optoelectronic Spiking Neuromorphic Computing Network,” in Conference on Lasers and Electro-Optics, 2019, p. JTh2A.68.
For a description of how phase change materials in optical resonators can be more readily integrated with planar waveguides, such as AWGRs, see “All-optical non-volatile tuning of an AMZI-coupled ring resonator with GST phase-change material,” Hanyu Zhang, Linjie Zhou, Jian Xu, Liangjun Lu, Jianping Chen, and B. M. A. Rahman, Optics Letters, vol. 43, Issue 22, pp. 5539-5542, 2018. Also see “Optical switching at 1.55 μm in silicon racetrack resonators using phase change materials,” Miquel Rudé, Josselin Pello, Johann Osmond, Gunther Roelkens, Jos J. G. M. van der Tol, and Valerio Pruneri, Appl. Phys. Lett. 103, 141119, 2013.
Nanophotonic Computing System that Uses Metalenses
Recent developments in metaphotonics and integrated photonic technologies have resulted in flat optical lenses that can be integrated on LCDs and detectors in vertical stacks, suggesting a path toward scalable multi-layer convolutional neural networks. Also, very complex artificial neural networks (ANN) that support reinforcement learning and unsupervised learning have been developed. Hence, it is now possible to envision new computing systems comprising metaphotonic and optoelectronic components with 2D and 3D integration.
Referring to
For example,
In the system illustrated in
In order to decompose and process optical information and to reconstruct optical information, we investigated all optical spatial multiplexers and demultiplexers utilizing silicon photonics. Our initial demonstration utilized orbital angular momentum (OAM) state multiplexing and demultiplexing. (See L. Allen, M. W. Beijersbergen, R. J. C. Spreeuw, and J. P. Woerdman, “Orbital angular momentum of light and the transformation of Laguerre-Gaussian laser modes,” Physical Review A, vol. 45, pp. 8185-8189, 1992.)
It is presently possible to create a 2D spatial mode multiplexer/demultiplexer utilizing multiples of the OAM mode multiplexer/demultiplexer at multiple radii.
As discussed above and as is illustrated in
Various modifications to the disclosed embodiments will be readily apparent to those skilled in the art, and the general principles defined herein may be applied to other embodiments and applications without departing from the spirit and scope of the present invention. Thus, the present invention is not limited to the embodiments shown, but is to be accorded the widest scope consistent with the principles and features disclosed herein.
The foregoing descriptions of embodiments have been presented for purposes of illustration and description only. They are not intended to be exhaustive or to limit the present description to the forms disclosed. Accordingly, many modifications and variations will be apparent to practitioners skilled in the art. Additionally, the above disclosure is not intended to limit the present description. The scope of the present description is defined by the appended claims.
This application claims priority under 35 U.S.C. § 119(e) to U.S. Provisional Application Ser. No. 62/891,986, entitled “Nanophotonic Parallel Processors,” by inventor Sung-Joo Ben Yoo, filed on 27 Aug. 2019 (Attorney Docket No. UC19-093-1PSP). This application also claims priority under 35 U.S.C. § 119(e) to U.S. Provisional Application Ser. No. 62/881,241, entitled “Multi-Wavelength Nanophotonic Neural Computing,” by inventor Sung-Joo Ben Yoo, filed on 31 Jul. 2019 (Attorney Docket No. UC19-788-1PSP). The contents of the above-listed applications are hereby incorporated by reference herein.
This invention was made with U.S. government support under grant number FA9550-18-1-0186 awarded by the Air Force Office of Scientific Research (AFOSR). The U.S. government has certain rights in the invention.
Number | Date | Country | |
---|---|---|---|
62881241 | Jul 2019 | US | |
62891986 | Aug 2019 | US |