Embodiments of the present disclosure generally relate to the field of optoelectronics, and more particularly, to techniques and configurations for an optical matrix multiplier for optical neural networks.
Machine learning architectures are typically based on artificial neural networks (ANNs). Optical neural networks (ONNs) are a type of physical implementation of artificial neural network (ANN) that use optical components as a building blocks. The basic building blocks of an optical neural network (ONN) typically include interconnected Mach-Zehnder interferometers (MZI) that perform unitary transformations on an array of optical signals. However, due to the size and configuration of MZIs, a network of interconnected MZIs may be less compact than desired. Furthermore, ANN computations typically implement nonlinear functions (such as Rectified Linear Unit ReLu, Sigmoid S-shape, Tan h, or threshold step-like functions), in addition to the linear algebra matrix computations. However, in optical domain, there appear to be no equivalent optical nonlinear activation functions and/or devices for ONN.
Embodiments will be readily understood by the following detailed description in conjunction with the accompanying drawings. To facilitate this description, like reference numerals designate like structural elements. Embodiments are illustrated by way of example and not by way of limitation in the figures of the accompanying drawings.
Embodiments of the present disclosure describe techniques and configurations for a nonlinear optical device used to construct an optical neural network (ONN) with one or more layers of matrix multipliers. Embodiments include coupling the nonlinear optical device to the outputs of the layers comprising optical coherent matrix multipliers to apply amplification, attenuation, and nonlinearity functions. In embodiments, the nonlinear optical device includes a waveguide to receive optical input and a gain medium coupled with the waveguide, to amplify the received optical input, to provide an output that is amplified in a nonlinear manner in response to the optical input reaching saturation, where the nonlinearly amplified output is to provide a nonlinear activation function for an ONN.
In embodiments, the nonlinear optical device provides optical amplification to compensate for waveguide propagation loss needed to emulate the multiple layers of the ONN. In embodiments, a III-V gain medium is bonded to silicon photonics to provide amplification, where the gain medium provides a linear amplification, and a nonlinear amplification function in response to the input power reaching a saturation level. In embodiments, a carrier-injection PIN diode can be added to couple with the amplification function to provide light attenuation control to not overload the subsequent layer or photodiode array (PDA).
In the following description, various aspects of the illustrative implementations will be described using terms commonly employed by those skilled in the art to convey the substance of their work to others skilled in the art. However, it will be apparent to those skilled in the art that embodiments of the present disclosure may be practiced with only some of the described aspects. For purposes of explanation, specific numbers, materials, and configurations are set forth in order to provide a thorough understanding of the illustrative implementations. However, it will be apparent to one skilled in the art that embodiments of the present disclosure may be practiced without the specific details. In other instances, well-known features are omitted or simplified in order not to obscure the illustrative implementations.
In the following detailed description, reference is made to the accompanying drawings that form a part hereof, wherein like numerals designate like parts throughout, and in which is shown by way of illustration embodiments in which the subject matter of the present disclosure may be practiced. It is to be understood that other embodiments may be utilized and structural or logical changes may be made without departing from the scope of the present disclosure. Therefore, the following detailed description is not to be taken in a limiting sense, and the scope of embodiments is defined by the appended claims and their equivalents.
For the purposes of the present disclosure, the phrase “A and/or B” means (A), (B), or (A and B). For the purposes of the present disclosure, the phrase “A, B, and/or C” means (A), (B), (C), (A and B), (A and C), (B and C), or (A, B, and C).
The description may use perspective-based descriptions such as top/bottom, in/out, over/under, and the like. Such descriptions are merely used to facilitate the discussion and are not intended to restrict the application of embodiments described herein to any particular orientation.
The description may use the phrases “in an embodiment,” or “in embodiments,” which may each refer to one or more of the same or different embodiments. Furthermore, the terms “comprising,” “including,” “having,” and the like, as used with respect to embodiments of the present disclosure, are synonymous.
The term “coupled with,” along with its derivatives, may be used herein. “Coupled” may mean one or more of the following. “Coupled” may mean that two or more elements are in direct physical or electrical contact. However, “coupled” may also mean that two or more elements indirectly contact each other, but yet still cooperate or interact with each other, and may mean that one or more other elements are coupled or connected between the elements that are said to be coupled with each other. The term “directly coupled” may mean that two or more elements are in direct contact.
As used herein, the term “optical waveguide” can refer to any physical device or structure that guides light (e.g., an optical signal) in a confined manner. In embodiments, the optical waveguides include silicon-based optical waveguides having a core for confinement of light and formation of modes surrounded by a cladding or substrate, having a lower refractive index than the core.
Within the ONN 102, a laser diode array (LDA) 110 together with optical modulators 112 (hereinafter referred to as “modulator 112”) provides optical input to a first layer 105. A photodetector array 114 will receive optical output from the third layer 107, and convert that output into digital signals. In this example, light signals are sent from layer 1105, to layer 2104, and then to layer 3107. Each layer is made up of an optical unitary matrix multiplier (that may include a plurality of optical unitary matrix multipliers) and non-linear optical devices (e.g., nonlinear optical amplifiers 124 described below). In embodiments, the ONN 102 including array (LDA) 110, modulator 112, multiple layers 105, 104, 107, and PDA 114 can be implemented in a heterogeneously integrated photonics circuit, such as a single silicon photonics die or single semiconductor substrate 150.
Nonlinear optical amplifiers 124 may be needed to be coupled to the optical unitary matrix multiplier 122 due to the linear nature of the optical signal processing from the optical unitary matrix multipliers 118, 120, 122. The optical signal, including noise added to the optical signal, may be linearly increased during operation of the ONN 102, and may result in a final signal intensity from the Un optical unitary matrix multiplier 122 that is too high. This signal intensity may cause optical inputs to overload a subsequent layer 107, or overload the PDA 114.
The nonlinear optical amplifier 124 may comprise multiple nonlinear optical devices. An example nonlinear optical device 128 shown in
The equation I out=f(Iin)eiΔϕ on the output of 126 shown in
For example, if the signal output 126 level represents 8 bits, it may be desirable for the nonlinear optical device 128 to clean up the representation of a low bit to 0, and a high bit to be put into the upper limits as a saturation function. This will enhance the performance of optical signal output to proceed to the next layer in the linear functions of the various optical matrix multipliers.
Gain medium 238 may be bonded on the silicon photonics waveguide 234 to provide nonlinear output power change with optical input power, in bias current vs voltage operation for activation functions in optical amplifying, attenuating and saturating. The gain medium has a gain which is highly nonlinear as a function of the input optical power and the unsaturated model gain. Since the gain depends on the input optical power, the gain medium is nonlinear. The nonlinearity occurs primarily due to gain saturation. When the gain medium operates in the linear regime where optical input power Pin is relatively low, the gain of the gain medium can linearly increase as the input optical power increases, leading to the output optical power Pout increase. Because the optical input power Pin increases relatively high when the gain medium operates in the non-linear saturation regime, the output optical power (Pout) does not change relative to Pin, where the saturated output power is defined as Psat. and described below in greater detail. Gain compression in the gain medium occurs primarily due to insufficient photons generated due to the stimulated recombination limited by the supply of free charge carriers to the active region. The saturated operating point of the gain medium can be operated as optical analogue of nonlinear-like Sigmoid or Tan h functions.
The operation of the gain medium may be described by the following equation:
P
out
=P
in
+G(Pin)
where Pin is the input optical power to the gain medium, and Pout is the output power at the end of the gain medium. The gain medium has a gain G and it depends on the input optical power to the gain medium G(Pin). The optical power can be measured in dBm, and Gain G can be measured in dB. Bias-voltage control of the amplifier 230 nonlinear functions may result in very different optical output power levels. If the light power is too low, or on-chip loss is too high, the amplifier 230 may exhibit a combination of nonlinear amplification behavior, and begin to behave like a sigmoid function.
In a design for nonlinear gain medium, there are two key parameters. The first, gain factor G, which itself is highly nonlinear. The second is saturation power Psat for input optical power and output optical power. For a given layer of an ONN they can be customized for in design gain-medium geometry, for example III-V layer cavity length, width, gain medium mesa structure, epitaxial layer stack, and/or geometry may form sigmoid-like optical nonlinear functions.
In the equation for G, g is the material gain of the active region Γa is the mode confinement factor of the active region, α is the modal loss, equal to the loss of each region weighted by its mode confinement factor. L is the length of gain medium. In the equation for Psat, Aeff is an effective area of the optical mode, aN is the differential gain, and τS is a recombination time.
An example nonlinear optical device 250 (similar to 128 of
Alternatively, the carrier injection diode 260 can also be added before the gain medium 238 (as shown in the nonlinear optical device 270, similar to 128 of
Since the amplifier gain depends on the input power, the amplifier is nonlinear. The nonlinearity is due to gain saturation. When input optical power is less than Psat, the amplifier gain G equals the unsaturated value G. As the input optical power Pin increases, the output optical power Pout also linear increases. As the output optical power becomes large enough to cause a significant reduction in the carrier density, and when the carrier density decreases, the gain G becomes saturated, then output optical power Pout is nearly not changing with the increasing input optical power. The add-on PIN pre- and post-diode provide the extra control of both the input optical power as well as the output optical power.
As shown, a free-carrier injection forwarded biased PIN diode 260 is added to enhance nonlinear activation functions in light attenuation. As a result of the edition of the carrier injection diode 260, the power function is:
P
out
=P
in
−e
−(Δα
+Δα
)*L
+G(Pin)
where Pin, Pout and G(Pin) are defined as the above equations. Δαe and Δαh are the electrons and holes carrier-induced electro-absorption propagation loss coefficient expressed in cm−1, respectively. L is the propagation distance along the free carrier-injection diode in waveguide expressed in cm. The carrier-induced electro-absorption propagation loss by the factor of exp (−Δα*L) is expressed in optical power loss in dB. The optical power is measured in dBm, and gain G(Pin) is measured in dB, in embodiments.
In embodiments, a quantum well or a quantum dot gain medium may be used for highly efficient optical nonlinear activation functions and/or devices. This function is described in more detail for amplifier 230 in reference to
During operation light signal 446 flows from the silicon rib 434, gets drawn up into the quantum well layer 442 next to the III-V material 444, and then subsequently flows back down into the silicon rib layer 434. During this process, amplification of the light occurs. In embodiments, the III-V material 444 acts as a semiconductor amplifier in the forward bias and acts as absorption as a attenuating in the reverse bias.
Attributes of the structures of the quantum well layer 442 and the III-V material 444 may be changed in order to affect the amount of amplification, including whether the amplification is linear or nonlinear. For example, lengths of the cavity in which the light 446 flows in the quantum well layer 442 may be shortened to make nonlinear amplification faster. By adjusting the quantum well layer 442 and the III-V material, the amplification range, saturation range, or saturation attenuation points may be adjusted. Note that if the amplification or intensity of the light 446 is too high, the light will become saturated and perform as a nonlinear function. In embodiments, the quantum well layer 442 may be a quantum dot.
Diagram 450 shows a cross-section perpendicular to the silicon rib 434, and includes the silicon substrate 438, the box layer 436, and the silicon rib 434. The waveguide 435 is shown on top of the box layer 436, and is surrounded by channels of air or silicon oxide between the waveguide 435 and the rest of the silicon rib 434. In some embodiments, the semiconductor layer 435 can be provided on high index waveguide platforms, such as silicon nitride (Si3N4) waveguide-buried oxide (BOX)-Si substrate, or silicon oxynitride (SiON) waveguide-BOX-Si substrate, higher index delta silicon oxide (SiOx) waveguide-BOX-Si substrate.
In embodiments, the gain material of the III-V structure 444 may include III-V compounds. For example, the gain material may include a combination of elements in group III of the periodic table (e.g., aluminum, gallium, indium, etc.) and elements in group V of the periodic table (e.g., phosphorous, arsenic, antimony, etc.) For example, the III-V compounds may include gallium nitride (GaN), gallium arsenide (GaAs), indium phosphide (InP), aluminum arsenide (AlAs), various combinations of the elements or compounds (e.g., InGaAsP, InAlGaAs, etc.), or some other III-V compound. I n some embodiments, the gain material may include glass, ceramic, or some other material. In some embodiments the gain material may include various layers of the above compounds or some other type of gain material. Any other materials suitable for generation of the non-linear function in the gain medium may be used.
On top of the waveguide 435 and the silicon rib 434 is a III-V layer 448. In embodiments, a cathode 452 may be applied to the III-V layer 448, and an anode 454 applied to the top of III-V layer 444.
In embodiments, a forward bias or a reverse bias may be applied to the layer 448, and an anode 454 may be provided to the top of the II-V layer 454. In embodiments, the amplifier 400 may perform the function of amplifying light when a voltage is applied to the cathode 452 that is less than a voltage applied to the anode 454, creating a forward bias. The amplifier 400 may perform the function of absorbing light when a voltage is applied to the anode 454 that is greater than a voltage applied to the cathode 452, creating a reverse bias.
In
In
Turning back to
Some example applications of the nonlinear optical devices in ONN are described below. Specifically,
Note that for clarity, only one of 2×2 directional optical matrices 702 and one of 2×2 unitary multi-mode interference (MMI) optical couplers 704 is labeled. For matrix multiplier 701, a plurality of 2×2 directional optical matrices 702 are optically coupled together to receive an array of optical signal inputs at 705 in
Note that the array of optical signal inputs 707 for matrix multiplier 701 (and optical signal inputs 711 for matrix multiplier 703) include n optical inputs and n optical signal outputs where n=8. In embodiments, the matrix multipliers each include n (n−1)/2 2×2 unitary optical matrices (e.g., n (n−1)/2 2×2 optical matrices. Although n=8 in
Accordingly, as described in connection with
Note that a tuning allows the modes of the first optical signal and the second optical signal interfere in the MM waveguide to output an optical signal at a power ratio that can be adjusted according to a U(2) matrix algebra.
In embodiments, the device 801 (and/or computing device 800) may include or be used in general matrix multiplier (GEMM) or convolutional (CONV) neural network accelerators, heterogeneous artificial intelligence (AI) media inferencing accelerators, domain-specific machine-learning and deep learning accelerators (Neuro/Memory/inferencing/training), or data-centric neural network computing processors.
As shown, computing device 800 may include a one or more processors or processor cores 803 and memory 804. In some embodiments, the device 801 may be integrated with the processors 803. In embodiments, memory 804 may be system memory. For the purpose of this application, including the claims, the terms “processor” and “processor cores” may be considered synonymous, unless the context clearly requires otherwise. The processor 803 may include any type of processors, such as a central processing unit, a microprocessor, and the like. The processor 803 may be implemented as an integrated circuit having multi-cores, e.g., a multi-core microprocessor. The computing device 800 may include mass storage devices 806 (such as diskette, hard drive, volatile memory (e.g., dynamic random-access memory (DRAM), compact disc read-only memory (CD-ROM), digital versatile disk (DVD), and so forth). In general, memory 804 and/or mass storage devices 806 may be temporal and/or persistent storage of any type, including, but not limited to, volatile and non-volatile memory, optical, magnetic, and/or solid state mass storage, and so forth. Volatile memory may include, but is not limited to, static and/or dynamic random-access memory. Non-volatile memory may include, but is not limited to, electrically erasable programmable read-only memory, phase change memory, resistive memory, and so forth.
The computing device 800 may further include input/output (I/O) devices 808 (such as a display (e.g., a touchscreen display), keyboard, cursor control, remote control, gaming controller, image capture device, and so forth) and communication interfaces 810 (such as network interface cards, modems, infrared receivers, radio receivers (e.g., Bluetooth), and so forth). In some embodiments, the communication interfaces 810 may include or otherwise be coupled with integrated photonics device 801, as described above, in accordance with various embodiments.
The communication interfaces 810 may include communication chips that may be configured to operate the device 800 in accordance with a Global System for Mobile Communication (GSM), General Packet Radio Service (GPRS), Universal Mobile Telecommunications System (UMTS), High Speed Packet Access (HSPA), Evolved HSPA (E-HSPA), or Long-Term Evolution (LTE) network. The communication chips may also be configured to operate in accordance with Enhanced Data for GSM Evolution (EDGE), GSM EDGE Radio Access Network (GERAN), Universal Terrestrial Radio Access Network (UTRAN), or Evolved UTRAN (E-UTRAN). The communication chips may be configured to operate in accordance with Code Division Multiple Access (CDMA), Time Division Multiple Access (TDMA), Digital Enhanced Cordless Telecommunications (DECT), Evolution-Data Optimized (EV-DO), derivatives thereof, as well as any other wireless protocols that are designated as 3G, 4G, 5G, and beyond. The communication interfaces 810 may operate in accordance with other wireless protocols in other embodiments.
The above-described computing device 800 elements may be coupled to each other via system bus 812, which may represent one or more buses. In the case of multiple buses, they may be bridged by one or more bus bridges (not shown). Each of these elements may perform its conventional functions known in the art. In particular, memory 804 and mass storage devices 806 may be employed to store a working copy and a permanent copy of the programming instructions for the operation of integrated photonics device. The various elements may be implemented by assembler instructions supported by processor(s) 803 or high-level languages that may be compiled into such instructions.
The permanent copy of the programming instructions may be placed into mass storage devices 806 in the factory, or in the field, through, for example, a distribution medium (not shown), such as a compact disc (CD), or through communication interface 810 (from a distribution server (not shown)). That is, one or more distribution media having an implementation of the agent program may be employed to distribute the agent and to program various computing devices.
The number, capability, and/or capacity of the elements 808, 810, 812 may vary, depending on whether computing device 800 is used as a stationary computing device, such as a server computer in a data center, or a mobile computing device, such as a tablet computing device, laptop computer, game console, or smartphone. Their constitutions are otherwise known, and accordingly will not be further described.
For one embodiment, at least one of processors 803 may be packaged together with computational logic 822 configured to practice aspects of optical signal transmission and receipt described herein to form a System in Package (SiP) or a System on Chip (SoC).
In various implementations, the computing device 800 may comprise one or more components of a data center, a laptop, a netbook, a notebook, an ultrabook, a smartphone, a tablet, a personal digital assistant (PDA), an ultra mobile PC, a mobile phone, or a digital camera. In further implementations, the computing device 800 may be any other electronic device that processes data.
According to various embodiments, the present disclosure describes a number of examples.
Example 1 is an apparatus, comprising a waveguide to receive optical input; and a gain medium coupled with the waveguide, to amplify or attenuate the received optical input, to provide an output that is generated in a nonlinear manner in response to the optical input reaching saturation, wherein the nonlinearly generated output is to provide a nonlinear activation function for an optical neural network (ONN).
Example 2 is the apparatus of Example 1, further comprising an optical matrix multiplier of the ONN coupled with the waveguide, wherein an output of the optical matrix multiplier comprises the optical input to the waveguide.
Example 3 is the apparatus of Example 1, wherein the apparatus to provide the nonlinear activation function includes to provide at least one of: amplification, saturation, rectification, or attenuation of the optical input.
Example 4 is the apparatus of Example 1, wherein the gain medium includes a III/V material.
Example 5 is the apparatus of Example 4, wherein the gain medium is characterized by one or more parameters, wherein the parameters include a gain factor G, wherein the gain factor G is a non-linear parameter.
Example 6 is the apparatus of Example 1, wherein the gain medium includes a multiple quantum well (MQW).
Example 7 is the apparatus of Example 6, wherein the gain medium comprises a quantum dot gain medium.
Example 8 is the apparatus of Example 1, further comprising a carrier injection diode coupled with the gain medium, to provide attenuation control to the output.
Example 9 is the apparatus of Example 1, wherein the carrier injection diode comprises a PIN diode.
Example 10 is the apparatus of Example 1, wherein the apparatus comprises a nonlinear optical device.
Example 11 is the apparatus of Example 1, wherein the apparatus is integrated in an ONN integrated circuit.
Example 12 is an apparatus for an optical neural network (ONN), comprising: an optical matrix multiplier provided in a semiconductor substrate to receive an array of optical signal inputs and to linearly transform the plurality of optical signal inputs into an array of optical signal outputs; a nonlinear optical device coupled with the optical matrix multiplier, to receive the optical signal outputs from the optical matrix multiplier, and to provide a first optical output that is generated in a linear manner in response to the optical signal outputs of the matrix multiplier operating in a first power range, and to provide a second optical output that is generated in a nonlinear manner in response to the optical signal outputs of the optical matrix multiplier operating in a second power range, wherein the second power range has a higher power limit than the first power range, wherein the nonlinearly generated optical output of the nonlinear optical device is to provide a nonlinear activation function for the ONN.
Example 13 is the apparatus of claim 12, further comprising: an array of light sources provided in the semiconductor substrate to generate an array of light signals; and a plurality of optical modulators coupled to the array of light sources in the semiconductor substrate to modulate data onto the light signals to generate the array of optical signal inputs, to be provided to the optical matrix multiplier.
Example 14 is the apparatus of Example 13, wherein the optical matrix multiplier is coupled with the plurality of optical modulators.
Example 15 is the apparatus of Example 12, wherein the nonlinear optical device includes: a waveguide to receive the optical signal outputs from the optical matrix multiplier; and a gain medium coupled with the waveguide, to provide the first and second optical outputs.
Example 16 is the apparatus of Example 13, wherein the semiconductor substrate is a single semiconductor substrate, wherein the array of light sources, the plurality of optical modulators, the optical unitary matrix multiplier, and the nonlinear optical device are heterogeneously integrated in the single semiconductor substrate.
Example 17 is a system, comprising: an optical neural network (ONN) integrated circuit (IC), including: an optical matrix multiplier provided in a semiconductor substrate to receive an array of optical signal inputs and to linearly transform the plurality of optical signal inputs into an array of optical signal outputs; and a nonlinear optical device coupled with the optical matrix multiplier, to receive the optical signal outputs from the optical matrix multiplier, and to provide an optical output that is generated in a nonlinear manner in response to the optical signal outputs the optical matrix multiplier reaching saturation, wherein the nonlinearly amplified generated output of the nonlinear optical device is to provide a nonlinear activation function for the ONN; and a processor coupled to the ONN IC to provide the ONN with data to modulate onto the array of optical signal inputs to be linearly transformed by the optical matrix multiplier.
Example 18 is the system of Example 17, wherein the ONN IC further includes: an array of light sources provided in the semiconductor substrate to generate an array of light signals; and a plurality of optical modulators coupled to the array of light sources in the semiconductor substrate to modulate data onto the light signals to generate the array of optical signal inputs, to be provided to the optical matrix multiplier.
Example 19 is the system of Example 18, wherein the nonlinear optical device includes: a waveguide to receive the optical signal outputs from the optical matrix multiplier; and a gain medium coupled with the waveguide, to provide the optical output that is generated in the nonlinear manner in response to the optical signal outputs of the optical matrix multiplier reaching saturation.
Example 19 is the system of Example 18, wherein the processor is included in one of: general matrix multiplier (GEMM) neural network accelerator, a convolutional (CONV) neural network accelerator, a domain-specific machine-learning accelerator, or a deep learning accelerator.
Various embodiments may include any suitable combination of the above-described embodiments including alternative (or) embodiments of embodiments that are described in conjunctive form (and) above (e.g., the “and” may be “and/or”). Furthermore, some embodiments may include one or more articles of manufacture (e.g., non-transitory computer-readable media) having instructions, stored thereon, that when executed result in actions of any of the above-described embodiments. Moreover, some embodiments may include apparatuses or systems having any suitable means for carrying out the various operations of the above-described embodiments.
The above description of illustrated implementations, including what is described in the Abstract, is not intended to be exhaustive or to limit the embodiments of the present disclosure to the precise forms disclosed. While specific implementations and examples are described herein for illustrative purposes, various equivalent modifications are possible within the scope of the present disclosure, as those skilled in the relevant art will recognize.
These modifications may be made to embodiments of the present disclosure in light of the above detailed description. The terms used in the following claims should not be construed to limit various embodiments of the present disclosure to the specific implementations disclosed in the specification and the claims. Rather, the scope is to be determined entirely by the following claims, which are to be construed in accordance with established doctrines of claim interpretation.