The present invention relates generally to programmable integrated optical circuits. In particular, the invention relates to devices for performing universal linear optics transformations on waveguides.
Advances in deep learning technology require an increased demand for computing power. Photonic devices that can perform an arbitrary linear operation on modulated light are an important component for quantum mechanics and classical information processing. A photonic matrix-vector multiplier is a key element for implementing analog neural networks with photonic circuits. In addition, such a component provides a platform for multistate systems of quantum particles for quantum computation and quantum information processing. On the other hand, an integrated matrix-vector multiplier is intrinsically a universal multiport feed-forward optical device with a wide range of applications such as integrated switching, filtering, and mode division multiplexing/demultiplexing, but not limited thereto. This becomes particularly important when a linear optical device becomes reconfigurable to implement photonic devices for a wide range of applications from classical and quantum information processing to sensing and metrology or other science and quantum technologies.
In one aspect, provided is an optical device for performing unitary matrix computations comprises a light source configured to generate first optical signals; an array of waveguides, including: inputs that receive the first optical signals from the light source; a plurality of channels positioned in parallel for transmitting the first optical signals along a length of the waveguides; and outputs for outputting second optical signals generated according to a matrix multiplication operation from the first optical signals. The optical device further comprises phase shifters constructed and arranged in a cascade structure at the channels of the waveguides, the waveguides include sections or directional couplers between adjacent phase shifters. The matrix multiplication operation includes coupling coefficient values between adjacent waveguides and length values of the sections of the waveguides.
In another aspect, a scaled-up photonic device for performing general non-unitary matrix computations, comprises a light source configured to generate a plurality of first optical signals (see claim 1); two N×N optical devices performing a unitary matrix multiplication from the first optical signals; and an array of amplitude modulators interlaced between the two unitary matrix multiplication devices.
In another aspect, a method for finding parameters that realize a desired optical matrix-vector multiplier comprises executing a gradient-based optimization method to find the parameters that minimize a L2-norm of the difference between a target matrix and an approximation produced by the factorized matrix in Eq. (1) for the corresponding parameters according to the case-I or case-II herein.
The present invention will become more apparent in view of the attached drawings and accompanying detailed description. The embodiments depicted therein are provided by way of example, not by way of limitation, wherein like reference numerals refer to the same or similar elements. In the drawings:
In brief overview, the present inventive concept provides a real-time matrix-vector multiplication apparatus and method that are useful for classical and quantum optical information processing applications, especially with respect to photonic acceleration in artificial intelligence and neural networks. In classical computing, matrix-vector multipliers are core for developing photonic chips for unconventional computing with photonic spin simulators, neuromorphic computing, and machine learning with analog photonic neural networks. In quantum information processing and quantum machine learning, a similar functionality is required for performing linear operations in photonic integrated circuits that manipulate quantum states of light. In addition to applications in optical computing, a photonic device having a multiplier can serve as a universal multi-input, multi-output device for ultra-high-speed manipulation of light in photonic integrated circuits for a broad spectrum of purposes ranging from analog optical signal processing to metrology and sensing and involve applications in telecommunication and light detection and ranging (LiDAR), but not limited thereto.
As previously mentioned, matrix-vector multipliers are important for neural networks. Deep learning extends on machine learning by using more-complex neural networks to tackle more-complex tasks, including speech recognition and autonomous driving. However, deep learning requires processing large volumes of data using complex processes such as matrix vector multiplication. In order to keep up with the increasing demand for increases in processing, developers are looking to universal linear optical devices.
This disclosure describes photonic integrated circuits that perform analog matrix-vector multiplications with light. A universal photonic device that can perform arbitrary linear operations is an indispensable component of classical and quantum optical computing, while it also serves as a critical multiport circuit for advanced manipulation of light in photonic integrated circuits. One of the main limitations of the existing integrated photonic matrix multipliers is their relatively large (compared to the wavelength of light) feature sizes which prevents their scaling to large numbers of ports. The existing solutions are based on integrated Mach-Zehnder interferometers, which are inherently bulky elements.
In accordance with embodiments, provided is a novel architecture for realizing compact photonic circuits that perform arbitrary complex linear operations by interlacing or cascading two building blocks: waveguides and phase shifters. Thus, non-unitary matrix computations can be implemented by interlacing an array of amplitude modulators between two unitary matrix multiplication devices. By utilizing this architecture, a scalable and energy-efficient integrated photonic circuits that perform arbitrary complex linear operations with light is provided. Furthermore, by incorporating tunable phase shifters, a programmable photonic circuit for general-purpose applications is realized.
In some embodiments, the optical device 100 includes a plurality of evanescently-coupled optical waveguides 106 and cascaded layers of phase elements 108, such as phase modulators, shifters, or the like, constructed and arranged in an array, for example, an N×N array formed of N input ports and N output ports, where N is an integer greater than 1. In some embodiments, the optical waveguides 106 and phase elements 108 are integrated into a single photonic chip and constructed and arranged for arbitrary linear operations.
The optical waveguides 106 are coupled to a light source 12, or in some embodiments, a beam splitter. The input ports 102 and output ports 104 of the waveguides 106 are arranged as N×1 arrays. In some embodiments, each waveguide 106 may include at least one modulator for encoding amplitude and phase information for the optical signals transmitted through the waveguide.
The phase elements 108 are integrated into a single photonic chip that is electrically and optically interfaced for settings of the phase elements 108. The phase elements 108 are arranged along the waveguides 106, and more specifically, positioned at waveguide sections with different lengths for rebalancing loads between the parallel waveguides 106. The waveguide sections or layers may be referred to as directional couplers. In some embodiments, the phase shifters are cascaded in series and can be calibrated accordingly to achieve a desired phase resolution. For example, the phase elements 108 are tunable to establish a relative phase shift of the optical signals required to compute matrix-vector multiplications. The phase elements 108 modulate the phase only of a transmission wave along a waveguide 106 without changing the amplitude, so that the total of amplitudes of the optical signals is maintained along the length of the channels. The power tuning function can be obtained by the phase elements 106 to form the arbitrary linear optical device.
The light source 12 encodes the input vector A to be multiplied. This is generated from a monochromatic light source injected into an array of Mach-Zehnder interferometer that splits the signal into the N required components of A. During operation, input vectors are loaded on a wavelength generated by the light source 12. The optical device 100 represents the unitary matrix B that performs a linear matrix operation on the complex modal amplitudes of light at the input ports 102 (A) to create the complex modal amplitudes of light at the output ports 104 (C=B·A). The lengths of the waveguide layers and the phase of the phase shifters are predetermined so that the device can perform a desired unitary matrix operation U. See Eq. (1) below.
To implement an arbitrary linear unitary operation U acting on N optical modes to perform an N×N matrix operation, a cascaded series of phase elements 108 are utilized. In doing so, M-1 layers of phase modulations are considered (each containing N phase elements) and sandwiched between M waveguide sections with different lengths. Here, M is fixed so that the minimum total number of parameters (N2) to be optimized is met or exceeded (overdetermined problem). The design is based on the hypothesis that an arbitrary unitary matrix can be decomposed in the form:
U=e
iP
e
iQ
e
iP
. . . e
iQ
e
iP
e
iQ
e
iP
(1)
where, the matrices eiP
Here, ϕnm(n=1, . . . , N and m=1, . . . , M−1) represent the phases imposed by the N phase elements in the mth layer, m represent the length of the coupled waveguide sections and κn,n+1m(n=1, . . . , N−1 and m=1, . . . , M−1) is the distributed coupling rate between two adjacent waveguides n and n+1 in the mth layer.
As shown in m, the input and output values of this waveguide array 206 become related according to the following relation:
where it is convenient to use the equivalent definition
with θnm parameters to be optimized.
The phase shifter array 308 is described through the following input-output relation:
where,
The desired multiplicative unitary matrix B=Ut to represent the unitary matrix-vector product C=B·A is defined. Henceforth, Ut is interchangeably referred to as a target matrix. The device 100 is capable of represent the target matrix, provided that the appropriate parameters are identified, such as the phase elements, waveguide section lengths and coupling parameters. m are yet to be found.
Be the set of phases ϕ={ϕnm}n=1,m=1N,M-1, waveguide parameters θ={θnm}n=1,m=1N-1,M, and waveguide array lengths ={
m}m=1M, along with the L2-norm (also known as Frobenius norm) ∥F∥:=√{square root over (tr(FF†))} associated with a given complex-valued matrix F. Two different optimizations can be performed to parametrize the target matrix Ut, denoted Case-I (see method 500A) and Case-II (see method 500B) as follows:
Case-I: The factorized matrix in Eq. (1) is parametrized in terms of the sets θ and ϕ as U(θ, ϕ), where ϕnm∈(−π, π) and θnm∈. This leads to 2NM−(N+M) parameters to optimize. That is, the product of waveguide section lengths with the coupling parameters θnm=
mκn,n+1 and phase modulations ϕnm must be determined.
Case-II: The factorized matrix in Eq. (1) is parametrized in terms of the sets and ϕ as U(
, ϕ), where ϕnm∈(−π, π) and
m>0. This leads to M(N+1)−N parameters to optimize. That is, the waveguide section lengths
m and phase modulations ϕnm must be determined.
At step 410, a gradient-based optimization method is executed to find the parameters that minimize the L2-norm of the difference between the target matrix and the approximation produced by the ansatz of relation Eq. (1). That is,
for the case-I and case-II, respectively. The L2-norm of a vector can be used for calculating the error in machine learning models. Numerical results show that a general unitary matrix with up to N=10 can be approximated with the factorization of the form (1) while the error becomes arbitrarily small by increasing the number of optimization iterations.
At step 420, assuming the existence of a factorization of the form (1), a systematic technique can be applied for finding the set of parameters {θnm, ϕnm} or {m, ϕnm} for the case-I and case-II, respectively. This can be achieved by identifying a recursive method for reducing the rank of the factorized elements.
At step 510, the set of parameters of method 500A {θnm, ϕnm} or method 500B {m, ϕnm} are initialized by randomly assigning their values within their domain, as described above. In some embodiments, the unitary matrix can have a value of N=10, but not limited thereto.
At step 520, the gradients are calculated.
At step 530, the parameters {θnm, ϕnm} or {m, ϕnm} are updated.
At step 540, a matrix norm is calculated, for example, according to:
At decision diamond 550, a determination is made whether ΔE<ε. If not, then the recursive method returns to step 520. If yes, then the method ends.
The disclosed device enables analog devices that perform linear discrete operations. In addition, by using controllable phase elements, i.e., by using phase modulators, this device becomes a programmable multiport circuit. Such a device can have a wide range of applications in classical and quantum information processing as discussed in previous sections.
The inventive concept described herein has been intended to work on the optical frequency domain, where waveguide arrays are light carriers and also allow for coupling between neighboring elements. However, the concept is not limited to the optical frequency domain. Nevertheless, other embodiments of the inventive concept can apply to the microwave domain using microstrip lines suitable for microwave transport, and interdigital capacitor for evanescent mode couplings between neighbors. Thus, a microwave device may be provided instead of an optical device.
The inventive concept can be scaled-up to perform more general matrix multiplications, i.e., by considering a non-unitary matrix F. This is done using the well-known singular value decomposition F=UΣV†, where U and V† are unitary matrices that can be represented through the invention, and Σ is a positive-definite matrix that can be implemented using amplitude modulators (no phase-modulation required in this layer). Then, the general non-unitary matrix-vector multiplication C=F·A can be implemented, where A is the N input vector and C the N output vector. The scaled-up device is shown in
Accordingly, in some embodiments, two N×N optical devices perform a unitary matrix multiplication operation from the received optical signals that are transmitted through a plurality of waveguides. An array of amplitude modulators are interlaced between the two unitary matrix multiplication devices. A plurality of waveguides for transmitting the first optical signals. For non-unitary operations, total amplitude is not preserved. The modulators are constructed and arranged for encoding amplitude and phase information for the optical signals transmitted through the waveguides.
While the invention has been described with reference to certain embodiments, it will be understood by those skilled in the art that various changes may be made and equivalents may be substituted for elements thereof to adapt to particular situations without departing from the scope of the disclosure. Therefore, it is intended that the claims not be limited to the particular embodiments disclosed, but that the claims will include all embodiments falling within the scope and spirit of the appended claims.
This application claims priority to U.S. Provisional Application Serial No. 63/359,994 filed Jul. 11, 2022, entitled “UNIVERSAL LINEAR OPTICAL DEVICE,” the entirety of which is incorporated by reference herein.
This invention was made with government support under grant number FA9550-22-1-0189 awarded by Air Force Office of Scientific Research. The government has certain rights in the invention.
Number | Date | Country | |
---|---|---|---|
63359994 | Jul 2022 | US |