Many emerging technologies in photonics rely on multiport interferometers. A multiport interferometer is a linear input-output circuits whose transfer matrix is programmed by the user. A major challenge for multiport interferometers is the presence of hardware errors (e.g., fabrication imperfections) that lead to deviations from ideal behavior. Since hardware errors cascade as light propagates through a circuit, their effects are especially severe on large circuits, and limit the scalability of many photonic systems. To address this problem, many error correction techniques have been proposed, including global optimization, local correction, and progressive self-configuration. However, all of these approaches require accurate pre-calibration of the errors, additional components, or both accurate pre-calibration and additional components.
We disclose programming methods for multiport interferometers, based on measurement-assisted matrix diagonalization, that correct for hardware errors in a near-optimal fashion without extra hardware complexity or pre-calibration, as well as multiport interferometers programmed or configured according to these methods. In addition, we present modified circuit architectures that further improve scaling: (1) a design based on modified tunable couplers to span a wider range of splitting ratios, and (2) a design based on the generalized FFT butterfly. These new architectures, coupled with error-tolerant programming methods, significantly relax the scaling constraints that hardware errors pose for linear photonic circuits.
One method involves configuring a multiport interferometer comprising a mesh of Mach-Zehnder interferometers characterized by a transfer matrix U=VDW and hardware component errors, where W is a first matrix representing a first portion of the mesh of Mach-Zehnder interferometers, V is a second matrix representing a second portion of the mesh of Mach-Zehnder interferometers, and D is a phase screen. This method includes, for each Mach-Zehnder interferometer in the mesh of Mach-Zehnder interferometers, performing Givens rotations that diagonalize a matrix X=V†UW†, iteratively updating the first matrix based on the Givens rotations, and for each column of the first matrix, setting a variable internal phase and a variable external phase of a corresponding Mach-Zehnder interferometer in the mesh of Mach-Zehnder interferometers to direct light to a predetermined port of the corresponding Mach-Zehnder interferometer.
The mesh of Mach-Zehnder interferometers can be a rectangular mesh of Mach-Zehnder interferometers, in which case the method may also include iteratively updating the second matrix based on the Givens rotations and, for each column of the second matrix, setting the variable internal phase and the variable external phase to direct light to a given port of that Mach-Zehnder interferometer. Alternatively, the mesh of Mach-Zehnder interferometers can be a triangular mesh of Mach-Zehnder interferometers and one of the first matrix or the second matrix is equal to the identity matrix I. Each Mach-Zehnder interferometer in the mesh of Mach-Zehnder interferometers can include three beam splitters or two beam splitters and a waveguide crossing.
Performing the Givens rotations can include performing a first Givens rotation that zeros a first element of the matrix X.
Setting the variable internal phase and the variable external phase can involve inputting a signal to the mesh of Mach-Zehnder interferometers representing a corresponding column of one of the first matrix or the second matrix, measuring an output of the mesh of Mach-Zehnder interferometers, and setting the variable internal phase shift and the variable external phase shift based on the output. Inputting the first signal to the mesh of Mach-Zehnder interferometers can comprise directing 100% of the signal coupled to that Mach-Zehnder interferometer out of one port of that Mach-Zehnder interferometer.
Setting the variable internal and external phases may include selecting them such that an inner product of the output and of a corresponding column of a matrix V representing a second portion of the mesh of Mach-Zehnder interferometers equals zero.
Each Mach-Zehnder interferometer in the mesh of Mach-Zehnder interferometers to an approximate cross state before performing the Givens rotations.
The method may also include other steps for each column in the second matrix after setting the variable internal and external phases for each Mach-Zehnder interferometer. These steps include injecting a third signal representing a corresponding column of the second matrix, measuring a third output of the mesh of Mach-Zehnder interferometers, and adjusting the variable internal phase shifts of the Mach-Zehnder interferometers based on the third output.
Another method of configuring a multiport interferometer comprising a mesh of Mach-Zehnder interferometers includes setting an internal phase of a first Mach-Zehnder interferometer in a diagonal of Mach-Zehnder interferometers to produce a desired power ratio at an output of the mesh of Mach-Zehnder interferometers. This method also includes setting an internal phase of at least one second Mach-Zehnder interferometer in the diagonal of Mach-Zehnder interferometers to align phases of partial output vectors at the output of the mesh of Mach-Zehnder interferometers. And it includes setting an internal phase of a last Mach-Zehnder interferometer in the diagonal of Mach-Zehnder interferometers to produce a desired global phase at the output of the mesh of Mach-Zehnder interferometers.
Setting the internal phase of the first Mach-Zehnder interferometer can involved measuring, at the output of the mesh of Mach-Zehnder interferometers, a combination of light that bypasses the first Mach-Zehnder interferometer, light that exits a first output port of the first Mach-Zehnder interferometer, and light that exits a second output port of the first Mach-Zehnder interferometer.
The mesh of Mach-Zehnder interferometers can be a triangular mesh or a rectangular mesh with a diagonal of drop ports. The mesh of Mach-Zehnder interferometers can include N Mach-Zehnder interferometers characterized by an error with a magnitude σ, in which case the method corrects the error with a stability following an asymptotic form proportional to Nσ2. The mesh of Mach-Zehnder interferometers can also include N Mach-Zehnder interferometers and be characterized by a transfer matrix U and a normalized matrix error ε=∥U∥rms/√{square root over (N)}, in which case the method suppresses the normalized matrix error by a factor of ε2/√{square root over (6)}.
All combinations of the foregoing concepts and additional concepts discussed in greater detail below (provided such concepts are not mutually inconsistent) are part of the inventive subject matter disclosed herein. In particular, all combinations of claimed subject matter appearing at the end of this disclosure are part of the inventive subject matter disclosed herein. The terminology used herein that also may appear in any disclosure incorporated by reference should be accorded a meaning most consistent with the particular concepts disclosed herein.
The skilled artisan will understand that the drawings primarily are for illustrative purposes and are not intended to limit the scope of the inventive subject matter described herein. The drawings are not necessarily to scale; in some instances, various aspects of the inventive subject matter disclosed herein may be shown exaggerated or enlarged in the drawings to facilitate an understanding of different features. In the drawings, like reference characters generally refer to like features (e.g., functionally similar and/or structurally similar elements).
1.1. Accurate Self-Configuration of Rectangular Multiport Interferometers
Multiport interferometers based on integrated beam splitter meshes are widely used in photonic technologies. While rectangular meshes of integrated beam splitters are favored for their compactness and uniformity, their geometry resists conventional self-configuration approaches, which are useful for programming large meshes in the presence of fabrication error. A configuration process related to the 2×2 block decomposition of a unitary matrix overcomes this limitation. This configuration process is robust to errors, requires no prior knowledge of the process variations, and relies on external sources and detectors. Self-configuration using this technique reduces the effect of fabrication errors by the same quadratic factor observed in triangular meshes. This relaxes a significant limit to the size of multiport interferometers, removing a major roadblock to the scaling of optical quantum and machine-learning hardware.
Both multiport interferometers are scalable meshes of integrated Mach-Zehnder interferometers (MZIs) 104 (diamonds), each of which has a variable or tunable internal phase shift θ and a variable or tunable external phase shift φ.
A major challenge to scaling MZI meshes like those in
The self-configuration process in
V=(T21,T22)(T41 . . . T44) . . . (TN−2,1 . . . TN−2,N−2)
W=(TN−1,N−1 . . . TN−1,1) . . . (T31 . . . T31)(T11) (1)
where Tmn is the 2×2 block unitary corresponding to the nth crossing (MZI/phase-shift pair) of the mth diagonal. Following the order in
While the process in
The self-configuration process given immediately above can also be described as follows. First (step 1 above), the MZIs in the mesh of MZIs are initialized to approximately the cross state. Next (step 2 above), loop over a sequence of measurement-assisted nulling steps, with one loop per MZI in the mesh of MZIs. In the first loop over this sequence, first (step 2a), we find a target rotation T11 that zeroes the upper right element of a target matrix X. On a computer, update the matrix W by right multiplication (step 2b). Program the corresponding MZI through a physical measurement that involves injecting an optical signal encoded with a vector w* (the last column of the complex conjugate matrix W*) into the mesh and adjusting the phases of the corresponding MZI to null a particular signal (the inner product {right arrow over (v)}i|{right arrow over (a)}out(θ,ϕ)). This is a measurement-assisted form of matrix diagonalization. In effect, this physically zeros an off-diagonal element of the target matrix X in hardware, in a manner that is robust to imperfections in the mesh of MZIs.
In other words, in step 2, loop through the MZIs (crossings in the mesh) along the diagonals in the order shown in
Steps 2a and 2b of the process above are a restatement of the Clements factorization given in
To test this self-configuration process, we performed numerical experiments on Clements meshes of size up to 128×128. The tests were implemented in PYTHON and are available as part of the M
In the uncorrected case, each MZI introduces a mean error ∥U∥rms=√{square root over (2)}σ. These errors add in quadrature, leading to an overall normalized error ε0=√{square root over (2N)}σ, which grows with mesh size. This is understandable given that a circuit depth that grows as O(N), with each layer contributing O(σ) error and the layers adding in quadrature.
This form can be derived rigorously from the distribution of MZI splitting angles over the Haar measure, where errors arise solely from MZIs whose target splitting ratios cannot be realized in the imperfect hardware. Since εc∝ε02, self-calibration leads to a quadratic suppression of errors: the smaller the initial error, the greater the benefit of error correction.
To illustrate the benefits of this error reduction, consider as an example deep neural network (DNN) inference on optical hardware. DNNs process data in a sequence of layers, each of which includes (linear) synaptic connections (diagonal lines) and (nonlinear) neuron activations (circles) as shown in
The self-configuration process for rectangular MZI meshes given above uses external sources and (coherent) detectors and does not rely on an accurate characterization of device errors. This method is based on the diagonalization of a unitary matrix by Givens rotations, with a specific set of measurements performed to ensure that the Givens rotations are properly implemented in the hardware. For sufficiently small hardware errors, this self-configuration process leads to perfect realization of the target matrix. For large errors, it achieves the same quadratic reduction ε→ε2/√{square root over (6)} observed for local correction processes and self-configuration on triangular meshes. The self-configuration technique increases the robustness of optically accelerated DNNs to hardware error, particularly in the region around a σ≈2% characteristic of directional couplers in silicon.
1.2. Accurate Self-Configuration for Triangular (Reck) Meshes
One aspect of self-configuring a Clements mesh is realizing that it can be divided into two triangles with a phase screen in the center: U=VDW. By configuring one diagonal at a time, alternating between V and W, it is possible to zero all the elements of the target matrix X=V†UW†, thus realizing the desired unitary. This procedure is simplified when configuring a Reck mesh, which can be expressed in terms of a single triangle:
U=D(ΠmnTmn) or (ΠmnTmn)D for W or V, respectively.
The first case U=DW corresponds to a mesh with an output phase screen, as shown in
The MZI order in
Like the self-configuration of the Clements mesh, the self-configuration process for the Reck mesh above is designed, at each time step, to properly set the configured MZIs in the shaded rectangular [resp. triangular] block to realize W [resp. V]. The process in
The self-configuration processes in
The self-configuration processes for Reck and Clements can be mapped to a general-purpose subroutine that self-configures any MZI mesh of the form U═VDW, provided that the geometry admits a matrix diagonalization by way of Givens rotations.
1.3. Correlated Errors
In realistic MZI meshes, the beam splitter errors αn, βn tend to be strongly correlated, since the process variations that lead to errors (waveguide thickness and spacing, partial etch depth, slab height) usually have correlation lengths much longer than the size of an MZI. In general, the matrix error ∥ΔU∥2 depends both on the error amplitudes (αn2,βn2) as well as their correlations (αmαn, etc.). For an individual matrix U, the dependence on correlations can be significant. However, in an ensemble average over the Haar measure, this dependence becomes very small for most inter-MZI correlations because of the random phase shifts between pairs of MZIs. Only intra-MZI correlations αn/βn have a significant effect on the ensemble-averaged matrix error.
Consider the extreme case of full correlation αn=βn=μ. This case is realized, for instance, when the dominant error source arises from operating the mesh away from the coupler design wavelength. The coverage, uncorrected error, and corrected error are calculated to be:
cov(N)=e−(2/3)N
ε0=2√{square root over (N)}μ
εc=√{square root over ( 8/9)}Nμ2
1.4. Neural Network Model
In the optical neural network model described above with respect to
The activation function at the inner layer is realized with an electro-optic nonlinearity: a fraction of each output field is fed into a detector that drives a Mach-Zehnder modulator, while the remaining light passes through the modulator. This implements the activation function:
f(E)=√{square root over (1−α)}e−i(g|E|
where α is the power tap fraction, g is the modulator phase induced per unit optical power, and ϕ is the phase in the absence of power. Here, we choose α=0.1, g=π/20, and ϕ=π, which causes f(E) to approximate the form of a leaky ReLU in the right power regime.
As explained above, realistic multiport interferometers (beam splitter meshes) are sensitive to component imperfections, and this sensitivity increases with mesh size (number of ports). Self-configuration techniques like those disclosed above can be employed to correct these imperfections, but not all self-configuration techniques are equal. This section highlights the benefits of stability in self-configuration. Naïve approaches based on sequentially setting matrix elements tend to be unstable and to perform poorly for large meshes, while techniques based on power ratios perform well in (all) cases, even in the presence of large errors. A self-configuration process for triangular meshes based on this sight uses only external detectors and works without prior knowledge of the component imperfections. This self-configuration process extends to the rectangular mesh by adding a single array of detectors along the diagonal.
Photonic technologies increasingly rely on programmable and reconfigurable circuits. One component in such circuits is the universal multiport interferometer, which, as described above, is an optical device with N>2 inputs and outputs, whose linear input-output relation (transfer matrix) is set by the user. Multiport interferometers are indispensable in applications ranging from linear optical quantum computing and radio-frequency (RF) photonics to signal processing and machine learning acceleration. They also play a useful role in proposed photonic field-programmable gate arrays. Interferometer size (i.e., number of ports) is one figure of merit for these applications. Scaling up multiport interferometers is an active field in research. Recent advances in silicon photonics are promising, allowing the scale-up from small proof-of-concept designs to large (and therefore technologically useful) systems.
Again, component imperfections are a major challenge to scaling the size of multiport interferometers. This is because large multiport interferometers are usually based on dense meshes of tunable beam splitters, whose circuit depth grows with size. Most non-recirculating designs are variants of the triangular Reck or rectangular Clements beam splitter mesh, both of which encode an N×N unitary transfer matrix into a compact mesh of programmable MZIs. These circuits have O(N) depth, meaning that component errors cascade as light propagates down the mesh. The upshot is that scaling in size should be accompanied by scaling in precision to preserve the accuracy of the input-output map. This challenge tends to be most acute for optical machine learning applications, which rely on very large mesh sizes for performance, where fabrication errors from even state-of-the-art technology are predicted to significantly degrade optical neural network (ONN) accuracy in hardware.
Several self-configuration techniques can suppress the effect of component imprecisions. For machine learning applications, the MZI phase shifts can be learned by in situ training, but this involves extra hardware (inline power detectors) and the learned weights are specific to the given device. Alternatively, if the chip has been pre-calibrated so the imperfections are known, global optimization can be used to find the phase shifts offline; however, this approach is time-consuming and typically requires that the hardware imperfections be known to high accuracy. MZI errors can also be eliminated by pairing MZIs, though this doubles the loss and chip area. Finally, for triangular meshes, the MZIs of each diagonal can be configured sequentially. This approach, however, also uses O(N2) inline power detectors (or pre-calibrated MZIs that can be configured to a perfect “bar” or “cross” state). In short, these other configuration schemes rely on either (i) additional hardware complexity, such as inline detectors or MZI pairing, or (ii) accurate pre-calibration of the mesh's component errors.
The self-configuration processes disclosed here can use only external detectors and do not rely on prior calibration of the MZI mesh. Not all self-configuration processes are created equal, and stability distinguishes good self-configuration processes from bad ones: for example, a straightforward approach based on sequentially matching matrix elements works in principle but performs poorly in the presence of large errors. Based on this insight, we propose a self-configuration process based on orthogonality and power ratios that performs well in all cases, even in the presence of large errors. This scheme is directly applicable to triangular meshes but can also be extended to a rectangular mesh with the addition of a single array of inline power detectors along the diagonal.
2.5. Statistics of Imperfect Meshes
Two common multiport interferometer designs are the Reck triangle, shown in
U=(ΣnTn)D,
where D is a phase mask and the Tn are 2×2 block matrices representing a phase shifter cascaded into an MZI crossing as shown in
Here (θn,ϕn) are the phases programmed by the user, e.g., through thermo-optic, or MEMS phase shifters, while (π/4+αn,π/4+βn) are the coupler angles, a property of the circuit and its imperfections (αn,βn). These angles are π/4 in an ideal MZI, which enables perfect contrast on each MZI output. In such a device, the phase shifts can be found by a procedure that diagonalizes U with a sequence of 2×2 rotations. (Another convention is to place the phase mask at the end, U=DΠnTn, and put the phase shifter ϕn at the beginning of the unit cell: Tn=S(π/4+βn)P1(θn)S(π/4+αn)P1(ϕn)).
2.6. Component Errors
Component errors (deviations from design) may perturb the input-output matrix. The magnitude of this perturbation ΔU can be quantified by the Frobenius norm ∥ΔU∥2=Σij|ΔUij|2 and normalized to define an error measure:
This metric can be interpreted as an average relative error per entry in the matrix U; for small ε, the quantity (1−ε) plays a role similar to the fidelity of a quantum operation, an analogy that can be made rigorous in the case of linear optical quantum computing. To first order, the effect of component errors is linear:
Applying the equations above yields ∂U/∂αn=UpreS′(π/4)Upost (and likewise for ∂U/∂βn). Since we are interested here in the magnitude ∥ΔU∥ and matrices (Upre, Upost) are unitary, it follows that:
Thus, the mesh is equally sensitive to all beam splitters, irrespective of geometry.
At this point, it becomes helpful to introduce an error model, since perturbations from nearby crossings may lead to correlated errors in U. While real imperfections are correlated, this adds significant complexity to the math that obfuscates the insights. Moreover, while correlations may affect the error measure of a particular matrix, when considering ensembles of matrices, they are expected to average to zero. Therefore, we assume an uncorrelated error model where (αnrms=βnrms=σ for a given error amplitude σ.
Under the uncorrelated model, the error terms in the equation above for AU add in quadrature over the N(N−1) couplers to give:
ε=√{square root over (2(N−1))}σ˜(2N)1/2σ
Since the depth of the circuit is O(N), and independent errors in each layer add in quadrature, it is not surprising that ε≥N1/2. Component precision should therefore increase as meshes are scaled up in order to maintain a desired matrix accuracy.
2.7. Error Correction
As mentioned above, if the beam splitter imperfections (αn,βn) are known, correction schemes can be used to program a unitary to relatively good accuracy. 2×2 unitaries with the same power splitting ratio are equivalent up to external phase shifts. In the presence of imperfections, the MZI does not achieve perfect contrast in both output ports. The range of splitting angles is truncated to:
2|α+β|≤θ≤π−2|α−β|
If this inequality is satisfied, a perfect MZI T(θn,ϕn) can be replaced by an imperfect MZI with external phase shifts:
(The extra phase shifts can be absorbed into the neighboring MZIs so that the number of physical phase shifters on the mesh does not increase.) Provided that the inequality is satisfied for all MZIs in the ideal Reck/Clements decomposition, this procedure can lead to a perfect representation of the matrix. The fraction of unitary matrices realizable by an imperfect mesh is called the coverage, cov(N, σ). If some MZIs do not satisfy the inequality above, we pick the closest possible internal phase setting for the MZI, {right arrow over (θ)}n ∈{θmin, θmax}, which leads to an error in the matrix:
Not all unitaries are equally easy to express on an MZI mesh. For this reason, when analyzing the efficiency of an error correction scheme, it helps to specify the probability distribution of U. Here, we consider random unitaries under the Haar measure, a distribution that samples uniformly from the space of unitary matrices. Under this measure, the phase shifts θn∈[0,π) and ϕnε[0,2π) are uncorrelated and distributed according to:
where k is the row index of the Reck mesh, starting from k=1 at the bottom as labeled in
punsat(k)=∫02|α+β|P(θ|k)dθ≈k(α+β)2
For an N×N unitary, there are (N−k) MZIs of rank k. The coverage is equal to the probability, under the Haar measure, that all θn are realizable. Since the θn are uncorrelated, this is a product of the probabilities for each MZI:
This is vanishingly small for large MZI meshes: for example, taking a reasonable value of σ=0.02, even a 32×32 mesh has a coverage around 1%.
Even if most unitaries cannot be realized exactly, they can be approximated to much better accuracy than the uncorrected result. Each MZI with an unrealizable θn will lead to a matrix error. Over the Haar measure, the average error induced by a particular MZI is thus:
Assuming the errors are uncorrelated (see below for the correlated case), they add in quadrature: ∥ΔU∥2=Σk(N−k)∥ΔU∥2k≈ 1/18 N3(α+β)4=⅔N3σ4. The corrected normalized error is therefore:
εcorr=√{square root over (⅔)}Nσ2
This is plotted in
2.8. Direct Self-Configuration Process
In many circumstances, the error correction procedure in the section immediately above cannot be applied because the errors in an MZI mesh are not known to sufficient accuracy. Nevertheless, for triangular meshes, “progressive” self-configuration strategies can still be used. As noted above, many other self-configuration processes rely on inline photodetectors or pre-calibration, which limits their usefulness in many systems. Here, we disclose a simple “direct” self-configuration method, also called the direct method or process, that is based on sequentially setting matrix elements. This direct self-configuration method can be run on uncalibrated hardware with only external detectors: and a “ratio” method based on setting power ratios.
One way to program a triangular mesh is to set the MZI phases one at a time to match the target matrix elements {right arrow over (U)}ij. This can be understood by considering first the case of a tunable 1:N splitter and later generalizing to an N×N unitary.
This direct self-configuration process works for triangular matrices. Triangularity guarantees that tuning steps do not disturb the matrix elements that have already been set, provided that the order above is followed. To apply the direct self-configuration to a Clements matrix, the Clements matrix can be divided into two triangles, which can be separately configured as explained below.
2.9. Ratio Self-Configuration Process
The ratio self-configuration process, also called the ratio method or process, is based on setting power ratios and can be applied to Reck and Clements meshes. Like the direct self-configuration process, it can be run on uncalibrated hardware with only external detectors. The ratio method is robust in the presence of large errors, which makes it especially useful for configuring large multiport interferometers.
Step N: Set the final phase shift to align the phase of {right arrow over (u)} with û.
The Reck triangle is configured one diagonal at a time in the order shown in
The goal of the ratio self-configuration method is to configure the MZI so that this output best approximates ûm, the mth column of target matrix Û. This is done by minimizing the L2 norm ∥{right arrow over (u)}m−ûm∥. Since T and Upost are unitary, then {right arrow over (a)}⊥{right arrow over (b)}⊥{right arrow over (c)} and |T11|2+|T21|2=1. Applying these relations yields:
The first two terms drop out as constants since they do not depend on the optimization variables θmn,ϕmn (which determine (T11, T21)). Since each MZI is preceded by a phase shifter, the phase of ψ is also freely tunable; it can be helpful to perform the following maximization:
maxθ,ϕmaxψRe[s eiψ(T11ûm|{right arrow over (b)}+T21ûm|{right arrow over ({right arrow over (c)})}))]
∝maxθ,ϕ|T11ûm|{right arrow over (b)}+T21ûm|{right arrow over (c)}|
subject to the constraint |T11|2+|T21|2=1. This is just optimizing a dot product, which amounts to setting the amplitude ratio:
We may not be able to measure T11, T21, {right arrow over (b)}, or {right arrow over (c)} directly in an experiment. Instead, we proceed as follows: first sweep the value of ψ to obtain {right arrow over (a)}, which is the value of ui averaged over opposite phases ψ:
Next, once {right arrow over (a)} is found, set (θ,ϕ) to maximize the quantity:
maxθ,ϕ|ûm|{right arrow over (u)}m(θ,ϕ)−{right arrow over (a)}|
which is independent of ψ, but relies only on external output measurements. This can be done because we are measuring the output ûm(θ,ϕ) and have knowledge of â and the target vector ûm, so can be implemented in a control loop that varies (θ,ϕ) until the quantity is maximized.
This method of self-configuring, self-calibrating, and/or error correction multiport interferometers based on power ratios works without any internal detectors or knowledge of the component imperfections. This ratio self-configuration method, or ratio method, is applicable to any triangular mesh, but can be extended to rectangular meshes by adding a single diagonal of drop ports, a small amount of additional complexity as the mesh size grows large. The accuracy of this method is guaranteed by the algorithmic stability of unitary matrix diagonalization and follows the asymptotic form εcorr∝Nσ2 over the Haar measure with independent Gaussian component errors as explained below. Employing this ratio self-configuration method suppresses matrix errors by a quadratic factor: εcorr=ε2/√{square root over (6)}, allowing MZI meshes to scale to large sizes (e.g., N>64) without unreasonable demands on fabrication tolerance.
2.10. Direct Versus Ratio Self-Configuration Performance Comparison
To compare the direct and ratio self-configuration processes, we simulate the self-calibration of Reck meshes in the presence of component errors. Here, the target unitaries Û are sampled from the Haar measure, with random, Gaussian-distributed errors in the beam splitter angles. We consider mesh sizes in the range 8≤N≤64 to analyze the scaling of the algorithms with mesh size. Error correction should allow perfect configuration when errors are low enough (coverage is order unity), and an error reduction of εcorr∝ε2 in the uncorrectable case.
Combining the two fits in
εcorr=ε2/√{square root over (6)}
This relation is independent of N, and can be used to test the scaling of the algorithms as the matrix dimension increases.
When following the ratio method, errors do not build up. To understand the stability of the ratio method, it is helpful to relate the method to the 2×2 block decomposition of a unitary matrix as shown in
Because of MZI errors, not all off-diagonal terms can be zeroed. If a term cannot be zeroed, it leaves a residual term (⊕αmn−βmn|−θmn) below the diagonal, where θmn is the target splitting ratio for MZI (m, n), which is unrealizable since |αmn−βmn|>θmn. Let V(mn)=Umn†. . . U12†U11†Û be the matrix after configuring Umn and define
which is the sum of squares of all elements Vij(mn) in the zero region below the diagonal (lighter shading in
which is the same result as for the local scheme disclosed in Saumil Bandyopadhyay, Ryan Hamerly, and Dirk Englund, “Hardware error correction for programmable photonics,” Optica 8, 1247-1255 (2021).
2.11. Rectangular Mesh
Compared to the Reck triangle, the rectangular Clements mesh has advantages of increased compactness, reduced circuit depth, and relative insensitivity to fixed component losses. However, it cannot be written as a cascade of diagonals, so the direct and ratio self-configuration techniques cannot be used. However, a simple modification to a Clements mesh 1600—placing a diagonal of tunable drop ports and detectors 1602—suffices to make the mesh programmable as shown in
First, following the Clements representation of an ideal MZI mesh, the target matrix is decomposed into two components Û=Û2 for the left and right triangles. Next, the diagonal ports are set to the “cross” state to collect all of the light along the diagonal. This allows the left triangle to be programmed to Û1 as shown in
In the reciprocal form of the ratio method, instead of sending light into a single port and matching the output vector to a column of Û, we send in a column um* of Û† as input and try to direct all the power to a single output. This is analogous to the Reverse Local Light Interference Method (RELLIM) but does not require internal detectors. The MZIs are programmed along falling diagonals, but the order is reversed (bottom to top, down each diagonal).
When configuring MZI (m, n), the upstream components have been configured, while downstream components have not. The input-output relation is the product v=UpostTUpreûm, where Upre has been configured but Upost has not.
vm=a+s eiψ(T21β+T22γ)
The MZI should be configured to direct its output power to the bottom port (T21/T22=(β/γ)*). In RELLIM, this is accomplished with the use of internal detectors. However, external detectors can also be used, even if the downstream MZIs have not been calibrated to a “cross” state. As above, first the phase is swept and the value of a is extracted from the average:
Next, the phases of the MZI are set to maximize:
maxθ,ϕ|vm−a|
Like the Reck mesh self-configuration scheme, this configuration method is resistant to errors by visualizing how each configuration step zeroes out entries in the target matrix. Each matrix has ≈¾ N2 free entries and ≈¼ N2 zeroes below the diagonal. Each mesh is self-configured to eliminate the remaining nonzero elements in the lower triangle, which takes ≈¼ N2 steps, half the number of steps as the Reck triangle.
2.12. Correlated Errors
Above, we assumed an uncorrelated error model, where αn and βn are independent random variables sampled from a zero-mean Gaussian with standard deviation σ. In practice, the fabrication imperfections that lead to splitter errors have a long correlation length, so errors tend to be strongly correlated. In this section, we show that: (1) averaged over the Haar measure, inter-MZI correlations cancel out, so only correlations within each MZI (i.e., between αn and βn) affect the result; and (2) for most unitaries, the effect of the symmetric error αn+βn is dominant.
The first-order error metric given above can be expanded to second order, accounting for all correlations:
with the matrix inner product V|W=tr(V†W).
where S is the symmetric splitter matrix:
and σx=[[0,1], [1,0]] is the Pauli matrix.
The terms P(p) and Q (q) above correspond to paths of light that pass through at most one splitter. These are mutually orthogonal and do not contribute to the correlation in the second-order error metric. Error correlations arise only from the first term, corresponding to paths of light that pass through both splitters p and q. The resulting inner product is independent of Upre and Upost and takes the form (at p=q=0):
In general, this quantity is nonzero. However, the phases (ϕ,ϕ′) are uniformly distributed over [0,2π) for Haar-random unitaries. Therefore, in the ensemble average over the Haar measure, the phase-dependent terms in the inner product cancel out. This means that each path from splitter p to q, which has a separate phase dependence, can be considered separately in the inner product. Consider the path from output i∈{1,2} to input j∈{1,2} (i=j=1 shown in
This is very small for the majority of MZIs, where the splitter angles (θm, θn) cluster tightly around zero. Moreover, while it is always possible to find a matrix decomposition with only positive θ, one can also sample from the Haar measure employing both positive and negative θ with equal probability; in this case, under the ensemble average, the inter-MZI correlations are zero.
On the other hand, correlations within an MZI, i.e., between αn and βn, matter. The matrix error of a single MZI is:
For the majority of MZIs, θn≈0 and the symmetric error dominates ∥ΔU∥2. The normalized error for the whole mesh is approximately:
ε≈√{square root over (N)}(α+β)rms
which in the case of uncorrelated (α,β) reduces to the form derived above: ε=√{square root over (2N)}σ.
For completeness, we now consider the case of fully correlated splitter errors, i.e., αn=βn=μ, where μ is a constant. Systematic effects such as imperfect coupler design or fabrication errors with long correlation length can lead to this situation, as can operating the mesh away from the coupler design wavelength. The uncorrected error amplitude is:
ε=2√{square root over (N)}μ
The coverage of unitary space is:
cov(N)=e−(2/3)N
In the presence of error correction, the matrix error becomes:
Two modified mesh architectures can address the component errors that limit the scaling of multiport interferometers based on MZI meshes: (1) a three-splitter MZI for generic errors, and (2) a broadband MZI+Crossing design for correlated errors. Because these designs allow for perfect realization of the cross state, the matrix fidelity is rendered independent of mesh size, allowing scaling to arbitrarily large meshes. These architectures support progressive self-configuration, are more compact than other MZI-doubling schemes, and do not require additional phase shifters. This eliminates a major obstacle to the development of very-large-scale linear photonic circuits.
3.1. Alternative Mesh Architectures to Overcome Scaling Limitations
Alternative mesh architectures may overcome this scaling limitation. Non-compact structures such as binary trees avoid the extreme splitting-ratio requirements but occupy large chip areas and have many crossings. A complementary approach is to stick to conventional mesh geometries but insert redundant MZIs to realize the full range of splitting ratios even in imperfect hardware. This solves the scaling problem, but at the cost of a 1.5-2× increase in the number of MZIs and phase shifters. The resulting effect on chip area and pad count makes this option unappealing.
Both the 3-splitter MZI 2102′ and the MZI+Crossing 2102″ take up significantly less chip area than the “perfect” redundant MZIs, and do not require additional phase shifters. Moreover, they support progressive self-configuration, allowing for error correction even when the hardware errors are unknown. This enables the development of freely scalable, broadband, and compact linear photonic circuits.
Splitting ratios can be visualized as points on the Riemann sphere, where hardware imperfections lead to forbidden regions around the poles (bar- and cross-state), where the probability density is at a maximum. To avoid this unfortunate coincidence, the architectures in
3.2. Error Correction Formalism
U=D((TN−1,1) . . . (T2,N−2 . . . T21)(T1,N−1 . . . T11))
where Tmn is the nth MZI of the mth diagonal. We configure the mesh by building up matrix W in a sequence of steps designed to diagonalize a target matrix X=UW†. In each step, we add one crossing to W, performing the update W→TmnW, which right-multiplies the target matrix X→XTmn† as shown in
T†=[*0]⇔T11/T12=u/v
This is illustrated in
Mathematically, nulling corresponds to matching the complex splitting ratio s≡T11/T12=−(T22/T21)* to a target value ŝ≡u/v. This is not always possible in practice, as the range of splitting ratios is constrained by hardware imperfections like those in
tan|α+β|≤|s|≤cot|α−β|
Here α,β are the splitting angle errors for the splitters in a standard MZI. These imperfections lead to forbidden regions, shown in
If nulling cannot be achieved perfectly, the “zero” region of matrix X is left with a residual of magnitude:
where ŝ is the target splitting ratio, s is the closest physically realizable value, and d(s,ŝ) is the Euclidean distance on the Riemann sphere with stereographic projection s=(x+iy)/(1+z). As a fidelity metric, we consider the normalized matrix error εc=∥ΔU∥rms/√{square root over (N)}, which is approximately the quadrature sum of these residuals:
Here, . . . refers to the ensemble average over Haar-distributed unitaries U. The residual post-correction error is highly sensitive to this distribution, since rmn is nonzero only within the forbidden regions. For large meshes, this distribution clusters tightly near the cross state, or the upper pole of the Riemann sphere as shown in
This is uniform for the lowest row of crossings and becomes increasingly concentrated as one approaches the triangle's apex.
We calculate the mean residual r2 by averaging the residual over the distribution P(s). This is simplified in the case of small hardware errors, because the forbidden region is correspondingly small and where we can assume P(s) is approximately constant:
where s0 and R0 are the center and radius of the forbidden region. Consider the forbidden region near the upper pole (the lower region plays a negligible role for large meshes), where s0=0 and R0=2|α+β|. The residual is also proportional to the quantity qmn=|u|2+|v|2. Following the Gaussian elimination procedure of a Haar matrix, this evaluates to qmn=(n+1)/(N+1−m), yielding:
Assuming an uncorrelated Gaussian perturbation model with αrms=βrms=σ, the resulting matrix error is:
In contrast, if the mesh were straightforwardly programmed without taking any account of the imperfections (“uncorrected” error), the error under an uncorrelated perturbation model would be ε0=√{square root over (2N)}σ. Self-configuration therefore leads to a quadratic suppression of errors: εc=(ε0)2/3. However, εc still increases with mesh size, so fabrication imperfections will still set a limit on the scaling of meshes based on this architecture.
Some matrices can be exactly realized even with an imperfect mesh. Again, the coverage is the fraction of such unitaries under the Haar measure. This is computed by summing up the probabilities that individual MZIs fall within the forbidden region:
In practice, even moderately sized meshes will have vanishingly small coverage, so the error metric ε is usually more relevant.
3.3. Three-Splitter MZI
One challenge to error correction in multiport interferometers is that the forbidden regions overlap with the peak of the probability distribution (cross state s=0). Adding redundant components (e.g., MZI doubling) solves this problem by eliminating the forbidden regions altogether, but at the cost of added optical and electrical complexity. An alternative approach is to displace the forbidden regions away from the cross state rather than eliminating them altogether. This can be performed by placing a third splitter at the input of the MZI, as shown in
For a 50:50 splitting ratio (η=π/4), the extra splitter maps the bar and cross states to s=±i as shown in
The 3-splitter MZI can realize the full range of (absolute value) splitting ratios |s|∈[0, ∞), and for certain parameter choices, this ratio is wavelength-independent. However, the presence of forbidden regions means that the relative phase of this splitter cannot be fully controlled; which means that errors can still occur when programming the mesh. However, with the 3-splitter structure, MZIs fall into the forbidden regions only rarely; consequently, the error metric of the 3-splitter MZI is much lower than the standard MZI, and perhaps more strikingly, does not increase with mesh size N.
The mean residual left by MZI Tmn is:
The factor of two above arises because both forbidden regions contribute equally.
where γe≈0.5772 is the Euler-Mascheroni constant. For most mesh sizes, the matrix error εc is 1-2 orders of magnitude smaller for the 3-MZI design. Remarkably, the error actually decreases with increasing mesh size, scaling as εc ∝√{square root over (log(N)/N)}. In the asymptotic limit N→∞, matrices can be programmed perfectly.
This non-intuitive effect arises from the fact that, under the Haar measure, only a small fraction of MZIs have significant probability density near s=±i, where the forbidden regions are centered. This probability decreases exponentially with the distance from the triangle's base (or from the edge in the case of the rectangular mesh) due to the 2−n dependence in the distribution of the residuals. Therefore, although the mesh has N(N−1)/2 MZIs, only O(N) contribute significantly to the matrix error under self-configuration. A naïve estimate assuming uncorrelated errors would give ∥ΔU∥∝√{square root over (N)}σ2, which would lead to a constant εc. However, during the self-configuration process, subsequent MZIs can partially correct for errors in earlier MZIs that cannot be properly configured. Only the residual terms remain, as these cannot be cancelled out by subsequent Givens rotations. The end result is to reduce the error of each MZI by a factor qmn=|u|2+|v|2, which is often <<1. Properly accounting for this factor explains the further reduction to εc ∝√{square root over (log(N)/N)}.
Another advantage of the 3-splitter MZI is that the threshold for perfect error correction is higher. One obtains this threshold is found by computing the coverage:
The threshold scales as σth ∝N1/2, as opposed to the N3/2 scaling observed for a standard MZI. Consequently, perfect error correction is available under a much larger range of conditions as shown in
3.4. MZI+Crossing
The 3-splitter MZI described above is well-suited to the case of generic, uncorrelated component errors. However, since the correlation lengths of process variations tend to be larger than a single MZI, errors are often correlated in practice. This is especially true for broadband couplers based on multimode interference (MMI), subwavelength gratings, and asymmetric designs, all of which are highly dependent on the device geometry, which can vary slightly from run to run. Moreover, even with perfect 50:50 couplers, the splitting ratios are still wavelength-dependent. Operating the mesh away from its design wavelength can lead to correlated device errors, so sensitivity to these errors is closely tied to the operational bandwidth of the device.
Consider the case of a constant offset μ for all splitting ratios: α=β=μ. In a standard MZI, the forbidden region near s=∞ disappears since |α−β|=0, while the region near s=0 (the peak of the probability distribution) remains in place as shown in
As above, we use the residual formula to calculate the matrix error. In this case, there is only one forbidden region, centered at s0=∞, with R0=4μ. Only the MZIs in the bottom row contribute to the sum because the probability distribution vanishes at s=∞ for the upper rows. This yields:
As an added bonus, the MZI+Crossing design also reduces the effect of errors in the absence of correction. To see how, consider the transfer matrix of the standard MZI:
where S(η)=eiσ
∥ΔT∥MZI2=2[cos2(θ/2)(α+β)+sin2(θ/2)(α−β)2]
The sensitivity to correlated errors α=β is maximized when the MZI is in the cross state θ=0. Since most MZIs in large meshes are close to the cross state, the matrix error ε0=2√{square root over (N)}μ for the overall mesh is larger by a factor of √{square root over (2)} compared to the uncorrelated case. Assuming a perfect crossing, for the MZI+Crossing, the transfer matrix is:
Up to some irrelevant output phases, the effect of the crossing is to flip the relative sign of α and β, so the component errors appear anticorrelated. As a result, ∥ΔT∥MZI+X∝ sin(θ/2)μ, which is zero for the cross state. Of course, the MZIs in a realistic mesh are not exactly in the cross state, to the actual error will be nonzero. Adding the errors ∥ΔTmn∥ in quadrature and averaging over the probability distribution Pmn(θ)=n sin(θ/2)cos(θ/2)2n−1 gives:
ε0=2√{square root over (2(log N+γe−2))}μ
Correlated errors (both corrected and uncorrected) tend to be important because they are tightly connected to the operational bandwidth of the mesh. Directional couplers are generally dispersive, and this dispersion leads to a correlated wavelength-dependent splitter error, which can usually be expanded to first order μ≈(dμ/dλ)Δλ. Two useful wavelength-dependent figures of merit are (1) the bandwidth and (2) the tuning ranage.
The bandwidth is related to the number of wavelength channels that can be simultaneously processed by the mesh. Since the mesh is usually programmed at a specific wavelength (the center channel), the other channels usually have distorted matrices. Given a target error εmax, the bandwidth is the range over which εo (A)<εmax:
The tuning range refers to the range of λ over which the mesh can be programmed to a given accuracy. This is governed by the condition εc(λ)<εmax$, which yields:
As
Real crossings have a small amount of nonzero crosstalk, quantified by the S-matrix element S21; scattering into the forward=facing port leads to a perturbation S(π/2)→S(π/2+γ) in the transfer matrix, where
γ=10−S
This does not degrade the effectiveness of self-configuration, since the additional scattering angle merely rotates the Riemann sphere, shown in
In addition to the forward-scattered light, a 90° crossing can scatter light into the backward-facing port. Back-reflected light can be subsequently reflected in other crossings, leading to a spurious signal that interferes with the forward-propagating light. Provided that the phases of reflected beams are random, these add in quadrature: with amplitude γ2 and O(N2) scattering paths, this can induce an O(Nγ2) error, which may be uncorrectable and set a limit on scaling. However, if this effect is small, gradient-based methods or iterative self-configuration may enable correction of these errors.
Adding a single passive component (either a splitter or a waveguide crossing) to the MZI makes it possible to recover behavior that is asymptotically perfect—that is, the average normalized matrix error decreases with size. The design choices are motivated by the elegant theory of self-configuration by matrix diagonalization, where splitting ratios are set to successively zero the off-diagonal elements of the target unitary. By visualizing the MZI state on the Bloch sphere, we can intuitively understand the increased error robustness of our designs in terms of “rotating” the forbidden regions away from the peak probability density. This leads to a several-orders-of-magnitude reduction in post-correction errors, compared to the standard MZI mesh. The ability to achieve near-perfect and freely scalable MZI meshes with less complexity than the MZI-doubled designs (especially with respect to the number of active components and pads) removes a major obstacle to the realization of very-large-scale photonic circuits.
The block singular-value decomposition of a unitary matrix can be used as the basis for a linear photonic circuit architecture. This decomposition splits a size-N mesh into four size-(N/2) quadrants connected by a central crossing layer, and when applied recursively yields a generalized fast Fourier transform (FFT) butterfly fractal. In addition to the benefit of modularity, the butterfly fractal exhibits greater robustness to hardware errors due to its more uniform distribution of coupler angles. Numerical simulations confirm this intuition, demonstrating reduced (post-correction) matrix error and improved scaling compared to the conventional mesh. A truncated butterfly can implement weight pruning in optical neural networks.
As photonic circuits grow larger, modularity and robustness to error become paramount architectural concerns. In modern process nodes, device yield limits the practical reticle size, necessitating multi-chip-module (MCM) architectures for high-end processors. Photonic MCMs are traditionally employed for heterogeneous integration of III/V lightsources; however, as MZI meshes grow in size, even these passive circuits may use a multi-chiplet architecture, so modularity is useful.
Hardware errors also limit the scaling of MZI meshes. With a fixed MZI error O(σ) and a circuit depth that scales as O(N), the overall matrix error scales as O(√{square root over (N)}σ) (assuming uncorrelated errors). Even for medium-sized meshes, errors noticeably degrade the performance in quantum and machine-learning applications. Errors can be corrected or compensated by a variety of techniques including global optimization, local correction, and self-configuration; however, for large meshes these corrections may be at best approximate and can lead to a matrix error that scales as O(Nσ2), which is at most a quadratic reduction. The limits to error correction are closely tied to the (in)ability of physical MZIs to realize the splitting ratios demanded by the mesh. This problem is severe in standard mesh geometries, where the distribution of target coupler angles clusters tightly around the “cross” state, which is not realizable in an imperfect MZI. MZI pairing and MZI+Crossing designs like those disclosed above can circumvent this difficulty, but at the cost of additional hardware complexity.
The butterfly fractal is a generalization of the standard FFT butterfly. Like the FFT, it has a self-similar (and therefore modular) structure. As a result of this structure, the fractal has a much more uniform distribution of coupler angles, which facilitates effective error correction even in very large meshes. On the other hand, while the FFT and its photonic analogs can represent only restricted unitaries, the butterfly fractal is a universal N×N coupler. Based on this observation, the butterfly and its fractal variant are extreme cases of a spectrum of MZI meshes. Truncating the fractal is a way of trading off efficiency and generality, analogous to pruning or dropout in conventional neural networks.
4.1. The FFT Butterfly Fractal
The transfer matrix for an MZI mesh is the product of a diagonal phase screen and a sequence of 2×2 block matrices, where, by convention, the phase screen is usually placed at the output of the mesh as in
In the Reck and Clements meshes, the MZIs couple nearest-neighbor waveguide pairs, i.e., they have a stride of one. Projected to the waveguide mode set being coupled, each waveguide transfer matrix Tm takes the form:
If N is even, the matrix U can be divided into size-(N/2) quadrants Uij, (i,j)∈{1,2}. For unitary U, the matrices Uij share singular vectors, so U admits a block singular-value decomposition (SVD) of the form:
This equation divides a size-N Clements mesh into four size-(N/2) blocks, each realizing one of the matrices V1,2, W1,2. These blocks are joined by a central layer of stride-(N/2) MZIs, implementing the matrix
where (Θ,Φ) are size-(N/2) diagonal matrices that encode the phase shifts of the central layer. (Technically the SVD returns real-valued Dij; however, this can be put into the form immediately above by propagating the unwanted phases to the matrices V1,2, W1,2).
The block SVD is over-parameterized, with a total of
degrees of freedom. The redundant variables come from the output phase screens of W1,2, which lie in the interior of the mesh. These can be eliminated by propagating the phases to the output, following the rule:
This process can be applied recursively to the blocks V1,2, W1,2, constructing a self-similar fractal. For a radix-2 mesh size, the fractal terminates at N=2, yielding the generalized FFT butterfly, which is shown in
Like the Clements mesh, the butterfly fractal has O(N) depth and can represent all unitaries. One difference between the butterfly fractal and the Clements mesh is the presence of MZIs with stride s>1. There are N/2k+1 layers of stride s=2k, interleaved between layers of ordinary (s=1) MZIs. The large number of waveguide crossings is a liability of this scheme, since crossings have nonzero loss and crosstalk. One way to mitigate the effects of imperfect crossings is to terminate the fractal at a reasonable Nblk>2 (for instance, splitting a 256×256 mesh into an 8×8 grid of 32×32 Clements blocks). In this case, the crossings are dealt with as follows:
Component imperfections lead to perturbations in the transfer matrix for each MZI. On most integrated platforms, splitter and phase errors dominate. For a fully programmable device, phase errors can be absorbed into the phase shifter values (θ,ϕ). Splitter errors can be a more fundamental problem, since they limit the range of MZI splitting ratios, which can lead to uncorrectable errors in the mesh. Splitter errors can perturb the MZI transfer matrix as follows:
This relationship limits the effective splitting angle of an MZI to the range:
The performance of error correction is fundamentally limited by “uncorrectable” errors arising whenever this inequality is violated.
To analyze the effectiveness of error correction, we must first specify (1) an error distribution and (2) a distribution for the target unitary. For simplicity, consider the case of uncorrelated Gaussian errors: α,β˜N(0,σ). (The case of correlated errors is qualitatively similar since the effects of most correlations vanish over the Haar measure). Following Refs., this note uses an error metric based on the Frobenius norm ε=∥ΔU∥/√{square root over (N)}, which quantifies the average relative error per matrix element. Target unitaries are sampled over the Haar measure. In the absence of error correction, the matrix error for both meshes (Clements and FFT) is:
ε0=√{square root over (2N)}σ
Since the uncorrected error is too large for many applications, error correction techniques should be employed in large meshes. The effects of error correction are twofold:
These equations for the coverage and residual post-correction error reveal that the effectiveness of error correction is strongly dependent on the splitting-angle probability distribution Pn(θ), especially its behavior near the poles {0, n}. Because the Jacobian in polar coordinates vanishes at the poles, P(0)=P(π)=0. Expanding to first order around these points, the coverage and corrected error are:
For the Reck triangle and Clements rectangle, P(θ)=k sin(θ/2)cos(θ/2)2k−1 where k∈{1, . . . , N−1} is the MZI's “rank”, and there are (N−k) MZIs of rank k. The larger the mesh, the higher the rank of the average MZI, and the tighter P(θ) clusters around the cross state θ=0. This gives a coverage of:
cov(N)=e−N
Unlike the Reck and Clements meshes, the FFT butterfly fractal is configured from a top-down block decomposition of the matrix. If the matrix U is sampled over the Haar measure, the singular-vector matrices V1,2, W1,2 are also Haar-random and independent of each other. As a result, the distribution Pn(θ) depends only on the MZI's stride, not on its location in the mesh.
The partial sum in the denominator converges very quickly to π2/8. Approximating the sum by this value leads to |P′n(0)|=|P′n(π)|≈4 s/π2. Substituting into the expressions above for the coverage and corrected error and summing over the mesh (there are N2/4 s MZIs of stride s) leads to:
The following table lists the uncorrected and corrected error ε0, εc for the Clements and FFT butterfly meshes over a range of mesh sizes.
To test the validity of the above expressions,
which is a factor of 3.6× for N=256. This factor, while modest, grows as √{square root over (N/log2(N))}, suggesting that the FFT butterfly mesh enjoys improved scaling under error correction compared to Reck and Clements. Intuitively, the improved scaling results from the more uniform distribution of phase angles (shown in
While various inventive embodiments have been described and illustrated herein, those of ordinary skill in the art will readily envision a variety of other means and/or structures for performing the function and/or obtaining the results and/or one or more of the advantages described herein, and each of such variations and/or modifications is deemed to be within the scope of the inventive embodiments described herein. More generally, those skilled in the art will readily appreciate that all parameters, dimensions, materials, and configurations described herein are meant to be exemplary and that the actual parameters, dimensions, materials, and/or configurations will depend upon the specific application or applications for which the inventive teachings is/are used. Those skilled in the art will recognize or be able to ascertain, using no more than routine experimentation, many equivalents to the specific inventive embodiments described herein. It is, therefore, to be understood that the foregoing embodiments are presented by way of example only and that, within the scope of the appended claims and equivalents thereto, inventive embodiments may be practiced otherwise than as specifically described and claimed. Inventive embodiments of the present disclosure are directed to each individual feature, system, article, material, kit, and/or method described herein. In addition, any combination of two or more such features, systems, articles, materials, kits, and/or methods, if such features, systems, articles, materials, kits, and/or methods are not mutually inconsistent, is included within the inventive scope of the present disclosure.
Also, various inventive concepts may be embodied as one or more methods, of which an example has been provided. The acts performed as part of the method may be ordered in any suitable way. Accordingly, embodiments may be constructed in which acts are performed in an order different than illustrated, which may include performing some acts simultaneously, even though shown as sequential acts in illustrative embodiments.
All definitions, as defined and used herein, should be understood to control over dictionary definitions, definitions in documents incorporated by reference, and/or ordinary meanings of the defined terms.
The indefinite articles “a” and “an,” as used herein in the specification and in the claims, unless clearly indicated to the contrary, should be understood to mean “at least one.”
The phrase “and/or,” as used herein in the specification and in the claims, should be understood to mean “either or both” of the elements so conjoined, i.e., elements that are conjunctively present in some cases and disjunctively present in other cases. Multiple elements listed with “and/or” should be construed in the same fashion, i.e., “one or more” of the elements so conjoined. Other elements may optionally be present other than the elements specifically identified by the “and/or” clause, whether related or unrelated to those elements specifically identified. Thus, as a non-limiting example, a reference to “A and/or B”, when used in conjunction with open-ended language such as “comprising” can refer, in one embodiment, to A only (optionally including elements other than B); in another embodiment, to B only (optionally including elements other than A); in yet another embodiment, to both A and B (optionally including other elements); etc.
As used herein in the specification and in the claims, “or” should be understood to have the same meaning as “and/or” as defined above. For example, when separating items in a list, “or” or “and/or” shall be interpreted as being inclusive, i.e., the inclusion of at least one, but also including more than one, of a number or list of elements, and, optionally, additional unlisted items. Only terms clearly indicated to the contrary, such as “only one of” or “exactly one of,” or, when used in the claims, “consisting of,” will refer to the inclusion of exactly one element of a number or list of elements. In general, the term “or” as used herein shall only be interpreted as indicating exclusive alternatives (i.e., “one or the other but not both”) when preceded by terms of exclusivity, such as “either,” “one of,” “only one of,” or “exactly one of” “Consisting essentially of,” when used in the claims, shall have its ordinary meaning as used in the field of patent law.
As used herein in the specification and in the claims, the phrase “at least one,” in reference to a list of one or more elements, should be understood to mean at least one element selected from any one or more of the elements in the list of elements, but not necessarily including at least one of each and every element specifically listed within the list of elements and not excluding any combinations of elements in the list of elements. This definition also allows that elements may optionally be present other than the elements specifically identified within the list of elements to which the phrase “at least one” refers, whether related or unrelated to those elements specifically identified. Thus, as a non-limiting example, “at least one of A and B” (or, equivalently, “at least one of A or B,” or, equivalently “at least one of A and/or B”) can refer, in one embodiment, to at least one, optionally including more than one, A, with no B present (and optionally including elements other than B); in another embodiment, to at least one, optionally including more than one, B, with no A present (and optionally including elements other than A); in yet another embodiment, to at least one, optionally including more than one, A, and at least one, optionally including more than one, B (and optionally including other elements); etc.
In the claims, as well as in the specification above, all transitional phrases such as “comprising,” “including,” “carrying,” “having,” “containing,” “involving,” “holding,” “composed of,” and the like are to be understood to be open-ended, i.e., to mean including but not limited to. Only the transitional phrases “consisting of” and “consisting essentially of” shall be closed or semi-closed transitional phrases, respectively, as set forth in the United States Patent Office Manual of Patent Examining Procedures, Section 2111.03.
This application claims the priority benefit, under 35 U.S.C. 119(e), of U.S. Application No. 63/196,301, which was filed on Jun. 3, 2021, and is incorporated herein by reference in its entirety for all purposes.
This invention was made with government support under FA9550-20-1-0113 awarded by the Air Force Office of Scientific Research. The government has certain rights in this invention.
Number | Name | Date | Kind |
---|---|---|---|
5278923 | Nazarathy | Jan 1994 | A |
5943458 | Miller | Aug 1999 | A |
10034069 | Rickman | Jul 2018 | B2 |
10645474 | Roth | May 2020 | B2 |
20040197041 | Lu | Oct 2004 | A1 |
20170031101 | Miller | Feb 2017 | A1 |
20180335574 | Steinbrecher | Nov 2018 | A1 |
20220327369 | Fan | Oct 2022 | A1 |
Entry |
---|
Akopyan et al. “Truenorth: Design and tool flow of a 65 mw 1 million neuron programmable neurosynaptic chip.” IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems 34.10 (2015): 1537-1557. |
Bandyopadhyay et al. “Hardware error correction for programmable photonics.” arXiv preprint arXiv:2103.04993 (2021), 23 pages. |
Bandyopadhyay et al., “Hardware error correction for programmable photonics,” Optica 8, 1247-1255 (2021), 9 pages. |
Bell et al. “Further Compactifying Linear Optical Unitaries.” arXiv preprint arXiv:2104.07561 (2021, 6 pages. |
Bernstein et al. “Freely scalable and reconfigurable optical hardware for deep learning.” Scientific Reports 11.1 (2021): 1-12. |
Billah et al. “Hybrid integration of silicon photonics circuits and InP lasers by photonic wire bonding.” Optica 5.7 (2018): 876-883. |
Bogaerts et al. “Programmable photonic circuits.” Nature 586.7828 (2020): 207-216. |
Brown et al. “Arbitrarily accurate composite pulse sequences.” Physical Review A 70.5 (2004): 052318, 4 pages. |
Bulmer et al. “Drive-noise tolerant optical switching inspired by composite pulses.” Optics Express 28.6 (2020): 8646-8657. |
Burgwal et al. “Using an imperfect photonic network to implement random unitaries.” Optics Express 25.23 (2017): 28236-28245. |
Capmany et al. “Microwave photonic signal processing.” Journal of Lightwave Technology 31.4 (2012): 571-586. |
Carolan et al. “Universal linear optics.” Science 349.6249 (2015): 711-716. |
Chen et al. “Low-loss multimode-interference-based crossings for silicon wire waveguides.” IEEE Photonics Technology Letters 18.21 (2006): 2260-2262. |
Clements et al. “Optimal design for universal multiport interferometers.” Optica 3.12 (2016): 1460-1465. |
Cooley et al. “An algorithm for the machine calculation of complex Fourier series.” Mathematics of Computation 19.90 (1965): 297-301. |
Dietrich et al. “In situ 3D nanoprinting of free-form coupling elements for hybrid photonic integration.” Nature Photonics 12.4 (2018): 241-247. |
Dumais et al. “Three-mode synthesis of slab Gaussian beam in ultra-low-loss in-plane nanophotonic silicon waveguide crossing.” 2017 IEEE 14th International Conference on Group IV Photonics (GFP). IEEE, 2017, 2 pages. |
Fang et al. “Design of optical neural networks with component imprecisions.” Optics Express 27.10 (2019): 14009-14029. |
Feldmann et al. “All-optical spiking neurosynaptic networks with self-learning capabilities.” Nature 569.7755 (2019): 208-214. |
Flamini et al. “Benchmarking integrated linear-optical architectures for quantum information processing.” Scientific Reports 7.1 (2017): 1-10. |
Fldzhyan et al. “Optimal design of error-tolerant reprogrammable multiport interferometers.” Optics Letters 45.9 (2020): 2632-2635. |
Friedmann et al. “Reward-based learning under hardware constraints-using a RISC processor embedded in a neuromorphic substrate.” Frontiers in Neuroscience 7 (2013): 160, 17 pages. |
Fukazawa et al. “Low loss intersection of Si photonic wire waveguides.” Japanese Journal of Applied Physics 43.2R (2004): 3 pages. |
Gill et al. “CMOS-compatible Si-ring-assisted Mach—Zehnder interferometer with internal bandwidth equalization.” IEEE Journal of Selected Topics in Quantum Electronics 16.1 (2009): 45-52. |
Grillanda et al. “Non-invasive monitoring and control in silicon photonics using CMOS integrated electronics.” Optica 1.3 (2014): 129-136. |
Gu et al. “Towards area-efficient optical neural networks: an FFT-based architecture.” 2020 25th Asia and South Pacific Design Automation Conference (ASP-DAC). IEEE, 2020, 6 pages. |
Haffner et al. “Nano-opto-electro-mechanical switches operated at CMOS-level voltages.” Science 366.6467 (2019): 860-864. |
Hamerly “Meshes: Tools for modeling photonic beamsplitter mesh networks,” accessed Oct. 28, 2022 online at: https:// github.com/QPG-MIT/meshes, 2020, 1 page. |
Hamerly et al. “Accurate Self-Configuration of Rectangular Multiport Interferometers.” arXiv e-prints (2021): arXiv-2106, 8 pages. |
Hamerly et al. “Design of asymptotically perfect linear photonic circuits.” AI and Optical Data Sciences III. vol. 12019. SPIE, 2022, 7 pages. |
Hamerly et al. “Infinitely scalable multiport interferometers.” arXiv preprint arXiv:2109.05367 (2021), 9 pages. |
Hamerly et al. “Large-scale optical neural networks based on photoelectric multiplication.” Physical Review X 9.2 (2019): 021032, 12 pages. |
Hamerly et al. “Robust Zero-Change Self-Configuration of the Rectangular Mesh.” 2021 Optical Fiber Communications Conference and Exhibition (OFC). IEEE, 2021, 3 pages. |
Hamerly et al. “Stability of self-configuring large multiport interferometers.” Physical Review Applied 18.2 (2022): 024018, 14 pages. |
Harris et al. “Accelerating Artificial Intelligence with Silicon Photonics.” 2020 Optical Fiber Communications Conference and Exhibition (OFC). IEEE, 2020, 4 pages. |
Harris et al. “Efficient, compact and low loss thermo-optic phase shifter in silicon.” Optics Express 22.9 (2014): 10487-10493. |
Harris et al. “Linear programmable nanophotonic processors.” Optica 5.12 (2018): 1623-1631. |
Hinton et al. “Improving neural networks by preventing co-adaptation of feature detectors.” arXiv preprint arXiv:1207.0580 (2012). |
Hughes et al. “Training of photonic neural networks through in situ backpropagation and gradient measurement.” Optica 5.7 (2018): 864-871. |
Jones et al. “Ultra-low crosstalk, CMOS compatible waveguide crossings for densely integrated photonic interconnection networks.” Optics Express 21.10 (2013): 12002-12013. |
Jouppi et al. “In-datacenter performance analysis of a tensor processing unit.” Proceedings of the 44th Annual International Symposium on Computer Architecture. 2017, 12 pages. |
Knill et al. “A scheme for efficient quantum computation with linear optics.” Nature 409.6816 (2001): 46-52. |
Kumar et al. “Mitigating linear optics imperfections via port allocation and compilation.” arXiv preprint arXiv:2103.03183 (2021), 13 pages. |
Lecun et al. “Gradient-based learning applied to document recognition.” Proceedings of the IEEE 86.11 (1998): 2278-2324. |
Leinse et al. “A novel high-speed polymeric EO modulator based on a combination of a microring resonator and an MZI.” IEEE Photonics Technology Letters 17.10 (2005): 2074-2076. |
Li et al. “Fast and reliable storage using a 5 bit, nonvolatile photonic memory cell.” Optica 6.1 (2019): 1-6. |
Lin et al. “A 7-nm 4-GHz Arm1-core-based CoWoS1 chiplet design for high-performance computing.” IEEE Journal of Solid-State Circuits 55.4 (2020): 956-966. |
Lindenmann et al. “Photonic wire bonding: a novel concept for chip-scale interconnects.” Optics Express 20.16 (2012): 17667-17677. |
Lopez-Pastor et al. “Arbitrary optical wave evolution with Fourier transforms and phase masks.” arXiv e-prints (2019): arXiv-1912. |
Lu et al. “Broadband silicon photonic directional coupler using asymmetric-waveguide based phase control.” Optics Express 23.3 (2015): 3795-3808. |
Ma et al. “Ultralow loss single layer submicron silicon waveguide crossing for SOI optical interconnect.” Optics Express 21.24 (2013): 29374-29382. |
Maese-Novo et al. “Wavelength independent multimode interference coupler.” Optics Express 21.6 (2013): 7033-7040. |
Marpaung et al. “Integrated microwave photonics.” Laser & Photonics Reviews 7.4 (2013): 506-538. |
Mennea et al. “Modular linear optical circuits.” Optica 5.9 (2018): 1087-1090. |
Mikkelsen et al. “Dimensional variation tolerant silicon-on-insulator directional couplers.” Optics Express 22.3 (2014): 3145-3150. |
Miller “Perfect optics with imperfect components.” Optica 2.8 (2015): 747-750. |
Miller “Self-aligning universal beam coupler.” Optics Express 21.5 (2013): 6360-6370. |
Miller “Self-configuring universal linear optical component.” Photonics Research 1.1 (2013): 1-15. |
Miller “Setting up meshes of interferometers-reversed local light interference method.” Optics Express 25.23 (2017): 29233-29248. |
Morino et al. “Reduction of wavelength dependence of coupling characteristics using Si optical waveguide curved directional coupler.” Journal of lightwave technology 32.12 (2014): 2188-2192. |
Mower et al. “High-fidelity quantum state evolution in imperfect photonic integrated circuits.” Physical Review A 92.3 (2015): 032322, 8 pages. |
Pai “Neurophox: a simulation framework for unitary neural networks and photonic devices,” accessed online on Oct. 28, 2022, at https://github.com/solgaardlab/neurophox, 2020, 4 pages. |
Pai et al. “Matrix optimization on universal unitary photonic devices.” Physical Review Applied 11.6 (2019): 064044, 18 bages. |
Pai et al. “Parallel programming of an arbitrary feedforward photonic network.” IEEE Journal of Selected Topics in Quantum Electronics 26.5 (2020): 1-13. |
Perez et al. “Multipurpose silicon photonics signal processor core.” Nature Communications 8.1 (2017): 1-9. |
Polcari “Generalizing the Butterfly Structure of the FFT.” Advanced Research in Naval Engineering. Springer, Cham, 2018. 35-52. |
Prabhu et al. “Accelerating recurrent Ising machines in photonic integrated circuits.” Optica 7.5 (2020): 551-558. |
Qiang et al. “Large-scale silicon quantum photonics implementing arbitrary two-qubit processing.” Nature Photonics 12.9 (2018): 534-539. |
Ramey “Silicon photonics for artificial intelligence acceleration: Hotchips 32.” 2020 IEEE Hot Chips 32 Symposium (HCS). IEEE, 2020, 26 pages. |
Reck et al. “Experimental realization of any discrete unitary operator.” Physical Review Letters 73.1 (1994): 58, 6 pages. |
Russell et al. “Direct dialling of Haar random unitary matrices.” New Journal of Physics 19.3 (2017): 033007, 9 pages. |
Sacher et al. “Monolithically integrated multilayer silicon nitride-on-silicon waveguide platforms for 3-D photonic circuits and devices.” Proceedings of the IEEE 106.12 (2018): 2232-2245. |
Saygin et al. “Robust architecture for programmable universal unitaries.” Physical Review Letters 124.1 (2020): 010501, 6 pages. |
Shao et al. “Simba: Scaling deep-learning inference with multi-chip-module-based architecture.” Proceedings of the 52nd Annual IEEE/ACM International Symposium on Microarchitecture. 2019, 14 pages. |
Sharstri et al. “Photonics for artificial intelligence and neuromorphic computing.” Nature Photonics 15.2 (2021): 102-114. |
Shen et al. “Deep learning with coherent nanophotonic circuits.” Nature Photonics 11.7 (2017): 441-446. |
Soldano et al. “Optical multi-mode interference devices based on self-imaging: principles and applications.” Journal of Lightwave Technology 13.4 (1995): 615-627. |
Spring et al. “Boson sampling on a photonic chip.” Science 339.6121 (2013): 798-801. |
Steinbrecher et al. “Quantum optical neural networks.” npj Quantum Information 5.1 (2019): 1-9. |
Su et al. “Multi-chip technologies to unleash computing performance gains over the next decade.” 2017 IEEE International Electron Devices Meeting (IEDM). IEEE, 2017, 8 pages. |
Suzuki et al. “Low-insertion-loss and power-efficient 32×32 silicon photonics switch with extremely high-Δ silica PLC connector.” Journal of Lightwave Technology 37.1 (2018): 116-122. |
Suzuki et al. “Ultra-high-extinction-ratio 2×2 silicon optical switch with variable splitter.” Optics Express 23.7 (2015): 9086-9092. |
Szameit et al. “Discrete optics in femtosecond-laser-written photonic structures.” Journal of Physics B: Atomic, Molecular and Optical Physics 43.16 (2010): 163001, 26 pages. |
Taballione et al. “8×8 reconfigurable quantum photonic processor based on silicon nitride waveguides.” Optics Express 27.19 (2019): 26842-26857. |
Tait et al. “Multi-channel control for microring weight banks.” Optics Express 24.8 (2016): 8895-8906. |
Tait et al. “Neuromorphic photonic networks using silicon photonic weight banks.” Scientific Reports 7.1 (2017): 1-10. |
Tanaka et al. “Pruning neural networks without any data by iteratively conserving synaptic flow.” arXiv preprint arXiv:2006.05467 (2020), 14 pages. |
Tanomura et al. “Robust integrated optical unitary converter using multiport directional couplers.” Journal of Lightwave Technology 38.1 (2020): 60-66. |
Tillmann et al. “Experimental boson sampling.” Nature Photonics 7.7 (2013): 540-544. |
Wang et al. “Compact broadband directional couplers using subwavelength gratings.” IEEE Photonics Journal 8.3 (2016): 1-8. |
Wang et al. “High-efficiency multiphoton boson sampling.” Nature Photonics 11.6 (2017): 361-365. |
Wang et al. “Tolerant, broadband tunable 2×2 coupler circuit.” Optics Express 28.4 (2020): 5555-5566. |
Wang et al. “Toward scalable boson sampling with photon loss.” Physical Review Letters 120.23 (2018): 230502, 6 pages. |
Williamson et al. “Reprogrammable electro-optic nonlinear activation functions for optical neural networks.” IEEE Journal of Selected Topics in Quantum Electronics 26.1 (2019): 1-12. |
Wu et al. “State-of-the-art and perspectives on silicon waveguide crossings: A review.” Micromachines 11.3 (2020): 326, 16 pages. |
Ye et al. “Ultra-compact broadband 2×2 3 dB power splitter using a subwavelength-grating-assisted asymmetric directional coupler.” Journal of Lightwave Technology 38.8 (2020): 2370-2375. |
Zhong et al. “Quantum computational advantage using photons.” Science 370.6523 (2020): 1460-1463. |
Zhuang et al. “Programmable photonic signal processor chip for radiofrequency applications.” Optica 2.10 (2015): 854-859. |
Zimmer et al. “A 0.11 pj/op, 0.32-128 tops, scalable multi-chip-module-based deep neural network accelerator with ground-reference signaling in 16nm.” 2019 Symposium on VLSI Circuits. IEEE, 2019, 2 pages. |
Number | Date | Country | |
---|---|---|---|
20220397383 A1 | Dec 2022 | US |
Number | Date | Country | |
---|---|---|---|
63196301 | Jun 2021 | US |