This disclosure relates generally to the design of physical devices for fabrication, and in particular but not exclusively, relates to photonic devices including but not limited to optical multiplexers and demultiplexers.
Fiber-optic communication is typically employed to transmit information from one place to another via light that has been modulated to carry the information. For example, many telecommunication companies use optical fiber to transmit telephone signals, internet communication, and cable television signals. But the cost of deploying optical fibers for fiber-optic communication may be prohibitive. As such, techniques have been developed to more efficiently use the bandwidth available within a single optical fiber. Wavelength-division multiplexing is one such technique that bundles multiple optical carrier signals onto a single optical fiber using different wavelengths. Fabrication techniques used to create physical devices such as photonic devices for use in these contexts can create complex physical structures, but a fabricated physical device may differ from the design of the physical device due to specifics of the fabrication process as implemented by a particular fabrication system.
In some embodiments, a non-transitory computer-readable medium is provided. The computer-readable medium has logic stored thereon that, in response to execution by one or more processors of a computing system, cause the computing system to perform actions for deriving a fabrication model for a fabrication system using an inverse design process. The actions include determining a test design for a test physical device, measuring performance of an instance of the test physical device fabricated by the fabrication system using the test design to determine an as-fabricated performance metric, optimizing the test design using a first loss function based on differences in a simulated performance metric of the test design and the as-fabricated performance metric to determine an as-fabricated design, optimizing a fabrication model using a second loss function based on differences between the test design and the as-fabricated design, and storing the optimized fabrication model for use in optimization of a new design for a new physical device.
In some embodiments, a non-transitory computer-readable medium is provided. The computer-readable medium has logic stored thereon that, in response to execution by one or more processors of a computing system, cause the computing system to perform actions for deriving a fabrication model for a fabrication system using an inverse design process. The actions include determining a test design for a test physical device, measuring performance of an instance of the test physical device fabricated by the fabrication system using the test design to determine an as-fabricated performance metric, determining structural parameters for a simulation based on the test design and the fabrication model, simulating performance of the test design using the structural parameters to determine a simulated performance metric, optimizing the fabrication model using a loss function based on differences between the simulated performance metric and the as-fabricated performance metric, and storing the optimized fabrication model for use in optimization of a new design for a new physical device.
Non-limiting and non-exhaustive embodiments of the invention are described with reference to the following figures, wherein like reference numerals refer to like parts throughout the various views unless otherwise specified. Not all instances of an element are necessarily labeled so as not to clutter the drawings where appropriate. The drawings are not necessarily to scale, emphasis instead being placed upon illustrating the principles being described. To easily identify the discussion of any particular element or act, the most significant digit or digits in a reference number refer to the figure number in which that element is first introduced.
In the illustrated embodiment, optical communication device 102 includes a controller 104, one or more interface device(s) 112 (e.g., fiber optic couplers, light guides, waveguides, and the like), a multiplexer (mux), demultiplexer (demux), or combination thereof (MUX/DEMUX 114), one or more light source(s) 116 (e.g., light emitting diodes, lasers, and the like), and one or more light sensor(s) 118 (e.g., photodiodes, phototransistors, photoresistors, and the like) coupled to one another. The controller includes one or more processor(s) 106 (e.g., one or more central processing units, application specific circuits, field programmable gate arrays, or otherwise) and memory 108 (e.g., volatile memory such as DRAM and SAM, non-volatile memory such as ROM, flash memory, and the like). It is appreciated that optical communication device 120 may include the same or similar elements as optical communication device 102, which have been omitted for clarity.
Controller 104 orchestrates operation of optical communication device 102 for transmitting and/or receiving optical signal 110 (e.g., a multi-channel optical signal having a plurality of distinct wavelength channels or otherwise). Controller 104 includes software (e.g., instructions included in memory 108 coupled to processor 106) and/or hardware logic (e.g., application specific integrated circuits, field-programmable gate arrays, and the like) that when executed by controller 104 causes controller 104 and/or optical communication device 102 to perform operations.
In one embodiment, controller 104 may choreograph operations of optical communication device 102 to cause light source(s) 116 to generate a plurality of distinct wavelength channels that are multiplexed via MUX/DEMUX 114 into a multi-channel optical signal 110 that is subsequently transmitted to optical communication device 120 via interface device 112. In other words, light source(s) 116 may output light having different wavelengths (e.g., 1271 nm, 1291 nm, 1311 nm, 1331 nm, 1506 nm, 1514 nm, 1551 nm, 1571, or otherwise) that may be modulated or pulsed via controller 104 to generate a plurality of distinct wavelength channels representative of information. The plurality of distinct wavelength channels are subsequently combined or otherwise multiplexed via MUX/DEMUX 114 into a multi-channel optical signal 110 that is transmitted to optical communication device 120 via interface device 112. In the same or another embodiment, controller 104 may choreograph operations of optical communication device 102 to cause a plurality of distinct wavelength channels to be demultiplexed via MUX/DEMUX 114 from a multi-channel optical signal 110 that is received via interface device 112 from optical communication device 120.
It is appreciated that in some embodiments certain elements of optical communication device 102 and/or optical communication device 120 may have been omitted to avoid obscuring certain aspects of the disclosure. For example, optical communication device 102 and optical communication device 120 may include amplification circuitry, lenses, or components to facilitate transmitting and receiving optical signal 110. It is further appreciated that in some embodiments optical communication device 102 and/or optical communication device 120 may not necessarily include all elements illustrated in
As illustrated in
In the illustrated embodiment of
As illustrated in
In the illustrated embodiment each of the plurality of output regions 304 are parallel to each other one of the plurality of output region 304. However, in other embodiments the plurality of output regions 304 may not be parallel to one another or even disposed on the same side (e.g., one or more of the plurality of output regions 304 and/or input region 302 may be disposed proximate to sides of dispersive region 332 that are adjacent to first side 328 and/or second side 330). In some embodiments adjacent ones of the plurality of output regions are separated from each other by a common separation distance when the plurality of output regions includes at least three output regions. For example, as illustrated adjacent output region 308 and output region 310 are separated from one another by distance 306, which may be common to the separation distance between other pairs of adjacent output regions.
As illustrated in the embodiment of
It is noted that the first material and second material of dispersive region 332 are arranged and shaped within the dispersive region such that the material interface pattern is substantially proportional to a design obtainable with an inverse design process, which will be discussed in greater detail later in the present disclosure. More specifically, in some embodiments, the inverse design process may include iterative gradient-based optimization of a design based at least in part on a loss function that incorporates a performance loss (e.g., to enforce functionality) and a fabrication loss (e.g., to enforce fabric ability and binarization of a first material and a second material) that is reduced or otherwise adjusted via iterative gradient-based optimization to generate the design. In the same or other embodiments, other optimization techniques may be used instead of, or jointly with, gradient-based optimization. Advantageously, this allows for optimization of a near unlimited number of design parameters to achieve functionality and performance within a predetermined area that may not have been possible with conventional design techniques.
For example, in one embodiment dispersive region 332 is structured to optically separate each of the four channels from the multi-channel optical signal within a predetermined area of 35 μm×35 μm (e.g., as defined by width 324 and length 326 of dispersive region 332) when the input region 302 receives the multi-channel optical signal. In the same or another embodiment, the dispersive region is structured to accommodate a common bandwidth for each of the four channels, each of the four channels having different center wavelengths. In one embodiment the common bandwidth is approximately 13 nm wide and the different center wavelengths is selected from a group consisting of 1271 nm, 1291 nm, 1311 nm, 1331 nm, 1506 nm, 1514 nm, 1551 nm, and 1571 nm. In some embodiments, the entire structure of demultiplexer 316 (e.g., including input region 302, periphery region 318, dispersive region 332, and plurality of output regions 304) fits within a predetermined area (e.g., as defined by width 320 and length 322). In one embodiment the predetermined area is 35 μm×35 μm. It is appreciated that in other embodiments dispersive region 332 and/or demultiplexer 316 fits within other areas greater than or less than 35 μm×35 μm, which may result in changes to the structure of dispersive region 332 (e.g., the arrangement and shape of the first and second material) and/or other components of demultiplexer 316.
In the same or other embodiments the dispersive region is structured to have a power transmission of −2 dB or greater from the input region 302, through the dispersive region 332, and to the corresponding one of the plurality of output regions 304 for a given wavelength within one of the plurality of distinct wavelength channels. For example, if channel 1 of a multi-channel optical signal is mapped to output region 308, then when demultiplexer 316 receives the multi-channel optical signal at input region 302 the dispersive region 332 will optically separate channel 1 from the multi-channel optical signal and guide a portion of the multi-channel optical signal corresponding to channel 1 to output region 308 with a power transmission of −2 dB or greater. In the same or another embodiment, dispersive region 332 is structured such that an adverse power transmission (i.e., isolation) for the given wavelength from the input region to any of the plurality of output regions other than the corresponding one of the plurality of output regions is −30 dB or less, −22 dB or less, or otherwise. For example, if channel 1 of a multi-channel optical signal is mapped to output region 308, then the adverse power transmission from input region 302 to any other one of the plurality of output regions (e.g., output region 310, output region 312, output region 314) other than the corresponding one of the plurality of output regions (e.g., output region 308) is −30 dB or less, −22 dB or less, or otherwise. In some embodiments, a maximum power reflection from demultiplexer 316 of an input signal (e.g., a multi-channel optical signal) received at an input region (e.g., input region 302) is reflected back to the input region by dispersive region 332 or otherwise is −40 dB or less, −20 dB or less, −8 dB or less, or otherwise. It is appreciated that in other embodiments the power transmission, adverse power transmission, maximum power, or other performance characteristics may be different than the respective values discussed herein, but the structure of dispersive region 332 may change due to the intrinsic relationship between structure, functionality, and performance of demultiplexer 316.
In one embodiment a silicon on insulator (SOI) wafer may be initially provided that includes a support substrate (e.g., a silicon substrate) that corresponds to substrate 334, a silicon dioxide dielectric layer that corresponds to dielectric layer 336, a silicon layer (e.g., intrinsic, doped, or otherwise), and a oxide layer (e.g., intrinsic, grown, or otherwise). In one embodiment, the silicon in the active layer 338 may be etched selectively by lithographically creating a pattern on the SOI wafer that is transferred to SOI wafer via a dry etch process (e.g., via a photoresist mask or other hard mask) to remove portions of the silicon. The silicon may be etched all the way down to dielectric layer 336 to form voids that may subsequently be backfilled with silicon dioxide that is subsequently encapsulated with silicon dioxide to form cladding layer 340. In one embodiment, there may be several etch depths including a full etch depth of the silicon to obtain the targeted structure. In one embodiment, the silicon may be 206 nm thick and thus the full etch depth may be 206 nm. In some embodiments, this may be a two-step encapsulation process in which two silicon dioxide depositions are performed with an intermediate chemical mechanical planarization used to yield a planar surface.
It is appreciated that in the illustrated embodiments of demultiplexer 316 as shown in
As illustrated in
The first material 410 (i.e., black colored regions within dispersive region 406) and second material 412 (i.e., white colored regions within dispersive region 406) of photonic demultiplexer 400 are inhomogeneously interspersed to create a plurality of interfaces that collectively form material interface pattern 420 as illustrated in
As illustrated in
In some embodiments, material interface pattern 420 includes one or more dendritic shapes, wherein each of the one or more dendritic shapes are defined as a branched structure formed from first material 410 or second material 412 and having a width that alternates between increasing and decreasing in size along a corresponding direction. Referring back to
In some embodiments, the inverse design process includes a fabrication loss that enforces a minimum feature size, for example, to ensure fabricability of the design. In the illustrated embodiment of photonic demultiplexer 400 illustrated in
As illustrated, system 500 includes controller 512, display 502, input device(s) 504, communication device(s) 506, network 508, remote resources 510, bus 534, and bus 520. Controller 512 includes processor 514, memory 516, local storage 518, and photonic device simulator 522. Photonic device simulator 522 includes operational simulation engine 526, fabrication loss calculation logic 528, calculation logic 524, adjoint simulation engine 530, and optimization engine 532. It is appreciated that in some embodiments, controller 512 may be a distributed system.
Controller 512 is coupled to display 502 (e.g., a light emitting diode display, a liquid crystal display, and the like) coupled to bus 534 through bus 520 for displaying information to a user utilizing system 500 to optimize structural parameters of the photonic device (i.e., demultiplexer). Input device 504 is coupled to bus 534 through bus 520 for communicating information and command selections to processor 514. Input device 504 may include a mouse, trackball, keyboard, stylus, or other computer peripheral, to facilitate an interaction between the user and controller 512. In response, controller 512 may provide verification of the interaction through display 502.
Another device, which may optionally be coupled to controller 512, is one or more communication device(s) 506 for accessing remote resources 510 of a distributed system via network 508. Communication device 506 may include any of a number of networking peripheral devices such as those used for coupling to an Ethernet, Internet, or wide area network, and the like. Communication device 506 may further include a mechanism that provides connectivity between controller 512 and the outside world. Note that any or all of the components of system 500 illustrated in
Controller 512 orchestrates operation of system 500 for optimizing structural parameters of the photonic device. Processor 514 (e.g., one or more central processing units, graphics processing units, and/or tensor processing units, etc.), memory 516 (e.g., volatile memory such as DRAM and SRAM, non-volatile memory such as ROM, flash memory, and the like), local storage 518 (e.g., magnetic memory such as computer disk drives), and the photonic device simulator 522 are coupled to each other through bus 520. Controller 512 includes software (e.g., instructions included in memory 516 coupled to processor 514) and/or hardware logic (e.g., application specific integrated circuits, field-programmable gate arrays, and the like) that when executed by controller 512 causes controller 512 or system 500 to perform operations. The operations may be based on instructions stored within any one of, or a combination of, memory 516, local storage 518, physical device simulator 522, and remote resources 510 accessed through network 508.
In the illustrated embodiment, the components of photonic device simulator 522 are utilized to optimize structural parameters of the photonic device (e.g., MUX/DEMUX 114 of
As illustrated in
Each of the plurality of voxels 612 may be associated with a structural value, a field value, and a source value. Collectively, the structural values of the virtual prototype 606 describe the structural parameters of the photonic device. In one embodiment, the structural values may correspond to a relative permittivity, permeability, and/or refractive index that collectively describe structural (i.e., material) boundaries or interfaces of the photonic device (e.g., material interface pattern 420 of
In the illustrated embodiment, the photonic device corresponds to an optical demultiplexer having a design region 614 (e.g., corresponding to dispersive region 332 of
However, in other embodiments, the entirety of the photonic device may be placed within the design region 614 such that the structural parameters may represent any portion or the entirety of the design of the photonic device. The electric and magnetic fields within the virtual prototype 606 (and subsequently the photonic device) may change (e.g., represented by the field value of the individual voxel that collectively correspond to the field response of the virtual prototype) in response to the excitation source. The output ports 604 of the optical demultiplexer may be used for determining a performance metric of the photonic device in response to the excitation source (e.g., power transmission from input port 602 to a specific one of the output ports 604). The initial description of the photonic device, including initial structural parameters, excitation sources, performance parameters or metrics, and other parameters describing the photonic device, are received by the system (e.g., system 500 of
Once the operational simulation reaches a steady state (e.g., changes to the field values in response to the excitation source substantially stabilize or reduce to negligible values) or otherwise concludes, one or more performance metrics may be determined. In one embodiment, the performance metric corresponds to the power transmission at a corresponding one of the output ports 604 mapped to the distinct wavelength channel being simulated by the excitation source. In other words, in some embodiments, the performance metric represents power (at one or more frequencies of interest) in the target mode shape at the specific locations of the output ports 604. A loss value or metric of the input design (e.g., the initial design and/or any refined design in which the structural parameters have been updated) based, at least in part, on the performance metric may be determined via a loss function. The loss metric, in conjunction with an adjoint simulation, may be utilized to determine a structural gradient (e.g., influence of structural parameters on loss metric) for updating or otherwise revising the structural parameters to reduce the loss metric (i.e. increase the performance metric). It is noted that the loss metric may be further based on a fabrication loss value that is utilized to enforce a minimum feature size of the photonic device to promote fabricability of the device, and/or other loss values.
In some embodiments, iterative cycles of performing the operational simulation, and adjoint simulation, determining the structural gradient, and updating the structural parameters to reduce the loss metric are performed successively as part of an inverse design process that utilizes iterative gradient-based optimization. An optimization scheme such as gradient descent may be utilized to determine specific amounts or degrees of changes to the structural parameters of the photonic device to incrementally reduce the loss metric. More specifically, after each cycle the structural parameters are updated (e.g., optimized) to reduce the loss metric. The operational simulation, adjoint simulation, and updating the structural parameters are iteratively repeated until the loss metric substantially converges or is otherwise below or within a threshold value or range such that the photonic device provides the desired performed while maintaining fabricability.
One problem in designing physical devices such as the photonic devices described above is that fabrication systems generally do not produce photonic devices that have the precise structure that is simulated using the techniques described above. The fabrication of photonic devices using fabrication systems such as semiconductor foundries involves a complex, multi-stage process of transferring the design from a photomask to a silicon wafer. Despite pre-corrections made to the photomask to compensate for distortion effects introduced by the photolithography process, the fabricated physical device still may exhibit shape distortions including but not limited to rounding of sharp corners, erosion or dilation of shape contours, oblique sidewall angles, and non-uniform layer thicknesses. These distortions are due to statistical variations of semiconductor processing and can affect performance of the manufactured physical devices in unexpected ways, given the delicate wave-interference physics which underlies the performance of physical devices such as the photonic devices described herein.
Characterizing these non-uniformities is a priority for operators of fabrication systems and typically requires a large sample of wafer runs. A process design kit (PDK) provided by a given fabrication system/foundry typically includes specifications and tolerances for critical features sizes such as the minimum line width and spacing, curvature, as well as minimum area (e.g, islands) and enclosed area (e.g., holes). However, the detailed processing capability including the optical proximity correction (OPC) and yields of a given foundry node are trade secrets, and are not generally available during the design process of the physical device to be fabricated. To ensure reliable and consistent manufacturing of inverse-designed physical devices, it is desirable to know a priori what the manufactured structure in the silicon wafer will be for a given photomask/design. It is further desirable for this information to be available without any detailed knowledge of the capabilities of the fabrication system beyond what is made publicly available (e.g., as part of the PDK).
One approach to generating a foundry-fabrication model is to take a scanning electron microscope (SEM) image or processing-simulation-generated prototype of the fabricated structure and come up with a series of differentiable operations (i.e., convolutions/filters, projections/thresholds, etc.) which can be applied sequentially to transform the photomask design to the image/prototype. Each of these individual operations is parameterized by some degrees of freedom and the problem is to determine these unknowns using gradient-based optimization via reverse-mode automatic differentiation of a scalar loss function. The transformation function can have an arbitrary number of parameters in order to incorporate the equivalent of the combined effects of the foundry OPC and processing.
Generally, it is challenging to generate a large set of input data using SEM images or simulated prototypes for training the foundry-fabrication model. There are three main challenges with this approach in which the inputs to the fabrication model are the photomask design and either SEM images or prototypes of the fabricated device:
What is desired are design techniques that can accurately model fabrication processes within fabrication systems without requiring proprietary information regarding the manufacturing process in order to increase the performance of the designed physical devices.
In some embodiments of the present disclosure, a new approach to creating a fabrication model for modeling the foundry-fabrication process is based on combining optical measurements of the fabricated device with topology optimization and full-wave electromagnetic simulations. In some embodiments, these techniques start by determining a photomask design and then using topology optimization to evolve the initial 3D structure based on the 2D photomask design until its simulated performance matches experimental measurements of an as-fabricated design. Similar to the SEM image/prototype-based approach described above, the optimization problem involves finding a set of parameters of a sequence of differentiable operations which minimize, for example, a difference of the simulated performance of the virtual prototype and the measured optical response of an as-fabricated design (e.g., minimizing a mean-squared error over a set of frequencies). This approach can use the same or a similar topology optimization framework that was initially used to design the photomask to match a pre-specified target output. Because the technique starts with a known initial design and is applying a similar set of constraints in the two stages of topology optimization, the design space is sufficiently limited such that the final design is likely to match the as-fabricated design.
Many benefits arise from using approaches in which the fabrication model is created based on the photomask design and the optical measurement of the fabricated device, including but not limited to the following:
As illustrated in
After receiving the initial design 730, the operational simulation 702 uses a fabrication model 744 to simulate the fabrication of the photonic device based on the initial structural parameters to create structural parameters 706 to be simulated in the simulation portion 742. In some embodiments, the fabrication model 744 is a sequence of differentiable operations that embodies differences from the initial design 730 that will be introduced by a fabrication system during fabrication. In some embodiments, instead of a sequence of differentiable operations, the fabrication model 744 may be represented by a neural network. Parameters for the fabrication model 744 may be learned using an optimization process as described in further detail below.
After the structural parameters 706 are determined using the fabrication model 744, the operational simulation 702 proceeds to a simulation portion 742. The simulation portion 742 occurs over a plurality of time-steps (e.g., from an initial time step to a final time step over a pre-determined or conditional number of time steps having a specified time step size) and models changes (e.g., from the initial field value 710) in electric and magnetic fields of a plurality of voxels describing the virtual prototype and/or photonic device that collectively correspond to the field response. More specifically, update operations (e.g., update operation 712, update operation 714, and update operation 716) are iterative and based on the field response, structural parameters 706, and one or more excitation sources 708. Each update operation is succeeded by another update operation, which are representative of successive steps forward in time within the plurality of time steps. For example, update operation 714 updates the field values 734 (see, e.g.,
Once the final time step of the simulation portion 742 is performed (although three update operations are illustrated for the sake of clarity and brevity, in some embodiments more than three update operations are performed), a simulated performance metric 718 is used to determine a performance loss value 720 associated with the structural parameters 706. In some embodiments, the simulated performance metric 718 is a characterization of the performance of the simulated device at one or more points (e.g., at the output ports of the device) and at one or more bandwidths, similar to the illustration in
From the loss metric 722, a loss gradient 724 may be determined. The loss gradient 724 may be treated as adjoint or virtual sources (e.g., physical stimuli or excitation source originating at an output region or port) which are backpropagated in reverse (from the final time step incrementally through the plurality of time steps until reaching the initial time step via update operation 726, update operation 750, and update operation 752) to determine gradient 728. Because it is determined based on the simulated performance metric 718, the gradient 728 is associated with the initial design 730 as modified by the fabrication model 744.
In the illustrated embodiment, the FDTD solve (e.g., simulation portion 742 of the operational simulation 702) and backward solve (e.g., adjoint simulation 704) problem are described pictorially, from a high-level, using only “update” and “loss” operations as well as their corresponding gradient operations. The simulation is set up initially in which the structural parameters, physical stimuli (i.e., excitation sources), and initial field states of the virtual prototype (and photonic device) are provided (e.g., via an initial description and/or input design). As discussed previously, the field values are updated in response to the excitation sources based on the structural parameters. More specifically, the update operation is given by ϕ, where =ϕ() for =1, . . . , . Here, corresponds to the total number of time steps (e.g., the plurality of time steps) for the operational simulation, where corresponds to the field response (the field value associated with the electric and magnetic fields of each of the plurality of voxels) of the virtual prototype at time step corresponds to the excitation source(s) (the source value associated with the electric and magnetic fields for each of the plurality of voxels) of the virtual prototype at time step , and corresponds to the structural parameters describing the topology and/or material properties of the physical device (e.g., relative permittivity, index of refraction, and the like).
It is noted that using the FDTD method, the update operation may specifically be stated as:
ϕ()=A()+B()
That is to say the FDTD update is linear with respect to the field and source terms. Concretely, A()∈N×N and B()∈N×N are linear operators which depend on the structural parameters, , and act on the fields, , and the sources, , respectively. Here, it is assumed that ∈N where N is the number of FDTD field components in the operational simulation. Additionally, the loss operation (e.g., loss function) may be given by L=ƒ(, . . . , ), which takes as input the computed fields and produces a single, real-valued scalar (e.g., the loss metric) that can be reduced and/or minimized.
In terms of revising or otherwise optimizing the structural parameters of the physical device, the relevant quantity to produce is
which is used to describe the influence of changes in the structural parameters of the initial design 730 on the loss value and is denoted as the gradient 728 illustrated in
which include
The update operation 714 of the operational simulation 702 updates the field value 734, , of the plurality of voxels at the ith time step to the next time step (i.e., +1 time step), which correspond to the field values 736, . The gradient 738 are utilized to determine for the backpropagation (e.g., update operation 726 backwards in time), which combined with the gradient 740 are used, at least in part, to calculate the structural gradient,
is the contribution of each field to the loss metric, L. It is noted that this is the partial derivative, and therefore does not take into account the causal relationship of →. Thus,
is utilized which encompasses the → relationship. The loss gradient,
may also be used to compute the structural gradient,
and corresponds to the total derivative of the field with respect to loss value, L. The loss gradient,
at a particular time step, , is equal to the summation of
which corresponds to the field gradient, is used which is the contribution to
from each time/update step.
In particular, the memory footprint to directly compute
is so large that it is difficult to store more than a handful of state Tensors. The state Tensor corresponds to storing the values of all of the FDTD cells (e.g., the plurality of voxels) for a single simulation time step. It is appreciated that the term “tensor” may refer to tensors in a mathematical sense or as described by the TensorFlow framework developed by Alphabet, Inc. In some embodiments the term “tensor” refers to a mathematical tensor which corresponds to a multidimensional array that follows specific transformation laws. However, in most embodiments, the term “tensor” refers to TensorFlow tensors, in which a tensor is described as a generalization of vectors and matrices to potentially higher dimensions (e.g., n-dimensional arrays of base data types), and is not necessarily limited to specific transformation laws. For example, for the general loss function ƒ, it may be necessary to store the fields, , for all time steps, . This is because, for most choices of ƒ, the gradient will be a function of the arguments of ƒ. This difficulty is compounded by the fact that the values of for larger values of are needed before the values for smaller due to the incremental updates of the field response and/or through backpropagation of the loss metric, which may prevent the use of schemes that attempt to store only the values
at an immediate time step.
An additional difficulty is further illustrated when computing the structural gradient,
which is given by:
For completeness, the full form of the first term in the sum,
is expressed as:
Based on the definition of ϕ as described by equation (1), it is noted that
which can be substituted in equation (3) to arrive at an adjoint update for backpropagation (e.g., the update operations such as update operation 726), which can be expressed as:
The adjoint update is the backpropagation of the loss gradient (e.g., from the loss metric) from later to earlier time steps and may be referred to as a backwards solve for
More specifically, the loss gradient may initially be based upon the backpropagation of a loss metric determined from the operational simulation with the loss function. The second term in the sum of the structural gradient,
corresponds to the field gradient and is denoted as:
for the particular form of ϕ described by the first equation above. Thus, each term of the sum associated depends on both for >= and for <. Since the dependency chains of these two terms are in opposite directions, it is concluded that computing
in this way requires the storage of xi values for all of . In some embodiments, the need to store all field value may be mitigated by a reduced representation of the fields.
Returning to
Accordingly,
It is appreciated that method 800 is a process that may be accomplished by performing operations with a system to perform iterative gradient-based optimization of a loss metric determined from a loss function that includes at least a comparison between a simulated performance metric 718 and an as-fabricated performance metric 748. In the same or other embodiments, method 800 may be included as instructions provided by at least one machine-accessible storage medium (e.g., non-transitory memory) that, when executed by a machine, will cause the machine to perform operations for generating and/or improving the fabrication model 744. It is further appreciated that the order in which some or all of the process blocks appear in method 800 should not be deemed limiting. Rather, one of ordinary skill in the art having the benefit of the present disclosure will understand that some of the process blocks may be executed in a variety of orders not illustrated, or even in parallel.
From a start block, the method 800 proceeds to block 802, where a test design (e.g., initial design 730) of a test physical device such as a photonic integrated circuit is received. In some embodiments, the physical device may be expected to have a certain functionality (e.g., perform as an optical demultiplexer). The test design may describe desired structural parameters of the physical device that are to be used for both fabrication and simulation. The virtual prototype may include a plurality of voxels that collectively describe the structural parameters of the test device. Each of the plurality of voxels may be associated with a structural value to describe the structural parameter, a field value to describe the field response (e.g., the electric and magnetic fields in one or more orthogonal directions) to physical stimuli (e.g., one or more excitation sources), and a source value to describe the physical stimuli.
It is appreciated that the “initial” test design may be a relative term. Thus, in some embodiments an initial description may be a first description of the test device described within the context of the virtual prototype (e.g., a first input design for performing a first operational simulation). However, in other embodiments, the term initial description may refer to an initial description of a particular cycle (e.g., of performing an operational simulation 702, operating an adjoint simulation 704, and updating the structural parameters). In such an embodiment, the test design or design of that particular cycle may correspond to a revised description or refined design (e.g., generated from a previous cycle). In some embodiments, the virtual prototype includes a design region that includes a portion of the plurality of voxels which have structural parameters that may be updated, revised, or otherwise changed to optimize the structural parameters. In the same or other embodiments, the structural parameters are associated with geometric boundaries and/or material compositions of the physical device based on the material properties (e.g., relative permittivity, index of refraction, etc.) of the virtual prototype.
At block 804, the test design is provided to a fabrication system to fabricate a test physical device. The fabrication system may use any suitable technique or combination of techniques to fabricate the test physical device, including but not limited to lithographic techniques such as photolithography and electron-beam lithography, sputtering, thermal evaporation, and physical and/or chemical vapor deposition. As discussed further below, while the method 800 describes receipt and processing of a single test design at once for the sake of clarity, in some embodiments multiple test physical devices based on multiple test designs may be received and provided to the fabrication system at once for fabrication on a single wafer. In some embodiments, up to tens of thousands of test physical devices may be fabricated on a single wafer and tested by wafer-scale testing techniques, thus allowing many test designs to be efficiently processed and used to improve the fabrication model even further.
At block 806, performance of the test physical device is measured to determine an as-fabricated performance metric. The performance of the test physical device may be measured at one or more points of the test physical device, including but not limited to one or more output ports. The performance may be measured by providing various inputs, including but not limited to light sources that produce various wavelengths, to one or more input ports of the physical device. In some embodiments, performance may be measured using one or more test-only input ports or output ports on the physical device that are used during testing but that are not taken into account during optimization of the design. Further, the performance of the test physical device may be tested both within frequencies that are intended for use and outside of frequencies that are intended for use in order to increase the amount of data that is available for the optimization process. Also, similar to the discussion above, though the method 800 illustrates and describes measuring performance of a single test physical device, in some embodiments the method 800 may use wafer-scale testing techniques to test more than one test physical device at a time.
At block 808, the fabrication model is used to determine structural parameters based on the test design. Because the fabrication model is intended to represent changes from structures specified in the test design that are introduced by the fabrication process used by the fabrication system, the structural parameters will likely be at least slightly different from the structure specified in the test design. In some embodiments, the fabrication model may initially not make any changes to the structure provided in the test design, but may begin to make changes once one or more iterations of the method 800 are completed. In some embodiments, the actions of block 808 may be skipped, and structural parameters of the test design may be provided directly to the virtual prototype without being altered by the fabrication model.
At block 810, a virtual prototype is configured to be representative of the structural parameters. Once the structural parameters have been determined using the fabrication model, the virtual prototype is configured (e.g., the number of voxels, shape/arrangement of voxels, and specific values for the structural values, field values, and/or source values of the voxels are set based on the structural parameters). In some embodiments the virtual prototype includes a design region optically coupled between a first communication region and a plurality of second communication regions. In some embodiments, the first communication region may correspond to an input region or port (e.g., where an excitation source originates), while the second communication may correspond to a plurality of output region or ports (e.g., when designing an optical demultiplexer that optically separates a plurality of distinct wavelength channels included in a multi-channel optical signal received at the input port and respectively guiding each of the distinct wavelength channels to a corresponding one of the plurality of output ports). However, in other embodiments, the first communication region may correspond to an output region or port, while the plurality of second communication regions corresponds to a plurality of input ports or region (e.g., when designing an optical multiplexer that optically combines a plurality of distinct wavelength signals received at respective ones of the plurality of input ports to form a multi-channel optical signal that is guided to the output port).
Block 812 shows mapping each of a plurality of distinct wavelength channels to a respective one of the plurality of second communication regions within the virtual prototype. The distinct wavelength channels may be mapped to the second communication regions by virtue of the test design. For example, the test design may associate a performance metric of the physical device with power transmission from the input port to individual output ports for mapped channels. In one embodiment, a first channel included in the plurality of distinct wavelength channels is mapped to a first output port, meaning that the performance metric of the physical device for the first channel is tied to the first output port. Similarly, other output ports may be mapped to the same or different channels included in the plurality of distinct wavelength channels such that each of the distinct wavelength channels is mapped to a respective one of the plurality of output ports (i.e., second communication regions) within the virtual prototype. In one embodiment, the plurality of second communication regions includes four regions and the plurality of distinct wavelength channels includes four channels that are each mapped to a corresponding one of the four regions. In other embodiments, there may be a different number of the second communication regions (e.g., 8 regions) and a different number of channels (e.g., 8 channels) that are each mapped to a respective one of the second communication regions.
Block 814 illustrates performing an operational simulation of the structural parameters in response to one or more excitation sources to determine a simulated performance metric. More specifically, in some embodiments an electromagnetic simulation is performed in which a field response of the photonic integrated circuit is updated incrementally over a plurality of time steps to determine how the field response of the simulated physical device changes due to the excitation source. The field values of the plurality of voxels are updated in response to the excitation sources and based, at least in part, on the structural parameters of the integrated photonic circuit. Additionally, each update operation at a particular time step may also be based, at least in part, on a previous (e.g., immediately prior) time step.
Consequently, the operational simulation simulates an interaction between the photonic device (i.e., the photonic integrated circuit) and a physical stimuli (i.e., one or more excitation sources) to determine a simulated output of the photonic device (e.g., at one or more of the output ports or regions) in response to the physical stimuli. The interaction may correspond to any one of, or combination of a perturbation, retransmission, attenuation, dispersion, refraction, reflection, diffraction, absorption, scattering, amplification, or otherwise of the physical stimuli within electromagnetic domain due, at least in part, to the structural parameters of the photonic device and underlying physics governing operation of the photonic device. Thus, the operational simulation simulates how the field response of the virtual prototype changes due to the excitation sources over a plurality of time steps (e.g., from an initial to final time step with a pre-determined step size).
In some embodiments, the simulated output may be utilized to determine one or more simulated performance metrics. For example, the excitation source may correspond to a selected one of a plurality of distinct wavelength channels that are each mapped to one of the plurality of output ports. The excitation source may originate at or be disposed proximate to the first communication region (i.e., input port) when performing the operational simulation. During the operational simulation, the field response at the output port mapped to the selected one of the plurality of distinct wavelength channels may then be utilized to determine a simulated power transmission of the photonic integrated circuit for the selected distinct wavelength channel. In other words, the operational simulation may be utilized to determine the simulated performance metric that includes determining a simulated power transmission of the excitation source from the first communication region, through the design region, and to a respective one of the plurality of second communication regions mapped to the selected one of the plurality of distinct wavelength channels. In some embodiments, the excitation source may cover the spectrum of all of the plurality of output ports (e.g., the excitation source spans at least the targeted frequency ranges for the bandpass regions for each of the plurality of distinct wavelength channels as well as the corresponding transition band regions, and at least portions of the corresponding stopband regions) to determine a performance metric (i.e., simulated power transmission) associated with each of the distinct wavelength channels for the photonic integrated circuit. In some embodiments, one or more frequencies that span the passband of a given one of the plurality of distinct wavelength channels is selected randomly to optimize the design (e.g., batch gradient descent while having a full width of each passband including ripple in the passband that meets the target specifications). In the same or other embodiments, each of the plurality of distinct wavelength channels has a common bandwidth with different center wavelengths.
The method 800 then proceeds to a continuation terminal (“terminal A”). From terminal A (
At decision block 818, a determination is made regarding whether the loss metric substantially converges such that the difference between the simulated performance metric and the as-fabricated performance metric is within a threshold range and/or has ceased improving. In some embodiments, the structural parameters of the design region of the integrated photonic circuit are revised when performing the iterations to match the structure of the fabricated physical device as closely as possible, which is indicated by this convergence. In some embodiments, the determination at decision block 818 is merely based on whether a predetermined number of iterations has been performed.
If the determination is that the loss metric has not converged, then the result of decision block 818 is NO, and the method 800 proceeds to block 820. Block 820 illustrates backpropagating the loss metric via the loss function through the virtual prototype to determine an influence of changes in the structural parameter on the loss metric (i.e., a structural gradient). The loss metric is treated as an adjoint or virtual source and is backpropagated incrementally from a final time step to earlier time steps in a backwards simulation to determine the structural gradient.
Block 822 shows updating the structural parameters of the test design based on the structural gradient to adjust the loss metric. In some embodiments, adjusting for the loss metric may reduce the loss metric. However, in other embodiments, the loss metric may be adjusted or otherwise compensated in a manner that does not necessarily reduce the loss metric. In one embodiment, adjusting the loss metric may maintain fabricability while providing a general direction within the parameterization space to obtain designs that will ultimately result in performance that more closely matches the as-fabricated performance metric. In some embodiments, the revised description is generated by utilizing an optimization scheme after a cycle of operational and adjoint simulations via a gradient descent algorithm, Markov Chain Monte Carlo algorithm, or other optimization techniques. Put in another way, iterative cycles of simulating the physical device, determining a loss metric, backpropagating the loss metric, and updating the structural parameters to adjust the loss metric may be successively performed until the loss metric substantially converges such that the difference between the simulated performance metric and the as-fabricated performance metric is within a threshold range. In some embodiments, the term “converges” may simply indicate the difference is within the threshold range and/or below some threshold value. In some embodiments, the term “converges” may indicate that the error between the simulated performance metric and the as-fabricated performance metric is no longer shrinking after one or more optimization iterations (i.e., a local or global minimum has been reached). The method 800 then returns to block 808 via a continuation terminal (“terminal B”) to iterate using the updated test design.
Returning to decision block 818, if the determination is that the loss metric has converged, then the result of decision block 818 is YES and the method 800 advances to block 824. At block 824, the test design as updated by one or more iterations of the method 800 is designated as an as-fabricated design. That is, the optimized test design is considered to accurately represent the actual structure of the test physical device as it was fabricated by the fabrication system.
The method 800 then proceeds to a decision block 826, where a determination is made regarding whether more devices remain to be processed. As discussed above, in some embodiments the method 800 may be performed using a plurality of devices in order to stress a wide variety of functionality of the fabrication system. In some embodiments, the method 800 may include testing a plurality of devices created on a single wafer. Since a large number of devices may be created on a single 300 mm wafer, this may allow a large number of different designs to be tested efficiently. In some embodiments, multiple devices fabricated on a single wafer may be tested using wafer-scale testing tools. In some embodiments, the plurality of devices created on the wafer may be designed using a “design for testing” approach in order to exercise the full range of capabilities of the fabrication system. Further, the simulated performance metric and as-fabricated performance metric may be collected using wavelengths other than those intended for device operation to provide additional data for the optimization process.
Accordingly, if the method 800 is processing more than one device and more devices remain to be processed, then the result of decision block 826 is YES, and the method 800 returns to block 802 via a continuation terminal (“terminal C”) to process the test design for the next device. On this subsequent iteration, if the test designs for each device were provided together to the fabrication system in order to fabricate the test physical devices on a single wafer, then at least some of the actions of block 802, block 804, and/or block 806 may have been performed during the previous iteration for the next device, and may be skipped in the subsequent iteration.
Returning to decision block 826, if all of the devices have been processed, then the result of decision block 826 is NO, and the method 800 proceeds to block 828. Block 828 illustrates updating the fabrication model based on differences between the as-fabricated designs and the test designs. In some embodiments, the fabrication model includes a sequence of differentiable operations (or composable functions), each of which is parameterized by some degrees of freedom. Some non-limiting examples of differentiable operations within the fabrication model may include a filter operation (e.g., convolution with a square, diamond, or conical shape function, etc.) and a projection operation (e.g., sigmoid, etc.). The updates may be applied to the fabrication model using any suitable technique. For example, since the fabrication model includes differentiable operations, a gradient of the error of the fabrication model may be determined, and the fabrication model may be updated based on the gradient. As another example, the JAX machine learning framework provided by Google, LLC may be used to optimize/update the parameters of the fabrication model. In some embodiments, the fabrication model may include a neural network which may be updated using any suitable technique, including but not limited to gradient descent or an Adam optimizer.
At block 830, the updated fabrication model is provided for simulations of new initial designs. The updated fabrication model may be used in an iterative design process similar to that illustrated above in
The method 800 then proceeds to an end block and terminates.
It will be appreciated that in the method 800, the as-fabricated design for each test design is derived. One advantage of using the method 800 to optimize the fabrication model is that the pairs of test designs and as-fabricated designs may be stored for future reference, and multiple different fabrication models (including fabrication models of different architectures) may be derived using the stored test design/as-fabricated design pairs without having to re-process the test designs (that is to say, once the pairs of test designs and as-fabricated designs are obtained, block 828 can be executed multiple times in order to optimize different fabrication models without repeating the previous portions of method 800). This allows for greater exploration of the architecture of the fabrication model with a reduced computational load.
That said, it may be desirable in some embodiments to reduce the amount of computation time needed for the training of a given fabrication model. Accordingly, in some embodiments, the optimization technique may directly optimize the fabrication model instead of deriving the as-fabricated designs as an intermediate step.
From a start block, the method 900 proceeds to block 902, where a test design of a test physical device is received. At block 904, the test design is provided to a fabrication system to fabricate a test physical device. At block 906, performance of the test physical device is measured to determine an as-fabricated performance metric. At block 908, the fabrication model is used to determine structural parameters based on the test design. At block 910, a virtual prototype is configured to be representative of the structural parameters. Block 912 shows mapping each of a plurality of distinct wavelength channels to a respective one of the plurality of second communication regions within the virtual prototype, and block 914 illustrates performing an operational simulation of the structural parameters in response to one or more excitation sources to determine a simulated performance metric.
It will be appreciated that the steps of method 900 from block 902 to block 914 are the same as the steps of method 800 from block 802 to block 814. Accordingly, the steps of block 902 to block 914 are not described again here for the sake of brevity. It should be noted, however, that while the use of the fabrication model to determine the structural parameters based on the test design was optional at block 808, these actions are not optional in block 908 because the fabrication model is being optimized within the iterative loop in order to affect the structural parameters.
From block 914, the method 900 proceeds to a continuation terminal (“terminal A”). From terminal A (
At decision block 918, a determination is made regarding whether the loss metric substantially converges such that the difference between the simulated performance metric and the as-fabricated performance metric is within a threshold range and/or has ceased improving. In some embodiments, the structural parameters generated by the fabrication model are revised when performing the iterations to match the structure of the fabricated physical device as closely as possible, which is indicated by this convergence. In some embodiments, the determination at decision block 918 is merely based on whether a predetermined number of iterations has been performed.
If the determination is that the loss metric has not converged, then the result of decision block 918 is NO, and the method 900 advances to block 920. At block 920, the loss metric is backpropagated through the virtual prototype to determine a structural gradient. Again, block 920 is similar to block 820 of method 800, and so is not described again here for the sake of brevity.
At block 922, the fabrication model is updated based on the structural gradient to adjust the loss metric. This is a departure from the actions of method 800, in which the structural gradient was used to adjust the test design. Instead, updating the fabrication model means that instead of the test design being optimized to more closely resemble the as-fabricated structure of the test physical device, the fabrication model is optimized such that the structural parameters it generates, given the test design, more closely resemble the as-fabricated structure of the test physical device. The structural gradient may be used to update the fabrication model in any suitable way. In some embodiments, the structural gradient may be used to determine a gradient of the fabrication model, and the gradient of the fabrication model may then be used in an optimization process. In some embodiments, backpropagation of the loss metric may determine the gradient of the fabrication model as well. In some embodiments, the optimization process may include JAX, gradient descent, or an Adam optimizer as discussed above. The method 900 then returns to block 908 via a continuation terminal (“terminal B”) to iterate using the updated fabrication model.
Returning to decision block 918, if the determination is that the loss metric has converged, then the result of decision block 918 is YES and the method 900 advances to decision block 924. At decision block 924, a determination is made regarding whether more devices remain to be processed. As discussed above and similar to method 800, in some embodiments the method 900 may be performed using a plurality of devices in order to account for a wide variety of functionality of the fabrication system. Accordingly, if the method 900 is processing more than one device and additional devices remain to be processed, then the result of decision block 924 is YES, and the method 900 returns to block 902 via a continuation terminal (“terminal C”) to process the test design for the next device. On this subsequent iteration, if the test designs for each device were provided together to the fabrication system in order to fabricate the test physical devices on a single wafer, then at least some of the actions of block 902, block 904, and/or block 906 may have been performed during the previous iteration for the next device, and may be skipped in the subsequent iteration.
Returning to decision block 924, if all of the devices have been processed, then the result of decision block 924 is NO, and the method 900 proceeds to block 926, where the updated fabrication model is provided for simulations of new initial designs. Again, the updated fabrication model may be used in an iterative design process similar to that illustrated above in
The method 900 then proceeds to an end block and terminates.
In the preceding description, numerous specific details are set forth to provide an understanding of various embodiments of the present disclosure. One skilled in the relevant art will recognize, however, that the techniques described herein can be practiced without one or more of the specific details, or with other methods, components, materials, etc. In other instances, well-known structures, materials, or operations are not shown or described in detail to avoid obscuring certain aspects.
Reference throughout this specification to “one embodiment” or “an embodiment” means that a particular feature, structure, or characteristic described in connection with the embodiment is included in at least one embodiment of the present invention. Thus, the appearances of the phrases “in one embodiment” or “in an embodiment” in various places throughout this specification are not necessarily all referring to the same embodiment. Furthermore, the particular features, structures, or characteristics may be combined in any suitable manner in one or more embodiments.
The order in which some or all of the blocks appear in each method flowchart should not be deemed limiting. Rather, one of ordinary skill in the art having the benefit of the present disclosure will understand that actions associated with some of the blocks may be executed in a variety of orders not illustrated, or even in parallel.
The processes explained above are described in terms of computer software and hardware. The techniques described may constitute machine-executable instructions embodied within a tangible or non-transitory machine (e.g., computer) readable storage medium, that when executed by a machine will cause the machine to perform the operations described. Additionally, the processes may be embodied within hardware, such as an application specific integrated circuit (“ASIC”) or otherwise.
The above description of illustrated embodiments of the invention, including what is described in the Abstract, is not intended to be exhaustive or to limit the invention to the precise forms disclosed. While specific embodiments of, and examples for, the invention are described herein for illustrative purposes, various modifications are possible within the scope of the invention, as those skilled in the relevant art will recognize.
These modifications can be made to the invention in light of the above detailed description. The terms used in the following claims should not be construed to limit the invention to the specific embodiments disclosed in the specification. Rather, the scope of the invention is to be determined entirely by the following claims, which are to be construed in accordance with established doctrines of claim interpretation.