The various embodiments of the present disclosure relate generally to systems and methods for enhanced engineering design and optimization capable of providing feasible optical response solutions for design parameters and output characteristics, and more particularly to systems and methods for enhanced engineering design and optimization of complex electromagnetic structures.
While electronic design is well established, nanophotonic integrated circuits are a fast-emerging domain and present design technologies are inadequate. Traditional design and optimization approaches for such nanophotonic structures rely on using analytical or semi-analytical modeling or even brute-force analysis. Such approaches are limited to structures having relatively simple designs due to the complex computational requirements to completely study and model such structures.
Further, as researchers are able to form increasingly more complex nanostructures with multiple design parameters, these traditional design methods and optimization approaches become less and less feasible. For example, as the number of design parameters increases so does the computational requirements for generating and analyzing such designs. Additionally, as such nanostructures become more and more complex, it becomes ever more important to understand and quickly identify whether a variety of design options and parameters may achieve a feasible optical response.
Some emerging solutions have suggested combatting these problems by utilizing neural networks, however such solutions have been limited to the simple nanostructures due to the reduced computational complexity afforded due to the one-to-one relationship between design parameters and output response. However, such approaches fail to account for the fact that most of the most promising nanostructures do not exhibit such a one-to-one relationship, thus such solutions ultimately provide little to no improvement over the brute force techniques. Additionally, these methods tend to focus on optimization of a nanostructure rather than learning the physical phenomena or possible responses that can be achieved from a structure or alternatively, the potential of a given structure providing a range of possible responses. Learning this information can assist in designing considerably simpler and fabricationally favorable structures.
Accordingly, there is a need for systems and methods for assessing feasibility of a desired response of a given design and feasibility for design parameters exhibiting a many-to-one relationship between design parameters and output characteristics. Specifically, there is a need for systems and methods for assessing feasibility of desired optical responses of complex electromagnetic nanostructures. Examples of the present disclosure are directed to these and other considerations.
Examples of the present disclosure comprise systems and methods for detecting feasible optical response performances from a structure having design parameters and limitation parameters, and more particularly to systems and methods for identifying feasibility of desired responses achievable in a photonic nanostructure.
An exemplary embodiment of the present disclosure provides a system for detecting a feasible optical response performance from a structure. The system can comprise one or more processors and at least one memory in communication with the one or more processors and configured to store instructions. The instructions, when executed by the one or more processors, can be configured to cause the system to collect input design data, identify limitation data, generate simulation data, train a first multi-layer neural network, train a second neural network, generate an optimization convex hull, and invert a design space and a response space to generate the feasible optical response performance from the structure.
In some embodiments, identifying limitation data can be based on the input design data. Generating simulation data can be based on the limitation data and can comprise a design space and a response space. Training the first multi-layer neural network can utilize the response space. The first multi-layer neural network can be trained to generate a reduced response space having reduced dimensionality compared to the response space. The first multi-layer neural network can comprise an encoding layer and a decoding layer. Train the second neural network can utilize the design space and the response space. The second neural network can be trained to generate a reduced design space having reduced dimensionality compared to the design space. The optimization convex hull can be generated by cascading the second neural network with the decoding layer of the first multi-layer neural network. Inverting the design space and the response space can use the optimization convex hull to generate the feasible optical response performance from the structure.
In some embodiments, wherein the instructions can be further configured to cause the system to determine a designation of overlapping or non-overlapping of a desired design space of a desired structure. The designation of overlapping or non-overlapping can be determined by using the optimization convex hull.
In some embodiments, the instructions can be further configured to cause the system to validate, by using validation data, the optimization convex hull.
In some embodiments, the input design data can comprise a plurality of randomly generated patterns of a simulated structure.
In some embodiments, limitation data can comprise structural limitation data relating to physical properties of a photonic nanostructure.
In some embodiments, the photonic nanostructure can comprise a metasurface.
In some embodiments, the first multi-layer neural network can be an autoencoder.
In some embodiments, the autoencoder can utilize mean squared error as a cost function.
In some embodiments, the simulation data can comprise a multi-dimensional response space.
In some embodiments, the simulation data can comprise at least a six-dimensional response space.
In some embodiments, the response space and the reduced response space have a one-to-one dimensional relationship.
In some embodiments, the reduced design space and the reduced response space have a one-to-one dimensional relationship.
An exemplary embodiment of the present disclosure provides a method comprising identifying a first plurality of data points, computing a first convex hull, merging the first convex hull with previous convex hulls, and determining a feasible optical response performance within an electromagnetic nanostructure. The first plurality of data points can comprise a design space and a response space. The first convex hull can include all data points in the first plurality of data points. Merging the first convex hull with previous convex hulls can form an optimized convex hull. The previous convex hulls can comprise a second plurality of data points comprising previous design spaces and previous response spaces. The feasible optical response performance within an electromagnetic nanostructure can be determined by using the optimized convex hull.
In some embodiments, the method can further comprise inverting, using the optimized convex hull, the design space and the response space to generate the feasible optical response performance from the electromagnetic nanostructure.
In some embodiments, inverting, using the optimized convex hull, the design space and the response space can further comprise applying a one-class support vector machine algorithm.
In some embodiments, inverting, using the optimized convex hull, the design space and the response space can further comprise designating a desired design space of a desired electromagnetic nanostructure as overlapping or non-overlapping the optimized convex hull.
In some embodiments, the method can further comprise merging, using the optimized convex hull and third plurality of data points from a desired electromagnetic nanostructure structure, a re-optimized convex hull when the desired electromagnetic nanostructure comprises a desired design space designated as non-overlapping.
An exemplary embodiment of the present disclosure provides a system for detecting a feasible optical response performance from an electromagnetic nanostructure. The system can comprise one or more processors and at least one memory in communication with the one or more processors and configured to store instructions. The instructions, when executed by the one or more processors, can be configured to cause the system to collect input electromagnetic nanostructure design data; identify, based on the input electromagnetic nanostructure design data, structural limitation data comprising material properties, potential nanostructure geometry, periodic/non-periodic, unit-cell structure, and fabrication limitations; generate, based on the structural limitation data, electromagnetic simulation data comprising a design space, the design space comprising a set of design patterns and a corresponding response space comprising a corresponding set of response patterns; train, utilizing the corresponding response space, a first multi-layer neural network to generate a reduced response space having reduced dimensionality compared to the corresponding response space, the first multi-layer neural network comprising an encoding layer and a decoding layer; train, utilizing the design space and the corresponding response space, a second neural network to generate a reduced design space having reduced dimensionality compared to the design space; generate, by cascading the second neural network with the decoding layer of the first multi-layer neural network, an optimization convex hull; and invert, using the optimization convex hull, the design space and the corresponding response space to generate the feasible optical response performance from the electromagnetic nanostructure.
In some embodiments, the instructions can be further configured to cause the system to determine, by using the optimization convex hull, a designation of overlapping or non-overlapping of a desired design space of a desired structure.
In some embodiments, the electromagnetic simulation data can comprise a multi-dimensional response space ranging from about 2-dimensional to about 6-dimensional.
Further features of the disclosed design, and the advantages offered thereby, are explained in greater detail hereinafter with reference to specific examples illustrated in the accompanying drawings, wherein like elements are indicated by like reference designators.
These and other aspects of the present disclosure are described in the Detailed Description below and the accompanying drawings. Other aspects and features of embodiments will become apparent to those of ordinary skill in the art upon reviewing the following description of specific, exemplary embodiments in concert with the drawings.
While features of the present disclosure may be discussed relative to certain embodiments and figures, all embodiments of the present disclosure can include one or more of the features discussed herein. Further, while one or more embodiments may be discussed as having certain advantageous features, one or more of such features may also be used with the various embodiments discussed herein. In similar fashion, while exemplary embodiments may be discussed below as device, system, or method embodiments, it is to be understood that such exemplary embodiments can be implemented in various devices, systems, and methods of the present disclosure.
The following detailed description of specific embodiments of the disclosure will be better understood when read in conjunction with the appended drawings. For the purpose of illustrating the disclosure, specific embodiments are shown in the drawings. It should be understood, however, that the disclosure is not limited to the precise arrangements and instrumentalities of the embodiments shown in the drawings.
To facilitate an understanding of the principles and features of the present disclosure, various illustrative embodiments are explained below. The components, steps, and materials described hereinafter as making up various elements of the embodiments disclosed herein are intended to be illustrative and not restrictive. Many suitable components, steps, and materials that would perform the same or similar functions as the components, steps, and materials described herein are intended to be embraced within the scope of the disclosure. Such other components, steps, and materials not described herein can include, but are not limited to, similar components or steps that are developed after development of the embodiments disclosed herein.
Here, the inventors present a new approach based on geometric deep learning to measure the viability of certain optical responses from a class of nanostructures. Design of nanostructures always comes with having some constraints (e.g., size, shape, and material properties). Due to these constraints, some responses are not practical for a certain class of nanostructure by any means (any set of design parameters). In case a given response is impossible to achieve using a class of nanostructure, instead of searching blindly over all possible design parameters, which is expensive in time and resource, this approach leads to a change in the structure. The algorithm can first reduce the dimensionality of the response space using the ground truth data generated by commercial full-wave simulators. Then, the platform can be trained to find the optimum convex hull to bound the feasible responses in the latent space. This can be done through an iterative process until convergence. Next, a one-class SVM algorithm can be applied to find the non-convex geometry of achievable responses. The method was applied to two different classes of metasurfaces (MSs) in the visible range: (i) digital MSs consisting of 7×7 and 14×14 binary plasmonic nano-cubes associated with sophisticated numerical reflectance responses, and (ii) engineered MSs comprising of a square-lattice array of dielectric nano-ellipsoids associated with numerical and experimental Fano-type sharp resonances. The systems and methods described herein can accelerate current design approaches and grant priceless information about what a specific class of photonics nanostructure is capable to offer.
As shown in
Input design data 12 can include data characterizing one or more nanostructures having randomly generated patterns using a full-wave EM simulation software (e.g., EM simulation software can include suitable computational tools such as Numerical Electromagnetics Code (NEC), Momentum, High-Frequency Structure Simulator (HFSS), XFdtd, AWR Axiem, AWR Analyst, JCMsuite, COMSOL Multiphysics, FEKO, and Elmer FEM). Additionally or alternatively thereto, input design data 12 can include data characterizing an EM wave solver using an analytic or a semi-analytic model.
As shown in
Based on limitation data 14, system 10 can be caused to generate simulation data 15. Simulation data 15 can include a plurality of design spaces 16 (e.g., all possible designs) and a plurality of response spaces 17 (e.g., all possible responses). The relationship between design space 16 and response space 17 for a structure can be a one-to-one relationship or a many-to-one relationship. A many-to-one relationship between a design space 16 and a response space 17 may indicate that multiple sets of design parameters may result in a given response. As will be appreciated, such a many-to-one relationship can lead to a non-uniqueness problem, meaning that there may be more than one solution for a given desired response. As will be further appreciated, many-to-one problems can also involve a great deal of computational complexity in order to arrive at one of the many nonunique solutions. It is such problems that system 10 seeks to solve.
As depicted in
In some embodiments, system 10 can further be caused to train second neural network 22 using the design space 16 and response space 17. Second neural network 22 can be trained to generate a reduced design space 24 having reduced dimensionality compared to design space 16. The second neural network 22 can be similar to first multi-layer neural network 18 and can include an encoding layer, a decoding layer, and optionally one or more hidden layers. In some embodiments, the second neural network 22 can be an autoencoder that dimensionally reduces design space 16 to generate reduced design space 24.
As depicted in
After forming optimization convex-hull 30, system 10 can further be caused to use optimization convex hull 30 to invert the design space 16 and response space 17 and generate the feasible optical response performances that a structure can achieve.
In some embodiments, system 10 can additionally be caused to use optimization convex hull 30 to determine a designation of either overlapping or non-overlapping of a desired design space of a desired structure. When system 10 designates a desired design space as overlapping, system 10 can provide design parameters possible to generate the desired optical response. Alternatively, when system 10 designates a desired design space as non-overlapping, system 10 can inform that a desired optical response is not possible with the desired design space and thereby accelerate alternative design approaches.
The simulation devices 120A-120n can represent computer simulation devices and/or one or more neural networks that have been pre-trained based on simulation data. The server 130 may belong to a third-party aggregator, for example, that stores data, such as neural network training data, simulation data, or other data necessary to implement the methods described herein.
The user device 140 can be, for example, a personal computer, a smartphone, a laptop computer, a tablet, a wearable device (e.g., smart watch, smart jewelry, head-mounted displays, etc.), or another computing device. An example computer architecture that can be used to implement the user device 140 is described below with reference to
Examples of electromagnetic nanostructures are illustrated in
An example electromagnetic nanostructure is provided in
Examples of electromagnetic nanostructures are illustrated in
As shown in
The following examples further illustrate aspects of the present disclosure. However, they are in no way a limitation of the teachings or disclosure of the present disclosure as set forth herein.
Photonic nanostructures have been of great recent interest due to their unique capabilities to manipulate the properties of electromagnetic (EM) waves beyond what conventional bulk materials can do. Owing to their constituent nanoscale features, which can spectrally, spatially, or temporally control the optical state of EM waves with subwavelength resolution, nanophotonic devices extend all the functionalities realized by conventional bulky optical devices in much smaller footprints. Combined with the advances in nanofabrication technologies, these nanostructures have been used to demonstrate devices with enormous potential for groundbreaking technologies such as computing, imaging, and energy harvesting, to name a few.
Design of photonic devices in the nanoscale regime outperforming the bulky optical components has been a long-lasting challenge in some state-of-the-art applications. Accordingly, devising a comprehensive model to understand and explain the fundamental physics of light-matter interactions in these nanostructures is a substantial step toward the realization of novel nanophotonic devices. To this end, existing modeling methods can be categorized into two main groups; single- and multi-objective approaches. Single-objective approaches either rely on exhaustive design parameter sweeps using a brute-force EM solver (e.g., based on the finite element method) or evolve from an initial guess to a final result through evolutionary methods (e.g., genetic algorithm). While the former requires extensive computation, the latter highly depends on the initial guess and in most cases converges to a local optimum. Both of these single-objective approaches are computationally demanding and fail when the input-output relation is complex, or the number of desired features for a nanostructure grows. On the other hand, multi-objective methods deal with formation of a model to optimize a certain class of problems. Although these methods are more computationally efficient, obtaining an optimal solution is not guaranteed.
Deep learning (DL)-based design approaches, combined with limited exhaustive searches, have proven to be a potent solver of multi-objective optimization problems by learning the input-output relation. Although DL-based approaches can be applied to nanophotonic design problems, by finding the geometry of the data while reducing the dimensionality of the response and design patterns, they can go much further and provide considerably more information and intuition about the dynamics of light-matter interaction in nanostructures with the hope of uncovering new physical phenomena that can be used to form completely new type of devices. Unfortunately, there has been little effort on using these techniques to obtain detailed knowledge about the physics of light-matter interaction in EM nanostructures (e.g., metasurfaces (MSs)). The change of focus of using DL techniques from “optimization” to “learning” can open a new research area with potentially transformative results in the entire field of nanophotonics. Examples of these “learning” paradigms include assessing the feasibility of a desired response using a given structure as well as the potential of a given structures in providing a range of possible responses. Such information can enable the evolution of the design from an initially selected nanostructure to a considerably simpler and fabricationally favorable structure The focus of this disclosure is to study these feasibility issues in nanostructures while discovering latent optical phenomena.
Knowing the feasibility of a desired response offered by a photonic nanostructure is very helpful prior to any design or optimization effort in avoiding suboptimal designs or convergence issues. It also guides us to modify the initial structure to achieve the desired response. This important concept has not been considered in existing optimization and inverse design approaches, which provides a solution to any inverse design problem regardless of its feasibility.
A geometric deep learning (GDL)-based technique is presented by forming the smallest convex set (i.e., convex hull) to discover hidden optical phenomena while analyzing the feasibility of having a desired optical response from a certain class of EM nanostructures. GDL is a term for techniques aims to generalize DL approaches by considering the non-Euclidean domain such as manifolds. These methods reduce the dimension of the patterns while finding the governing geometry of the patterns in low-dimensional space which Euclidean distance can be a good measure for similarity of the patterns. The developed approach is based on reducing the dimensionality of the response space (RS) of a given EM nanostructure and finding the convex hull that contains achievable responses in the latent RS. The dimensionality reduction (DR) implementation is based on the autoencoder (see Hinton, G. E. & Salakhutdinov, R. R. Reducing the dimensionality of data with neural networks. Science 313, 504-507 (2006)), and the Quickhull (see Barber, C. B., Dobkin, D. P., Dobkin, D. P. & Huhdanpaa, H. The quickhull algorithm for convex hulls. ACM Transactions on Mathematical Software (TOMS) 22, 469-483 (1996)) algorithm is used to form the convex hull in the latent RS. The technique uses the numerical simulation of the response of the system for a series of randomly selected design parameters (called training sets) and another series of similar simulation results for validation of the technique. After initial training and validation, the algorithm finds the optimal bounded subset, which contains all feasible responses. The optimal region that contains the feasible responses might not be convex in many cases, and it is better to also find a tighter bound over feasible responses in the latent RS. For this purpose, the inventors use the one-class support vector machine (SVM) algorithm to find the non-convex geometry. One-class SVM also provides information about the level of feasibility (or non-feasibility) of a response and the possibility of trading an acceptable error (or a small change in the desired response) to get the closest feasible response from a non-feasible one. Despite being implemented for the EM nanostructures (especially dielectric and plasmonic MSs), the disclosed technique can be applied to a wide variety of applications once the training data can be provided. Some example extensions include thermal structures, fluidic systems, mechanical platforms, and acoustic metamaterials.
The rest of the disclosure is organized as follows. Section 2 describes the details of the approach. Section 3 demonstrates the application of the approach to two classes of important MSs. Section 4 is devoted to the comparison of the findings of the disclosed technique with experimental data. It is followed by discussion in Section 5 and conclusion in Section 6.
The convex hull of a set of points is defined as the smallest convex set that contains all those points. A d-dimensional convex hull can be represented using its vertices and (d-1)-dimensional facets. The ridges of the convex hull are (d-2)-faces, which are the intersection of the vertices in two neighboring facets. There are different algorithms presented in geometrical computation to form the convex hull of a given set of points. One of the most effective and well-known algorithms is Quick-hull, which forms the convex hull using an incremental method based on Grunbaum's Beneath-Beyond theorem (see below). For a typical problem, the Quickhull algorithm starts with a set of given (or training) points and forms the initial convex hull. The points that lie outside the initial convex hull are considered as the outside set. The farthest point from the initial convex hull (i.e., the point with the maximum Euclidean distance from its nearest facet) is found at each iteration and the facets, ridges, and vertices are updated based on Grunbaum's Beneath-Beyond theorem. These steps are repeated until the algorithm converges.
While the convex hull algorithm is capable of finding a convex geometry for feasible responses, it has some limitations. If the optimum feasible region is not convex, inevitably some unfeasible regions in the latent RS will be included in the convex hull to reach a convex region. This limits the efficiency of the algorithm for such structures due to the false-positive errors. Moreover, the algorithm acts as a binary classifier and classifies responses into two classes: feasible (achievable) and unfeasible (unachievable). In most practical cases, it is desirable to know how far an unfeasible response is from feasible responses. It is also helpful to know whether it is possible to push an unfeasible response toward the feasible region by accepting some error. Unfortunately, the Euclidean distance of a given point in the latent RS from the geometry of the convex hull is not a good measure for feasibility of the corresponding response. To address this limitation, the inventors use one-class SVM in the latent RS as the alternative algorithm.
One-class SVM is an algorithm that separates the patterns into two regions (e.g., feasible and unfeasible in the example case). In addition, the Euclidean distance between any point in the space and the geometry of the one-class SVM is a good measure of this separation (e.g., a good measure of the feasibility of a response in the example cases disclosed herein). Mathematically, a one-class SVM forms a nonlinear geometry by projecting patterns xi through a nonlinear function ϕ to a higher-dimensional space F. This mapping helps to separate linearly non-separable patterns in the input space I in a high-dimensional space by a hyperplane (represented with wT+b=0, w∈F and b∈R). By projecting this hyperplane from the high-dimensional space back to the original space, the algorithm finds the equivalent non-convex decision geometry. In this projection, the resulting region for the desired (or feasible) class of data may not only have a non-convex geometry, but it may also exclude smaller closed regions within the geometry. The implementation of the one-class SVM has considerable flexibility through two parameters ν and γ which control the tightness of the geometry of the decision region and the maximum ratio of the given training patterns that fall outside the geometry (and thus, contribute to classification error). By using different values of γ, one can find a series of boundaries with different levels of classification errors for the ground-truth data. However one-class SVM shows is capable of finding the non-convex geometry of latent patterns, computation complexity of validating ν and γ in each iteration prevents using it as a prelimenary approach of forming the geometry. Further details about the one-class SVM are provided in Example 9 below.
To demonstrate the potentials of the disclosed technique, the inventors apply it to the investigation of possible optical reflection responses from plasmonic and dielectric MSs as two popular classes of photonic nanostructures.
The design patterns in each case are achieved by random selection of the binary inclusions, and the calculated reflection spectra are sampled uniformly in the 400-800 nm wavelength range with 2 nm resolution to form a vector with dimensionality of 200 forms the response patterns. Due to the iterative nature of the algorithms in
Table 1. Average distance of different classes of test data (14×14 and 7×7 responses as well as Fano and Lorenzian lineshape resonances) from the highest confident region border for one-class SVM. Distances for random samples represented in the most-right column is also represented. The distances are calculated using Eq. S8.
After obtaining the training dataset, the first step of implementation is the dimensionality reduction of the RS by training an autoencoder. To find the optimum dimensionality of the latent RS and the number of layers of the autoencoder, the inventors use an ad-hoc approach by using different structures and dimensionalities and calculating the mean squared error (MSE) for each case. The details of this approach are explained in Kiarashinejad, Y., Abdollahramezani, S., Zandehshahvar, M., Hemmatyar, O. & Adibi, A. Deep learning reveals underlying physics of light-matter interactions in nanophotonic devices. Advanced Theory and Simulations (2019).
It is important to note that despite training with a non-aggressive success rate of 95%, the convex hull algorithm is capable of identifying all non-feasible responses as well as a large portion of the feasible responses. Nevertheless, the convex hulls in
Each added region corresponds to a different level of non-feasibility of a response that lies outside the highest confidence region. A quantitative measure for the level of feasibility of a response in this one-class SVM is the minimum distance of that response form the geometry of the highest confidence region. The calculated distance in a 6D one-class SVM for a series of responses of the structure in
To show the applicability of the disclosed technique in practical problems, without loss of generality, the inventors choose the reflective structure of a low-loss dielectric MS, which can be experimentally fabricated and characterized.
To evaluate the convex hull experimentally, the inventors fabricated dielectric MSs with symmetric unit cells (i.e., px=py=p) with 250 nm<p<450 nm consisting of symmetric nanopillars (i.e., rx=ry=r) with 0.65 p<r<0.75 p (see Example 7 below). The scanning electron microscopy (SEM) image for the fabricated MS with p=450 nm and r=0.75 p is shown in
The results in previous sections clearly show the power of GDL algorithm in assessing the feasibility of a desired response given a specific nanostructure design. They also show the advantage of one-class SVMs in providing a more quantitative measure for the feasibility of the desired response. This advantage comes from the fact that in one-class SVM, the geometric distance of a point in the latent RS from the geometry of the one-class SVM is a good gauge for the feasibility of the structure while in general convex hulls, this relation does not hold. This advantage comes at the expense of more sophisticated training as the optimum hyper-parameters ν and γ in SVM are not usually trivial to find. In practice, the inventors first find the convex hull of the feasible responses, and use it to find proper values of ν and γ. Nevertheless, convex hulls are helpful in providing quick evaluations of the feasible responses. The training process can also be simplified if more error is accepted. Note also that finding the actual geometry of the convex hull and one-class SVM may not be important in design and optimization problems as the points on the boundaries are less reliable to be feasible. We prefer the desired response to be in the middle of one-class SVM.
In addition to the boundaries of convex hull and one-class SVM in the latent RS, the area that is covered in that space by these shapes has important practical implications. The larger the area, the more capable the structure is in forming output responses.
Note that the dimensionality reduction algorithm implemented by the autoencoder is an important step in reducing the computing requirements for the convex hull and one-class SVM. For any particular problem, the optimum dimension of the latent RS depends on the selection of the design and the redundancy of the response (i.e., the level of non-uniqueness). Thus, finding the optimum size of the latent RS is the initial step in implementing the algorithms of this disclosure. Once the size of the latent RS is selected, the required computation for the calculation of the convex hull and one-class SVM are primarily for the training algorithm. In this disclosure, the inventors mainly used the brute-force approach in starting with a training dataset and expanding it until the convex hull (and subsequently the one-class SVM) pass the validation test. Further rigorous approaches muse be developed to minimize the computation for training. One can also take advantage of the trade0off between the accuracy (or the error) and the computation requirement as explained above.
Although the focus of this disclosure was the first demonstration of a GDL-based technique for studying the feasibility of a given response, this technique can be adopted for obtaining far more detailed information about the physics of nanostructures. As an example,
Numerical Simulations. All numerical simulations throughout this disclosure were carried out in COMSOL Multiphysics commercial software interfaced to MATLAB to facilitate the process. For the design of unit elements, periodic geometry conditions and perfectly match layers were considered in the lateral and vertical directions, respectively. A TM-polarized light in the range of 400-800 nm is launched into the simulation domain, and the co-polarized reflection coefficient was calculated at the location of the input port. The optical constants of Al, Al2O3 in
Fabrication process. The dielectric MS shown in
Having a set of points, there are different ways to find a boundary that bounds these points like Simplex, Voronoi Diagram, Convex hull, etc. The convex hull of a set of points is the smallest convex set that contains the points. Considering x1, x2, . . . , xk∈X, the convex combination of these points is defined as θ1x1+θ2x2+ . . . +θkxk where θi≥0 and θ1+θ2+ . . . +θk=1. A set is convex if and only if it contains all the convex combination of its points. The convex hull of the set of points, X, is denoted as conv X and is defined as:
conv X={θ1x1+θ2x2+ . . . +θkxk|xi∈X, θi≥0, i=1, 2, . . . , k, θ1+θ2+ . . . +θk} (S1)
Considering the convex hull operator on a set of points, it is (1) Extensive (i.e. Convex hull of all sets in X is a superset of X), (2) Non-decreasing (i.e. convex hull of a subset of set X, is a subset of convex hull of X), and (3) Idempotent(i.e. Convex hull of the convex hull of X is same as convex hull of X). The convex hull of any set of points is also unique and closed set.
There are different algorithms presented in Geometrical Computation to form the convex hull of a given set of points. One of the most effective and well-known algorithms is Quickhull. This algorithms find the convex hull of a set of points in d dimension using an effective method both in memory and computation. Given a set of n data points with r processed point, the algorithm is O(nlogr) if the dimension of the convex hull is less than or equal to 3 and is O(nfr/r) for d being more than 3 (fr is the maximum number of facets for r vertices). Let's define extreme points of a convex hull as those points that are vertices of the boundary of the convex hull. The running time of the, algorithm as mentioned, will be output dependent since it depends on the number of facets and vertices. Therefore, for those sets that the inventors have less extreme points, it takes less time for the algorithm to find the solution. A d-dimensional convex hull can be shown using its vertices and (d-1)-dimensional faces. The ridges of the convex hull are (d-2)-faces which are the intersection of the vertices in two neighboring facets. Quickhull forms the convex hull using an incremental method. First consider the Grunbaum's Beneath-Beyond theorem is used in this incremental algorithm.
Consider H as the convex hull of a set of points in Rd and a point p in Rd−H. F is a facet of conv(HUp) if and only if:
(1) F is a facet of H and p is below F, or
(2) F is not a facet of H. and its vertices are p and the vertices of a ridge of H that has one incident facet below p and one above p.
The quickhull algorithm starts with a set of points (i.e., a random subset of all datapoints) and forms the initial convex hull. All the points that lies outside of the initial convex hull are considered as the outside set. The furthest point from the outside set is found at each iteration and based on theorem 1, the facest, ridges, and vertices will be updated. This process will continue until convergence. The resulting convex hull consists all the datapoints. The random based methods, however, consider a random point from outside set at each iteration. This makes the process time consuming and the running time of the algorithm will be much more than quickhull.
After forming the convex hull for set X in the space, the inventors need to find out whether a given point p lies inside the convex hull or not. First consider one random point a outside of the convex hull. Then connect x and a with a line segment. Find the number of intersection of the line xa and every vertex of the convex hull. If the number of intersections is odd, the point lies inside the convex hull. Otherwise, if the number of intersections is even or zero, this point is outside the convex hull.
As discussed before, the convex hull method just provides us binary decisions about the feasibility of the responses. To tackle this limitation, one-class SVM is used. Assume that the training data are x1, x2, . . . , xN∈X where N is the number of datapoints. Considering the mapping ϕ(x) from feature space, X, to a dot product space F, the kernel function is defined as:
k(xi, xj)=(ϕ(xi), ϕ(xj)) (S2)
There are different choices for the kernel function like Gaussian and polynomial kernel. In this research, the inventors used the Gaussian kernel.
One-class SVM then can be formulated as an optimization problem which finds a hyperplane to separate datapoints in X from the origin in F and has the maximum distance from the origin. This problem is formulated as a quadratic program:
Here ν∈(0, 1] is a free parameter of the algorithm. The slack variables ξi let the algorithm to misclassify some points to have a better generalization over unseen datapoints. Therefore, the free parameter ν penalizes the number of misclassified points. For ν=0, the penalty for the slack variables is infinity and all the algorithm will overfit to the training data while for larger ν more slack variables can have non zero value and the algorithm underfits. It is more practical to solve the dual problem for one-class SVM.
By solving this optimization problem, which is a quadratic programming, the decision function becomes:
Here p can be recovered using the dual variables. Those datapoints xi that the optimized value αi is non-zero are called support vectors. These datapoints are mainly close to the boundary and enforces the complexity of the boundary.
The dimension of the original response space is 200. This high dimensional space results in two major issues that should be solved. First, due to the curse of dimensionality the distances and patterns in high dimensional space cannot be interpreted and results in low performance. Second, running time of the Quickhull algorithm increases as the dimensionality increase and forming the convex hull in such high dimensional space is not practical. To address these problems the inventors used auto-encoder to reduce the dimensionality of the response space. We reduce the dimensionality to 2D and 3D for visualization. However, to find the optimum dimensionality, the inventors need to find the reconstruction error. The MSE is shown in
To have a better sense of the efficiency of the algorithm, the inventors define point-to-point error (Errorp). Assume that the inventors have n response patterns and each response pattern achieved by discretizing the response by measuring reflectance (r and {dot over (r)} represent ground truth and estimated reflectance respectively) in m different wavelengths (i.e λ). The point-to-point error becomes:
To understand the capabilities and limitations of the binary structures, the inventors tested the algorithm with Fano-lineshapes. These type of resonances can be observed in the reflectance response of the all-dielectric MS consisting of HfO2 NPs shown in
where a, b, and c are the constant real numbers, ω0 is the central resonant frequency, and γ is the overall damping rate of the resonance. The Q is calculated by Q=ω0/γ.
To show the capabilities of the disclosed approach, the inverse design of metasurfaces was studied with reflection responses of Fano-like lineshape using the unit-cell structures shown in
For the AI analysis, a total of 8000 random sets of design parameters for the five unit-cell structures are generated, and the corresponding reflection responses are computed using three-dimensional finite-difference time-domain (3D FDTD) simulations, implemented using Lumerical, in the 300<λ<850 nm range, where λ is the wavelength. The incident beam is a normally-incident plane-wave from the top with linear polarization in the x direction in
To form the feasible set of responses for each nanostructure in
The latent-space representation of the responses (see
Comparing the feasible responses of structures with different unit cells in
To better quantify the effectiveness of each unit-cell structure in
To find a structure that generates a desired reflection response, the first step is to find the corresponding point in the latent space by reducing the dimensionality of the desired response using the trained AE (see the two examples in
To form the latent space of the responses, an AE is trained on a total of 6000 reflection responses obtained by 3D FDTD simulations for the random sets of design parameters of the structures in
To find the optimum design parameters, a feed-forward NN is trained from the design to the response space. The network has 8 layers with 9, 20, 50, 100, 100, 200, 400, 500 nodes at each layer. The activation functions of the hidden layers is tanh(.). The design parameters are normalized to have zero mean and unit standard deviation. The weights of the NN are trained using the Adam optimizer in Python to minimize the MSE.
To perform the inverse design with a desired reflection response and a given design complexity (i.e., a given unit-cell structure in
The 3D FDTD simulations are conducted with the commercial software Lumerical. The simulation domain is limited to one period (p) in the lateral directions (i.e., x and y in
An example systematic approach for the inverse design of non-unique nanophotonic structures can be based on reducing the dimensionality of the design space and response space (DS and RS, respectively). The inverse design problem can be solved using the reduced DS (RDS) and the reduced RS (RRS), and the computation requirements are reduced by orders of magnitude. The inverse design approach is based on dividing the large overall non-unique (and thus, non-invertible) problem into a combination of a large invertible problem (between the RS and the RDS) and a small non-invertible problem (between the RDS and the original DS). To demonstrate this approach's unique features, the example method as described above was applied to design standard multilayer thin-film structures composed of consecutive layers of silica (SiO2) and titania (TiO2). The detailed comparison with the alternative approach based on training a conventional NN without dimensionality reduction shows an improvement by two to three orders of magnitude in the number of floatingpoint operations per second (FLOPS) without imposing a significant error.
where y=F(x), y*=F(x*), and Loss(y, y*) is considered as the MSE (i.e., ∥y−y*∥22). Although the FNN is significantly faster than an EM simulation software for searching over the DS, the computation will increase as the number of design parameters and the complexity of the structure increase [since a network with more nodes and layers needs to be trained). The first step in the disclosed dimensionality-reduction-based design approach is to reduce the dimensionality of the RS using an autoencoder (AE). FIG. 20B shows the schematic of an AE, which is composed of an encoder that maps the RS to the RRS (Φ:→) and a decoder that reconstructs the original response from the RRS (Ψ:→). Due to the high redundancy in the response of a photonic nanostructure, the dimensionality of the RS can be reduced extensively without significant error. The optimum dimensionality of the RRS can be found using an ad hoc method by changing the size of the bottleneck layer in
As discussed above, the relation between the RDS and the RS in
To investigate the existence of non-uniqueness in the disclosed dataset, the inventors produce sets of design parameters with nearly identical optical spectrums with an accelerated brute-force approach. We use the trained FNN in
To reduce the dimensionality of the design and RSs for the eight-layer structure, the inventors first train the AE in
It is important to note that the numbers of nodes in the layers of the PE after the bottleneck layer (i.e., the RDS-to-RRS network) do not affect the computation advantage of the PE in inverse design, as the only part used for the final search is the DS-to-RDS network. Nevertheless, it is important to optimize the dimensions of the RDS and RRS to ensure a one-to-one relation between the RDS and the RS to enable the simple inversion from the RS to the RDS. This is currently done by trying different dimensions for the RRS and the RDS. Future research should be performed to develop more rigorous approaches for finding such optimal dimensions.
While the PE-based approach is computationally favorable over the FNN during the inverse design phase, it requires more computation during the training phase since the PE requires training of two separate networks: the AE for the dimensionality reduction of the RS, and the PE for that of the DS. However, training is performed only once for all the inverse design attempts using a given photonic device architecture. Thus, the added training computation is not a major disadvantage of the PE-based approach. Although the inventors considered thin-film structures with 20 layers in this disclosure, the disclosed dimensionality-reduction approach can be used to reduce the computation requirements for structures with any number of layers. The computation advantage of the approach will be even more for more complex structures, especially with careful optimization of the dimension of the latent space based on the acceptable reconstruction error. Note that there is a trade-off between the dimensionality of the latent space (and thus, the computation requirements) and the error in reconstruction of a given response by the AE, as can be seen in
A unique feature of the demonstrated approach is its generality and applicability for designing and investigating a variety of different nanophotonic structures for different applications, as long as the response features are covered in the training phase. This is in contrast to conventional design approaches where the entire design process has to be repeated once the desired response changes (even slightly).
To summarize, the inventors demonstrated here a reliable and computationally superior AI approach based on dimensionality reduction for analysis and inverse design of photonic nanostructures. The PE-based approach has two to three orders of magnitude reduction in the required computation for the inverse design of a typical photonic nanostructure without imposing much error compared to using a FNN. It also applies to non-unique problems with no major difference. By breaking the large non-unique inverse design problem into a large one-to-one problem and a small non-unique problem, the disclosed PE-based approach can further facilitate the inverse design of photonic nanostructures, especially through employing more rigorous optimization techniques for the last stage (from the RDS to the DS), while such rigorous techniques cannot usually be employed for the original non-unique problem due to the excessive computation requirements.
It is to be understood that the embodiments and claims disclosed herein are not limited in their application to the details of construction and arrangement of the components set forth in the description and illustrated in the drawings. Rather, the description and the drawings provide examples of the embodiments envisioned. The embodiments and claims disclosed herein are further capable of other embodiments and of being practiced and carried out in various ways. Also, it is to be understood that the phraseology and terminology employed herein are for the purposes of description and should not be regarded as limiting the claims.
Accordingly, those skilled in the art will appreciate that the conception upon which the application and claims are based may be readily utilized as a basis for the design of other structures, methods, and systems for carrying out the several purposes of the embodiments and claims presented in this application. It is important, therefore, that the claims be regarded as including such equivalent constructions.
Furthermore, the purpose of the foregoing Abstract is to enable the United States Patent and Trademark Office and the public generally, and especially including the practitioners in the art who are not familiar with patent and legal terms or phraseology, to determine quickly from a cursory inspection the nature and essence of the technical disclosure of the application. The Abstract is neither intended to define the claims of the application, nor is it intended to be limiting to the scope of the claims in any way.
This application is a continuation application of U.S. patent application Ser. No. 17/233,140, filed on 16 Apr. 2021, which is a continuation application of U.S. patent application Ser. No. 17/010,262, filed on 2 Sep. 2020, which claims the benefit of U.S. Provisional Application Ser. No. 62/895,466, filed on 3 Sep. 2019, the contents of which are incorporated herein by reference in its entirety as if fully set forth below.
This invention was made with government support under grant/award number N00014-18-1-2055 awarded by the Office of Naval Research. The government has certain rights in the invention.
Number | Date | Country | |
---|---|---|---|
62895466 | Sep 2019 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 17233140 | Apr 2021 | US |
Child | 17474523 | US | |
Parent | 17010262 | Sep 2020 | US |
Child | 17233140 | US |