The present invention is related to physics-informed machine learning, and, in particular, utilizing physics-informed neural networks to examine internal structures and defects of solid materials, including (1) geometry identification and (2) design of optimized and/or improved geometry.
Deep learning approaches play an increasingly significant role in a wide range of technologies that benefit computer vision, natural language processing, and other data-rich areas of societal interest. Despite the evolving sophistication of data analytics and neural networks (NNs), much of this work to date has not been predicated on a large volume of scientific data, through which predictive models can be constructed using experimentally validated mechanistic inferences and laws of physics. In most scientific applications, by contrast, physical conservation laws (such as those for momentum and energy) are framed by highly general, mathematical formulations (e.g., those invoking partial differential equations (PDEs) in areas such as solid mechanics, fluid mechanics, and material diffusion), along with experimental authentication by recourse to laboratory tests.
According to one aspect, a method for analyzing an aspect of a solid material/structure is provided. The method includes receiving one or more geometric variables as one or more inputs to physics-informed neural networks (PINNs). The method also includes characterizing/parametrizing a first geometry according to the one or more geometric variables. The method further includes identifying one or more aspects of the solid material/structure based on the first geometry, wherein characterizing/parametrizing a first geometry is performed in a trainable manner.
According to another aspect, a method for analyzing an aspect of a solid material is provided. The method includes receiving one or more geometric variables as one or more inputs to physics-informed neural networks (PINNs). The method also includes generating features and parameters of a second geometry of one or more aspects of a solid material, wherein generating features and parameters of a second geometry achieves an improved performance based on one or more objectives, wherein an improved performance based on one or more objectives is performed in a trainable manner.
In some exemplary embodiments of the methods, the one or more aspects of the solid material/structure includes at least one of one or more internal structures, internal surfaces/boundaries, external structures, or external surfaces/boundaries. In some exemplary embodiments of the methods, the one or more aspects of the solid material/structure includes one or more defects in the solid material/structure.
In some exemplary embodiments of the methods, the trainable manner includes substituting one or more geometric variables such as geometry trainable variables, geometry-dependent training points, and/or making the gradient with respect to geometry tractable.
In some exemplary embodiments, characterizing/parametrizing a first geometry includes inversely characterizing a first geometry of at least one of one or more internal structures, internal surfaces/boundaries, external structures, or external surfaces/boundaries according to displacement data. In other exemplary embodiments, generating features and parameters of a second geometry includes inversely designing the second geometry of at least one of one or more internal structures, internal surfaces/boundaries, external structures, or external surfaces/boundaries according to a predefined objective function.
In some exemplary embodiments, identifying one or more aspects of the solid material/structure based on the first geometry includes concurrently identifying one or more full field stresses, strains, and/or displacements in the one or more aspects of the solid material/structure.
In some exemplary embodiments of the methods, a framework for inversely characterizing and/or designing involves unknown/moving domains directly parameterizing a computational domain/geometry with material and geometry parameterization. In some exemplary embodiments, inversely characterizing a first geometry includes minimizing a discrepancy/loss between the displacement data and one or more results of a forward solver. In other exemplary embodiments, inversely designing a second geometry includes minimizing the objective function.
In some exemplary embodiments of the methods, characterizing/parametrizing a first geometry or generating features and parameters of a second geometry includes representing of at least one of one or more internal structures, internal surfaces/boundaries, external structures, or external surfaces/boundaries by analytical function(s), parameterized function(s), a non-uniform rational basis spline (NURBS) or other neural network(s). Additionally, a shape of the at least one of one or more internal structures, internal surfaces/boundaries, external structures, or external surfaces/boundaries are simple or arbitrarily complicated.
In some exemplary, embodiments, concurrently identifying the full field stresses, strains, displacements in the one or more solid materials/structures includes one or more different shapes and/or topologies of the one or more solid materials/structures and different constitutive models for describing the mechanical properties of the one or more solid materials/structures. Further, the different constitutive models for describing the mechanical properties of the one or more solid materials/structures include measuring linear elasticity, nonlinear elasticity or hyperelasticity, and plasticity.
In some exemplary embodiments of the methods, the trainable manner includes a pretraining process for the PINNs. This pretraining process includes maintaining one or more estimated unknown parameters θ defined as fixed/not trainable and updating one or more trainable parameters of a neural network (NN) λ for one or more iterations. The pretraining process also includes solving one or more forward problems to capture a qualitative pattern of a displacement field and a stress field. The pretraining process further includes solving one or more forward problems until both a loss function and one or more estimated geometric parameters reach a relative plateau following pretraining of the PINNs. Additionally, both λ and θ are then trainable, with λ converged towards to a desired local minimum, and the pretraining process stabilizes the trainable manner of the PINNs. In some exemplary embodiments, an estimation of geometric parameters is automatically updated as the PINNs minimize the loss function during the pretraining and/or training process for the PINNs, including enforcing one or more diverse types of conditions in problem definition for integration into the PINNs in the form of the loss function during the pretraining and/or training process for the PINNs.
In some exemplary embodiments of the methods, the one or more geometric variables parameterize the computational domains of partial differential equations (PDEs) and boundary conditions. Additionally, the one or more geometric variables are first defined as trainable before expressing one or more locations of residual points as functions of the one or more geometric variables. In some exemplary embodiments, the one or more locations of the residual points are automatically updated as an estimation of the one or more geometric variables are updated throughout the training process. Additionally, the one or more locations of the residual points for one or more different conditions are in their correct domains, allowing the capturing of a gradient of a loss function with respect to the one or more geometric variables. Further, the residual points for the one or more different conditions allows for the PINNs to correctly update estimation and/or design of the one or more geometric variables throughout the training process and characterize/design the at least one of one or more internal structures, internal surfaces/boundaries, external structures, or external surfaces/boundaries and/or defects.
In some exemplary embodiments, characterizing/parametrizing a first geometry includes accurately estimating the one or more geometric variables and one or more material parameters with limited non-destructive measurements. Accurately estimating the unknown geometric and one or more material parameters includes a relative error O(10−2) when proper displacement data are supplied to ensure identifiability. Additionally, characterizing/parametrizing a first geometry further includes placing one or more displacement measurement points only on an boundary of a solid material/structure.
According to another aspect, a method for utilizing physics-informed neural networks (PINNs) to examine internal structures and defects of solid materials/structures is provided. The method includes applying a neural network to approximate the primary solution fields. The method also includes integrating one or more mechanical laws into the PINN by deriving relevant mechanical quantities of interest from one or more neural network (NN) outputs, such as strain, stress, and/or residual of equilibrium partial differential equations (PDEs). The method further includes formulating a loss function (λ, θ), wherein the loss function
(λ, θ) measures a discrepancy between predicted mechanical quantities of interest and their respective true values provided by the one or more mechanical laws and measured data. The method further includes conducting parameter estimation through a training of the PINN, wherein the training of the PINN includes updating/training unknown parameters θ=(θmat, θgeo) and neural networks parameters λ to minimize the loss function.
According to another aspect, a method for utilizing physics-informed neural networks (PINNs) to examine internal or external structures and defects of solid materials/structures is provided. The method includes minimizing PDE(λ, θ),
BC(λ, θ), wherein minimizing
PDE(λ, θ),
BC(λ, θ) includes satisfying a governing partial differential equation (PDE) and one or more boundary conditions as the PINN seeks to minimize a loss function. Additionally, one or more constraints are satisfied through minimizing
cnstr(λ, θ), wherein the one or more constraints are directly incorporated through designing an architecture of PINNs. Further, a design target is achieved through minimizing
target(λ, θ), and one or more relevant geometric parameters in θ are adjusted to minimize the loss function and realize a design of optimal geometry.
The present disclosure will be described more fully hereinafter with reference to the accompanying drawings, in which preferred embodiments of the disclosure are shown. This disclosure may, however, be embodied in many different forms and should not be construed as limited to the embodiments set forth herein. Rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the scope of the disclosure to those skilled in the art. Like numbers refer to like elements throughout. Other objects, features and advantages of the present disclosure will become apparent from the detailed description of the disclosure, which follows when considered in light of the accompanying drawings in which:
Aspects of the present disclosure include methods involving physics-informed deep learning to help solve inverse problems of solid materials/structures related to unknown geometry. Practically, this method helps to (1) identify and characterize unknown materials/structures and defects at great levels of accuracy and predictive capability with limited non-destructive measurements, and (2) design geometrical features and parameters of solid materials and structures to achieve optimized and/or improved performance.
Emerging research reveals the profound untapped potential of physics-based, multidisciplinary, deep learning approaches with unprecedented opportunities for scientific and engineering advances in molecular analysis, design of materials with improved properties and performance in structural and functional applications, and unique pathways for the characterization of properties of materials. To further realize this potential, broadly applicable methodologies in the area of NNs are needed to address a variety of issues that underpin deep learning analyses, governed by physical laws and guided by mathematical formulations. To this end, a physics-informed deep learning approach has recently been proposed for the simulation of systems governed by physical laws that are represented by PDEs. While traditional methods based on deep learning encode such formulations implicitly by feeding training data governed by equations, this approach explicitly encodes known physical or scaling laws in the form of mathematical equations into the standard structure of NNs, formulating the so-called Physics-Informed Neural Networks (PINNs). Such an approach integrates any existing knowledge expressible in terms of PDEs during the learning process, thereby markedly improving predictability while reducing the amount of data required to achieve a desired level of accuracy. Studies have shown the applicability of PINNs in addressing a wide spectrum of forward and inverse problems spanning disciplines such as fluid mechanics, quantum mechanics, and solid mechanics. Such applications have shown promise for enhancing predictability when the amount of data is limited or when the problem is ill-posed, situations in which existing methods are not likely to yield accurate and reliable results. This approach has been further extended to offer new pathways to address relevant mathematical formulations, such as stochastic PDEs and fractional PDEs.
Aspects of the present disclosure address this urgent need and overcome the various disadvantages of previously known systems and methods by addressing geometry identification and geometry design problems in the broad field of continuum solid mechanics with potential applications in many branches of engineering, sciences and medicine. In a geometry identification problem, the unknown geometric features and parameters are determined in a solid material/structure given measured material response under static or dynamic loading, thereby characterizing unknown structures including internal defects or boundaries such as voids, vacancies or holes, inclusions and reinforcements, and/or cracks. In a geometry design problem, one needs to identify the optimal geometric feature so that the performance of the material/structure is optimized and/or improved under certain metrics. Traditionally, computational algorithms for geometry identification/design are established based on the finite element method (FEM) as the forward solver. Beyond the forward solver, considerable effort is required for the design and implementation of iterative algorithms for updating the estimated/designed values of geometric parameters. Through the iterative algorithm, for identification problems, the discrepancy (loss) between the observed data and the results of the forward solver is minimized; for design problems, the objective function is minimized. However, the embedded forward FEM solver as a mesh-based method inherently brings about complications in these algorithms. The estimated/designed geometry is updated by repeatedly remeshing the domain through iterations. Alternatively, the unknown domain is embedded in a larger fixed domain while introducing an auxiliary field to track the presence of material. The problem becomes even more challenging when large deformations (i.e., geometric nonlinearity) and nonlinear mechanical properties (i.e., highly nonlinear constitutive behavior of the solid material) are involved. Available methods are cumbersome and resource-intensive for deriving automated solutions to such inverse problems involving unknown geometry.
The present disclosure relates to a novel method based on physics-informed machine learning, and, in particular, utilizing physics-informed neural networks to examine internal structures and defects in materials. It is noted that these unknown internal structures and defects can also be referred to as aspects. It is also noted that these materials can be solid or non-solid, despite illustrative embodiments treating the materials as primarily solid. This method solves inverse problems related to unknown geometry, which in practice can be applied to (1) identify and characterize unknown internal structures and defects with limited non-destructive measurements, and (2) design geometry of solids to achieve optimized and/or improved mechanical performance including maximizing or minimizing individually or in certain combinations the stresses, strains, displacements, elastic or inelastic energies, etc. According to some embodiments of the present disclosure, a first geometry can refer to the identification and characterization of unknown internal structures and defects with limited non-destructive measurements, and a second geometry can refer to designing geometry of solids to achieve optimized and/or improved mechanical performance including maximizing or minimizing individually or in certain combinations the stresses, strains, displacements, elastic or inelastic energies, etc.
The instant disclosure presents a unique, systematic approach based on PINNs for solving geometry identification and geometry design problems in continuum solid mechanics. PINNs integrate known PDEs of importance in solid mechanics with NNs, composing a unified computational framework involving both the forward solver and the inverse algorithm. Notably, the disclosure provides a method for directly parameterizing the geometry of the solid in a differentiable and trainable manner (i.e., geometry as trainable variables; geometry-dependent training points; making the gradient with respect to geometry tractable). According to the present disclosure, these geometric variables can be inputted into PINNs to advance the systems and methods described herein. An additional technique (i.e., pretraining procedure) is important for stabilizing the training process of the PINN. For identification problems, this method can concurrently solve unknown material parameters in addition to geometric parameters.
According to the exemplary embodiments described herein in detail, for identification problems, the PINN inversely characterizes the geometry of the void/inclusion according to the displacement data. For design problems, the PINN inversely design the optimized and/or improved geometry of the void/inclusion according to a predefined objective function. According to the present disclosure, this design process can also be referred to as generating the optimized geometry. According to some examples, optimized performance can refer to improving performance. As the method completes the identification/design task, it concurrently identifies the full field stresses, strains, displacements in the solid structure. This framework for inverse problems involving unknown (moving) domains directly parameterizes the computational domain (geometry), and can deal with material and geometry parameterization, even for large deformation.
To test the performance of the method with various parametric assessments, the instant disclosure builds a set of detailed cases for identification and design problems. The examples include different shapes and topologies of the structure and different constitutive models for describing the mechanical properties of the solid material, including linear elasticity, hyperelasticity, and deformation plasticity. For the particular case of inclusion in identification problems, the PINN is also required to estimate the unknown material parameter of the inclusion, through which the instant disclosure demonstrates the capability of our model in solving combined material and geometry identification problems. As a proof of concept for the method of the subject applications, the instant disclosure focuses on two-dimensional plane strain static problems. Result is accurate (relative error no more than O(10−2)).
According to the exemplary embodiments described herein in detail, the present disclosure establishes the general formulation of PINNs in continuum solid mechanics involving both (material and geometry) identification and design problems.
The present disclosure firstly summarizes the basic workflow of PINNs for identification problems as follows: First, it is contemplated that one can apply a neural network (NN; with trainable parameters λ) to approximate the primary solution fields (top left panels in (λ, θ), which measures the discrepancy between the predicted mechanical quantities of interest and their respective true values provided by mechanical laws and measured data (bottom right panels in
where the α's with various subscripts refer to the weights of corresponding loss terms and the three loss terms correspond to PDEs, boundary conditions (BCs), and data, respectively. Each loss term j(λ, θ) (where j=PDE, BC, Data) is the mean squared error evaluated on Nj residual points
where rj is the residual of the condition j at the ith residual point Xj(i)(θgeo). The Nj residual points for condition j are distributed in the domain of condition j to correctly evaluate j(λ, θ). The coordinates of the residual points Xj(i) depends on geometric parameters θgeo because of the variable computational domain, which will be explained in detail in subsequent portions of the disclosure. Lastly, it is contemplated that one can conduct parameter estimation through the training of the PINN (bottom left panels in
where the hat symbol refers to the value of these trainable parameters after the training process completes. As the solution to the inverse problem, the estimation of the unknown parameters is {circumflex over (θ)}. More details of the formulation are shown throughout the remaining disclosure.
The aforementioned summary describes the procedure for solving geometry (and material) identification problems with the proposed methods. With minimum change of the loss function, this method can be applied to geometry design problems for optimized and/or improved performance, as also shown in
where the additional two loss terms cnstr(λ, θ) and
target(λ, θ) correspond to the constraint in the design problem and the target function to be minimized in the problem. For example, in structural engineering, one often needs to design a structure with the maximum stiffness (hence minimum compliance) under a constraint for the total weight/volume of the structure. In this case,
cnstr(λ, θ) penalize the violation of the weight/volume constraint, and
target(λ, θ) may be defined as the compliance of the structure. In the training process, as the PINN seeks to minimize the loss function, the governing PDE and the boundary conditions are satisfied through minimizing
PDE(λ, θ),
BC(λ, θ); the constraints are approximately satisfied through minimizing
cnstr(λ, θ); the design target is achieved through minimizing
target(λ, θ). In this process, any relevant geometric parameters in θ are adjusted in order to minimize the loss function, thereby realizing the design of optimal geometry. Note that, in some situations, constraints can alternatively be directly incorporated through designing the architecture of PINN instead.
According to the exemplary embodiments described herein in detail, geometric parameters θgeo play an essentially different role in the inverse problem compared to material parameters θmat. Material parameters parameterize the governing PDEs of mechanics, which are naturally endowed with trainability through automatic differentiation of (physics-informed) neural networks. As a result, material parameters can be directly estimated using the standard formulation of PINNs for inverse problems. Geometric parameters θgeo, on the other hand, parameterize the computational domains of the PDEs and boundary conditions, which do not naturally serve as trainable parameters in the framework of PINNs. To make the geometric parameters θgeo differentiable and hence trainable in a similar way to material parameters θmat, the present disclosure parameterizes the coordinates of residual points by geometric parameters θgeo. Technically, it is contemplated that such parameterization can be implemented by utilizing the definition of trainable variables in deep learning libraries (e.g., TensorFlow, Pytorch): the present disclosure first defines the geometric parameters θgeo as trainable variables; then, expresses the locations of residual points as functions of these trainable variables. As a result, the coordinates of residual points are automatically updated as the estimation of θgeo are updated throughout the iterative training process. In with respect to the geometric parameters θgeo, which otherwise could not be realized using the standard formulations of PINNs. With the geometry-parameterized residual points, the PINN can correctly update the estimated/designed geometric parameters θgeo throughout the training process, thereby characterizing/designing the unknown geometry. It is contemplated that such form of parameterization invoking PINNs to solve geometry identification problems have hitherto not been addressed. Practically, θgeo may be defined by assuming the shape of the void (e.g., circle, ellipse) and then assigning the unknown parameters to θgeo; alternatively, for more flexibility, the void shape, or other shapes, may be represented by analytical function(s), parameterized function(s), non-uniform rational basis spline (NURBS) or another neural network, where a large number of parameters in θgeo represents the irregular shape of the void. In this way, the shape of the at least one of one or more internal structures, internal surfaces, internal boundaries, external structures, external surfaces, or external boundaries can be arbitrarily complicated or simple (including but not limited to circular and elliptical shapes), or, as complicated as the methods of the present disclosure can describe.
In another aspect of the present disclosure, it is contemplated that one can find it important to pretrain the model before using the model to characterize/design unknown geometry. If the model is directly applied without pretraining, the estimated/designed geometric parameters rapidly depart from physically admissible values (e.g., void located outside the matrix) after a few iterations. Inspired by the transfer learning technique, the present disclosure proposes to maintain all the estimated unknown parameters θ fixed (not trainable) and only update the trainable parameters of the NN λ for the first few iterations. During this pretraining process, the PINN essentially solves a forward problem, seeking to roughly capture the qualitative pattern of the displacement field and the stress field. After this pretraining process, both λ and θ can be made trainable, which initiates the parameter estimation/design. Such a pretraining procedure induces λ to converge towards to the desired local minimum, hence serving as a good initialization for geometry identification and design problems. For the prototypical problem of the current disclosure, technically, the PINN needs to be pretrained until there emerged a qualitative pattern indicating the existence of a stress concentration around the curved boundary or interface.
The present disclosure demonstrates the efficacy of the method for geometry identification problems by modeling a two-dimensional prototypical problem on a matrix-void/inclusion system as a proof of concept (see
It is noted that, in accordance with exemplary embodiments, the solution of the aforementioned Cases 0-5 in
Another aspect of the present disclosure adapts a finite element solver (e.g., commercial software such as Abaqus, open-source library such as FEniCS) to generate the computational examples. In this disclosure, Abaqus can be adopted as the finite element solver. Specifically, the present disclosure presets reference values of unknown parameters to be θ* and conducted forward simulations, which generated the displacement data provided to the PINN and ground-truth full-field solution for assessing the performance of the PINN. The PINN initialized the estimation of unknown parameters to be θ0. The PINN firstly went through a pretraining procedure for stabilizing the forward prediction, where the estimated parameters were fixed to be θ*. As the PINN initiated parameter estimation through the iterative training process, it is contemplated that the estimated parameters θ would migrate towards the correct value θ*. The training process terminated after the loss function and the estimated parameters reached a relative plateau, yielding the parameter estimation results {circumflex over (θ)}. The detailed setups of the prototypical problem, the finite element solver, and the hyperparameters of the PINN for geometry identification problems are included in Paragraphs 00085-000105.
The present disclosure provides the results for Cases 0-5 in the main text. The results of parameter estimation for Cases 0-5 are shown in view 50 of
It is worth noting that the estimated shear modulus of the inclusion μi in Case 5 has an error slightly more than 10%. To improve the accuracy of Case 5, the present disclosure supposes that five additional data points inside the solid are available as in Case 2. The present disclosure re-trains the PINN with the expanded measurement data and append the results as the modified Case 5, labeled Case 5 (With Internal Data) in
It is noted that, in accordance with exemplary embodiments, the method of the present disclosure is not only capable of estimating unknown parameters, but also providing quantitative measures of the deformed patterns of the solid as well. Specifically, the present disclosure applies the estimation results {circumflex over (λ)} and {circumflex over (θ)}geo (see Eq. 3) on the neural network part of the PINN (top left panels in
In addition to the reconstruction of deformation pattern, the method of the present disclosure is also capable of reconstructing the distribution of related mechanical quantities within the entire volume of the solid, including strain and stress components, principal stresses, elastic energy density, etc. For Case 4, for example, can be shown in
In addition to the aforementioned Cases 0-5 on inverse material/geometry identification problems, the present disclosure also considers illustrative examples of inverse problems on the design of optimal geometry. As a first illustrative example, this disclosure considers a problem on designing fiber-reinforced materials (called as Case 6 hereinafter), as shown in diagram 90 of
To solve this problem, the subject disclosure proposes to represent the fiber-matrix interface with a NN which takes x1 (the horizontal coordinate) as input and outputs/(the thickness of the fiber at the given x1). The trainable parameters of this NN are viewed as unknown geometric parameters to be designed. A representative volume element (the dashed box in cnstr is defined to make sure that fiber makes up 20% of the area/volume. Uniform strain loading is applied on left and right boundaries.
target is defined as the mean normal stress in the x1 direction on residual points. As the shape of the interface changes, the present disclosure anticipates that this mean normal stress response is maximized, which corresponds to the optimal design with maximum stiffness. Theoretical derivations for composite materials have revealed that the maximum stiffness is achieved under the isostrain state, which means that the thickness of the fiber can be uniform (in our case t(x1)=0.2 for all x1) for achieving maximum stiffness in x1 direction.
Continuing to graph 100 in
An additional illustrative example (Case 7) is presented in views 120 and 122 respectively of
The instant disclosure considers two situations regarding the geometry parameterization: (1) the void is constrained to be a centered ellipse (Case 7A in
Notably, in this example, the instant disclosure enforces the constraint of the area not through the addition of loss term cnstr as a soft constraint. Instead, the constraint is realized in a hard way by normalizing the void geometry following the output of the NN for describing r(θ), so that the area is fixed whatever the trainable parameters are.
This aspect of the instant disclosure examines the results in Case 7A from the PINN for designing the aspect ratio a/b and the tilting angle θ to minimize the maximum value of the maximum principal tensile stress in the entire solid. Due to the setup of the biaxial loading with P1:P2=2:1, if the void is circular, the largest tensile stress will be located at the top/bottom of the void. To avoid this, the void is anticipated to increase its aspect ratio and align its major axis along x1 direction, so that a less sharp (a larger curvature radius) at the top/bottom of the elliptical void reduces the local stress concentration.
The results of the void outline for Case 7A are shown in graph 130 of
Graph 160 of
In addition to the aforementioned Cases 7A and 7B for the matrix-void system, the present disclosure also examines the method performance in terms of more complicated problems involving multiple mechanical loading conditions.
The instant disclosure considers an example (Case 8) of designing the shape of a simple bridge as a proof of concept for demonstrating the practical applicability of our method in mechanical and civil engineering as shown in diagram 190 of
The present disclosure includes an additional computational case considering multiple loading conditions in addition to the aforementioned Case 7C. The illustrative example (Case 9) considers applying multiple loadings on a structure and seeking to optimize the performance of the structure under a metric that involves these loading conditions. Technically, to implement such problems, it is noted that the PINN is slightly modified to include an additional pseudo-time/as the input of the neural network, which essentially indicates the loading steps. As a proof of concept, the illustrative example of the present disclosure considers a curved beam structure 232, as shown in view 230 of
The illustrative example of the present disclosure seeks to find a target value θ through these two design targets, so that the homogenized (equivalent) stiffness of the curved beam 232 satisfies the desired values under both small and large deformations.
The results are shown in graphs 240 and 250 of
This aspect of the present disclosure further provides the detailed formulation of PINNs for forward and inverse problems in continuum solid mechanics. This focus concerns the PINN for hyperelasticity (specifically, incompressible Neo-Hookean material) as most of the computational examples of the present disclosure adopt this material. To better clarify the quantitative formulation, here the present disclosure denotes all the material and geometric parameters of interest as θmat and θgeo, respectively. For incompressible Neo-Hookean materials, the only material parameter is the shear modulus μ so that θmat=μ. The unknown part of θ=(θmat, θgeo) in the inverse problem is denoted as θunk.
The workflow of PINNs comprises four steps in the present disclosure. First, it is contemplated that one can apply a NN to approximate the primary solution fields (top left panel in
Secondly, it is contemplated that one can integrate mechanical laws into the PINN architecture (top right panel in
where I is the identity tensor, Eq. A1 is kinematics, and Eq. A2 is the constitutive relation for incompressible Neo-Hookean materials. The residuals of the equilibrium PDE and the incompressibility condition at X are expressed by:
The residuals of Dirichlet/displacement and Neumann/traction boundary conditions at X are:
where N is the outward unit normal vector on the boundary, and u and T are the specified displacement and traction on the boundary, respectively. For inverse problems, the present disclosure provides displacement data{u*(i)}i=1N
Thirdly, it is contemplated that one can formulate the loss function according to the foregoing residuals from mechanics and data (bottom right panel in
and the loss function is:
where αPDE, αinc, αD, αN, αu are the weights of the loss terms. Note that the two loss terms D and
N for the two types of boundary conditions are simplified into
BC in Eq. 1 in the main text.
Lastly, it is contemplated that one can conduct parameter estimation through training/loss minimization (bottom left panel in
With the PINN adjusting λ to minimize the loss function, it is contemplated that one can anticipate that all the mechanical laws will be approximately satisfied, making the NN serve as an approximation to the primary solution fields. Furthermore, the residual of displacement observations in the loss function guides the estimated unknown parameters to evolve towards their respective target values. In this way, the PINN is able to solve inverse problems.
The present disclosure further contemplates the differences of PINNs for linear elasticity (
To build the PINN for linear elasticity, kinematics in Eq. A1 is replaced by the definition of the infinitesimal strain tensor εij=∂ui/∂xj, where u=(u1, u2) is the displacement. Stress-strain relationship in Eq. A2 is replaced by the stress-strain relationship
where λ and μ are Lamé constants. The equilibrium equation similar to Eq. A3 is ∂σij/∂xj=0. Note that the loss function for (compressible) linear elasticity does not require the term for incompressibility (see Eq. 1 and Eq. A13).
The PINN for the power-law deformation plasticity can be constructed based on the PINN for linear elasticity. In this case, the nonlinear stress-strain relation is expressed as
where E (Young's modulus), α, αY (yield stress), and n (hardening exponent) are material parameters. s is the deviatoric stress tensor
is the von Mises stress.
According to Eq. A16, stress cannot be explicitly expressed by strain for power-law deformation plasticity, unlike linear elasticity (
According to the exemplary embodiments herein, in the case of inverse problems for multiple materials as in Case 5 in the main text, the present disclosure applies two independent NNs to approximate the displacement and pressure fields of the matrix and inclusion materials, respectively. Such approximation can be expressed by
In addition to the foregoing loss terms for each material, an additional loss term int can be utilized to force the continuity of displacement and traction on the interface Γint(θgeo) of two materials. To do this, the residuals can be written first on a single point for the two conditions of continuity. The residual of displacement continuity is
and the residual of stress continuity is
Then, place residual points XΓ(i)(θgeo) (i∈{1,2, . . . , NΓ}) on Γ(θgeo), and construct the loss. component on the interface of two materials by taking a weighted sum as
Hence, the loss function in the case of multiple materials is the summation of the loss function for each single material and the additional loss int for the material interface.
For forward problems with no unknown parameter (θunk=Ø, hence θ=Ø for convenience), measurement data may not be available. For incompressible materials, for example, the loss function can be written as a weighted sum of all the four loss terms that correspond to PDEs, the incompressibility condition, displacement boundary conditions, and traction boundary conditions, respectively:
where αPDE, αinc, αD, αN are the weights of the loss components. Each loss term is defined by
The solution of forward problems using PINNs can be expressed as:
The displacement solved by the PINN is ũ(X; {circumflex over (λ)}) for X∈Ω.
According to the exemplary embodiments of the present disclosure, it is noted that for Case 0, the Young's modulus of the matrix is E=1.0, and the Poisson's ratio is v=0.3. For Cases 1, 2, 4 and 5, the shear modulus of the matrix (incompressible Neo-Hookean material) is μi=0.333, and the external load is P0=0.3. For Case 3, the parameters fed to the PINN include material parameters E=1.0, α=0.1, n=10, and σY=0.005 and the external load P0=0.002. For Cases 0, 1, 3, 4, 5, there are Nu=1.0 measurement points of on each edge of the matrix. For Case 2, measurement points are inside the solid, with 10 of them uniformly placed on the line, X1=−0.15, X1=0.15, X2=−0.45, X2=0.45 respectively. For the modified Case 5 with additional measurements, the five additional measurement points are located at (−0.3,0.05), (−0.3,0.10), (−0.3,0.15) in the matrix and (−0.05,0.10), (0.15,0.10) in the inclusion. However, these are merely illustrative embodiments, and not meant to be comprehensive of all possibilities when considering identification problems utilizing the PINNs of the present disclosure.
The present disclosure can utilize TensorFlow 1.14 to build up our PINN. Each NN in the PINN has 4 hidden layers, each with 30 neurons. For Cases 0, 1, 2 and 4, there is one single NN in the PINN. There are two NNs in the PINN in Case 3 (one NN for (u1, u2), the other NN for (σ11, σ12, σ22)) and Case 5 (one NN for each material). The present disclosure can adopt the layer-wise adaptive “tanh” function as the activation function for the NNs. Among the trainable parameters of the NN, weights are initialized with Xavier initialization, biases are initialized as zeros, and the variable of adaptive activation are initialized as ones. The illustrative example of the present disclosure utilizes the Adam optimizer to train the network. The learning rate is 0.001 for all cases except Cases 2 and 3.
For the illustrative embodiment in Case 2, the unknown geometric parameters directly defined in the code are the location of the center of the slit (X1(c), X2(c)), half length of the slit L, and tilting angle of the slit Γ, which are post-processed to be the coordinates of the locations of the centers of the slit tips (X1(1)=X1(c)−L sin Γ, X2(1)=X2(c)+L cos Γ, X1(2)=X1(c)+L sin Γ, X2(2)=X2(c)−L cos Γ). The learning rate is 0.001 for Γ and 0.0002 for (X1(c), X2(c), L). For Case 3, the learning rate is 0.00005.
For the illustrative embodiments Cases 0, 1, 3 and 5, the assignments of residual points are similar. 3200 internal points for the PDE and incompressibility are assigned for the illustrative embodiment (800 points for each of the four sub-regions; 40 along the circumferential direction and 20 along the radial direction). For Case 5 only, 400 residual points are assigned in the inclusion material. The illustrative embodiment of the present disclosure places 160 residual points on the outer boundary for enforcing traction boundary conditions. Another 160 residual points are placed on the inner boundary of the matrix, which enforce the traction-free boundary conditions for Cases 1 and 3 and the interface conditions for Case 5. The points of displacement measurements are uniformly placed on the outer boundary of the matrix.
For the illustrative embodiment Case 2, the smallest length scale is determined by the W, which poses a limitation for the interval of residual points. To accurately resolve the displacement field, especially around the slit tip where the displacement changes drastically and the stress concentration exists, a large density of residual points may be needed in the computational domain. For this case, there are 49000 internal points, 700 points on the outer boundary, and 700 points on the inner boundary. For the illustrative embodiment Case 4, the square region can be divided into two parts, each with one void. For each part, residual points are assigned in a similar way to Case 1.
For the illustrative embodiments Cases 0-5, the weights of the loss terms in Eq. A13 to be αPDE=αinc=αD=αu=1 are designated when applicable. The weight for traction boundary conditions αN is also set to be 1, but the present disclosure evaluates this part separately for each of the five boundaries (left, right, top, bottom, inner), leading to five loss terms related to traction boundaries, each with weight 1, in the total loss. In Case 2, the calculation of the mean squared error of the PDE loss term PDE(λ, θ) is further weighted inversely by the local density of residual points, so that different spatial regions contribute equally to the loss term of PDEs.
For illustrative embodiments Cases 0 and 3, the small values of loading (roughly O(10−3)) cause difficulties in training the PINN. To mitigate this issue, the external loading (and σY=0.005 in Case 3) can be scaled up 100 times and feed the scaled loading into the PINN. According to Eqs. A15 and A16, such scaling only results in 100 times the original displacement (and hence strain) and does not essentially changes the characteristics of the solution. After obtaining the solution from the PINN, the displacement/strain solution can be scaled back (0.01 times) to its original value.
Illustrative embodiments of the present disclosure may use Abaqus as the FEM solver to generate the displacement data {u*(i)}i=1N
According to exemplary embodiments of the present disclosure, a simple comparison on the model performance in terms of accuracy and efficiency with different strategies on optimizers based on Case 4. In the main text, the Adam optimizer can be adopted as the only optimizer throughout the entire training process with 1M iterations (called strategy 1A in this section), after which both the parameter estimations and loss function reach a relative plateau. Such a strategy gives high accuracy and can help facilitate study the convergence history as a fundamental characteristic of our method. To achieve a reasonable accuracy practically, one may not need as many as 1M iterations. Here, the present disclosure considers training the PINN over 200K iterations only (called strategy 1B). To further improve the computational efficiency, it is also contemplated that one may consider using Adam for the first few iterations (40K iterations in our case), and then switching to L-BFGS (48) (called strategy 2), which is common for training PINNs practically. Then, one can compare the results of the three strategies (1A, 1B and 2) in view 220 of
The present disclosure demonstrates the capability of the new method based on PINNs in effectively solving inverse problems in solid mechanics involving parameterized geometry. Specifically, the present disclosure focuses on the analyses of internal structures and defects in engineering solids for two types of problems: (1) geometry (and material) identification; (2) design of optimized and/or improved geometry. Using illustrative examples, this disclosure demonstrates that the presented method works for different behavior of the solid material, including (1) linear elasticity, (2) hyperelasticity, and (3) deformation plasticity. The present disclosure demonstrates that, for identification problems, the present framework is able to accurately estimate the unknown geometric and material parameters with a relative error O(10−2) when proper displacement data are supplied to ensure identifiability. For design problems, the present disclosure demonstrates that the presented method is able to design optimized and/or improved geometry with a relative error of O(10−3) compared with reference optimal design. In addition to identifying the unknown geometry or optimal geometry, the present disclosure demonstrates the capability of the method in reconstructing the full-field mechanical quantities corresponding to the identified/designed geometry, such as displacement, strain, and stress.
According to the exemplary embodiments described herein in detail, the approach presented in the present disclosure possesses some unique characteristics, endowing this method with some distinct advantages. It is contemplated that the method inherits and extends the applicability and advantages of original PINNs. It is contemplated that the present disclosure provides a unified framework for solving forward problems and inverse problems with unknown parameters in PDEs (material identification) and/or domains (geometry identification; geometry design. Unlike traditional methods based on FEM, there is neither the need to design problem-specific algorithms to update estimated unknown parameters beyond the forward solver, nor the need to repeatedly re-mesh the computational domain throughout the iterations in the present disclosure. In the present disclosure, the update of geometry is realized by the automated process built in the deep learning algorithms. In particular, the estimation of geometric parameters is automatically updated as the PINN seeks to minimize the loss function through the iterative training process. With open-source deep learning libraries (e.g., TensorFlow, Pytorch), the entire length of the PINN code of the present disclosure for the current work is merely a few hundred lines. From the perspective of both design and implementation of the algorithm, PINNs significantly reduce the human effort and related costs in setting up algorithms for inverse problems. Diverse types of conditions in the problem definition can be enforced and integrated into the training guideline of the PINN in the form of loss function. For example, displacement and traction boundary conditions or data (for identification problems) can be accurately enforced in the training process; maximum stiffness or minimum principal stress as the objective function (for design problems) can be integrated into the training guideline within the PINN framework. On the other hand, compared to typical data-driven deep learning approaches, PINNs have the advantage of utilizing well-established mechanics formulations as training guidelines, thereby requiring data only for the current instance of the problem setup and ensuring data-efficiency. It is noted that data-driven deep learning methods, on the other hand, require a large dataset containing numerous problem setups.
It is noted that, in accordance with exemplary embodiments, the present disclosure adopts the Adam optimizer as the optimization algorithm to achieve best accuracy and to study the convergence history as a fundamental characteristic of our method. The PINN is trained until both loss function and the estimated parameters reach a relative plateau. With such a setup, the computational time for Case 4 of identification, for example, is around 11 hours on a typical machine (with CPU only) to complete the entire 1M iterations and achieve high accuracy. It is contemplated by the present disclosure that reasonable accuracy has been achieved within the first 200K iterations. One may further combine Adam and the L-BFGS optimizer to achieve similar accuracy within much less computational time (around 30 minutes; see Paragraph 000105 and
The present disclosure focuses on the matrix-void/inclusion system in Cases 0-5 as a simple proof of concept for identification problems, seeking to characterize the internal structures with static loading on outer boundaries. According to Saint-Venant's principle, under static loading, the inhomogeneous stress and deformation states caused by the internal void/inclusion decays as the distance from the void/inclusion increases. Subsequently, the measurements on outer boundaries essentially provide the PINN with limited amount of information regarding the internal void/inclusion. Modern experimental techniques have adopted dynamic external loading such as ultrasound to acquire time-dependent measurements, through which it is anticipated that the performance of the method of the present disclosure will benefit from more information provided by measurements.
The applications presented in this disclosure can involve a wide range of engineering problems. Geometry design problems are prevalent in mechanical, civil and material engineering, where one needs to design the geometry, topology, and/or internal structures of materials/structures for achieving optimized and/or improved performance based on certain objectives as guidelines. Geometry identification problems are closed related to defect detection, which represents a broad class of practical engineering needs in various fields, where identification and characterization of internal structures and defects in materials are essential. Illustrative embodiments of the present disclosure include experimental techniques, which have so far been developed for different materials based on ultrasound, active thermography, eddy current, optical coherent tomography, and microwave. By integrating the respective physical principles in these problems, it is contemplated that the approach of the present disclosure can be combined with these techniques for dealing with unknown/moving geometries, which extends our method beyond continuum solid mechanics.
Embodiments of the above-described systems and methods can be implemented in digital electronic circuitry, in computer hardware, firmware, software and combinations thereof. The implementation can be as a computer program product. The implementation can, for example, be in a machine-readable storage device, for execution by, or to control the operation of, data processing apparatus. The implementation can, for example, be a programmable processor, a computer, and/or multiple computers.
A computer program is provided in any form of programming language, including compiled and/or interpreted languages, and the computer program can be deployed in any form, including as a stand-alone program or as a subroutine, element, and/or other unit suitable for use in a computing environment. A computer program can be deployed to be executed on one computer or on multiple computers at one site.
Method steps can be performed by one or more programmable processors executing a computer program to perform functions of the invention by operating on input data and generating output. Method steps can also be performed by and an apparatus can be implemented as special purpose logic circuitry. The circuitry can, for example, be a FPGA (field programmable gate array) and/or an ASIC (application-specific integrated circuit). Subroutines and software agents can refer to portions of the computer program, the processor, the special circuitry, software, and/or hardware that implement that functionality.
Processors suitable for the execution of a computer program include, by way of example, both general and special purpose microprocessors, and any one or more processors of any kind of digital computer. Generally, a processor receives instructions and data from a read-only memory or a random access memory or both. The essential elements of a computer are a processor for executing instructions and one or more memory devices for storing instructions and data. Generally, a computer can include, can be operatively coupled to receive data from and/or transfer data to one or more mass storage devices for storing data (e.g., magnetic, magneto-optical disks, or optical disks).
Data transmission and instructions can also occur over a communications network. Information carriers suitable for embodying computer program instructions and data include all forms of non-volatile memory, including by way of example semiconductor memory devices. The information carriers can, for example, be EPROM, EEPROM, flash memory devices, magnetic disks, internal hard disks, removable disks, magneto-optical disks, CD-ROM, and/or DVD-ROM disks. The processor and the memory can be supplemented by, and/or incorporated in special purpose logic circuitry.
To provide for interaction with a user, the above described techniques can be implemented on a computer having a display device. The display device can, for example, be a liquid crystal display (LCD) monitor. The interaction with a user can, for example, be a display of information to the user and a keyboard and a pointing device (e.g., a mouse or a trackball) by which the user can provide input to the computer (e.g., interact with a user interface element). Other kinds of devices can be used to provide for interaction with a user. Other devices can, for example, be feedback provided to the user in any form of sensory feedback (e.g., visual feedback, auditory feedback, or tactile feedback). Input from the user can, for example, be received in any form, including acoustic, speech, and/or tactile input.
The above described techniques can be implemented in a distributed computing system that includes a back-end component. The back-end component can, for example, be a data server, a middleware component, and/or an application server. The above described techniques can be implemented in a distributing computing system that includes a front-end component. The front-end component can, for example, be a client computer having a graphical user interface, a Web browser through which a user can interact with an example implementation, and/or other graphical user interfaces for a transmitting device. The components of the system can be interconnected by any form or medium of digital data communication (e.g., a communication network). Examples of communication networks include a local area network (LAN), a wide area network (WAN), the Internet, wired networks, and/or wireless networks.
The system can include clients and servers. A client and a server are generally remote from each other and typically interact through a communication network. The relationship of client and server arises by virtue of computer programs running on the respective computers and having a client-server relationship to each other.
Packet-based networks can include, for example, the Internet, a carrier internet protocol (IP) network (e.g., local area network (LAN), wide area network (WAN), campus area network (CAN), metropolitan area network (MAN), home area network (HAN)), a private IP network, an IP private branch exchange (IPBX), a wireless network (e.g., radio access network (RAN), 802.11 network, 802.16 network, general packet radio service (GPRS) network, HiperLAN), and/or other packet-based networks. Circuit-based networks can include, for example, the public switched telephone network (PSTN), a private branch exchange (PBX), a wireless network (e.g., RAN, bluetooth, code-division multiple access (CDMA) network, time division multiple access (TDMA) network, global system for mobile communications (GSM) network), and/or other circuit-based networks.
The transmitting device can include, for example, a computer, a computer with a browser device, a telephone, an IP phone, a mobile device (e.g., cellular phone, personal digital assistant (PDA) device, laptop computer, electronic mail device), and/or other communication devices. The browser device includes, for example, a computer (e.g., desktop computer, laptop computer) with a world wide web browser (e.g., Microsoft® Internet Explorer® available from Microsoft Corporation, Mozilla® Firefox available from Mozilla Corporation). The mobile computing device includes, for example, a smartphone or tablet (e.g., iPhone®, iPad®, Android® device, Windows Phone®, etc.).
The terms comprise, include, and/or plural forms of each are open ended and include the listed parts and can include additional parts that are not listed. The term and/or is open ended and includes one or more of the listed parts and combinations of the listed parts.
One skilled in the art will realize the invention may be embodied in other specific forms without departing from the spirit or essential characteristics thereof. The foregoing embodiments are therefore to be considered in all respects illustrative rather than limiting of the invention described herein. Scope of the invention is thus indicated by the appended claims, rather than by the foregoing description, and all changes that come within the meaning and range of equivalency of the claims are therefore intended to be embraced therein.
The foregoing and other objects, features and advantages will be apparent from the following more particular description of the embodiments, as illustrated in the accompanying drawings in which like reference characters refer to the same parts throughout the different views. The drawings are not necessarily to scale, emphasis instead being placed upon illustrating the principles of the embodiments.
Although various aspects of the present disclosure are described herein in terms of various exemplary embodiments, it should be understood that variations and modifications may be made to the disclosure described herein to adopt it to various usages and conditions within the scope of applicant's invention as claimed herein.
The recitation of a listing of elements in any definition of a variable herein includes definitions of that variable as any single element or combination (or sub-combination) of listed elements. The recitation of an embodiment herein includes that embodiment as any single embodiment or in combination with any other embodiments or portions thereof.
All patents and publications mentioned in this specification are herein incorporated by reference to the same extent as if each independent patent and publication was specifically and individually indicated to be incorporated by reference.
The present application claims the benefit of U.S. provisional application No. 63/307,478, filed Feb. 7, 2022, the entire contents of which is incorporated herein by reference.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2023/012420 | 2/6/2023 | WO |
Number | Date | Country | |
---|---|---|---|
63307478 | Feb 2022 | US |