Physics based animation generally facilitates producing realistic video animations wherein objects, particularly soft objects, deform in response to forces. The computation of object deformation, especially for complex objects such as trees, animals, and people, can be time consuming. Therefore, approaches to accelerate such computations are of interest in the field.
Model reduction methods can substantially accelerate deformable simulations and have become popular in many computer graphics applications, ranging from animation generation and control, material design, to realistic sound synthesis. In these methods, a small number of deformation basis vectors or modes are computed beforehand. The online simulation then constrains the deformation to a subspace spanned by the modes, tremendously reducing the simulation degrees of freedom. While enjoying fast simulation performance, model reduction methods need to carefully construct a set of modes that well express possible deformations during the simulation. This is usually an expensive task that can take hours to obtain plausible modes.
The conventional wisdom here is to tax the preprocessing step in exchange for runtime performance. Indeed, if the object geometry and material properties have been decided, it is worthwhile and affordable to precompute once for repeated online simulations. However, when the shape or material is frequently altered—for instance, in the case where a user is exploring different animation settings, a long precomputation time would drastically slow down the work flow because every geometric and material update dictates a re-computation of the reduced model. There is a need in the art to accelerate precomputation in a typical reduced deformable simulation pipeline, a problem that has been largely overlooked.
Methods, devices, and computer readable media are disclosed to precompute reduced deformable models for objects, e.g., objects which may be represented graphically by input meshes. Some example methods may include applying a Krylov subspace iteration to construct a series of inertia modes for the input mesh; condensing the inertia modes into a mode matrix; sampling a set of cubature points from the input mesh, and calculating cubature weights of the set of cubature points for each of the inertia modes in the mode matrix; generating a training dataset by iteratively adding training samples to the training dataset until a training error metric converges, wherein each training sample is generated from an inertia mode in the mode matrix and corresponding cubature weights; and generating the reduced deformable model, wherein the reduced deformable model includes inertia modes in the training dataset and corresponding cubature weights.
Example computing devices according to this disclosure may generally comprise a processor and memory including computer readable instructions to perform the methods disclosed herein. Computer readable media may according to this disclosure may generally comprise computer readable instructions to perform the methods disclosed herein. Further aspects and embodiments are described in detail below.
Various features and attendant advantages of the disclosed technologies will become fully appreciated when considered in conjunction with the accompanying drawings, in which like reference characters designate the same or similar parts throughout the several views, and wherein:
Prior to explaining embodiments of the invention in detail, it is to be understood that this disclosure is not limited to the details of construction or arrangements of the components and method steps set forth in the following description or illustrated in the drawings. Embodiments of this disclosure are capable of other embodiments and of being practiced and carried out in various ways. Also, it is to be understood that the phraseology and terminology employed herein are for the purpose of the description and should not be regarded as limiting.
At a “Generate Inertia Modes” operation 101, in some embodiments, a Krylov subspace iteration may be applied to construct a series of inertia modes for a given input mesh. The series of inertia modes may comprise linear inertia modes and nonlinear inertia modes. Linear inertia modes may result from the Krylov subspace iteration. Nonlinear inertia modes may be generated by computing asymptotic inertia derivatives for the input mesh, as described in detail herein. Also, in some embodiments, a Gram-Schmidt orthogonalization scheme may be applied to regularize the constructed series of inertia modes, as described in detail herein.
At a “Condense Inertia Modes” operation 102, in some embodiments, the inertia modes generated at operation 101 may be condensed into a mode matrix. For example, a random projection method may be used to condense the inertia modes in the mode matrix, as described in detail herein.
At a “Sample Cubature Points and Calculate Cubature Weights” operation 103, in some embodiments, a set of cubature points may be sampled from the input mesh, and cubature weights may be calculated for the set of cubature points, for each of the linear inertia modes in the mode matrix. In some embodiments, the set of cubature points may be uniformly sampled from the input mesh, as described in detail herein.
At a “Generate Training Dataset” operation 104, in some embodiments, a training dataset may be generated by iteratively adding training samples to the training dataset until a training error metric converges, wherein each training sample is generated from an inertia mode in the mode matrix and corresponding cubature weights. Iteratively adding training samples to the training dataset may comprise adding more than one inertia mode at each iteration, as described in detail herein.
At a “Generate Reduced Deformable Model” operation 105, in some embodiments, a reduced deformable model may be generated and/or may be embodied by the completed training dataset. The reduced deformable model may include inertia modes in the training dataset and corresponding cubature weights.
At a “Use Reduced Deformable Model for Simulation” operation 106, in some embodiments, the reduced deformable model generated by operations 101-105 may be used to generate simulations of the object represented by the input mesh. For example, the reduced deformable model may be used in one or more online simulations.
This disclosure generally provides technologies to accelerate the precomputation step in a reduced deformable simulation pipeline. The standard precomputation in a typical reduced model method undergoes three expensive sub-steps. First, the reduced modes are typically extracted using the modal analysis or principle component analysis (PCA) methods, both of which rely on a generalized Eigen analysis or singular value decomposition. For a high-resolution finite element mesh, this is costly. Secondly, when used in a nonlinear deformable simulation, model reduction methods need to evaluate the object's nonlinear internal forces at runtime. Such runtime evaluations are accelerated using a cubature scheme, but the precomputation of the cubature points and their weights is known to be computationally expensive. Lastly, to facilitate the cubature training, one also needs to prepare for a set of training poses and simulate them using a full-space simulation, which is typically expensive. In addition, there is always a question of what kind of training poses to be included in the training dataset. Too few training poses lead to unsatisfactory subspace construction, while too many poses unnecessarily increase the cost for data preparation.
As a response to the aforementioned challenges, we offer three techniques to accelerate these precomputation sub-steps: First, for modal construction, we augment the linear inertia mode technique originally designed for substructured objects. This method (§ 4), based on Krylov iterations, allows us to sidestep the expensive Eigen decomposition in traditional modal construction methods. We devise a reduced Gram-Schmidt orthogonalization scheme to quickly regularize the modal vectors, and derive a formula to compute nonlinear reduced modes up to an arbitrary order. Furthermore, to extract a compact set of modes, we propose a random projection algorithm inspired by techniques in the field of data mining. As a result, our method at this sub-step may in some embodiments achieve results 20-40× faster than standard modal analysis-based methods.
Second, we accelerate the precomputation of cubature points and weights. Based on the Newton-Cotes rule, our cubature points may be sampled uniformly in the solid body, but the cubature weights may be trained individually and in parallel across all reduced modes. As a result, some embodiments of this disclosure can potentially finish cubature training within milliseconds.
Third, our new cubature training scheme further allows us to answer the question of how to effectively prepare training samples. We interpret the training process as a Monte Carlo integration for finding the expected error of the reduced simulation. From this point of view, we disclose an algorithm to incrementally add training samples and stop when no more samples are needed. As a result, it saves us from generating unnecessary training samples and ultimately reduces the precomputation cost.
Putting together all these techniques, we significantly accelerate the precomputation step. Compared to the standard pipeline, some embodiments may gain orders-of-magnitude speedups in preprocessing while retaining the runtime simulation quality comparable to the standard methods. Furthermore, the disclosed methods are independent from any particular online reduced simulation methods, so the disclosed methods can work in tandem with different types of simulators, including simulators for single-body and substructured deformable simulation as well as sound simulation.
The Finite Element Method (FEM) based framework has become one of the primary numerical tools to handle elastic deformable objects. The governing differential equilibrium equations across the deformable body are numerically solved with the discretization of many interconnected small elements yielding realistic results. Various material properties can also be accommodated in this framework based on well-established theories from continuum mechanics.
High computational cost is a major drawback associated with FEM, especially for meshes of large size. In order to make deformable models more efficient for interactive applications, numerous contributions have been proposed in the decades. Multi-resolution and adaptive simulations use hierarchical spatial discretization of the simulation domain to accelerate the computation. These types of techniques use high-level bases to represent general deformations and refined ones for more detailed local deformations if necessary. Similarly, other mesh variations like embedded mesh, coarsened mesh, caged mesh, or skeletonized mesh offer extra control over the deformation with auxiliary grids.
Another important line of research focusing on improving the simulation performance of FEM is well-known as model reduction. Model reduction is designed to accelerate the simulation speed by reducing the numbers of degrees of freedom (DoFs) of the simulation system. It is achieved based on the assumption that the displacement of the deformable object can be well approximated using a small number of basis vectors. By projecting the original high-dimensional system onto the subspace spanned by these basis vectors, a reduced deformable model can be obtained. While the technique appears straightforward, it is rather challenging to construct a “good” subspace because the deformation that an elastic body will undergo is unknown. One of the most widely-used techniques is developed based on vibrational analysis or modal analysis. The basis vectors are calculated as the vibrational modes at different frequencies. High frequency modes are of higher elastic energies and less likely to occur in the upcoming simulation. Therefore, they are discarded for the system reduction. Modal analysis was employed for interactive deformable simulation as early as the late 1980s. Many significant contributions have been developed afterwards based on this technique such as character skinning, deformable animation control and editing, as well as subspace collision/contact. It has also been extended to simulate large deformation based on co-rotational elasticity, nonlinear deformable material and multi-domain simulation.
Besides modal analysis, some other methods are also used for constructing subspace bases. Some approaches used selected key points to build the subspace. Others utilized previously simulated results to create a time-varying subspace. Others used PCA to prune a pose set to obtain basis vectors. Recent contributions create a subspace by enriching local deformable features.
Reduced Model Simulation.
For a given finite element mesh with n nodes, model reduction assumes that its displacement can be expressed as a linear combination of a set of modal vectors U, or u=Uq, which leads to a reduced Euler-Lagrange equation:
Mq{umlaut over (q)}+ƒdis(q,{dot over (q)})+ƒ(q)=ƒext, Equation 1:
where Mq=UTMU, ƒdis and ƒ are the reduced mass matrix, dissipative force, and internal force respectively. The reduced internal elastic force ƒ is a function of the modal displacement q, and its Jacobian matrix is often referred to as the reduced tangent stiffness matrix Kq (q). With Rayleigh damping, ƒdis can be evaluated as (ζMq+ξKq(q)){dot over (q)}. In practice, q ∈T is a vector of much lower dimension (i.e., r<<3n). Thus, solving the nonlinear equation (1) is significantly faster than solving the its unreduced counterpart.
Precomputation Pipeline.
The linear and nonlinear modes may be condensed into a final mode matrix U using a random projection method (§ 4.3). Next, we uniformly sample a set of cubature points on the object. The cubature weights of those points are calculated for every reduced mode (§ 5).
The training proceeds in an iterative process: in each iteration, we incrementally add a few training poses (also referred to herein as training samples), and stop the iteration when a training error metric converges (§ 6). At that point, a reduced deformable model, including a set of deformation modes and related cubature weights, are ready for subsequent online simulations. The reduced deformable model may be simulated, e.g., using a Jacobian-free Newton-PCG solver at runtime.
We start by describing our fast algorithm for precomputing reduced modes. We build our reduced modes based on the Krylov subspace method, a technique for model reduction in structural analysis. We propose a method to accelerate the mode construction process (§ 4.2) and further extend the Krylov iteration to handle higher-order and nonlinear deformations (§ 4.3).
4.1 Background on Linear Inertia Mode
Traditional reduced mode construction involves eigenvalue decomposition through a principle component analysis or linear modalanalysis. Eigen-decomposition is generally expensive and has limited space to accelerate. Instead, we use the Krylov subspace method, which has been used in structural analysis and in graphics for computing substructured modes. The resulting modes, known as linear inertia modes, up to an order of m are computed recursively:
U(m)=Am-1U(0),where A=K−1 Equation 2:
where K and M are respectively the rest shape stiffness matrix and mass matrix, and U(0) is for mode initialization. A typical choice is the six infinitesimal rigid-body modes of the object (i.e., U(0)=Ur), wherein MU(0) can be interpreted as the inertia forces induced by rigid-body accelerations. Equation (2) essentially constructs a Krylov subspace of order m, denoted as K(m)span(U(1)) ∪ . . . ∪ span(U(m)), where span(B) stands for the column space of a matrix B.
Unconstrained Inertia Mode.
Eq. (2) may be used to construct reduced modes of substructure components with well-defined boundary conditions. However, when an object is unconstrained, K is singular, and thus Eq. (2) is unusable. We notice that a deformable object's motion is a superposition of its rigid-body motion ur ∈ span(Ur) and a pure deformation ud. The subspace of reduced modes approximating ud should therefore be orthogonal to Ur, resulting in a constrained linear system for computing the unconstrained inertia modes.
where λ is a Lagrange multiplier. This formula of constructing unconstrained modes is new.
Numerical Challenges.
In essence, constructing the Krylov subspace in Eq. (2) amounts to partially performing linear modal analysis, which solves an Eigen problem of KU=MUS (or equivalently AU=US−1). This is because K(m) is a good approximation of the subspace spanned by leading eigenvectors and has been used in classic Eigen solvers such as Arnoldi and Lanczos methods. However, Krylov iterations undermine the linear independence among modal vectors: after few iterations, the mode matrix quickly becomes ill-conditioned. A recipe to address this problem is to apply regularization methods such as a mass Modified Gram-Schmidt (mass-MGS) process after each iteration. This process involves frequent computation of mass inner product between two modes ui and uj (i.e., <ui,uj>MūiTMuj), and prunes the i-th mode ui using previously regularized modes uj, j<i:
The time complexity of processing all modes is O(nr2), which greatly increases the precomputation cost for a high-resolution mesh with a moderate number of reduced modes.
4.2 Reduced Mass-MGS
We propose a reduced mass-MGS to regularize modal vectors during the mode construction. We first accelerate the mass inner product, which effectively reduces the O(n) factor in the O(nr2) complexity. We further lessen the cost of repeated subtraction in Eq. (4) to lower the O(r2) factor.
Sparse Inner Product. The mass inner product between two displacement vectors ui and uj is a numerical discretization of a volume integral in the continuous setting,
(ui,uj)M≈∫Ωρ(x)ui(x)uj(x)dV, Equation 5:
where ui,j(x) is a continuous displacement field corresponding to the discretized displacement vector ui,j. Such a domain integral can be numerically evaluated using the Newton-Cotes integration rule, which sums up the integrand values at a set of sample points S over Ω,
where ui(p) and uj(p) are 3D displacement vectors at a sample point p. wp is a nonnegative weight associated with p. In the rest of this section, we use <⋅, ⋅>s to denote our sparse inner product.
The Newton-Cotes rule requires sample points be evenly placed over the object volume Ω. To this end, we create an axis-aligned bounding box of the mesh and subdivide the bounding box along three axes into cubic boxes. If a box B intersects with the input mesh, we add the finite element node nearest to the center of B into S and compute its weight as the total mass inside B, wp=∫B∩Ωρ(x)dV. The section entitled, “Error Analysis of the Sparse Inner Product” provides an analytical error analysis of the sparse inner product. For at least some implementations, we find that setting |S| ∝ log(n) provides a good balance between efficiency and accuracy as shown in
In
Subtraction Reduction.
Next, we reduce the cost of O(r2) subtraction (i.e., Eq. (4)) in the mass-MGS. One observation is that among all pairs of linear inertia modes, a considerable portion is already near-orthogonal pairs even without the mass-MGS regularization. In other words, (ui, uj)M is small for many pairs of ui and uj (see statistics in
Inspired by the above observation that among all pairs of linear inertia modes, a considerable portion is already near-orthogonal pairs even without the mass-MGS regularization, we define the sparse cosine as
and use it as a sparse metric of the orthogonality test between ui and uj. If |α| is smaller than a threshold ατ, the corresponding subtraction (i.e., Eq. (4)) may be skipped. We outline the implementation pseudo-code with subtraction reduction in the section entitled, “Implementation Details of rMGS with Subtraction Reduction,” where some implementation details are also highlighted. A visualization of the mass-orthogonality of the resulting mode matrix processed with our method is shown in
4.3 Nonlinear Inertia Derivatives
Linear modes are insufficient to capture nonlinear deformations. To address this limitation, a method of computing modal derivatives, the first-order directional derivatives of Eigen vectors, may be employed to expand the subspace and thereby incorporate the nonlinear deformations. While such methods are based on linear modal analysis, we show that our Krylov-based inertia modes can also be extended for capturing nonlinear deformations (§ 4.3.1). We call those nonlinear modes the inertia derivatives. More importantly, we derive an asymptotic formula for computing the inertia derivatives of an arbitrary order. Additionally, to refine the modal bases, we propose a novel random projection scheme as a faster alternative to the classic PCA (§ 4.3.2).
4.3.1 Generation of Nonlinear Inertia Derivatives
Recall that the linear inertia modes are computed recursively using U(m)=K−1 MU(m-1). When the nonlinear deformation is considered, the stiffness matrix K is no longer constant. Instead, it depends on the current displacement u. Consider a small perturbation of K=K(0), that is, K(Δu)=K+ΔK. We then expand the inverse of K(Δu) using Taylor series and obtain:
K(Δu)−1=(K+ΔK)−1=K−1−K−1ΔKK−1+O(∥Δu∥2). Equation 7:
Applying Eq. (7) to the computation of inertia modes reveals an asymptotic expansion of nonlinear approximation:
K−1MU(m-1)−K−1ΔKK−1MU(m-1)+ . . . =U(m)−K−1ΔKU(m)+ . . . ,
Here the first term is the linear inertia modes. To compute the first-order nonlinear modes, we express ΔK using the directional derivative of K along a direction u,
K−1ΔKU(m)=K−1(H:u)U(m),
where H=∇K is the stiffness Hessian, a third-order tensor, and the semicolon operator indicates a tensor contraction. When u is chosen as individual linear modes, we obtain the formula of computing the first-order nonlinear inertia derivatives,
vij(1)=K−1(H:ui)uj,
where both ui and uj are linear inertia modes, and the superscript of vij indicates that it is a first-order nonlinear mode. We notice that this formula of computing vij(1) echoes a modal derivative formula, although our linear modes are computed in a different way.
More importantly, this line of derivation facilitates the computation of arbitrarily high-order nonlinear modes. Further expansion of Eq. (7) shows that K(Δu)−1=K−1−K−1 ΔKK−1+K−1 ΔKK−1 ΔKK−1+ . . . , from which we can compute the second-order inertia derivatives,
V(2)=K−1(H:u)V(1),
where we use V(1) to denote the first-order nonlinear modal matrix, and V(2) is the second-order modal matrix. In general, the nonlinear inertia derivatives of order k can be written as:
V(k)=K−1(H:v)V(k-1),
where v is a reduced mode in span (V(1)) ∪ span(V(2)) ∪ . . . ∪ span(V(k-1)).
Discussion.
In theory, by carefully selecting first-order or even linear modes, one can always capture a complex deformation. So there exists a philosophical question: should one favor the increment of low-order modes over the use of higher-order ones? If some deformation priors are known—for instance, in a case where the external forces are given—one can carefully choose the initial Krylov vectors (i.e., U(0)) to construct a fine-tuned subspace. One may also directly interpolate reduced modes according to anticipated shape changes. However, for geometrically complex models with large deformations, manually picking the “right” modes may not be straightforward. In those cases, using high-order inertia derivatives can be a simple and robust solution (
4.3.2 Random Projection
As one chooses to use increasingly higher-order modes, the total number of modes increases exponentially (i.e., the column size of V(k) increases exponentially with respect to k), and thus the reduced simulation quickly slows down. Most existing work use PCA to select the most prominent modes out of the constructed ones, and compelling results have been reported. PCA has a time complexity of O(min((r2)3, n3)), assuming that the number of constructed modes comprising both linear and nonlinear modes is O(r2). In traditional modal-analysis-based precomputation, this is hardly a bottleneck, as the Eigen-decomposition for modal analysis is usually more expensive. However, our mode construction method has eliminated the use of Eigen-decomposition, leaving PCA indeed a performance bottleneck.
As a faster alternative of PCA, we propose an efficient method, which we call Random Projection (RP). This method is based on an observation: in a high-dimensional vector space, it is very likely that two random vectors are almost orthogonal to each other. This suggests that we can simply use a random thin matrix to condense the constructed modes.
Concretely, we first normalize the “importance” of the modes with various frequencies by scaling them according to their generalized Rayleigh quotient. Suppose there are m constructed modes, including both the linear inertia modes and nonlinear inertia derivatives. For every mode ui, i=1 . . . m, we compute ui←ui (uTKui)/(uTMui). We then concatenate all the ui into a superset matrix Ũ, and compute the final modal matrix U for the online simulation using U=ŨR, where R is a m×r matrix to condense the number of modes from m to r. The entry of R is randomly generated using:
When m>>r, the column vectors of R are almost always near-orthogonal to each other. Thus, ŨR is approximately a projection of span(Ũ) on a denser space determined by R. Since R is sparse, this matrix multiplication is much faster than running PCA.
Unlike PCA, this projection cannot choose the most salient subspace indicated by eigenvalues.
As shown in
Using a reduced model for an efficient nonlinear deformable simulation requires a fast runtime computation of the internal elastic force and its Jacobian. Building on the numerical cubature scheme, a fast method for the runtime internal force evaluation may be used. During precomputation, it selects a set of cubature elements ε on the mesh, and computes a weight we for each element e ∈ ε. Then the reduced internal force ƒ(q) and its Jacobian ∂ƒ/∂q are respectively computed as weighted summations over all the cubature elements:
where ge(q) is the reduced internal force induced by a modal displacement q at a cubature element e. While this scheme enjoys a fast runtime computation, the precomputation is highly expensive: it incrementally adds cubature elements following the residual gradient, which is calculated by exhaustive nonnegative least-squares (NNLS) solves.
Rationale.
We seek to accelerate the cubature precomputation to achieve an interactive performance. One straightforward attempt is again to exploit the Newton-Cotes rule: using evenly spaced cubature points to avoid repeated NNLS solves. However, to maintain sufficient approximation accuracy using the Newton-Cotes rule, we need to sample cubature points densely, which in turn burdens the NNLS in precomputation and the force evaluation at runtime. We propose a simple solution to address this dilemma: instead of using a single weight at each cubature point, we prepare multiple weights, each for an individual reduced coordinate. Our experiments (see
Method.
Concretely, for every component ƒj of the reduced internal force ƒ, we precompute a set of cubature weights wej, and approximate r as
where ej is the canonical unit basis vector of r (e.g., e1=[1, 0, . . . ]T); gej is the j-th component of ge, the internal force at a cubature element e. We stack wej for all e ∈ ε into a vector wj and precompute it by solving a NNLS problem, Ajwj=bj, where Aj and bj are constructed based on a training set with T samples. Specifically:
where ƒj,i denotes the j-th component of a reduced internal force ƒi in the i-th training example.
To distinguish from the standard cubature training, to which we refer as Optimized Cubature (OC), we refer to our cubature scheme as Modal Cubature (MC). Using MC, the reduced internal force Jacobian can be written as
We also sample the positions of cubature points evenly using axis-aligned voxels, as used in the sparse inner product step (§ 4.2). While we need to solve a NNLS problem for every single component of the reduced coordinate, the size of each NNLS problem is much smaller, and all the solves can be performed in parallel, yielding an interactive precomputation performance (1000-5000×) faster calculation.
Extension.
It is noteworthy that the Jacobian matrix resulting from Eq. (9) is not necessarily symmetric. One can approximately symmetrize the matrix Kq using Kq←½(Kq+KaT). On the other hand, we propose a new Newton-PCG solver, which requires no runtime evaluation of the Jacobian matrix, and thus completely sidesteps the asymmetry problem. We describe this runtime solver in the section entitled, “Jacobian-free Newton-PCG Solver,” as an extension of our precomputation pipeline.
Most reduced simulation methods compute the cubature weight from a training data , which is often taken as granted. One reasonable strategy, when there does not exist a prior training set, is to blindly sample a deformation subspace. For instance, the modal displacement following a normal distribution may be sampled and used in a full-space simulation to evaluate the corresponding internal forces and eventually, assemble the training dataset. With our goal of expediting the entire precomputation pipeline, we wish to carefully generate the training samples to avoid full-space simulation as much as possible. To the best of our knowledge, this problem has been largely unexplored.
the size of the initial training set
stores poses with top 20% fitting error
recenter samples
Our modal cubature scheme (§ 5) allows us to reduce the cost of training sample generation by incrementally expanding the training dataset. MC differs from the traditional optimized cubature scheme, wherein the training data is given, and their goal is to find the best set of cubature points. In contrast, with a fixed set of cubature points, we seek for a proper size of training dataset.
6.1 Observations and Rationale
Our training data generation algorithm is inspired by two observations, which can be understood with reference to
In a first observation, it is known that the numerical accuracy of approximating a force integral using the Newton-Cotes rules is bounded by the sampling interval (see the section entitled, “Error Analysis of the Sparse Inner Product”). This implies that the increase of training samples has a diminishing return on the accuracy improvement, as demonstrated in
Our second observation looks at the change of the normalized NNLS fitting error shown in
Initially, when T is small, the NNLS problem is under-constrained, and thus the fitting error is low. As more samples are added in the cubature training, the fitting error grows, but eventually becomes bounded from above. One interpretation of this observation is from the Monte Carlo integration point of view. The error defined in Eq. (10) computes the averaged fitting error across all training samples. As the number of samples increases, it is equivalent to evaluating, using Monte Carlo integration, the expected fitting error in the deformation subspace. Since the number of cubature points is fixed, the expected fitting error is bounded.
6.2 Incremental Training Samples
The above interpretation from a Monte Carlo point of view suggests that the error metric defined in Eq. (10) can be a natural indicator for when to stop adding training samples. One simple algorithm is as follows: we incrementally add samples into the training dataset. Every time when adding a training sample, we generate a reduced modal pose whose component qi follows a Gaussian distribution, qi˜(Ei, σi), where Ei=0 and σi=1/√{square root over (Qi)}. Here Qi is the generalized Rayleigh quotient of the modal vector ui. It is an estimation of the effective eigenvalue of ui, so the low-frequency mode will produce samples with larger variance. We use the generated reduced coordinate to evaluate a displacement vector and resulting internal forces using a full-space simulation. After we add a few samples, we update the fitting error Eq. (10), and evaluate its corresponding change Δe. We stop generating new training samples, if Δe is smaller than a threshold eτ (eτ=0.005 in all of our examples).
We further improve our training pose sampling algorithm by adding more poses in under-sampled regions in the reduced deformation space. As detailed in Algorithm 1, we start by generating 3/2|ε| samples using the Gaussian distribution described above (line 7-11 of Algorithm 1), so the resulting NNLS problem is over-constrained. We then iteratively add more samples. In each iteration, we add NI samples. (NI=⅓|ε| in all our examples). At the end of each iteration, we adjust Ei and σi for subsequent sampling (line 14-18 of Algorithm 1) based on the training poses that have large fitting errors. To this end, we maintain a set of training poses that have the top 20% fitting error among currently assembled training poses. For the next sample generation, we set Ei as the averaged modal pose of , and adjust σi such that they are in the radius of the original Gaussian distribution (line 17 of Algorithm 1). The iteration process stops if the error change Δe is below a threshold. The efficacy of this improved algorithm is shown in
In every iteration, we generate NI training samples in parallel. With our incremental training generation, we are able to stop computing training samples whenever they become unnecessary. As a result, the training process is largely accelerated. Even for large-scale models, the entire precomputation, including the generation of training data, can be completed within tens of seconds. Of course, if is given a priori in certain cases, (or not required, for instance one may choose to use geometrical warping to produce nonlinear deformation as in the example shown in
In experiments to test the proposed precomputation pipeline to evaluate its performance, scalability, quality, and versatility, the disclosed methods may be implemented, e.g., using Microsoft Visual Studio 2010 on a Windows 8.1×64 desktop equipped with an Intel i7-5960 3.0 GHz CPU (with eight physical cores) and 32G on-board RAM. pThread may be used to parallelize the computation for modal cubature training and training data generation. Most proposed numerical algorithms (e.g., sparse inner product, pMGS, nonnegative least square and Jacobian-free Newton-PCG solver) may be implemented from scratch with the help of the Intel MKL libraries.
7.1 Validation
Scalability Test.
The scalability of our modal construction algorithm may be tested using the gargoyle model. To ease the control of number of elements, we voxelize the model, so we can generate tetrahedral meshes whose sizes range from 50K to 1M, such as illustrated in
The computation time for generating linear inertia modes (our method) and linear Eigen modes (previous method) may be recorded in a table. In some embodiments, the linear inertia modes may be computed using the PARDISO solver shipped with MKL. The modal analysis-based modes may be computed using Matlab 2015a's built-in eigs function with multi-threading enabled. We found that eigs is faster than ARPACK++ or SLEPc, another two widely-used open source packages for sparse Eigen problems. This may be because Matlab uses MKL to handle sparse linear system, which has better performance on our hardware platform (Intel CPU). These tests show that with the help of rMGS, the construction of inertia modes may be up to 20-30× faster than linear modal analysis on average. Under such circumstance, mode refinement with PCA becomes a costly operation in the precomputation. As plotted in
Simulation Quality.
It can be seen in
Modal Cubature.
We compare the performance of the modal cubature with the standard optimized cubature on a training set , T=1000. We adopt the lazy cubature optimization strategy, wherein the new seed element is picked out of a subset of 1,000 elements. Both training strategies are tested using the Armadillo model. We choose multiple NNLS fitting error levels. For each error level, we examine the number of cubature elements needed to reach the error level, and compare the cubature training performances of both schemes.
When training poses are not available, we incrementally generate while tracking the change of the NNLS fitting error (i.e., Algorithm 1). This strategy need only involve a few hundred training poses (although more may be used if desired) and produces deformable animation of similar quality to the ones generated using a larger training set. Because of the per-mode weight storage in MC training, MC consumes r times more memory than OC does. One drawback of MC is associated with the asymmetry of the resulting Jacobian matrix, which can result in a larger numerical error than OC when used to compute the force gradient. Fortunately, the proposed Jacobian-free PCG solver in the section entitled, “Jacobian-free Newton-PCG Solver,” addresses this issue. In all our examples, our inner PCG solver is able to converge within 3 to 5 iterations (we set the convergence threshold as 0.001). The St. Venant-Kirchhoff material model is adopted in both comparisons.
Free-moving deformable bodies can be well accommodated within our framework.
7.2 Applications
Application I: Multi-domain Simulation with High-order Nonlinear Modes. Our precomputation pipeline can work in tandem with different types of simulation methods and hyperplastic materials. Beside the typical single-domain solid deformable simulation, here we illustrate the application of our precomputation in a substructured simulation. Following the state-of-the-art domain decomposition methods, precomputation may be localized at small-size domains.
Application II: Simulation-in-the-loop Character Animation. The proposed precomputation pipeline allows a fast preview of physics-based animation on the top of the classic skeleton-driven character animations. In this application, the user is able to tweak material parameters of the character. Owing to the proposed fast precomputation, we are able to update the reduced model and rerun the simulation all interactively.
Application III: Nonlinear Sound Synthesis. Lastly, fast deformable precomputation also allows a quick forelook for sound synthesis.
We present a comprehensive solution to accelerate the precomputation of nonlinear elastic deformable models based on Krylov iteration. We optimize the three performance bottlenecks along the traditional precomputation pipeline: the mode construction and its regularization, the cubature training and the generation of the training poses. Together with the devised Jacobian-free Newton-PCG solver, expensive precomputation is now made interactive or nearly interactive while the online simulation remains in real-time.
Error Analysis of the Sparse Inner Product
Eq. (6) is essentially an open-type Newton-Cotes formula, wherein the midpoint rule with a constant-value interpolating function is used. Eq. (6) corresponds to the open-type formula because we do not use values of target function at end points of an interval. Instead, the value at the midpoint is used.
where H=b−a is the size of the integration interval. ƒ(k) denotes the k-th order derivative function of ƒ(x). v is some value between [a, b]. It is noteworthy that Eq. (11) gives the approximation error of adopting sparse inner product with respect to Eq. (5), the inner product between two vector-valued functions, while our real approximating target is the full-size mass inner products. Therefore, the numerical error induced by adopting the sparse inner product is bounded by O((H−He)3), where He is the maximum size of the element on the mesh.
∪* hosts the reglarized modes
α is the sparse cosine
re normalize vi
incremental norm evaluation
Implementation Details of rMGS with Subtraction Reduction
The pseudo-code outlining the proposed rMGS is given in Algorithm 2. It can be seen that rMGS needs to update the sparse-norm of vi (i.e., variable l in the pseudo-code) immediately, after a projection-subtraction is executed in order to evaluate α for the next loop. This subroutine sits in the innermost loop of rMGS and can be sped up by updating its sparse norm incrementally:
This equation (line 15 of Algorithm 2) however, could accumulate roundoff error and lead to negative square rooting when α goes large and l gets smaller. To maintain the numerical stability, the values of 1−α2 and l are regularly checked (line 12 of Algorithm 2). If necessary, we fresh evaluate l directly using sparse inner product and re sparse-normalize vi (line 13 of Algorithm 2).
Jacobian-Free Newton-PCG Solver
The Newton's method is a common choice for online subspace integration. At each time step, Newton's method seeks for an incremental displacement Δq as the residual minimizer iteratively. It typically requires the explicit formulation of the current tangent stiffness matrix, which is an O(|ε|r2) procedure. Besides, an accurate force gradient may not be available with MC scheme, recalling that Eq. (9) does not even preserve its symmetry. To tackle this limitation associated with MC training, we do not use any direct solvers (e.g., LU, Cholesky) to calculate Δq within a Newton iteration. Instead, a preconditioned conjugate gradient (PCG) solver is adopted, which only needs the evaluation of the matrix-vector product. We approximate these matrix-vector products numerically instead of resorting to the analytical evaluation of the force Jacobian. Suppose that the implicit Newmark time integration is used. Each Newton iteration needs to solve a r-dimension linear system of AΔq=−e, where
Here δqi=qi+1−qi is the displacement deviation at current time step. {dot over (q)}i and {dot over (q)}i are the known reduced velocity and acceleration at the previous step. ζ and ζ are damping coefficients. ƒext is the reduced external force. α1, α2, . . . α6 are constant coefficients computed as:
where β=½, γ=1 are two parameters of the Newmark integrator. h is the size of each time step.
Matrix-vector product between the system matrix A and a certain vector p in the PCG solver can be written as the summation of two items according to the formulation of A:
The first term on the r.h.s can be directly evaluated as Mq is a constant matrix. The second term is essentially the scaled directional derivative of the reduced internal force, where the notation of DΠ (x)[u] stands for the directional derivative of a function Π at x in the direction of u. Understanding this important fact allows us to use the numerical directional derivative to approximate the matrix-vector product associated with the reduced tangent stiffness matrix:
The choice of ε in Eq. (13) is not trivial: if ε is too large, the derivative is poorly approximated and if it is too small the result of the finite difference is contaminated by floating-point roundoff error. We follow the choice used in NITSOL package:
where εmachine is the machine epsilon. It is typically set as 10−6 for 64-bit double precision and is regarded as the most suitable number. The coefficient of
makes sure that the final adopted ε is not impaired by an over-scaled p.
Preconditioning.
The preconditioner plays a critical role for the PCG solver. Unfortunately, there does not yet exist a well-established theory finding the best preconditioner for every case. Most preconditioning methods such as Jacobi, Gauss-Seidel, or SOR preconditioning require the information of the system matrix A, which is not available in our case as the tangent stiffness matrix is unknown. Alternatively, we design the preconditioner P as the largest invariant portion of A:
P=(α1+ζα4)Mq+(1+ξα4)Kq(0). Equation 15:
We find that using the preconditioner defined in Eq. 15 is able to double the convergence rate. The initial guess of the PCG is set as δqi at the very beginning of each time step and as a zero vector for the rest Newton iterations following the logic that the current Δq should be similar to the previous one at the first the Newton iteration, while it should quickly converge to a zero vector as Newton iteration moves forward.
Example Computing Device
Depending on the desired configuration, processor 2110 may be of any type including but not limited to a microprocessor (μP), a microcontroller (μC), a digital signal processor (DSP), or any combination thereof. Processor 2110 may include one or more levels of caching, such as a level one cache 2111 and a level two cache 2112, a processor core 2113, and registers 2114. Processor core 2113 may include an arithmetic logic unit (ALU), a floating point unit (FPU), a digital signal processing core (DSP Core), or any combination thereof. A memory controller 2115 may also be used with processor 2110, or in some implementations memory controller 2115 may be an internal part of processor 2110.
Depending on the desired configuration, system memory 2120 may be of any type including but not limited to volatile memory (such as RAM), non-volatile memory (such as ROM, flash memory, etc.), or any combination thereof. System memory 2120 typically includes an operating system 2121, one or more applications 2122, and program data 2125. In some embodiments, operating system 2121 may comprise a virtual machine that is managed by a Virtual Machine Manager (VMM). Applications 2122 may include an accelerated precomputation application 2123 which may implement any or all of the various precomputation techniques disclosed herein, and optionally a simulator application 2124 which may perform simulations using reduced deformable models generated by the precomputation application 2123. Program data 2125 may include data 2126 which may include any or all of the various data inputs and outputs described in connection with the disclosed precomputation techniques. Computing device 2100 may also connect with other computing devices 2190 to access cloud or remote data base storage.
Computing device 2100 may have additional features or functionality, and additional interfaces to facilitate communications between the basic configuration 2101 and any required devices and interfaces. For example, a bus/interface controller 2140 may be used to facilitate communications between the basic configuration 2101 and one or more data storage devices 2150 via a storage interface bus 2141. The data storage devices 2150 may be removable storage devices 2151, non-removable storage devices 2152, or a combination thereof. Examples of removable storage and non-removable storage devices include magnetic disk devices such as flexible disk drives and hard-disk drives (HDD), optical disk drives such as compact disk (CD) drives or digital versatile disk (DVD) drives, solid state drives (SSD), and tape drives, to name a few. Example computer storage media may include volatile and nonvolatile, removable and non-removable media implemented in any method or technology for storage of information, such as computer readable instructions, data structures, program modules, or other data.
Level 1 cache 2111, level 2 cache 2112, system memory 2120, removable storage 2151, and non-removable storage devices 2152 are all examples of computer storage media. Computer storage media includes, but is not limited to, RAM, ROM, EEPROM, flash memory or other memory technology, CD-ROM, digital versatile disks (DVD) or other optical storage, magnetic cassettes, magnetic tape, magnetic disk storage or other magnetic storage devices, or any other medium that may be used to store the desired information and that may be accessed by computing device 2100. Any such computer storage media may be part of device 2100.
Computing device 2100 may also include an interface bus 2142 for facilitating communication from various interface devices (e.g., output interfaces, peripheral interfaces, and communication interfaces) to the basic configuration 2101 via the bus/interface controller 2140. Example output devices 2160 include a graphics processing unit 2161 and an audio processing unit 2162, which may be configured to communicate to various external devices such as a display or speakers via one or more A/V ports 2163. Example peripheral interfaces 2170 may include a serial interface controller 2171 or a parallel interface controller 2172, which may be configured to communicate through either wired or wireless connections with external devices such as input devices (e.g., keyboard, mouse, pen, voice input device, touch input device, etc.) or other peripheral devices (e.g., printer, scanner, etc.) via one or more I/O ports 2173. Other conventional I/O devices may be connected as well such as a mouse, keyboard, and so forth. An example communications device 2180 includes a network controller 2181, which may be arranged to facilitate communications with one or more other computing devices 2190 via one or more communication ports 2182.
In some embodiments, computing device 2100 may be implemented as a business or personal use computer including both laptop computer and non-laptop computer configurations. In some embodiments, computing device 2100 may be implemented as one or more servers, e.g., servers in a data center or servers in an animation studio.
The foregoing detailed description has set forth various embodiments of the devices and/or processes via the use of block diagrams, flowcharts, and/or examples. Insofar as such block diagrams, flowcharts, and/or examples contain one or more functions and/or operations, it will be understood by those within the art that each function and/or operation within such block diagrams, flowcharts, or examples can be implemented, individually and/or collectively, by a wide range of hardware, software, firmware, or virtually any combination thereof. In addition, those skilled in the art will appreciate that the mechanisms of the subject matter described herein are capable of being distributed as a program product in a variety of forms, and that an illustrative embodiment of the subject matter described herein applies regardless of the particular type of signal bearing medium used to actually carry out the distribution. Examples of a signal bearing medium include, but are not limited to, the following: a recordable type medium such as a floppy disk, a hard disk drive, a Compact Disc (CD), a Digital Video Disk (DVD), a digital tape, a computer memory, etc.; and a transmission type medium such as a digital and/or an analog communication medium (e.g., a fiber optic cable, a waveguide, a wired communications link, a wireless communication link, etc.).
Those skilled in the art will recognize that it is common within the art to describe devices and/or processes in the fashion set forth herein, and thereafter use engineering practices to integrate such described devices and/or processes into data processing systems. That is, at least a portion of the devices and/or processes described herein can be integrated into a data processing system via a reasonable amount of experimentation. Those having skill in the art will recognize that a typical data processing system generally includes one or more of a system unit housing, a video display device, a memory such as volatile and non-volatile memory, processors such as microprocessors and digital signal processors, computational entities such as operating systems, drivers, graphical user interfaces, and applications programs, one or more interaction devices, such as a touch pad or screen, and/or control systems including feedback loops and control motors (e.g., feedback for sensing position and/or velocity; control motors for moving and/or adjusting components and/or quantities). A typical data processing system may be implemented utilizing any suitable commercially available components, such as those typically found in data computing/communication and/or network computing/communication systems.
It will be understood by those within the art that, in general, terms used herein, and especially in the appended claims (e.g., bodies of the appended claims) are generally intended as “open” terms (e.g., the term “including” should be interpreted as “including but not limited to,” the term “having” should be interpreted as “having at least,” the term “includes” should be interpreted as “includes but is not limited to,” etc.). It will be further understood by those within the art that if a specific number of an introduced claim recitation is intended, such an intent will be explicitly recited in the claim, and in the absence of such recitation no such intent is present. For example, as an aid to understanding, the following appended claims may contain usage of the introductory phrases “at least one” and “one or more” to introduce claim recitations. However, the use of such phrases should not be construed to imply that the introduction of a claim recitation by the indefinite articles “a” or “an” limits any particular claim containing such introduced claim recitation to inventions containing only one such recitation, even when the same claim includes the introductory phrases “one or more” or “at least one” and indefinite articles such as “a” or “an” (e.g., “a” and/or “an” should typically be interpreted to mean “at least one” or “one or more”); the same holds true for the use of definite articles used to introduce claim recitations. In addition, even if a specific number of an introduced claim recitation is explicitly recited, those skilled in the art will recognize that such recitation should typically be interpreted to mean at least the recited number (e.g., the bare recitation of “two recitations,” without other modifiers, typically means at least two recitations, or two or more recitations). Furthermore, in those instances where a convention analogous to “at least one of A, B, and C, etc.” is used, in general such a construction is intended in the sense one having skill in the art would understand the convention (e.g., “a system having at least one of A, B, and C” would include but not be limited to systems that have A alone, B alone, C alone, A and B together, A and C together, B and C together, and/or A, B, and C together, etc.). In those instances where a convention analogous to “at least one of A, B, or C, etc.” is used, in general such a construction is intended in the sense one having skill in the art would understand the convention (e.g., “a system having at least one of A, B, or C” would include but not be limited to systems that have A alone, B alone, C alone, A and B together, A and C together, B and C together, and/or A, B, and C together, etc.). It will be further understood by those within the art that virtually any disjunctive word and/or phrase presenting two or more alternative terms, whether in the description, claims, or drawings, should be understood to contemplate the possibilities of including one of the terms, either of the terms, or both terms. For example, the phrase “A or B” will be understood to include the possibilities of “A” or “B” or “A and B.”
While various embodiments have been disclosed herein, other aspects and embodiments will be apparent to those skilled in art.
Priority is claimed to U.S. Provisional Application No. 62/213,760, filed on Sep. 3, 2015, entitled “Expediting Precomputation for Reduced Deformable Simulation.” The prior application is incorporated by reference in its entirety.
This invention was made with Government support under Agreement CRII-1464306 awarded by the National Science Foundation. The Government has certain rights in this invention.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2016/050073 | 9/2/2016 | WO | 00 |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO2017/040905 | 3/9/2017 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
6862374 | Nagai | Mar 2005 | B1 |
7663648 | Saldanha | Feb 2010 | B1 |
8924186 | Zhao | Dec 2014 | B1 |
9063882 | Zhao | Jun 2015 | B1 |
9378589 | Kim | Jun 2016 | B2 |
9601109 | Horesh | Mar 2017 | B2 |
20020024517 | Yamaguchi | Feb 2002 | A1 |
20020130853 | Horowitz | Sep 2002 | A1 |
20030184603 | Marshall | Oct 2003 | A1 |
20050128211 | Berger | Jun 2005 | A1 |
20060028466 | Zhou | Feb 2006 | A1 |
20060071932 | Weese | Apr 2006 | A1 |
20060122999 | Sosnov | Jun 2006 | A1 |
20060139347 | Choi | Jun 2006 | A1 |
20080180448 | Anguelov | Jul 2008 | A1 |
20080309664 | Zhou | Dec 2008 | A1 |
20090002376 | Xu | Jan 2009 | A1 |
20090284529 | De Aguiar | Nov 2009 | A1 |
20100020073 | Corazza | Jan 2010 | A1 |
20100053186 | DeRose | Mar 2010 | A1 |
20100111370 | Black | May 2010 | A1 |
20110275921 | Revishvili | Nov 2011 | A1 |
20110296331 | Iyer | Dec 2011 | A1 |
20110305379 | Mahfouz | Dec 2011 | A1 |
20120035459 | Revishvili | Feb 2012 | A1 |
20120081357 | Habbecke | Apr 2012 | A1 |
20120162217 | Lim | Jun 2012 | A1 |
20120221617 | Subbarao | Aug 2012 | A1 |
20120281019 | Tamstorf | Nov 2012 | A1 |
20120281873 | Brown | Nov 2012 | A1 |
20120290976 | Lahm | Nov 2012 | A1 |
20130162633 | Berger | Jun 2013 | A1 |
20130187919 | Medioni | Jul 2013 | A1 |
20130271449 | Lee | Oct 2013 | A1 |
20130286012 | Medioni | Oct 2013 | A1 |
20140015852 | Kantartzis | Jan 2014 | A1 |
20140028673 | Gregson | Jan 2014 | A1 |
20140043328 | Chen | Feb 2014 | A1 |
20140092090 | Fleury | Apr 2014 | A1 |
20140198107 | Thomaszewski | Jul 2014 | A1 |
20140198108 | Sigal | Jul 2014 | A1 |
20140333614 | Black | Nov 2014 | A1 |
20150074158 | Kimmel | Mar 2015 | A1 |
20150088225 | Noble | Mar 2015 | A1 |
20150088473 | Liu | Mar 2015 | A1 |
20150161987 | Horesh | Jun 2015 | A1 |
20150206341 | Loper | Jul 2015 | A1 |
20150213646 | Ma | Jul 2015 | A1 |
20150317808 | Tian | Nov 2015 | A1 |
20160048618 | Shimizu | Feb 2016 | A1 |
20160140255 | Kim | May 2016 | A1 |
20160202389 | Malvesin | Jul 2016 | A1 |
20170004621 | Maranzana | Jan 2017 | A1 |
20170018118 | Li | Jan 2017 | A1 |
20170032055 | Eisemann | Feb 2017 | A1 |
20170032579 | Eisemann | Feb 2017 | A1 |
20170061683 | Dorin | Mar 2017 | A1 |
20170118011 | Shibutani | Apr 2017 | A1 |
20170147874 | Perbet | May 2017 | A1 |
20170193692 | Huang | Jul 2017 | A1 |
20170213381 | Bronstein | Jul 2017 | A1 |
20170221274 | Chen | Aug 2017 | A1 |
20170320346 | Zhou | Nov 2017 | A1 |
20170330375 | Chen | Nov 2017 | A1 |
20170337732 | Tamersoy | Nov 2017 | A1 |
20170351793 | Tamstorf | Dec 2017 | A1 |
20180061141 | Choi | Mar 2018 | A1 |
20180130245 | Kozlov | May 2018 | A1 |
20180130256 | Wampler | May 2018 | A1 |
20180144533 | Brossard | May 2018 | A1 |
20180165860 | Noh | Jun 2018 | A1 |
20180181802 | Chen | Jun 2018 | A1 |
20180210413 | Frangos | Jul 2018 | A1 |
20180315230 | Black | Nov 2018 | A1 |
20180321347 | Wang | Nov 2018 | A1 |
20190012831 | Mitchell | Jan 2019 | A1 |
20190019331 | Alliez | Jan 2019 | A1 |
20190087979 | Mammou | Mar 2019 | A1 |
20190096116 | Cheong | Mar 2019 | A1 |
20190096127 | Huang | Mar 2019 | A1 |
20190108300 | Soler Arasanz | Apr 2019 | A1 |
20190108396 | Dal Mutto | Apr 2019 | A1 |
Number | Date | Country |
---|---|---|
2017040905 | Mar 2017 | WO |
Entry |
---|
Kim et al., Skipping Steps in Deformable Simulation with Online Model Reduction, 2009 (Year: 2009). |
Kim et al. Physics-based Character Skinning using Multi-Domain Subspace Deformations, 2011 (Year: 2011). |
Pernice et al., NITSOL: A Newton Iterative Solver for Nonlinear Systems, 1998 (Year: 1998). |
Xu, H., Li, Y., Chen, Y., and Barbic, J. 2015. Interactive Material Design Using Model Reduction. ACM Trans. on Graphics 34, 2, 14 pages. |
Yang, Y., Xu, W., Guo, X., Zhou, K., and Guo, B. 2013. Boundary-aware multidomain subspace deformation. IEEE Transactions on Visualization and Computer Graphics 19, 10, 1633-1645. |
Zheng, C., and James, D. L. 2011. Toward high-quality modal contact sound. ACM Transactions on Graphics (Proceedings of SIGGRAPH 2011) 30, 4 (Aug.). |
Zou, H., Hastie, T., and Tibshirani, R. 2006. Sparse Principal Component Analysis. Journal of Computational and Graphical Statistics, vol. 15, No. 2, pp. 265-286. |
Alanelli, M., and Hadjidimos, A. 2004. Block gauss elimination followed by a classical iterative method for the solution of linear systems. Journal of Computational and Applied Mathematics 163, 2, 381-400. |
An, S. S., Kim, T., and James, D. L. 2008. Optimizing cubature for efficient integration of subspace deformations. ACM Trans. Graph. 27, 5 (Dec.), 165:1-165:10. |
Atkinson, K. 1989. An Introduction to Numerical Analysis, 2 edition ed. Wiley, New York, Jan. |
Baraff, D., and Witkin, A. 1992. Dynamic simulation of non-penetrating flexible bodies. SIGGRAPH Comput. Graph. 26, 2 (July), 303-308. |
Barbic , J., and James, D. L. 2005. Real-time subspace integration for st. venant-kirchhoff deformable models. In ACM SIGGRAPH 2005 Papers, ACM, SIGGRAPH '05, 982-990. |
Barbic, J., and James, D. L. 2010. Subspace self-collision culling. ACM Trans. Graph. 29, 4 (July), 81:1-81:9. |
Barbic, J., and Popovic, J. 2008. Real-time control of physically based simulations using gentle forces. ACM Trans. on Graphics (SIGGRAPH Asia 2008) 27, 5, 163:1-163:10. |
Barbic , J., and Zhao, Y. 2011. Real-time large-deformation substructuring. In ACM SIGGRAPH 2011 Papers, SIGGRAPH '11, 91:1-91:8. |
Barbic, J., Da Silva, M., and Popovi'c, J. 2009. Deformable object animation using reduced optimal control. ACM Trans. Graph. 28, 3, 53:1-53:9. |
Barbic, J., Sin, F., and Grinspun, E. 2012. Interactive editing of deformable simulations. ACM Trans. Graph. 31, 4 (July), 70:1-70:8. |
Berrut, J.-P., and Trefethen, L. N. 2004. Barycentric lagrange interpolation. SIAM Review 46, 3, 501-517. |
Bingham, E., and Mannila, H. 2001. Random projection in dimensionality reduction: Applications to image and text data. KDD '01, 245-250. |
Bonet, D. J., and Wood, D. R. D. 2008. Nonlinear Continuum Mechanics for Finite Element Analysis. Cambridge University Press. |
Brown, P., and Saad, Y. 1990. Hybrid krylov methods for nonlinear systems of equations. SIAM Journal on Scientific and Statistical Computing 11, 3, 450-481. |
Capell, S., Green, S., Curless, B., Duchamp, T., and Popovi'c, Z. 2002. Interactive skeleton-driven dynamic deformations. SIGGRAPH '02, 586-593. |
Capell, S., Green, S., Curless, B., Duchamp, T., and Popovi'c , Z. 2002. A multiresolution framework for dynamic deformations. In Proceedings of the 2002 ACM SIGGRAPH/Eurographics Symposium on Computer Animation, SCA '02, 41-47. |
Chan, T., and Jackson, K. 1984. Nonlinearly preconditioned krylov subspace methods for discrete newton algorithms. SIAM Journal on Scientific and Statistical Computing 5, 3, 533-542. |
Choi, M. G., and Ko, H.-S. 2005. Modal warping: Real-time simulation of large rotational deformation and manipulation. IEEE Transactions on Visualization and Computer Graphics 11, 1 (Jan.), 91-101. |
Dasgupta, S. 2000. Experiments with random projection. In Proceedings of the 16th Conference on Uncertainty in Artificial Intelligence, UAI '00, 143-151. |
García, F. G., Paradinas, T., Coll, N., and Patow, G. 2013. *cages:: A multilevel, multi-cage-based system for mesh deformation. ACM Trans. Graph. 32, 3 (July), 24:1-24:13. |
Golub, G. H., and Van Van Loan, C. F. 1996. Matrix Computations (Johns Hopkins Studies in Mathematical Sciences)(3rd Edition). Johns Hopkins University Press. |
Grinspun, E., Krysl, P., and Schröder, P. 2002. Charms: A simple framework for adaptive simulation. SIGGRAPH '02, 281-290. |
Harmon, D., and Zorin, D. 2013. Subspace integration with local deformations. ACM Trans. Graph. 32, 4 (July), 107:1-107:10. |
Hauser, K. K., Shen, C., and O'Brien, J. F. 2003. Interactive deformation using modal analysis with constraints. In Graphics Interface, CIPS, Canadian Human-Computer Commnication Society, 247-256. |
Hecht, F., Lee, Y. J., Shewchuk, J. R., and O'Brien, J. F. 2012. Updated sparse cholesky factors for corotational elastodynamics. ACM Trans. Graph. 31, 5 (Sept.), 123:1-123:13. |
Huang, J., Tong, Y., Zhou, K., Bao, H., and Desbrun, M. 2011. Interactive shape interpolation through controllable dynamic deformation. Visualization and Computer Graphics, IEEE Transactions on 17, 7 (July), 983-992. |
Hughes, T. J. R. 2000. The Finite Element Method: Linear Static and Dynamic Finite Element Analysis (Dover Civil and Mechanical Engineering). Dover Publications. |
James, D. L., and Pai, D. K. 2004. Bd-tree: Output-sensitive collision detection for reduced deformable models. SIGGRAPH '04, 393-398. |
Kavan, L., Sloan, P.-P., and O'Sullivan, C. 2010. Fast and efficient skinning of animated meshes. Computer Graphics Forum 29, 2, 327-336. |
Kharevych, L., Mullen, P., Owhadi, H., and Desbrun, M. 2009. Numerical coarsening of inhomogeneous elastic materials. ACM Trans. Graph. 28, 3 (July), 51:1-51:8. |
Kim, T., and James, D. L. 2009. Skipping steps in deformable simulation with online model reduction. ACM Trans. Graph. 28, 5 (Dec.), 123:1-123:9. |
Kim, T., and James, D. L. 2011. Physics-based character skinning using multi-domain subspace deformations. SCA 11, 63-72. |
Koh, W., Narain, R., and O'Brien, J. F. 2014. View-dependent adaptive cloth simulation. In Proceedings of the ACM SIGGRAPH /Eurographics Symposium on Computer Animation, 1-8. |
Kry, P. G., James, D. L., and Pai, D. K. 2002. Eigenskin: Real time large deformation character skinning in hardware. SCA '02, 153-159. |
Meyer, M., and Anderson, J. 2007. Key point subspace acceleration and soft caching. In ACM SIGGRAPH 2007 Papers, SIGGRAPH '07. |
Miller, K. S. 1981. On the inverse of the sum of matrices. Mathematics Magazine 54, 2, pp. 67-72. |
Müller, M., Dorsey, J., McMillan, L., Jagnow, R., and Cutler, B. 2002. Stable real-time deformations. SCA '02, 49-54. |
Nealen, A., MãcÂijller, M., Keiser, R., Boxerman, E., and Carlson, M. 2006. Physically based deformable models in computer graphics. Computer Graphics Forum 25, 4, 809-836. |
Nesme, M., Kry, P. G., Je{hacek over ( )}rábková, L., and Faure, F. 2009. Preserving topology and elasticity for embedded deformable models. SIGGRAPH '09, 52:1-52:9. |
Parlett, B. N. 1987. The Symmetric Eigenvalue Problem (Classics in Applied Mathematics). Society for Industrial and Applied Mathematics. |
Pentland, A., and Williams, J. 1989. Good vibrations: Modal dynamics for graphics and animation. SIGGRAPH '89, 215-222. |
Pernice, M., and Walker, H. 1998. Nitsol: A newton iterative solver for nonlinear systems. SIAM Journal on Scientific Computing 19, 1, 302-318. |
Saad, Y. 1981. Krylov subspace methods for solving large unsymmetric linear systems. Mathematics of Computation 37, 155, pp. 105-126. |
Shabana, A. A. 2005. Dynamics of Multibody Systems, third ed. Cambridge University Press. Cambridge Books Online. |
Shewchuk, J. R. 1994. An introduction to the conjugate gradient method without the agonizing pain. Tech. rep., Pittsburgh, PA, USA. |
Teng, Y., Otaduy, M. A., and Kim, T. 2014. Simulating articulated subspace self-contact. ACM Trans. Graph. 33, 4 (July), 106:1-106:9. |
Terzopoulos, D., Platt, J., Barr, A., and Fleischer, K. 1987. Elastically deformable models. SIGGRAPH Comput. Graph. 21, 4 (Aug.), 205-214. |
Von Tycowicz, C., Schulz, C., Seidel, H.-P., and Hildebrandt, K. 2013. An efficient construction of reduced deformable objects. ACM Trans. Graph. 32, 6 (Nov.), 213:1-213:10. |
International Search Report and Written Opinion for PCT/US2016/050073, International Searching Authority, dated Nov. 3, 2016. |
Mitchell, N. et al., GRIDiron: An interactive authoring and cognitive training foundation for reconstructive plastic surgery procedures, Jul. 2015. |
Number | Date | Country | |
---|---|---|---|
20180247158 A1 | Aug 2018 | US |
Number | Date | Country | |
---|---|---|---|
62213760 | Sep 2015 | US |