The present invention relates generally to the field of device model approximation. More particularly, the present invention relates to an efficient system and method for approximating analytical circuit device models.
Simulation is an indispensable verification step before committing an integrated circuit to an expensive manufacturing process. An important step in circuit simulation is model evaluation. Modern analytical models such as bsim3 and bsim4 have become more and more complicated and expensive to evaluate in simulation. The percentage of model evaluation time in circuit simulation can be as high as 70–80%. This number can grow to 90% if one includes simulation time spent in using all the individual device evaluations to generate the circuit equations. As a result, model evaluation has become a bottle neck in the improvement of circuit simulation efficiency.
It is becoming more and more difficult to maintain some degree of smoothness in the approximated analytical model due to its complexity. As a result, many resources have been spent in checking the smoothness of device models and the consistency between function values and their derivatives in both the Electronic Design Automation (EDA) and the semi-conductor industries.
A high-order accurate table model is a potential solution to the above problems. Table models provide polynomial approximations of analytical models based on table-grid interpolations of the original current-voltage and charge-voltage (i-v/q-v) curves. Most existing table models use spline interpolation based on a fixed stencil consisting of a fixed group of consecutive interpolation points. Fix stencil interpolations work well if the curve to be approximated is globally smooth. The problem associated with fixed stencil interpolations, however, is that for curves containing a C0 corner point (i.e. curves that are continuous but not differentiable at the comer point), fixed stencil interpolation of second or higher order accuracy will be necessarily oscillatory near the C0 comer point.
The impact of this oscillatory behavior can be illustrated using the example in
Besides causing non-diminishing oscillations, existing table model-based simulation methods also lead to unnecessary memory usage and inefficient memory access. This is because these methods construct and store all the measurement data entries (i. e. table entries) at the beginning of simulation. Storing all the data entries is not necessary since simulations usually only concentrates on “local” data entries surrounding one or more operating points. Moreover, these data entries may contain redundant information since boundaries of these data entries may overlap. Storing these redundant information is certainly unnecessary. Moreover, storing all the table entries at the beginning of simulation makes it impractical to build the table entries adaptively and accurately due to the huge memory consumption.
There is therefore a need to develop a method and system for obtaining smooth, accurate, and computationally efficient approximation of analytical device models.
The presentation invention includes a method for obtaining smooth and accurate approximations of analytical device models, comprising the steps of identifying a first set of measurement units; locating two or more sets of units that neighbor one or more of said measurement units; for each set of the two or more sets of neighbor units, obtaining the union of one or more of said sets of neighbor units and the first set of measurement units; calculating the smoothness of the analytical device model within one or more of said unions; and selecting at least one of said unions within which the analytical device model is the smoothest as the new set of measurement units.
The present invention also includes a method for obtaining smooth and accurate approximations of analytical device models in a (m+1)-dimensional manifold where m is an integer, comprising the steps of identifying a first set of measurement units in the (m+1)-dimensional manifold; locating two or more sets of units that neighbor one or more of said measurement units; obtaining, for each m-dimensional boundary of the (m+1)-dimensional manifold, an approximation of the analytical device model based on the two or more sets of neighboring points and the first set of measurement units; and blending the approximations obtained for each m-dimensional boundary of the (m+1)-dimensional manifold to produce an approximation of the analytical device model.
The present invention further includes a method for reducing memory usage during circuit simulations, comprising the steps of identifying one or more operating points; obtaining, for each of the one or more operating points, a set of measurement points surrounding said operating point; eliminating overlapping boundaries among each set of measurement points; and constructing a hash table for storing each set of measurement points.
One preferred embodiment of the present invention uses dynamic stencils to obtain smooth and accurate approximations of analytical device models. A flow chart of this embodiment is depicted in
The above approximation process can be performed in any subspace of the measurement space and the corresponding measurement units in the subspace can be of any shape. In one embodiment, the approximation process further comprises, before step 210, the steps of providing a partition of the measurement space, and for each partition, executing steps 210–260. The approximated analytical device model is obtained at step 280 based on the final set of measurement units for each partition.
Examples of the dynamic stencil scheme include the Essentially Non-Oscillatory (ENO) schemes, which are described by Harten et al. in [1], the contents of which is hereby incorporated in its entirety by reference. In one embodiment, the original charge (q) curve describing the analytical device model is described by the mapping q: V→R, where V is a m-dimensional manifold representing the measurement index and R is the set of real numbers representing the measurement value of the original curve. Measurement units and neighboring units are identified based on a measurement index contained in manifold V. The measurement index can be of any shape. In one embodiment, this index is a grid consisting of cells of equal or variable size. Measurement units and neighboring units thus obtain their value from selected grid-points on the grid.
The dynamic stencil grows based on the smoothness calculation of the original curve within sets of stencil points. In one embodiment, the smoothness of the original curve may be obtained by calculating the Newton divided difference of the original curve within the sets of stencil point. Specifically, when the index grid is one-dimensional, the measurement units become the “stencil points”. The smoothness calculation and the selection of stencil points can be described as follows:
Given a grid
define cells, cell centers, and cell sizes by
Denote Δv as the maximum cell size and represented it by Δv≡maxI≦i≦NΔv. The approximation problem can be defined as:
Given the cell average of the original curve q(v):
find a polynomial pi(v), of degree at most k−1, for each cell Ii, such that it is a k-th order accurate approximation to the function q(v)inside Ii, i.e.
pi(v)=q(v)+O(Δvk),vεIi,i=1, . . . , N. (1)
This gives approximations to the function at cell boundaries which are also -th order accurate, i.e.
Function pi(v)can be replaced by other simple functions, such as exponential functions [2]. For large-scale circuit simulation, polynomial may be the most efficient to evaluate.
The 0-th degree Newton divided differences of function are defined by
In general, the j-th degree Newton divided differences for j≧1, are defined inductively by
The Newton form of the k-th degree interpolation polynomial P(v), which interpolates q(v) at k+1 points starting from grid point
can be expressed using the divided differences by
We can take the derivative of P(v)to obtain p(v):
An important property of divided difference is:
for some ξ inside the stencil
as long as the function q(v) is smooth inside the stencil. Thus the divided difference is a measurement of the smoothness of the curve inside the stencil.
Based on these definitions, the dynamic selection of stencils can be described as follows. Suppose we want to find a stencil of k+1 consecutive points, which must include
such that q(v)is the “smoothest” in this stencil comparing with other possible stencils. This job may be performed in the preferred embodiment by the following steps. In each step only one point may be added to the stencil. Start with the two point stencil
The linear (1-th degree) interpolation on the stencil can be written in the Newton form as
At the next step, since the grid is one-dimensional, only two choices exist to expand the stencil by adding one point: either adding the left neighbor
resulting in the following quadratic interpolation
or the right neighbor
resulting in the following quadratic interpolation
Note that the derivation of the above two interpolations have the same element
multiplied by two different constants
These two constants are the two second degree divided differences of q(v)in two different stencils. Note that a smaller divided difference implies the function is “smoother” in that stencil. The selection of stencil can thus be performed by comparing the two relevant divided differences and picking the one with a smaller absolute value. Thus, if
The 3 point stencil will be selected as
Otherwise, the stencil will be selected as
The step can be continued, with one point added to the stencil at each step, until the desired number of points in the stencil is reached. In general, to obtain a m-th order interpolating polynomial, a stencil with m+1 points is required.
If the index grid is uniform (Δvi=Δv), undivided differences
can be used instead. In this case, the Newton formulae (1) and (2) for obtaining interpolating polynomials should also be adjusted accordingly. This saves computation time and reduces round-off errors.
It is shown in [1] that for a piecewise smooth function q(v), ENO-based method starting with a two point stencil has the following properties:
In practice, a wrong stencil may be selected due to a round-off error of the divided difference calculation. That is, when both divided differences are near zero, a small change at the round off level would change the direction of the inequality (1) and hence the stencil selection. To remedy this problem, a “biasing” strategy may be introduced to weigh the selection towards a preferred stencil. In one embodiment, the biasing strategy is implemented as follows.
First, identifying a preferred stencil
Next, replacing (3) by
i.e. if the left-most point ν
of the current stencil has not reached the left-most point v
of the preferred stencil; otherwise if
one replace (3) by
Where b>1 is the biasing parameter. Analysis in [3] indicates a good choice of the parameter is b=2. The philosophy is to stay as close as possible to the preferred stencil, unless the alternative is a factor b>1 better in smoothness.
Device models are usually described with charge and current functions with three voltages as independent variables. To best reflect the original model, the preferred embodiment uses multidimensional grids to describe the independent variables and uses multidimensional tables to store the grid indices and values of stencil points.
In obtaining approximation of device models in the general m-dimensional space Ω⊂Vm, enclosed by boundary δΩ, the most natural and computationally efficient extension of polynomial approximations to multi-dimension appears through the use of tensor products.
Let Dk, k=1, . . . , K be K non-overlapping general hexahedrals such that
where δDkk=Dk∩δΩ and δDki=Dk∩Di for i≠k, and there exists a diffeomorphism, Ψ:Dk→I, where I⊂Vm is the unit cube, i.e.I∈[−1,1]m. The introduction of the diffeomorphism ensures that a standard interpolation and blending scheme described in [2] can be applied in the unit cube. The existence of the diffeomorphism is guaranteed since a circuit simulator is always free to decompose the manifold. The non-overlapping Dk is usually rectangular in shape with different size. This is because devices have fine local behavior or sharp transitions such as the sub-threshold behavior of MOSFETS. In order to resolve sharp transitions and keep the table size small, a non-uniform Dk may be used, i.e. smaller sizes are used at sharp transition regions and bigger sizes at smooth regions.
For simplicity, Dk is denoted as D with boundary δD, referring to any rectangular grid table cell. The coordinates of D will be denoted as (v1, v2, v2) or (ξ,η,ζ), interchangeably. Define
through the subsets
Associated with these sets are nodal sets
with the global nodal set,
ΛL(ΞL)=ΛL=ΛLξ×ΛLη×ΛLζ
The nodal sets are assumed to be ordered such that ξ0=η0=ζ0=−1, ξL
Likewise, for the construction of the polynomial approximation inside the domain I, introduce the global set
with the corresponding nodal sets
For simplicity, the notation
and likewise were used for various combinations of the sets.
To establish a one to one mapping between the unit cube and the general rectangle-shaped cell, a global map Ψ:Dk→I, may be constructed in one embodiment using the so-called transfinite blending functions, which are described in [2], the content of which is hereby incorporated in its entirety by reference. The map x=Ψ−1 (ξ) in one embodiment may be derived from the Boolean sum
and likewise for Pη(q),Pζ(q), are denoted as “face projectors”. The “shape functions”
associated with these face projectors are nothing more than the interpolating ENO polynomials based on nodal sets
respectively. Using the properties of projectors, the “edge projectors” become
and likewise for Pηζ(q),Pξζ(q). The “vertex projector” becomes
To simplify things further, the following transfinite blending function may also be applied to construct the face projectors.
and similarly for other face projectors. With the above reconstruction procedure, it is guaranteed that the device model approximations are continuous over all the interfaces between cells. This is because they are constructed uniquely from the node points and some neighboring points on the plane. By using the previously-mentioned biasing scheme in ENO approximation, the derivatives are also continuous in smooth regions where the ENO approximation in each dimension of the gird chooses the points on the same side.
With the above formulations, the inverse of the global map can be constructed in a preferred embodiment as follows. Assuming the shape functions are linear, i.e.
Lξ=Lη=Lζ=1,
ΛLξ=ΛLn=ΛLζ={−1,1}
N0ξ=(1−ξ)/2,N1ξ=(1+ξ)/2
and similarly for the other shape functions:
Thus, to construct the map a parametric form is established for the edges enclosing D, e.g.
Where Liα(ξ) is the ENO approximating polynomial based on the nodal set ΛNξ, employed in the unite cube I. For a through discussion of transfinite blending functions and their properties, please refer to [2].
Though computation required in table model evaluation is just a small percentage of analytical device model evaluation, what could cause inefficiency is the time for gathering the multi-dimensional table data. To solve this problem, one embodiment uses a cell-based data storage scheme that achieves efficient table model evaluation.
The scheme is described here assuming the grid is two-dimensional (2-D). In other embodiments, this scheme is extended to other dimensions. The building block of 2-D table model storage scheme consists of four neighboring cells spanning in both dimensions, as shown in
With this scheme, the left-top cell contains all the information it needs for table evaluation. The right-top and the left-bottom cells only need to access one neighboring building block, as shown by the arrows in
In one embodiment, a dynamic table construction strategy is used to further reduce the overlap. Specifically, when model evaluation is required at an operating point, only the building blocks containing the operating point and some neighboring building blocks are constructed. This leads to big savings in memory in many cases. The building blocks may be stored in a hash table with efficient look-up schemes for easy and fast access. Keys of the hash table may be integers representing operating points. Values of the hash table can be build blocks containing operating points. Hash functions can be developed using well-known algorithmic methods to ensure a one-to-one mapping between the keys and values of the hash table and efficient look-up.
The building blocks or grid cells may be of either uniform or non-uniform size. For evaluation of electronic device models, the grid is non-uniform in nature since electronic devices, such as transistors, may exhibit sharp transition behavior in certain grid regions and finer cells are required to reflect these sharp transitions.
In one embodiment, it possible to combine ENO approximating of different orders is combined on different edges. For example, a third-order approximation in the exponential region of BJTs while using quadratic or even linear approximation in other regions may be applied. With the use of transfinite blending functions, the approximation is still smooth across shared boundaries.
One embodiment has been tested in the simulation of thousands of digital, analog, and mixed-signal circuits. Results show that the method described above turns out to be very robust and does not have convergence problems.
The speed and accuracy of the ENO-based table model method described above can be further demonstrated by additional simulations.
While the above invention has been described with reference to certain preferred embodiments, the scope of the present invention is not limited to these embodiments. One skilled in the art may find variations of these preferred embodiments which, nevertheless, fall within the spirit of the present invention, whose scope is defined by the claims set forth below.
Number | Name | Date | Kind |
---|---|---|---|
4890242 | Sinha et al. | Dec 1989 | A |
5465323 | Mallet | Nov 1995 | A |
5542030 | Gutfinger | Jul 1996 | A |
5793371 | Deering | Aug 1998 | A |
5844564 | Bennis et al. | Dec 1998 | A |
5923329 | Beale | Jul 1999 | A |
6208939 | Kunii | Mar 2001 | B1 |
6256038 | Krishnamurthy | Jul 2001 | B1 |
Number | Date | Country | |
---|---|---|---|
20040260523 A1 | Dec 2004 | US |