Traditionally, the design of an analog integrated circuit is completed by solving analytic equations that link design parameters with performance metrics. To automate the sizing of the components (transistors, capacitors, inductors, etc.) of an analog circuit, multi-objective optimization problems are formulated with analytic equations. The generated Pareto fronts provide a means to analyze the tradeoffs in circuit performance. However, with technology scaling, the knowledge-based approaches are limited by the is required to tune the circuit to resolve any mismatch between theoretically optimized results and simulation results.
Simulation-based approaches emerge as a substitute that addresses the challenges associated with knowledge-based optimization methods. Data mining and machine learning techniques may be used to extract modeling and design information from simulation data in a bottom-up approach. Representative techniques include stochastic pattern search, Bayesian optimization, and deep neural networks. Prior work has shown that simulation-based methods are successful in the design of analog circuits. However, improvements are needed with regard to the following.
Sample efficiency. Simulation-based methods rely on real-time sampling and optimization with simulation tools. The slow numerical solvers used for simulation limit the size of the dataset. To improve sample efficiency, a technique that samples from high-dimensional black-box functions with Duchon pseudo-cubic splines has been proposed. Another solution proposes using Bayesian neural networks that approximate the Pareto front with a reduced number of samples. Reducing the number of samples required by the optimization process to shorten the design time remains an open challenge.
Specification-driven design considerations: Based on the circuit requirements, analog design specifications may be grouped into two categories: 1) figure of merit (FoM) constraints that require optimization, and 2) hard constraints that must only be sufficiently met. As an example, power is treated as an FoM constraint when a design priority is to minimize the power consumption. In contrast, power consumption is treated as a hard constraint, specifically a power budget, when other circuit metrics are more critical.
FoM constraints may be commonly optimized by regression models. In practice, a limited number of circuit performance metrics are considered FoM constraints, for two primary reasons. First, when less important metrics are overemphasized, the search space is narrowed unnecessarily, which results in a more difficult or even infeasible search. Second, when more than two metrics are concurrently considered as FoM constraints, the Pareto fronts generated by multi-objective optimization algorithms such as NSGA—11 may be hard to visualize and apply. Tradeoff curves between two circuit metrics are meaningful only when the remaining specification-based metrics are satisfied.
In practice, specifications are often listed in the form of hard constraints, where the objective is to meet the set of target values. Applying classification to predict whether a candidate design point satisfies the specifications is well suited for analysis with hard constraints. In one approach, support vector machines (SVMs) are introduced to classify the performance space of analog circuits. One-class classifiers are favored over two-class classifiers as the latter suffers from a large dimensionality of the parameter space. Specifically, the proportion of design points that yield the desired performance parameters is likely to be small in an initial randomly sampled dataset. The dimensionality of the design space, therefore, limits the application of binary classifiers. Additionally, classifiers may only be applied for the analysis of the circuit performance space rather than for the design of the circuit.
Interpretability: Techniques that automate the design of a circuit must be interpretable and easy to use such that human efforts to apply the tools and algorithms are minimized. The black-box models and complex decision processes used by existing methods may be interpretable. Beyond the generation of design solutions, information such as performance tradeoffs, design space partitioning information, and importance rankings (sensitivity analysis) of design variables provide utility.
The automation of analog circuit design has drawn particular interest among the research community. The synthesis flow of an analog circuit consists of topology selection, component sizing, and physical design. The sizing of components, which includes both passive and active devices, is a critical step that ensures the selected circuit topology satisfies the target specifications. More recently, machine learning is explored as a means to facilitate the optimization of the sizing of an analog circuit.
The goal of applying machine learning is to learn and develop models to map from the design space to the performance space. A multi-label regression or classification problem is formulated, which is then solved by optimization algorithms. Classifiers may be applied to predict whether a design point satisfies the provided specifications. However, the interdependence among output labels has not been fully explored or used. The prediction models trained for the sizing of an analog circuit are improved by accounting for the relationships among the circuit performance metrics.
Another challenge for the automation of analog circuit design is the proper consideration of the effects that variations have on the output performances. Analog integrated circuits are sensitive to both inter-chip variations introduced by the fabrication process and intra-chip variations resulting from the discrepancy among parameters of individual transistors, such as deviations in the oxide thickness or the threshold voltage. In addition, during circuit operation, environmental effects including changes in temperature result in deviations in the performance of a circuit. With the variations in circuit parameters resulting in yield loss or improper operation, compensating for the effects of the variations increases the design complexity.
Therefore, design methodologies may account for the effects caused by circuit variations, while limiting any increase in design complexity. In past solutions, multiple variation-aware analog circuit sizing frameworks have been proposed. Direct optimization methods target maximizing the yield characterized by Monte Carlo analysis, which is an effective approach to simulate and model the effects due to variations. Probability density functions are generated from the density estimates of the Monte Carlo samples. However, the use of the Monte Carlo method is computationally expensive. Corner analysis is another approach for the characterization of the effects on circuit performance due to variations. In digital circuits, corner analysis is applied specifically to account for the effects of variations on timing and power consumption. Typically, in addition to temperature and voltage, five process corners may be considered: typical-typical (TT), fastfast (FF), slow-slow (SS), slow-fast (SF), and fast-slow (FS). The utilization of electronic design automation (EDA) tools such as Cadence allows for the numerical simulation of circuit performances at different process corners and temperatures. Design solutions are considered robust if the specifications are satisfied for all corner cases, or fall within a certain standard deviation from the specifications for all corner cases. Designing for the worst case guarantees robustness but usually results in over-design or renders the problem infeasible to implement. Another approach is design centering, which selects design solutions that are farthest from the specification boundaries so that process and environmental variations are tolerated.
The inventors propose a simulation-based optimization framework that sizes analog circuit components to meet the design specifications while constraining the variations in the performance of the circuit across all corners of interest within a set bound. Classifier chains are used that represent the relationships among output parameters to improve the model accuracy and to provide additional design insight after the completion of the automated sizing methodology.
The figures supplement the description.
1. Classification with Adaptive Labeling Thresholds for Analog Circuit Sizing
To address the limitations of existing techniques, the inventors developed a batchmode online to design analog integrated circuits through classification with adaptive labeling thresholds (CALT). The method may be an improvement over the art for the following reasons: 1) the application of classifiers for both the modeling of the performance space and the sizing of an analog circuit, 2) the use of interpretable tree-based algorithms for surrogate modeling, and 3) a strategy to adaptively set the labeling thresholds for the training of the classifiers such that the lack of positively labeled data is resolved.
1.1 Proposed Methodology
With CALT, the sizing of the components of an analog circuit is performed by the sequential completion of two tasks: multioutput classification for performance modeling of a circuit, and optimization for the generation of the component sizes for the circuit
1.2.1 Classification with Adaptive Labeling Thresholds
Given the problem of sizing the components of an analog circuit, denote the design space as X⊆Rd, and the performance space as Y⊆Rk. Initially, a dataset U=((x (1), y(1)), . . . (x (n),y(n)))∈(X Y)n is sampled from the design space. Latin Hypercube Sampling (LHS) is applied, where LHS is a Monte Carlo method that provides a quasi-random sampling distribution. For a pre-specified sample size n, the design space is partitioned into equal regions, and a single point is randomly selected in each region.
After the initial dataset is generated, binary labels are assigned to each data point for each circuit performance metric based on whether a target threshold is met. The labeled space is denoted as Y{circumflex over ( )}⊆{+/−1}k. The objective then becomes to train a classifier hk:X→Y{circumflex over ( )}k for the kth circuit performance metric that, given a new instance x∈X, predicts y{circumflex over ( )}k=hk(x)∈Y{circumflex over ( )}k. A multi-output classification problem is, therefore, formulated.
A possible choice for the labeling threshold is the design specification. However, if the dimensionality of the design space is large, the initial dataset is unlikely to contain sufficient data points with positive labels for training. Instead, for a target specification, the labeling threshold is set to the eth percentile of the distribution of a given circuit performance metric in the dataset U as a lower bound, and the (100−e)th percentile of the distribution as the upper bound. If the corresponding specification exceeds the percentile value, the dataset contains enough positively labeled data points and the threshold is, therefore, set to the specification. Given the design specification set S⊆Rk for s∈S, the labeling threshold set T⊆Rk for t∈T is generated as given by
Precision and Recall may be used to evaluate the performance of the classifiers, which are defined as follows:
Precision=# of true positives/# of positive predictions EQ. 1.1
Recall=# of true positives/# of positive instances EQ. 1.2
Combining Precision and Recall results in the F1-score, which is used as a single metric that evaluates the performance of a classifier, as given by Equation 1.3.
F1-score=2×(precision×recall)/(precision+recall) EQ. 1.3
1.2.2 Applying Random Forest for Classification
Decision tree (DT) algorithms may be applied to map from the circuit specifications to the circuit topology by using past designs as reference. In this work, DT-based algorithms are used due to the following advantages:
To train a decision tree, the Gini index Gl is applied as the node splitting criteria, which is defined as:
Gl=1−Σi f(i)2 EQ. 1.4
Ensemble techniques are applied to reduce model overfitting, which results from using single tree models. In the inventors' work under this section, the random forest algorithm is used, which draws samples with replacement from the dataset for the training of a bag of deep trees with a subset of the features. The final prediction is obtained by averaging the individual predictions produced by the models, as given by
The execution of the random forest algorithm provides the importance ranking of the design variables. During each iteration of bootstrap training, a single tree model is trained from the bootstrap samples and tested with the remaining samples. The comparison of the samples results in an out-of-Bag (OOB) error. The average of the OOB errors from all runs of bootstrap training is an estimate of the performance of the ensemble. Through random permutations of a feature set, the importance of a design parameter is determined by characterizing the impact of the changes on the OOB error, as described by the pseudocode provided as
1.2.3 Optimization-Based Active Querying
After the classifiers for each performance metric are trained, qualified designs are determined from the intersection of the feasible regions of all models. A multi-objective search is executed for each iteration of the simulation loop to search for points such that the predicted probability scores of all models are simultaneously maximized. The candidate solutions are given as:
x*∈argmax(p1(x), . . . ,pk(x)) EQ. 1.5
1.2.4 Summary of the Design Flow of CALT
As shown in
1.3 Simulation Results
CALT is applied to the design of an inductively degenerated differential low noise amplifier (LNA), which is shown in
The design set may include nine variables: the sizes of the inductors Lg1, Ld1, and Ls1, the widths of transistors M1, M3, and M5, the size of capacitor Cg1, and the biasing voltages Vb1 and Vb3. Due to the symmetry of the differential structure, the remaining variables are set to the same values as the corresponding counterparts. The transistor length is set to the minimum of 65 nm. The performance set includes the power gain, noise figure (NF), third-order intercept point (IP3), and power consumption. The target design variables are constrained as:
60 nm≤transistor widths≤900 μm,
0.01 nH≤inductor sizes≤12 nH,
30 f F≤capacitor sizes≤20 pF, and
0 V≤biasing voltages≤1.2 V EQ. 1.6
Two different sets of design specifications are targeted, the first given as Specification Set 1:
Gain≥10 dB,
NF≤3 dB,
IP3≥−5 dBm, and
Power≤10 mW EQ. 1.7
Gain≥14 dB,
NF≤2.8 dB,
IP3≥−5 dBm, and
Power≤20 mW EQ. 1.8
An initial dataset of 1000 points is sampled with LHS. After verifying that the dataset contains no points that satisfy all of the specifications, CALT is executed to solve for the nine design variables for both sets of target specifications, where ε, as described herein, is set to 95. Five runs of CALT are executed for each of the target specification sets. A summary of the results is provided in
As indicated by the results listed in
The total execution time for CALT consists of the time for initial sampling, offline model training and optimization, and verification through online simulation. As listed in
The data plotted in
The F1-scores of the classifiers for each of the four performance metrics when solving for Specification Set 2 are shown in
After sizing the components of the LNA with CALT, importance rankings of the design variables are extracted from the random forest models, as shown in
As a final step, decision trees are trained with the final dataset. A tree for NF prediction trained with the final dataset generated from completion of the CALT sizing methodology on Specification Set 2 is shown in
1.4 Discussion
If the topology and technology node are fixed, the design space of an analog circuit may also be fixed. The necessary partitioning details of the design space are, therefore, learned by CALT from simulation data. With the binary classifiers, decision boundaries between feasible and infeasible regions are identified for a given specification. The optimizations are used to search for design points in the common feasible regions of all models. As new design points are actively queried, more information on both the design space and the performance space is gathered. The performance of the classifiers, therefore, improves, which results in the convergence to a design solution.
Fine-tuning of the surrogate models is performed with the proposed closed-loop learning system. The dataset determined during the final iteration of the sizing flow is considered as the minimum required for convergence to a design solution. The CALT framework is driven by the circuit specifications, which allows for customized designs of analog circuits, where the specifications are adjusted based on the design needs.
2. Introduction & Background—Variation-Aware Analog Circuit Sizing with Classifier Chains
2.1 Introduction
The inventors propose a simulation-based optimization framework that sizes analog circuit components to meet the design specifications while constraining the variations in the performance of the circuit across all corners of interest within a set bound. Classifier chains are used that represent the relationships among output parameters to improve the model accuracy and to provide additional design insight after the completion of the automated sizing methodology.
2.2 Proposed Methodology
For the component sizing of an analog circuit, the design space is denoted as X⊆Rd and the performance space as Y⊆Rk. Assume an initial dataset U=((x(1), y(1)), . . . , (x(n), y(n)))∈(X×Y)n is randomly sampled from the design space. Binary labels are then assigned to each selected data point of each circuit performance metric based on whether a target threshold is met. The labeled space is denoted as Y⊆{±1}k. One classifier is trained to map hk: X->Yk for the kth circuit performance metric. Therefore, a multi-label classification problem is formulated. A technique ma use adaptive labeling thresholds to train the classifiers. The procedure to determine the set of performance metric thresholds T from the provided specification set S is given by
2.2.1 Variation-Aware Circuit Sizing
When considering the effects of variations on circuit performance, simulations for each design point are acquired at each corner of interest. The standard deviations of the performance variations across all of the corners for each design point are then calculated. Design points with performance fluctuations that fall below the set threshold Tthre of the standard deviation are assigned with positive labels, while all other points are assigned negative labels. The pseudo-code to set the robustness labels is given by
2.2.2 Classifier Chains
For a multi-label classification problem, the traditional approach is to train one binary classifier for each labeled target performance metric as shown in
Herein classifier chains may be adopted to model the interdependencies among the outputs. A representation of a classifier chain is shown in
With classifier chains, the correlation among target labels is considered. The order in which the classifiers are organized is a key parameter that affects the performance of the model. The first model in the chain is a classifier trained on the original input features, while additional output features are included as training features in the remaining models of the chain.
2.2.3 Ensemble of Classifier Chains (ECC)
An ensemble improves the prediction accuracy of a model and reduces model overfitting. When considering an ensemble of classifier chains, a total of m chains are trained, which are denoted as C1, C2, . . . , Cm. If enumeration of all the possible orders of the chain is possible, then m=k! classifier chains are trained, where k is the number of output labels. When m<k!, each classifier chain in the ensemble is trained with a random ordering of individual classifiers. The predictions are summed and averaged for each label. A threshold is used to determine the final predicted labels. The pseudo-code for prediction by the ensemble of classifier chains is given by
1.2.4 Design Flow for Variation-Aware Component Sizing with Classifier Chains
The classifiers for predicting the circuit parameters are trained on a ‘default’ standard corner, which was set to TT at 20C in this work. The classifiers for predicting robustness are trained with the standard deviations calculated for all corners (e.g., process, voltage, and temperature) and for each design point (selected component sizes).
After the training of the ensemble of classifier chains, for each iteration of the design loop shown in
x*∈argmax(p1(x), pk(x),r1(x), . . . , rk(x)) EQ. 2.1
The proposed framework is applied to the design of a differential low-noise amplifier (LNA), which is shown in
60≤nm transistor widths≤900 μm,
0.01≤nH inductor sizes≤12 nH,
30 f F≤capacitor sizes≤20 pF, and
0 V≤biasing voltages≤1.2 V EQ. 2.2
The specifications for the performance metrics of the circuit are given as:
Gain≥10 dB,
NF≤2.8 dB,
IP3≥−5 dBm, and
Power≤20 mW EQ. 2.3
and the robust requirements of the circuit are given as
σGain≤1 dB,
σNF≤0.5 dB,
σIP3≥1 dBm, and
σPower≤5 mW EQ. 2.4
The initial dataset contains 1000 design points. 15 corner cases are simulated for each design point, which are given by the combination of the three temperatures of 20° C., 80° C., and 120° C., and the five process corners of TT, FF, SS, SF, and FS. The TT process corner at 20° C. is considered as the default standard case, while the remaining corner cases are acquired primarily for characterizing the variations in the performance of the circuit. For the initial acquired dataset, the distribution of standard deviations of the variations in the performance across the 15 corner cases is shown in
After the initial dataset is acquired, the design flow is executed with the adaptive labeling threshold method applied to the performance specifications. The term, as defined in
Classifier chains are first trained only on data from the standard corner case without considering variations. For the four circuit performance metrics, there are 4! (24) combinations of possible orderings of the chain. Classifier chains with all 24 possible combinations are trained. The results from the characterization of the performance of the models are shown in
From the results shown in
Among the six optimal classifier chains, the prediction of the noise figure is always placed last in the chain, which indicates that the prediction of the noise figure is the least accurate and provides the least information to the training of the other classifiers. The classifier chain provides higher performance when models with the highest confidence are placed first, while inaccurate models are placed near the end of the chain so that any error is minimally propagated along the chain.
Including an additional four classifiers to account for the prediction of the robustness of the circuit to variation, the chain now consists of eight classifiers. As the enumeration of 8! combinations to identify the optimal chain sequence is computationally expensive, 10 random combinations of the eight classifiers are trained. Execution of
The design flow, as depicted in
2.2.4 Discussion
With the proposed methodology, in addition to training four classifiers to predict the four performance specifications of the circuit, four additional constraints are added to predict the variations in the performance parameters across different corner cases of interest. The adoption of classifier chains allows for the modeling of the interdependencies among output labels. Based on the results of the performance of the models shown in
Two implementations of classifier chains are presented. The first approach is to enumerate all of the possible orders of the chain and adopt the optimal order of classifiers. Execution of the first approach is feasible when the number of output labels is small (i.e., four). When the number of output labels is large, the number of possible order combinations of a classifier chain increases exponentially, which indicates that the second approach of applying an ensemble of randomly selected chains is a better option. The number of individual classifiers required when either the binary relevance or classifier chain technique is applied scales linearly with the size of the label set Therefore, there is no overhead in computational resources required to train classifier chains. However, during the optimization phase, each seed requires m times more functional evaluations when the ensemble is used as compared to applying only the best chain, where m is the number of classifier chains in the ensemble.
Among the generated candidate solutions, the plot of the standard deviation of the power consumption variation as a function of the power consumption is shown in
Corner analysis is based on the assumption of a fixed value in the variation of a physical parameter pi. The relation between the variation in a physical parameter and the variation in a circuit performance parameter y is given by:
The sensitivity term, ∂y/∂pi, varies for each performance metric as the sizes of the components and the bias voltages differ. Therefore, in the ideal case, corner models must be generated for each performance metric separately. Applying fixed variations for pi regardless of the sensitivities results in inaccuracies in the model. However, the proposed design methodology still applies as the simulation data is considered as ground truth.
An additional alternative to account for the effects of variations on the performance of the circuit is to train classifiers that consider the worst cases of the dataset. The limitation of the approach is that the worst-case performance often occurs in different corner cases for different target circuit specifications. As an example, the worst-case power consumption occurs when the FF corner is considered at 120° C. However, the worst-case gain occurs when the SS corner is considered at 20° C. Designing for the worst cases, therefore, results in ambiguous outcomes. Comparatively, designing based on the TT corner while constraining all the performance variations across all corners within a certain standard deviation, as proposed in this paper, provides viable solutions.
1. A method for generating optimal sizing solutions for devices of an analog circuit that satisfy the design specifications on circuit performance parameters and robustness parameters, wherein at each iteration of determining the optimal sizing solution, prediction models are trained and optimization is executed on the prediction models, and the iteration stops when a qualified solution is found or a preset maximum number of iterations is reached, wherein an ensemble of classifier chain models is trained to predict each target circuit performance parameter based on device sizes by training on circuit data.
2. The method of embodiment 1, wherein a multi-objective genetic algorithm is executed on m ensembles of the classifier chain to simultaneously maximize a probability that each of m performance specifications are satisfied.
3. The method of embodiment 2, wherein the performance specifications of an analog circuit are generated with a SPICE solver that randomly generates combinations of transistor sizes; then binary labels are assigned with a classification with an algorithm that adaptively sets labeling thresholds.
4. The method of embodiment 3, wherein classification is performed while using the algorithm, wherein a threshold is specified on a E percentile of data values of a performance parameter to resolve lass imbalance in a sampled dataset; wherein if the E percentile value exceeds a specification value of the performance parameter, the threshold is set to the specification value; w
5. The method of embodiment 4, wherein binary labels are assigned as reference to the threshold.
6. The method of embodiment 2, wherein one ensemble model is comprised of a number of decision-tree classifiers and a final prediction of the ensemble is calculated as an average of the predictions of all the classifiers.
7. The method of embodiment 1, wherein to account for effects of circuit variations on circuit performance, standard deviations are calculated on evaluations of a performance parameter at all process, voltage, and temperature corners considered in an application of a set of transistor sizes.
8. A method for sizing analog circuit components using a simulation-based optimization framework using classifier chains that represent relationships among output parameters to improve framework accuracy, wherein when considering effects of design variations on circuit performance, simulations for each design point are acquired at each corner of interest and the standard deviations of the performance variations across all of the corners for each design point are then calculated, wherein design points with performance fluctuations that fall below a set threshold Tthre of the standard deviation are assigned with positive labels, while all other points are assigned negative labels.
2.5 Conclusions
In this section, a simulation-based optimization framework is proposed that determines the sizing of components of an analog circuit to meet target design specifications while also satisfying the robustness specifications set by the designer. The robustness is guaranteed by setting a limit on the standard deviations of the variations in the performance parameters of a circuit across all process and temperature corners of interest. Classifier chains are used that, in addition to modeling the relationship between inputs and outputs, learn the relationships amongst output labels. The proposed methodology is applied to the design of an LNA in a 65 nm fabrication process. The use of classifier chains and the ensemble of classifier chains provides an improvement in the prediction accuracy as compared to the binary relevance method. A qualified design solution is generated that satisfies both the performance and robustness specifications across all of the corners considered. The gain, noise figure, IP3, and power consumption of the design of the LNA are 10.76 dB, 2.78 dB, −2.65 dBm, and 16.8 mW, while the standard deviations across all considered corners are 0.74 dB, 0.45 dB, 0.47 dBm, and 4.9 mW, respectively.
While the invention has been described with reference to the embodiments above, a person of ordinary skill in the art would understand that various changes or modifications may be made thereto without departing from the scope of the claims.
This invention was made with government support under Contract No. CNS-1751032 awarded by the National Science Foundation. The government has certain rights in the invention.
Number | Date | Country | |
---|---|---|---|
63320774 | Mar 2022 | US |