Not applicable
Not applicable
This application relates to a hybrid fuel cell system with an improved power control scheme based on transient recognition control.
Fuel cells have attracted much attention as an efficient, scalable, low-pollution means of generating electrical power. A short list of potential fuel cell applications includes distributed power generation, auxiliary and primary generation in transportation systems, consumer electronics and backup generation. Load transients in many fuel cell applications involve significant peaks in power relative to the steady-state demand.
The effects of load transients can be reduced by combining fuel cells with energy storage devices such as capacitors or batteries to form a hybrid system [1], [2], [3], [4], [5], [6], [7], [8]. Hybrid fuel cell systems have been proposed for improved transient response in several scenarios. For example, Proton Exchange Membrane (PEM) fuel cells are considered in combination with lead acid batteries, Li-ion batteries, and capacitors as energy storage elements for portable military electronics and communication applications. See e.g., [1], [4], [6], [7], [8]. Hybridsystems are more efficient in working with impulse type loads, more reliable in general, and offer smaller size and weight than the PEM fuel cells working alone. A fuel cell system coupled with a superconducting magnetic energy storage system (SMES) is proposed in [5] for a distributed generation system. SMES enables the hybrid system to compensate for transients in the utility system caused by the sudden change of real and reactive power load requirements. The authors of [2] investigate the hybridization of a direct methanol fuel cell (DMFC) with an all-solid-state super-capacitor, showing that the cell power is improved by over 30% compared with that of the DMFC alone at 25° C. The authors in [9], [10], [11], [12], [13] consider combining fuel cells with batteries or super-capacitors for electric vehicle applications. Hybrid systems are preferred in electric vehicles because they provide flexibility and efficiency in operating the fuel cells [11].
Simple hybrid fuel cell systems connect the energy storage device in parallel with the fuel cell. During a transient, the portion of current delivered from the fuel cell is determined implicitly by the impedances of the fuel cell and the storage device. See e.g., [3]. Some hybrid systems control the fuel cell to output approximately constant power. In these applications, the storage device handles all of the transient energy, and the fuel cell operates like a battery charger. See e.g., [1], [4], [5], [6].
The aforementioned means of minimizing the effects of load transients are problematic, in part because they do not effectively anticipate the future behavior of the transient. There is a need for improved transient performance of a hybrid fuel cell system that can determine the future behavior of the transient. In particular, there is a need for a fuel cell, or other critical source, that can be controlled so that it does not respond to the large initial current typical of a load transient. Citation of any reference in this section of the application is not to be construed as an admission that such reference is prior art to the present application.
The present invention relates to a method for controlling a power generation system comprising using a transient recognition control module to estimate the steady state information of load transients. The transient recognition control module can be based upon a cluster weighted modeling algorithm. The power generation system can be a hybrid fuel cell system. The cluster weighted modeling algorithm can be formulated recursively and provide useful feed-forward information in real time.
The present invention also relates to a method for controlling a power generation system comprising: a) providing a power electronics source which is electrically connected to a power generation source, an energy storage device, a transient recognition control device, and a DC bus; b) providing a current sensor that is electrically connected to a transient recognition control device; c) estimating the steady state load transient information in the transient recognition control device using a computer weighted modeling algorithm; d) adjusting the use of energy from the storage device and the power generation source based upon an input command signal from the transient recognition control system to the power electronics; and
d) regulating the voltage on the DC bus based upon a signal from the power electronics.
For a more complete understanding of the present invention and for further advantages thereof, reference is made to the following drawings:
The present invention is directed to a novel power control scheme that uses feed-forward information about load transient behavior to manage the flow of energy between components of a hybrid fuel cell system. This control scheme may allow designers to minimize energy storage requirements, improve the system reliability, reduce internal loss, and extend the lifetime of fuel cell systems. The methods of the invention may also be applicable to hybrid systems with critical sources other than fuel cells.
The present invention relates to a more flexible system motivated by the near real-time transient recognition capabilities of the Nonintrusive Load Monitor (NILM) in [23], [14]. The NILM is capable of desegregating and recognizing individual loads from a current waveform using a library of transient signatures. The hybrid control problem of this invention differs from NILM in that the identification of the long-range transient behavior happens quickly, on the time-scale of the transient. To address the unique constraint of the hybrid system control the present invention uses a modified Cluster-Weighted Modeling (CWM) scheme. CWM, introduced in [16], [17], [18] is a novel method that can establish complex functional relationships between input patterns and associated outputs. However, like NILM and most other pattern recognition schemes, CWM nominally requires the arrival of a full pattern. The latency associated with this requirement is unacceptable in the hybrid control scenario. The present invention provides a solution to this problem in the form of a recursive modification of the cluster weighted modeling scheme. See [19]. The present invention is directed to the performance of a prototype hybrid system comprises a fuel cell and battery using commands from a simulated CWM-based transient recognition control.
This invention proposes a novel power control scheme using transient recognition control (TRC). This is developed in the context of a fuel cell system, but applies as well to similar systems with transient-sensitive sources. The following gives an overview of the recursive CWM method developed in [19]. The examples provide a few simulation results demonstrating the capabilities of recursive CWM in this application and experimental results showing the behavior of the power electronics and loads under transient recognition control. A hybrid system with transient recognition control is compared to simple connection of power electronics to a fuel cell.
cell could respond to the transient given the estimate of the long-term transient behavior {tilde over (l)}ss. Given a class of transients, a conventional control could be designed to minimize the overshoot, fuel cell thermal excursions, or any other criterion of interest. However, that conventional control would be a compromise solution. A control that recognizes the “fingerprint” of an incoming load can provide a response that is optimal for that load.
The difficult part of the transient recognition control scheme is to ensure that the delay in estimating a useful {tilde over (l)}ss is as short as possible. We address this problem by adapting a recursive modification of cluster-weighted modeling to the hybrid control problem. More details of recursive cluster weighted modeling appear in [19].
In this section, the CWM algorithm is first briefly reviewed based on the presentation and notation in [17] where the complete derivations can be found. Then, the implementation of RLTR by CWM is detailed, including a recursive modification to the conventional CWM prediction procedure to improve the real time performance of RLTR, a transient data scaling scheme to improve data storing effectiveness, and transient pattern design issue under CWM scheme. A list of explanations for the notations used in this section is summarized in
Cluster-Weighted Modeling (CWM) [16], [17], [18] is an iterative scheme of finding the probability density distributions and building up a functional mapping over a sample set {yn,xn}N, n=1, where N is the total number of samples. CWM is useful for pattern recognition, and is appealing for stochastic time series analysis and synthesis [18], [20], [21], and non-linear modeling [22]. A cluster in the CWM scheme is described by three priors, {P(cm),p(xn|cm),p(yn|xn, cm)}. P(cm) is the weight of cluster cm, reflecting the relative importance of one cluster on the overall modeling problem. p(xn|cm) is the local influence of one cluster on the input pattern. p(yn|xn, cm) reflects the local statistical relationship between the output and the input on one cluster. In the context of load transient recognition problem, xn can be configured as a load transient pattern, and yn is the load steady state value associated with that transient. cm represents one cluster for load transient modeling. One class of load transients which are similar up to a scale factor in magnitude can be modeled by one or more clusters.
The density distribution models for the priors are assumed to be Gaussian, i.e.,
wherein D is the dimension of transient pattern xn. There is not a requirement that the actual forms of p(xn|cm) and p(yn|xn, cm) are Gaussian. Equations (1) and (2) can be generic basis functions in a generalized version of CWM. However, the Gaussian has an intuitive probabilistic interpretation and good localization properties. Arbitrary distributions can be approximated by overlapping a set of Gaussian equations.
In (1), the covariance matrix [covij]D×D is approximated by the diagonal variance matrix [σ2m,d]D×D. Although using the full covariance matrix would let the CWM be more flexible, it is not preferred if the pattern dimension is large. For a covariance matrix, the storage requirement and computational cost in matrix inversion grows quadratically with matrix dimension, while for a diagonal matrix, cost is linearly related to dimension. Furthermore, it is easier to prevent singular matrix inversion problems in the variance matrix than in the covariance matrix.
In (2), an “expectation function” ƒ(xn, βm) is used in the Gaussian distribution, instead of a constant expectation value. Accordingly, a local model ƒ(xn, βm) with parameter vector βm is defined for each cluster that reflects the local functional relationship between the input patterns and the output observations. βm is naturally determined as a part of the estimate of p(yn|xn, cm). The global mapping is a “cluster weighted” combination of the local mappings for each cluster. This is apparent in output prediction step,
In (3) M is the total number of clusters, p(xn|cm) is the likelihood of xn to cluster cm, and the term (xn|cm)P(cm) is a measure of importance that one local model has on a given input. For example, a local model formulated for the cluster of bulb transients will probably have little influence on an input pattern of lathe transient because p(clathe|cbulb) is small. The denominator in (3) is a normalization factor for the weighted combination. Suggested by (3), it is apparent that (2) not only is a basis function, but also provides a mechanism for recovering the functional relationship between the input patterns and the output observations. The local model could be assumed to be as simple as the linear type,
if enough number of clusters are defined, so that the linear model has sufficient local accuracy. However, note that the global mapping expressed by (3) is highly non-linear with respect to the input. CWM has valuable modeling capability suggested by its non-linear essential.
In the load transient recognition problem, the CWM can be interpreted as that of a set of filters f(*,βm) used for transient processing. Each filter is specially designed to give the perfect filtering for one class of transient signals. During the recognition process, which filter will be chosen is determined by the likelihood of the input transient to the transient class associated with this filter. The CWM scheme matches perfectly to the filter bank idea mentioned in the section entitled Novel Power Control Scheme for Hybrid Fuel Cell Systems for building up TRC.
There are two coexisting processes in the CWM scheme. The prediction process uses equation (3) to give the CWM output. The training process is performed to find the parameters of the priors. In a given iteration of training, the posterior p(yn|xn, cm) is evaluated based on the priors {P(Cm), p(xn|cm), p(yn|xn, cm)} estimated at the last step. The priors are then updated using the new posterior information. The estimates of the priors maximize the joint log likelihood ΣNn=1 log [p(yn,xn)]. Reference [17] provides a detailed presentation of the CWM training process.
Equation (3) is the foundation of implementing the TRC for load transient steady state demand estimation, where <y|xn> equals iss in a hybrid fuel cell system shown in
where xn(1:K) is the partial transient received by the step K, and p(xn(1:K)|cm) is the partial likelihood evaluated at step K(K≦D). The first item at the right hand side of equation (5) has a form similar to equation (3). This item is the conventional CWM prediction at step K, based on the available transient information xn(1:K). The second item in the right hand side of equation (5) is the cluster weighted correction to the model prediction at step K, based on available prior information μm(K+1:D) learned through the training process. Equation (5) is rewritten in equation (6). The vector [x(1+K)Tμm(K+1:D)T]T is a recovery of complete vector xn(1:D) at step K by cluster cm, where μm(K+1:D) is the most likely prediction by each cluster of the un-received part of the transient. This recursive recognition is summarized as: Recover the complete transient data xn(1:D) by producing the un-received part xn(K+1:D) with available prior information by each cluster, then use the recovered complete transient vector and the partial likelihood p(xn(1:K)|cm) evaluated at the current step to compute the CWM model prediction as defined in equation (6).
Transient data scaling is an important issue for TCR implementation. A good scaling strategy will reduce the amount of the information needed to be stored. It is generally not necessary to record the transients of all possible loads in a target system because different loads in the same class normally have transients that are similar up to a scale factor in magnitude. Similar transients can be matched to one normalized transient exemplar. For example, NILM determines the scale factor between the transient and the exemplar as part of the event classifying problem. See [14]. One way of scaling transients is to normalize the data by the associated steady state values. After normalization, the transients will have a steady state load current of one. The load transients in the same class would be expected to merge into one cluster. In the training process, the normalized transient samples are used to find the maximum likelihood prior parameters for the CWM model. In the following, we mark the prior parameters with superscript “tilde” to note that they are obtained through the training with the normalized samples, for example, μm, σ2m,i.
In the production process, because the steady state information of the income transients is not available until the transient periods finish, a suitable scale factor should be determined recursively by the following two-step procedure instead.
Step 1: At step K, determine a maximum likelihood scale factor amML between the transient data xn(1:K) received and each cluster cm by solving the constrained optimal problem as
maxa
subject to: amε(ammin ammax),
where K=1, 2, . . . , D; m=1, 2, . . . , M, and
{right arrow over ({tilde over (x)}
n
(1:K)
=a
m·({right arrow over (x)}n(1:K)−b), (8)
is a scaled estimate of xn(1:K) for cluster cm, and b is the offset which equals the steady state value on the load bus before the transient occurs. The limits ammin and ammax determine the reasonable scaling range for the transients in cluster cm. A detailed derivation is contained in reference [19].
Step 2: Compare the likelihoods p(xn(1:K)|cm, amML) for different clusters to find which cluster has the maximum likelihood and use the corresponding scale factor for xn(1:K), i.e.,
aML=aMML, (9)
M=arg{a
m
ML:maxm[p({right arrow over ({tilde over (x)}n(1:K)|cm,amML)]}. (10)
The same result for amML can be derived by solving the following weighted least squares problem,
As suggested by equation (11), the best scaling factor is determined not only by minimizing the Euclidian distance between the cluster centers and the scaled transient data, but also by considering the scale ranges of the clusters.
The offset b can be evaluated by a sliding window average. This offset reflects the steady state status on the load bus and can replace the iss in
Load transients can be characterized by narrow segments with relatively high derivative or mean value variation information, called v-sections (see e.g.,
Using v-sections instead of the entire transient to do the load transient recognition problem is preferred because the v-sections are less likely to overlap in a transient stream than an entire transient. See e.g., [15]. NILM incorporates two steps in recognizing the load transient. See e.g., [23]. The first step is to match the stored v-sections to the incoming data, and the second step is to “connect” the matched v-sections to interpret potential load transient events.
It is difficult to “connect” v-sections in real time in order to match with the real time operation of RLTR. However, a tradeoff can be made. The individual v-section can be viewed as an independent transient exemplar because it is only necessary to predict the steady state value change following each v-section. For example, in
Another important issue in transient exemplar design is to determine the length of the v-sections. Because of the errors from scaling and measurement, clusters for short v-sections could match well to the former part of some long v-sections. To prevent the false matching between the short and long v-sections, short v-sections can be extended with a segment of dummy steady state data. On the other hand, in order to save computational cost and storage space, long v-sections can be truncated by some extent, because from experience it is not necessary to investigate the complete data set of a long v-section in making good recognition. There are tradeoffs between how many points are patched to the short v-sections and how many points are truncated from long v-sections. The longer the exemplar v-section designed, the more accurate the final recognition result. On the other side, the shorter the v-section designed, the less probability of overlapping occurs during recognition process, and the less computational cost and storage space required.
Based upon the work for TRC implementation, an operation flowchart of the TRC is outlined below:
Step 1: Doing pre-processing for the input transient signal x(1:K) to eliminate noise;
Step 2: Determine the maximum likelihood scale factor aML by the two-step procedure described in the Transient Data Scaling section, and compute the instant scaled transient data as
x
(1:K)
=a
MLx(x(1:K)−b); (14)
Step 3: Compute the partial likelihood p(x(1:K)|cm, aML) for each cluster;
Step 4: Recover the scaled complete transient vector by maximum likelihood prediction for each cluster,
x
(1:K)
=[x
(1:K)μm(K+1:D)T]T (15)
Step 5: Calculate <{tilde over (y)}|x(1:K)> according to equation (6);
Step 6: Recover the un-scaled output,
<{tilde over (y)}|x(1:K)>=1/aML·<{tilde over (y)}|x(1:K)>+b (16)
The invention is further defined by reference to the following examples. The examples are representative, and should not be construed to limit the scope of the invention.
In the examples, the effectiveness of the TRC was first verified using the true load transient data for training and testing. The hybrid fuel cell system was then built up in hardware for verifying the power control scheme illustrated in section II.
Several types of true load transient data were gathered from DC/AC inverter and AC loads systems. The recorded data stream was first processed for eliminating the coupled noise from the instruments. Then informative v-sections of load transients were extracted, sorted to different transient classes, and synchronized to each other within the same class. Each class of transients is split into two parts, the training subset and the testing subset. The transients in training sub-set were first shifted so that the starting offset was approximately zero. The data was then normalized by the corresponding steady state values. The instrumentation for transient setup comprises a DC power source, a DC/AC inverter, a voltage/current sensor board installed on the DC side of the inverter, a USB real time data gathering card, and a host computer for data recording. The specifications of the instruments are summarized in Table II. Data was collected at a sampling rate of 1 kHz for the purpose of limiting the amount of the data needs to be processed. DC bus voltage was fixed to 12 volts. Five types of transients were chosen, including a lathe, bulb, computer monitor, drill, and vacuum cleaner. The monitor transient includes 2 separate v-sections which were treated as two individual transients. A total of 265 transients were recorded. The numbers of the recorded transients for the training and testing subsets in different transient classes are summarized in Table III.
In our implementation, it was necessary to filter transient data before use due to the inverter switching noise being coupled into the transient signals. The switching noise was highly harmonic, about 120 Hz, and can be effectively filtered using a low-pass filter with reasonable length. In the experiment, a moving average filter was used with the first notch of it being intently set to 120 Hz. The delay introduced by the moving average filter can be less than other kinds of advanced low-pass filters. In the experiment, the delay was not significant compared to the length of the transients gathered. For example, the length of the transients collected is about 250 to 2500 sample points given a 1 kHz sampling rate, and the filter length was 17. It may be desirable to use a passive power-level filter to attenuate switching noise.
The training was performed for finding a suitable set of cluster parameters, such as cluster mean vectors, variances, and local model parameters. More than one cluster may be used to model a class of transients if the class is highly variable. Cluster probability P(cm) is initialized as 1/M. Local model's parameter βm is initialized with a small random number. The initial positions of clusters should not be initialized randomly, because this will cause an unreasonably long time for the clusters to converge to the expected positions. See e.g., [18]. Therefore random initialization may not guarantee the clusters to converge perfectly according to the variations of each transient class. A practical way of initializing the cluster centers is to predefine the number of clusters needed for each transient class modeling. This is mainly determined by the variability of the transient class. Then the set of clusters used to model one class are initialized to the mean position of that class, plus small random perturbations. This implies that the transients in one class initially have equal likelihoods, because no prior information is available before training. Adding small random perturbations to the initial cluster centers effectively dispatches clusters to span the transient variability. Variances of p(xn|cm) and P(yn|xn, cm) are initialized to be a constant 100. During training, it is also necessary to add small constants to the variances, preventing them from shrinking to zero. See e.g., [17].
Before training, the gathered transients were downsampled by 10 to further save the computational cost. The transient length was set to D=50 after down-sampling. 27 clusters were used to model six classes of transients. (Monitor transients have two v-sections.) The allocation of clusters for different transient classes is summarized in Table IV. 5000 iterations were performed for training.
After training, the cluster probability P(cm) was checked to ensure that no clusters had infinitesimal probability. In this was the case, the number of clusters and the clusters allocation for transient classes were adjusted, and the training was re-performed. The converged clusters' positions were also checked relative to the position for each transient class. If any cluster converged far from the transients, the raining was also re-performed. The criterion for training is to maximize the joint log-likelihood.
V(h)=ΣNn=1 log p(yn,xn)
h=1, 2, . . . , 5000, is iteration number.
The convergence curve of V(h) is shown in
A separate set of transient data, not issued for training, was used for testing. The testing procedure is essentially the same as the procedure for TRC working in real time applications, which is outlined in the TRC Operation Flowchart.
Recursive recognition results are shown in
a
m
typical
/a
m
min
≦R,a
m
max
/a
m
typical
≦R
where R=4 for this experiment. amtypical for a cluster is determined interactively after training.
Two fuel cell hardware systems were built up and tested in the experiments. One is the hybrid fuel cell system with the proposed novel power control scheme illustrated in section II, making use of the transient steady state information supplied by TRC. The other is a simple fuel cell power supply system without using the transient recognition information and fast sources. The purpose of the simple fuel cell system is to show a “worst” case of the load of transient response by fuel cell, and using it as a reference for measuring the quality improvement of the load transient response of the hybrid fuel cell system.
The composition of the two fuel cell power systems are illustrated in the black diagrams in
Two load transients were used in experiments for testing the current responses of the fuel cell systems: one is a bulb transient, the other is a lathe transient.
Through the comparison of the load transient responses of the two fuel cell systems, the quality of the load transient response of the hybrid fuel cell system is apparently improved by applying the TRC based power control scheme. In the hybrid fuel cells system, the possible overshoot caused by the load transient requirement was prevented from the fuel cell output current response. However, in the simple fuel cell system, the output current of the fuel cell passively followed the load transient requirements, and the overshoots in the fuel cell transient response were still significant, although the transient had been smoothed by some extension of the fuel cell slow dynamics. It must be noted that only adjusting the dynamic of converter and inverter, instead of making use of the predicted load transient steady state information, can not help eliminate all possible load transient shock on fuel cell output.
The experimental results on TRC testing suggest that it is not necessary to use the physical length of a transient as a complete pattern. An effective fraction of a transient pattern can be determined empirically. For example, the effective length of a pattern in
Various modifications of the invention in addition to those shown and described herein will be apparent to those skilled in the art from the foregoing description. Such modifications are also intended to fall within the scope of the appended claims. The foregoing disclosure includes all the information deemed essential to enable those skilled in the art to practice the claimed invention.
A number of references have been cited, the entire disclosure of which are incorporated by reference.
This application claims priority to U.S. PCT Application No. 60/569,644, filed May 11, 2004, the contents of which are incorporated herein by reference in their entirety.
This invention was partially made with government support under United States Department of Energy Grant No. R02413060 through Pacific Northwest National Laboratory Grant #3917. The U.S. government has certain rights in this invention.
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/US2005/016170 | 5/11/2005 | WO | 00 | 7/23/2008 |
Number | Date | Country | |
---|---|---|---|
60569644 | May 2004 | US |