The present invention relates to a computer-implemented method for predicting a residual service life of vehicle batteries of a fleet of electric vehicles.
In electric vehicles, the residual service life of vehicle batteries, also known as the state of health or the SoH value, constitutes an important parameter for their economic value and their capacity. Knowledge about the way in which the SoH value decreases over time is consequently helpful or necessary. Generally, the SoH value is not measured in the vehicle because the required sensor equipment would be too expensive. The SoH value is therefore estimated in the vehicle using a processor model. The estimation relates to the current SoH value. A prediction of the vehicle-individual state of health values is not available in current vehicle battery management systems. The driver or the vehicle fleet company (such as a car rental or car leasing company) is unable to calculate the future residual service life that remains for each individual vehicle. A prediction possibility is desirable so that the vehicle-individual residual service life can be predicted as early as possible and measures for extending the residual service life of the vehicle battery may possibly be triggered.
There is special interest in predicting the instant at which the residual service life of the vehicle battery will drop below a specific value, e.g., the value of 80% of the initial service life. In particular when many measured values are not yet available (incomplete parameter space), there is a lack of robust methods for a robust estimation and prediction of the state of health and for preventing an over-adaption of the processor model used in the vehicle for learning the current state of health with the aid of machine learning.
According to an example embodiment, the present invention provides a method in which:
In its device aspects, the present invention provides a device which is set up to predict a residual service life of vehicle batteries of a fleet of electric vehicles. According to an example embodiment of the present invention, the device includes means for measuring parameters of the vehicle batteries that arise during an operation of the electric vehicles, and mean s for transmitting the measured parameters to a server, the server being set up to
In addition, the present invention relates to a computer program product which includes instructions that upon a program execution by a computer, induce the computer to carry out the method/steps of the method of the present invention.
According to an example embodiment of the present invention, the method in particular relates to a method for a vehicle-individual prediction of the vehicle state of health which uses the Cloud connectivity of electric fleet vehicles. To allow for precise SoH calculations and predictions, the present invention makes it possible to make predictions regarding the residual service life (e.g., 80% residual capacity limit) already in an early phase of the service life of vehicle batteries when the actual parameter conditions of the fleet vehicles are not yet fully covered, even if only a limited number of training data for predictive machine learning of the residual service life are available. In addition, uncertainties can be evaluated by a quantified probability analysis which allows for a long-term prediction of the SoH values of the vehicle battery that manifests itself in an expected service life of each individual vehicle battery. The method according to the present invention furthermore is scalable to a high degree and steadily improves its accuracy the more data are available.
In one advantageous example embodiment of the present invention, operating parameters of the vehicle battery are modified if its residual service life undershoots a predefined value. For example, the maximally withdrawable power is restricted and/or the maximally permitted charge currents are restricted. It is also possible that the recharging is restricted to a limit value below the maximum capacity. The limit value is preferably dependent on the temperature. Preferably, the higher the temperature of the vehicle battery, the lower the limit value since high charges in conjunction with high temperatures accelerate the ageing of the vehicle battery.
In one preferred example embodiment of the present invention, parameters measured during the operation of the electric vehicles for a vehicle battery are combined to a feature vector characterizing the specific vehicle battery.
It is also preferred that the conditional probability is determined as a quotient whose denominator depends on a probability that the specific vehicle battery has a certain feature vector at the point in time lying in the past.
It is furthermore preferred that the denominator
In a further preferred example embodiment of the present invention, the quotient has a numerator which is a function of the joint probability that the vehicle battery having the specific feature vector has a residual service life that undershoots the predefined limit value at the point in time lying in the past.
It is also preferred that the joint probability is modeled by a Bayesian network, that is, a directed cyclic graph B = (v,ε), where v is a set of vertices representing the variables, and ε forming the set of edges that encode the dependencies between the variables.
It is furthermore preferred that the Bayesian network has a vertex without parents.
In another preferred example embodiment of the present invention, the vehicle batteries are labeled by a binary classifier, which has a first value, in particular the value zero, for vehicle batteries whose residual service life is greater than a threshold value, and which has a second value that differs from the first value, in particular the value 1, for vehicle batteries whose residual service life is less than a threshold value.
It is also preferred that a probability that the binary classifier has the second value at a point in time T that is later than a specific point in time t, is described by a survival function which is estimated by the conventional Kaplan-Meier estimator.
In addition, it is preferred that the probability of an event, that is, the probability that the binary classifier assumes the second value, is calculated by the cumulative death distribution function from the survival analysis.
In a further preferred example embodiment of the present invention, the structure of the Bayesian network is determined using the criterion of the minimum description length.
Additional advantages and example embodiments of the present invention result from the disclosure herein.
It is understood that the above-mentioned features and the features still to be described in the following text may be used not only in the individually indicated combination but also in other combinations or by themselves without departing from the framework of the present invention.
Exemplary embodiments of the present invention are shown in the figures and will be described in greater detail in the following description. Identical reference numerals in different figures describe the same elements or elements that are at least comparable in their function.
In detail,
Each electric vehicle also has calculation means (i.e., devices) 18.1, 18.2, ...,18.n, which calculate from the data supplied by sensor system 16.1, 16.2, ...,16.n an instantaneous SoH of the drive battery with the aid of machine learning using a calculation model. This calculation model is preferably independent of the prediction according to the present invention, less accurate and requires more measured values (i.e., a more complete parameter space).
In addition, each electric vehicle 10.1, 10.2, ...,10.n has a Cloud connectivity in the form of mobile radio communication means (i.e., devices) 20.1, 20.2, ...,20.n by which electric vehicle 10.1, 10.2, ...,10.n is able to exchange information with server 12 of vehicle fleet 10 and/or other components of a Cloud 22.
The present invention is realized by an interaction of the components of the distributed system illustrated in
With the aid of the parameters measured for one of vehicle batteries 14.1, 14.2, ...,14.n in each case, feature vectors are formed for respective vehicle battery 14.1, 14.2, ...,14.n. The measurements are undertaken at specific points in time. The variable to be determined is (initially) a remaining service life of a respective vehicle battery 14.1, 14.2, ...,14.n at an instantaneous point in time.
The variable to be determined is defined as the conditional probability that a certain event (such as reaching/undershooting a residual service life) occurs in a vehicle battery 14.1, 14.2, ...,14.n having a specific feature vector (x(t) at a specific point in time t.
Let it be assumed that data from n vehicle batteries 14.1, 14.2, ...,14.n, which are independent from one another and identical, are collected by m different data sensors of sensor system 16.1, 16.2, ...,16.n of vehicles 10.1, 10.2, ...,10.n that allow for a continual measurement in each case. Examples of such data are electrical voltages and temperatures of vehicle batteries 14.1, 14.2, ...,14.n, but the data are not restricted to these examples.
Despite continual measurements, only discretized measurements are considered for the evaluation. That means, the measurements received from an ith data source (e.g., a sensor of a vehicle battery) are transformed into a range Ric Q. The discretized data collected during the measurements are represented as feature vector x ɛ Ri× ... × Rm. The data are collected with discrete time stamps, which are denoted by t1 <t2...tK, that is, ti∈ ℤ for all 1 ≤i ≤ K. For all 1 ≤ i≤ n, Ti £ Z is defined as the event time, and C¡E{t1...,tK} as the censor time (that is, the given object is no longer monitored). It is assumed that Ci ≤ Ti applies. However, Ti >tK is also possible, which means that the event has occurred by the time of the last time stamp. The event status at instant tK is defined as δik = [Tj ≤tk], where [.] is denoted as the Iverson bracket, that is,[true] = 1 and [false ] = 0.
A set of indexes Ti<={1...,K} is introduced so that tk ≤ min(Ci,Ti) applies, and Xik are available for all k E Γi.In this context, Xik is the feature vector of the ith vehicle battery at time tk. The data collected for the ith vehicle battery are represented as Di= {(xik,δik)|k∈Ti}. The entire dataset is denoted byD = Di U ... U Dn .
A scenario is examined where data are available only for a few events at the instantaneous point in time tc = tK. The goal is the prediction of an event status at instant tf, where tf > tc and thus lies in the future. The event status for the vehicle battery i is denoted by yj(tc) ɛ {0.1} .
A binary classifier is generated by using Yi(tc) as a class label. If Yi(tc) = 1, then the event for vehicle battery i has occurred at instantaneous point in time tc. In contrast, if Yi(tc) = 0, then the event has not yet occurred at the instantaneous point in time tc.
The goal is to calculate the conditional probability
where xrepresents the feature vector for a given vehicle battery. The determination of this probability is represented by a second step 200 of the flow diagram. An event prediction based on the evaluation of probabilities is described in the paper “A Bayesian Perspective on Early Stage Event Prediction in Longitudinal Data”, IEEE Transactions on Knowledge and Data Engineering, 28 (12):3126 - 3139, December 2016.
In the denominator of the fraction, the probability represents that the vehicle battery has the specific feature vector x at the specific point in time t.
The numerator of the fraction represents the joint probability
that the specific event has already occurred in a vehicle battery that has the specific feature vector x at the specific instant t.
The quotient thus is the probability that an individual vehicle battery has already undershot a residual service life of 80% of its expected total service life at an instant t in the past.
To model the joint probability P(y(tc),x,t≦tc), step 200 includes the definition of a Bayesian network, that is, a directed cyclic graph B = (v, ɛ),in which v is the set of vertices that the variables represent, and ɛ forms the set of edges that encode the dependencies between the variables. For example, a Bayesian network is described in paper “Bayesian network classifiers”, Machine Learning, 29(2-3): 131- 161, November 1997.
For this purpose, a random variable for each sensor measurement xi with t1 ≤ i ≤ is initially examined for each feature vector x, and an additional variable that corresponds to the class label y(tc) is examined. To this end, the notation π(xi)for the set of the parents of the vertex belonging to x¡ is used. It is assumed that π(y(tc)= ø applies, that is, that no parents exist for the vertex belonging to y(tc).
The joint probability may then be factorized to
Therefore, the following is obtained: [..., sic]
The numerator P(x,t≤tc) is able to be estimated via the empirical distribution on the basis of the event frequencies, that is,
As an alternative, a parametric distribution may be assumed for p(x,t < tc) such as the normal distribution or the uniform distribution.
To calculate the probability for the vertex without parents, theory from the field of survival analysis is used, based on a survival analysis:
The present invention relates to a scenario in which only a limited set of data is available for estimating the a priori probability (prior probability) P(y(tc) = 1,t ≤ tc) of an event. Some of the available data are incomplete, that is, censored data are present.
For each time ti, all events are labeled either as event or as event-free. The survival function S(t) = P(T > t) is estimated to calculate the labeling. This function indicates the probability that the instant T of an event occurrence is later than an instant t indicated in the network.
The conventional Kaplan-Meier estimator is used for the estimation
where, di represents the number of events at instant ti, and ni is the number of objects that remain in the study at time ti. The probability of an event Fe(t) is calculated with the aid of the cumulative death distribution function
In addition, let it be assumed that Q(t) = P(C > t), which indicates the probability that time C of the censoring is later than a specific time t. The Kaplan-Meier estimator for Q(t) takes the form of
The censoring probability is calculated as
At point in time t, the event label is assigned to all instances if
In the other case, all instances are labeled as event-free.
The use of the labeling makes it possible to collect instances that are labeled as an event, and the experimental probability distribution
is able to be calculated.
However, a parametric distribution F(t) is used instead. A popular example is the conventional Weibull distribution having two parameters a and b, that is,
This parametric distribution is dependent on data.
To learn the structure, that is, the edge quantity of the Bayesian network, the criterion of the minimum description length
Is able to be used, where
is the number of free parameters in the network. The log-likelihood function may be defined as
where Type equation here,
If the empirical distribution P(.) is assumed, which is defined by the frequency of the events in the training set, that is
for each event
the log-likelihood function is able to be written as
which is maximized as
This criterion MDL(B|D) may be minimized by a local search algorithm (e.g., by the conventional hill climbing algorithm).
With the aid of the Bayesian network determined in this way, the numerator of the conditional probability
is able to be calculated.
The resulting knowledge of the conditional probability
allows for a prediction of the residual service life of vehicle batteries of the fleet, which occurs in step 300, as a function of the conditional probability, as will be described in the following text.
The value of the probability P(y(tc) = 1|x,t ≤ tc) that an event has occurred by the instantaneous point in time tc amounts to between 0 and 1 according to the definition.
Probability P(y(tc) = 0|x,t ≤ tc) complementary thereto is able to be calculated on the basis of general characteristics of the probability as
Therefore, if the probability that an event has occurred by the instantaneous point in time tc can be calculated, then it is also possible to calculate the probability that the event has not occurred by instantaneous point in time tc. In the present case, the latter probability is of interest, on the one hand. On the other hand, the training data for the complementary probability P(y(tc) = 0|x,t≤ tc) are available. For that reason, this probability is calculated at the outset, for instance. The calculated value may then be used to calculate the probability actually of interest in that tc is replaced by tf.
Number | Date | Country | Kind |
---|---|---|---|
10 2020 209 339.3 | Jul 2020 | DE | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/EP2021/070019 | 7/16/2021 | WO |