The present invention relates to collecting data stored in a storage device in an electronic control unit (ECU) of each vehicle during driving of the vehicle, and generating normal value data of various driving parameters as a reference of a failure diagnosis.
There is known a diagnosis unit as a tool for diagnosing a failure of a vehicle such as automobile. In recent years, there is considered that a so-called expert system for registering information on failures occurring in the past in database and estimating a cause of a newly occurring failure is used for a diagnosis machine. The expert system includes a rule base estimation system in which human experience knowledge is registered in database for retrieval and a model base estimation system in which behaviors of the system during normal time or failure are learned for retrieving a failure cause via simulation.
In the former system, a mechanism is simple and a relatively high reliability can be expected but the knowledge is difficult to rule and a rule needs to be added or modified each time the target system is changed.
On the other hand, in the latter system, even if an operating staff does not have sufficient experience or knowledge, a failure site can be estimated and a certain degree of versatility can be expected in terms of the change in the target system. Particularly, a method for comparing data during normal time with data during failure under the same driving environmental condition is one of the most effective means for finding a cause of a failure.
However, since various environmental conditions or travel patterns are present for the driving of a vehicle and a range of normal values, which a large number of respective driving parameters can take depending on a respective situation, can be independently changed, the data during normal time needs to be collected under various driving situations in order to obtain normal value data for comparing items of data under similar conditions and to make an accurate decision.
For the purpose, a dedicated test vehicle provided with a driving data accumulation unit is utilized for conducting a large amount of travel tests, and data collection and accumulation needs a lot of labor.
Japanese Patent Application Laid-Open No. 62-261938 Publication (Patent Literature 1) describes therein a diagnosis unit comprising a knowledge data storing means for storing accurate information on a relationship between a symptom of a failure and a cause thereof, and a rare case storing means for storing inaccurate information.
Japanese Patent Application Laid-Open No. 6-95881 Publication (Patent Literature 2) describes therein analyzing design data or past failure data and inputting it as EMEA in a positive estimation system in relational database, creating a modified EMEA and then an event sequence diagram, creating a failure retrieval three for reference to a rule base, and digitizing an expert know-how to create a rule base.
Patent Literature 1: Japanese Patent Application Laid-Open No. 62-261938 Publication
Patent Literature 2: Japanese Patent Application Laid-Open No. 6-95881 Publication
If much driving data can be obtained from ordinary vehicles driving in a city or other places on a daily basis instead of data collection through the driving of a test vehicle provided with a driving data accumulation unit, the data collection through the driving of the test vehicle is not required.
However, the driving data accumulation performed in ordinary vehicles is limited to data during the occurrence of a failure which is recoded as diagnosis information in an ECU when the failure occurs.
The data recorded and accumulated as the data during the occurrence of a failure in the ECU is a collection of various driving parameters, where part of the parameters relating to the failure have abnormal values and most of the parameters have normal values.
Thus, it is an object of the present invention to provide an apparatus for utilizing travel data such as a time when a failure occurs, which is accumulated in an ordinary vehicle travelling in a city or other places in a daily basis, to generate data during normal driving.
In order to solve the above problem, the present invention provides a reference value generating apparatus for failure diagnosis that compares time sequence electronic control unit (ECU) data with a reference value for a plurality of driving parameters during the occurrence of a failure which is stored in a storing device of the electronic control unit (ECU) of a vehicle when the failure occurs in the vehicle, and that makes a failure diagnosis of the vehicle. The reference value generating apparatus comprises an accumulation unit that sequentially accumulates and stores the time sequence ECU data obtained from a large number of vehicles, and means for generating numerical vectors of the time sequence ECU data accumulated in the accumulation unit.
The numerical vectors are clustered and classified into a plurality of clusters according to the characteristics, a range of high occurrence rate values is obtained for the parameter values of the respective driving parameters in each of the plurality of clusters, the range of high occurrence rate values, which is obtained by the occurrence rate calculating means, is then stored as a range of the normal values of the driving parameters, and the range of the normal values is set as the reference value for a failure diagnosis.
According to one embodiment, the means for generating a numerical vector divides the time sequence ECU data into a plurality of time zones, and generates the numerical vector for each time zone. The numerical vectors for the time zones are clustered.
An embodiment of the present invention will be described below with reference to the drawings.
The F system network sends control system data such as vehicle speed data 11A, engine water temperature data 11B and engine revolution speed data 11C, and a failure code 11E indicative of a failure detected by the ECU to the data collection unit 14. The B system network sends data from a body's electric component ECU such as data 11F indicative of the state of an accessory (ACC) such as air conditioner and a failure code 11G indicative of a failure detected by the ECU to the data collection unit 14.
The data collection unit 14 itself is a ECU. The data collection unit 14 is controlled by a controller 14H whose main element is a microprocessor. A receiving part 14A sequentially receives the latest data indicative of a state of the vehicle from the vehicle-mounted network 11, and the controller 14H sequentially and temporarily stores the data in a vehicle state data memory 14B. The vehicle state data memory 14B is a random access memory and stores therein the latest data having a predetermined time interval of 20 seconds, for example, which is sequentially rewritten with new data every 0.2 second in first-in first-out (FIFO) shift register system.
A nonvolatile memory 14D is configured of a backup memory for maintaining the memory that a hold current is supplied by a buttery also during power-off or a rewritable ROM such as EEPROM. When receiving a failure code (DTC: Diagnosis Trouble Code), the controller 14H reads data for 15 seconds before the failure code occurs from the vehicle state data memory 14B and stores it in the nonvolatile memory 14D. The data is called onboard snap shot (OBS). The processing is executed each time a failure code occurs, and the nonvolatile memory 14D stores a plurality of OBSs corresponding to a plurality of failure codes.
When the vehicle is brought to a service shop for repair, a service staff connects a connection terminal of a failure diagnosis machine 16 to output terminals of the ECUs, reads the data stored in the nonvolatile memory 14D into the diagnosis machine, and utilizes the diagnosis machine to make a failure diagnosis.
On the other hand, the read data stored in the nonvolatile memory 14D is accumulated in a data accumulation unit 20 as travel data to which the present invention is to be applied. A reference data creation unit 30 is provided for the data accumulation unit 20.
The data stored in the nonvolatile memory 14D may be sent from a vehicle-mounted communication unit to the data accumulation unit 20 until the vehicle is brought to a service shop.
Table 1 shows an example of the OBSs stored in the nonvolatile memory 14D in response to a failure code. In the example, the driving parameters are denoted as R (engine revolution speed), V (vehicle speed) and T (temperature of engine cooling water), but the OBSs contain data for a large number of (50 to 60 depending on a vehicle type, for example) parameters such as an output value of an air-fuel ratio sensor, an output value of a 02 sensor, a fuel ignition time and an air-fuel ratio.
Time is indicated using a minus sign with a time 0.2 second before assuming that the occurrence time of the failure code is 0 second as reference.
How to extract data reference values (normal values) from the OBSs will be described herein. Even the OBS data stored when a failure code occurs has a normal parameter value non relating to the failure. Even the parameter value relating to the failure is considered partially normal during 15 seconds before the failure occurs. Thus, when the OBS data for many failures is collected for statistical processing, an abnormal value is made inconspicuous among many normal values or is excluded, and consequently the parameter values during normal driving can be extracted.
For that purpose, the more the samples is the better. In the example, the number of vehicles as samples is assumed as 13000 and the OBS data is obtained as described above from ordinary users' vehicles actually travelling in a city.
The range of normal values to be extracted is considered existing per “certain state (certain condition)”. Thus, when the items of approximate OBS data are collected as cluster (group) and a range of appropriate values is extracted for each cluster, the range of normal values can be extracted for each “certain state.”
Specifically, the clustering method is used to classify the items of approximate OBS data as a cluster (group). A range of appropriate values of the parameters is extracted for each cluster and is set as a reference value to be referred during the failure diagnosis.
The clustering is a data analysis method for grouping the data without an external reference, and a method called K-means method is employed in the example. The data is plotted in a space having a dimension corresponding to the number of parameters, and the data is classified depending on its distance.
The reference data creation unit 30 collects the OBSs (one per vehicle) from 13000 vehicles, for example, for one vehicle type.
Next, the collected OBS data is clustered into “20” by the characteristic amounts of three parameters such as “engine revolution speed: R”, “vehicle speed: V” and “cooling water temperature: T” (primary clustering).
Further, the “20” clusters extracted in the primary clustering are further clustered into “30” by all the parameters to extract “600” clusters, that is, driving conditions (secondary clustering).
The primary clustering is roughly performed by the three parameters such as “engine revolution speed: R”, “vehicle speed: V” and “cooling water temperature: T” on which the primary clustering particularly depends among the driving parameters relating to the occurrence of a failure, and the characteristic amounts of the three driving parameters are first extracted from the respective OBSs.
An example of reference value generation will be described with reference to
The inclination described herein is the important characteristic amount which may be a judgment index indicative of how much the vehicle is accelerating, decelerating or cruise driving (constant speed driving), and is the value obtained by differentiating the amount of change in the parameter value f(x) relative to an elapsed time (h) to be expressed by the following formula. The inclination can be obtained by the operation of the differential of the parameter value f(x) in the digital operation.
With reference to
In the example, the respective numerical vectors in the five time zones for the three parameters (Rn, Vn, Tn) such as “engine revolution speed: R”, “vehicle speed: V” and “cooling water temperature: T” are classified into, for example, 20 primary clusters Dj by the primary clustering (33). For the initial condition, a main core vector of the 20 primary clusters is randomly defined. The initial value of the core vector can be defined according to an empirical rule from a limited number of items of experimental data. When the main core vector of the j-th cluster is indicated as (Rav-cj, Rmx-cj, Rmn-cj, Rin-cj, Vav-cj, Vmx-cj, Vmn-cj, Vin-cj, Tav-cj, Tmx-cj, Tmn-cj, Tin-cj) assuming that j is an integer from 1 to 20, a Manhattan distance D1nj between the numerical vector in the first time zone in the above table and the respective core vectors of the 20 clusters Dj is calculated in the following formula.
D1nj=|Rnav-1-Rav-cj|+|Rnmx-1-Rmx-cj|+|Rnnm-1-Rmn-cj|+|Rnin-1-Rin-cj|+|Vnav-1-Vav-cj|+|Vnmx-1-Vmx-cj|+|Vnmn-1-Vmn-cj|+|Vnin-1-Vin-cj|+|Tnav-1-Tav-cj|+|Tnmx-1-Tmx-cj|+|Tnmn-1−Tnmn-cj|+|Tnin-1-Tnin-cj| (1)
The operation of the formula (1) with j=1, 2, 3, . . . 20 is performed to find the Manhattan distance for one numerical vector (Rn, Vn, Tn) in the first time zone to cause the numerical vector (Rn, Vn, Tn) to belong to the cluster Dj of the core vector having the smallest value. The similar operation is performed on n=1, 2, 3, . . . , 13000 to cause the respective numerical vectors (Rn, Vn, Tn) to belong to the cluster Dj having the smallest Manhattan distance relative to the core vector. In this way, the numerical vector in the first time zone of the OBS data obtained from n vehicles are classified into 20 clusters Dj.
The Manhattan distance D2nj between the numerical vector in the second time zone and the cluster D2j in the second time zone in the above table is similarly calculated in the following formula also for the second time zone.
D2nj=|Rnav-2-Rav-cj|+|Rnmx-2-Rmx-cj|+|Rnmn-2-Rmn-cj|+|Rin-2-Rin-cj|+|Vnav-2-Vav-cj|+|Vnmx-2-Vmx-cj|+|Vnmn-2-Vmn-cj|+|Vnin-2-Vin-cj|+|Tnav-2-Tav-cj|+|Tnmx-2-Tmx-cj|+|Tnmn-2-Tmn-cj|+|Tnin-2-Tin-cj| (2)
The numerical vector in the second time zone of the OBS data obtained from n vehicles is classified into 20 primary clusters Dj based on the Manhattan distance. Similarly, the numerical vector in the third time zone is classified into 20 primary clusters Dj, and the numerical vector in the fourth time zone and the numerical vector in the fifth time zone are classified into 20 primary clusters Dj, respectively.
Then, an average value of the numerical vectors which belongs to each of the 20 primary clusters Dj is calculated and the average value is set as a core vector of each cluster. The thus-updated core vector is used to perform the above clustering again. When the core vector of each cluster is updated in the second clustering, the updated core vector is used to further perform the third clustering. The repetition is performed until the core vector converges or until a preset number of times of trials ends.
Since the way of the convergence is different depending on the core vector at the initial setting, for example, ten different initial settings are randomly prepared to repeat the clustering using a respective initial setting, and consequently the clustering result whose trial is converged the best can be employed. In this manner, the final core vectors (centers of gravity) can be obtained for the 20 primary clusters Dj, respectively.
The final core vectors are used to perform the clustering on the 20 primary clusters, thereby completing the primary clustering for 65000 items of data (13000 vehicles×5 items/vehicle). Since the most important driving parameters are used to perform the primary clustering, thereby grouping the numerical vectors having a high degree of approximation, all the items of data are roughly grouped depending on a difference in a driving state having a high importance. Thus, the respective groups, that is, clusters are formed corresponding to a driving state having a relatively large difference.
Then, the processing proceeds to the secondary clustering (35). In the secondary clustering, the clustering is further performed on the OBS numerical vectors (obtained by extending the numerical vectors of
Since the respective primary clusters are subjected to the secondary clustering to form the 30 secondary clusters, 600 (20×30) secondary clusters in total are obtained. The respective OBS numerical vectors (65000 items of data in the example) belong to one of the 600 secondary clusters.
The initial values of the core vectors of the 30 secondary clusters used for the secondary clustering can be randomly defined similar to the primary clustering. The initial values can be defined according to an empirical rule in terms of the past data. The core vectors are updated by the same method as the primary clustering to find the final core vectors, and the 30 final core vectors are used to perform the final clustering, thereby forming the 600 secondary clusters.
Next, the processing proceeds to an occurrence rate calculating step 37, where a 20-divided histogram is created from the thus-obtained data of the 600 secondary clusters with the horizontal axis as the value of parameter and the vertical axis as the number of items of data per parameter (
At this time, the values of a % and b % are adjusted by the parameter residue of the clustering as a guide of the dispersion of the OBS data in the secondary cluster. The parameter residue is indicative of how long a respective parameter value of the numerical vector is away from the core vector as the center of gravity of the cluster to which the numerical value belongs. In the example, a difference between a respective numerical vector and the center of gravity of its belonging cluster is taken and the root-mean-square of the difference (square root of square sum) indicates the residue. Since a simple difference has a sign of plus or minus, the root-mean-square is used to know a magnitude of the difference.
For the parameter in which the parameter residue is larger and the dispersion is larger, a % and b % are reduced to widely extract a range of normal values (
The method described with reference to
The count bars having a % or less of the total number of counts are deleted. Assuming that a % is 1%, the bars having the number of counts of 1 or less are deleted and the bars in table 3 remain. In the example of
Adjacent bars are combined and grouped in the mauling bars. In this example, the bars are grouped as shown in table 4.
The groups having the total number of counts of b % or less are deleted. For example, the groups having 20% or less are deleted, the group having the group ID of 3 is deleted and the groups in table 5 remain. In the example of
The processing then proceeds to a reference value setting step 39, where the range of the columns in the remaining groups, that is, the characteristic amounts of the driving parameters is set as a range of the normal values. In the example, 0.0 to 0.1 and 0.3 to 0.7 are the ranges of the normal values. In order to simplify the data processing, the ranges of the normal values are collected into one to be a range of reference values used for the failure diagnosis. In the example, the range of the reference values is between 0.0 and 0.7. In the example of
As described above, in the embodiment, the range of the normal values of the respective driving parameters can be generated under a condition in which 600 driving situations are different.
In order to make a diagnosis of a filed vehicle using the normal value data, the ECU data to be diagnosed in the failed vehicle is read by the diagnosis machine and the most similar driving situation data is retrieved from among the normal value data (600 items of driving situation data as different driving situations). Then, the normal value data of the retrieval result and the ECU data to be diagnosed are compared with each other to judge which parameter is off the reference value (the range of the normal values) among the driving parameters under the same condition, thereby retrieving a failure portion based on the judgment.
There are various methods for retrieving the most similar items of driving situation data, and as one example thereof, the characteristic amount of each driving parameter of the ECU data is calculated as described in the embodiment, and the items of normal value data similar to the ECU data are approximately retrieved from the 600 items of normal value data to select the target normal value data.
The present invention has been described above with the specific embodiment. The present invention is not limited to the embodiment.
Number | Date | Country | Kind |
---|---|---|---|
2008-264593 | Oct 2008 | JP | national |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/JP2009/004818 | 9/24/2009 | WO | 00 | 4/8/2011 |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO2010/041382 | 4/15/2010 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
5170343 | Matsuda | Dec 1992 | A |
5638296 | Johnson et al. | Jun 1997 | A |
5727383 | Yamashita et al. | Mar 1998 | A |
6006146 | Usui et al. | Dec 1999 | A |
6053147 | Hemmerlein et al. | Apr 2000 | A |
6076504 | Stavnheim et al. | Jun 2000 | A |
6134488 | Sasaki et al. | Oct 2000 | A |
6314375 | Sasaki et al. | Nov 2001 | B1 |
6484109 | Lofall | Nov 2002 | B1 |
6581371 | Orzel et al. | Jun 2003 | B1 |
7042663 | Shimotono et al. | May 2006 | B2 |
7383238 | Iverson | Jun 2008 | B1 |
7765039 | Hagenbuch | Jul 2010 | B1 |
20010049579 | Fujino et al. | Dec 2001 | A1 |
20020059075 | Schick et al. | May 2002 | A1 |
20020188392 | Breed et al. | Dec 2002 | A1 |
20030050747 | Kamiya | Mar 2003 | A1 |
20030055666 | Roddy et al. | Mar 2003 | A1 |
20040003401 | Gutta et al. | Jan 2004 | A1 |
20040054507 | Mott | Mar 2004 | A1 |
20040181712 | Taniguchi et al. | Sep 2004 | A1 |
20050049832 | Gorinevsky | Mar 2005 | A1 |
20060030993 | Goodell et al. | Feb 2006 | A1 |
20060196167 | Odajima et al. | Sep 2006 | A1 |
20060200258 | Hoffberg et al. | Sep 2006 | A1 |
20060260573 | Urushihata et al. | Nov 2006 | A1 |
20070005206 | Zhang et al. | Jan 2007 | A1 |
20080086257 | Iihoshi et al. | Apr 2008 | A1 |
20080103675 | Ishizuka et al. | May 2008 | A1 |
20080117020 | Martin | May 2008 | A1 |
20080137961 | Ishida et al. | Jun 2008 | A1 |
20080162017 | Nagata et al. | Jul 2008 | A1 |
20080172169 | Kuronita et al. | Jul 2008 | A1 |
20090138153 | Mylaraswamy et al. | May 2009 | A1 |
20090216584 | Fountain et al. | Aug 2009 | A1 |
20090254243 | Uchihashi et al. | Oct 2009 | A1 |
20090306841 | Miwa et al. | Dec 2009 | A1 |
20100101409 | Bromberg et al. | Apr 2010 | A1 |
20110153154 | Hagenbuch | Jun 2011 | A1 |
20110196572 | Tsuchikiri et al. | Aug 2011 | A1 |
20110208567 | Roddy et al. | Aug 2011 | A9 |
20120036016 | Hoffberg et al. | Feb 2012 | A1 |
20120209634 | Ling et al. | Aug 2012 | A1 |
Number | Date | Country |
---|---|---|
62-261938 | Nov 1987 | JP |
06-095881 | Apr 1994 | JP |
2004227439 | Aug 2004 | JP |
2005-117756 | Apr 2005 | JP |
2007-281959 | Oct 2007 | JP |
Entry |
---|
International Search Report, PCT/JP2009/004818, Nov. 17, 2009, a total of 3 pages. |
Number | Date | Country | |
---|---|---|---|
20110196572 A1 | Aug 2011 | US |