A portion of the disclosure of this patent document contains material which is subject to copyright protection. The copyright owner has no objection to the facsimile reproduction by anyone of the patent document or the patent disclosure, as it appears in the Patent and Trademark Office patent file or records, but otherwise reserves all copyright rights whatsoever.
This invention relates to determining vehicle insurance risk and more specifically to development and usage of an insurance risk database that is referenced to elements of a transportation network.
There is a need in the automotive insurance industry to accurately predict the risk of claims being made and the costliness of claims being made and adjusting the insurance rate charged to an individual or for a vehicle accordingly. The more accurate the prediction, the lower the premiums can become, making the insurer more competitive and presumably profitable, and/or the insurer may choose to not insure individuals or vehicles of the perceived greatest risk or smallest profit potential.
It is known in the art to base premiums on such thing as the geographic area where a driver lives, or potentially the area s/he drives through on a regular basis. It is also known to further evaluate rates based on the historical location and frequency of accidents, crime rates, traffic flow and/or claims made in the vicinity of geographic area used as a rating territory. It is further known to adjust the rates based on the drivers past driving history with respect to insurance claims and driving record.
One of the many problems with existing insurance risk rating systems is that they are too granular or non-specific. For example, typically a geographic area for rating would be based on the address of the owner of the vehicle. This would mean that all the residents of a given area or neighborhood would be lumped into the same rate category. These rates could be adjusted for factors such as the type of car being insured on how expensive claims are for that particular type of car in the area of interest, however this type of rating system generally does not take into account the areas typically driven through on a regular basis by the driver.
Another issue with current insurance risk rating systems is that assumptions made in the systems may not be valid. For example, most would agree that if a person obeys the traffic laws, then that person's driving risk would be less. This may not be the case and the embodiments of the present invention make no such assumptions.
There is a need in the industry to have a vehicle insurance risk system based on one or several parameters that are spatially referenced with respect to the transportation network the vehicles travel on and further based on the driving habits of individual drivers that are insured or desire insurance. Knowing when and where a driver drives and knowing the historical risk associated with driving a given route at a particular time, a formula can be derived to predict risk for individual driver which in turn can be used to set rates. Because the parameters related to driving/insurance risk and driving habits of a given driver are associated with transportation system elements in the present invention, a more refined model of risk is possible than for insurance risk solutions in the art. Determining premiums based on a single point or region (for example a residential address) does not take into account where a person drives on a regular basis.
As the correlation between one or more risk parameters and insurance risk may vary over time and may vary regionally, it may be needed to statistically analyze the parameters used in a model and continually change them over time. In addition, historical parameters used may lose relevance with time and will need to be retired or withdrawn from the determination of risk—relying on more recent data.
Real time information (while the insured is driving) may be much more relevant to risk. For example, if the road is icy, the likelihood of making a claim is potentially higher, than if the only information available is that is likely to be icy at the timeframe when driving.
With a dynamic rating system that is continually updated and also has a real-time component, it is further possible to compel drivers to adjust driving habits based on the real-time information to reduce the risk. For example, if a particular route is known to be icy, and the course the driver is taking is being monitored, and the monitoring system further suggest an alternate non-icy route, then the driver can be rewarded for avoiding risky conditions by a reduced premium, or by monthly rebate checks or similar.
Real-time information can come from a variety of sources such as wireless acquired weather information and traffic reports. This information can further be statistically aggregated to produce historical weather/traffic risk information likelihood indices that are spatially and temporally indexed. Metadata associated with the historical information can then be used to cull older information and continually update the indices with the latest information. Also continuous, real time, accumulation of accident reports with root causes can be helpful to asses and distribute that risk across the total driving space of some geographic region.
Glossary:
Driver Insurance Risk: The probability that an insured will make a claim and for how much given a variety of measured factors. It could also refer simply to the probability of being in an accident.
Transportation Network: A system of road, streets, paths, sidewalks, trails, waterways or other ways that a vehicle or pedestrian travels along. A transportation network can be subdivided by the type of vehicle or pedestrian that is intended to be used for. For example, roads and streets may be used by cars, trucks and busses. Trails and sidewalks may be used by pedestrians and perhaps bicycles. Transportation networks are generally stored in a Geographic information System that documents the location and interaction of various components of the transportation network. Attribution is also associated with the various components of the network.
Element: Is a distinct component of a transportation network that has an associated geographic coordinate/s. Examples of elements are road segments where the road begins and ends at an intersection; an intersection between two or more roads; or the boundary of a lake.
Attribution: Attribution associated with a transportation network includes any piece of information that can be related to a spatially referenced element or component of the transportation network. Examples are such things as speed limits, number of lanes, connections between components, or type of vehicle that can traverse the component. Attribution, in addition to being spatially referenced may have a temporal (time) component expressed as, for example, time of day, time of week, or time of year. An example of this is the speed limit in a school zone.
Metadata: Metadata is a special kind of attribution associated with the quality of components of transportation network. Metadata can be associated with individual geographic components, attribution or the source of the geography or attribution. Metadata may be associated with precision or accuracy of the components or source. Metadata may have a component that list the age of the source.
Index: two meaning are used:
Maneuver/Complex Maneuver: A maneuver is an attribute associated with an action that can be either perform or not performed and which is associated with one or more components of a transportation network. For example, a no-left-turn at an intersection is an example of a prohibited maneuver. A complex maneuver is generally associated with more than one component of a transportation network—for example, what is known as a Michigan Left Turn, in which a vehicle desires to turn left at an intersection, but in order to do this has to turn right, cross one or more lanes, then cross a median on an avenue, then turn left, is a complex maneuver.
Parameters: Any factor that may be directly or indirectly be related to insurance risk.
Geocode: Process of taking a street address and determining a geo-referenced coordinate usually a latitude and longitude and further determining the associated transportation segment associated to the street address.
Snapping: Refers to the process of finding the nearest transportation segment (via perpendicular distance) to a given geo-spatial coordinate location.
Multivariate Analysis: A class of statistical analysis used to determine the relevance of one or more parameters in predicting an outcome and used to build a predictive function base on one or more of the analyzed parameters. In this case the outcome is the prediction of insurance risk or driving hazard assessment. An example of a multivariate analysis is an Artificial Neural Network (or simply a neural network). Another example is any form of machine learning.
Threshold: In multivariate analysis, several factors contribute to the predictive model. Some factors can be more relevant or more influential than others. For example the number of accidents in the past along a particular road segment, may be a better predictor of insurance risk of driving that segment than the average vehicle speed along the segment. However a relative weighting of the two parameters may predict better than either one used singly. So if a predictive model, when using a particular factor in the prediction, does not perform appreciably better than if the factor was not incorporated in the model, the factor can be removed from consideration. When this happens is when the difference in the two predictions is less than a preset threshold value.
Accident Count: The number of accidents that occur for a given element of the transportation network over a given time. This may be further subdivided based on weather conditions and/or time of day, time of week or based on other attributes that may influence accident occurrence.
Incident: A single occurrence of a measured parameter. For example an individual accident report is an incident of the parameter accidents; a recorded speed of an individual driver along a segment of road is an incident of speed of travel for that segment.
Granularity: This term is used to refer to the specificity of either an attribute or index. For example, if an accident count is based simply on the transportation element it took place on, it is less granular than if the accident count is based on the location (element) and the time.
Insurance Risk: This term is used collectively for all embodiments of the present invention to encompass the desired outcome of an insurance risk model. Examples of desired output are the probability of: having an accident, making an insurance claims, or making an insurance claim within defined monetary limits.
Crowd Sourced: Information that is gathered from voluntary (or otherwise) information that is contributed to a website or webservice via an internet link. This information can be anything from verbal reports concerning traffic, to GPS trails that observe a drivers location and speed in real-time, which can then subsequently be used to update maps and other information pertaining to traffic or hazard.
Outside Sourced: all sourcing of risk factor information that are not part of vehicle tracking and sensor analysis. This can include crowd sourcing, police reports, accident reports from insurance and/or police, weather from weather bureaus or crowd sourced, pavement conditions from highway departments or state government, traffic data from published or crowd sourced services and many others.
Statistically Significant: refers to a minimum amount of information that can be used to achieve acceptable predictions of risk or hazard. For example if a predictive function relies heavily on a variable such as the average speed of vehicle passage for each road segment, then wherever there is no information concerning the average speed for any segment, then an average speed would have to be assumed. You could default to the speed limit for example. The more road segments that have an estimated average speed, the poorer the prediction of risk will be. A threshold needs to be in place to exclude information that is below a pre-defined value of percent coverage.
Statistical Relevance: in any form of multivariate analysis, one or more measurements or parameters are used to predict an outcome. In this case an outcome is the risk associated with driving along a transportation element. In the process of developing the prediction function, it may be found that removal of certain parameters or measurements from the predictive function, does not appreciably change the prediction. A threshold can be set, pertaining to how much a specific parameter influences the prediction and if the correlation between an actual outcome and the predicted outcome does not improve about the threshold, then the parameter can be dropped from consideration. This is not to say that it could not be re-introduced when more or better data is available, or used in other geographic areas.
Sensor derivative: Sensors that are incorporated in a vehicle or are within a vehicle (accelerometers in a smartphone where the smartphone is in the vehicle for example) can have the output evaluated and turned into a parameter. For example if an accelerometer indicates rapid acceleration in the direction of the front of the vehicle and a tire spin sensor records an event, this may be registered as a sensor derivative called dangerous acceleration. If there is a rapid acceleration to the left followed by a rapid acceleration to the right, this may be registered as a dangerous lane change event.
Below are examples of elements of a vehicle insurance risk database. Some or all of these elements may be used to develop a risk model or risk indices.
A primary object of the present invention is a method to develop a database comprising parameters that are related to insurance risk and/or driving hazard to be used for vehicle insurance rating and/or pricing and furthermore, where the parameters are related to transportation network elements.
Another object of the invention is to determine which parameters or combination of parameters best predicts insurance risk for individual drivers or individual vehicles.
A further object of the present invention is a maintenance and update method for the above mentioned databases.
Yet another object of the present invention is to track and parameterize the driving habits of individual drivers and to compare those driving habits to historical parameters and habits of other driver in order to predict individual insurance risk or driving hazard.
It is a further object of the present invention to influence the driving habits of individual drivers by suggesting safer routes or driving habits and to reward or penalize individual driver based on their utilization or lack of utilization of suggestions.
It is an object of the present invention to develop a system that comprises a database, software and hardware to predict insurance risk or driving hazard, to mitigate insurance risk or driving hazard while individuals are driving and to set insurance premiums based on the database and real-time input.
It is an object of the present invention to develop an insurance rating system based on accident counts for individual elements of a transportation network and how frequently a driver travels elements with accident risk.
It is an object of this invention to display driving hazard or insurance risk relative to transportation segments on a map of a transportation network.
It is an object of this to route from an origin to a destination taking into account hazards and risk data from the hazard/risk database.
The drawings constitute a part of this specification and include exemplary embodiments to the invention, which may be embodied in various forms. It is to be understood that in some instances various aspects of the invention may be shown exaggerated or enlarged to facilitate an understanding of the invention.
It is not presumed that relationships between parameters and risk hold true over large areas—there may be locally relevant predictors that are not as significant as in other areas. Certain historical datasets or parameters may not be as readily available in some areas as they are in others. For example, reports documenting accidents and accident locations may be more readily available and more easily input into a database for an urban area than for a rural area. Or accident reports may not be available, but traffic counts which may indicate accidents may be available.
Ideally the attribution used for insurance rating will be easier to deal with if it is consistent throughout the entire rating area. To accommodate this, it may be necessary to approximate a parameter stored in the database with input from a related parameter. For example, from the previous paragraph, you may wish to store accident occurrences associated with each road segment. If accident reports are not available for an area of interest but traffic flow information is, you may be able to infer that while traffic stops or slows way down that this is caused by an accident. This could then be reflected as an accident occurrence. This inferred accident occurrence could further be reflected in the metadata as the source for the accident count and an indication that the count is less reliable than an actual accident count. Another means of getting the proxy is the road quality, like road maintenance, and quality of the road surface type.
Accordingly as shown in
Any model or predictive function could be greatly influenced by information that is acquired in real-time or near real time from drivers. This information could comprise things such as speed of travel, braking, engine function, acceleration, route taken and many others. If this information is readily available, it will influence the design of the predictive database. Therefore sources of pertinent real time information need to be identified 104. Real-time information could come from insurance subscribers that opt into an insurance plan that mandates monitoring or could be crowd sourced by volunteers. Additionally real-time information could come from sources such as commercial traffic information providers or local government highway or police departments.
Based on what historical information that is available and what quantity there is and what type of real time information can be acquired, the database schema or design can then be created 106. All parameters to be stored in the database will be geographically referenced 114 relative to an underlying GIS database 112 of the transportation network. Certain parameter (for example a speed limit) may also be temporally referenced.
Once a rating system is running based on the database, some of the data in the database may be retired based on age or when more accurate information becomes available. Therefore metadata about the age and quality of the data needs to be documented 110.
As is well known, the probability of an accident will increase with increased traffic density and/or due to inclement weather. This information may be available 210 with incoming accident reports or may be available via other sources such from a weather service which then can be related to an accident incident via location and time.
The probably of an accident may increase based on the time. For example the probability of an accident most likely increases at 2 AM (2:00) on New Year's day as opposed to any other day at the same time. Therefore any form of attribution that can be associated with an incident should be added 212 so that it can be analyzed to see if there is any correlation with risk.
The granularity of associated information will vary. For example if a traffic flow was associated with a particular accident and that traffic flow information was acquired from a Traffic Messaging Channel (TMC), this information may not be associated with the exact location of the accident and therefore may be suspect. The quality of the associated attribution for accident reports needs to be documented as metadata 214.
It should be noted that initially accident reports (and other parameters) would come from historical data such as police reports, however, this could be supplanted by real time information coming from vehicle sensors. For example, if an insurance subscriber allowed access to the insurer for output from car sensors, an accident incident could be recorded at the gps location of the vehicle when there was signal indicating that the air-bag was deployed. Once again the source of the report or parameter should be included as part of the metadata and be used as a measure of quality. Other driving telemetry obtaining devices which may be installed on the vehicle (perhaps at the behest of the insurance company) would be used to obtain additional pertinent information.
Examples are shown below of incidents that can be recorded in a risk database and which can subsequently be used to determine driving/insurance risk. Examples of associated attribution are also provided. These are examples only and is not an exhaustive list.
The following list are examples of information that may be recorded for an individual driver and may come from either/or questionnaires or real-time sensor information: Type of car; where you drive; when you drive; snow tires during winter; previous tickets
Once a historical database of incidents, for example, accidents and traffic violations is developed and referenced to transportation elements, then analysis can be performed to determine relationships to risk and hazard. Once again, no a priori assumptions are made about a correlation between a particular parameter and risk other than initial assumptions that are made to run and test a multivariate model.
In an embodiment, incidents are evaluated based on the quantity and quality of information available and also the extent over which the information is available. The goal is to create a risk and/or hazard index or indices based on one or more of the type of incidents recorded related to elements of the transportation network.
In an embodiment, what is desired, is a function to predict the likelihood that a given driver will make an insurance claim and for how much or for example the likelihood the driver will be involved in an accident. The likelihood of claims and cost of those claims or the likelihood of being in an accident can be a function of:
These parameter can be further broken down into:
The factors that may influence the number and amount of insurance claims or the risk of being in an accident may be exceedingly complex. This is why the analysis lends itself to a form of multivariate analysis. Typically a human can only visualize the relationship between 2, maybe 3 variables at a time and a parameter my not be directly related to a cause of an incident, but may provide an indication of the cause. For example in one area, it may be found that the instance of traffic accidents at 2 AM is far greater than in another area. Therefore you could conclude that time of night is not a very good overall predictor of having an accident. However if you also observe that in the first area, the instance of arrest for drunk and disorderly is higher than the second area, the combination of time and arrests for intoxication, may be a much better predictor. If yet more variables are introduced, then the relationship may get more complicated and more poorly understood without some form of multivariate statistical correlation.
In another example the quality of the information will influence the predictive model. It is well known that ice formation on a road is a function of temperature, humidity and barometric pressure. However if the weather conditions in an accident report are based on the general weather conditions for the region from a weather report, this data will not take into account, subtle weather variations that may be available from in-car sensors. A difference of a degree in temperature could make the difference between ice and no ice.
As shown in
It is desirable to have as much granularity in the observed information as possible in order to determine what information correlates more strongly to risk and hazard. Using the accident report example, we want to predict insurance risk. Therefore, for all the accidents that occur in a region, if we have information on the insurance pay-out, a model can be developed that uses part of the information as a training set 306, for example in a neural network predictive model known in the art and part of the data to test the prediction 308.
In many multivariate analysis methods, initial assumptions need to be made to come up with a working predictive function 306. For example, initial weighting or correlation values might need to be assigned to the input variables. An educated guess may be that the number of pot holes in a road is about half as important to risk as the number of drunk driving arrests.
Once an initial model is generated, an iterative process 310 is used to converge on a reasonable predictive model. This is done by modifying the weighting of input parameters slightly 312, then rerunning the new predictive function and observing the correlation statistics until an optimal correlation is arrived at.
In an embodiment, the input for a model may need to be parameterized in such a way as it can be used into a model. An example of parameterization would be to characterize incidents into a grouping. For example, it may be desirable to collectively refer to accidents counts falling into a range of 1-10 accidents per year as a “low” accident count and have “medium” and “high” counts as well.
As was previously pointed out, the parameters that could be used to predict insurance risk and/or driving hazard and the resulting model could be exceedingly complex. Compiling information from a variety of sources to populate a given parameter may be difficult and if available data is insufficient, may also result in a poor prediction. Therefore, in order to keep the cost of the rating system low and to facilitate rapid development, it may be desirable to limit the data/parameters that are utilized and make some simplifying assumptions.
In an embodiment, the assumption is made that insurance risk for driving on a particular transportation element is directly correlated to the number of accidents reported on that element over a set time period. Therefore the risk database could simply contain accident incidents that are related to individual transportation segments. If available, additional attribution that may be recorded with accident incidents are, for example, direction of travel, time of day, date, and weather variables. In this embodiment, it is further assumed that insured drivers have agreed to have their driving habits monitored. If the person is applying for insurance, an initial insurance premium could be based partially on the area of residence and some average of accident risk within a geographic radius of the residence.
Alternatively or after an initial rate is applied, the weekly habits (or longer duration) of the driver could be monitored. By monitoring when and where the driver has been, then, for example, it could be determined all the transportation elements the driver has traversed for a given rating interval and how many times they have been traversed.
As shown in
To determine an insurance premium based on the above accident counts, then the risk associated with an individual's driving habits needs to be assessed. The can be done by collection of data while an individual is driving 406. Data to be collected comprises when and where a person is driving and then relating that information to the transportation elements a person drives on and the frequency they drive on them. From this information, a basic Hazard Index (I) can be developed 408. In its simplest form, the Hazard Index is the summation of the accident count for all elements traversed multiplied by the number of traversals for a given time period. Finally insurance premiums could be adjusted based on the individual Hazard Index when compared to other individuals.
Yet more refinement of an individual Hazard Index could be made by further subdividing the index based on additional attribution such as weather and road conditions provided that the accident count database has this amount of granularity.
As more drivers are monitored, gradually, historical data gleaned from accident reports could be replaced by, for example, air bag deployments sensor information from insured drivers. The air bag deployment could be related to accident occurrence and severity and would make it unnecessary to acquire accident information from other sources such as accident reports from the police.
Granularity can be further increased by further analysis of recorded data about the vehicle. For example, there is possibly a correlation between driving behavior just prior to an accident and the probability of the accident happening. So if a driver is accelerating rapidly or changing lanes frequently, this may indicate increased probability of having an immediate accident.
In an embodiment of the present invention, once a database of insurance risk is established and maintained with current information, then commercial risk products can be created that map the associated driving risk to transportation elements. This product can be sold to municipalities and other entities responsible for safety on transportation networks.
Yet another embodiment of the present invention is a method to reduce driver/insurance risk utilizing one of the above described risk and driver habit databases and monitoring of a driver activities and habits in real-time. The system utilizes a navigation device located in a vehicle. The navigation device is either in communication with a risk database and a driving habits database or the databases are stored within the navigation device. The navigation device can be integral to the vehicle, a stand-alone device or software implemented on a computer, smartphone or tablet device. Generally location is determined by a GPS which is part of the navigation device. Based on former analysis and part of the driver habits database, the system pre-determines routes that the driver in question has historically taken. It further determines the propensity of the driver to deviate from safe driving habits such as driving faster than the speed limit or swerving in the other lane or using a mobile phone (as determined from blue-tooth usage for example).
As shown in
It should be noted that insurance premiums based in part on driving habits, can be underwritten in a conventional manner for a vehicle, or underwritten for a specific driver as long as when monitoring a vehicle, the driver is identified in some manner and the data acquired and stored is referenced to the specific driver.
In addition, deviations from safer driving habits are monitored during driving 512. If a bad driving habit are detected—say, for example, exceeding the speed limit—advice can be displayed to slow down. If the advice is taken, then this information can be treated as in the above safe route scenario 516.
The present invention may be conveniently implemented using one or more conventional general purpose or specialized digital computers or microprocessors programmed according to the teachings of the present disclosure. Appropriate software coding can readily be prepared by skilled programmers based on the teachings of the present disclosure, as will be apparent to those skilled in the software art.
In some embodiments, the present invention includes a computer program product which is a non-transitory storage medium (media) having instructions stored thereon/in which can be used to program a computer to perform any of the processes of the present invention. The storage medium can include, but is not limited to, any type of disk including floppy disks, optical discs, DVD, CD-ROMs, microdrive, and magneto-optical disks, ROMs, RAMs, EPROMs, EEPROMs, DRAMs, VRAMs, flash memory devices, magnetic or optical cards, nanosystems (including molecular memory ICs), or any type of media or device suitable for storing instructions and/or data.
The foregoing description of the present invention has been provided for the purposes of illustration and description. It is not intended to be exhaustive or to limit the invention to the precise forms disclosed. The embodiments were chosen and described in order to best explain the principles of the invention and its practical application, thereby enabling others skilled in the art to understand the invention for various embodiments and with various modifications that are suited to the particular use contemplated. For example, although the illustrations provided herein primarily describe embodiments using vehicles, it will be evident that the techniques described herein can be similarly used with, e.g., trains, ships, airplanes, containers, or other moving equipment. It is intended that the scope of the invention be defined by the following claims and their equivalence.
This application claims the benefit of priority to U.S. Provisional Patent Application titled “Risk Based Automotive Insurance Rating”, Application No. 61/968904 filed on Mar. 21, 2014 which is herein incorporated by reference.
Number | Date | Country | |
---|---|---|---|
61968904 | Mar 2014 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 14460868 | Aug 2014 | US |
Child | 16525384 | US |