The present invention relates to systems and methods for estimating traffic signal information. Traffic signals have been an indispensable element of our transportation networks since their inception and are not likely to change form or function in the foreseeable future. While traffic signals ensure safety of conflicting movements at intersections, they also cause much delay, wasted fuel, and tailpipe emissions. Frequent stops and goes induced by a series of traffic lights often frustrate drivers. In arterial driving, the complex and unknown switching pattern of traffic signals often makes accurate travel time estimation or optimal routing impossible, even with modern traffic-aware in-vehicle navigation systems.
Many of these difficulties arise due to the lack of information about the current and future states of traffic signals. In an ideal situation where the state of a signal's timing and phasing is known, the speed of a vehicle could be adjusted for a timely arrival at green. One can expect considerable fuel savings in city driving with related art predictive cruise control algorithms. When idling at red becomes unavoidable, knowledge of the remaining red time can determine if an engine shut-down is worthwhile. A collision warning system can benefit from the signal timing information and warn against potential signal violations. Future navigation systems that have access to the timing plan of traffic lights can find arterial routes with less idling delay and can also provide more accurate estimates of trip time.
The main technical challenge to deploying such in-vehicle functionalities is in the reliable estimation and prediction of Signal Phase and Timing (SPaT) information. Uncertainties arising from the clock drift of fixed-time signals, various timing plans of actuated traffic signals, and traffic queues render this a challenging and open-ended problem. Direct access to signal timing plans and the real-time state of the signal is prohibitively difficult, due to the hundreds of local and federal entities that manage the more than 300,000 traffic lights across the United States alone. Even when such access is granted, much effort and time must be spent in structuring information from various municipalities in standard and uniform formats. The recent emphasis on Dedicated Short Range Communication (DSRC) technology for communicating the state of traffic signals to nearby vehicles has safety benefits, but requires heavy infrastructure investments and is limited by its short communication range.
In recent years several related art methods have been developed that utilize mobile phone or vehicle probe data for estimation of traffic flow. Today many traffic information providers, such as Google®, INRIX®, and Waze®, use data from vehicle and cellular phone probes, as well as other devices, to estimate the severity of traffic on highways nearly in real-time. However, such algorithms perform relatively poorly in arterial networks, because traffic signals induce complex queue and stop-and-go dynamics. Other related art methods have focused on estimating queue lengths and determining locations of traffic signals and stop signs by using vehicle probe data. However, the related art does not provide a systematic attempt to derive SPaT information from available vehicle data streams. M. Kerper, C. Wewetzer, A. Sasse, and M. Mauve, “Learning traffic light phase schedules from velocity profiles in the cloud,” in Proceedings of 5th International Conference on New Technologies, Mobility and Security (NTMS), 2012, pp. 1-5, describes a simulation study that is performed to show the feasibility of determining SPaT information using probe data. However, the results are limited by the assumptions that the data is updated at a high rate of approximately 1 Hz, and that the penetration level is high.
Unfortunately, currently one cannot expect high update rates from public fleets that broadcast their information, nor is there a proliferation of vehicle probes. Most related art vehicle probes provide only event-based updates, for example at a time of a crash or an air-bag deployment. Some data sources, such as San Francisco taxi cab data available through the Cabspotting program, have update rates of only once per minute. Slightly more frequent updates are available through NextBus®, a service that provides a real-time eXtensible Markup Language (XML) feed of a global positioning system (GPS) time stamp, position, velocity, and several other attributes of transit buses of a few cities in North America. Some instances of this feed, such as San Francisco MUNI stream, have update rates on the order of twice per minute. Further, intersections along a bus route are generally traversed by a bus every few minutes during the day.
Accordingly, it would be advantageous to provide systems and methods for estimating traffic signal phase and timing information from statistical patterns in low-frequency probe data. For example, it would be advantageous to estimate cycle times and durations of reds and greens for fixed-time traffic lights, and to estimate the future starts of greens in real-time. Such information about traffic signals' phase and timing may be valuable in enabling new fuel efficiency and safety functionalities in connected vehicles. For example, velocity advisory systems could use the estimated timing plan to calculate velocity trajectories that reduce idling time at red signals and therefore improve fuel efficiency and lower emissions. In addition, advanced engine management strategies could shut down the engine in anticipation of a long idling interval at red. Further, intersection collision avoidance and active safety systems could also benefit from the predictions. Various applications of SPaT information are discussed in copending U.S. application Ser. No. 13/840,830, the entire disclosure of which is incorporated by reference.
Exemplary embodiments of the invention provide systems and methods for estimating traffic signal information. According to an aspect of the invention, positioning system data is obtained from a plurality of vehicles. Each data set includes the position and the velocity of a vehicle as functions of time. For an intersection having a traffic signal, an average acceleration of the vehicles when leaving the intersection is estimated, and an average deceleration of the vehicles when approaching the intersection is estimated. For each of a subset of the vehicles, a stop duration at the intersection is estimated based on the average acceleration, the average deceleration, and the positioning system data for the respective vehicle. A duration of a red phase of the traffic signal is estimated based on the stop duration of each of the subset of the vehicles.
The average acceleration and the average deceleration may be estimated based on the positioning system data for the vehicles and a location of a stop bar behind which the vehicles stop at the intersection. The average acceleration and the average deceleration may be estimated by using a least-square estimation and removing outlier data points.
The subset of the vehicles may be determined by selecting vehicles whose positioning system data include data points within a distance interval surrounding the intersection, wherein at least a first one of the data points is before the intersection and at least a second one of the data points is after the intersection; removing vehicles whose velocity is lower than a threshold; for each of the remaining vehicles, estimating an intersection delay based on the positioning system data for the respective vehicle; and removing vehicles whose intersection delay is negative or zero.
For each of the subset of the vehicles, estimating the stop duration may include estimating a stop time at which the respective vehicle stops at the intersection based on the positioning system data for the respective vehicle and the average deceleration; estimating a start time at which the respective vehicle leaves the intersection based on the positioning system data for the respective vehicle and the average acceleration; and estimating the stop duration based on the stop time, the start time, the average deceleration, and the positioning system data for the respective vehicle. Vehicles for which the stop time is greater than the start time may be removed.
Each data set may be obtained from a cellular telephone and/or a navigation device within the vehicle. The duration of the red phase of the traffic signal may be estimated by determining a maximum stop duration of the subset of the vehicles during a time period with relatively light traffic. The update frequency of the positioning system data may be not greater than twice per minute.
According to another aspect of the invention, positioning system data is obtained from a plurality of vehicles. Each data set includes the position and the velocity of a vehicle as functions of time. For an intersection having a traffic signal, an average acceleration of the vehicles when leaving the intersection is estimated. For each of a subset of the vehicles, a start time at which the respective vehicle leaves the intersection is estimated based on the positioning system data for the respective vehicle and the average acceleration. A cycle time of the traffic signal is estimated by calculating a difference between the start times of the vehicles for each pair of consecutive vehicles in the subset, and solving an optimization problem based on the differences and the cycle time. A Gaussian mixture model may be fit to a histogram of a remainder of a division of the difference and the cycle time, and a signal offset caused by a schedule change of the traffic signal may be estimated based on clusters within the Gaussian mixture model.
According to yet another aspect of the invention, positioning system data is obtained from a plurality of vehicles. Each data set includes the position and the velocity of a vehicle as functions of time. For an intersection having a traffic signal, an average acceleration of the vehicles when leaving the intersection is estimated. A start time at which the respective vehicle leaves the intersection is estimated based on the positioning system data for the respective vehicle and the average acceleration for each of a subset of the vehicles, and a start of a future green phase of the traffic signal is estimated by calculating a moving average of the start times within a time interval defined by a cycle time of the traffic signal.
The moving average may be calculated by mapping the start times within the time interval onto a circle, such that each of the start times is represented as a vector with an angle; determining an average angle as a direction of a vector sum of the vectors; and determining the moving average by mapping the average angle to a time axis. A variance of the moving average may be calculated based on a minimum cyclic distance to the moving average for a plurality of times of day, and a schedule change of the traffic signal may be detected based on a spike in the variance of the moving average as a function of time.
According to still another aspect of the invention, positioning system data is obtained from a plurality of vehicles. Each data set includes the position and the velocity of a vehicle as functions of time. Vehicles are selected whose positioning system data include data points within a distance interval surrounding the intersection, wherein at least a first one of the data points is before the intersection and at least a second one of the data points is after the intersection. For each of the selected vehicles, an intersection delay is estimated based on the positioning system data for the respective vehicle. Vehicles whose intersection delay is non-zero are removed. For each of the remaining vehicles, a time at which the traffic signal was green is determined by interpolating between a first time of the first one of the data points and a second time of the second one of the data points. An interval during which the traffic signal was green is estimated based on the times at which the traffic signal was green.
The interval during which the traffic signal was green may be estimated by aggregating the times at which the traffic signal was green by mapping the times at which the traffic signal was green onto a reference interval. A probability that the traffic signal will be green in the future may be predicted based on the interval during which the traffic signal was green.
Other objects, advantages, and novel features of the present invention will become apparent from the following detailed description of the invention when considered in conjunction with the accompanying drawings.
Exemplary embodiments of the present invention provide systems and methods for estimating traffic signal information.
For example, the positioning system data may be acquired from bus movements within a city. The bus data feed may be obtained from NextBus®, which provides data feeds for a number of North American cities in XML. NextBus® provides GPS data that includes the position and velocity of each bus, along with the time stamp and the bus identification number. The bus route data and the location of bus stops may be extracted from the same data stream.
Exemplary embodiments of the invention determine if a bus was stopped at an intersection, estimate how long the bus was stopped at the intersection, and at what time the bus left the intersection. By aggregating this information for many buses, exemplary embodiments of the invention estimate traffic signal information, such as the duration of a red phase, the cycle length, and the start of a green phase. Because the update points for each bus are sporadic, a bus trajectory is approximated between each pair of update points.
Exemplary embodiments of the invention select bus passes that have update points within a given interval before and after that intersection. For example, for the Clement intersection shown in
To determine whether a bus stopped at an intersection, the intersection delay td may be approximated by subtracting the projected travel time from the actual travel time as follows:
where x1, v1, and t1 are the position, velocity, and time stamp, respectively, of the last update of a bus before an intersection of interest, and x2, v2, and t2 are the position, velocity, and time stamp, respectively, of the first update of that bus after the intersection. Therefore t2−t1 is the actual travel time and
is the estimated travel time if the velocity of the bus changed linearly between v1 and v2. If td≦0, it may be assumed that the bus had no delay, and that it passed the intersection during a green interval. Otherwise, it may be assumed that the delay was caused by a stop at a red signal, which may be further confirmed as described below.
If td>0, the consistency of the trajectory shown in
If tstop>tstart, the postulated trajectory is invalid, and the associated bus pass may be discarded. On the other hand, if tstop≦tstart, the trajectory may be accepted as valid, and it is determined that the bus came to a full stop at a red light. The duration of red “observed” by a particular bus may then be estimated as:
where
is the time it takes a bus to come to a full stop after the driver detects that the signal is red. Aggregating tred for a sufficiently large number of bus passes may provide an estimate of total red duration of a phase.
The above calculations assume that the acceleration and deceleration of the buses are known constants. The average acceleration and deceleration of the buses may be estimated in any appropriate manner. For example, the average acceleration and deceleration may be obtained from published values in the literature. As an alternative, probe data may be used to approximate the average acceleration and deceleration of the bus fleet. As discussed below, tred is not highly sensitive to reasonable variations in the value of the acceleration.
Because of data sparsity, it is not possible to estimate the acceleration or deceleration of an individual bus. However, as shown in
v
2(x)=2āacc(x−xsignal) (5)
where āacc is the average acceleration to be estimated from the data. A similar equation may be written for a deceleration interval. By defining y=x−xsignal, Ψ=v2(x), and
Equation (5) may be reorganized in the following linear parameterized form:
y=θΨ (6)
Several data points may be stacked in a least-square approach to estimate the parameter θ and therefore āacc. As shown in
The sensitivity of the tred estimate in Equation (4) to variations in acceleration (and similarly deceleration) may be calculated as
Because v2 is at most around 20 m/s for a city bus and aacc and adec are greater than 1 m/s2, even a 20% error in approximation of aacc(δaacc/aacc=±0.2) results in a maximum error of 2 seconds for tred. The error is much smaller in most instances where v2 is much less than 20 m/s.
Exemplary embodiments of the invention may obtain the baseline timing for traffic lights by offline aggregation and averaging of crowd-sourced bus data. In particular, the duration of reds/greens of a phase and the cycle time of a traffic signal may be determined. Mere knowledge of the baseline schedule, obtained offline and using only historical data, has statistical value even when a signal's clock-time is unknown. For example, as discussed in G. Mahler and A. Vahidi, “Reducing idling at red lights based on probabilistic prediction of traffic signal timings,” in Proceedings of the American Control Conference, Montreal, Quebec, 2012, pp. 6557-6562, the baseline schedule of a signal may be used to predict the chance of a future green for an eco-driving application.
The following examples are based on results for a segment of Van Ness street, between the Lombard and Washington intersections. This is a sometimes congested street and is therefore suited to test exemplary embodiments of the invention under (relatively heavy) city traffic conditions. Additionally, the actual signal timing cards of the intersections of Van Ness may be used to verify the validity of the estimates. Most intersections on this segment of Van Ness are fixed time intersections with the same cycle time and red duration throughout all days of the week. For most of these traffic signals, only offset times change during rush hour schedule, as described below. In these examples, one month of data (September 2012) is aggregated from two bus routes, route 47 and route 49, in the southbound direction, totaling 4289 bus passes. This data is used to estimate signals' cycle time and the timing of the phases controlling southbound traffic on Van Ness, as explained below.
As shown in
For example, for the southbound phase on Van Ness street at the Lombard intersection, 347 bus passes remained after applying the filters described above to the 4289 total passes.
Also, many bus drivers may treat a yellow as red, thereby increasing their observed red time to a maximum of 63.5 seconds. Exemplary embodiments of the invention may account for this by subtracting 3.5 seconds from the observed red time. The process described above was repeated for a few other intersections on Van Ness, and the results are summarized in Table I. In most cases the estimated red durations are very close to the actual red durations. This is while, unlike the Lombard intersection, many of these intersections had a short red interval and a green-wave design that allowed most buses to pass through their green period; thus offering a smaller number of usable data points.
As shown in
b
g(j)=tstart(j+1)−tstart(j) (7)
For a given cycle time C, the remainder of division of bg and C may then be calculated as follows:
modC(bg)=bg−round(bg/C)C (8)
where the function round(•) rounds its argument to the nearest integer and the function modC(•) is a modified definition of remainder of division by C that allows negative values. For example mod10(12)=2 and mod10(8)=−2.
It is expected that modC(bg) will be close to zero on average, if the cycle time is fixed at C and signal clock drift between two qualifying bus passes is small. Therefore C may be approximated by solving the following optimization problem:
where it is assumed that there are n+1 qualifying bus passes during the interval of interest and therefore n calculations of bg. Observing that
the remainders may be normalized by C/2 to ensure that all values of C generate equivalent costs.
Because a signal cycle time is normally an integer and has a limited range, one can conveniently solve the above optimization problem by trying every feasible C. In the present example, integer values between 1 and 120 seconds were tried when determining cycle time of signals on Van Ness. To reduce the influence of signal clock drift, the choice of bg may be limited to those within a few hours, such as 5 hours in this example. Using one month of data, the estimated cycle time for the Lombard intersection was 90 seconds in this example, perfectly matching its actual value. This is visually illustrated in
Table I summarizes cycle estimates for a number of other intersections along Van Ness. For most intersections, the estimated and actual cycle times are identical. For the Washington intersection, the cycle time was estimated at exactly half of its actual value. This is partly due to a lack of enough qualifying bus passes for this intersection. There were only 94 bus passes that qualified the filters for the Washington intersection as compared to 347 passes for the Lombard intersection.
For real-time in-vehicle applications, it is important to have an estimate of the starts of future green (or red) phases. Estimating the start of a green is a challenging problem: even for fixed-time signals that have fixed cycles, a periodic projection of starts of greens can be inaccurate due to signal clock drift throughout a day. To address this problem, exemplary embodiments of the invention continuously estimate the start of a green phase based on the movement of buses that accelerate from a stop at an intersection.
As shown in
by applying the mode operator, e.g. for the ith qualifying bus pass, as follows:
t
i=modC(tstart(i)) (10)
An average estimate of the start of green may then be created in this reference interval. Note that a simple “linear” average will, in general, produce an erroneous estimate due to the cycle periodicity. For example, in the schematic shown in
on the circle. The average angle
Here m represents the number of samples used to calculate the moving average. The average start of the green is obtained by mapping back the average angle to the time axis:
The variance of this estimate is then obtained based on the minimum cyclic distance to the average, equivalently calculated by:
As discussed below, in some instances, the accuracy of
The traffic signals in the example on Van Ness street have 3 different schedules. While cycle times remain constant across multiple schedules for these intersections, each signal's offset with respect to other signals and with respect to a reference clock switches as the schedule changes. For example, at the Lombard intersection and during weekdays, the start of the cycle is moved backward by 34 seconds at 6 AM and at 3 PM, and moved forward at 10 AM and 7 PM. Exemplary embodiments of the invention may estimate the change in offset and time of this change, in order to rely on crowd-sourced data for predicting the start of a green.
A change in signal offset/schedule may be detected by keeping track of the starts of greens and detecting when a start of a green shifts significantly from its periodic prediction. A smaller value of variance calculated in Eq. (13) indicates that the corresponding m estimates of starts of greens are consistent with each other and are a multiple of C seconds apart. Right after a schedule change when the starts of greens are shifted by the offset times, the variance is expected to temporarily increase, until it is corrected by newer estimates of starts of greens. Jumps in the value of variance can then be indications of a change in the signal schedule/offset times.
In one example, three months of data are combined, and the variance of the moving average is calculated as a function of time of day.
As discussed above,
The exemplary embodiments described above have been based on movements of buses that stopped at an intersection. These embodiments filtered out bus passes that had no intersection delay, e.g. those that cruised through a green. This approach discards a substantial amount of data, in particular for signals that either are often green or are timed in a green wave. But there is useful information that can be extracted from passes during a green. For example, it is possible to interpolate a point in time that a phase was green based on the bus data before and after an intersection.
As shown in
Next an interpolation between update times t1 and t2 may be used to determine the point in time at which the signal was green at step 430. For the constant acceleration case, the position of the traffic signal is given by:
where
is the constant acceleration between two update points. Here tg denotes a time at which the signal was green, which is the feasible solution to the above quadratic equation:
Exemplary embodiments of the invention aggregate all point calculations of tg to estimate intervals of green at step 440. For signals with a fixed and known cycle time C, this can be done by mapping all values of tg onto a reference interval [0,C].
Because of the cyclic periodicity, the data can be better visualized if mapped onto a polar histogram in which one revolution corresponds to one cycle time.
The probability that the traffic signal will be green in the future may be predicted based on the interval during which the traffic signal was green. The probability may be determined by normalizing the histograms shown in
To determine the accuracy of the estimates, in particular the starts of greens, on-site ground truth tests were performed at the intersection of Lombard and Van Ness streets on Jun. 6, 2013. Between the hours of 7 AM and 4 PM, the actual start of a green of the southbound phase on Van Ness was recorded almost every 15 minutes as the ground truth. This was done with the aid of a computer program that upon a key press would log the time as synchronized with the NIST time server. The human observer's reaction time was determined to be less than 0.3 seconds.
Concurrently, the starts of greens were estimated according to the exemplary embodiments described above. This was done in real-time via a crowd-sourcing backend server. The XML updates from routes of interest are continuously parsed and the data is written to a SQL data server. Another computational node constantly monitors the data to estimate starts of greens and records the results on the SQL server. As an alternative, the agreement between actual starts of greens and crowd-sourced starts-of-greens could be monitored in real-time via a PHP web interface.
After each qualifying bus pass, new estimates for starts of greens were generated using i) the last data point only, ii) minimum-variance average of 3 samples chosen out of last 6 data points, and iii) minimum-variance average of 2 samples chosen out of last 4 data points. Note that crowd-sourced estimates of greens are sparse in time due to the fact that the bus data that qualifies the filters described above is infrequent. Therefore, between two actual estimates, the starts of greens are cyclically mapped using the estimated cycle time of the traffic signal. Also the change in signal offset during schedule change is accounted for in this process. The estimated values for starts of greens are then compared to the actual ground readings of the starts of greens.
The methods discussed above are executed by a computer processor that is programmed to perform the methods so that the processor executes the programming to perform the methods. Such a processor is needed to handle the large volumes of data and to perform the complex and computationally-intensive analysis of the methods discussed above. In addition, the processor is required to perform the methods in a commercially viable timeframe. Accordingly, it is necessary to quickly process large and complex data sets.
According to another exemplary embodiment of the invention, there is provided a non-transitory computer-readable medium encoded with a computer program for estimating traffic signal information. The term “computer-readable medium” as used herein refers to any medium that participates in providing instructions for execution. Common forms of computer-readable media include, for example, a floppy disk, a flexible disk, a hard disk, magnetic tape, any other magnetic medium, a CD-ROM, any other optical medium, punch cards, paper tape, any other physical medium with patterns of holes, a RAM, a PROM, an EPROM, a FLASH-EPROM, any other memory chip or cartridge, and any other non-transitory medium from which a computer can read.
The foregoing disclosure has been set forth merely to illustrate the invention and is not intended to be limiting. Since modifications of the disclosed embodiments incorporating the spirit and substance of the invention may occur to persons skilled in the art, the invention should be construed to include everything within the scope of the appended claims and equivalents thereof.