The present disclosure relates to the field of urban traffic estimation, and specifically, to a realtime urban traffic status monitoring method based on privacy-preserving compressive sensing.
In recent years, with the acceleration of urbanization in China, more vehicles are driving on urban roads, and this intensifies traffic congestion. Long-time traffic congestion not only causes serious waste of time and energy, affecting efficiency of urban economy to a certain extent, but also increases a probability of traffic accidents. Compared with extension of urban roads, vehicles are obviously growing faster. This means that urban traffic congestion will become increasingly serious. In recent years, people have been widely aware of the importance of realtime urban traffic monitoring. Traffic congestion can be effectively alleviated if drivers obtain a road congestion status in real time. Conventional traffic congestion estimation methods are all based on static traffic sensors such as a closed circuit camera and an inductive loop detector. These conventional methods are not suitable for large-scale use because of high deployment and maintenance costs. As vehicle-mounted sensors have increasingly good performance, vehicle data collection has become an important means for obtaining a traffic congestion status, and for example, is used for Google and Baidu maps. However, due to a large quantity of roads, it is necessary to collect and analyze mass vehicle data to obtain the traffic congestion status. This imposes new requirements for data storage and data processing. Therefore, it is urgent to own a device that can provide sufficient data storage and have a powerful data processing capability.
To implement traffic monitoring through vehicle data analysis, the vehicle data needs to be outsourced to a cloud. However, the vehicle data includes some sensitive information such as global positioning system (GPS) positions, report time, and speeds. Therefore, it needs to be ensured that the outsourced data is not leaked without authorization. If an attacker continuously intercepts data such as a speed and a GPS position, a driving route of a driver can be predicted by using a mining technology. What's more, GPS position information also reveals places the driver often visits, such as homes, offices and restaurants. In addition, a destination of the driver may also be predicted by comparing a start position and data collected from an odometer. Therefore, it is extremely important to protect privacy of the data. Data encryption algorithms include a homomorphic encryption (HE) algorithm and secure multiparty computation (MPC). However, even if the outsourced data stored in the cloud is encrypted, it is still difficult to speed up operation efficiency of the data encryption algorithm without damaging privacy of original data. In addition, with a rapid increase in urban roads and vehicles, it is impossible to collect data of all roads because this will increase a time delay of traffic monitoring. Therefore, it is necessary to use a little data to estimate a traffic congestion status of an entire road network.
In view of this, an objective of the present disclosure is to provide a realtime urban traffic status monitoring method based on privacy-preserving compressive sensing, to remarkably enhance a capability of protecting privacy of vehicle data, ensure rapid and accurate data processing, reduce energy consumed for urban traffic estimation, and shorten required traffic estimation time.
To achieve the above objective, the present disclosure uses the following technical solutions.
A realtime urban traffic status monitoring method based on privacy-preserving compressive sensing includes the following steps:
step S1: dividing vehicle data under privacy preserving into two parts, and sending the two parts to two different road side units (RSU) for preprocessing;
step S2: outsourcing, by the two different RSUs, preprocessed vehicle data to two cloud platforms (CP) respectively, and
designing a data encryption execution protocol based on a finally expected operation result and interactive operation between the two CPs, to encrypt the data; and
step S3: receiving, by a navigation service provider (NSP), encrypted data from the CPs, decrypting the received encrypted data, and estimating an urban traffic status by using a compressive sensing technology.
Further, the step S1 specifically includes the following substeps:
step S11: completing a setting that there are n roads in a city, a total of m pieces of data are uploaded by all vehicles on a road section j within a time period T, and each piece of data is in a format of u=[t,v,lat,lon], where t is uploading time, v is an instantaneous speed of a vehicle, and lat and lon are a longitude and a latitude of the vehicle;
step S12: constructing, by a system, the reported data as a vector Uj=[u(j,1), u(j,2), . . . , u(j,m)]T and a matrix M=[U1, U2, . . . , Un]:
randomly dividing the matrix M into M′ and M″:
where u(i,j) represents the ith data set reported on the jth road section, and u(i,j)′ and u(i,j)″ represent two random components of u(i,j), namely, u(i,j)=u(i,j)′+u(i,j)″:
step S13: sending the two random components to the different RSUs respectively;
step S14: processing, by the RSU when receiving data U, the data, constructing a vector V=[v1, v2, v3 . . . , vm] based on an instantaneous speed in each piece of data, and computing an average speed on the road section according to the following formula:
step S15: completing a setting that a quantity of known road sections is w, and obtaining the average speed V⊥=[v(⊥,1), v(⊥,2), . . . , v(⊥,w)]; and then dividing, by the RSU, the average speed into two vectors V⊥′=[v(⊥,1)′, v(⊥,2)′, . . . , v(⊥,w)′] and V⊥′=[v(⊥,1)″, v(⊥,2)″, . . . , v(⊥,w)″], where same processing is performed on the longitude and the latitude; and
step S16: outsourcing, to the CPs, vectors N′ and N″ obtained by dividing N, in other words, sending the vector N′ to CP A (CPA) and the vector N″ to CP B (CPB).
Further, the data encryption execution protocol is a data encryption protocol based on secure two-party computation, and includes a basic secure computation protocol and a matrix secure computation protocol.
Further, the basic secure computation protocol includes a secure multiplicative protocol, a secure division protocol, a secure cosine protocol, a secure sine protocol, an extended secure cosine protocol, a reverse exponential protocol, an extended multiplicative protocol, and a secure arccosine protocol.
Further, the matrix secure computation protocol specifically refers to extended secure computation protocols Mmul and Mdiv that are designed by means of hadamard product, where Mmul and Mdiv are respectively obtained by replacing random numbers in Mul and Div with matrices.
Further, the data encryption protocol based on secure two-party computation further integrates the compressive sensing theory. Specifically, a measurement matrix and a sparse representation matrix that are required for compressive sensing are also used as keys for vehicle data encryption, and then are integrated into the data encryption protocol based on secure two-party computation.
Further, the step S3 specifically includes the following substeps:
step S31: completing a setting that there are n road sections in a city, and computing traffic congestion rates of w road sections within a time interval T′ (w<<n), where CPA has a vector C′=[c1′, c2′, . . . , cw′] and CPB has a vector C″=[c1″, c2″, . . . , cw″];
step S32: generating, by a third-party service provider (TPS), a random measurement matrix Φ∈, and receiving, by the NSP, component vectors C′, C″, Φ, Ψ1, and Ψ2, to obtain a formula C′+C″=Φ×(Ψ1+Ψ2)×C, where C includes the traffic congestion rates of all the road sections; and
step S33: obtaining, by the NSP, C based on an optimization algorithm, and sending a road traffic status to a vehicle by using a navigation application.
A realtime urban traffic status monitoring system based on privacy-preserving compressive sensing includes data vehicles (DVs), two RSUs, two CPs, a TPS, and an NSP.
The DVs are configured to obtain vehicle traveling data.
The RSU is configured to preprocess the vehicle traveling data, and outsource preprocessed vehicle data to the two CPs respectively.
The CP is configured to store and further process the data.
The TPS is trusted by all parties and responsible for generating a required random number.
The NSP is configured to receive ciphertext from the CP, estimate a traffic status of a road section, and transmit an estimated traffic congestion rate to a vehicle.
Compared with the prior art, the present disclosure has the following beneficial effects:
The present disclosure can remarkably enhance a capability of protecting privacy of vehicle data, ensure rapid and accurate data processing, reduce energy consumed for urban traffic estimation, and shorten required traffic estimation time.
The present disclosure is described in more detail with reference to the accompanying drawings and embodiments.
Refer to
In this embodiment, a realtime urban traffic status monitoring method based on privacy-preserving compressive sensing includes the following steps:
Step S1: Divide vehicle data under privacy preserving into two parts, and send the two parts to the two different RSUs for preprocessing.
1) The vehicle data needs to be securely uploaded to the RSUs. It is set that there are n roads in a city, a total of m pieces of data are uploaded by all vehicles on a road section j within a time period T, and each piece of data is in a format of u=[t,v,lat,lon], where t is uploading time, v is an instantaneous speed of a vehicle, and lat and lon are a longitude and a latitude of the vehicle.
2) The system constructs the reported data as a vector Uj=[u(j,1), u(j2), . . . , u(j,m)]T and a matrix M=[U1, U2, . . . , Un]:
Then, the matrix M is divided into M′ and M″:
where i(i,j) represents the ith data set reported on the jth road section, and u(i,j)′ and u(i,j)″ represent two random components of u(i,j), namely, u(i,j)=u(i,j)′=u(i,j)″: Then, the two parts are sent to the different RSUs.
3) When receiving data U, the RSU processes the data, constructs a vector V=[v1, v2, v3, . . . vm] based on an instantaneous speed in each piece of data, and computes an average speed on the road section according to the following formula:
If it is set that a quantity of known road sections is w, the average speed is V⊥=[v(⊥,1), v(⊥,2), . . . v(⊥,w)]. Then, the RSU divides the average speed into two vectors V⊥′=[v(⊥,1)′, v(⊥,2)′, . . . v(⊥,w)′] and V⊥″=[v(⊥,1)″, v(⊥,2)″, . . . v(⊥,w)″]. Likewise, the longitude and the latitude are processed.
The following describes a computation process performed by the RSU.
4) Finally, the system outsources, to the CPs, the vectors N′ and N″ obtained by dividing N, in other words, sends the vector N′ to CPA and the vector N″ to CPB.
Step S2: The two different RSUs outsource preprocessed vehicle data to the two CPs respectively. A data encryption execution protocol is designed based on a finally expected operation result and interactive operation between the two CPs, to encrypt the data.
Step S3: The NSP receives encrypted data from the CPs, decrypts the received encrypted data, and estimates an urban traffic status by using a compressive sensing technology.
In this embodiment, the data encryption execution protocol is a data encryption protocol based on secure two-party computation, and includes a basic secure computation protocol and a matrix secure computation protocol.
Preferably, the basic secure computation protocol includes a secure multiplicative protocol, a secure division protocol, a secure cosine protocol, a secure sine protocol, an extended secure cosine protocol, a reverse exponential protocol, an extended multiplicative protocol, and a secure arccosine protocol. Specifically:
(1) Secure multiplicative protocol (Mul): It is assumed that CPA has a random number x∈+ and CPB has a random number Y∈+. Mul is intended to obtain a value of x·y without leaking an intermediate result.
The protocol is executed according to the following four steps:
1) The TPS generates random numbers ra∈+, rb∈+, ka∈+, and kb∈+, where ka+kb=ra·rb. Then, the TPS sends (ra, ka) to CPA through a secure communication channel to CPA, and (rb,kb) to CPB through a secure communication channel to CPB.
2) CPB performs computation according to y′y+rb, and sends y′ to CPA. Then, CPA performs computation according to x′=x+ra, and sends a computation result to CPB.
3) CPB generates a random number vb, performs computation according to t=x′·y+(kb−vb), and sends a computation result to CPA.
4) CPA performs computation according to va=t+ka−(ra·y′).
After the Mul protocol is executed, CPA obtains va and CPB obtains vb This process is represented as (va:vb)←Mul(x:y)
without leaking an intermediate result. The protocol is executed according to the following four steps:
1) The TPS generates random numbers ra∈+, rb∈+, ka∈+, and kb∈Q+, where ka+kb=ra·rb. Then, the TPS sends (ra,ka) to CPA through a secure communication channel to CPA, and (rb,kb) to CPB through a secure communication channel to CPB.
2) CPA performs computation according to x′=x+ra, and sends x′ to CPB. Then, CPB performs computation according to
and sends y′ to CPA.
3) CPB generates a random number vb, performs computation according to
and sends a computation result to CPA.
4) CPA performs computation according to va=t+ka−(ra·y′).
After the Div protocol is executed, CPA obtains va and CPB obtains vb. This process is represented as (va:vb)←Div(x:y).
(3) Secure cosine protocol (Cos): It is assumed that CPA has a random number x∈+ and CPB has a random number y∈+. Cos is designed to obtain a value of va+vb=cos(x+y) without leaking an intermediate result.
The protocol is executed according to the following three steps:
1) CPA performs computation according to c1=cos(x) and s1=sin(x). CPB performs computation according to c2=cos(y) and s2=sin(y).
2) Use the Mul protocol to compute (va′+vb′)←Mul(cos(x):cos(y)) and (va″+vb″)←Mul(sin(x): sin(y)).
3) CPA performs computation according to va=va′−va″. CPB performs computation according to vb=vb′−Vb″.
After the Cos protocol is executed, CPA obtains va and CPB obtains vb. This process is represented as (va:vb)←Cos(x:y).
(4) Secure sine protocol (Sin): It is assumed that CPA has a random number x−+ and CPB has a random number y∈+. Sin is designed to obtain a value of va+vb=sin(x+y) without leaking an intermediate result.
The protocol is executed according to the following three steps:
1) CPA performs computation according to x′=sin(x) and x″=cos(x). CPB performs computation according to y′=sin(y) and y″=cos(y).
2) Use the Mul protocol to compute (va′:vb′)←Mul(x′:y′) and (va″:vb″)←Mul(x″:y″).
3) CPA performs computation according to va=va′+va″. CPB performs computation according to vb=vb′+vb″.
After the sin protocol is executed, CPA obtains va and CPB obtains vb. This process is represented as (va:vb)←Sin(x:y).
(5) Extended secure cosine protocol (Ecos): It is assumed that CPA has (x1, y1) and CPB has (x2,y2). Ecos is designed to obtain a value of va+vb=cos(x1+x2)cos(y1+y2) without leaking an intermediate result.
The protocol is executed according to the following two steps:
1) Execute (va′,vb′)←Cos(x1:x2), (va″, vb″)←Cos(y1:y2), (va
2) CPA performs computation according to va=va′·va+va
After the Ecos protocol is executed, CPA obtains va and CPB obtains vb. This process is represented as (va:vb)←Ecos((x1,y1):(x2,y2)).
(6) Reverse exponential protocol (Rexp): It is assumed that CPA has a random number x∈+ and CPB has a random number y∈+. Rexp is designed to obtain a value of va+vb=e−(x+y) without leaking an intermediate result.
The protocol is executed according to the following four steps:
1) The TPS generates random numbers ra∈+, rb∈+, ka∈+, and kb∈+, where ka+kb=ra·rb. Then, the TPS sends (ra,ka) to CPA through a secure communication channel to CPA, and (rb,kb) to CPB through a secure communication channel to CPB.
2) CPB performs computation according to y′=e−y+rb, and sends y′ to CPA. Then, CPA performs computation according to x′=e−x+ra, and sends x′ to CPB.
3) CPB generates a random number vb, performs computation according to t=x′·e−y+kb−vb, and sends a computation result to CPA.
4) CPA performs computation according to va=t+ka−ra·y′.
After the Rexp protocol is executed, CPA obtains va and CPB obtains vb. This process is represented as (va:vb)←Rexp(x:y).
(7) Extended multiplicative protocol (Emul): It is assumed that CPA has random numbers x1∈+ and y1 ∈+, and CPB has random numbers x2∈+ and y2∈+. Emul is designed to obtain a value of va+vb=(x1+x2)·(y1+y2) without leaking an intermediate result.
The protocol is executed according to the following two steps:
1) Execute (va′:vb′)←Mul) and (va″:vb″)←Mul(y1: x2) and then send (va′,va″) to CPA and (vb′,vb″) to CPB.
2) CPA performs computation according to va=x1·y1+va′+va″. CPB performs computation according to vb=x2·y2+vb′±vb″.
After the Emul protocol is executed, CPA obtains va and CPB obtains vb. This process is represented as (va:vb)←Emul((x1,y1):(x2,y2)).
(8) Secure arccosine protocol (Arccos): It is assumed that CPA has a random number x∈+ and CPB has a random number y∈+. Emul is designed to obtain a value of va+vb=arccos(x+y) without leaking an intermediate result. Based on a series expansion, an arccosine value can be computed according to the following formula:
In fact, a McLaughlin expansion may be used, and an approximate arccosine value is selected, so that
where χ is a part of the McLaughlin series expansion, and is described as follows:
Therefore, x may be used to replace X+y to obtain the approximate arccosine value. Then, the Binomial Theorem is used to compute a value of (x+Y)n.
(x+y)n=Cn0xny0+Cn1xn-1y1+ . . . +Cnnx0yn
Herein, the Mul protocol may be invoked to compute the arccosine value. Therefore, the Arccos protocol may be expressed as (va:vb)←Arccos(x+y).
Preferably, in this embodiment, the matrix secure computation protocol specifically refers to extended secure computation protocols Mmul and Mdiv that are designed by means of hadamard product, where Mmul and Mdiv are respectively obtained by replacing random numbers in Mul and Div with matrices.
In this embodiment, after CPA and CPB respectively receive the vectors N′ and N″, CPA and CPB further need to perform interactive computation to obtain a traffic congestion rate C and a spatio-temporal correlation matrix G.
A traffic congestion rate set C of a known road section is computed as follows:
First, when intermediate values in an interactive computation process performed by CPA and CPB are not leaked, a formula
is used for computation. Next, CPA and CPB generate vectors S′=[1/2, 1/2, . . . , 1/2]1×w and S″=[1/2, 1/2, . . . , 1/2]1×w, and use the vectors for subsequent computation, where w is a quantity of known road sections. Finally, CPA and CPB perform interactive computation and execute (C′,C″)←Mdiv((V⊥′,S′):(V⊥″,S″)). CPA obtains a result C′, and CPB obtains a result C″.
The following describes in detail how to compute the traffic congestion rates C′ and C″.
Next, the spatio-temporal correlation matrix G is computed. First, a Euclidean distance between the CPs is computed. It is assumed that there are two GPS points a=(lata,lona) and b=(latb,lonb). A Euclidean distance L may be computed according to the following formula:
L=R·arccos(L1+L2)
In the foregoing formula, L1 cos(lata)·cos(latb)·cos(lona-lonb) and L2=sin(lata)·sin(latb). The two GPS points a and b each are divided into two parts. For example, a is divided into a′=(lat′a,lona′) and a″=(lata″,lona″) and b is divided into b′=(latb′, lonb′) and b″=(latb″,lonb″). a′ and b′ exist at the same time, and a″ and b″ also exist at the same time. We design a secure distance computation protocol Dis for computing the Euclidean distance between a and b under privacy preserving, in other words, (l1:l2)←Dis(a:b). L is specifically computed according to the following steps:
1) Execute (va
2) CPA performs computation according to va
3) Compute (va
4) According to the protocol (va:vb)←Arcos(va
For an entire urban road network, the system selects, from N′ and N″, elements reflecting spatial-temporal features, and constructs matrices ′=[[lat(⊥,1)′,lon(⊥,1)′], . . . , [lat(⊥,n)′,lon(⊥,n)′]] and ″=[[lat(⊥,1)″,lon(⊥,1)″], . . . , [lat(⊥,n)″,lon(⊥,n)″]] that include only GPS information. Then, the Mdis protocol is used to compute latitude and longitude distances between any two GPS points, in other words, a and b in Dis are replaced by ′ and ″ to obtain (D′:D″)←Mdis(′:″), where D=D′+D″. Assuming that there are n road sections in a city, dimensions of D are n×n.
The following describes in detail how to compute D.
= [[lat(⊥,1)″, lon(⊥,1)″], . . . , [lat(⊥,n)″, lon(⊥,n)″]]
Preferably, in this embodiment, traffic monitoring of a few road sections is implemented by using the compressive sensing technology. A Gaussian kernel function
is introduced into our mechanism, where d is defined as a Euclidean distance between any two GPS points, and σ is a correlation coefficient used to determine impact between two GPS points. A value of σ may be obtained by training data based on the Bayesian framework. The Gaussian kernel function
is used to generate a correlation matrix G, where G is in the following form:
CPA has the matrix D′, and CPB has the matrix D″, where D=D′+D″. D is computed according to the following steps:
1) Execute (La′:Lb′)←Mmul(D′:D″).
2) CPA performs computation according to
3) CPB performs computation according to
4) Execute (La:Lb)←Rexp(W1:W2).
After the foregoing protocol is executed, an orthogonal feature vector basis can be generated by G that is generated by using the Gaussian kernel function. According to a formula G=ΨΛΨ−1, G is diagonalized into three parts: Ψ, Λ, and Ψ−1. Herein, Ψ indicates a compressive sensing spare representation matrix, and is randomly divided into Ψ1 and Ψ2. Then, Ψ1 and Ψ2 are sent to CPA and CPB respectively. Then, Ψ1 and Ψ2 may be used by the NSP to estimate a traffic congestion rate of an unknown road section. A complicated cloud computing process is shown in
In this embodiment, the step S3 specifically includes the following substeps:
Step S31: Complete a setting that there are n road sections in a city, and compute traffic congestion rates of w road sections within a time interval T′ (w<<n), where CPA has a vector C′=[c1′, c2′, . . . , cw′] and CPB has a vector C″=[c1″,c2″, . . . ,cw″].
Step S32: The TPS generates a random measurement matrix Φ∈, and the NSP receives component vectors C′, C″, Φ, Ψ1, and Ψ2, to obtain a formula C′+C″=Φ×(Ψ1+Ψ2)×C, where C includes the traffic congestion rates of all the road sections.
Step S33: The NSP obtains C based on an optimization algorithm, and sends a road traffic status to a vehicle by using a navigation application.
The afore-mentioned are only preferred embodiments of the present disclosure, and all equivalent changes and modifications made in accordance with the claims of the present disclosure shall fall within the scope of the present disclosure.
Number | Date | Country | Kind |
---|---|---|---|
202010649250.3 | Jul 2020 | CN | national |
Number | Name | Date | Kind |
---|---|---|---|
10545247 | Zheng | Jan 2020 | B2 |
20130151088 | Ricci | Jun 2013 | A1 |
20160189544 | Ricci | Jun 2016 | A1 |
Number | Date | Country | |
---|---|---|---|
20220014506 A1 | Jan 2022 | US |