This application claims priority to foreign French patent application No. FR 2307321, filed on Jul. 7, 2023, the disclosure of which is incorporated by reference in its entirety.
The invention relates to the field of modeling the dispersion of air pollution, in particular in an urban environment.
It would be advantageous to be able to identify (locate and estimate the shape and amplitude of) sources of air pollution in urban areas, and in particular in the urban canopy, based on a set of localized measurements.
Thus, the problem addressed is that of identifying sources of pollution in urban areas by means of a network of low-cost sensors. Such sources emit pollutants (gases, particles) that are dispersed by the wind. The complex geometry of urban buildings generates a wind field that is non-uniform on a district scale.
Given a choice of model and a set of noisy measurements (sensor network or mobile sensors), it is a question of solving an inverse problem to estimate the position and characteristics of the sources. The problem is one of robust fusion of spatio-temporal data in the presence of spatial constraints and measurement uncertainties.
Known reliable methods are methods based on physical models in which one or more unknown input terms applied to partial differential equations (PDEs) are sought by minimizing the error between the measurements and the output of a mathematical model representing the physical system, said model therefore being sensitive to unknown inputs that must be estimated.
As regards identification of sources of pollution, a number of methods have been applied in the prior art, such as Kalman filtering and its variants (Defforge, 2019; Mons et al., 2017), the calculus of variations (Kumar et al., 2015) sometimes associated with the use of reduced models (Khodayi-mehr, 2019) or even particle filters (Septier et al., 2020).
These methods have the drawback of being computationally expensive, or, in the context of use of computationally less expensive reduced models, of requiring a training database to be created for each dispersion scenario.
In addition, air pollution is a three-dimensional phenomenon that to be modeled requires fine and complex computational spaces.
The source-term estimation approach employed in the present invention uses an LS-RBF-FD approach (LS-RBF-FD standing for Least-Squares Radial-Basis-Function Finite Differences) to create an accurate numerical model of pollutant dispersion. Modelling based on LS-RBF-FD has recently been described in the literature (Tominec et al., 2021). Similar to the RBF-FD approach, which has been known for about 10 years (Flyer et al., 2012), LS-RBF-FD shares the rapidity of RBF-FD but displays greater robustness in the computation of solutions, in particular when it is a question of computing three-dimensional field variables.
The LS-RBF-FD approach is used in one embodiment on a pollutant-transport equation called the advection-diffusion equation and denoted ADPDE. This equation models transport of pollutants considered not to react and governs a scalar concentration field u: x∈Ω=Ω°∪∂Ω, t∈Tu(x, t). This equation is written
with t∈T time, x∈Ω⊂3 position, Ui, Ki the components of the velocities and diffusivities of the mean wind and diffusion fields, u pollutant concentration, s a pollution source term.
Ω represents the spatial domain where the advection-diffusion differential equation is applied while ∂Ω=ΓD∪ΓN represents the edges of the domain where boundary conditions are applied. The situation of interest is one where ΓD is the edge domain where Dirichlet conditions apply and ΓN the edge domain where no-flux conditions apply.
The weather, namely the wind field U and diffusion field K, is obtained using a third-party model.
Advection/diffusion equations are difficult to model in an urban environment with spatial constraints and are computationally expensive. It is necessary to provide a solution allowing computation to be made accessible.
The ADPDE is solved in a steady state. In addition, the physical model may be reformulated in a condensed manner by expressing the differentiation operators applied to c as a generic differential operator in Ω° and where the boundary conditions apply, namely ∂Ω=ΓD∪ΓN such that,
In the ADPDE, s is the source term and g=0.
RBFs are known to satisfactorily interpolate spaces (2D, 3D and 4D spaces in particular).
Any function f: y∈Ωf(y)∈f(Ω)⊂d (d being the dimension of the space) can be reconstructed by a number of (RBF) basis functions ϕ centered on nodes {xi}i=1N
Solving the inverse problem is considered equivalent to execution of the workflow with methods based on models.
A vector of unknowns θ is the variable to be estimated, and is initialized to the value θ0. For example in the above advection-diffusion equation, Equation (1), θ may correspond to the whole source vector (for example θ=
The measurements um sample the field variable u, for example the concentration field of the advection-diffusion equation. They are generally noisy, and the algorithm implementing the inverse method may be provided with a noise model if the noise of the sensors is characterized.
The physical constraint model (dispersion model) m applies to the unknowns θ input into the model, and hence ue=m(θ). Thus, the computed field variable ue is of the same nature as um. A measurement model h is applied to match the measurements um (in particular their positions) with m(θ) to compute a (generally quadratic) deviation of the type r2=[um−h(m(θ))]2 that allows an error term to be expressed.
Variational or Bayesian methods optimize this error term to return an estimate {circumflex over (θ)}, optionally with ranges of uncertainties. Estimation generally requires the physical model m to be called upon several times, this being computationally expensive.
The result of the workflow is estimated parameters and an uncertainty if the method is Bayesian.
The inverse problem addressed is reconstruction of the discretized source term
Radial basis functions, their finite-difference variants and their least-squares finite-difference variants (LS-RBF-FD) are known. LS-RBF-FD is used here to model air pollution and to solve the inverse problem of pollution-source estimation, in combination with a model reduction.
The article (Khodayi-mehr, 2019) uses reduced models for source identification. The model reduction is obtained by principal component analysis (PCA). To construct these models, the authors created several steady-state dispersion scenarios by varying only the position of a pollution source. To solve their inverse problem, they used an adjoint variational method that makes multiple calls to the reduced model trained by PCA on two cases of steady-state two-dimensional studies. However, heightwise transport is non-negligible. It is thus preferable to use three-dimensional transport models. In addition, PCA requires training data. If the physical domain changes or if the weather or the shape of the pollutant sources changes, a new database needs to be built.
The article (Liu et al., 2019) addresses modeling air pollution using RBF-FD not solved either three-dimensionally or by least squares. The authors use a two-dimensional RBF-FD non-steady-state physical model of pollutant transport. They computed the weather and reused the mesh as computation points in their model. The computational domain contains obstacles of rectangular shape. A complex urban geometry would require the inner courtyards of buildings to be handled, which it would be necessary to close beforehand as not doing so would run the risk of generating model indeterminacy. In addition, heightwise transport plays a non-negligible role in dispersion.
The article (Tominec et al., 2021) uses the LS-RBF-FD technique. The LS-RBF-FD method is a generalization of RBF-FD. It is based on two sets of computation points X and Y of Nx and Ny points.
Y is anchored by physics and contains the boundary-condition constraints of the geometric shapes. X is less dense than Y and covers the physical domain without needing to be confined by the edges of the domain (X may have points beyond the domain, or even solely inside the domain). Thus, as with RBF-FD, it is possible to construct a differentiation matrix D that discreetly models by RBF-FD the operators and defined inside Ω, Ω° and its edges ∂Ω so that the solution of the system comparable to (2) may be expressed by:
The article (Tominec et al., 2021) uses uniform-diffusion equations with hybrid (Dirichlet, Neumann) boundary conditions. No three-dimensional solution is disclosed or discussed. Modelling of air pollution is not mentioned. Inverse problems are not addressed.
The principle of RBF and in particular RBF-FD is presented, inter alia, in (Barnett, 2015) p. 2-13.
RBFs have spatial interpolation properties. Let a finite set Y⊂Ω that partitions Ω and allows the variations of y in Equation (3) to be represented by yj∈Y={yj}j=1N
A set Y of evaluation nodes made up of two subsets of nodes YΩ° and Y∂Ω corresponding to nodes inside the domain, i.e. in YΩ°, and those present on the edge, Y∂Ω, respectively. Y is generated in a manner similar to X. The following notion is used: Ny
The idea in this method is to approximate the constraints imposed on a variable u governed by a partial differential equation at a point in space y=yk∈Y. Basis centers/PDE points (X) neighboring the evaluation point are used to express these constraints. This set of neighboring points in X form a set of points centered on yx∈Y. This set is called Sy
In the above equation, wi and λi are weighting coefficients of the approximation and the pi are monomials used to regularize the approximation to impose Σi=1nwipj(xi)=0 ∀j∈{1, . . . , m}. The local interpolating function of u is denoted ua. The vector of the ua(xiy
The vectors
In a similar manner, it is possible to locally interpolate a differential operator on u at a point yk∈Y such that,
Equation 11 can be reformulated in the manner of Equation 8:
By introducing (10) into (12) the following is obtained,
By positing, [Eq. 14] (,)=((Φ)(yk))T, (P)T(yk))A−1, the following is obtained after removing the polynomial terms, which cancel each other out,
The above is equivalent to ua(yk)=(ua(x0y
To express the finite differences applied to the operator u(y)|y=y
It is thus possible to approximate each differential operator u(yk)∀yk∈Y as a weighted sum of n neighbouring values of u defined in X. It is possible to perform this approximation on other differential operators such as that of the boundary conditions. Thus it is possible to create a discrete differentiation matrix D of Ny×Nx size that for any yx∈Y expresses the differentiation weights that apply to it. The weights are stored row by row such that ∀yk∈Y,
Zeroes (0's) are added to the weight vectors of D for nodes of X not forming part of the stencil Sy
The invention pertains to modeling, in particular inverse modeling of urban air pollution, and uses an LS-RBF-FD approach to modelling. LS-RBF-FD is used to express the direct air-pollution model and a least-squares solver is used to obtain the solution, in particular to advection-diffusion transport equations, the RBF-FD method in contrast merely achieving collocation.
The invention achieves model reduction without requiring a training method, through selection of a small X solution space, this allowing computation to be sped up and being useful for inversion.
One advantage of this method is the reduction in the size of the unknown that is the source vector: the problem is thus better stated.
Generally, RBFs make it possible to project the sought variables into spaces of small dimensions.
The invention also consists in an original and effective adjoint quadratic solution. It is effectively implemented on a simple case study.
The invention relates to unprecedented modeling of transport of atmospheric pollutants with the LS-RBF-FD method. This method is fast and has an inherent ability to effect model reduction, this paving the way to discovery of the source.
Thus, in order to improve existing techniques for searching for sources, a method is provided for determining spatial coordinates of a source causing a dispersion effect in a domain, the dispersion effect being approximated by a system of partial differential equations, the method comprising computer-implemented steps,
The domain is typically part of an urban canopy, i.e. a district of a city or a peri-urban area, the atmosphere of which is being modeled, up to about 50 m above the ground, for example. The diffusion source may be a stationary car emitting exhaust gases, for example.
The method is special because the system of equations is expressed in terms of finite differences generated by radial basis functions, the system of equations being solved in a least-squares context, using a least-squares solver, the system of equations having been subject to adjoint quadratic solution with radial basis functions chosen from polyharmonic-spline functions, inverse-multiquadric functions, or Gaussian functions, with regularization polynomials of order 1 or higher.
Advantage is taken of the fact that discretization is oversampled in the LS-RBF-FD method and hence it allows a solution to a least-squares problem.
The invention is based on the distinction between two sets of computation points X, Y, one of which has a cardinal greater than the other by a factor of at least 3.
The method used guarantees digital stability in the processor that conducts the computations.
By virtue of this method based on LS-RBF-FD, it is possible to employ an approximation space (X) that is not very fine, while obtaining quality solutions. This choice allows the size of the unknown (the values of the sought source vector) to be decreased, and the computations to be sped up.
The proposed method cleverly makes use of a constraint relaxation effect, and has more evaluation points than unknowns. What is sought is a source term
According to advantageous and optional features
The invention will be better understood and other advantages will become apparent on reading the following non-limiting description with reference to the appended figures, in which:
The LS-RBF-FD approach employs two distinct sets of computations such that Y≠X and card(Y)/card(X)≥3. The value of 3 or more is a threshold that has proved reasonable.
The technique then consists in solving a least-squares problem in the case of the presented steady-state ADPDE,
the, agglomerated source term defined in the coordinates of Y comprising the zero Neumann and Dirichlet boundary conditions and the source term such that
The least-squares formulation of Equation 18 has a number of advantages:
As a result of the way in which D is defined, with
it is possible to extract from D two sub-matrices, DΩ° and D∂Ω, that stack the L(yk) for all the yk∈YΩ° and B(yk) for all the yk∈Y∂Ω, respectively.
Equation (18) admits a single solution when D is of full rank with [Eq. 20]
Thus, a direct Eulerian model of pollution has been disclosed that allows a dispersion of pollutants to be computed for a given input topology and heterogeneous weather data. The computation is carried out using an LS-RBF-FD approach. The solution to the steady-state problem of Equation 1 is computed efficiently by a least-squares solver.
The inverse problem consists in simultaneously estimating the source term
The measurement operator denoted C is similarly defined as an operator of interpolation from the field space X to the measurement space, such that
[Eq. 22]
For the purpose of estimating the source term, the following least-squares problem is solved—it is a question of the regularized minimization of the least-squares error between the output of the LS-RBF-FD model and the measurements—
The optimal solution (
To provide confirmation of this, the Lagrangian of Equation 23,
is introduced. Lagrangian theory makes it possible to deduce the necessary conditions of optimality with:
Furthermore, by introducing the expression for
The invertibility of the matrix E is guaranteed if and only if the differentiation matrix D in Equation 18 is of full rank. This condition may be met by choosing a suitable cardinality for X and Y. The linear system of Equation 24 of 2Nx×2Nx size is solved without explicit matrix inversion to speed up the computations for large domains, in particular through use of a LSQR least-squares solver.
In the main embodiment, these input field data are weather data, but in an embodiment pertaining to diffusion of heat through a material, they may be density in the material, electrical charge, or even salinity.
The geometric information 10, the model 20 based on continuous partial differential equations, and the input field data 30 are raw input data of the model.
The geometric information 10 is processed to obtain the boundary conditions of the domain, in a step 40. Processed input data are thus obtained, by creating a computational domain including the boundary conditions.
In a step 50, a discretization of the PDE model 20 is performed, with a view to discretizing it by means of LS-RBF-FD radial basis functions. This is done taking into account both the computational domain established in step 40, and the input field data 30. Discretized physical operators are obtained.
The method naturally comprises obtaining measurements 60.
Next, on the basis of the measurements 60 and of the discretized physical operators obtained in step 40, the inverse model 70 is implemented, this including iterative solution based on explicitation, in the processor and associated memory, of the adjoint equation of the discretized operators by matrix inversion, the radial basis functions having been chosen in order to make this inversion possible. The inverse model thus comprises solving a least-squares problem for the purposes of the iterative solution: a value representative of an error between the estimations made by the model and the measurements is minimized.
At the end of the inverse model 70, the coordinates—i.e. the positions—of the sources (or of the source, in the case of a single source) and optionally estimated emission rates are obtained. The intensities (often concentrations) at the various points of the domain are also optionally obtained. These are the results 80.
PHS5 radial basis functions were used, according to the equation
Regularization polynomials of order 2 were used. The direct LS-RBF-FD model of Equation 21 was used with 15000 evaluation nodes to ensure a good resolution. The set X contained many fewer nodes with Nx=2060, this naturally leading to a model reduction allowing the size of the vectors of unknowns
Weather data was used as input information for the computations. The mean wind fields were generated using the software package Parallel-Micro-SWIFT-SPRAY (PMSS), its Eulerian quasi-steady state wind-field model Micro-SWIFT-M, which solves Reynolds-averaged Navier-Stokes equations, being used to compute therefrom a mean wind field U and a diffusion K that expresses turbulent wind fluctuations that may be likened, over long time scales, to a diffusive effect. Weather data computed by models other than Micro-SWIFT-M could in principle be suitable for the LS-RBF-FD model presented above.
The measurements
[Eq. 29]
[Eq. 30] s: xas exp((x−xs)/(√{square root over (2)}σs))2, x∈Ω° with as, xs, σs=1, [0.42Lx, 0.39Ly, 0.02Lz] and [8 m, 8 m, 8 m]. A 2D cross section of the source term is shown in
Two inversion experiments were carried out with additive noise using 1500 and 200 measurements distributed throughout the three-dimensional computational domain.
However, the field resulting from an estimation of the source term based on 1500 measurements (part B of
A number of parameters may have an impact on the quality of an estimation of the source term. Among them, the number of nodes in X and Y and the weight of the regularization in the ratio b/r have an impact on the estimated pattern of the source and its order of magnitude.
The computation time is reasonably short. Once the operator matrix D had been computed, the estimation of the source and of its resulting field were obtained in about 4 minutes in this example with an Intel Xeon E-2276M 6-core processor and 32 Gb of RAM executing a still poorly optimized Python program.
In
Thus, in the left-hand part of
Thus, the left-hand part of
The adjoint method based on RBF-FD models therefore works with noisy measurements, obtained from a very fine LS-RBF-FD model, and produces estimations of source terms using 2060 unknowns when provided with 1500 or 200 measurements scattered over the domain.
In the case where the simulated measurements are provided by PMSS, i.e. a model different from the LS-RBF-FD model that was professionally developed and that has been validated in a wind tunnel (Moussafir et al., 2004), and where the measurements are located at an altitude of about 2 meters, the source is located very rapidly and very accurately with the inverse method presented above.
The inversion procedure may incorporate Kalman filtering, which then allows the estimation to be updated by adding measurements without having to repeat a least-squares computation on a whole batch of measurements. An ensemble Kalman filter may be used and makes it possible not to have to explicitly compute inverses. This is particularly useful when the transient regime is of interest. Ensemble Kalman filters are suitable for problems of large dimensions.
According to yet another variant, an iterative adjoint procedure is carried out: this makes it possible to iterate toward the solution without having to store dense matrices in memory, and is therefore suitable for large domains.
According to yet another variant, a computationally efficient implicit time advancement scheme employing a physical solver is used to make the concentration states vary over time:
The transient physical solver lays the foundation for the prospect of transient identification. It is formulated as follows: the non-steady-state problem creates a minimization problem similar to the steady-state problem where the computed solution, ūa it will be recalled, satisfies the minimization problem,
In the non-steady-state problem, the variation as a function of time in the field variable u is added to the minimization constraint.
To advance in time in a discrete computation scheme, a discrete derivative is expressed at a following time t+1 via an implicit scheme, the non-steady-state problem being expressed as
with D0 an RBF-FD operator with the derivative order 0 (interpolator) of YΩ° in X when it acts on nodes yk∈YΩ° and 0 otherwise. Hence,
where
Thus,
is solved, with
Therefore, the following is solved
This expression is solvable with LSQR or L-BFGS-B, conjugate gradients or other solvers.
Inverse modeling based on LS-RBF-FD is applicable to any search for unknowns, source terms or even field variables in partial differential equations. Thus, it may be employed to identify (locate) hot spots in batteries, or to estimate a source in a CBRN-E context (for example biological/chemical contamination in a water network, an induced leak, etc.).
In the case of pollution, the method is a useful way of identifying sources of pollution for high-risk services (firefighters, army, etc.).
The invention may also be used to locate acoustic sources in complex environments.
Number | Date | Country | Kind |
---|---|---|---|
2307321 | Jul 2023 | FR | national |