Method and apparatus for SAR image data enhancement, and storage medium

Information

  • Patent Grant
  • 11428803
  • Patent Number
    11,428,803
  • Date Filed
    Friday, August 2, 2019
    5 years ago
  • Date Issued
    Tuesday, August 30, 2022
    2 years ago
Abstract
Disclosed are a method and apparatus for SAR image data enhancement, and a storage medium. The method includes: processing an SAR target image by electromagnetic simulation to acquire an SAR electromagnetic simulation image; and processing the SAR electromagnetic simulation image and the SAR target image by a generative adversarial network to obtain a set of virtual samples of the SAR target image.
Description
CROSS REFERENCE TO RELATED APPLICATION

The disclosure relates to the field of image data processing technologies, and in particular to a method and apparatus for synthetic aperture radar (SAR) image data enhancement, and a storage medium.


FIELD

The disclosure relates to the field of image data processing technologies, and in particular to a method and apparatus for SAR image data enhancement, and a storage medium.


BACKGROUND

At present, with the progress of the construction of Guangdong-Hong Kong-Macao Greater Bay Area, application demands on SAR images in such safety detection fields as detection of remote sensing aviation aircraft, dynamic monitoring of ship targets and dynamic detection of oil spill pre-warning are increasing. Large-scene SAR target recognition in the Guangdong-Hong Kong-Macao Greater Bay Area relies on a large number of labeled samples to construct a classification model, which is limited by the spatial range and regional accessibility and has a high cost. Meanwhile, the cost of annotating data limits the construction of a large-scene SAR database in the Greater Bay Area. Therefore, it is urgent to solve the problems of insufficient labeled training samples and insufficient data richness. The traditional method uses contrast enhancement in spatial and frequency domains of images for data enhancement. However, a sample set acquired by this method has high redundancy, and some details of the SAR target image generated are missing, thus failing to accurately reflect information contained in SAR images, which is not conducive to large-scene SAR image target recognition in the Greater Bay Area.


SUMMARY

To solve the foregoing technical problems, an objective of the disclosure is to provide a method and apparatus for SAR image data enhancement, and a storage medium, for alleviating the problems of inadequate SAR target image samples and insufficient data richness by generating a large number of training samples in combination with SAR electromagnetic simulation images and a generative adversarial network.


The technical solution adopted by the disclosure to solve the problem thereof is as follows.


In a first aspect, the disclosure provides a method for SAR image data enhancement, including the following steps of:


processing an SAR target image by electromagnetic simulation to acquire an SAR electromagnetic simulation image; and


processing the SAR electromagnetic simulation image and the SAR target image by a generative adversarial network to obtain a set of virtual samples of the SAR target image.


In some embodiments, processing an SAR target image by electromagnetic simulation to acquire an SAR electromagnetic simulation image includes:


constructing an electromagnetic simulation model by using the SAR target image and simulation parameters;


processing the electromagnetic simulation model by electromagnetic simulation software to obtain radar cross section (RCS) data of the SAR target image; and


obtaining the SAR electromagnetic simulation image by inverse image processing on the RCS data.


In some embodiments, processing the SAR electromagnetic simulation image and the SAR target image by a generative adversarial network to obtain a set of virtual samples of the SAR target image includes:


constructing a generator and a discriminator in the generative adversarial network;


inputting the SAR electromagnetic simulation image into the generator to obtain a generative sample similar to a real SAR target image sample;


inputting the SAR target image or the generative sample into the discriminator to obtain feedback information; and


outputting, by the generator, the set of virtual samples of the SAR target image according to the feedback information.


In some embodiments, the generative adversarial network is GraphGAN.


In some embodiments, the generator adopts a convolutional neural network structure.


In a second aspect, the disclosure provides an SAR image data enhancement apparatus, including:


at least one control processor, and


a memory communicating with the at least one control processor and having instructions stored thereon, and the instructions being executable by the at least one control processor to enable the at least one control processor to perform the method for SAR image data enhancement described in the first aspect.


In a third aspect, the disclosure provides a computer readable storage medium having computer executable instructions stored thereon, wherein the instructions, when executed by a computer, causes the computer to perform the method for SAR image data enhancement described in the first aspect.


The disclosure has the following beneficial effects. In the disclosure, an SAR target image is processed by electromagnetic simulation to acquire a multi-azimuth SAR electromagnetic simulation image, which overcomes the shortcoming of traditional acquisition of insufficient SAR target image samples, and provides sufficient data input for subsequently solving the problem of scare training data in the process of deep learning training. A mapping relationship between the SAR electromagnetic simulation image and the SAR target image is learned by a generative adversarial network, which improves data richness of the SAR electromagnetic simulation image, thus expanding the angle-missing SAR target image and providing strong support for the subsequent recognition and detection of the SAR target image.





BRIEF DESCRIPTION OF THE DRAWINGS

The disclosure is further described below with reference to the accompanying drawings and examples.



FIG. 1 is a brief schematic flowchart of a method for SAR image data enhancement according to the disclosure;



FIG. 2 is a brief schematic flowchart of processing an SAR target image by electromagnetic simulation according to the disclosure;



FIG. 3 is a brief schematic flowchart of processing an SAR electromagnetic simulation image and the SAR target image by a generative adversarial network according to the disclosure;



FIG. 4 is a schematic flowchart of a method for SAR image data enhancement according to the disclosure;



FIG. 5 is a schematic diagram of an RCS reconstructed geometry according to the disclosure;



FIG. 6 is a schematic diagram of an electromagnetic simulation imaging geometry according to the disclosure; and



FIG. 7 is a schematic diagram of a connection relationship between modules for processing the SAR electromagnetic simulation image and the SAR target image by a generative adversarial network according to the disclosure.





DETAILED DESCRIPTION OF THE EMBODIMENTS

The disclosure provides a method and apparatus for SAR image data enhancement, and a storage medium, which can solve the problems of traditional inadequate acquisition of SAR target image samples and insufficient data richness, thus providing strong support for the subsequent recognition and detection of SAR target images.


Embodiments of the disclosure are further set forth below with reference to the accompanying drawings.


Referring to FIG. 1, an embodiment of the disclosure provides a method for SAR image data enhancement, including the following steps of:


step S100: processing an SAR target image by electromagnetic simulation to acquire an SAR electromagnetic simulation image; and


step S200: processing the SAR electromagnetic simulation image and the SAR target image by a generative adversarial network to obtain a set of virtual samples of the SAR target image.


In this embodiment, in step S100, an SAR electromagnetic simulation image at a multi-azimuth angle of 0-360 degrees can be obtained by processing an SAR target image by electromagnetic simulation, which solves the problem of traditional inadequate acquisition of SAR target image samples, and has a low acquisition cost and few restrictions, thus providing sufficient data input for subsequently solving the problem of scarce training data in the process of deep learning training.


In step S200, a mapping relationship between the SAR electromagnetic simulation image and the SAR target image is learned by a generative adversarial network to obtain a set of virtual samples of the SAR target image, which improves data richness of the SAR electromagnetic simulation image, thereby expanding the SAR target image lacking angle richness and provide strong support for the subsequent recognition and detection of the SAR target image.


Further, referring to FIG. 2, another embodiment of the disclosure further provides a method for SAR image data enhancement, wherein processing an SAR target image by electromagnetic simulation to acquire an SAR electromagnetic simulation image includes:


step S110: constructing an electromagnetic simulation model by using the SAR target image and simulation parameters;


step S120: processing the electromagnetic simulation model by an electromagnetic simulation software to obtain RCS data of the SAR target image; and


step S130: obtaining the SAR electromagnetic simulation image by inverse imaging processing on the RCS data.


In this embodiment, in step S110, a target scene is modeled based on real data of an SAR target image, and simulation parameters are set for the real data (mainly including resolution, incidence angle, carrier frequency band, etc.) to construct an electromagnetic simulation model. The electromagnetic model is established to mainly include point targets and ground targets. Modeling and simulation of point targets are mainly to verify the inverse imaging performance of RCS data and a relative positional relationship of the targets. Simulation of the ground targets mainly provides data input for subsequent deep learning. In electromagnetic simulation of high-frequency radar targets, corner reflectors or small-sized metal balls are often used to simulate point-target imaging, so the small-sized metal balls are used in electromagnetic modeling of the point targets. For the electromagnetic modeling process of the ground targets, in the embodiment of the disclosure, a 3D model is designed based on 3D Studio Max software and imported into a CST simulation software platform for experiments. Further adjustment on the grid of the imported model can ensure the simulation speed and a better effect. In addition, in the embodiment of the disclosure, the construction of the electromagnetic simulation model for the SAR target image has few restrictions and a low acquisition cost.


In step S120, simulation parameters of the constructed electromagnetic simulation model are set in CST simulation software, and RCS data, i.e., a scattering coefficient radar cross-sectional area, of the SAR target image corresponding to incident-angle azimuth is obtained with an algorithm A by simulation. The entire process of calculating the scattering coefficient radar cross-sectional area of the SAR target image is implemented by the CST simulation software. The scattering coefficient radar cross-sectional area is a converted area of a target, which is used to measure the intensity of echo generated by the target under the irradiation of radar waves. To determine a scattering coefficient radar cross-sectional area, the radar energy reflected by a target towards an observer is measured or calculated at first, then the size of a reflecting sphere (an optical equivalent can be a spherical mirror) that can return the same radar energy is calculated, and a projected area of the sphere (i.e. the area of a circle) is the scattering coefficient radar cross-sectional area of the target.


In step S130, since the process of calculation of the RCS data of the SAR target image by electromagnetic simulation is similar to that of radar acquisition of echo signals in rotation target imaging, inverse imaging of the RCS data can be achieved based on the rotation imaging principle. The imaging principle will be briefly introduced below.


The reconstructed geometry is as shown in FIG. 5, where (x,y) is a target coordinate system and (u,v) is a radar coordinate system. In the simulation process, the target and the radar move relatively, and a rotation angle is θm.


According to the coordinate transformation formula, a coordinate system transformation formula from x-y to u-v is:

u=x cos θ+y sin θ
v=−x sin θ+y cos θ


A coordinate system transformation formula from u-v to x-y is:

x=u cos θ−v sin θ
y=u sin θ+v cos θ


According to the geometric relationship in FIG. 5, a distance from any point P on a target surface to the radar can be expressed as:

Rθ(x,y)=√{square root over ((R0+v)2+u2)}=√{square root over (R02+x2+y2+2R0(y cos θ−x sin θ))}


As the electromagnetic simulation process simulates rotation target imaging, the radar transmits a linear frequency modulated signal and receives an echo signal. The echo signal after mixing is:

S(k,θ)=∫yxƒ(x,y)exp{−jk[Rθ(x,y)−R0]}dxdy


where S(k,θ) is received echo data, ƒ(x,y) is an electromagnetic scattering characteristic function of the target, k is a radar observing frequency, and C is the velocity of light. That is,






k
=


2
C



(


f
0

+

a

t


)






As the electromagnetic simulation process meets a far-field condition, a distance R0 from the radar to a center point of a scene is far greater than that from the target to the center point of the scene; in this case, the formula Rθ(x,y)=√{square root over ((R0+v)2+u2)}=√{square root over (R02+x2+y2+2 R0(y cos θ−x sin θ))} can be simplified as:

Rθ(x,y)≈R0+v=R0−x sin θ+y cos θ


Further, the simplified slant-range formula is substituted into the formula S(k,θ)=∫yxƒ(x,y)exp{−j2πk[Rθ(x,y)−R0]}dxdy, and a relationship between the target echo S(k,θ) and the electromagnetic scattering characteristic function ƒ(x,y) of the target can be obtained:

S(k,θ)=∫yxƒ(x,y)exp{−jk[−x sin θ+y cos θ]}dxdy


According to the formulas x=u cos θ−v sin θ and y=u sin θ+v cos θ, the formula S(k,θ)=∫yxƒ(x,y)exp{−j2πk[−x sin θ+y cos θ]}dxdy is coordinate-transformed to obtain:










S


(

k
,
θ

)


=





𝔲





v




f


(



u

cos

θ

-

v

sin

θ


,


u

sin

θ

+

v





cos





θ



)




•exp


(


-
j






2

π





kv

)



dudv









=





v





p
θ



(
v
)




exp


(


-
j






2





π





kv

)



dv









where pθ(v)=ƒ(u cos θ−v sin θ, u sin θ+v cos θ) indicates the projection of the electromagnetic scattering characteristic function ƒ(x,y) of the target on the v axis, and from the formula:










S


(

k
,
θ

)


=





𝔲





v




f


(



u

cos

θ

-

v

sin

θ


,


u

sin

θ

+

v





cos





θ



)




•exp


(


-
j






2

π





kv

)



dudv









=





v





p
θ



(
v
)




exp


(


-
j






2





π





kv

)



dv










it can be known that in the case of a fixed observation angle, the echo data is obtained by conducting Fourier transform on pθ(v). A two-dimensional spectrum slice of ƒ(x,y) can be expressed as F(k,θ). An expression of the slice can be obtained as S(k,θ)=F(−k sin θ, k cos θ), according to the formula S(k,θ)=∫yxƒ(x,y)exp{−j2πk[−x sin θ+y cos θ]}dxdy.


The RCS data of the target obtained by electromagnetic simulation at each observation angle is slice sampling of the target at the observation angle F(kx,ky), and then a scattering characteristic time-domain expression ƒ(x,y) of the target can be obtained by inversion through two-dimensional Fourier transform.


As the electromagnetic simulation process is similar to the rotation imaging, the results obtained through RCS inversion imaging are slant-plane images, which needs to be corrected to the ground plane. As the simulation process is a positive side view, the imaging geometric relationship is as shown in FIG. 6:


where A and B are two targets in the simulation process, R1 denotes a distance between the two targets on the ground plane, R2 denotes a distance between the two targets on the slant plane, and θ is an incident angle. Then, a conversion relationship between the ground plane and the slant plane is:

R2=R1 cos(θ)


The correction process adopts point-by-point correction. Firstly, a ground-plane scene area is selected, and the corresponding position in the slant image is calculated according to the formula R2=R1 cos(θ), and a pixel value of a target point is obtained by interpolation.


The method of acquiring SAR image data by electromagnetic simulation has a relatively low cost and is suitable for acquisition of large-scale remote sensing data.


Further, referring to FIG. 3, another embodiment of the disclosure further provides a method for SAR image data enhancement, wherein processing the SAR electromagnetic simulation image and the SAR target image by a generative adversarial network to obtain a set of virtual samples of the SAR target image includes:


step S210: constructing a generator and a discriminator in the generative adversarial network;


step S220: inputting the SAR electromagnetic simulation image into the generator to obtain a generative sample similar to a real SAR target image sample;


step S230: inputting the SAR target image or the generative sample into the discriminator to obtain feedback information; and


step S240: outputting, by the generator, the set of virtual samples of the SAR target image according to the feedback information.


In this embodiment, the generative adversarial network is a generative model. The input of the model is an SAR electromagnetic simulation image, and the output is a set of virtual samples of the SAR target image similar to a real SAR target image. The difference between the input and output data distributions is a loss function of the generative model.


The generative adversarial network generates data by simultaneously training two models: a generator G and a discriminator D. The network structure is as shown in FIG. 7. G receives the SAR electromagnetic simulation image and generates a generative sample similar to a real SAR target image. The effect of the generative sample is as real as possible through processing. D is a binary classifier, whose input is the real SAR target image or the generative sample generated by the generator. D discriminates the input image with respect to the probability that the input image is from the real SAR target image. If the input image is from the real SAR target image, D outputs a high probability; otherwise, it outputs a low probability. The process is essentially a model training process, which is a zero-sum game about G and D. In the loss function, it is necessary to minimize the probability of discriminating the sample generated by the generator G, i.e. alternately training the generator and the discriminator, while the maximization of the probability of correct discrimination of the discriminator D is satisfied. D will receive the real SAR target image and a “false” SAR target image generated by G. For the final output, parameters of both sides can be adjusted and optimized at the same time. If D makes a correct judgment, the parameters of G need to be adjusted to make the generated “false” SAR target image more “realistic.” If D makes a wrong judgment, the parameters of D need to be adjusted to avoid an error in the next similar judgment. In the process of training, data are constantly fitted close, from unstable at the beginning to stable at the end. Ideally, when the final training is completed, the generative sample generated by the generator G is exactly the same as the real SAR target image, and the discriminator D can no longer distinguish whether the image is from the real SAR target image or the generative sample of G. The product after training is a high-quality automatic generator G and a discriminator D with good judging capability.


In this embodiment, SAR image data is expanded by using a trained generative model to obtain a set of virtual samples of the SAR target image. The SAR electromagnetic simulation image is processed by a generative adversarial network, and data richness of the SAR electromagnetic simulation image is further enhanced on the premise of obtaining a multi-azimuth SAR electromagnetic simulation image.


Further, another embodiment of the disclosure further provides a method for SAR image data enhancement, wherein the generative adversarial network is GraphGAN.


In this embodiment, first of all, some symbolic definitions are given, g=(v,ε) denotes a given network, V={v1, . . . , vv} denotes a set of nodes, and ε={eij}i, j=1v denotes a set of edges. For a given node vc, N(vc) denotes a node (first-order neighbor) directly adjacent to the node vc, and ptrue(v|vc) denotes true conditional distribution of nodes in the network about vc, indicating the connection preference of the node vc. From a certain point of view, N(vc) can be regarded as a sample set based on ptrue(v|vc) sampling.


There are mainly two models in GraphGAN:

    • (1) generator G(v|vcG): a generative model G mainly tries to fit or estimate the real connection distribution probability as much as possible, so as to select a node most likely to be connected to vc from a set of nodes V;
    • (2) discriminator D(v, vc; θD): a discriminative model D mainly distinguishes a real node pair from a generative node pair, and calculates the possibility of an edge between the output nodes v and vc.


G is intended to generate points that are similar to neighbor nodes actually connected to vc to fool the discriminator D. D is intended to determine which of these nodes are true neighbors of vc and which are generated by its opponent G. Therefore, an objective function of a minimax game with two opponents is:








min

θ
G





max

θ
D




V


(

G
,
D

)




=




c
=
1

V



(



E

v


p

true


(

·

|

v
c



)







[

log






D


(

v
,


v
c

;

θ
D



)



]


+

(


E

v


G


(


·

|

v
c



;

θ
G


)






[

log


(

1
-

D


(

v
,


v
c

;

θ
D



)



)


]


)








There are two steps for understanding the target function. As D(v, vc; θD) outputs a scalar, it indicates the possibility of existence of an edge between the output nodes v and vc. Then,

    • (1) for θD, the discriminator, of course, wants to be able to predict it correctly, that is, to make the probability value of the actual sample large, and to make the probability value of the sample generated by G small, that is, to make (1−D(v, vc; θD)) large, and thus the whole is a maximized objective;
    • (2) for θG, from the perspective of the generator, to fool the discriminator, that is, to make the discriminator incapable of distinguishing a generated sample and regard it as a real sample, i.e., the probability value of existence of an edge between the generated sample and vc is predicted to be large, namely, to make (1−D(v, vc; θD)) small, and thus it is a minimized objective.


A minimax objective function can be obtained by combining the two objectives, as shown by the equation of the objective function.


The parameters of the generator and the discriminator are continuously updated by alternate training. In each iteration, the discriminator D is trained by positive samples from ptrue(v|vc) and negative samples from G. The generator G is updated according to a gradient strategy under the guidance of D.


In this embodiment, the implementation of the discriminative model D is considered as a sigmoid function:







D


(

v
,

v
c


)


=


σ


(


d
v
T



d

v

c



)


=

1

1
+

exp


(


-

d
v
T




d

v

c



)









where dv,dvc∈ Rk is a k-dimensional vector expression of the nodes v and vc in the discriminator, so θD can be regarded as a set of all dv.


Therefore, corresponding node-expressed vectors dv and dvc are updated only by gradient ascent for a given node pair (v, vc):










θ

D




V


(

G
,
D

)



=

{










θ





D



log







D


(

v
,

v
c


)



,






if





v



p
true


;










θ





D




(

1
-

log






D


(

v
,

v
c


)




)


,





if





v


G




.






An objective function of the generator is a minimized minimax function, and thus it can be optimized and updated by gradient descent, and a gradient of the generator is:













θ





G




V


(

G
,
D

)



=






θ





G







c
=
1

V




E

v


G


(

·

|

v
c



)






[

log


(

1
-

D


(

v
,


v
c

;

θ
D



)



)


]










=






c
=
1

V




E

v


G


(

·

|

v
c



)






[





θ





G



log







G


(

v
|

v
c


)




log


(

1
-

D


(

v
,

v
c


)



)



]










It should be noted that the gradient θθGV(G,D) can actually be regarded as an expected sum of a gradient ∇θG log G(v|vc) of a weight log(1−D(v, vc; θD)). That is, if a generative node is identified as a negative sample node, the probability D(v, vc; θD) will be small, and the weight corresponding to the gradient of the generative node will be large, so that the entire gradient will become large.


The implementation of the generative model is defined by a softmax function:







G


(

v
|

v
c


)


=


exp


(


g
v
T



g

v

c



)






v


v
c





exp


(


g
v
T



g

v

c



)








where gv, gvc∈Rk is a k-dimensional vector expression of the nodes v and vc in the generator, so θG can be regarded as a set of all gv.


Based on such a setting, at first, estimated connection distribution G(v|vc; θ) can be calculated according to the formula








G


(

v
|

v
c


)


=


exp


(


g
v
T



g

v

c



)






v


v
c





exp


(


g
v
T



g

v

c



)





,





then a sample set (v, vc) can be obtained by random sampling according to the probability value, and finally θG is updated with an SGD method.


In this embodiment, the generative adversarial network uses GraphGAN to connect generated nodes, which has a great advantage in improving the performance, and the generation effect is better than that of an ordinary GAN.


Further, another embodiment of the disclosure further provides a method for SAR image data enhancement, wherein the generator adopts a convolutional neural network structure.


In this embodiment, a convolutional neural network is a deep neural network with a convolution structure. The basic idea thereof is to construct a multi-layer network to express a target in a multi-layer way, so as to represent abstract semantic information of data through high-level features of the multi-layer network and obtain better feature robustness. Meanwhile, the convolution structure can reduce the memory of the deep network. The generator in this embodiment adopts a convolutional neural network structure, which, on the one hand, reduces the number of weights and makes the network easy to optimize, and, on the other hand, reduces the complexity of the model, that is, reduces the risk of overfitting. The advantage is more obvious when the input of the network is an image, so that the image can be directly used as the input of the network, avoiding the complex process of feature extraction and data reconstruction in traditional recognition algorithms. It has a lot of advantages in the process of two-dimensional image processing, for example, the network can extract image features including color, texture, shape and topology structure of the image, and has good robustness and operation efficiency in the two-dimensional image processing, especially in displacement identification, zooming and other forms of distortion invariance applications.


In addition, referring to FIG. 4, another embodiment of the disclosure further provides a method for SAR image data enhancement, including the following steps of:


step S110: constructing an electromagnetic simulation model by using the SAR target image and simulation parameters;


step S120: processing the electromagnetic simulation model by an electromagnetic simulation software to obtain RCS data of the SAR target image;


step S130: obtaining the SAR electromagnetic simulation image by inverse imaging processing on the RCS data;


step S210: constructing a generator and a discriminator in the generative adversarial network;


step S220: inputting the SAR electromagnetic simulation image into the generator to obtain a generated sample similar to a real SAR target image sample;


step S230: inputting the SAR target image or the generative sample into the discriminator to obtain feedback information; and


step S240: outputting, by the generator, the a set of virtual samples of the SAR target image according to the feedback information.


In this embodiment, in step S110, a target scene is modeled based on real data of an SAR target image, and simulation parameters are set for the real data (mainly including resolution, incidence angle, carrier frequency band, etc.) to construct an electromagnetic simulation model. The electromagnetic model is established to mainly include point targets and ground targets. Modeling and simulation of point targets are mainly to verify the inverse imaging performance of RCS data and a relative positional relationship of the targets. Simulation of the ground targets mainly provides data input for subsequent deep learning. In electromagnetic simulation of high-frequency radar targets, corner reflectors or small-sized metal balls are often used to simulate point-target imaging, so the small-sized metal balls are used in electromagnetic modeling of the point targets. For the electromagnetic modeling process of the ground targets, in the embodiment of the disclosure, a 3D model is designed based on 3D Studio Max software and imported into a CST simulation software platform for experiments. Further adjustment on the grid of the imported model can ensure the simulation speed and a better effect. In addition, in the embodiment of the disclosure, the construction of the electromagnetic simulation model for the SAR target image has few restrictions and a low acquisition cost.


In step S120, simulation parameters of the constructed electromagnetic simulation model are set in CST simulation software, and RCS data, i.e., a scattering coefficient radar cross-sectional area, of the SAR target image corresponding to incident-angle azimuth is obtained with an algorithm A by simulation. The entire process of calculating the scattering coefficient radar cross-sectional area of the SAR target image is implemented by the CST simulation software. The scattering coefficient radar cross-sectional area is a converted area of a target, which is used to measure the intensity of echo generated by the target under the irradiation of radar waves. To determine a scattering coefficient radar cross-sectional area, the radar energy reflected by a target towards an observer is measured or calculated at first, then the size of a reflecting sphere (an optical equivalent can be a spherical mirror) that can return the same radar energy is calculated, and a projected area of the sphere (i.e. the area of a circle) is the scattering coefficient radar cross-sectional area of the target.


In step S130, since the process of calculation of the RCS data of the SAR target image by electromagnetic simulation is similar to that of radar acquisition of echo signals in rotation target imaging, inverse imaging of the RCS data can be achieved based on the rotation imaging principle. The imaging principle will be briefly introduced below.


The reconstructed geometry is as shown in FIG. 5, where (x,y) is a target coordinate system and (u,v) is a radar coordinate system. In the simulation process, the target and the radar move relatively, and a rotation angle is θm.


According to the coordinate transformation formula, a coordinate system transformation formula from x-y to u-v is:

u=x cos θ+y sin θ
v=−x sin θ+y cos θ


A coordinate system transformation formula from u-v to x-y is:

x=u cos θ−v sin θ
y=u sin θ+v cos θ


According to the geometric relationship in FIG. 5, a distance from any point P on a target surface to the radar can be expressed as:

Rθ(x,y)=√{square root over ((R0+v)2+u2)}=√{square root over (R02+x2+y2+2R0(y cos θ−x sin θ))}


As the electromagnetic simulation process simulates rotation target imaging, the radar transmits a linear frequency modulated signal and receives an echo signal. The echo signal after mixing is:

S(k,θ)=∫yxƒ(x,y)exp{−jk[Rθ(x,y)−R0]}dxdy


where S(k,θ) is received echo data, ƒ(x,y) is an electromagnetic scattering characteristic function of the target, k is a radar observing frequency, and C is the velocity of light. That is,






k
=


2
C



(


f
0

+

a

t


)






As the electromagnetic simulation process meets a far-field condition, a distance R0 from the radar to a center point of a scene is far greater than that from the target to the center point of the scene; in this case, the formula Rθ(x,y)=√{square root over ((R0+v)2+u2)}=√{square root over (R02+x2+y2+2R0(y cos θ−x sin θ))} can be simplified as:

R0(x,y)≈R0+v=R0−x sin θ+y cos θ


Further, the simplified slant-range formula is substituted into the formula S(k,θ)=∫yxƒ(x,y)exp{−j2πk[Rθ(x,y)−R0]}dxdy, and a relationship between the target echo S(k,θ) and the electromagnetic scattering characteristic function ƒ(x,y) of the target can be obtained:

S(k,θ)=∫yxƒ(x,y)exp{−jk[−x sin θ+y cos θ]}dxdy


According to the formulas x=u cos θ−v sin θ and y=u sin θ+v cos θ, the formula S(k,θ)=∫yxƒ(x,y)exp{−j2πk[−x sin θ+y cos θ]}dxdy is coordinate-transformed to obtain:










S


(

k
,
θ

)


=





𝔲





v




f


(



u

cos

θ

-

v

sin

θ


,


u

sin

θ

+

v





cos





θ



)




•exp


(


-
j






2

π





kv

)



dudv









=





v





p
θ



(
v
)




exp


(


-
j






2





π





kv

)



dv









where pθ(v)=ƒ(u cos θ−v sin θ,u sin θ+v cos θ) indicates the projection of the electromagnetic scattering characteristic function ƒ(x,y) of the target on the v axis, and from the formula










S


(

k
,
θ

)


=





𝔲





v




f


(



u

cos

θ

-

v

sin

θ


,


u

sin

θ

+

v





cos





θ



)




•exp


(


-
j






2

π





kv

)



dudv









=





v





p
θ



(
v
)




exp


(


-
j






2





π





kv

)



dv










it can be known that in the case of a fixed observation angle, the echo data is obtained by conducting Fourier transform on pθ(v). A two-dimensional spectrum slice of ƒ(x,y) can be expressed as F(k,θ). An expression of the slice can be obtained as S(k,θ)=F(−k sin θ,k cos θ), according to the formula S(k,θ)=∫yxƒ(x,y)exp{−j2πk[−x sin θ+y cos θ]}dxdy.


The RCS data of the target obtained by electromagnetic simulation at each observation angle is slice sampling of the target at the observation angle F(kx,ky), and then a scattering characteristic time-domain expression ƒ(x,y) of the target can be obtained by inversion through two-dimensional Fourier transform.


As the electromagnetic simulation process is similar to the rotation imaging, the results obtained through RCS inversion imaging are slant-plane images, which needs to be corrected to the ground plane. As the simulation process is a positive side view, the imaging geometric relationship is as shown in FIG. 6:


where A and B are two targets in the simulation process, R1 denotes a distance between the two targets on the ground plane, R2 denotes a distance between the two targets on the slant plane, and θ is an incident angle. Then, a conversion relationship between the ground plane and the slant plane is:

R2=R1 cos(θ)


The correction process adopts point-by-point correction. Firstly, a ground-plane scene area is selected, and the corresponding position in the slant image is calculated according to the formula R2=R1 cos(θ), and a pixel value of a target point is obtained by interpolation.


The method of acquiring SAR image data by electromagnetic simulation has a relatively low cost and is suitable for acquisition of large-scale remote sensing data.


In steps S210, S220, S230 and S240, the generative adversarial network is a generative model. The input of the model is an SAR electromagnetic simulation image, and the output is a set of virtual samples of the SAR target image similar to a real SAR target image. The difference between the input and output data distributions is a loss function of the generative model.


The generative adversarial network generates data by simultaneously training two models: a generator G and a discriminator D. The network structure is as shown in FIG. 7. G receives the SAR electromagnetic simulation image and generates a generative sample similar to a real SAR target image. The effect of the generative sample is as real as possible through processing. D is a binary classifier, whose input is the real SAR target image or the generative sample generated by the generator. D discriminates the input image with respect to the probability that the input image is from the real SAR target image. If the input image is from the real SAR target image, D outputs high probability; otherwise, it outputs low probability. The process is essentially a model training process, which is a zero-sum game about G and D. In the loss function, it is necessary to minimize the probability of discriminating the sample generated by the generator G, i.e. alternately training the generator and the discriminator, while the maximization of the probability of correct discrimination of the discriminator D is satisfied. D will receive the real SAR target image and a “false” SAR target image generated by G. For the final output, parameters of both sides can be adjusted and optimized at the same time. If D makes a correct judgment, the parameters of G need to be adjusted to make the generated “false” SAR target image more “realistic.” If D makes a wrong judgment, the parameters of D need to be adjusted to avoid an error in the next similar judgment. In the process of training, data are constantly fitted close, from unstable at the beginning to stable at the end. Ideally, when the final training is completed, the generative sample generated by the generator G is exactly the same as the real SAR target image, and the discriminator D can no longer distinguish whether the image is from the real SAR target image or the generative samples of G. The product after the training is a high-quality automatic generator G and a discriminator D with good judgment capability.


In this embodiment, SAR image data is expanded by using a trained generative model to obtain an a set of virtual samples of the SAR target image. The SAR electromagnetic simulation image is processed by a generative adversarial network, and data richness of the SAR electromagnetic simulation image is further enhanced on the premise of obtaining a multi-azimuth SAR electromagnetic simulation image.


Besides, another embodiment of the disclosure further provides an SAR image data enhancement apparatus, including at least one control processor and a memory for communicating with the at least one control processor. The memory may store instructions executable by the at least one control processor, and the instructions, when executed by the at least one control processor, enable the at least one control processor to perform the method for SAR image data enhancement as described in any of above embodiments.


In this embodiment, the apparatus for SAR image data enhancement includes at least one control processor and a memory which can be connected by a bus or in other manners.


As a nonvolatile computer readable storage medium, the memory can be used to store nonvolatile software programs, nonvolatile computer executable programs and modules, such as program instructions/modules corresponding to the method for SAR image data enhancement in the embodiments of the disclosure. The control processor performs various functional applications and data processing of the apparatus for SAR image data enhancement by running the nonvolatile software programs, instructions and modules stored in the memory, that is, implements the method for SAR image data enhancement in the above method embodiments.


The memory may include a program storage area and a data storage area, where the program storage area may store an operating system and an application required by at least one function; and the data storage area may store data created according to use of the apparatus for SAR image data enhancement. In addition, the memory may include a high-speed random access memory, and may further include a nonvolatile memory, for example, at least one disk storage device, a flash memory device, or other nonvolatile solid-state storage devices. In some embodiments, the memory optionally includes memories remotely disposed relative to the control processor, and these remote memories may be connected to the apparatus for SAR image data enhancement via a network. Examples of the network include but are not limited to, the Internet, an intranet, a local area network, a mobile communications network, or a combination thereof.


The one or more modules are stored in the memory, and when executed by the one or more control processors, perform the method for SAR image data enhancement described in the above method embodiments, for example, perform the functions of steps S100 to S200, S110 to S130 and S210 to S240 in the method for SAR image data enhancement described above.


In addition, another embodiment of the disclosure further provides a computer readable storage medium, wherein the computer readable storage medium may store computer executable instructions, and when the computer executable instructions are executed by one or more control processors, for example, one control processor, the one or more control processors can be caused to perform the method for SAR image data enhancement described in the above method embodiments, for example, perform the functions of steps S100 to S200, S110 to S130 and S210 to S240 in the method for SAR image data enhancement described above.


The apparatus embodiments described above are merely schematic. The units described as separate parts may or may not be physically separate, may be located in one position, or may be distributed over a plurality of network units. Some or all of the modules may be selected according to actual needs to achieve the objectives of the solutions of the embodiments.


From the description of the embodiments above, those skilled in the art may clearly understand that the embodiments may be implemented by software plus a necessary universal hardware platform. Those of ordinary skill in the art may understand that implementation of all or some of steps in the method of the above embodiments may be completed by a program instructing relevant hardware, the program may be stored in a computer readable storage medium, and when the program is executed, the process of the above method embodiment may be included. The storage medium may be a magnetic disk, an optical disk, a Read Only Memory (ROM), a Random Access Memory (RAM), or the like.


The above only describes the preferred embodiments of the disclosure. The disclosure is not limited to the above embodiments. Any embodiment should be encompassed in the protection scope of the invention as long as it achieves the technical effect of the disclosure with the same means.

Claims
  • 1. A method of enhancing synthetic aperture radar (SAR) image data, the method comprising: processing an SAR target image by an electromagnetic simulation to acquire an SAR electromagnetic simulation image;processing the SAR electromagnetic simulation image and the SAR target image by a generative adversarial network to obtain a set of virtual samples of the SAR target image;wherein processing the SAR target image by the electromagnetic simulation to acquire the SAR electromagnetic simulation image comprises:constructing an electromagnetic simulation model by using the SAR target image and simulation parameters;processing the electromagnetic simulation model by an electromagnetic simulation software to obtain radar cross section (RCS) data of the SAR target image; andobtaining the SAR electromagnetic simulation image by an inverse image processing on the RCS data;wherein obtaining the SAR electromagnetic simulation image by the inverse image processing on the RCS data comprises:transforming the x-y coordinate system to u-v according to coordinate transformation formulas including: x=u cos θ−v sin θ; andy=u sin θ+v cos θ;wherein the coordinate transformation formulas from coordinate system u-v to x-y are given as follow: u=x cos θ+y sin θ; andv=−x sin θ+y cos θ;expressing a distance from any point P on a target surface to a radar by utilizing a property that in a far-field condition, a distance from the radar to a center point of a scene is greater than that from the target to the center point of the scene: Rθ(x,y)≈R0+v=R0−x sin θ+y cos θwherein R0 indicates a distance from the radar to a center point of a scene, defining a target echo as: S(k,θ)=∫y∫xƒ(x,y)exp{−j2πk[Rθ(x,y)−R0]}dxdy wherein k is a radar observing frequency;substituting Rθ(x,y)≈R0+v=R0−x sin θ+y cos θ into S(k,θ)=∫y∫xƒ(x,y)exp{−j2πk[Rθ(x,y)−R0]}dxdy to obtain a relationship between the target echo S(k,θ) and the electromagnetic scattering characteristic function ƒ(x,y) of the target: S(k,θ)=∫y∫xƒ(x,y)exp{−j2πk[−x sin θ+y cos θ]}dxdy obtaining a Fourier transform on pθ(v) by substituting x=u cos θ−v sin θ and y=u sin θ+v cos θ into S(k,θ)=∫y∫xƒ(x,y)exp{−j2πk[−x sin θ+y cos θ]}dxdy: S(k,θ)=∫u∫vƒ(u cos θ−v sin θ,u sin θ+v cos θ)• exp(−j2πkv)dudv =∫vpθ(v)exp(−j2πkv)dv wherein pθ(v)=ƒ(u cos θ−v sin θ,u sin θ+v cos θ) indicates a projection of the electromagnetic scattering characteristic function ƒ(x,y) of the target on the v axis, expressing a two-dimensional spectrum slice of ƒ(x,y) as F(k,θ);obtaining an expression of the slice according to S(k,θ)=∫y∫xƒ(x,y)exp {−j2πk[−x sin θ+y cos θ]}dxdy: S(k,θ)=F(−k sin θ,k cos θ)slice sampling the RCS data at the observation angle F(kk, ky);applying two-dimensional Fourier transformation to F(kk, ky) to obtain a scattering characteristic time-domain expression ƒ(x,y) of the target F(kk, ky).
  • 2. The method according to claim 1, wherein processing the SAR electromagnetic simulation image and the SAR target image by the generative adversarial network to obtain the set of virtual samples of the SAR target image comprises: constructing a generator and a discriminator in the generative adversarial network;inputting the SAR electromagnetic simulation image into the generator to obtain a generative sample similar to a real SAR target image sample;inputting the SAR target image or the generative sample into the discriminator to obtain feedback information; andoutputting, by the generator, the set of virtual samples of the SAR target image according to the feedback information.
  • 3. The method according to claim 1, wherein the generative adversarial network is GraphGAN.
  • 4. The method according to claim 2, wherein the generator adopts a convolutional neural network structure.
  • 5. An apparatus for enhancing synthetic aperture radar (SAR) image data, the apparatus comprising: at least one control processor, anda memory communicating with the at least one control processor and having instructions stored thereon, and the instructions being executable by the at least one control processor to enable the at least one control processor to: process an SAR target image by electromagnetic simulation to acquire an SAR electromagnetic simulation image;process the SAR electromagnetic simulation image and the SAR target image by a generative adversarial network to obtain a set of virtual samples of the SAR target image;wherein processing the SAR target image by the electromagnetic simulation to acquire the SAR electromagnetic simulation image comprises:constructing an electromagnetic simulation model by using the SAR target image and simulation parameters;processing the electromagnetic simulation model by an electromagnetic simulation software to obtain radar cross section (RCS) data of the SAR target image; andobtaining the SAR electromagnetic simulation image by an inverse image processing on the RCS data;wherein obtaining the SAR electromagnetic simulation image by the inverse image processing on the RCS data comprises:transforming the x-y coordinate system to u-v according to coordinate transformation formulas including: x=u cos θ−v sin θ; andy=u sin θ+v cos θ;wherein the coordinate transformation formulas from coordinate system u-v to x-y are given as follow: u=x cos θ+y sin θ; andv=−x sin θ+y cos θ;expressing a distance from any point P on a target surface to a radar by utilizing a property that in a far-field condition, a distance from the radar to a center point of a scene is greater than that from the target to the center point of the scene: Rθ(x,y)≈R0+v=R0−x sin θ+y cos θwherein R0 indicates a distance from the radar to a center point of a scene, defining a target echo as: S(k,θ)=∫y∫xƒ(x,y)exp{−j2πk[Rθ(x,y)−R0]}dxdy wherein k is a radar observing frequency;substituting Rθ(x,y)≈R0+v=R0−x sin θ+y cos θ into S(k,θ)=∫y∫xƒ(x,y)exp{−j2πk[Rθ(x,y)−R0]}dxdy to obtain a relationship between the target echo S(k,θ) and the electromagnetic scattering characteristic function ƒ(x,y) of the target: S(k,θ)=∫y∫xƒ(x,y)exp{−j2πk[−x sin θ+y cos θ]}dxdy obtaining a Fourier transform on pθ(v) by substituting x=u cos θ−v sin θ and y=u sin θ+v cos θ into S(k,θ)=∫y∫xƒ(x,y)exp{−j2πk[−x sin θ+y cos θ]}dxdy: S(k,θ)=∫u∫vƒ(u cos θ−v sin θ,u sin θ+v cos θ)• exp(−j2πkv)dudv =∫vpθ(v)exp(−j2πkv)dv wherein pθ(v)=ƒ(u cos θ−v sin θ,u sin θ+v cos θ) indicates a projection of the electromagnetic scattering characteristic function ƒ(x,y) of the target on the v axis, expressing a two-dimensional spectrum slice of ƒ(x,y) as F(k,θ);obtaining an expression of the slice according to S(k,θ)=∫y∫xƒ(x,y)exp {−j2πk[−x sin θ+y cos θ]}dxdy: S(k,θ)=F(−k sin θ,k cos θ)slice sampling the RCS data at the observation angle F(kk, ky);applying two-dimensional Fourier transformation to F(kk, ky) to obtain a scattering characteristic time-domain expression ƒ(x,y) of the target F(kk, ky).
  • 6. The apparatus according to claim 5, wherein processing the SAR electromagnetic simulation image and the SAR target image by the generative adversarial network to obtain the set of virtual samples of the SAR target image comprises: constructing a generator and a discriminator in the generative adversarial network;inputting the SAR electromagnetic simulation image into the generator to obtain a generative sample similar to a real SAR target image sample;inputting the SAR target image or the generative sample into the discriminator to obtain feedback information; andoutputting, by the generator, the set of virtual samples of the SAR target image according to the feedback information.
  • 7. The apparatus according to claim 5, wherein the generative adversarial network is GraphGAN.
  • 8. The apparatus according to claim 6, wherein the generator adopts a convolutional neural network structure.
  • 9. A computer readable storage medium having computer executable instructions stored thereon, wherein the instructions, when executed by a computer, causes the computer to: process a synthetic aperture radar (SAR) target image by electromagnetic simulation to acquire an SAR electromagnetic simulation image;process the SAR electromagnetic simulation image and the SAR target image by a generative adversarial network to obtain a set of virtual samples of the SAR target image;wherein processing the SAR target image by the electromagnetic simulation to acquire the SAR electromagnetic simulation image comprises:constructing an electromagnetic simulation model by using the SAR target image and simulation parameters;processing the electromagnetic simulation model by an electromagnetic simulation software to obtain radar cross section (RCS) data of the SAR target image; andobtaining the SAR electromagnetic simulation image by an inverse image processing on the RCS data;wherein obtaining the SAR electromagnetic simulation image by the inverse image processing on the RCS data comprises:transforming the x-y coordinate system to u-v according to coordinate transformation formulas including: x=u cos θ−v sin θ; andy=u sin θ+v cos θ;wherein the coordinate transformation formulas from coordinate system u-v to x-y are given as follow: u=x cos θ+y sin θ; andv=−x sin θ+y cos θ;expressing a distance from any point P on a target surface to a radar by utilizing a property that in a far-field condition, a distance from the radar to a center point of a scene is greater than that from the target to the center point of the scene: Rθ(x,y)≈R0+v=R0−x sin θ+y cos θwherein R0 indicates a distance from the radar to a center point of a scene, defining a target echo as: S(k,θ)=∫y∫xƒ(x,y)exp{−j2πk[Rθ(x,y)−R0]}dxdy wherein k is a radar observing frequency;substituting Rθ(x,y)≈R0+v=R0−x sin θ+y cos θ into S(k,θ)=∫y∫xƒ(x,y)exp{−j2πk[Rθ(x,y)−R0]}dxdy to obtain a relationship between the target echo S(k,θ) and the electromagnetic scattering characteristic function ƒ(x,y) of the target: S(k,θ)=∫y∫xƒ(x,y)exp{−j2πk[−x sin θ+y cos θ]}dxdy obtaining a Fourier transform on pθ(v) by substituting x=u cos θ−v sin θ and y=u sin θ+v cos θ into S(k,θ)=∫y∫xƒ(x,y)exp{−j2πk[−x sin θ+y cos θ]}dxdy: S(k,θ)=∫u∫vƒ(u cos θ−v sin θ,u sin θ+v cos θ)• exp(−j2πkv)dudv =∫vpθ(v)exp(−j2πkv)dv wherein pθ(v)=ƒ(u cos θ−v sin θ,u sin θ+v cos θ) indicates a projection of the electromagnetic scattering characteristic function ƒ(x,y) of the target on the v axis, expressing a two-dimensional spectrum slice of ƒ(x,y) as F(k,θ);obtaining an expression of the slice according to S(k,θ)=∫y∫xƒ(x,y)exp {−j2πk[−x sin θ+y cos θ]}dxdy: S(k,θ)=F(−k sin θ,k cos θ)slice sampling the RCS data at the observation angle F(kk, ky);applying two-dimensional Fourier transformation to F(kk, ky) to obtain a scattering characteristic time-domain expression ƒ(x,y) of the target F(kk, ky).
  • 10. The computer readable storage medium according to claim 9, wherein processing the SAR electromagnetic simulation image and the SAR target image by the generative adversarial network to obtain the set of virtual samples of the SAR target image comprises: constructing a generator and a discriminator in the generative adversarial network;inputting the SAR electromagnetic simulation image into the generator to obtain a generative sample similar to a real SAR target image sample;inputting the SAR target image or the generative sample into the discriminator to obtain feedback information; andoutputting, by the generator, the set of virtual samples of the SAR target image according to the feedback information.
  • 11. The computer readable storage medium according to claim 9, wherein the generative adversarial network is GraphGAN.
  • 12. The computer readable storage medium according to claim 10, wherein the generator adopts a convolutional neural network structure.
Priority Claims (1)
Number Date Country Kind
201910589014.4 Jul 2019 CN national
US Referenced Citations (3)
Number Name Date Kind
20160019458 Kaufhold Jan 2016 A1
20200193227 Zhou Jun 2020 A1
20200264300 Rostami Aug 2020 A1
Foreign Referenced Citations (3)
Number Date Country
106355151 Jan 2017 CN
106569191 Apr 2017 CN
109934282 Jun 2019 CN
Non-Patent Literature Citations (1)
Entry
Wang, H., Wang, J., Wang, J., Zhao, M., Zhang, W., Zhang, F., Xie, X., & Guo, M. (2018). GraphGAN: Graph Representation Learning With Generative Adversarial Nets. Proceedings of the AAAI Conference on Artificial Intelligence, 32(1). Retrieved from https://ojs.aaai.org/index.php/AAAI/article/view/11872 (Year: 2018).
Related Publications (1)
Number Date Country
20210003699 A1 Jan 2021 US