Detection of objects with a synthetic antenna

FIELD OF THE INVENTION

The present invention relates to the field of the detection of objects based on sensor signals. More specifically, it relates to the detection of objects in synthetic-antenna signals, carried out by combining distance measurements at various points.

BACKGROUND

A sonar is a measuring device that is widely used in underwater navigation for detecting/locating objects in the water and measuring their distance. An active sonar operates as follows:

- a sound wave is emitted;
- the sound wave is reflected by objects present in the water and the seabed;
- the sound waves thus reflected are picked up at a point close to the point of emission. The time differences between emission and reception and the intensity of the reflected wave make it possible to determine the distance from the closest point (object or seabed) in a given direction;
- the distances for the various directions are aggregated to form a sonar image, which actually represents the distances from the points closest to the sonar in the various directions.

An object in a sonar image may be detected using the shape of the echo of the object in the image, but also using its shadow, that is to say the shape of the portion of the seabed that is not reached by the sound wave emitted by the sonar, because it is masked by the object.

In order to improve the resolution of a sonar, what is known as a synthetic-antenna sonar system may be used. Synthetic-antenna sonar aims to improve resolution at a given range without increasing the physical linear dimension of the reception antenna. The principle of synthetic-antenna sonar consists in using a composite physical antenna formed by a linear array of N transducers. In this type of sonar, when the carrier is moving forward, an emitter, or emission antenna, emits M successive pulses in an elementary sector that is fixed with respect to the carrier. The signals received by the N transducers of the physical reception antenna at M instants, and therefore at M successive locations, are used to form the beams of the synthetic antenna. The resolution of the images that are obtained, that is to say the resolution of the beams of the synthetic antennas (“array beam resolution”), is substantially equivalent to that of a virtual antenna the length of which corresponds to the length traveled by the physical antenna during these M successive instants.

Synthetic-antenna sonars are widely used because they make it possible to significantly improve sonar resolution without having to make any hardware changes.

However, the shadows of objects perceived by synthetic-antenna sonars are affected by a penumbra effect, also known as a parallax effect: since the angles of emission of the wave and of reception of the echoes are modified between each image capture, the directions of the shadows are modified between each image capture. When the synthetic image is generated, the shadows are thus blurred.

In some cases, for example when an object has its height stretched, or is floating between two bodies of water, the shadow may even become practically undetectable. This may be the case for example with a school of fish.

A similar problem may be encountered for other types of synthetic antennas, that is to say sensors operating on the principle of emitting and receiving waves at various points, and generating a synthetic image based on reflected waves received at the various points. This is the case for example for synthetic-aperture radars, or certain types of ultrasound.

There is therefore a need for improved detection, using synthetic antennas based on the emission of waves and the reception of waves reflected at various points, of objects the shadow cast by which is subject to penumbra effects.

SUMMARY OF THE INVENTION

To this end, one subject of the invention is a computer-implemented method comprising: receiving a series of distance measurements generated, from a plurality of respectively different positions, by a detection system that operates by: emitting a wave; receiving waves reflected by the environment; determining distances by computing differences between the time of emission of the wave and the times of reception of the reflected waves; generating, based on said series of distance measurements, a synthetic image representing the distances of the environment from a reference position; for each focusing distance of a plurality of focusing distances: generating, based on said series of distance measurements or said synthetic image, a synthetic image focused at said focusing distance by applying penumbra effect compensation; detecting the presence of an object in said focused synthetic image.

Advantageously, the detection system is a sonar system, and the generation of the synthetic image defines a synthetic-antenna sonar.

Advantageously, detecting the presence of an object in said focused synthetic image comprises applying a supervised machine learning engine trained with a learning base comprising focused images of shadows of objects of the same type as said object.

Advantageously, the method comprises: prior to the detection: computing, for each pixel of the focused synthetic image, a ratio between the intensities of the pixel in the synthetic image and the synthetic image; thresholding the pixels of the synthetic image for which this ratio is greater than a threshold; applying a mathematical morphology operation to the thresholded pixels; applying said detection to the output of said mathematical morphology operation.

Advantageously, the plurality of focusing distances comprises a plurality of initial focusing distances defined by a first distance pitch over a first range of focusing distances, the method comprising: a step of defining a plurality of refined focusing distances, which are defined by: a second range of focusing distances, narrower than the first, around a first focusing distance of said plurality of initial focusing distances, at which the presence of an object has been detected; a second distance pitch, less than the first; for each focusing distance from among said plurality of refined focusing distances, said generating, based on said series of distance measurements, a synthetic image focused at said focusing distance by applying penumbra effect compensation.

Advantageously, the step of generating, based on said synthetic image, a synthetic image focused at said focusing distance by applying penumbra effect compensation is carried out by applying a one-dimensional filter to the synthetic image.

Advantageously, the method as claimed in claim 1 comprises: generating a modified focused synthetic image by adding a shadow associated with a label to said synthetic image focused at said focusing distance; generating a modified synthetic image by applying an inverse filter of said one-dimensional filter to the modified focused synthetic image; enriching a learning base for detecting the presence of objects with the modified focused synthetic image, said focusing distance and said label.

Advantageously, the method comprises, in the event of detection of the presence of an object in said focused synthetic image, generating a composite image based on said synthetic image and said focused synthetic image.

Advantageously, said generating the composite image comprises: detecting shadows in the focused synthetic image; assigning the following for each pixel of the composite image: the intensity value of the corresponding pixel of the focused synthetic image for each pixel belonging to a shadow; the intensity value of the corresponding pixel of the synthetic image otherwise.

Advantageously, said generating the composite image comprises assigning, for each pixel of the composite image, an intensity value equal to the weighted sum of the intensity value of the corresponding pixel of the focused synthetic image and of the intensity value of the corresponding pixel of the synthetic image, where, for each pixel, the relative weight of the intensity value of the corresponding pixel of the focused synthetic image increases with an index of belonging to a shadow of the pixel.

Advantageously, the method comprises defining a region of interest of the synthetic image, wherein the steps of generating, based on said series of distance measurements, a synthetic image focused at said focusing distance by applying penumbra effect compensation and detecting the presence of an object in said focused synthetic image are carried out only in the region of interest.

Advantageously, the region of interest of the synthetic image comprises: displaying the synthetic image on a graphical interface; the user drawing a rectangle defining the region of interest.

Advantageously, the method comprises displaying the focused or composite image inside said rectangle.

Another subject of the invention is a computer program product comprising computer code instructions that, when the program is executed on a computer, cause said computer to execute the method according to one of the embodiments of the invention.

Another subject of the invention is a data processing system comprising a processor configured to implement the method according to one of the embodiments of the invention.

Another subject of the invention is a computer-readable recording medium comprising instructions that, when they are executed by a computer, cause said computer to implement the method according to one of the embodiments of the invention.

BRIEF DESCRIPTION OF THE DRAWINGS

Other features, details and advantages of the invention will become apparent on reading the description given with reference to the appended drawings, which are given by way of example and in which, respectively:

FIG. 1 shows one example of generating a flow of synthetic-antenna sonar images able to serve as a basis for object detection in one set of modes of implementation of the invention;

FIG. 2a shows a first example of a shadow of a sonar image able to be used by a method according to one set of embodiments of the invention;

FIG. 2b shows a second example of a shadow of a sonar image able to be used by a method according to one set of embodiments of the invention;

FIG. 3 shows one example of a penumbra effect able to be compensated for as part of one embodiment of the invention;

FIG. 4a shows a first example of a computer-implemented method in one set of embodiments of the invention;

FIG. 4b shows a second example of a method in one set of embodiments of the invention;

FIG. 5 shows one exemplary application of penumbra effect compensation in one set of embodiments of the invention;

FIG. 6 shows one example of a graphical interface for defining a region of interest, and for displaying the focused synthetic image or the composite image in this region of interest.

DETAILED DESCRIPTION

The invention will be illustrated by examples relating to the detection of objects using a synthetic-antenna sonar. However, it is more generally applicable to any type of synthetic antenna based on the emission of waves at various points, the reception of reflected waves, and the generation of a synthetic image based on the reflected waves.

FIG. 1 shows one example of generating a flow of synthetic-antenna sonar images able to serve as a basis for detecting objects in one set of modes of implementation of the invention.

In this example, a sonar S, mounted on a vehicle, moves along an arbitrary but generally straight trajectory above the seabed. At regular intervals, S emits a pulse for imaging the background along a line of sight that generally aims to the side of the trajectory. {right arrow over (X_n)} denotes the position 110 of the sonar during the pulse, {right arrow over (r_n)} denotes the unit vector 120 giving the pointing direction of the sonar, and {right arrow over (x_n)} denotes the vector 130 tangent to the trajectory of the vehicle at the n^themitted pulse.

The pulse is described as the complex function p, a function either of the time of flight t of the pulse or of the sonar range r at the time of emission, the two variables being linked by the relationship r=c×t, where c is the speed of sound. The pulse is narrowband, modulating a carrier of wavelength λ.

The operating principle of the synthetic-antenna sonar will be explained taking the example of an arbitrary, fixed target A (that is to say a point for which it is sought to ascertain whether or not it is occupied) in the scene, the position of which is denoted {right arrow over (A)}.

The sonar S is formed of an emission antenna and a reception antenna. The emission antenna is configured to emit energy in an antenna lobe. For example, the lobes 141 and 142 respectively represent the antenna lobes of the emission antenna for pulses of index n−1 and n−2. The antenna lobe has an aperture β in the horizontal plane and an aperture with a bearing large enough to illuminate a large area on the bottom of the body of water. If this lobe is large enough, the object A may be visible on multiple consecutive pulses, with maximum energy when the point is exactly in the line of sight of the sonar.

Assuming that A is the only target, the raw acoustic signal received by the sonar at the n^thpulse, for the range r, is modeled as being:

$\begin{matrix} s^{raw} (n, r) = K (\vec{r_{n}}, \vec{X_{n} A}) p (r -  \vec{X_{n} A} ) e^{- j \frac{2 π}{λ}  \vec{X_{n} A} } G (❘ ∠ (\vec{r_{n}}, \vec{X_{n} A}) ❘) & (Equation 1) \end{matrix}$

Where:

- j is the imaginary number;
- K is a link budget term related to the distance between the sonar and the target, and the nature of the target;
- G is a gain function related to the antenna and to the absolute value of the angle formed by the vector {right arrow over (r_n)} and the vector {right arrow over (X_nA)}. This gain function is at a maximum when the angle is zero, and decreases toward zero when the angle exceeds the aperture β. Hereinafter, for simplification, it will be assumed that the gain is unity if the absolute value of the angle is smaller than β and zero otherwise (the sonar illuminates significantly only at an angle β);
- < is the symbol of the computation of an angle between two vectors.

In reality, the scene is composed of a multitude of targets 160. The synthetic-antenna integration process is carried out as the pulses progress, as follows: at emitted pulse number n, generating a grid of reference points that are uniformly spaced by a pitch δr*{right arrow over (r_n)} on the axis orthogonal to the trajectory and a pitch δr*{right arrow over (x_n)} on the axis of the trajectory; the set of targets is therefore defined by:

$\begin{matrix} {P_{n} (s, k) = s . δ r . {\vec{r}}_{n} + k . δ x . {\vec{x}}_{n}}, s = 0 . . s_{\max}, k = - \infty . . + \infty & (Equation 2) \end{matrix}$

In which s_maxis chosen with respect to the maximum range of the sonar.

With a fixed index k, the points {p_n(s, k)} form what is referred to as the kth beam of the pulse n (or the k^thbeam of ping n) 160; the coordinates k and s are called beam index and sample index, respectively. The points {p_n(s, k)} are notional reference targets.

A subset of beams K_n={k_n,min. . . k_n,max} is then selected, such that the points of these beams are those that have a greater gain at the pulse n (with respect to the position and the attitude of the sonar at this pulse) than the gain obtained for these same points in space at the position and the attitude of the sonar at any other pulse (in other words, these are the points that are able to be imaged best at the pulse n).

For each point (s, k), k∈K_n, I(n, s, k), the pulse interval, where P_n(s, k) is illuminated significantly by the sonar, is determined as:

$\begin{matrix} I (n, s, k) \overset{Δ}{=} {m \in ℕ ❘ ❘ ∠ ({\vec{r}}_{m r}, \vec{X_{m} P_{n}} (s, k) ❘ < β} & (Equation 3) \end{matrix}$

The synthetic-antenna integration for the point {right arrow over (P_n)}(i, k) is then carried out by forming the coherent sum of the raw signals obtained for this point, for all of the pulses where this is able to be perceived by the sonar:

$\begin{matrix} s^{S A S} (\vec{X_{n} P_{n}} (s, k)) = s^{S A S} (n, s, k) = \sum_{m \in I (n, s, k)} e^{\frac{j 2 π}{λ}  \vec{X_{m} P_{n}} (s, k) } s^{R A W} (m,  \vec{X_{m} P_{n}} (s, k) ) & (Equation 4) \end{matrix}$

The datum S^SAS(n, s, k) for the beams K_n={k_n,min. . . K_n,max} is called SAS antenna for the pulse n, SAS being the acronym for synthetic-aperture sonar. Equation 4 is known as the generalized backpropagation equation.

Next, a function w^SAS(i, b), called waterfall function, is defined such that:

$\begin{matrix} {\begin{matrix} b (n, k) = k + \sum_{m = 0}^{n - 1} ❘ K_{m} ❘ \\ w^{SAS} (s, b (n, k)) = s^{SAS} (n, s, k) \end{matrix} & (Equation 5) \end{matrix}$

In which the notation |K_m| designates the cardinal of the set K_m. The waterfall may be seen as a complex 2D image, with the axis of the beam indices b being called the azimuth axis and the axis of the indices i being the radial axis. It is also possible to parameterize the waterfall in terms of oblique distance r=s×δr and in terms of curvilinear abscissa x=b×δx.

FIG. 2a shows a first example of a shadow of a sonar image able to be used by a method according to one set of embodiments of the invention.

FIG. 2b shows a second example of a shadow of a sonar image able to be used by a method according to one set of embodiments of the invention.

Shadows are elements that, in an SAS image, contribute greatly to the understanding of the scene for operators or to automatic object detection. Indeed, by the very nature of the imaging process, it appears that echoes are relatively compacted on the oblique distance axis, thereby greatly hindering shape recognition for a human operator or a machine algorithm. On the contrary, when the shadows are extended, they make it possible to guess the shape of the object.

FIGS. 2a and 2b show two examples of perception of shadows by way of a sonar image. They respectively show two scenes 210a, 210b on the seabed 211, and two sonar images 220a, 200b of these two scenes, respectively.

These two scenes respectively show an object 212a placed on the seabed, and an object 212b, of the same size, floating between two bodies of water and attached to the seabed by a cable 213b (rope) attached to a base 214b (sinker). The image is produced by a sonar located at the point 230.

In the case of the scene 210a, the sonar will perceive the echo 221a and the shadow 222a of the object 212a. In the case of the scene 210b, the sonar will perceive the echo 221b of the object 212b, of the cable 213b and of the base 214b, and also the shadow 222b of the object 212b and of the cable 213b.

The shadows are therefore defined as areas for which no echo is perceived, because they are masked by an object. It may be seen in FIGS. 2a and 2b that, at the same size, in the case of an object floating between two bodies of water (midwater) the size of the shadow is significantly greater.

FIG. 3 shows one example of a penumbra effect able to be compensated for as part of one embodiment of the invention.

A penumbra effect, or parallax effect, is obtained when a target is illuminated only during some of the image captures of a synthetic-antenna sonar.

For example, in the case of FIG. 3, a target A may be masked by an object T. For example:

- In iteration n−2, the sonar is located at the position {right arrow over (X_n−2)} 310, and perceives all targets located in the shaded cone 311;
- In iteration n−1, the sonar is located at the position {right arrow over (X_n−1)} 320, and perceives all targets located in the shaded cone 321;
- In iteration n, the sonar is located at the position {right arrow over (X_n)} 330, and perceives all targets located in the shaded cone 331;
- In iteration n+1, the sonar is located at the position {right arrow over (X_n+1)} 340, and perceives all targets located in the shaded cone 341.

The target A is therefore illuminated by the sonar in iterations n−2 and n+1, but not in iteration n−1 or in iteration n. FIG. 3 also shows that the shadow cast by the object T varies as a function of the image capture angle. In practice, this means that, in the synthetic-antenna integration, the shadows will not be sharp, but blurred and affected by a penumbra effect.

This penumbra effect may remain acceptable for an object close to the seabed, such as the object 212a in FIG. 2a, but become problematic for an object stretched lengthwise, or floating between two bodies of water (midwater or floating near the surface), such as the object 212b in FIG. 2b. In this case, the shadow seen by the synthetic-antenna sonar may be excessively blurry or deformed to be able to recognize the object.

FIG. 4a shows a first example of a computer-implemented method in one set of embodiments of the invention.

The method 400a is a computer-implemented method that aims to detect objects that are subject to penumbra effects in measurements from synthetic antennas.

The method 400a comprises a first step 410 of receiving a series of distance measurements generated, from a plurality of respectively different positions, by a synthetic-antenna detection system that operates by: emitting a wave; receiving waves reflected by the environment; determining distances by computing differences between the time of emission of the wave and the times of reception of the reflected waves.

The method 400a is thus applicable to any synthetic antenna based on the emission of waves at various points, and the reception of reflected waves. It is applicable for example to synthetic-antenna sonars, synthetic-antenna radars, or scanners.

In one set of embodiments of the invention, the detection system is a sonar system forming a synthetic-antenna sonar.

The method 400a then comprises a step 420 of generating, based on said series of distance measurements, a synthetic image representing the distances of the environment from a reference position.

This step consists in forming a synthetic image of distances for a given position. For example, in the example shown in FIG. 1, the measurements carried out at the times n−2, n−1, n, n+1 and n+2 may be integrated in order to generate a sonar image for the position 110 at the instant n.

This synthetic image will simply be called “synthetic image”, as opposed to the “focused synthetic images” introduced in the remainder of this document. It may also be called “reference synthetic image”.

The method 400a then comprises, for each focusing distance of a plurality of focusing distances:

- a step 430 of generating, based on said series of distance measurements or said synthetic image, a synthetic image focused at said focusing distance by applying penumbra effect compensation;
- a step 440 of detecting the presence of an object in said focused synthetic image.

In other words, for each distance of a given set of focusing distances, the method 400a, in step 430, will generate a synthetic image focused at the desired distance, and then, in step 440, will detect the presence of an object in the focused synthetic image.

Step 430 makes it possible to obtain a synthetic image for which the shadows located at the distance under study are sharp. This therefore makes it possible, in step 440, to carry out object detection based on an image in which the shadows are sharp at a given distance, and therefore to improve object detection.

Steps 430 and 440 may be iterated for each distance of the plurality of distances. Sharp shadows may therefore be obtained for detection, for all desired distances.

The method 400a thus enables improved object detection in synthetic images from synthetic antennas that are subject to the penumbra effect.

For step 430, it is possible to use various penumbra effect compensation methods that have the effect of producing sharp shadows at a given distance.

For example, the penumbra effect compensation is carried out using a method known as “FFSE” (fixed focus shadow enhancement), described for example by Groen, J., Hansen, R. E., Callow, H. J., Sabel, J. C., & Sabo, T. O. (2008). Shadow enhancement in synthetic aperture sonar using fixed focusing. IEEE journal of oceanic engineering, 34(3), 269-284. That method is presented in that publication in the case of an underwater vehicle moving on a straight trajectory, with a sonar aiming at 90° from the trajectory. In this case, applying FFSE consists in replacing the synthetic-antenna sonar integration equation with the following equation, for a distance r_tfrom an object:

$\begin{matrix} s^{S A S} (\vec{X_{n} P_{n}} (s, k)) = s_{r_{T}}^{FFSE} (n, s, k) = \sum_{m \in I (n, s, k)} e^{\frac{j 2 π}{λ}  \vec{X_{m} P_{n}} (t, k) } s^{R A W} (m,  \vec{X_{m} P_{n}} (s, k) ) & (Equation 6) \end{matrix}$

$With t = ⌊ \frac{r_{T}}{δ r} ⌋$

In this case, the image associated with a point P_n(s, k) becomes blurry, but the transition between the image and the shadow is sharp. This operation is tantamount to focusing, at the distance r_T, all of the points located further away in the range, hence the term fixed focus. The waterfall, or synthetic image associated with the FFSE image, has the same definition (that is to say number of pixels and resolution) as the waterfall, or synthetic image, of the synthetic-antenna sonar.

Another possible penumbra compensation method is what is known as the HVPC (high variant phase compensation) method. The HVPC algorithm may conceptually be seen as a refinement of FFSE, taking into account the height of the object that generated the shadow. This height may be determined in various ways, for example be determined using interferometry, or by a third-party sonar, such as a volume sonar. The height of the object that generated the shadow may also be obtained by trigonometry based on the length of the shadow projected onto the background by the object and knowing the altitude of the sonar (and possibly the local bathymetry, determined using interferometry or by a dedicated sonar).

FIG. 5 shows one exemplary application of penumbra effect compensation in one set of embodiments of the invention.

Two images of a seabed scene depicting a shipwreck are shown in FIG. 5.

On the left, the image 510 is obtained using a synthetic-antenna sonar. In this image, the wreck 511 and the shadow 512 generated by the wreck are blurred.

On the right, the image 520 is obtained using the same synthetic-antenna sonar, and has also benefited from the application of a penumbra compensation method, in this example FFSE, parameterized by the distance between the reference point of the image capture and the wreck. In this image, the wreck 521 and the shadow 522 generated by the wreck have become sharp.

This example demonstrates the ability of a penumbra effect compensation method to generate sharp shadows for a given obstacle distance in one set of embodiments of the invention. The shadow that is thus obtained, which is much sharper, may thus be used to carry out object detection more efficiently.

According to various embodiments of the invention, the plurality of focusing distances may be obtained in various ways. For example, a plurality of predefined distances may be used. The distances may be defined by applying a distance pitch over a given range. The distance pitch may be defined so as to ensure that a sufficiently sharp shadow will be obtained for all possible distances within the range.

The focusing distances may for example be defined as distances regularly spaced by a pitch 41 over a distance range [r_min; r_max], that is to say the focusing distances are defined by the set {r_min; r_min+Δ₁; r_min+2*Δ₁. . . r_max}. The parameters r_min, r_maxand Δ₁may be defined in various ways. For example, in one set of embodiments:

$r_{\min} = 2 * w_{\min} * \frac{δ_{x}}{λ},$

where:

- w_minis the minimum width of the objects to be detected;
- δ_xis the resolution of the sonar;
- λ is the emission wavelength of the sonar;
- r_maxis the maximum range of the sonar;

$Δ_{1} = \frac{r_{\min}}{2} .$

The values of r_minand r_maxmay also be defined as a function of the minimum and maximum expected heights of the objects, and also their minimum and maximum expected distances. Indeed, the minimum and maximum heights and distances make it possible to identify the minimum and maximum distances of the shadows cast for the objects. The values of r_minand r_maxmay thus be defined as the minimum and maximum focusing distances at which a shadow is expected to be identified in the rest of the sonar range.

More generally, the values r_minand r_maxmay be defined so as to determine the minimum and maximum focusing distances at which it is relevant to search for a cast shadow, thus making it possible to limit the computation to distances at which a shadow could be identified. For example, r_maxmay be defined as the minimum out of the maximum range of the sonar and the maximum focusing distance at which a shadow is expected to be identified in the rest of the sonar range.

These values provide a good compromise between an objective of limiting the number of focusing distances to be tested (and therefore computational complexity) and a detection efficiency objective. Indeed, the distance r_mincorresponds to a distance below which the parallax effect is not significant, and objects may be detected directly without processing; r_maxis the maximum range of the sonar, and detecting shadows beyond this distance therefore does not make sense; Δ₁=r_min/2 is a good compromise in terms of granularity for the focusing distances. These parameters therefore make it possible to test the entire range of relevant distances, while at the same time limiting the complexity of the method.

According to various embodiments of the invention, step 440 of detecting the presence of an object in the focused synthetic image may be carried out in various ways.

In general, any shadow shape detection method may be used. For example, it is possible to use a machine learning method, which may comprise for example the use of artificial neural networks and/or deep learning.

In one set of embodiments of the invention, detecting the presence of an object in said focused synthetic image comprises applying a supervised machine learning engine trained with a learning base comprising focused images of shadows of objects of the same type as said object.

The supervised machine learning engine may thus have been trained with a training base formed of focused shadows of the desired object. For example, if the purpose of the method is to detect the presence of schools of fish, a supervised machine learning engine may have been trained with a database of focused images of shadows of schools of fish.

Such a supervised machine learning engine has the advantage of being able to be trained to detect any type of object, and to provide very efficient detection.

In one set of embodiments of the invention, the method comprises:

- prior to the detection:
- computing, for each pixel of the focused synthetic image, a ratio I_r(b, s) between the intensities of the pixel of the synthetic image and of the focused synthetic image;
- thresholding the pixels of the synthetic image for which this ratio I_r(b, s) is greater than a threshold;
- applying a mathematical morphology operation to the thresholded pixels;
- applying said detection to the output of said mathematical morphology operation.

If a pixel belongs to a shadow at the focusing distance, it will be darker in the focused image, and therefore its intensity value in the focused synthetic image will be lower than its intensity value in the synthetic image. Therefore, the ratio of the intensity value in the synthetic image divided by the intensity value in the focused synthetic image will be greater than 1. This pixel will therefore be chosen as a potential shadow pixel before the mathematical morphology is applied. In one set of embodiments of the invention, the chosen threshold is however greater than 1, in order to avoid generating false alarms.

In other words, for each of the focusing distances, step 440 may comprise, prior to the detection itself, pre-processing consisting, first of all, in thresholding the pixels corresponding to a shadow, and in then retaining only significant shadows. The detection itself, for example the application of a supervised machine learning engine, then applies only to these pixels belonging to significant shadows.

This makes it possible to make the detection more robust by limiting the detection to the most significant shadows at a given focusing distance.

Once the detection has been carried out, it may be carried out in various ways. By way of non-limiting example:

An alert may be raised;

- The synthetic image associated with the detection may be presented to an operator;
- The image may be stored in an image bank;
- etc.

More generally, automatic object detection may be used in a large number of fields, and any action able to be associated with automatic image detection may be implemented at the end of the detection.

In one set of embodiments of the invention, the focused synthetic images are generated, for each focusing distance, directly based on the series of distance measurements. For example, in the case of an application of FFSE to measurements from a sonar, this means that the FFSE technique is applied directly, for each focusing distance, to the sensor signals forming SAS beams (SAS beamforming).

Although this solution naturally provides a solution for obtaining focused images, it may prove costly in terms of computing time when focused synthetic images have to be obtained for a large number of focusing distances.

In order to limit computational complexity, in one set of embodiments of the invention, the step of generating, based on said synthetic image, a synthetic image focused at said focusing distance by applying penumbra effect compensation is carried out by applying one-dimensional filtering to the synthetic image.

The costly step of generating a synthetic image based on distance measurements is thus carried out just once, and all of the focused synthetic images are then obtained by 1-dimensional filtering, which is far less resource-intensive.

This method for computing focused synthetic images will now be described in one exemplary application to sonar images using FFSE.

In the case of a sonar orientation at 90° on a linear trajectory, given the SAS waterfall w^SAS(b, s) obtained using the SAS process with sampling at a pitch δx on the azimuth axis (finer than the physical resolution of the synthetic antenna) and a pitch or on the distal axis, the FFSE image may be obtained, to within a constant multiplicative coefficient, based on said waterfall by 1D (1-dimensional) matched filtering (that is to say a 1D correlation), on each column of constant index s, using the following signal described in the spatial domain and centered around b=0:

$\begin{matrix} h_{r_{t}}^{A P F F S E} (b, s) = K . p^{2} (b) e^{j \frac{4 π f_{0}}{c_{0}} (\sqrt{{(s . δ r)}^{2} + {(b . δ x)}^{2}} - \sqrt{r_{t}^{2} + {(b . δ x)}^{2}})} & (Equation 7) \end{matrix}$

Where:

$s = ⌊ \frac{r}{δ r} ⌋;$

$b = ⌊ \frac{x}{δ x} ⌋;$

- K is a normalization multiplicative constant;
- p(b) is an apodization function, which may be Gaussian, by way of non-limiting example:

$\begin{matrix} p (b, s, β) = \exp (- \frac{{(b . δ x)}^{2}}{2 {(α . s . δ r . \tan (\frac{β}{2}))}^{2}}) & (Equation 8) \end{matrix}$

- α is an adjustment parameter, for example α=1;
- β is the angular aperture of the emitter.

The purpose of this apodization function is to limit the spatial support of the filter to the spatial domain where the target at the range s.δr is illuminated, this illumination taking place over a length of typical standard deviation s.ϵr·tan (β/2). This result may be applied generally to the case where depointing is no more than 90°, at the expense of making the equations slightly more complicated. In this case, it is indeed necessary to consider a depointing angle θ_sand to take this depointing into account for the computing of the convolution filter h_r_t^APFFSEor spectral filter H_r_t^APFFSE. The exact formulation will depend on the beam geometry of the synthetic image that is formed, and involves adding terms in cos(θ_s) and sin(θ_ss) to the formulations of h_r_t^APFFSEand H_r_t^APFFSE.

FFSE may then be applied, to obtain a focused synthetic image, by way of 1D filtering of the waterfall, which may be implemented by way of 1D correlation for each column s:

$\begin{matrix} w^{'} (b, s) = (h_{r_{t}}^{A P F F S E} ★ w^{S A S}) (b, s) & (Equation 9) \end{matrix}$

In equivalent fashion, this operation may be implemented by a product in the spectral domain:

$\begin{matrix} W^{'} (ξ, s) = W_{s}^{S A S} (ξ, s) \times H_{r_{t}}^{A P F F S E} (ξ, s) & (Equation 10) \end{matrix}$

Where W_s^SAS(ζ, s), H_r_t^APFFSE(ζ, s) and W′(ζ, s) denote the Fourier transforms according to b (or on the azimuth axis x) respectively of the reference SAS waterfall w^SAS(b, s), of the FFSE filter h_r_t^APFFSE(b, s) and of the refocused waterfall w′(b, s).

The refocusing of the SAS beams, that is to say the transformation of the synthetic image into a focused synthetic image, may thus be carried out by:

- carrying out the Fourier transform W_s^SAS(ζ, s) on the image beams w^SAS(b, s);
- multiplying this Fourier transform by the filter H_r_t^APFFSE(ζ, s), storing the result in W′(ζ, s);
- carrying out the inverse Fourier transform of W′(ζ, s) in order to obtain the refocused image beams w′(b, s).

The refocusing may thus be applied to an already formed SAS image and be efficiently carried out by way of Fast Fourier Transforms (FFT). The focusing may therefore be carried out quickly and inexpensively over a large number of focusing distances.

It is also possible to carry out the inverse operation, that is to say to switch from a focused synthetic image (for example an FFSE focused image) to a synthetic image (for example a reference SAS image), by convoluting the focused synthetic image using the inverse filter h_r_t^APFFSE−1this being tantamount to dividing by H_r_t^APFFSE(ζ, s) in the spectral domain.

In one set of embodiments, this operation may be used to enrich a learning database for object detection.

To this end, in one set of embodiments of the invention, the method 400 comprises:

- generating a modified focused synthetic image by adding a shadow associated with a label to said synthetic image focused at said focusing distance;
- generating a modified synthetic image by applying an inverse filter of said one-dimensional filter to the modified focused synthetic image;
- enriching a learning base for detecting the presence of an object with the modified focused synthetic image, said focusing distance and said label.

In other words, a sharp shadow corresponding to a known object is added to an image focused at the focusing distance, and then inverse 1D filtering is applied in order to retrieve a reference synthetic image (for example reference SAS image) comprising the unfocused shadow. This image is added to a training base for detecting the presence of objects, with an object label (defining the type of object) and the focusing distance. The combination of the modified synthetic image, the focusing distance and the label corresponding to the type of object thus makes it possible to train the object detection at various focusing distances (the focusing distances at which the object of said type should or should not be detected are then known).

This makes it possible to efficiently construct a training base, since this offers the possibility of generating a large number of images for the training base, with a large number of shadow images, for a large number of focusing distances. This therefore makes it possible to train the object detection efficiently and quickly, for example by training a supervised learning engine for object detection. This also limits the need to obtain real images, thereby greatly simplifying the construction of the training base.

FIG. 4b shows a second example of a method in one set of embodiments of the invention.

The method 400b comprises all of the steps of the method 400a, and also three optional steps 450b, 460b and 470b. It should be noted that, although the three steps are shown in FIG. 4b, the three steps 450b, 460b and 470b may be executed independently. According to various embodiments of the invention, a method according to the invention may therefore comprise these three steps, none of them, or any combination of these three steps.

In step 460b, the focusing distances are refined, that is to say focusing distances are tested more finely around a distance considered to be relevant.

In one set of embodiments of the invention, the plurality of focusing distances thus comprises a plurality of initial focusing distances defined by a first distance pitch over a first range of focusing distances, and the method comprising:

- a step 460b of defining a plurality of refined focusing distances, which are defined by:
- a second range of focusing distances, narrower than the first, around a first focusing distance of said plurality of initial focusing distances, at which the presence of an object has been detected;
- a second distance pitch, smaller than the first;
- for each focusing distance from among said plurality of refined focusing distances:
- the step 430 of generating, based on said series of distance measurements, a synthetic image focused at said focusing distance by applying penumbra effect compensation.

In other words, initial focusing distances may be the set of distances of a range [r_min; r_max] with a coarse pitch Δ₁(that is to say the initial distances are defined by the set {r_min; r_min+Δ₁; r_min+2*Δ₁. . . r_max}; a first focusing distance r_festis selected from among the focusing distances, and then a range of refined distances is defined around the first focusing distance r_festat which the presence of an object has been detected, with a finer pitch Δ₂<Δ₁. For example, the refined focusing distances may be selected with the pitch Δ₂over the interval [r_fest−Δ₁; r_fest+Δ_1].

Next, the focusing step 430 is carried out for each refined focusing distance.

This makes it possible to test the focusing distances with a finer pitch around a focusing distance at which the presence of an object has been detected, and thus to enable focusing at a distance potentially closer to the distance of the object.

This makes it possible to obtain accurate results while at the same time limiting the computational load required by the method, since the focusing distances are tested with a coarse pitch over the entire possible range, but with a finer pitch for a range of focusing distances of interest.

The first focusing distance is thus a focusing distance for which the presence of an object has been detected.

In other words, as soon as an object is detected in step 440, the focusing distances are refined around the focusing distance that led to the detection of the object.

This makes it possible to refine the detection of objects around distances generating a detection, and therefore around the most relevant distances for detecting objects.

In this case, the refinement may be referred to as autofixed focus, because it consists in automatically determining the focusing distance that produces the sharpest shadows.

Various sharpness indices may be used. By way of non-limiting example, the metrics introduced by A. Buffington, F. S. Crawford, R. A. Muller, A. J. Schwemin and R. C. Smits, “Correction of atmospheric distortion with an image-sharpening telescope”, Jour. Acoust. Soc. Am., Vol. 67, no. 3, pp. 298-303, March 1977 may be used.

In one set of embodiments of the invention, the method 400b comprises, in the event of detection of the presence of an object in said focused synthetic image, a step 470b of generating a composite image based on said synthetic image and said focused synthetic image.

Indeed, as mentioned above, the focused synthetic image makes it possible to obtain sharp echoes and shadows for a given focusing distance, but while blurring the rest of the image. On the contrary, the unfocused synthetic image is sharper over the entire image, but exhibits the penumbra effect. The composite image produced based on the synthetic image and the focused synthetic image may therefore, at the same time, be sharp overall over the entire image and not exhibit a penumbra effect but, on the contrary, exhibit sharp shadows at the focusing distance.

Compositing step 470b therefore makes it possible to obtain an image that is as relevant as possible to be presented to an operator, in which the shadows are sharp at a given focusing distance, without sacrificing the sharpness of the rest of the image. This step may be carried out in various ways.

In one set of embodiments of the invention, step 470b of generating the composite image comprises:

- detecting shadows in the focused synthetic image;
- assigning the following for each pixel of the composite image:
- the intensity value of the corresponding pixel of the focused synthetic image for each pixel belonging to a shadow;
- the intensity value of the corresponding pixel of the synthetic image otherwise.

In other words, shadows are detected in the focused synthetic image, and the pixels of the composite image come from:

- the focused synthetic image, for pixels belonging to a shadow;
- the synthetic image for the others.

This provides a simple and effective solution for obtaining a composite image with sharp shadows, without sacrificing the general quality of the image.

Other ways of generating the composite image may be envisaged.

For example, in one set of embodiments of the invention, said generating the composite image comprises assigning, for each pixel of the composite image, an intensity value equal to the weighted sum of the intensity value of the corresponding pixel of the focused synthetic image and of the intensity value of the corresponding pixel of the synthetic image, where, for each pixel, the relative weight of the intensity value of the corresponding pixel of the focused synthetic image increases with an index of belonging to a shadow of the pixel.

Thus, in this case, the pixel intensity values are not selected exclusively from one image or the other, but are defined as a weighted average of the intensity values in both images, where the relative weight of the intensity of the corresponding pixel of the focused synthetic image is greater when the pixel is considered to belong to a shadow.

This makes it possible to benefit from the sharpness of the shadow from the focused synthetic image, while at the same time benefiting from a more gradual transition between shadows and the rest of the image.

The relative weight of the pixels from the two images (synthetic image and focused synthetic image) may be computed in various ways. For example, a low-pass filter may be applied to the focused synthetic image. Indeed, the low-pass filter makes it possible to efficiently determine the pixels forming or not forming part of a shadow.

For example, if the synthetic image is denoted I_A(b, s), the focused synthetic image is denoted I_B(b, s), the composite image is denoted I_C(b, s), and the focusing distance under consideration is denoted r_T, then the composite image may be generated as follows:

- For all pixels corresponding to an area closer than r_T(s,δr<r_T), which by definition cannot correspond to a shadow generated by an object at this distance r_T, the value of the pixel will be taken from the synthetic image: I_C(b, s)←I_A(b, s)

For the other pixels (that is to say those such that s.δr≥r_T), there is a process of:

- Computing m_athe average of I_A(b, s) and m_bthe average of I_B(b, s) for all values of b, s;
- Generating I_Lpa low-pass-filtered image of |I_B| using a support filter with B beams and S samples, B and S being adjustment parameters of the filter. m_LPdenotes the average of I_LP;
- Generating a weight

$w_{C} (b, s) = \frac{1}{1 + e^{- \frac{20 \log_{10} ❘ I_{LP} (b, s) ❘ - T}{τ}}}$

for each pixel, where T is a shadow threshold in dB and t is a filter adjustment parameter (log-sigmoidal weighting);

- Computing

$l_{C} (b, s) ⟵ (\frac{w_{C} (b, s)}{m_{A}} I_{A} (b, s) + \frac{1 - w_{C} (b, s)}{m_{B}} l_{B} (b, s)) \times m_{A}$

Each pixel of the composite image is thus a weighted average of the corresponding pixels of the synthetic image and the focused synthetic image, giving more weight to the focused synthetic image in shadow areas, and more weight to the synthetic image in other areas, while at the same time ensuring a smooth transition between these two types of areas.

In one set of embodiments of the invention, the method 400b comprises defining 450b a region of interest of the synthetic image, and steps 430 of generating the focused synthetic image and 440 of detecting an object are carried out only in the region of interest.

This makes it possible to limit the complexity of the method, since steps 430 and 440 are carried out only on a region of interest considered to be relevant.

In one set of embodiments of the invention, the region of interest comprises the entire image based on the focusing distance. In other embodiments of the invention, the region of interest comprises only a sub-part of the image, resulting from a division of the total image into blocks of beams or defined by a user, for example.

FIG. 6 shows one example of a graphical interface for defining a region of interest, and for displaying the focused synthetic image or the composite image in this region of interest.

In one set of embodiments of the invention, defining the region of interest of the synthetic image comprises:

- displaying the synthetic image on a graphical interface 610, 620;
- a user drawing a rectangle 621 defining the region of interest.

FIG. 6 shows two successive states 610, 620 of such a graphical interface.

In a first state 610, the graphical interface represents a synthetic image, or waterfall, of a sonar. In this representation, the range is defined by the horizontal axis: the pixels furthest to the right correspond to the points furthest from the sensor; the beams are defined by the vertical axis: the lowest points correspond to the beams acquired at the oldest dates and the highest points correspond to the beams acquired at the most recent dates.

The user is able, in the interface, to trigger the definition of the region of interest, for example by pressing a button 630, and trigger the compositing, for example by pressing the button 631. However, these buttons are provided only by way of example, and other means may be used to trigger the definition of the region of interest and/or the compositing, such as keyboard shortcuts or voice commands, for example.

In the example of FIG. 6, when the button 630 is activated by the user, the graphical interface moves to a step 620, where a rectangle 621 representing the region of interest appears:

- The left-hand vertical edge of the rectangle 621 represents the shortest range covered by the region of interest;
- The right-hand vertical edge of the rectangle 621 represents the longest range covered by the region of interest;
- The upper horizontal edge of the rectangle 621 represents the most recent beams covered by the region of interest;
- The lower horizontal edge of the rectangle 621 represents the oldest beams covered by the region of interest.

In this example, the user may thus manually define the region of interest directly on the synthetic image. The user may modify the region of interest by modifying the rectangle, for example by modifying the rectangle using a moving cursor (which may for example be manipulated using an input interface such as a mouse or touch sensor): for example, the user may drag a corner, drag an edge, or drag-and-drop the entire rectangle.

In one embodiment of the invention, the method 400b comprises displaying the focused synthetic image or the composite image inside said rectangle.

In the example of FIG. 6, the composite image is generated for the region of interest defined by the rectangle 621, and displayed directly in this rectangle.

This allows the user to directly visualize the effect of the focusing on the image.

The composite image may be generated and displayed when the user presses the button 631, or makes another compositing command (keyboard shortcut, voice command, etc.).

As an alternative, the compositing and the display may be carried out in real time, when the rectangle 621 is displayed, and as soon as the user modifies the rectangle 621. This allows the user to visualize the result of the focusing and the compositing in real time.

In the example of FIG. 6, the focusing and the shadow detection are parameterized by a reference range r_Trepresented by the line 622, which may also be moved by the user to modify the parameterization. Like for the rectangle 621, a movement of the line 622 may result in the focusing, the shadow detection, the compositing and the displaying of the focused or composite synthetic image being carried out again in real time. The reference range r_Tmay be displayed 623 to the user.

The user is thus able, in real time, to modify the shadowing parameters and visualize the obtained result.

The above examples demonstrate the ability of the invention to improve the detection of objects in images from a synthetic antenna. However, they are given only by way of example and in no way limit the scope of the invention as defined in the claims below.

Detection of objects with a synthetic antenna

Information

Publication Number

Date Filed

Date Published

Inventors

Original Assignees

CPC

International Classifications

Abstract

Description

Claims

Priority Claims (1)

CROSS-REFERENCE TO RELATED APPLICATIONS

PCT Information