The present invention relates to a sensor device, for example a sensor device suitable for use in imaging applications, in particular, time of flight imaging.
Single photon lidar based on the principle of time of flight (ToF) has become a prominent tool for high-speed ToF imaging in recent years. At the core of the technique, the depth of an object of interest is measured directly through time-correlated single-photon counting data or by constructing a histogram of time delays.
The development of high rate, high resolution, low-power ToF image sensors is challenging due to large data volumes required. This may cause a major data processing bottleneck on the device when either the number of photons per pixel is large, the time resolution is fine or the spatial resolution is high, as the space requirement, power consumption and computational burden of the depth reconstruction algorithms scale with these parameters.
Known methods to mitigate data transfer issues include, for example: direct calculation of a depth from SPAD pulses; embedded histogram processing to estimate the histogram peak position; motion triggered in-pixel histograms; range-gating histograms, and discarding photon detections based on activity. These techniques may compromise, for example, computation, accuracy, or resolution to achieve reduced data rates which may limit their effectiveness.
Compressed sensing (CS) is a set of techniques that have been proposed to reduce the number of acquisitions (measurements) in sensing applications. Compressed sensing has been applied to MRI and in optical imaging. Known compressed sensing imaging solutions have been proposed but are based on spatial encoding or spatial temporal encoding (i.e. video) of the 2d or 3d image measurements. Compressive Learning (CL) is an extension of CS that generalizes CS theory to machine learning problems and the “measurement” and reconstruction of probability distributions. CL may be performed through a collection of “random” statistics including the random Fourier features, which are popular in machine learning. To date, known applications of CL may be found in machine learning, distributed machine learning and privacy-based machine learning.
In accordance with a first aspect there is provided a sensor device for photon-based imaging, comprising: one or more photon detectors configured to produce a plurality of photon detection signals in response to a plurality of photon detection events, wherein each photon detection event has a corresponding detection time and wherein the detection times of the plurality of photon detection events are distributed in accordance with a distribution over time; and processing circuitry configured to perform a process, for example, a sketching process, using timing information of the plurality of photon detection events to obtain a compressed representation of the distribution over time. The sketching process may comprise: generating a plurality of feature values based on the timing information of the plurality of photon detection events using one or more feature functions. The sketching process may further comprise combining the generated plurality of feature values to obtain the compressed representation of the distribution over time, wherein the feature functions have one or more properties such that combining the plurality of feature values generated using the one or more feature functions preserves signal information and/or supresses background information and/or distinguishes signal information from background information in the compressed representation. The compressed representation may be such that at least one or more desired parameters of the distribution over time can be estimated by performing a parameter estimation process using the compressed representation. The parameter estimation process may be based on a model of the distribution over time.
Each photon detector may comprise a single photon avalanche diode (SPAD). The compressed representation may be referred to as a sketch. The compressed representation may comprise sufficient information to provide an estimate of one or more parameters of the distribution over time. The compressed representation may be such that storage and/or transmission of the compressed representation uses less storage capacity and/or communication capacity than storage and/or transmission of the plurality of photon detection event signals and/or a histogram-based representation of the distribution over time. The compressed representation may be such that obtaining the compressed representation uses less computational resources than obtaining a histogram-based representation of the distribution over time. The compressed representation may be such that performing the parameter estimation process uses less computational resources than estimating the desired parameters from the plurality of photon detection event signals and/or a histogram-based representation.
The sensor device may be for use in an imaging process in which a scene comprising one or more targets is imaged and the distribution over time may be dependent on the scene being imaged. The scene being imaged may contain a number of targets and the size of the compressed representation may scale with at least the number of targets in the scene, per-pixel.
The plurality of photon detection events may comprise signal and background photon detection events. The signal photon detection events may correspond to detection of scattered and/or reflected and/or transmitted photons. The signal photon detection events may correspond to detection of photons produced by an illumination source associated with the sensor device and subsequently scattered and/or reflected and/or transmitted. The plurality of feature values produced by the sketching process may comprise signal and background feature values corresponding to feature values produced by signal photon detection events and background photon detection events, respectively.
The compressed representation is such that the storage capacity and/or communication capacity required by the compressed representation does not depend on the number of photon detection events and/or a detection time resolution. The distribution over time may comprise a time-delay distribution.
The one or more feature functions may be based on an expected background distribution of the photon detection events.
The one or more feature functions may comprise a functional form such that, when combining the feature values, the feature values associated with background photon detection events substantially cancel to provide a zero or at least substantially supressed contribution to the compressed representation.
The substantially supressed contribution may be relative to a corresponding contribution to the combination from feature values associated with signal photon detection events.
The one or more feature functions may comprise a mapping of timing information to a phase representation so that the feature values are representative of a phase, such that, when the feature values are combined, the phases for background events substantially cancel.
The one or more feature functions may comprise at least one of: a non-linear function, a periodic function, a continuous function. The one or more feature functions may be dependent on at least time.
The one or more feature functions may comprise a set of: periodic triangular functions, harmonic periodic functions, periodic complex representations. The one or more feature functions may produce non-zero valued feature values and/or wherein combining the feature values uses feature values obtained from each of the plurality of photon detection events.
The one or more feature functions may comprise at least one of: a polynomial function; a piecewise polynomial function; a spline function.
The feature functions may comprise or be configured to be formed from a set of basis functions. The basis functions may comprise spline or polynomial functions or sinusoidal functions. Using the one or more feature functions may comprise approximating the feature functions using the set of basis functions, for example, by forming a linear combination of their output. The set of basis function may comprise polynomial or spline functions or sinusoidal functions.
The set of basis functions may have one or more properties that allow a compressed representation to be formed using their output, for example, values generated using the set of basis functions. The one or more properties include: the set of polynomial basis functions may be combined to form an approximation of a sinusoidal function or other periodic, non-linear function; each basis function may comprise a local portion centred on a respective time; the respective times may correspond to regular time intervals across a time window; the respective times may correspond to irregular time intervals across the time window; the local portion of each basis function may overlap in time with one or more of the other basis functions; the local portion may be periodic; the local portion may comprise a polynomial function. At least one basis function may comprise an increasing portion over an interval of time and at least one other basis function may have a decreasing portion over said interval of time. The local portion may be non-zero over a time interval.
The sketching process and/or the parameter estimation process may comprise combining feature values generated by the set of basis functions, for example, a linear combination of feature values.
The photon detection events may occur over a pre-determined detection time period. The one or more feature function may comprise a periodic function characterised by a frequency that is a multiple of the inverse of the pre-determined detection time period. The frequency may be an integer multiple of the inverse of the pre-determined detection time period.
The sketching process comprises selecting more than one frequency for the one or more feature functions in accordance with a frequency-selection scheme. The sketching may further comprise generating feature values and combining said feature values for each selected frequency. The compressed representation may comprise at least one total value for each frequency. The compressed representation may be characterised by a sketch size. The sketch size may correspond to at least the number of frequencies selected, preferably twice the number of frequencies selected.
The selection of the frequency may be such that combining feature values generated by the sketching function at each of the selected frequencies suppresses the background information. The frequencies may be selected in accordance with a random distribution. A first pre-determined number of frequencies from a set of frequencies may be selected. A frequency equal to zero is selected to provide information related to the number of photon detection events.
The timing information may comprise a representation of the detection time of each photon detection event signal. The timing information may comprise a count of photon detection events in one or more pre-determined timing intervals. The sketching process may comprise obtaining one or more feature values from the one or more feature functions using the detection time of each photon detection event signal thereby to obtain the plurality of feature values. The processing circuitry may be configured to obtain a histogram-based representation of the timing information and performing the sketching process on the histogram-based representation.
The processing circuitry may be further configured to obtain at least one feature value for the one or more feature functions, dependent on the timing information.
Obtaining the feature values may comprise retrieving a value from a look-up table and/or performing a feature value generating algorithm, for example, digit-by-digit algorithm or CORDIC algorithm.
The device may further comprise processing circuitry associated with the one or more photon detectors. The processing circuitry may be configured to process the plurality of produced photon detection signals to produce a plurality of photon detection event signals. Each photon detection event signal may comprise time data representative of the detection time at which the photon detection event occurred.
Combining the plurality of the feature values may comprise storing a running total or average of the feature values and updating the running total or average using a new feature value in response to a photon detection event. At least one running total or average of the feature values may be stored and/or updated per selected frequency.
The running total or average may be represented by an intermediate compressed representation. The intermediate compressed representation may be stored and/or updated in response to a photon detection event.
The device may comprise a memory resource for storing the compressed representation and/or a communication interface for communication of the compressed representation to a further device.
The model of the distribution over time may be based on a photon time-of-flight observation model. The model of the distribution over time may comprise a first portion associated with signal photon detection events and a second function associated with background photon detection events. The model of the distribution over time may comprise a first portion representative of an expected distribution of signal photon detection events and/or a second portion representative of an expected distribution of background photon detection events.
The model of the distribution over time may comprise a model based on an impulse response function or an empirical approximation of the impulse response function. The model of the distribution over time may comprise a portion based on a Poisson distribution. The model of the distribution over time may comprise at least one Gaussian portion for the signal photon detection events. The model of the distribution over time may comprise a portion representative of background photon detection event. The model of the distribution over time may comprise a uniform distribution for the background photon detection events. The model of the distribution over time may comprise a first portion based on a model of an expected distribution of signal photon detection events and a second portion based on model of an expected distribution of background photon detection events.
The one or more desired parameters may comprise a measure of distance between the sensor device and a target in a scene and/or a measure of intensity of reflection of light by a target. The parameter may comprise a count of photon detection events.
The model of the distribution over time may comprise at least a portion that is dependent on intensity. One or more properties of the model of the distribution over time may be selected based on intensity.
The one or more parameters may be estimated from the compressed representation by performing a mathematical optimization process using the compressed representation.
The processing circuitry may be further configured to perform the parameter estimation process on the compressed representation thereby to determine an estimate for the one or more desired parameters.
The device may further comprise a communication resource for communicating the determined estimates to a further computing resource.
In accordance with a second aspect, there is provided an imaging system comprising the sensor device provided in accordance with the first aspect and an illumination source.
In accordance with a third aspect, which may be provided independently, there is provided a method comprising: performing a process, for example, a sketching process, using timing information of a plurality of photon detection events, wherein each photon detection event has a corresponding detection time and wherein the detection times of the plurality of photon detection events are distributed in accordance with a distribution over time, to obtain a compressed representation of the distribution over time. The sketching process may comprise: generating a plurality of feature values based on the timing information of the plurality of photon detection events using one or more feature functions. The sketching process may further comprise combining the generated plurality of feature values to obtain the compressed representation of the distribution over time, wherein the one or more feature functions have one or more properties such that combining the plurality of feature values generated using the one or more feature functions preserves signal information and/or supresses background information and/or distinguishes signal information from background information in the compressed representation. The compressed representation may be such that at least one or more desired parameters of the distribution over time can be estimated by performing a parameter estimation process using the compressed representation. The parameter estimation process may be based on a model of the distribution over time.
In accordance with a fourth aspect there is provided a non-transitory computer readable medium comprising instructions operable by a processor to perform the method of the third aspect.
In accordance with a fifth aspect, which may be provided independently, there is provided a sensor device for photon-based imaging, comprising: one or more photon detectors configured to produce a plurality of photon detection signals in response to a plurality of photon detection events, wherein each photon detection event has a corresponding detection time and wherein the detection times of the plurality of photon detection events are distributed in accordance with a distribution over time; and processing circuitry configured to perform a process, for example, a sketching process, using timing information of the plurality of photon detection events to obtain a compressed representation of the distribution over time. The process may comprise: generating a plurality of feature values based on the timing information of the plurality of photon detection events using one or more feature functions. The sketching process may further comprise combining the generated plurality of feature values to obtain the compressed representation of the distribution over time.
The feature functions may comprise a set of basis functions. The set of basis functions may comprise spline or polynomial functions or sinusoidal functions. At least two of the one or more feature functions may have non-zero portions that overlap in time. The one or more feature functions may comprise a set of functions such that at least two of the one or more feature functions have a non-zero value over at least part, optionally all of a time window. At least one basis function may comprise an increasing portion over an interval of time and at least one other basis function may have a decreasing portion over said interval of time.
The set of basis functions may have one or more properties that allow a compressed representation to be formed using the generated feature values. The one or more properties include: the set of basis functions may be combinable to form an approximation of a sinusoidal function or other periodic, non-linear function; each basis function may comprise a local portion centred or associated with a respective time; the local portion of each basis function may overlap in time with one or more portions of the other basis functions; the local portion may be periodic; the local portion may be polynomial.
Each basis function may comprise a local portion centred or associated with a respective time. The respective times of each basis function may correspond to regular time intervals across a time window. The respective times of each basis function may correspond to irregular time intervals across the time window. The one or more feature functions may comprise one or more functions locally centred at one or more times.
The compressed representation may preserve signal information and/or supress background information and/or distinguishes signal information from background information in the compressed representation. The compressed representation may be such that at least one or more desired parameters of the distribution over time may be estimated by performing a parameter estimation process using the compressed representation. The parameter estimation process may be based on a model of the distribution over time. The process and/or the parameter estimation process may comprise combining feature values generated by the set of basis functions, for example, a linear combination of feature values.
In accordance with a sixth aspect, which may be provided independently, there is provided a method comprising: performing a process, for example, a sketching process, using timing information of a plurality of photon detection events, wherein each photon detection event has a corresponding detection time and wherein the detection times of the plurality of photon detection events are distributed in accordance with a distribution over time, to obtain a compressed representation of the distribution over time. The process may comprise: generating a plurality of feature values based on the timing information of the plurality of photon detection events using one or more feature functions. The process may comprise combining the generated plurality of feature values to obtain the compressed representation of the distribution over time.
In accordance with a seventh aspect there is provided a non-transitory computer readable medium comprising instructions operable by a processor to perform the method of the sixth aspect.
There may be provided a method, system or device substantially as described herein with reference to the accompanying drawings.
Features in one aspect may be provided as features in any other aspect as appropriate. For example, features of a method may be provided as features of an apparatus and vice versa. Any feature or features in one aspect may be provided in combination with any suitable feature or features in any other aspect.
Various aspects of the invention will now be described by way of example only, and with reference to the accompanying drawings, of which:
Embodiments described in the following may address the bottleneck of known time of flight based imaging techniques, for example lidar techniques, by calculating a compressed representation, which may also be referred to as a sketch, of a time distribution of photon detection events. In particular, rather than construct and transfer a full time distribution of arrival times of photons (for example, a histogram) a compressed representation is generated for storage or transfer. As described in further detail in the following, the compressed representation provides on-the-fly summary statistics of the photon time stamps, based on samples of a characteristic function of a time distribution.
A number of advantages may be provided by the following embodiments. For example, the size of the sketch may scale with the number of degrees of freedom of the time of flight (ToF) model (for example, this may be the number of objects in the scene) rather than the number of photons or the fineness of the time resolution. This scaling may be possible without sacrificing precision in depth. The sketch may be computed for each incoming photon in an online fashion, only requiring a minimal amount of additional computation which can be efficiently performed in-chip. The sketch has sufficient information to provide an estimate of one or more parameters of the underlying distribution. It may be shown that the sketch captures all the salient information of the histogram, including the ability to compensate for background light or dark count effects, in a compact and data efficient form, suitable for either in-chip processing or for efficient data transfer for subsequent off-chip post processing.
Furthermore, embodiments allow off-chip image reconstruction algorithms to be developed in which the computational complexity is dependent on the size of the sketch. It has been found that, for some embodiments, resources used for data transfer may be reduced by up to and more than 30 times over transferring individual photon timing information or histogram counts while retaining almost full statistical efficiency. In the case of a simple two-component sketch (using polystyrene head data, as discussed with reference to, for example,
Embodiments described in the following, aim to acquire a compressed set of statistics. The compressed set of statistics may be, for example, Fourier series (FS) measurements of a photon arrival times over a set of pulses separately per pixel. From these statistics the reflection distances measured at each pixel can be calculated making use of the known SPAD temporal response function. Working per pixel makes separable processing possible and allows for low complexity acquisition that may be included on the back of the imaging pixel.
The lidar based imaging system of
Following illumination of the scene 14 by the illumination source 12, light is detected by the sensor device 10. As described in further detail with reference to
In known lidar-based imaging systems, the sensor device 10 produces data allowing a histogram of the arrival times of photons to be constructed. The histogram is representative of a time distribution of detected photons. In general, peaks of the histogram correspond to signal photons such that the location of a peak of the histogram can be used to determine a measure of distance between the sensor device 10 and the targets 16, allowing an image of the scene to be reconstructed. In contrast to known lidar-based imaging systems, and as described in further detail in the following, a sensor device 10 provided in accordance with embodiments is not configured to determine the full histogram representative of the time distribution but rather to generate a compressed representation of the time distribution. In other embodiments, the sensor device may first generate the full histogram and, rather than transfer the full histogram off the device, may form a compressed representation of the full histogram such that only the compressed representation is transferred off the device.
Turning to
Pixel device 20a has a photon detector, in the present embodiment, the photon detector is a single SPAD 22. Associated with the SPAD 22 is dedicated time-stamp processing circuitry 24, for example a Time to Digital Converter (TDC) and processing circuitry 26. The SPAD 22 is configured to produce photon detection signals in response to photon detection events. The TDC 24 processes the photon detection signals and provides photon detection event signals, for example in the form of time stamp data, digital timing signals or other representation of the time an event occurs. The processing circuitry 26 is provided per pixel to process photon detection event signals to perform a sketching process using the photon detection event signals.
In the presently described embodiment, each pixel device, for example, pixel 20a, has a single SPAD. However, it will be understood that in other embodiments, each pixel device has a SPAD array comprising a plurality of SPADs.
In some embodiments, the processing circuitry 26 is provided per pixel. In other embodiments, the processing circuitry 26 is provided as a shared component, for example, the processing circuitry or part of the processing circuitry is provided as part of a further processing resource configured to process photon detection event signals received from more than one or all pixels. In some embodiments, the TDC or at least some components of a TDC is shared between pixels. In such embodiments, timestamps or other timing information for a plurality of pixels may be generated by a common TDC or by TDCs sharing at least some common components.
The sensor 10 also includes a communications resource 28 in the form, in this embodiment, of a sensor output bus. The sensor output bus enables transmission of data from the sensor 10 to an external processing resource (not shown in
With reference to the SPAD 22, a time of arrival of a single photon is registered as a change in state of the SPAD, for example from an un-activated state to an activated state. Following activation, the SPAD has an intrinsic dead-time period that typically lasts 10s of nanoseconds. After the dead time period has expired the SPAD resets to its original, un-activated state.
As described above, contrary to known systems, rather than use the photon detection signals to produce a full representation of a time distribution for detected photons, a compressed representation, which may also be referred to as a sketch is produced. The processing circuitry 26 is configured to perform the sketching process. As part of the sketching process, the processing circuitry 26 is configured to obtain one or more values, herein referred to as feature values, using one or more feature functions.
The processing circuitry 26 may obtain output values from the feature values using different methods. In the present embodiment, the processing circuitry 26 calculates feature values using the detection time in accordance with any suitable algorithm. A non-limiting example of a suitable algorithm is a CORDIC algorithm.
The plurality of feature values are returned at a high rate, in the present embodiment, and this may be performed using an appropriate approximation of the feature function. In the present embodiment, the compressed representation is calculated by taking the running total (or an average) of multiple nonlinear functions of the detection time, tD, i.e. the trigonometric functions: sin (ωitD) or cos (ωitD) are used for a range of m frequencies, wi=ω1, . . . , ωm. These functions may be computed at a very high rate and in practice may be computed only approximately.
In other embodiments, a readable memory resource (not shown in
Regarding the feature function, the following comments regarding the feature function are provided. Further comments regarding the theoretical background and feature function are provided later in the specification. The one or more feature functions have a number of properties that allow for a compressed representation that provides sufficient information for the determination of parameters of the underlying time distribution.
In the present embodiment, a single, multivariate feature function is described that outputs a plurality of feature values. The single, multivariate feature function is a function of both time and frequency. It will be understood that such a single, multivariate feature function can equivalently be represented as a group of functions i.e. a group of time dependent functions at different frequencies. In the present embodiment, the equivalent group of functions are sinusoidal functions sin (ωitD) or cos (ωitD) for ωi=ω1, . . . , ωm.
In further detail, in the present embodiment, the multi-variate function is representative of a mapping between detection time tD and frequency ωi for a range of frequencies. For a single frequency ωi, the multi-variate function maps detection time to two real output values sin (ωitD) and cos (ωitD) . In further detail, for each frequency, the multi-variate function maps the detection time to the two real output values, and therefore, for m frequencies, the multi-variate function maps the time to 2m real values. It will be understood that, for a particular frequency, the output can be represented as a two-component vector or as a complex number. For a choice of m frequencies, the multi-variate feature function maps detection time onto 2m real output values sin (ωitD) or cos (ωitD) for ωi=ω1, . . . , ωm. The output from the multi-variate function is represented by a feature vector, in this case, as a 2m component vector where each component is a real number. Other representations of the feature vector are possible, for example, the feature vector could be represented as m complex values components.
While trigonometric functions are described above, these are provided as an example of a suitable non-linear, continuous and periodic function that can be used as part of the sketching process. It will be understood that other feature functions with similar properties, for example, non-linear periodic functions can be used. Other transforms, for example, triangular waves of different frequencies or structured time-domain transforms, may provide suitable properties. Other functions are suitable. For example, periodic triangular or square functions can be used. In addition, any harmonic periodic functions (where the harmonic is defined with reference to a time gated window) or a periodic complex representation may be suitable. The feature functions may be considered as mapping timing information to a phase representation such that when the feature values are combined, the phases for background events substantially cancel.
In particular, the one or more feature functions are such that the output of the one or more feature functions, the feature values, corresponding to signal photon detection events provide a larger contribution to the compressed representation than output of the one or more feature functions corresponding to background signal events. As described in the following, in some embodiments, the feature function also has the property that the sketching process includes selecting frequencies for the feature function such that the expected value for background photon detection events is substantially zero.
It will be understood that while different representation for the one or more feature functions may be suitable, the feature functions have properties such that the combination of their output i.e. the plurality of feature values, preserves signal information and/or supresses background information and/or distinguishes signal information from background information in the compressed representation.
In further embodiments, the feature functions may be a polynomial function or a piecewise polynomial function or a spline function. The feature functions may be a piecewise low order approximation of a sinusoidal function.
The one or more feature functions are time-dependent, and therefore the feature values obtained by the processing circuitry 24 depend on the detection time of the photon detection event. As the one or more feature functions either comprise a function that is dependent on frequency or a group of functions characterised by different frequencies, part of the sketching process requires a selection of one or more frequencies for the one or more feature functions. It has been found that particular choices of frequencies may provide additional advantages. In particular, for a pre-determined detection time period and where the one or more feature function is a periodic function characterised by a frequency, a suitable choice of frequency is a multiple of the inverse of the pre-determined detection time period.
In further detail, it is known to introduce a time window over which photon detection occurs (in lidar this is referred to as time gating). The time window may also be referred to as the acquisition time or detection time. The selected frequencies are dependent on the size of this time window, duration T, using trigonometric functions with frequencies that are a harmonics of 1/T. Using such frequencies for the one or more feature functions allows the combination of outputs to be “blind” to the background count. This provides a sketch that can be used to estimate range and reflectivity without considering the background count. In the simplest case, discussed in the theoretical background section, a sketch of size 2 is used to calculate the “circular mean” to calculate a single range.
In particular, selection of frequencies that correspond to the set of Fourier components of with frequencies that are a harmonics of 1/T provides sample values that are zero for background photon detection events. Therefore, this choice of frequencies eliminates the expected contribution to the compressed representation from background photon detection events to zero. In other embodiments, choices of frequency substantially supress the contribution to the compressed representation. There is a freedom in the choice of which frequencies to use. In one non-limiting example, the first m complex trigonometric functions over a time gated window are used. In a further non-limiting example, a set of trigonometric functions with harmonic frequencies are chosen. The choice of a larger set of trigonometric functions i.e. using a larger number of selected frequencies, allows a compressed representation for a more complex distribution over time to be determined. Further discussion regarding the selection of frequencies is provided in the following and in the theoretical background section.
While a selection of frequencies that allow construction of a compressed representation that is blind to the background, in some embodiments, background information should be retained. In such embodiments, the total photon count is calculated and stored. For a suitable choice of feature function, this count corresponds to the zero component frequency. This may be used, for example, in conjunction with the background blind sketch for the detection of the absence/presence of one or more targets, or to improve the range and refection estimation.
At step 302, timing information is obtained. The timing information is representative of a distribution over time of photon detection events. For the purposes of discussion of the present method, the timing information comprises a plurality of photon detection times.
For each photon detection time, a plurality of feature values is obtained using the feature function (in this embodiment, a multi-variate feature function discussed above). In the present embodiment, as discussed above, the feature function is a multi-variate representation of a family of trigonometric functions having different frequencies, such that, for a single detection time, the multi-variate function outputs a plurality of feature values. The frequencies are pre-determined, as discussed in further detail with reference to
It will be understood that feature values can be determined using different feature functions, different timing information or different hardware configurations. Further examples are provided with reference to
At step 306, a feature vector is formed. The feature vector comprises the outputs of the feature function and is a representation of the feature values obtained using the feature function. The feature vector has a size dependent on the number of frequencies selected. In this embodiment, as m frequencies are selected, 2m feature values are obtained and therefore the feature vector is a 1×2m dimensional array where each component is a real valued component.
At step 308, an intermediate instance of the compressed representation (also referred to as the intermediate compressed representation) is updated using the values of the feature vector determined at step 306. In the present embodiment, the feature vector values are combined with the intermediate compressed representation. In particular, the feature vector values are added to components of the intermediate compressed representation. In the present embodiment, the compressed representation is a 1×2m dimensional array and the value of each component of the current sketch is updated by adding the corresponding component of the 1×2m dimensional feature vector.
It will be understood that, for the first iteration, the intermediate compressed representation will be set to have zero value components such that the intermediate compressed representation of the method of
The method of
For the purposes of the following description, the method is considered as occurring over a predefined detection period. The detection of photons is time gated such that photons are only detected during a predefined time window corresponding to the predefined detection period. The detection period can be varied by setting an operational parameter. At step 402, at the beginning of the detection period, the illumination source 12 illuminates the scene 14 to be imaged. As described with reference to
In response to the illumination of the scene 14, photon detection events occur. In further detail, some light from illumination source 12 that is emitted into scene 14 is reflected by surfaces of targets and directed back to sensor device 10. In addition, background or ambient light is also present throughout the environment and is incident on the sensor device 10. Therefore, the photons of the light incident on sensor device 10 can be considered as a combination of signal photons, reflected from targets 16, photons and a plurality of background photons. The time distribution is over the detection period. The time distribution is dependent on transmission and reflection properties of the surfaces of the targets 16 in the scene 14.
The photon detection events are distributed over time in accordance with an underlying time distribution. The underlying time distribution is in accordance with the target scene and the SPAD impulse response function. Knowledge of the underlying time distribution allows a distance between the sensor device 10 and the illuminated surfaces to be determined. The underlying distribution over time therefore has a relationship with the scene being imaged and may therefore be considered as an inherent distribution of the scene being imaged.
In the present embodiment, as described in the following, a process is performed by the sensor device 10, per-pixel. In the present embodiment, the process is performed per-pixel, for each photon detection event over the detection period, as represented at step 404.
At step 406, a photon detection signal is produced by the SPAD 22 for each photon detection event. The photon detection event is characterised by a detection time corresponding to the time the photon is detected by the SPAD 22.
At step 408, the TDC 24 associated with the SPAD 22 produces a photon detection event signal by processing the photon detection signal. The photon detection event signal comprises time data representative of the detection time. The time stamp data provides timing information for the sketching process.
At step 410, the time stamp data from each photon detection event signal is used as part of a sketching process performed by processing circuitry 26. In the present embodiment, the photon detection event signal including the time stamp data is provided to the processing circuitry 26. The processing circuitry 26 uses the detection time or a signal that is representative of the detection time to obtain the plurality of feature values for the photon detection event signal, substantially as described with reference to
At step 412 the obtained feature values are combined as part of a combining process performed by processing circuitry 26. In the present embodiment, the obtained feature values are combined with an intermediate compressed representation to update the intermediate compressed representation, as described with reference to step 308 of
At step 414, the method returns to step 404 for the subsequent photon detection event in the detection period. In some embodiments, the method is performed until a pre-determined number of photon detection events occur. In such embodiments, a running total of the number of photon detection events processed is also stored until the pre-determined number of photon detection events is met. In some embodiments, the detection period has a pre-determine duration. These parameters can be selected by a user.
At step 416, a final compressed representation of the underlying distribution over time of the photon detection events, per pixel is provided. At this step, the generated compressed representation is transferred off the device using the communication resource 28.
Transmission of the compressed representation uses less transmission capacity than transmission of the plurality of photon detection event signals. The subsequent storage of the compressed representation uses less storage capacity than the storage of the plurality of photon detection event signals. In contrast to known methods, as only the compressed representation is transferred/stored in the above-described embodiment, as opposed to a full histogram, the transfer and/or storage of the compressed representation uses less communication and/or storage capacity that transfer and/or storage of the full histogram.
At step 418, the transferred compressed representation is used to obtain one or more parameters of the distribution over time. In particular, the compressed representation has sufficient information to allow an estimate of distance and intensity between sensor device 10 and targets to be determined. The techniques used to extract distance and intensity from the compressed representation are provided at the theoretical background section.
At step 416, the final compressed representation is transferred off the device to be processed using a parameter estimation process. It will be understood that, in other embodiments, the parameter estimation process can be performed by a processing resource of the sensing device such that only estimated parameters are stored or transferred (rather that the compressed representation).
In the above described embodiments, the raw measurements are not compressed in an invertible way. Rather, only the underlying statistical properties are captured. In some embodiments, properties of Fourier series (FS) measurements are exploited. FS averages have the special property that they explicitly remove any bias induced by background count, due to dark counts or background illumination. Essentially these measurements are blind to the background illumination and therefore may remove one of the biggest challenges in ToF imaging.
While in one embodiment, random FS averages are used, in other embodiments, the lowest frequency FS averages are used. In the simplest embodiment, a simple sine/cosine average pair of measurements (the first order Fourier Series component) are used to calculate a single reflection time of flight without background count bias. Multiple reflections per pixel (e.g. when imaging through glass) may be measured by taking a few FS averages: less than 20 may give sub bin accuracy.
In the above described embodiment, the pixel 20a is described as having processing circuitry for generating digital time stamp signals also referred to as photon detection event signals and this time stamp data is used to obtain a plurality of feature values. It will be understood that, in alternative embodiments, an alternative processing resource and/or hardware setup for obtaining feature values for a given photon detection event can be provided. A non-limiting example of an alternative setup is provided at
Performance of a sketching process using the device of
During the sample and hold process, the value of each sampled signal is digitised, for example, using an analogue to digital converter to give the feature values. In this way, the feature values are obtained. As described with reference to
In the above described embodiments, a method of obtaining a compressed representation in which timing information, in the form of detection times for each photon detection event is used. In other embodiments, aggregated or averaged timing information can be used to determine the compressed representation.
For this method, the sensing device is configured to generate a high-resolution histogram 602 over an acquisition time. For example, a SPAD of the device generates photon detection signals that are converted into time stamp events by the TDC and a histogram generation method is performed using the time stamp data. Known histogram generation methods can be used.
Following the acquisition time, a sketching process is then performed on the histogram 602. In this method, the bin-center for each histogram bin is first determined at step 606, using known method. Each bin-center value is provided to the multi-variate feature function (608) to obtain a plurality of feature vectors (610), each feature vector corresponding to a histogram bin. The process of obtaining a feature vector from timing information (in this case, a bin center) is substantially as described with reference to
For each bin, a weighted multiplication is performed using the photon count (604) for the bin and the feature vector (610) to produce weighted feature vectors. The weighted feature vectors are then summed together to produce a compact representation for the weighted histogram. In contrast to the method of
It will be understood that this method does not provide a reduction in storage on chip, as the histogram must be stored prior to performing the sketching process, however, this method will provide a reduction in data transfer off the device, as only the compressed representation is transferred.
The following comments are provided to describe the theoretical background of this work. In the following, solutions to the bottleneck of existing lidar techniques by calculating on-the-fly summary statistics of the photon time stamps, a so-called sketch, also referred to as a compressed representation, based on samples of the characteristic function of the ToF model, is explored. Distinct to compressive sensing, the goal is not to recover the photon counting data but rather the underlying probability distribution. In this sense, the probability model is directly estimated from a summary statistic and therefore the framework utilises much of the theory found in the generalised method of moments, empirical characteristic function and compressive learning literature. The size of the sketch scales with the degrees of freedom of the ToF model (i.e., number of objects in depth) and not with the number of photons or the fineness of the time resolution, without sacrificing precision in depth.
The sketch can be computed for each incoming photon in an online fashion, only requiring a minimal amount of additional computation which can be performed efficiently on-chip. The sketch can be shown to capture all the salient information of the histogram, including the ability to explicitly remove background light or dark count effects, in a compact and data efficient form, suitable for both on-chip processing or off-chip post processing. Furthermore, a compressive lidar image reconstruction algorithm is developed which has computational complexity dependent only on the size of the sketch. The described method paves the way for high accuracy 3D imaging at fast frame rates with low power consumption. In summary, the main contributions of this work may be: a principled approach for compressing time-of-flight information in an online fashion without the requirement to form a histogram and without compromising depth resolution; a compressive single-photon lidar algorithm is proposed which does not scale with either the number of photons or the time stamp resolution in terms of space and time complexity; statistical efficiency, given a compression rate (or sketch size), is quantified for different single-photon lidar scenarios, showing that only limited measurements of the characteristic function are needed to achieve negligible information loss.
The photon count at time stamp t∈[0, T−1] for an arbitrary pixel can be modelled as a Poisson distribution
t
|(r,b,tk)˜(rh(t−tk)+b), (1)
where r≥0 denotes the reflectivity of the detected surface, h(·) is the impulse response of the system and b defines the level of background photons. The number of discretized time stamp bins over the range of interest is denoted by T. The time stamp t is discretized over the range [0, T−1] dependent on the time-stamp resolution Δt. For simplicity, here it is assumed that the integral of the impulse response H=Σt=1Th(t) is constant although the proposed approach can accommodate more complex scenarios. If the lidar system is in free running mode where multiple acquisitions of a surface/object are obtained, then the interval [0, T−1] can be thought of as circular in the sense that time-stamp T is equivalent to the time-stamp0.
Alternatively, one can instead model the time of arrival of the pth photon detected. It is assumed that there are K distinct reflecting surfaces, where αk and α0 denote the probability that the detected photon originated from the kth surface and background sources, respectively. Let xp∈[0, T−1] denote the time stamp of the pth photon where 1≤p≤n, then xp can be described by a mixture distribution:
π(xp|α0, . . . , αK, t1, . . . , tK)=Σk=1Kαkπs(xp|tk)+α0πb(xp) (2)
where Σk=0Kαk=1. The distribution of the photons originating from the signal and background are defined by πs(xp|t)=h(xp−t)/H and the uniform distribution πb(xp|t)=1/T over [0, T−1], respectively. Often in practice, the signal distribution πs is modelled either using a discretized Gaussian distribution over the interval [0, T−1] or through the data driven impulse function which is calculated through experiments.
The acquisition goal is to obtain parameter estimates of the signal model in Eqn (2), given the time-stamp of photons detected. Parameter estimation usually involves the inference of a set of parameters θ∈Θ⊂2k+1 associated to a probability model π(·|θ) defined on some space x∈d. In the case of single-photon lidar, the dimension d=1. Typically, a finite dataset ={xi}i=1n of n samples is observed, which is assumed is sampled from the distribution given in Eqn. 2. Maximum likelihood estimation (MLE) is a traditional parameter estimation method whereby a likelihood function associated with the finite data is maximized with respect to the model parameters, e.g.
1) Generalised Method of Moments: In some cases, the likelihood function might not have a closed form solution nor a computationally tractable approximation. Generalised method of moments (GeMM) is an alternative parameter estimation method where one estimates θ by matching a collection of generalised moments with an empirical counterpart computed over a set of finite data sampled from the distribution π(x|θ). Given a nonlinear function g: d→m, then the expectation constraint is defined as
θg(x; θ0)=0 (4)
where θ denotes the expectation with respect to the probability distribution π(x|θ). Typically, the GeMM estimator is obtained by minimising a quadratic cost of the empirical discrepancy with respect to θ to try impose the moment constraints of (4). Let us define
calculated over ={xi}i=1n, then a GeMM classically takes the form:
where W is a symmetric positive definite weighting matrix that may depend on a θ.
2) Compressive Learning: Building on the concept of GeMM compressive learning utilises generalised moments of the data but with the distinct goal of reducing signal acquisition, space and time complexities. The link to GeMM is established by separating the function g into the following particular form:
g(x; θ)=Φ(x)−θΦ(x) (7)
where Φ: d→m is often referred to as the multi-variate feature function.
The separable form decouples the measured moments, Φ(x), from the parameters θ that are to be estimated. This is not a usual assumption in GeMM, although it may arise in particular cases. By denoting the empirical mean or the so-called sketch as
θ can be estimated solely from the sketch zn, by minimising
which is the particular compressive GeMM loss of Eqn (6). Elsewhere, the weighting matrix W for compressive single-photon lidar is defined.
The separable form of g in Eqn (7) allows a sketch statistic zn to be formed with a single pass of the data without the need to store the full dataset ={xi}i=1n and it can easily be updated on the fly with minimal computational cost. The sketch statistic has size m, or size 2m if decoupled into its real and imaginary components, which, fundamentally, scales independent of the dimensions of the dataset which in the case of single-photon lidar is the photon count n or the binning resolution T.
3) Empirical Characteristic Function: A specific type of GeMM is the empirical characteristic function (ECF) estimation, and occurs when the generalized moment is chosen to be Φ(x)=[e−iω
For a single depth observation model in Eqn (2) (i.e. K=1) and an impulse response function h, the characteristic function of the observation model is defined by
where sinc(x)=sin (x)/x and ĥ denotes the (discrete) Fourier transform of the impulse response function h. It should be noted that different distributions πb, and hence CFs, to model the detected photons originating from more complex background sources, although this is beyond the scope of this paper.
The feature function Φ(x) is a complex valued function of size m. With regards to hardware implementation, it is often preferable and convenient to work directly with real valued functions. The complex term eiωx can be alternatively written as cos (ωx)+i sin (ωx), where eiωx has been decoupled into its real and imaginary components. As a result, the feature function Φ can be equivalently written as a real valued feature function Φ: d2m, consisting of 2m real valued terms by stacking the real and complex components, for e.g.
For sake of fair comparison to existing hardware implemented methods in the literature, the results and figures presented represent a sketch of size 2m, consisting of 2m real valued measurements. The nature of the feature function, in terms of it being represented by a complex or real valued function, will be made clear in its context throughout the text.
The potential of using a sketch for single-photon lidar and to motivate the design of the sketch sampling procedure, which will be discussed in the following section.
In the absence of photons originating from background sources (i.e. SBR=∞) and the presence of a single surface or object, the sample mean of all the photon time-stamps (Φ(x)=x) is the simplest summary statistic for estimating the single location parameter t1. This only holds in the noiseless case as the sample mean estimate is heavily biased toward the centre of the histogram when background photons are detected.
Suppose, instead the cosine and sine of each photon count (x) is observed with angular frequency ω=2π/T , namely the feature function at this frequency is:
and denote zn the real valued sketch of size 2 computed over the dataset as in Eqn 8. It is possible to recover an estimate of the single depth location parameter t1 directly from the sketch, via the trigonometric mean
where arg denotes the complex argument. As the background photons are distributed uniformly over the interval
the expected moment of the photons originating from background sources is zero, x˜π
A simulated example is provided. A pixel of T=1000 histogram bins with a signal-to-background ratio (SBR) of 1 and a total of n=600 photons is simulated, where the time stamp of each photon is denoted by ={xi}i=1n. The data was simulated using a Gaussian impulse response function with σ=15 and a true position at time stamp t1=320. Computing the sketch zn from Eqn 8 and using Eqn. 12 the sketch estimate {circumflex over (t)}cm=323.3 is obtained and the sample mean estimate of {circumflex over (t)}=434.1 is obtained. The TCSPC histogram along with both the circular and standard mean estimates as well as the location parameter t1 are shown in
Importantly, the sketch formed using the moment in Eqn 11 is equivalent to the ECF sketch
sampled at
decoupled into both its real and complex parts. In fact, the estimate {circumflex over (t)}1 in Eqn 12 is the optimal estimator to the compressive ECF sketch detailed in Eqn 9. Principally, for this example, only 2 values need to be stored/transferred to estimate accurately the depth location of the object or surface, without the requirement to resort to the original photon time-stamped data.
For the remainder of this section, this example approach of forming a sketch is generalized to arbitrary size and sampling the ECF at multiple frequencies. This enables statistically efficient estimates for the single surface case to be obtained and to solve more complex lidar scenes including several surfaces with varying intensities where more salient information of the observation model is required.
Recall that the observation model π in Eqn 2 is discretized over the interval [0, T−1] which therefore can be considered to be a sufficient sampling if the distribution in Eqn 2 is approximately bandlimited. As a result, the characteristic function Ψπ(ω) has a finite basis characterized by the set of frequencies
The approach described above can be generalized by sampling multiple frequencies from the finite basis in order to construct the ECF sketch. As is the case for the circular mean, the frequencies ω=2πj/T for j∈[0, T−1] correspond to the zeros in the sinc function associated with the background pdf πb seen in Eqn (10). Therefore a sketch of arbitrary dimension that is also blind to photons originating from background sources can be constructed by avoiding the zero frequency ω=0 of the finite basis. As a result, the set of orthogonal frequencies are defined by
This set is referred to as the orthogonal frequencies as it defines regions over the interval of the observation model's characteristic function
Sampling Schemes: In order to construct a sketch, sufficient salient information of the characteristic function Ψπ should be retained such that identification and estimation of the unique location and intensity parameters θ of the observation model π(x|θ) defined in Eqn 2 is possible. It was discussed previously that the CF of a probability distribution decays in frequency. Furthermore, as the observation model is discretized over the interval, it is assumed that the characteristic function of the observation model is approximately band-limited. A natural sampling scheme would therefore be to sample the ECF at the first m frequencies of the orthogonal frequencies Ω to capture the maximum energy of the CF. In other words, the CF of the observation model can be truncated whilst avoiding the zero frequency.
Alternatively, provable guarantees for estimating mixture of Gaussian models may be provided, under certain conditions based on random sampling (cf. compressive sensing) of the CF. It is understood that the higher frequencies of the CF may provide further information to help discriminate distributions that are close in probability space. Moreover, if the CF decays slowly in frequency then the energy of the CF will be spread more throughout the set of orthogonal frequencies. Therefore an alternative sampling scheme is provided in which the set of orthogonal frequencies are randomly sampled with respect to some sampling law Λ. Orthogonal frequencies are sampled by:
(ω1, ω2, . . . , ωm)˜Λĥ (15)
where Λĥ denotes the cumulant distribution function of ĥ. To formalize, the following sampling schemes are described in order to construct the sketches:
Depending on the circumstances of the lidar device one or the other sampling scheme may perform better. It has been found that this is the case with both synthetic and real datasets.
Once the ECF sketch is constructed using either sampling scheme, the parameters θ of the observation model π(x|θ) are estimated from the obtained sketched (compressed representation), denoted zn. In general, there is no closed form expression to estimate θ from the sketch of arbitrary size as is the case for the circular mean estimate in Eqn 12. It is well documented in the ECF and GeMM literature that a complex valued ECF sketch zn of size m (or size 2m if decoupled into its real and complex components), computed over a finite dataset satisfies the central limit theorem. Formally, a complex valued sketch zn∈m converges asymptotically to a Gaussian random variable.
where Σθ∈m×m has entries (Σθ)ij=Ψπ(ωi+ωj)−Ψπ(ωi)Ψi(ωj)for i,j=1,2 . . . , m.
The asymptotic normality result in Eqn 16 naturally leads to a sketch maximum likelihood estimation (SMLE) algorithm that consists of minimising the following:
where for convenience zθ=[Ψπ(ωj)]j=1m. For an observation model consisting of K surfaces and a general impulse response function h, recall that
and θ=(α1, . . . , αK, t1, . . . , tK). Note that the sinc function is dropped on the assumption that one of the proposed sampling schemes is being used. Minimising Eqn 17 is equivalent to minimising the compressive GeMM objective function defined in Eqn 9 with the weighting function chosen to be W=Σθ−1. The weighting matrix W is asymptotically optimal in the sense that it minimises the variance of the estimator {circumflex over (θ)} from the sketch. In practice Σθ is θ dependent as it is a function of the underlying parameters θ that are to be estimated. There are various well established methods in the GeMM and ECF literature that tackle the difficulty of approximating Σθ and estimating θ simultaneously. For example, the K-L method may be used which iteratively estimates Σθ and θ in a two stage procedure by fixing and updating one at a time. Some known methods fix Σθ after only a few iterations of the K-L approach to reduce the computational complexity of the algorithm, although this typically comes at the cost of introducing sample bias. Occasionally, the covariance matrix is set throughout to be the identity, Σθ=1, reducing Eqn 17 to a standard least squares optimization but this generally results in a less statistically efficient estimator.
In this work, the method of estimating Σθ and θ simultaneously at each iteration is adopted. This approach is commonly referred to as Continuous Updating Estimator (CUE) and obtains estimates that do not produce sample bias like the two-step K-L approach and can often lead to more statistically efficient estimators. However, the SMLE method is not restricted to the CUE and in certain situations practitioners may choose to sacrifice unbiased and efficiently optimal estimators for a reduced computational complexity by considering the other methods discussed.
The optimisation problem in Eqn 17 is also typically non-convex and can suffer from spurious local minima. For the case when there is only a single surface, the SMLE algorithm is initialised using the analytic circular mean solution in Eqn 12 with minimal added computational overhead. From our experience with synthetic and real data, the circular mean estimate generally initialises the SMLE algorithm within the basin of the global minima, hence the issues associated with non-convex optimization are circumvented. For the case of multiple surfaces, a coarse uniform grid is formed across [0, T−1]K and initialised at the smallest SMLE loss.
One of the main advantages of the SMLE lidar approach from Eqn. 16 is that even at low photon levels (i.e. small n), the SMLE estimates quickly follow the central limit theorem (CLT) and provide a good approximation of its expectation. In contrast, known TCSPC histogram methods may be a poor approximation to its expectation as each time stamp bin t has only a small number of photons. Thus efficient processing of the full histogram data requires careful consideration of the underlying Poisson statistics. It was found that the estimate of the time parameter satisfies the central limit theorem with respect to photon count n, and that the sketched lidar CLT results of Eqn. 16 hold even for low photons levels, therefore the SMLE loss of Eqn. 17 is a well-justified loss to minimize.
It was found during the course of this work, that only a moderate real valued sketch of size 2m is needed to achieve negligible loss of information. The results are based on the asymptotic normality property discussed in Eqn 16, and, it was found that in practice this normality result holds even for small photon counts of n=10.
The following comments regarding experimental results are also provided. The compressive lidar framework described was evaluated on synthetic and real data with increasingly complex scenes. The method is compared with classical algorithms working on the full data space (i.e. no compression) namely log-matched filtering and expectation maximization (EM). Moreover, the results are also compared to the alternative compression technique of a known coarse binning. Both the log-matched filtering and EM algorithms estimate the location parameters using the full data and therefore the results obtained from these methods set a benchmark to the estimation accuracy when no compression takes place. The real valued sketch consisting of 2m real value measurements is used in the following results for the sake of fair comparison.
1) Processing: Restoration of depth imaging of single-photon lidar consists of estimating a 3D point cloud from a lidar data cube containing the number of photons ni,j,t in pixel (i, j) at time stamp t, where i∈[1, . . . , Nr], j∈[1, . . . , Nc] and t∈[0, T−1]. The average photon count for the whole data cube is denoted by
Each pixel (i, j) of the data cube is processed to estimate the true location and intensity parameter, denoted t1 and α, respectively. A data driven impulse response is given for each dataset and the characteristic function of the impulse response function can be obtained by using Eqn. 10.
2) Evaluation Metrics: Two different error metrics are used to evaluate the performance of our proposed sketched lidar framework. The root mean squared error (RMSE) between the reconstructed image and the ground truth is considered. Given that ti,j,k is the location of the kth peak in pixel (i, j) and {circumflex over (t)}i,j,k is the estimated counterpart, then the root mean squared error of the reconstructed image is
Secondly, the percentage of true detections Ftrue(τ) is compared with a resolution τ. That is, a true detection occurs if |ti,j,k−{circumflex over (t)}i,j,k|≤τ. When plotted as a function of τ, this metric quantifies the variability of the estimate.
The compression of both the sketched lidar and coarse binning approach is measured in terms of the dimension reduction achieved by the statistic with respect to the raw TCSPC data and is quantified by the metric
which is dependent on the dimensions, T and n, of the lidar scene. As previously discussed in the Theoretical Background section, we use the real-valued sketch of size 2m for sake of fair comparison with the technique of coarse binning.
The sketched lidar framework was also evaluated on two real datasets of increasing complexity. Firstly, a polystyrene head was imaged which consists mostly of a single peak and secondly, a scene where two humans are standing behind a camouflaged net, which contains 2 objects per pixel with varying intensity.
In above described embodiments, a model of the distribution over time based on an impulse response function or an empirical approximation of the impulse response function. In some embodiments, the impulse response function used is dependent on intensity of the signal received (for example, the intensity of the reflected light). In such embodiments, the shape or size of the impulse response function is dependent on the intensity. The most suited impulse response function may have a different functional form for different targets and scenes. As a first non-limiting example, light received from an example target may be modelled best by a sharply peaked function, for example, a delta function or a functional approximation. As a second non-limiting example, the target may reflect light at a lower intensity, such that light received from the target may be modelled by a wider peak. It has been found that using an intensity dependent model of the distribution over time, problems associated with Range-Walk (for example, including intensity dependent pile-up effects due to the SPAD detector dead time) may be addressed.
In the above described embodiments, a number of different functional forms for feature functions are described that have one or more properties such that the combination of feature values generated by the feature functions preserves signal information and/or supresses background information and/or distinguishes signal information from background information in the compressed representation. Examples provided above include functions that have properties such as periodicity and are non-linear (for example, periodic pairs of cosine and sine functions over a number of different frequencies). As described above, properties such as these allow compressed representations (sketches) to retain sufficient information to allow reconstruction of the time distribution off the device. Such feature functions may include sinusoidal feature functions.
In such embodiments, as described above, feature functions are used to generate feature values on the processing resource of the device (for example, by sampling a generated signal or using a suitable algorithm). As such, a low complexity approximation of the feature functions may be desirable, such as a CORDIC approximation, as described above.
In further embodiments, it has been found that approximating sinusoidal feature functions using local polynomial approximating functions (referred to in the following as spline functions) may offer additional advantages. The spline functions may be considered to form a basis set across a time period and may also be referred to as basis functions It has further been found that the set of spline functions have suitable properties that allow a compressed representation (sketch) to be formed either with or without the step of approximating the sinusoidal features.
In such embodiments, a set of N spline functions are used to form the compressed representation. The spline functions may also be referred to as spline feature functions and are piecewise polynomial functions. Each spline function has a local, functional portion (a polynomial portion) centred about a respective time. This portion is referred to a local portion in the following but may also be referred to a functional portion or a polynomial portion. In some embodiments, for example, those depicted in
In
In
The spline functions can be used to approximate sinusoidal or other periodic, non-linear functions that corresponding to the feature functions of to the embodiments described above. The spline functions can be interpolated to form piecewise polynomial functions.
As can be seen from
The right hand side of
It will be understood that, as was described with reference to other embodiments, the T of
The spline functions and their output can be used to form the compressed representation either by approximating non-linear, periodic functions or directly. In some embodiments the parameter estimation process may apply suitable algorithms directly to the compressed representation formed by the output of the spline functions, rather than approximating the sinusoidal functions. In such embodiments, the spline feature functions are used to from a distinct sketch independent of their ability to approximate sinusoidal functions. It will be understood that the spline set of functions may capture substantially the same distribution information as the sinusoidal functions.
Using the spline functions to form a compressed representation may offer advantages. For example, each spline function has a polynomial portion localized in time (i.e. non-zero over part of the period T) as opposed to spanning the entire period. Therefore, a smaller number of feature functions are required to be calculated for each photon detection compared to the corresponding sinusoidal feature function which may reduce computational requirements on the device. The number of spline functions returning feature values for each photon detection is one more than the order of the polynomial used (for example, for the linear function, two spline functions return a feature value for a photon detection and for the quadratic function, three spline functions return a feature value per photon detection). The basis functions depicted in
In addition, the set of spline functions can be used to represent the constant function (see the upper most function on the right hand side of
A skilled person will appreciate that variations of the enclosed arrangement are possible without departing from the invention. Accordingly, the above description of the specific embodiments are made by way of example only and not for the purposes of limitations. It will be clear to the skilled person that minor modifications may be made without significant changes to the operation described.
The project leading to this application has received funding from the European Research Council (ERC) under the European Union's Horizon 2020 research and innovation programme (grant agreement No 694888).
Number | Date | Country | Kind |
---|---|---|---|
2101369.3 | Feb 2021 | GB | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/GB2022/050263 | 2/1/2022 | WO |