The present application claims priority to Swedish patent application No. 2350006-9, filed Jan. 10, 2023, entitled “ILLUMINATOR SYSTEM FOR EYE-TRACKING SYSTEM”, and is hereby incorporated by reference in its entirety.
The present invention is directed to an illuminator system for an eye-tracking system for tracking movements of an eye. The illuminator system serves to enable glint detection as well as providing illumination to the eye area.
There is a growing interest in augmented reality (AR) or virtual reality (VR) systems, or more generally extended reality (XR) systems in various applications, including gaming, advertisement, medical, engineering, entertainment and more. These systems may also incorporate tracking of eye movements, i.e., eye tracking.
The illuminator system 200′ serves two main purposes, namely, to provide illumination to the eye area in order to allow a camera to capture images of the eye, and to project light onto the cornea of the eye so as to produce point-like (specular) reflections of the illuminators 210′. These reflections are referred to as “glints”. Depending on the relative position and orientation of the eye ball, some of the reflections as captured by the camera may fall within the iris/pupil region, while some of them may fall in the white of the eye. The corneal reflections that fall within the pupil/iris region are particularly useful as glints.
The purpose of the glints is to serve as a reference relative to which gaze may be determined. That is, the relative position and orientation of the eyeball may be determined by comparing the apparent position of the iris/pupil relative to the positions of the glints as captured by the camera.
As shown in
However, the prior-art illuminator system 200′ shown in
On the contrary, as can be seen in
In any event, because these additional reflections are variable and unpredictable, they are not useful as glints and cannot be used as reference for gaze estimation. Furthermore, in reality, the reflections of the illuminators 210′ may not be perfectly sharp images but may be blurred to some unknown extent. Therefore, the apparent size of the reflections may also be unpredictable and cannot be used as a feature for distinguishing genuine glints from artefacts during image processing. This is especially the case because the illuminators 210′ in prior art systems are simple point light sources and do not have any internal features from which the scaling of the reflections could be determined. For example, especially in the case of
As a result, in prior-art systems, glint detection is unreliable. When glints are incorrectly detected, incorrect gaze estimation may result and reduce the performance of an eye-tracking system. As a further result, in prior-art systems, glint matching is also unreliable. When glints are matched to the incorrect illuminators 210′, gaze estimation performance may also suffer.
There is thus a desire to improve the reliability of glint detection and/or glint matching, and also to reduce the power consumption of eye-tracking systems.
In accordance with the present invention, there is disclosed an illuminator system for an eye-tracking system for tracking movements of an eye, the system comprising a plurality of illuminators; wherein each of the plurality of illuminators is configured to emit light forming a respective predetermined reference pattern; wherein each of the reference patterns, when reflected off the cornea of the eye, produces a respective reflected pattern forming a single glint; wherein the reference patterns of at least two of the plurality of the illuminators are different from each other.
Each of the plurality of illuminators may comprise a light source; and a cover configured to selectively block light emitted by the light source to form the respective predetermined reference pattern.
The cover may part of a casing of the eye-tracking system.
In each of the plurality of illuminators, the illuminator may further comprise a light guide between the light source and the cover, so that light from the light source passes through the light guide before reaching the cover.
At least two of the illuminators may have a shared light source and a shared light guide, so that light from the shared light source passes through the shared light guide before reaching the respective covers of the at least two illuminators.
In each of the plurality of illuminators, the cover may comprise at least one window configured to allow the light forming the predetermined reference pattern to pass therethrough. The at least one window may be optically diffuse.
The one or more window may be configured to diffuse the light passing therethrough to achieve a substantially uniform light intensity.
Each of the plurality of illuminators may comprise a plurality of light sources positioned to form the respective predetermined reference pattern.
All of the light sources in each of the plurality of illuminators may be positioned to form the respective predetermined reference pattern.
Each of the plurality of illuminators may comprise further light sources which are non-operable and do not form part of the respective predetermined reference pattern.
The plurality of light sources may be arranged in a matrix, and the plurality of light sources may be selectively operable to form the predetermined reference pattern.
Optionally, none of the predetermined reference patterns is a scaled or rotated version of another one of the predetermined reference patterns.
Each of the predetermined reference patterns may have no rotational symmetry.
The reference pattern of each of the illuminators may be at least 2 mm in length and width.
The reference pattern of each of the illuminators may have a length which is not equal to its width.
The plurality of illuminators may comprise at least four illuminators.
There is also disclosed an eye-tracking system comprising the above illuminator system.
The eye-tracking system may comprise an extended reality headset; wherein the illuminator system is attached to or integrated in the extended reality headset.
The plurality of illuminators may be arranged to surround the eye.
The eye-tracking system may further comprise an image sensor configured to capture an image of one or more of the glints.
The eye-tracking system may further comprise a controller having stored thereon the predetermined reference patterns; wherein the controller is configured to recognise the one or more glints present in the captured image; compare the one or more glints with the predetermined reference patterns; and associate each of the one or more glints with a matching one of the predetermined reference patterns.
The controller may be further configured to, for each of the one or more glints: determine the apparent size of the glint; determine the glint as genuine if its apparent size falls within a predetermined size range; and determine the glint as an artefact if its apparent size falls outside the predetermined size range.
According to the present invention, there is also disclosed a method of eye tracking comprising: emitting light from a plurality of spatial positions towards an eye, wherein the emitted light from each of the positions forms a respective predetermined reference pattern; the reference patterns are different from one another; and each of the reference patterns, when reflected off the cornea of an eye, produces a respective reflected pattern forming a single glint; capturing an image of one or more of the glints; recognising the one or more glints present in the captured image; comparing the one or more glints with the predetermined reference patterns; and associating each of the one or more glints with a matching one of the predetermined reference patterns.
The method may comprise, for each of the one or more glints: determining the apparent size of the glint; determining the glint as genuine if its apparent size falls within a predetermined size range; and determining the glint as an artefact if its apparent size falls outside the predetermined size range.
According to the present invention, there is also disclosed a computer program comprising instructions to cause the above system to execute the steps of: capturing an image of one or more of the glints; recognising the one or more glints present in the captured image; comparing the one or more glints with the predetermined reference patterns; and associating each of the one or more glints with a matching one of the predetermined reference patterns.
The computer program may comprise further instructions to cause the system to execute the steps of determining the apparent size of the glint; determining the glint as genuine if its apparent size falls within a predetermined size range; and determining the glint as an artefact if its apparent size falls outside the predetermined size range.
There is also disclosed a computer-readable medium having stored thereon the computer program above.
Referring to
As shown in
As shown in
Various known patters may be employed for this purpose. For example, patterns commonly used in fiducial marking may be used as the predetermined reference patterns 211. As known, a fiducial mark is an object placed in the field of view of an imaging system to serve as a point of reference or measure. Fiducials of known pattern and size can serve as real world anchors of location, orientation and scale. They can establish the identity of the scene or objects within the scene. Examples of rectilinear patterns include ARTag, AprilTag and ArUco. ARToolKit is an example of non-rectilinear patterns. In general, any arbitrary pattern may serve as a reference pattern 211 provided that it is distinguishable from other reference patterns 211, and that it produces a reflected pattern that forms a single glint 212 (as opposed to disparate speckles of light).
Each of the predetermined reference patterns 211 may be formed of a single segment of light (i.e. a single continuous region of light), or may be formed as several segments of light (i.e. several discreet regions of light, separated by regions without light). In the case that the reference pattern 211 is formed of multiple segments, the segments may be arranged such that the reflection of the reference pattern 211 forms a glint which is recognisable as a single glint. For example, the single glint may appear as a “mosaic glint”, namely that it has a recognisable mosaic pattern which is distinct from the other reflected patterns. In other words, the reference patters 211 may be arranged such that each of them produces a “glint symbol”, i.e. each reflected pattern may be recognisable as a single symbol rather than multiple disparate patterns. More generally, the reference patterns 211 may be arranged such that each of the reflected patterns has a glint shape or glint structure which allows the reflected pattern to be recognisable as a single glint rather than several disparate patterns.
In
The size of a reference pattern 211 may be defined in terms of the diameter of a bounding circle encompassing the reference pattern 211. Accordingly, the distance between two illuminators 210 may be defined in terms of the centre-to-centre distance between the respective bounding circles.
In the examples of
In the simplest case, the pattern provided on the cover may have only two levels of opaqueness/transmittance, so that the resulting reference patters 211 is effectively monochrome. However, it is possible for the cover to be provided with areas of intermediate opaqueness/transmittance, so that the resulting reference pattern 211 has multiple levels of brightness.
Furthermore, as mentioned above, as the light source 213 also serves the purpose of illuminating the eye for image capture, the light source 213 may produce non-visible light, such as infrared light. In particular, near-infrared (NIR) light (e.g. 750 to 1,400 nm wavelength) may be used. For example, the light source 213 may be an LED emitting NIR light. Alternatively, the light source 213 may produce light within a broader spectral band, and may comprise a light filter to prevent light other than NIR light from being emitted.
As shown in
The casing 110 with integrated covers 214, as shown in
Alternatively, as shown in
As shown in
As noted above, in the example implementations of
The cover 214 may be manufactured by first forming the cover 214 out of an optically transmissive but diffuse material (e.g. a plastics material), followed by coating the cover 214 with an opaque material (e.g. paint), then selectively etching or burning away (e.g. using laser) some of the opaque material so as to form the windows 216.
Compared with using a single light source per illuminator 210, the plurality of light sources 213a in this example arrangement may individually have a lower power output. Because power output is lower, cheaper individual light sources 213a may be used. Of course, it is possible to achieve a lower power output by under-driving the light sources 213a. The combined light output of the plurality of light sources 213a may be similar to the example arrangements shown in
In this arrangement, because all of the light emitted by the light sources 213a is emitted towards the eye without any of it being blocked or otherwise absorbed, this arrangement may be more energy efficient.
As shown in
In some situations, the manufacturing process required to produce illuminators 210 with different arrangements of light sources 213a may be complex. Therefore, as an alternative to the arrangement shown in
To make a light source 213a non-operable, during manufacturing, the light source 213a may simply be not provided with a connection to the electrical supply. Alternatively, all of the light sources 213a may initially be connected, and the manufacturing process may include an additional step of severing the connections of the light sources 213a which are required to be non-operable.
As noted above, the plurality of light sources 213a may be arranged in an invariable matrix for all of the illuminators 210. However, as an alternative to rendering some of the light sources 213a non-operable, the required reference pattern 211 for each illuminator 201 may be formed by selectively operating some of the plurality of light sources 213a. This arrangement also has the advantage that only one design of illuminators 210 need be manufactured, which may reduce the complexity of the manufacturing process. Another advantage of this arrangement may be that the reference patterns 211 can be changed during use of the eye-tracking system 100. For example, this may be useful for adapting to different external light conditions.
As noted above, at least two of the reference patterns 211 should be different from each other, so that their corresponding glints can be distinguished from one another. A further benefit of this arrangement is that it may allow each glint 212 to be readily matched to the corresponding reference pattern 211. In a prior-art system 100′ such as shown in
A further cause for high power consumption associated with glint matching using prior-art systems 100′ is that substantial amount of computer processing is necessary for this type of glint matching. This is due to the need to analyse substantially the entire image segment containing the pupil region, and the corneal reflections (which may contain an unknown number and formation of glints) generally do not have a predictable pattern. That is, it is difficult to know a priori what pattern to expect in the corneal reflections. As a result, machine learning modules cannot be trained to search for glints by simply recognising specific patterns that are known a priori. Instead, complex and power-consuming algorithms are needed and must analyse the whole image segment, and even then the reliability of glint matching is poor.
By contrast, with the present invention, because each glint 212 has a unique pattern which is known a priori, for the purpose of glint matching, it may be computationally less demanding to recognise the pattern of a glint 212. This is because machine learning algorithms can be computationally efficient in recognising set patterns, and the use of the predetermined patterns 211 according to the present invention may enable the machine learning module to be specifically trained on these patterns. In other words, with the use of the predetermined patterns 211, the machine learning module may have a much more specific and well-defined task, which may lend itself to an implementation which requires little memory and processing power. This may in turn result in a lower power consumption associated with glint matching.
Furthermore, given the improved reliability in glint matching, it may be possible to use a small number of illuminators 210 whilst still maintaining adequate glint matching reliability. For example, as will be discussed in more detail below, as few as four illuminators 210 may be used.
As a further effect, with the present invention, it may be necessary to analyse only the image segments containing the glints 212. Because the total area of the image segments containing the glints 212 tends to be much smaller than the area of an image segment containing the entire pupil region, the amount of image data that needs to be analysed can be substantially reduced. Therefore, the processing power required may be reduced, which may in turn reduce the power consumption associated with glint matching.
To further ensure uniqueness of the reference patterns 211, they may be different from one another in the sense that none of the reference patterns 211 is a scaled or rotated version of another one of the reference patterns 211. This may be useful, for example, in case the user wears prescription glasses, which may produce complex reflections that rotate and/or scale the reference patterns 211. Therefore, by ensuring that none of the reference patterns 211 is a scaled or rotated version of another reference pattern 211, the glints 212 can be reliably distinguished from one another.
To further ensure uniqueness of the reference patterns 211, each of the reference patterns may be configured so that it has no rotational symmetry.
As noted above, one of the drawbacks of prior-art systems is that it is sometimes difficult to distinguish genuine glints from unwanted artefacts. In prior-art systems, one approach to addressing this drawback is to employ smaller light sources (and thus increasing the light intensity), so that the glints are significantly brighter than the general level of illumination provided to the eye area by the same light sources. That is, since the prior-art light sources have no other feature than being point-like, the glints can be made to be more recognisable by making them smaller and more intense. However, this approach creates another drawback which is that the glints may become over-exposed in the images captured by the camera. That is, given the limited dynamic range of the camera, it may be difficult to set the exposure level of the camera such that the dynamic range is used efficiently while preventing the image from being saturated by the extra-bright glints. When a glint is saturated, its outline becomes less defined and its centre may become more difficult to compute reliably. Therefore, saturation may add a degree of uncertainty in the determination of glint positions, which may in turn result in poor gaze estimation and poor eye-tracking reliability. We refer to this as the “exposure problem”.
In certain applications of prior-art systems, for example in AR glasses, a further technique which may be employed to make the genuine glints more distinguishable from unwanted artefacts is to increase the power level of the illuminators. This may produce glints which are brighter in comparison with external light sources. However, this may exacerbate the exposure problem.
In other words, prior-art systems are limited by the trade-off between glint detection accuracy and gaze estimation accuracy. That is, the brighter (i.e. greater intensity, but not necessarily greater light power) the glints, the more reliable the glint detection but gaze estimation reliability suffers. Conversely, reducing the brightness of the glints may improve the image quality of the features of the pupil/iris region, which is necessary for gaze estimation, but glint detection accuracy suffers. Without reliable glint detection, as noted above, gaze estimation also becomes unreliable. As a further drawback, increasing the brightness of the glints also increases power consumption.
In order to address the exposure problem, the reference pattern 211 of each of the illuminators 210 may be sufficiently large. For example, the reference patterns 211 may be at least 2 mm, at least 4 mm, at least 6 mm, at least 8 mm, at least 10 mm, or at least 12 mm in length and width.
As would be appreciated, although increasing the size of the reference patterns 211 may address the exposure problem, it may not be desirable to increase the size of the reference patterns 211 to an arbitrarily large size. For example, the large size of the reference patterns 211 may result in a bulkier illumination system 200 because the casing 110 would have to large enough to accommodate the larger illuminators 210. In addition, as the size of the reference patterns 211 increases, the risk of the reflected patterns colliding also increases.
This is especially the case if the user wears prescription glasses. Therefore, whilst it may be advantageous to address the exposure problem, it may also be desirable to limit the size of the reference patterns 211. For example, the size of the reference pattern 211 may be no more than 12 mm, no more than 8 mm, no more than 6 mm, no more than 4 mm, or no more than 2 mm.
In the examples shown in the figures, the reference patterns 211 are shown to have generally equal length and width (i.e. they have a generally square outline). However, it may be advantageous to employ reference patterns 211 which have a length that is not equal to its width. That is, each of the reference patterns 211 may have a generally rectangular outline. This may be advantageous because larger reference patterns 211 may be accommodated without increasing the thickness of the casing 110 housing the illuminator system 200. As such, the exposure problem may be more effectively addressed without rendering the eye-tracking system 100 bulkier.
As noted above, depending on the relative position and orientation of the eyeball, some of the reflected patterns may fall outside the iris/pupil region, and these reflections will not produce glints. Therefore, if only two illuminators 210 are provided, it is possible that, in some extreme angular position of the eyeball, no glint will be produced and gaze estimation may fail. Therefore, it may be desirable to provide more than two illuminators 210 so as to increase the likelihood that at least one detectable glint (per eye) is produced. For example, the number of illuminators 210 may be three or more, four or more, five or more, six or more, seven or more, eight or more, nine or more, or ten or more. More preferably, it may be desirable to provide four illuminators 210 (as shown in
As shown in
The present invention is also directed to a complete eye-tracking system comprising the illuminator system 200. Depending on the implementation of the illuminator system 200, the illuminator system 200 may be attached to or integrated in the eye-tracking system 100. As noted above, the eye-tracking system 100 may comprise a pair of extended reality (XR) headset.
As noted above, in order to perform glint detection and gaze estimation, the eye-tracking system 100 may comprise an image sensor 300 (or camera) configured to capture an image of an eye, the image comprising one or more glints. The captured image may include features of the iris/pupil region, which may be necessary for gaze estimation. As shown in
In order to perform glint detection, the eye-tracking system may further comprise a controller (not shown) communicating with the image sensor 300. The controller may be used to analyse the image captured by the image sensor 300. Referring to
At step 530, the controller may analyse the captured images. Specifically, the controller may recognise the one or more glints 212 present in the captured images. For example, the controller may recognise the glints 212 by searching for image features that match the expected attributes of a glint 212. The controller may perform the recognition of one or more glints using image processing techniques, optionally in conjunction with machine learning algorithms. For example, glints 212 may have an expected general outline shape. The expected outline shape may be known based on the predetermined reference patterns stored on the controller and based on the known curvature of the cornea of the eye. Therefore, image features which appear to be glints but are heavily distorted may be excluded as possible candidates for genuine glints.
At step 540, the controller may compare the one of more recognised glints with the predetermined reference patterns 211 stored on the controller. The controller may compare each of the recognised glints with each of the predetermined reference patterns 211 that are known to be present in the illuminator system 200, so as to find the reference pattern 211 that matches the glint being compared. If a match is found, at step 550, the controller may associate the glint with the matching reference pattern 211. Therefore, with this arrangement, it is possible not only to detect the glints, but also to distinguish one glint from another. This may provide further certainty in the glint detection, which may in turn improve the reliability of gaze estimation.
As noted above, especially if the user is wearing prescription glasses, the images captured by the image sensor 300 may include numerous artefacts resulting from the reflection of the emitted light by the prescription glasses. Otherwise, artefacts may result from external light sources. As noted above, reflections produced by prescription glasses generally are distorted and have an apparent size which is inconsistent with genuine glints. Therefore, the step (step 540) of comparing the glints with the predetermined reference patterns may comprise determining the apparent size of each glint at step 541. The controller may determine that the glint is genuine if its apparent size falls within a predetermined size range, at step 542. On the contrary, the controller may determine that the glint is an artefact if its apparent size falls outside the predetermined range, at step 543. The predetermined size range may be known in advance based on the stored predetermined reference patterns and the known approximate curvature of the cornea of the eye.
The controller may be an embedded device within the eye-tracking system 100. Alternatively, the controller may be not embedded, but connected to image sensor 300 using a wired or wireless connection. For example, the controller may be provided in a separate unit that communicates with the extended reality headset (or another wearable component of the eye-tracking system 100).
Another aspect of the present invention is directed to a computer program comprising instructions to cause an eye-tracking system 100 to perform the method shown in
Number | Date | Country | Kind |
---|---|---|---|
2350006-7 | Jan 2023 | SE | national |