This disclosure relates to a head mounted display.
A technique is known in which the eyesight direction of a user is detected by emitting non-visible light such as near-infrared light to the user's eyes, and analyzing an image of the user's eyes including reflected light. Information of the detected eyesight direction of the user is reflected on the monitor of, for example, a PC (Personal Computer), a game console or the like, and thus use as a pointing device has been realized.
Japanese Unexamined Patent Application, First Publication No. H2-264632
A head mounted display is an image display device that presents a three-dimensional image to a user wearing the device. Generally, the head mounted display is used in a state of being mounted to cover the visual range of a user. For this reason, a user wearing the head mounted display has an external image shielded. When the head mounted display is used as a display device of an image of a moving picture, a game or the like, it is difficult for a user to visually recognize an input device such as a controller.
Therefore, the usability of a head mounted display as a substitute for a pointing device by detecting the eyesight direction of a user wearing the display is of convenience. Particularly, the acquisition of geometric information (information of spatial coordinates or a shape) of a user's cornea in a state where the user wears a head mounted display is useful in estimating the eyesight direction of the user.
It could therefore be helpful to provide a technique of detecting geometric information of the cornea of a user wearing a head mounted display.
Provided is a line-of-sight detection system including a head-mounted display and a line-of-sight detection device, the head-mounted display includes an image display element that includes a plurality of pixels, each pixel including sub-pixels that emit red, green, blue, invisible light, and displays an image to be viewed by a user; an imaging unit that images an eye of the user wearing the head-mounted display on the basis of the invisible light emitted from the sub-pixel that emits the invisible light; and a transmission unit that transmits a captured image captured by the imaging unit, and the line-of-sight detection device includes a reception unit that receives the captured image; and a line-of-sight detection unit that detects a line of sight of the eye of the user based on the captured image.
Further, in order to resolve the problem, a head-mounted display according to an aspect of the present invention is a head-mounted display mounted on a head of a user and used, and includes a convex lens disposed in a position facing a cornea of the user when the head-mounted display is mounted; an image display element that includes a plurality of pixels, each pixel including sub-pixels that emit red, green, blue, invisible light, and displays an image to be viewed by a user; a camera that images a video including the cornea of the user as a subject; and a housing that houses the convex lens, the image display element, and the camera.
Further, in the head-mounted display, the head-mounted display may further include a control unit that selects the pixel that emits invisible light among the plurality of pixels constituting the image display element and causes the selected pixel to emit light.
Further, the control unit may change the pixel that emits the invisible light when a predetermined time elapses.
Further, the control unit may switch and control a light emission timing of the sub-pixel that emit the invisible light and the sub-pixel other than the sub-pixels that emit the invisible light.
Meanwhile, any combination of the aforementioned components, and implementation of our displays in the form of methods, devices, systems, computer programs, data structures, recording mediums, and the like may be considered part of this disclosure.
It is thus possible to provide a technique of detecting geometric information of the cornea of a user wearing a head mounted display.
The image reproducing device 200 generates an image displayed by the head mounted display 100. Although not limited, as an example, the image reproducing device 200 is a device capable of reproducing an image of a stationary game console, a portable game console, a PC, a tablet, a smartphone, a phablet, a video player, a television or the like. The image reproducing device 200 connects to the head mounted display 100 in a wireless or wired manner. In an example shown in
Meanwhile,
The head mounted display 100 includes a housing 150, a mounting fixture 160, and a headphone 170. The housing 150 houses an image display system such as an image display element that presents an image to the user 300, or a wireless transmission module such as a WI-FI module or a BLUETOOTH (Registered Trademark) module which is not shown. The mounting fixture 160 mounts the head mounted display 100 on the head of the user 300. The mounting fixture 160 can be realized by, for example, a belt, an elastic band or the like. When the user 300 mounts the head mounted display 100 using the mounting fixture 160, the housing 150 is disposed at a position where the eyes of the user 300 are covered. For this reason, when the user 300 mounts the head mounted display 100, the visual range of the user 300 is shielded by the housing 150.
The headphone 170 outputs a voice of an image reproduced by the image reproducing device 200. The headphone 170 may be fixed to the head mounted display 100. Even in a state where the user 300 mounts the head mounted display 100 using the mounting fixture 160, the user can freely attach and detach the headphone 170.
The near-infrared light source 103 is a light source capable of emitting light of a near-infrared (approximately 700 nm to 2,500 nm) wavelength band. The near-infrared light is light of a wavelength band of non-visible light which is not able to be generally observed with a naked eye of the user 300.
The image display element 108 displays an image for presentation to the user 300. The image displayed by the image display element 108 is generated by a GPU (Graphic Processing Unit), not shown, within the image reproducing device 200. The image display element 108 can be realized using, for example, a known LCD (Liquid Crystal Display), an organic EL display (Organic Electro-Luminescent Display) or the like.
When the user 300 mounts the head mounted display 100, the hot mirror 112 is disposed between the image display element 108 and the cornea 302 of the user 300. The hot mirror 112 has a property of transmitting visible light generated by the image display element 108, but reflecting near-infrared light.
The convex lens 114 is disposed on the opposite side to the image display element 108 with respect to the hot mirror 112. In other words, when the user 300 mounts the head mounted display 100, the convex lens 114 is disposed between the hot mirror 112 and the cornea 302 of the user 300. That is, when the head mounted display 100 is mounted to the user 300, the convex lens 114 is disposed at a position facing the cornea 302 of the user 300.
The convex lens 114 condenses image display light that passes through the hot mirror 112. For this reason, the convex lens 114 functions as an image enlargement unit that enlarges an image generated by the image display element 108 and presents the enlarged image to the user 300. Meanwhile, for convenience of description, only one convex lens 114 is shown in
The near-infrared light source 103 is disposed at the lateral side of the convex lens 114. The near-infrared light source 103 emits infrared light toward the inside of the convex lens 114. The convex lens 114 is provided with a plurality of reflection regions that reflect the infrared light inside the lens. These reflection regions can be realized by providing fine regions having different refractive indexes in the inside of the convex lens 114. Meanwhile, providing the regions having different refractive indexes in the convex lens 114 can be realized using a known laser machining technique. The reflection region is provided at a plurality of places in the inside of the convex lens 114.
Near-infrared light emitted toward the inside of the convex lens 114 by the near-infrared light source 103 is reflected from the reflection region inside the convex lens 114 and directed to the cornea 302 of the user 300. Meanwhile, since the near-infrared light is non-visible light, the user 300 is almost not able to perceive the near-infrared light reflected from the reflection region. In addition, the reflection region is a region which is as large as a pixel constituting the image display element 108 or is finer. For this reason, the user 300 is almost not able to perceive the reflection region, and is able to observe image light emitted by the image display element 108. Meanwhile, the details of the reflection region will be described later.
Although not shown, the image display system 130 of the head mounted display 100 includes two image display elements 108, and can generate an image for presentation to the right eye of the user 300 and an image for presentation to the left eye independently of each other. For this reason, the head mounted display 100 can present a parallax image for the right eye and a parallax image for the left eye, respectively, to the right eye and the left eye of the user 300. Thereby, the head mounted display 100 can present a stereoscopic image having a sense of depth to the user 300.
As described above, the hot mirror 112 transmits visible light, and reflects near-infrared light. Therefore, image light emitted by the image display element 108 passes through the hot mirror 112 and reaches the cornea 302 of the user 300. In addition, infrared light emitted from the near-infrared light source 103 and reflected from the reflection region inside the convex lens 114 reaches the cornea 302 of the user 300.
The infrared light reaching the cornea 302 of the user 300 is reflected from the cornea 302 of the user 300, and directed to the direction of the convex lens 114 again. This infrared light passes through the convex lens 114, and is reflected from the hot mirror 112. The camera 116 includes a filter that shields visible light, and captures near-infrared light reflected from the hot mirror 112. That is, the camera 116 is a near-infrared camera that captures near-infrared light emitted from the near-infrared light source 103 and reflected from the cornea of the user 300.
The image output unit 118 outputs an image captured by the camera 116 to an eyesight detection unit that detects the eyesight direction of the user 300. The image output unit 118 also outputs the image captured by the camera 116 to a cornea coordinate acquisition unit that acquires spatial coordinates of the user's cornea. Specifically, the image output unit 118 transmits the image captured by the camera 116 to the image reproducing device 200. The eyesight detection unit and the cornea coordinate acquisition unit will be described later, but can be realized by an eyesight detecting program and a cornea coordinate acquiring program executed by a CPU (Central Processing Unit) of the image reproducing device 200. Meanwhile, when the head mounted display 100 has a computing resource of a CPU, a memory or the like, the CPU of the head mounted display 100 may execute a program to operate the eyesight detection unit.
Although a detailed description will be given later, in the image captured by the camera 116, a bright point of the near-infrared light reflected by the cornea 302 of the user 300 and an image of the eye including the cornea 302 of the user 300 observed at a near-infrared wavelength band are captured.
In the convex lens 114, a plurality of reflection regions are formed so that a pattern of infrared light appearing on the cornea 302 of the user 300 forms structured light. The “structured light” refers to light used in one method of three-dimensional measurement of an object called a structured light method. More specifically, the structured light is light emitted to cause a light pattern having a special structure to appear on the surface of an object to be measured. Various patterns caused to appear through the structured light are present, but include as an example, a plurality of dot patterns arrayed in a lattice shape, stripe-shaped patterns disposed at equal intervals, a lattice pattern, and the like. In addition, the structured light may include not only single-color light, but also multi-color (such as, for example, red, green and blue) light.
The structured light method is a known technique, and thus a detailed description thereof will not be given, but the structured light formed by the reflection region provided inside the convex lens 114 causes a pattern formed by a plurality of infrared light dots to appear in a region including the cornea 302 of the user 300.
As shown in
As shown in
As described above, the reflection region 120 is a region having a different refractive index as compared to other regions in the convex lens 114. For this reason, the infrared light incident from the LED 104 is totally reflected from the reflection region 120 and directed to the cornea 302 of the user 300. Since the reflection region 120 is provided in a plurality of places in the convex lens 114, as much infrared light as the reflection region 120 is directed to the cornea 302 of the user 300. Thereby, dot patterns according to an installation shape of the reflection region 120 can be formed on the cornea 302 of the user 300. Meanwhile, providing a region having a refractive index in the convex lens 114 can be realized using a known laser machining technique.
As described above, the infrared light reaching the cornea 302 of the user 300 is reflected from the cornea 302 of the user 300, and directed to the direction of the convex lens 114 again. In this case, when the infrared light reaches the reflection region 120, the infrared light is reflected by the reflection region 120 and is not able to pass through the convex lens 114. However, each of the reflection regions 120 is a narrow region, and a relative position between the reflection region 120 and the cornea 302 of the user 300 continually changes with a change in the eyesight of the user 300. For this reason, the probability of the infrared light reflected from the cornea 302 of the user 300 and directed to the convex lens 114 being reflected by the reflection region 120 is small, which does not lead to a problem.
Even when it is assumed that the infrared light reflected from the cornea 302 of the user 300 and directed to the convex lens 114 is reflected in the reflection region 120 at a certain timing, the relative position between the reflection region 120 and the cornea 302 of the user 300 changes at another timing, and thus the infrared light is not reflected. Therefore, even when reflected light from the cornea 302 of the user 300 is reflected by the reflection region 120 at a certain moment by capturing the infrared light in the camera 116 over time, the camera 116 can capture an image at another moment, which does not lead to a problem.
In
In
As shown in
The reception and transmission unit 210 executes the transmission of information between the image reproducing device 200 and the head mounted display 100. The reception and transmission unit 210 can be realized by a wireless communication module according to the standard of MIRACAST (Trademark), WIGIG (Trademark), WHDI (Trademark), or the like described above.
The image generation unit 220 generates an image displayed on the image display element 108 of the head mounted display 100. The image generation unit 220 can be realized using, for example, the GPU or the CPU described above.
The cornea coordinate acquisition unit 240 analyzes the interval between the dot patterns appearing in the cornea 302 of the user 300, and thus acquires a three-dimensional shape of the cornea 302 of the user 300. Thereby, the cornea coordinate acquisition unit 240 can also estimate position coordinates of the cornea 302 of the user 300 in a three-dimensional coordinate system using the camera 116 as an origin.
Meanwhile, the camera 116 may be a monocular camera, and may be a stereo camera including two or more imaging units. In this case, the cornea coordinate acquisition unit 240 analyzes the parallax image of the cornea 302 of the user 300 which is captured by the camera 116, and thus can more accurately estimate the position coordinates of the cornea 302 of the user 300 in the three-dimensional coordinate system using the camera 116 as an origin.
Meanwhile, the cornea 302 of the user 300 is generally aspherical rather than spherical. For this reason, in the above method in which the cornea 302 of the user 300 is assumed to be spherical, an estimation error may occur in the eyesight direction of the user 300. Consequently, the eyesight detection unit 230 may provide calibration for an eyesight direction estimation in advance of the user 300 starting to use the head mounted display 100.
Hereinafter, similarly, the user 300 stores errors with respect to nine directions P.sub.1-Q.sub.1, P.sub.2-Q.sub.2, . . . , P.sub.9-Q.sub.9 of the point Q.sub.1 to the point Q.sub.9, and thus the eyesight detection unit 230 can acquire a correction table to correct the direction of the normal line 306 obtained by calculation. The eyesight detection unit 230 acquires the correction table in advance through calibration, and corrects the direction of the normal line 306 obtained in the aforementioned method, thereby allowing higher-accuracy eyesight direction detection to be realized.
It is also considered that, after the user 300 mounts the head mounted display 100 on the head and the eyesight detection unit 230 performs calibration, a relative positional relationship between the head of the user 300 and the head mounted display 100 changes. However, when the eyesight direction of the user 300 is detected from the shape of the cornea 302 of the user 300 described above, the relative positional relationship between the head of the user 300 and the head mounted display 100 does not influence the accuracy of detection of the eyesight direction. Therefore, it is possible to realize robust eyesight direction detection with respect to a change in the relative positional relationship between the head of the user 300 and the head mounted display 100.
Regarding the above, a method in which the eyesight detection unit 230 detects the eyesight direction of the user 300 using a geometric method has been described. The eyesight detection unit 230 may execute eyesight direction detection based on an algebraic method using coordinate transformation described below, instead of the geometric method.
In
A matrix M having a size of 3.times.2 is defined as Expression (1).
In this case, when the matrix M satisfies Expression (2), the matrix M becomes a matrix to project the eyesight direction of the user 300 onto a moving image surface displayed by the image display element 108.
P.sub.N=MQ.sub.N (N=1, . . . ,9) (2)
When Expression (2) is specifically written, Expression (3) is established.
When Expression (3) is deformed, Expression (4) is obtained.
When the following expression is set, Expression (5) is obtained.
Y=Ax (5)
In Expression (5), the elements of a vector y are the coordinates of the points Q.sub.1 to Q.sub.9 displayed on the image display element 108 by the eyesight detection unit 230, and thus the elements are known. In addition, the elements of a matrix A are coordinates of the peak P of the cornea 302 of the user 300 acquired by the cornea coordinate acquisition unit 240. Therefore, the eyesight detection unit 230 can acquire the vector y and the matrix A. Meanwhile, a vector x which is a vector obtained by arranging the elements of the transformation matrix M is unknown. Therefore, when the vector y and the matrix A are known, a problem of estimating the matrix M becomes a problem of obtaining the unknown vector x.
In Expression (5), when the number of expressions (that is, the number of points Q presented to the user 300 when the eyesight detection unit 230 performs calibration) is larger than the number of unknowns (that is, the number of elements of the vector x is 6), a priority determination problem occurs. In the example shown in Expression (5), the number of expressions is nine, which leads to a priority determination problem.
An error vector between the vector y and a vector Ax is set to a vector e. That is, the relation of e=y−Ax is established. In this case, in the meaning of minimizing a square sum of the elements of the vector e, an optimum vector x.sub.opt is obtained by Expression (6).
x.sub.opt=(A.sup.TA).sup.−1A.sup.Ty (6) wherein “−1” indicates an inverse matrix.
The eyesight detection unit 230 constitutes the matrix M of Expression (1) by using the elements of the obtained vector x.sub.opt. Thereby, the eyesight detection unit 230 uses the matrix M and the coordinates of the peak P of the cornea 302 of the user 300 acquired by the cornea coordinate acquisition unit 240, and thus can estimate where on the moving image surface displayed by the image display element 108 the user 300 keeps observation on according to Expression (2).
It is also considered that, after the user 300 mounts the head mounted display 100 on the head and the eyesight detection unit 230 performs calibration, a relative positional relationship between the head of the user 300 and the head mounted display 100 changes. However, the position coordinates of the peak P of the cornea 302 constituting the matrix A described above are values estimated by the cornea coordinate acquisition unit 240 as position coordinates in the three-dimensional coordinate system using the camera 116 as an origin. Even when it is assumed that the relative positional relationship between the head of the user 300 and the head mounted display 100 changes, a coordinate system based on the position coordinates of the peak P of the cornea 302 does not change. Therefore, even when the relative positional relationship between the head of the user 300 and the head mounted display 100 changes slightly, coordinate transformation according to Expression (2) is considered to be effective. Consequently, eyesight detection executed by the eyesight detection unit 230 can improve robustness with respect to a shift of the head mounted display 100 during mounting.
As described above, according to the head mounted display 100, it is possible to detect geometric information of the cornea 302 of the user 300 wearing the head mounted display 100.
Particularly, the head mounted display 100 can acquire the three-dimensional shape and the position coordinates of the cornea 302 of the user 300, it is possible to estimate the eyesight direction of the user 300 with good accuracy.
As stated above, our displays have been described on the basis of our examples. The examples have been described for exemplary purposes only, and it can be readily understood by those skilled in the art that various modifications may be made by a combination of each of these components or processes, which are also encompassed by the scope of this disclosure.
In the above, a description has been given of an example when the convex lens 114 is provided with the reflection regions 120 so that different dot patterns appear at different positions of the cornea 302 of the user 300. Dots having different blinking patterns may be caused to appear at different positions of the cornea 302 of the user 300, instead thereof or in addition thereto. This can be realized by forming, for example, the near-infrared light source 103 by a plurality of different light sources, and changing a blinking pattern in each light source.
Further, although the near-infrared light is radiated from the near-infrared light source 103 in the above-described embodiment, a light source that radiates near-infrared light may be included in each pixel constituting the image display element 108. That is, generally, one pixel is constituted by RGB, and a light emitting element that emits near-infrared light is provided in addition to the light emitting elements that emit red light, green light, and blue light. When a sub-pixel that emits near-infrared light is included as a sub-pixel in the image display element, the near-infrared light source 103 may not be provided in the head-mounted display 100.
The sub-pixel 900i is a pixel that emits near-infrared light. It is determined whether or not the sub-pixel 900i of each pixel 900 emits the near-infrared light according to an instruction from the video output unit 224, and information indicating whether or not the sub-pixel 900i of each pixel 900 emits the near-infrared light is included in display image data that the video output unit 224 outputs to the head-mounted display 100. Thus, an emission pattern of the near-infrared light desired by an operator can be formed. Therefore, in a video to be displayed at that time, for example, formation of a pattern can also be realized such that the near-infrared light is not emitted according to content of the image in a pixel that strongly emits red light. The light emission of the sub-pixel 900i may be executed by a display unit included in the head-mounted display shown in the above embodiment, or may be executed by an irradiation unit that controls the sub-pixel 900i that emits near-infrared light.
A configuration for emitting near-infrared light in the image is effective regardless of a type of the display device, and can be applied to various display devices such as an LCD, a plasma display, an organic EL display. Further, even when a sub-pixel for the near-infrared light is included in the pixel, the user do not feel uncomfortable when viewing the image by setting a wavelength of the near-infrared light to be radiated to be outside a range of wavelengths that can be perceived by a person with respect to an actually displayed image.
Further, control of the sub-pixel 900i of which of the respective pixels of the image display element 108 is caused to emit light may be executed by the display unit or the irradiation unit of the head-mounted display 100, or may be executed by the display unit or the irradiation unit of the head-mounted display according to designation of the video generation unit 220. Thus, structural light shown in the above embodiment can be realized. Further, at this time, turn-on of the sub-pixel 900i may be appropriately changed. Particularly, for example, when a moving image is displayed on the image display element 108, the sub-pixel 900i that emits the near-infrared light may be changed each time a predetermined time elapses in time series. Here, the predetermined time may be defined by the number of seconds. In the case of a moving image, the predetermined time may be defined by the number of frames, or the predetermined time may be defined for each blanking period. In this case, the frame number of the moving image and coordinate position information of the image display element 108 of the sub-pixel 900i that emits near-infrared light at that time are stored in the head-mounted display or a line-of-sight detection device in association with each other, such that a line-of-sight detection can be appropriately executed each time. Further, a blinking pattern of the sub-pixel 900i that emits the near-infrared light may be changed in a predetermined period.
Further, a timing at which the sub-pixel 900i is turned on and a timing at which the sub-pixel 900r, the sub-pixel 900g, and the sub-pixel 900b are turned on may be different timings. The camera 116 may be configured to execute imaging only at a timing at which the sub-pixel 900i is turned on. Further, as a scheme for realizing this configuration, for example, the configuration may be realized so that a blanking period of the sub-pixel 900r, the sub-pixel 900g, and the sub-pixel 900b and a blanking period of the sub-pixel 900i are set to be different time zones. More specifically, it is preferable for the blanking period of the sub-pixel 900r, the sub-pixel 900g, and the sub-pixel 900b to be set as a turn-on period of the sub-pixel 900i and for the blanking period of the sub-pixel 900i to be set as the turn-on period of the sub-pixel 900r, the sub-pixel 900g, and the sub-pixel 900b.
Further, in the above embodiment, the image is displayed on the image display element 108 provided on the head-mounted display 100 and the video is provided to the user, but the present invention is not limited thereto.
A display system 1000 illustrated in
In the above embodiment, the example in which the line-of-sight detection is assumed, a marker image is displayed, the marker image is caused to be gazed by the user, mapping information indicating a relationship between the cornea and a monitor obtained by calibration is stored, and the line-of-sight detection for the user when the user views an actual video is performed is shown. However, it goes without saying that a line-of-sight detection scheme is not limited to the above algorithm. Line-of-sight detection using the following scheme is also included in the idea of the present invention.
When line-of-sight detection using an image reflected in the eyes is realized, a visible light camera is used as the camera 116. Accordingly, an image based on normal visible light can be imaged, and an image as illustrated in
Further, although the position of the corneal center of the user when the user is viewing the center of the image is stored in the above description, the line-of-sight detection can be performed without storing the position information. That is, the feature point is detected from a first frame of the moving image output by the video output unit 224 and a second frame following the first frame (the second frame may not be a frame immediately after the first frame, but at least a part of the same object as an object to be displayed in the first frame is required to be displayed). Further, a position of a corneal center of the user gazing at the first frame at that time and a position of a corneal center of the user gazing at the second frame are detected. The line-of-sight detection unit 230 may be configured to detect a point (line-of-sight direction) of the second frame at which the user is gazing on the basis of a movement distance and a movement direction on the screen display element 108 from the feature point in the first frame to the corresponding feature point in the second frame, and a movement distance and a movement direction on the screen display element 108 from the position of the corneal center of the user in the first frame to the position of the corneal center of the user in the second frame.
According to these schemes, it is not necessary to execute the calibration by displaying the marker image shown in the above embodiment. Therefore, prior preparation for performing the line-of-sight detection using the head-mounted display 100 may not be performed, and convenience of the user can be improved.
The present invention is applicable to a head mounted display.
Number | Date | Country | Kind |
---|---|---|---|
PCT/JP2015/060398 | Apr 2015 | JP | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/JP2016/056078 | 2/29/2016 | WO | 00 |