The present invention relates to an information input device, and more particularly to an information input device that uses a projected image for information input.
Generally, an information input device such as a remote control device is used to input information for operating a television set, a video recorder, or the like. However, when it comes time to use the remote control device or like, the user may have trouble in locating the device because, for example, the user does not know where the device is placed, leading to the problem that the user is unable to use the device when the user desires to use it.
In view of the above, an information input device is known that projects, from an image projection device, an image of an operation unit having a plurality of input keys, and that determines on which input key an operation has been performed by detecting the motion of a finger on the projected image by image recognition (for example, refer to patent document 1). In the information input device disclosed in patent document 1, first the finger placed on the projected image is identified by edge detection from an image captured by an imaging unit, and then the downward motion of the finger, that is, the motion of the finger touching the surface on which the image is projected, is detected. This makes it possible to perform various input operations without operating the information input device itself.
A gestural interface as an wearable information input device is also known in which an image for input operation (pattern) such as a dial pad is projected on a wall, a table, or the palm of a user′ hand from a projector worn on the user and, when the projected image for input operation is pointed to by a device worn on the user's fingertip, an input operation corresponding to the image portion thus pointed to is implemented (for example, refer to patent document 2).
In the gestural interface disclosed in patent document 2, the image captured by a camera is analyzed by a computer, and the movement of the device worn on the user's fingertip is tracked to determine whether any corresponding input operation has been performed on the input operation image such as a dial pad. Further, since the image from the projector is projected after being reflected by a mirror, the user can change the projection position of the input operation image as desired by manually adjusting the orientation of the mirror.
Patent document 1: Japanese Unexamined Patent Publication No. H11-95895 (FIG. 1)
Patent document 2: U.S. Patent Publication No. 2010/0199232 (FIGS. 1, 2, and 12)
Such information input devices are also called virtual remote control devices, and are used to project an input operation image (pattern) on a suitable object in any desired environment so that anyone can easily perform an input operation. Generally, a visible laser light source is used as the light source for the projector projecting the input operation image. If the visible laser light is irradiated, for example, accidentally into the user's eye, the user's eye may be damaged.
In view of the above, it is an object of the present invention to provide an information input device whereby visible laser light which projects an information input image is prevented as much as possible from irradiating a body part to be protected such as the user's eye.
Provided is an information input device including a projection unit which projects an information input image by using visible laser light, a movable support unit which mounts the projection unit thereon in such a manner that a projection position on which the information input image is to be projected by the projection unit can be changed, a first sensing unit which captures an image of a sensing region within which the information input image can be projected, a second sensing unit which is mounted on the movable support unit, and which detects an object entering a predetermined region containing the projection position of the information input image and detects a distance to the object, an information input detection unit which detects information input by identifying, based on image data captured by the first sensing unit, an image of an input operation being performed on the information input image, and an identification control unit which identifies, based on information acquired by the second sensing unit, the presence or absence of a particular object entering the predetermined region and, if the entering of a particular object is detected, then causes the projection unit to stop projecting the information input image.
Preferably, in the above information input device, the information input detection unit detects information input by identifying, based on image data captured by the first sensing unit and information acquired by the second sensing unit, an image of an input operation being performed on the information input image.
Preferably, in the above information input device, the identification control unit identifies, based on image data captured by the first sensing unit and information acquired by the second sensing unit, the presence or absence of a particular object entering the predetermined region and, if the entering of a particular object is detected, then causes the projection unit to stop projecting the information input image.
Preferably, in the above information input device, the identification control unit identifies a human eye, nose, ear, mouth, face contour, or face as a particular object.
Preferably, in the above information input device, the second sensing unit includes an infrared light emitting unit, an infrared light sensing unit, and a scanning unit which scans the predetermined region in a two-dimensional fashion with an infrared beam that the infrared light emitting unit emits.
Preferably, in the above information input device, the second sensing unit detects the distance to the object entering the predetermined region by using a random dot pattern.
Preferably, in the above information input device, the second sensing unit detects the distance to the object entering the predetermined region by using a position sensitive device.
Preferably, in the above information input device, the first sensing unit includes an infrared light emitting unit and an infrared camera.
Preferably, in the above information input device, the first sensing unit and the second sensing unit respectively use mutually perpendicular linearly polarized infrared lights. This makes it possible to prevent interference between both of the sensing units.
Preferably, in the above information input device, the first sensing unit and the second sensing unit respectively use infrared lights of different wavelengths. This also makes it possible to prevent interference between both of the sensing units.
Preferably, in the above The information input device, the infrared light emitting unit in the first sensing unit and the infrared light emitting unit in the second sensing unit have respectively different emission timings. This also makes it possible to prevent interference between both of the sensing units.
Preferably, in the above information input device, the first sensing unit includes a camera module constructed from a combination of a camera for capturing a color image and an infrared camera for acquiring depth information.
Preferably, the above information input device further includes a projection position control unit which, based on image data captured by the first sensing unit, identifies a target object on which the information input image is to be projected, and controls the movable support unit so as to cause the projection unit to project the information input image by tracking the position of the target object.
According to the above information input device, it is possible to always monitor the sensing region containing the projection position on which the information input image is to be projected by the projection unit, and detect an object entering that region and a distance to the object, since the second sensing unit is mounted on the movable support unit together with the projection unit. Then, it is possible to substantially reduce the possibility of irradiating a body part to be protected such as a human eye for a long time with visible laser light, since the identification control unit identifies, based on information acquired by the second sensing unit, the presence or absence of a particular object such as a human eye or face and, if the entering of a particular object is detected, then causes the projection unit to stop projecting the information input image.
Hereinafter, with reference to the accompanying drawings, an information input device will be explained. However, it should be noted that the technical scope of the present invention is not limited to embodiments thereof, and includes the invention described in claims and equivalents thereof. In the explanation of the drawings, the same symbols are attached to the same or corresponding elements, and duplicated explanation is omitted. The scale of members is appropriately changed for explanation.
As shown in
The pan head 10 includes a base 11 fixed to a mounting frame 2 shown by dashed lines in
The first sensing unit 20 is fixed to the base 11 of the pan head 10, and includes a first infrared light emitting unit 21 and an infrared camera 22. The second sensing unit 25 is mounted to the second rotating part 13 of the pan head 10 together with the projection unit 30a of the projection device 30, and includes a second infrared light emitting unit 26 and an infrared light sensing unit 27.
The projection device 30 is constructed from an ultra-compact projector using visible laser light sources, one for each of the RGB colors, and the projection unit (projection head) 30a is mounted to the second rotating part 13 of the pan head 10. Based on the image data received from the control unit 50, the projection device 30 projects an information input image 70 onto a desired position on a table 40 which serves as the projection surface.
The projection device 30 includes, for example, visible laser light sources, a fiber pigtail module, an RGB fiber combiner, a visible single-mode fiber, and the projection unit 30a which is a projection head. The visible laser light sources are RGB light sources each constructed from a semiconductor laser (laser diode). The fiber pigtail module introduces the RGB laser lights from the respective laser light sources into R, G, and B laser light guiding fibers, respectively. The RGB fiber combiner combines the lights from the R, G, and B laser light guiding fibers. The visible single-mode fiber guides the combined light to the projection unit 30a. The projection unit 30a projects the information input image by using the thus guided visible laser light.
All the parts, except the visible single-mode fiber and the projection unit 30a, may be accommodated inside the base 11 of the pan head 10 together with the control unit 50, or a separate control box may be mounted on the mounting frame 2 to accommodate them. Since the projection unit 30a is mounted to the second rotating part 13 of the pan head 10 so that the projection direction can be changed as desired by rotating the first and second rotating parts 12 and 13, the projection position of the information input image 70 can be changed as desired.
The projection device 30 may be constructed from a projector using a monochromatic visible laser light source, etc., as long as the projector is designed to be able to project a predetermined information input image. Further, if the projection device 30 can be made ultra compact in size, the device in its entirety may be mounted to the second rotating part 13 of the pan head 10. In the example of
In operation of the first sensing unit 20, infrared light is emitted from the first infrared light emitting unit 21 to irradiate an entire sensing region 80 within which the information input image 70 can be projected, and a reflection of the infrared light reflected from an object located within the sensing region 80 is received by the infrared camera 22 for imaging. The first sensing unit 20 supplies to the control unit 50 position coordinate data and depth data (data pertaining to the distance between the infrared camera 22 and the captured object corresponding to the target pixel) for each pixel of the image captured by the infrared camera 22. In the example shown in
The first infrared light emitting unit 21 is constructed using an infrared light emitting semiconductor laser (laser diode). In the infrared wavelength range, near-infrared laser light of wavelength in the range of 1400 nm to 2600 nm is called “eye-safe laser” because it does not reach the retina of the human eye and is thus relatively harmless to the eye. It is therefore preferable to use laser light in this wavelength range. However, since using laser light in this wavelength range requires the use of, for example, an expensive InGaAs-based infrared camera to detect its reflection, a low-cost Si-based CMOS or CCD camera may be used in practice. In that case, it is preferable to use a semiconductor laser whose oscillation wavelength is longer than the visible region of the spectrum and falls within a range of 800 nm to 1100 nm to which the Si-based CMOS or CCD camera has sensitivity.
As shown in
In operation of the second sensing unit 25, infrared light emitted from the second infrared light emitting unit 26 is projected over a predetermined region containing the projection position of the information input image 70, and light reflected from an object entering that region is received and sensed by the infrared light sensing unit 27. Then, the second sensing unit 25 supplies the position coordinate data of the object and the depth data representing the distance to the object to the control unit 50.
The second infrared light emitting unit 26 is also constructed using an infrared light emitting semiconductor laser (laser diode), and it is preferable to use an eye-safe laser as in the case of the first infrared light emitting unit 21. However, since an expensive InGaAs-based infrared sensor, for example, has to be used in the case of the wavelength region longer than 1400 nm, a low-cost Si-based photodiode may be used in practice. In that case, it is preferable to use a semiconductor laser whose oscillation wavelength is longer than the visible region of the spectrum and falls within a range of 800 nm to 1100 nm to which the Si-based photodiode has sensitivity.
The infrared light sensing unit 27 includes a photodiode as a light receiving element. The infrared light sensing unit 27 further includes a calculating unit which calculates the position coordinate data of the object from such parameters as the signal sensed by the photodiode, the ratio between the intensity of the sensed signal and the intensity of the emitted infrared laser light, and the projection angle of the infrared laser, and calculates the depth data, i.e., the distance to the detected object, by using a TOF method. However, the function of this calculating unit may be incorporated in the control unit 50.
The TOF (time-of-flight) method is a distance measuring method by which the distance to a target object is calculated based on the time of flight of light (delay time) from the time the light emitted from a light source to the time the light reflected from the object reaches a sensor and on the speed of light (=3×108 m/s). In the example shown in
In the second sensing unit 25 also, a polarizer 28 is placed on the front of the second infrared light emitting unit 26, as shown in
Thus, the first sensing unit 20 and the second sensing unit 25 respectively use mutually perpendicular linearly polarized infrared lights, as described above. With this arrangement, when the irradiated object has the characteristic that the depolarization occurring on it is small, the S/N ratio can be improved by reducing the interference between the infrared light received by the infrared camera 22 and the infrared light received by the infrared light sensing unit 27.
More specifically, the second sensing unit 25 is preferably configured as shown, for example, in
Then, the polarizer 28, a beam splitter 250, and the MEMS mirror 251 as a scanning unit are arranged in this order along the optical axis of the infrared light emitted from the second infrared light emitting unit 26. The beam splitter 250 and the MEMS mirror 251 are arranged so that the half-reflecting face of the beam splitter 250 and the mirror face of the MEMS mirror 251 in its neutral position are each oriented at an angle of about 5° to 45° with respect to the optical axis of the emitted infrared light. The polarizer 29 is disposed between the infrared light sensing unit 27 and the beam splitter 250.
The MEMS mirror 251, one example of which is shown in the top plan view of
The MEMS mirror 251 is formed from a one-piece plate. The first and second supporting members 251d and 251e have elasticity and, when subjected to external forces, allow the mirror face 251a to rotate (vibrate) by resonating in two dimensions at its natural frequency of vibration within a range limited by the elasticity. The MEMS mirror 251 may employ a method in which the second supporting members 251e are driven in a resonant mode and the first supporting members 251d are forcefully driven without using resonance. Means for applying external forces include an electromagnetic coil, a piezoelectric element, etc.
The rotation directions indicated by arrows “a” and “b” in
Instead of the MEMS mirror 251 rotating or vibrating in two dimensions as described above, a combination of two vibrating mirrors, such as MEMS mirrors, each of which rotates or vibrates in one dimension, may be used as the scanning unit. If the beam splitter 250 is constructed from a polarizing beam splitter, the polarizers 28 and 29 can be omitted.
The control unit 50 includes a microcomputer including a CPU 51, RAM 52, ROM 53, and I/O 54. The CPU 51 is a central processing unit that performs various calculations and processing. The ROM 53 is a read-only memory that stores fixed data and operating programs to be executed by the CPU 51. The RAM 52 is a random-access memory that temporarily stores input data and other data being processed by the CPU 51. The I/O 54 is an input/output port for transmitting and receiving data to and from the pan head 10, the first sensing unit 20, the projection device 30, and a control target apparatus 60. The control unit 50 may further include a nonvolatile RAM (NVRAM) and a hard disk drive (HDD).
The control unit 50 functions as an information input detection unit which detects information input by identifying, based on the image data captured by the first sensing unit 20 or also based on the information acquired by the second sensing unit 25, an image of an input operation such as an operation performed by a fingertip, etc., on the information input image 70 projected from the projection unit 30a of the projection device 30. The control unit 50 supplies the detected information input data to the control target apparatus 60. The control unit 50 further functions as an identification control unit which identifies, based on the information acquired by the second sensing unit 25, the presence or absence of a particular object entering the predetermined region and, if the entering of a particular object is detected, then issues a projection control signal and thereby causes the projection unit 30a of the projection device 30 to stop projecting the information input image 70.
The control unit 50, which controls the driving of the first and second motors 15 and 16 of the pan head 10 in accordance with control data, can project the information input image 70 onto a desired position on the table 40 by rotating the first and second rotating parts 12 and 13 in
The control target apparatus 60 is, for example, an air-conditioner, a network access apparatus, a personal computer, a television receiver, a radio receiver, or a recording and playback apparatus of a recording medium such as a CD, DVD, or VTR, and performs various kinds of processing based on the information input data.
First, a display and an operation unit (keyboard and mouse) not shown are connected to the control unit 50 via the I/O 54. Then, an image based on the image data captured by the infrared camera 22 in the first sensing unit 20 is produced on the display under the control of the control unit 50; in this condition, the process waits until the user specifies the position of the projection surface by using the operation unit (S10). When the position of the projection surface is specified, the control unit 50 stores the position coordinate data indicating the range of the projection surface in the RAM 52, etc., (S11). Once the initialization is performed and initial data are stored at the time of installation, the above initialization steps S10 and S11 can be omitted in the next and subsequent power-up processes, as long as the installation place and conditions remain unchanged.
Next, the control unit 50 acquires the depth data of the projection surface 41 from the first sensing unit 20 (S12), and stores the depth data in the RAM 52 for each pixel contained in the region specified as the projection surface 41 (S13).
Next, the control unit 50 transmits predetermined image data to the projection device 30 to project a reference projection image 71 from the projection unit 30a onto the projection surface 41, and transmits predetermined control data to the pan head 10 to move the reference projection image 71 to a reference position by controlling the pan head 10 (S14). The reference projection image 71 is one that contains five black dots displayed within a circular frame, as indicated by each of reference numerals 71-1 to 71-7 in
Next, the control unit 50 acquires the position coordinate data from the first and second sensing units 20 and 25 (S15). Then, using the five black dots, the control unit 50 identifies the position of the reference projection image 71 (S16), and stores a mapping between the control data transmitted to the pan head 10 and the position coordinate data of the identified reference projection image 71 in a data table constructed within the RAM 52 (S17).
After that, the control unit 50 determines whether the reference projection image 71 has been moved to every possible region on the projection surface 41 (S18). If there is any remaining region (No in S18), the process returns to S14. In this way, the control unit 50 repeats the process from S14 to S17 by sequentially moving the reference projection image 71 from 71-2 through to 71-7 in
By repeating the process from S14 to S17 a certain number of times, the control unit 50 completes the construction of the data table that provides a mapping between the control data and the position coordinate data of the projected image for the entire area of the projection surface 41. Then, when it is determined by the control unit 50 that the reference projection image 71 has been moved to every possible region on the projection surface 41 (Yes in S18), the process of
Using the completed data table, the control unit 50 can control the pan head 10 so that the projected image from the projection unit 30a is moved to the desired position on the specified projection surface 41. Conversely, by using the data table, the control unit 50 can identify the position of the currently projected image on the projection surface 41.
Based on the image data to be transmitted to the projection device 30, the control unit 50 can identify the kinds of the input buttons contained in the information input image and the positions of the buttons on the information input image. Further, the control unit 50 can identify the position of the information input image on the projection surface 41, based on the data table constructed in S17 of
First, the control unit 50 acquires the image data to be transmitted to the projection device 30 and the control data transmitted to the pan head 10 (S20). Then, the control unit 50 acquires the position coordinate data and depth data from the first and second sensing units 20 and 25 (S21). The order of S20 and S21 may be interchanged.
Next, based on the position coordinate data acquired in S21, the control unit 50 identifies image contour regions (S22). More specifically, the control unit 50 identities the contour regions of an entering object (for example, a hand's contour region 90 such as shown in
Next, based on the depth data acquired in S21, the control unit 50 groups together the contour regions having substantially the same depth data from among the contour regions identified in S22 (S23).
Next, based on the contour regions grouped together in S23, the control unit 50 identifies the positions at which the entering object has entered the projection surface and the position of the fingertip (S24).
In the example of
Next, the control unit 50 determines whether the entering object is performing an information input operation (S25). Even if the entering object exists within the sensing region 80 shown in
The control unit 50 determines whether the point E3 of the fingertip position is located on the projection surface 41 or not, based on whether the difference between the depth data of the projection surface 41 acquired in advance in S12 of
The depth data of the point E3 of the fingertip position may fluctuate over a short period of time because of chattering, etc. Accordingly, in order to prevent an erroneous detection, the control unit 50 may determine that an information input has been done only when the difference between the depth data of the point E3 of the fingertip position and the depth data of the projection surface 41 at the position coordinates representing the point E3 has remained within the predetermined threshold continuously for a predetermined length of time (for example, one second or longer).
If it is determined by the control unit 50 that the fingertip at the detected position is intended for an information input operation (Yes in S25), the position on the projection surface 41 of each input button contained in the information input image 70, such as shown in
When the position of each input button on the projection surface 41 is identified in S26, the control unit 50 identifies the kind of the information input operation, based on the point E3 of the fingertip position identified in S24 and the position of each input button on the projection surface 41 identified in S26 (S27). For example, if the coordinates of the point E3 of the fingertip position lie within the range of the playback button 72 shown in
Next, the control unit 50 performs processing corresponding to the kind of the information input operation identified in S27 on the control target apparatus 60 shown in
The process flow of
Next, a description will be given of how to detect a particular object, such as a human face, eye, etc., entering the projection space through which the information input image 70 is projected from the projection unit 30a onto the table 40 in
First, the control unit 50 determines whether the projection device 30 is projecting an information input image (S30) and, if it is projecting an information input image (Yes in S30), then activates the second sensing unit 25 (S31). Alternatively, the control unit 50 may activate the second sensing unit 25 in S31 when an information input image is being projected and further an object is detected at a position spaced more than a predetermined distance away from the projection surface 41 (the table 40) within the sensing region 80 based on the sensing information (position coordinate data and depth data) acquired from the first sensing unit 20.
If it is determined in S30 that the projection device 30 is not projecting an information input image (No in S30), or if it is determined that the projection device 30 is not projecting an information input image and further no object is detected at any position spaced more than a predetermined distance away from the projection surface 41 based on the sensing information acquired from the first sensing unit 20, the process may wait until an information input image is projected and an object is detected, or the process of
When the second sensing unit 25 is activated, the control unit 50 acquires the position coordinate data and depth data of the object detected at each scan point within the predetermined region (S32).
Then, based on the acquired position coordinate data, the control unit 50 identifies the contour regions of the object (S33). Further, based on the depth data, the control unit 50 groups together the contour regions having substantially the same depth data (S34). After that, the control unit 50 determines whether any object has been detected by the first sensing unit 20 (S35). If no object has been detected (No in S35), the process is terminated. On the other hand, if any object has been detected (Yes in S35), the control unit 50 determines whether the detected object indicates the detection of the entering of a particular object, based on the grouping of contour region data by the second sensing unit 25 (S36). More specifically, the control unit 50 determines whether the entering of a particular object has been detected or not, for example, by checking whether or not a contour pattern having a depth within a predetermined range is approximate or similar to any one of the particular object patterns prestored in the ROM 53, etc.
For this purpose, pattern data representing the characteristic features of the body parts to be protected, for example, a human eye, nose, ear, mouth, face, face contour, etc., are prestored as detection target data of particular objects in the ROM 53, etc.
If it is determined that the detected object does not indicate the detection of the entering of a particular object (No in S36), the process of
In this way, when the entering of a particular object is detected, the emission of the RGB visible laser light from the projection unit 30a shown in
As described above, when the information input image is being projected, or when the information input image is being projected and further the presence of an object that is likely to be a particular object is detected within the sensing region 80 based on the sensing information acquired from the first sensing unit 20, the control unit 50 activates the second sensing unit 25 which can always scan at high speed across the predetermined region containing the projection region where the information input image is projected from the projection unit 30a. Then, when the entering of a particular object such as a human eye or face has entered the projection region, the second sensing unit 25 quickly and accurately detects it by using the TOF method based on the sensing information, and thus the projection device 30 can be caused to stop projecting the information input image 70. This serves to greatly improve the safety.
Since the refresh rate of the infrared camera 22 is about 30 frames per second, it is not possible to track quick movement of the human face, etc., by simply using the sensing information acquired from the first sensing unit 20. Therefore, by making use of the high-speed capability of the second sensing unit 25, the human face or eye entering the image projection area is quickly detected and the emission of the visible laser light is stopped. Furthermore, since the second sensing unit 25 is integrally mounted to the second rotating part 13, i.e., the movable supporting member of the pan head 10, together with the projection unit 30a of the projection device 30, even if the projection region of the information input image 70 projected from the projection unit 30a is moved, the second sensing unit 25 can always scan at high speed across the predetermined region containing the projection region of the information input image 70.
The projection device 30 such as a laser projector, shown in
The projection device 30, which projects the information input image 70 by using the visible laser light, has the characteristic of being able to always project the information input image 70 with a good focus on the projection surface irrespectively of the distance between the projection surface and the projection unit 30a (focus-free characteristic). It will be appreciated that any suitable projection device other than the projector using the RGB color lasers may be used, as long as it is designed to be able to project a predetermined information input image.
In the example of
The control unit 50 shown in
Based on the information acquired by the infrared camera 22 in the first sensing unit 20, the control unit 50 identifies the target body part, i.e., the palm of the user's left hand 180, on which the information input image 70 is to be projected. Then, the control unit 50 controls the first and second motors 15 and 16 in accordance with control data so as to track the position of the target body part, and thereby causes the projection unit 30a to project the information input image 70 on the palm of the user's left hand 180.
When the control unit 50 controls the first motor 15 of the pan head 10 so that the first rotating part 12 shown in
That is, the control unit 50 functions as a projection position control unit which tracks the position of the palm of the user's left hand 180 as the target body part and changes the projection position of the information input image 70 accordingly. The control unit 50 also functions as an information input detection unit which detects an information input operation performed on the information input image 70, based on the sensing information acquired from the first sensing unit 20 or the second sensing unit 25.
First, the control unit 50 acquires the position coordinate data and depth data from the first sensing unit 20 (S40). Next, based on the position coordinate data acquired in S40, the control unit 50 identifies the regions containing object contours (S41). Then, based on the depth data acquired in S40, the control unit 50 groups together the regions having substantially the same depth data from among the regions containing the contours (S42).
Next, the control unit 50 determines whether the object contour regions grouped together in S42 represent the target body part which is the body part forward of the wrist, by comparing their pattern against the patterns prestored in the ROM 53, etc., (S43). For example, when the user is sitting, a plurality of groups of contour regions (legs, face, shoulders, etc.) of the entering object may be detected, but only the target body part, which is the body part forward of the wrist, can be identified by pattern recognition.
If it is determined in S43 that the entering object is the user's left hand 180 which is the target body part, the control unit 50 detects the palm region 200 indicated by a dashed circle on the left hand 180 in
The palm region 200 is detected from the contour (outline) of the identified left hand 180, for example, in the following manner. In
Next, the control unit 50 derives the spatial coordinates (x,y,z) of the center point N6 from the position data (x,y) and depth data (r) of the center point N6 of the palm region 200. Then, using the data table constructed in S17 of
Therefore, the control unit 50 performs control so that the information input image 70 will always fit within the palm region 200 by increasing or reducing the size of the projected image based on the depth data of the center point N6 shown in
First, the control unit 50 determines whether the target body part (the part forward of the left wrist) has been identified or not (S50), and proceeds to carry out the following steps only when the target body part has been identified.
When the target body part has been identified in S50, the control unit 50 acquires the image data transmitted to the projection device 30 and the control data transmitted to the pan head 10 (S51). Next, the control unit 50 acquires the position coordinate data and depth data primarily from the second sensing unit 25 (S52). The order of S51 and S52 may be interchanged.
Next, the control unit 50 identifies the contour data of the detected object, based on the position coordinate data acquired in S52 (S53). Then, based on the depth data acquired in S52, the control unit 50 groups together the contour regions having substantially the same depth data (S54). Further, based on the contour regions thus grouped together, the control unit 50 identifies the entry positions through which the entering object has entered the palm region 200 and the position of the fingertip (S55). There may be more than one entering object on which grouping is done in S54, but the control unit 50 identifies only the object having position coordinates (x,y) within the range of the palm region 200 as being the entering object.
Alternatively, the contour region contained in the right hand 190 and located at the position farthest from the midpoint 03 between the entry positions 01 and 02 may be identified as the position of the fingertip. The above method of identifying the position of the fingertip based on the entry positions of the right hand 190 is only one example, and the position of the fingertip may be identified using some other suitable method.
Next, the control unit 50 determines whether the right hand 190 as the entering object is performing an information input operation (S56). Even if the right hand 190 exists within the palm region 200, the right hand 190 may have merely entered the palm region 200 without any intention of performing an information input operation. Therefore, if, for example, the point 05 of the fingertip position is located on the palm region 200, then the control unit 50 determines that the fingertip of the right hand 190 is performing an information input operation.
The control unit 50 determines whether the point 05 of the fingertip position is located on the palm region 200 or not, based on whether the difference between the depth data of the palm region 200 and the depth data of the point 05 of the fingertip position lies within a predetermined threshold (for example, within 10 mm).
The depth data of the point 05 of the fingertip position may fluctuate over a short period of time because of chattering, etc. Accordingly, in order to prevent an erroneous detection, the control unit 50 may determine that an information input has been done only when the difference between the depth data of the point 05 of the fingertip position and the depth data of the palm region 200 has remained within the predetermined threshold continuously for a predetermined length of time (for example, one second or longer).
If it is determined by the control unit 50 that the fingertip at the detected position is intended for an information input operation (Yes in S56), the position on the palm region 200 of each input button contained in the information input image 70 projected on the palm region 200 as shown in
Next, the control unit 50 identifies the kind of the information input operation, based on the point 05 of the fingertip position identified in S55 and the position of each input button on the palm region 200 identified in S57 (S58). For example, if the coordinates of the point 05 of the fingertip position lie within the range of the playback button 72 as shown in
After that, the control unit 50 performs processing corresponding to the kind of the information input operation identified in S58 on the control target apparatus 60 (S59), and terminates the sequence of operations. For example, if the operation indicated by the identified information input is “playback”, the control unit 50 sends a “playback” signal to the control target apparatus 60.
On the other hand, if it is determined by the control unit 50 that the fingertip at the detected position is not intended for an information input operation (No in S56), the process of
The process flow of
In the process flow of
In S50 of
When performing an information input operation on the information input image 70 projected by using the user's body part such as the palm of his/her hand as the projection surface, as described above, the user's face 100 or eye 101 tends to enter the projection region indicated by dashed lines in
To prevent the interference between the infrared light emitted from the first sensing unit 20 and the infrared light emitted from the second sensing unit 25 shown in
Alternatively, a time multiplexing method may be employed to prevent the occurrence of interference; in this case, the first infrared light emitting unit 21 in the first sensing unit 20 and the second infrared light emitting unit 26 in the second sensing unit 25 are controlled to emit the infrared lights at different emission timings, that is, staggered emission timings. It is also possible to prevent the occurrence of interference by suitably combining the above methods.
Further, the infrared camera 22 shown in
For example, when a color camera for capturing a color image is used, color data such as RGB can also be detected. As a result, even when a ring or a wrist watch or the like is worn on the hand, finger, or arm to be detected, such objects can be discriminated based on the color data, and only the skin-tone image region of the hand can be accurately identified.
Further, the projection device 30 may be configured to also serve as the second infrared light emitting unit 26 in the second sensing unit 25. In that case, the infrared beam as well as the visible laser light for projecting the information input image, for example, is projected from the projection unit 30a onto the projection surface, and the infrared light sensing unit such as a photodiode receives the light reflected from an object and passed through an infrared band-pass filter.
In the projection unit 31, the laser light emitted from the single-mode fiber 32 is directed toward a MEMS mirror 31b through an illumination optic 31a, and the light reflected from the MEMS mirror 31b is projected on the earlier described projection surface through a projection optic 31c. By vibrating the MEMS mirror 31b about mutually perpendicular two axes, the laser light being projected can be scanned at high speed in a two-dimensional fashion. In this way, the projection device 30 can be configured to also serve as the second infrared light emitting unit 26. Further, a beam splitter may be inserted in the path between the illumination optic 31a and the MEMS mirror 31b in
Instead of the earlier described TOF method, a random dot pattern method may be used by the second sensing unit to measure the distance to the detected object. In the TOF method, since the computation has to be performed at high speed at all times in order to obtain high resolution in real time, the CPU 51 is required to have a high computational capability. On the other hand, the random dot pattern method is a method that is based on the principle of triangulation, and that calculates the distance from the amount of horizontal displacement of the pattern by utilizing the autocorrelation properties of an m-sequence code or the like and detects as the autocorrelation value the lightness and darkness of the pattern overlapping caused by the bit shifting of the obtained image data. By repeatedly performing cross-correlation processing with the original pattern, the method can detect the position with the highest correlation value as representing the amount of displacement.
Further, in the random dot pattern method, the whole process from the generation of the random dot pattern to the comparison of the patterns can be electronically performed by storing the original m-sequence code pattern in an electronic memory and by successively comparing it with reflection patterns for distance measurement. In this method, since the dot density can be easily changed according to the distance desired to be detected, highly accurate depth information can be obtained, compared with a method that optically deploys a random dot pattern in space by a projection laser in combination with a fixed optical hologram pattern. Furthermore, if part of the function, such as the generation of the random dot pattern, is implemented using a hardware circuit such as a shift register, the computational burden can be easily reduced.
Let L denote the distance from the line image sensor 127 to the reference plane serving as the original pattern, and W denote the value representing the amount of horizontal displacement of a specific pattern generated by the scanning of the MEMS mirror 251 and converted to the amount of displacement on the reference plane located at the distance L; then, from the principle of triangulation, the distance Z to the object is obtained from the following equation.
Z=(d·L)/(d+W) (1)
For each line scan of the MEMS mirror 251, the line image sensor 127 integrates the random dot pattern reflected from the object, and acquires the result as one-dimensional image information. The control unit 50 in
Since the line image sensor 127 is one dimensional (rectilinear), only the depth data on a one-dimensional line can be obtained, unlike the case of the commonly used two-dimensional dot pattern. However, since the line image sensor 127 is synchronized to each line scan of the MEMS mirror 251, it is possible to determine the line position located in the direction perpendicular to the line scan direction and held within the frame generated by the MEMS mirror 251. As a result, it is possible to convert the acquired data to two-dimensional data. Furthermore, since the presence or absence of a particular object is determined by also using the image data captured by the first sensing unit, the deficiency that only the depth data on one-dimensional line can be obtained by the line image sensor 127 does not present any problem in practice.
Since the second sensing unit 125 can track the movement of the object and measure the distance to the object on a per line scan basis, as described above, it becomes possible, despite its simple configuration, to measure the distance at high speed even when the object is moving.
Another method for measuring the distance to the detected object is the PSD method. This method detects the light intensity centroid position of the infrared light reflected from the object by using a position sensitive device (PSD) as the infrared light sensing unit instead of the line image sensor 127. Similarly to the random dot pattern method, the PSD method measures a change in the distance to the object from the amount of horizontal positional displacement by using the principle of triangulation, and a change in the angle of reflection off of the object due to the positional change in the horizontal direction is detected as a change in the light intensity centroid position. In the case of the line image sensor, the control unit 50 needs to construct the entire image from the amount of received light measured on each cell of the sensor, but in the case of the PSD method, since information representing the light intensity centroid position is output from the position sensitive device itself, it becomes possible to detect any positional change in the horizontal direction by just monitoring this information, and thus the distance to the object can be measured. This offers the advantage of being able to further simplify the configuration of the control unit 50.
While various embodiments and modified examples of the information input device according to the present invention have been described above, the information input device is not limited to any particular example described herein, but it will be appreciated that various other changes, additions, omissions, combinations, etc., can be applied without departing from the scope defined in the appended claims.
The present invention can be used as an information input device for virtual remote control that remotely controls various kinds of control target apparatus such as, for example, an air-conditioner, a network access apparatus, a personal computer, a television receiver, a radio receiver, or a recording and playback apparatus of a recording medium such as a CD, DVD, or VTR.
Number | Date | Country | Kind |
---|---|---|---|
2012-186611 | Aug 2012 | JP | national |
2013-066186 | Mar 2013 | JP | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/JP2013/072463 | 8/22/2013 | WO | 00 |