FIELD OF THE INVENTION
This invention generally relates to a 3-D path detection system, and more particularly to a 3-D path detection system that combines the computer vision and Doppler radar technologies for hand gesture recognition application.
BACKGROUND OF THE INVENTION
Recently, augmented reality (AR) is gaining increasing attention for use in mobile phones, and hand gesture recognition (HGR) is considered as an important interaction technique in the mobile AR applications. By using HGR technology, users can directly see the interaction with the virtual image in the mobile phone's display. The mainstream technologies for HGR are based on 3-D cameras such as Microsoft Kinect, Intel RealSense and Leap Motion. In a 3-D camera, RGB and infrared (IR) cameras are used to produce images and their depth information, respectively, for HGR. However, the mobile phone equipped with an extra IR camera has the disadvantages of extra cost and greater power/resource consumption. Additionally, the IR cameras function poorly outdoors or in bright light. These disadvantages of IR cameras restrict the evolution of HGR in mobile phones.
SUMMARY
The primary object of the present invention is to utilize an image capture device to produce dynamic images of a moving object for extracting the x- and y-direction (transverse) pixel-value displacements from a captured moving object image, utilize a radar device to detect the z-direction (longitudinal) displacement of the moving object, and utilize a computing module to construct a 3-D path of the moving object according to the obtained x- and y-direction pixel-value and z-direction displacements.
The 3-D path detection system of the present invention includes an image capture device, a radar device and a computing module. The image capture device is configured to produce dynamic images of a moving object, and is configured to extract an x-direction (transverse horizontal) pixel-value displacement and a y-direction (transverse vertical) pixel-value displacement from a captured moving object image. The radar device is configured to transmit a wireless signal input from a wireless communication device to the moving object and receive a reflection signal from the moving object, and is configured to detect a z-direction displacement of the moving object according to a Doppler shift in the reflection signal. The computing module is coupled to the image capture device and the radar device, and is configured to construct a 3-D path of the moving object using the results of the x- and y-direction pixel-value and z-direction displacements.
The 3-D path detection system of the present invention utilizes the image capture device to obtain the x- and y-direction pixel-value displacements of the moving object image and utilizes the radar device to obtain the z-direction displacement of the moving object, such that the computing module can construct the 3-D path of the moving object according to the obtained x- and y-direction pixel-value and z-direction displacements. The radar device does not have its own RF signal source, but uses the wireless signal input from the wireless communication device to detect the z-direction displacement of the moving object. Therefore, the interference issue between the radar device and the wireless communication device can be avoided. The present invention can be used on mobile phones to carry out HGR without extra IR camera and RF signal source and therefore has the advantages of lower computation resources and power consumption than the 3-D camera technology. In addition, the radar device using the wireless signal to detect the moving object will not be affected by outdoor use or bright light. For these reasons the 3-D path detection system of the present invention can improve the popularity of HGR application in mobile phones.
DESCRIPTION OF THE DRAWINGS
FIG. 1 is a functional block diagram illustrating a 3-D path detection system in accordance with one embodiment of the present invention.
FIG. 2 is a functional block diagram illustrating an image capture device in accordance with one embodiment of the present invention.
FIG. 3 is a schematic diagram illustrating a mobile phone producing a dynamic image in accordance with one embodiment of the present invention.
FIG. 4 is a schematic diagram illustrating a radar device detecting a moving object in accordance with one embodiment of the present invention.
FIG. 5 is a circuit diagram illustrating the radar device in accordance with one embodiment of the present invention.
FIG. 6 is a schematic diagram illustrating the moving object along a straight path in the xz plane with a predetermined angle relative to x-axis in accordance with one embodiment of the present invention.
FIG. 7 is a schematic diagram illustrating the moving object along a straight path in the yz plane with a predetermined angle relative to y-axis in accordance with one embodiment of the present invention.
FIG. 8 is a schematic diagram illustrating the 3-D path detection system constructing the 3-D path of one hand gesture in accordance with one embodiment of the present invention.
FIG. 9 is a schematic diagram illustrating the 3-D path detection system constructing the 3-D path of another hand gesture in accordance with one embodiment of the present invention.
DETAILED DESCRIPTION OF THE INVENTION
With reference to FIG. 1, it is a functional block diagram of a 3-D path detection system 100 in one embodiment of the present invention. The 3-D path detection system 100 includes an image capture device 110, a radar device 120 and a computing module 130. The image capture device 110 and the radar device 120 are coupled to the computing module 130, and the image capture device 110, the radar device 120 and the computing module 130 are non-integrated electronic products in this embodiment. However, they can be integrated in a mobile phone in other embodiments.
With reference to FIGS. 2 and 3, the image capture device 110 includes an RGB camera 111 and an image processing module 112. The RGB camera 111 produces a dynamic image P, which is viewed as a moving object image OP with a background. In FIG. 3, the moving object image OP represents a user's hand moving along an arc path in this embodiment. And the moving object image OP may represent different moving objects in other embodiments. Referring FIG. 2, the image processing module 112 of the image capture device 110 receives the dynamic image P from the RGB camera 111 and then performs a background subtraction to compute an x-direction (transverse horizontal) pixel-value displacement ΔPx and a y-direction (transverse vertical) pixel-value displacement ΔPy according to the moving object image OP. In this embodiment, the image processing module 112 of the image capture device 110 establishes a background model related to the dynamic image P to compare the differences in pixels between the dynamic image P and the background model for capturing the moving object image OP. The background mode is, but not limited to, a Gaussian mixture model.
With reference to FIGS. 1 and 4, the radar device 120 is a passive radar without owning an RF signal source in this embodiment. However, it can be an active radar with its own RF signal source in other embodiments. In FIG. 4, the radar device 120 is configured to transmit an input wireless signal ST to a moving object O and receive a reflection signal SR from the moving object O. And the moving object O represents the same moving object as that in the moving object image OP. The reflection signal SR contains the Doppler shift associated with the z-direction (longitudinal) displacement of the moving object O, so the radar device 120 can detect a z-direction displacement Δz of the moving object O from the reflection signal SR. Preferably, the radar device 120 is a passive radar when the 3-D path detection system 100 of the present invention is applied to a mobile phone, because the passive radar possesses the benefits of both lower power consumption and less interference.
With reference to FIG. 5, the radar device 120 includes a coupler 121, an antenna 122, an injection-locked oscillator (ILO) 123, an amplifier 124, a demodulation circuit 125, a signal processing unit 126 and a bandpass filter 127.
With reference to FIG. 5, the input wireless signal ST is fed to the coupler 121 of the radar device 120, and the input wireless signal ST comes from a wireless communication device. In this embodiment, the input wireless signal ST is a Wi-Fi signal from a Wi-Fi access point, and the coupler 121 is a branch-line coupler. The coupler 121 couples the input wireless signal ST to the ILO 123, allowing the ILO 123 to operate in an injection-locked state and output a lock-in signal SL. The input wireless signal ST is also coupled to the bandpass filter 127 for out-of-band rejection and then delivered to the antenna 122. The antenna 122 radiates the input wireless signal ST to the moving object O and receives the reflection signal SR from the moving object O with the Doppler shift associated with the z-direction displacement of the moving object O. The reflection signal SR received by the antenna 122 is delivered to the bandpass filter 127 and then coupled to the amplifier 124 via the coupler 121. The demodulation circuit 125 is coupled to the ILO 123 and the amplifier 124 and utilized to quadrature demodulate an amplified RF signal SO output from the amplifier 124 using the lock-in signal SL output from the ILO 123.
With reference to FIG. 5, the demodulation circuit 125 is a quadrature demodulator in this embodiment. The demodulation circuit 125 includes a quadrature power splitter 125a, a power splitter 125b, a first mixer 125c, a second mixer 125d, a first low-pass filter 125e and a second low-pass filter 125f. The quadrature power splitter 125a is coupled to the ILO 123 for receiving the lock-in signal SL, and the quadrature power splitter 125a outputs an in-phase local oscillation signal SLO_I via a 0° output port and outputs a quadrature local oscillation signal SLO_Q via a 90° output port. The power splitter 125b is coupled to the amplifier 124 for splitting the amplified RF signal SO into two paths. The first mixer 125c is utilized to receive and mix the in-phase local oscillation signal SLO_I and the amplified RF signal SO in one path for producing a first mixer output signal M1. The second mixer 125d is utilized to receive and mix the quadrature local oscillation signal SLO_Q and the amplified RF signal SO in the other path for producing a second mixer output signal M2. The first low-pass filter 125e is arranged to receive and filter the first mixer output signal M1 from the first mixer 125c to obtain an in-phase signal I, and the second low-pass filter 125f is arranged to receive and filter the second mixer output signal M2 from the second mixer 125d to obtain a quadrature signal Q. The signal processing unit 126 is coupled to the demodulation circuit 125 for receiving and processing the in-phase signal I and the quadrature signal Q to obtain the z-direction displacement Δz of the moving object O based on the detection of the Doppler shift using an arctangent calculation method.
With reference to FIG. 1, the computing module 130 is coupled to the image capture device 110 and the radar device 120, and required to perform a calibration procedure prior to constructing a 3-D path T. In the calibration procedure, as FIGS. 6 and 7 show, the moving object O moves along a straight path in the xz plane with a first angle θx relative to x-axis, and then moves along a straight path in the yz plane with a second angle θy relative to y-axis. And the first and second angles θx and θy are the predetermined angles. According to the x- and y-direction pixel-value displacements ΔPx and ΔPy of the moving object image OP and the z-direction displacement Δz of the moving object O that are obtained by the image capture device 110 and the radar device 120, respectively, in the calibration procedure, the computing module 130 can calculate a first conversion coefficient mx from the x-direction pixel-value displacement ΔPx to an x-direction displacement Δx and a second conversion coefficient my from the y-direction pixel-value displacement ΔPy to a y-direction displacement Δy for use in constructing the 3-D path T of the moving object O. The calculation formulas of the first and second conversion coefficients mx and my are given as follows:
For an arbitrary movement of the moving object O, the 3-D path detection system 100 of the present invention utilizes the image capture device 110 to provide the x- and y-direction pixel-value displacements ΔPx and ΔPy of the moving object image OP, utilizes the radar device 120 to provide the z-direction displacement Δz of the moving object O, and utilizes the computing module 130 to convert the x- and y-direction pixel-value displacements ΔPx and ΔPy of the moving object image OP into the x- and y-direction displacements Δx and Δy of the moving object O using the first and second conversion coefficients mx and my established through the calibration procedure. Accordingly, the computing module 130 can further construct the 3-D path T of the moving object O according to the obtained x-, y- and z-direction displacements Δx, Δy and Δz.
In this embodiment, the radar device 120 is a passive radar which does not have its own RF signal source but uses the Wi-Fi signal to detect the z-direction displacement of the moving object. Therefore, the radar device 120 has the advantage of low power consumption and will not interfere with wireless communications of the mobile phone. In addition, the radar device using the Wi-Fi signal to detect the moving object will not be affected by the ambient light. For these reasons, the 3-D path detection system 100 of the present invention can improve the popularity of HGR application in mobile phones.
With reference to FIGS. 8 and 9, the 3-D path detection system 100 of the present invention is provided to construct the 3-D paths of two hand gestures. In more details, the image capture device 110 is a mobile phone, the radar device 120 is a passive radar (not shown), the antenna 122 of the radar device 120 is a planar antenna, the computing module 130 is a laptop, and the image capture device 110 and the radar device 120 are coupled to the computing module 130. Referring FIG. 8, a user holds the image capture device 110 in his left hand and moves his right hand right up and then right down, in forward motion (away from the image capture device 110 gradually). Referring FIG. 9, the user holds the image capture device 110 in his left hand and moves his right hand right forward (away from the image capture device 110 gradually), turns the hand back and then moves it left backward (close to the image capture device 110 gradually). In FIGS. 8 and 9, the 3-D paths constructed by the 3-D path detection system 100 are similar to the actual hand gestures of the user, demonstrating that the 3-D path detection system 100 of the present invention that combines the computer vision and Doppler radar technologies can be utilized to construct the 3-D path of hand gestures.
While this invention has been particularly illustrated and described in detail with respect to the preferred embodiments thereof, it will be clearly understood by those skilled in the art that is not limited to the specific features shown and described and various modified and changed in form and details may be made without departing from the spirit and scope of this invention.