This application is based upon and claims the benefit of priority from Japanese Patent Application No. 2007-224891, filed Aug. 30, 2007, the entire contents of which are incorporated herein by reference.
1. Field
One embodiment of the present invention relates to an information processing device, program, and method. It particularly relates to an information processing device that recognizes gestures of the user and performs control based on a recognized gesture, as well as a program and a method for such a device.
2. Description of the Related Art
Methods for operating an information processing device such as a television and a personal computer in accordance with a user's instruction by a gesture have been suggested. With such methods, the information processing device can be remotely controlled without any input device such as a mouse, keyboard, or remote controller.
Methods of capturing a gesture or any other movement include an algorithm technology of analyzing an image photographed by a camera and detecting and recognizing a circular orbit movement of the operator's hand (see Jpn. Pat. Appln. KOKAI Publication No. 2007-172577).
With the method set forth in Jpn. Pat. Appln. KOKAI Publication No. 2007-172577, however, in which the roundness of the circular movement is calculated from several coordinate points, the circular movement may not be properly detected depending on the speed of the movement. Furthermore, it is difficult to switch in a seamless manner between a normal mode and an operating mode without checking whether or not the system is in operation, based only on the determination as to whether or not the hand movement is circular.
A general architecture that implements the various feature of the invention will now be described with reference to the drawings. The drawings and the associated descriptions are provided to illustrate embodiments of the invention and not to limit the scope of the invention.
Various embodiments according to the invention will be described hereinafter with reference to the accompanying drawings. In general, according to one embodiment of the invention, an information processing device includes a position detecting section configured to detect a position of a hand from an input image of the hand; a memory section configured to store data of the position of the hand detected by the position detecting section; a rotation judging section configured to judge, assuming that records of the data of the position of the hand stored in the memory section show a rotary movement, that a latest position of the hand falls in an angle range predicted for the rotary movement; and an executing section configured to, when the rotation judging section judges that the latest position of the hand falls in the angle range, obtain a rotational angle at the latest position of the hand and also execute a process that corresponds to a predetermined rotary movement of the hand.
Exemplary embodiments of the present invention will be explained below with reference to the drawings.
First, an information processing device according to the first embodiment of the present invention will be discussed with reference to
As illustrated in
The CPU 111 is a processor that controls the operation of the personal computer 100. The CPU 111 executes the operating system (OS) and various application programs loaded from the HDD 117 onto the main memory 112. The CPU 111 also executes the Basic Input/Output System (BIOS) that is stored in the BIOS-ROM 119. The BIOS is a program for controlling peripheral devices. The BIOS is implemented immediately after the power of the personal computer 100 is turned on.
The north bridge 113 is a bridge device that connects the local bus of the CPU 111 and the south bridge 116. The north bridge 113 conducts communications with the graphics controller 114 by way of the Accelerated Graphics Port (AGP) bus.
The graphics controller 114 is a display controller that controls the display 115 of the personal computer 100. The graphics controller 114 generates a display signal that is to be output to the display 115, from the display data written in the VRAM (not shown in the drawings) by the OS or an application program. The display 115 is, for example, a liquid crystal display (LCD).
The HDD 117, the coprocessor 118, the BIOS-ROM 119, the camera 126, and the EC/KBC 120 are connected to the south bridge 116. In addition, the south bridge 116 includes an Integrated Drive Electronics (IDE) controller to control the HDD 117 and the coprocessor 118.
The EC/KBC 120 is a one-chip microcomputer in which an embedded controller (EC) configured to manage power consumption and a keyboard controller (KBC) configured to control the touchpad 124 and the keyboard (KB) 125 are integrated. When the power switch 21 is operated, the EC/KBC 120 turns on the power of the personal computer 100, together with the power supply circuit 121. When power is externally supplied to the personal computer 100 by way of the AC adapter 123, the personal computer 100 is driven by an external power supply. When power is not externally supplied, the personal computer 100 operates on the battery 124.
The camera 126 may be a USB camera. The USB connector of the camera 126 is connected to a USB port (not shown in the drawings) provided on the main body of the personal computer 100. An image taken by the camera 126 (such as hand movement) can be displayed on the display 115 of the personal computer 100. The frame rate of the images supplied by the camera 126 may be 15 frames/second. The camera 126 may be externally provided or built in the personal computer 100.
The coprocessor 118 processes moving images taken from the camera 126.
As indicated in
First, the control section 11 of the personal computer 100 obtains moving images from the camera 125 and processes these images. The detection section 13 determines whether the processed images are images of a hand movement. When it is determined as images of a hand movement, it is determined whether the hand movement is a rotary movement (Step S101). When the hand movement is determined as a rotary movement (YES in Step S101: rotation start detected), the control section 11 calculates the rotational angle (Step S102). In accordance with the calculated rotational angle, the display section 14 displays the process such as image scrolling. For example, when a right-handed (left-handed) rotation is detected, the screen is scrolled down (up).
When the rotation end movement is detected (YES in Step S103), the screen scrolling or the like is stopped.
Next, the aforementioned detection of the rotation start movement, rotary movement, and rotation end movement will be explained in detail with reference to
(1) Detection of Rotation Start Movement
An input moving image will be regarded as a moving image frame. The input moving image is put into the storage section 12, and moving image frames of the past can be used as a reference.
First, a flag s, which denotes the rotational direction, is initialized (Step S201 in
Next, whether or not the hand in the outputs 20 of the past k frames before the nth frame (see
The angle θnk(i) and distance dnk(i) of the hand position of i−k+1(0≦i<k) frames before the nth frame to with respect to the center coordinates, and the image taking time tnk(n,k,i) with respect to the image taking time of the n−(k−1)th frame are calculated by the following expressions.
An integer that satisfies the following condition is adopted for the value a (Step S202:
A linear regression analysis is performed on θnk(i) and tnk(n,k,i), and on dnk(i) and tnk(n,k,i) by the method of least squares. Correlation coefficients Aθnk, Bθnk, Adnk, and Bdnk are thereby obtained to find the predicted value Θnk(i) of θnk(i), and also coefficients of determination which indicate the degree of the correlation are obtained.
It is assumed that CDθnk is the coefficient of determination for the correlation between θnk(i) and tnk(n,k,i) and that CDdnk is the coefficient of determination for the correlation between dnk(i) and tnk(n,k,i).
By use of the correlation coefficients Aθnk, Bθnk, Adnk and Bdθnk, Θnk(i) and Dnk(i) can be expressed as follows (see
Θnk(i)=Aθnktnk(i)+Bθnk
D
nk(i)=Adnktnk(i)+Bdθnk
The start angle θSnk(i) and the end angle θEnk(i) are obtained in such a manner that the angle range between the start to the end is minimized, by comparing the measured value θnk(i) and the predicted value Θnk(i). (This process is not essential, and the operation can do without it.)
When 0<Aθnk,
θSnk=max(θ(0),Θ(0))
θEnk=min(θ(k−1),Θ(k−1))
Otherwise,
θSnk=min(θ(0),Θ(0))
θEnk=max(θ(k−1),Θ(k−1))
When the following conditions are satisfied, a rotational movement is regarded as being performed around the center at xc(n,k), yc(n,k) in the past k frames before the nth frame, and a rotational operation is initiated.
θSnk=min(θ(0),Θ(0))
θEnk=max(θ(k−1),Θ(k−1))
Angle_Threshold<abs(θEnk−θSnk)
CD_θ_Threshold<CDθnk
CD_d_Threshold<CDdnk
Angle_Threshold is a threshold value of the rotational angle. A rotational operation is regarded as being performed when the angle of the rotation made in the past k frames is equal to or greater than this threshold value. The threshold value is set to approximately between 2π and 4π.
CD_θ_Threshold is a threshold value that is employed to confirm that the angle and the time form a linear regression line. Because the angle and the time need to establish a strong correlation, the threshold is set close to 1 (for example, 0.9).
CD_d_Threshold is a threshold value that is employed to confirm that the distance from the center and the time form a linear regression line. Because the distance does not have to have as strong a correction with the time as the angle does, CD_d_Threshold is set to a smaller value than CD_θ_Threshold.
Every time a detection result of a certain frame is received, the aforementioned process is executed on different ranges of previous frames so that rotational operations at different speeds can be detected.
In addition, when a rotational operation is initiated, the center coordinates xc(n,k), yc(n,k) and the average distance from the center coordinates, dm(n,k), are stored in the storage section 12:
(2) Detection of Rotational Movement
After the rotational operation by hand is initiated by the detection of a rotational movement, the center of the rotation and the current hand position are displayed on the screen. A change in rotation amount is calculated in correspondence with the angle and distance between the center of the rotation and the current hand position, and an operation such as scrolling is thereby performed.
The center of the rotation during the rotational operation is recorded as xc, yc, and the average distance between the center of the rotation and the hand position is recorded as dm. At the beginning of the rotation, xc, yc is xc(n,k), yc(n,k), and dm is dm(n,k).
Further, a symbol s is recorded as the direction of the rotation. At the beginning of the rotation, s has the same sign as Aθnk.
It is assumed that the angle and the distance between the center of the rotation and the hand position in the nth frame are θc(n) and dc(n), respectively.
First, the change in angle from the previous frame, dθ(n), is calculated as follows:
dθ(n)=θc(n)−θc(n−1)
When the sign of dθ(n) is the same as that of s, the value θM that denotes the degree of the angle change from the previous frame is obtained in accordance with the following steps:
1) Let θM=dθ(n).
2) When dc(n)<dm, θM is multiplied by dc(n)/dm.
If the hand passes near the center of the rotation, the angle change increases even with a small hand movement. In order to prevent the operation from being made erroneously to a large degree, the operation amount is reduced (optional procedure: YES in Step S203, Step S204: see
When the sign of dθ(n) differs from that of s (YES in Step S205 in
3) When dθ_Threshold<abs(dθ(n)) (YES in Step S209 in
When dθ is close to π, it is difficult to determine in which direction the rotation is made. Thus, the operation amount is reduced (optional procedure).
The value θM obtained as described above is adopted as an operation amount (Step S211 in
For example, when θM is equal to or smaller than −π, scrolling is performed one screen upward. When θM is equal to or greater than π, scrolling is performed one screen downward.
After the initiation of the rotational operation, the rotation start detecting process is still carried out in the same manner as at the beginning of the rotation (1). When a rotation is detected, xc, yc and dm are updated so as to bring close to the center and average distance of the newly detected rotation (Step S212 in
(3) Detection of Rotation End Movement
During a normal rotational operation (see
The difference dx(n), dy(n) in the hand position between the current frame and the previous frame is calculated as follows:
dx(n)=x(n)−x(n−1)
dy(n)=y(n)−y(n−1)
The travel amount in the rotational operation in the circumferential direction is denoted as c(n), and the travel amount in the direction orthogonal to the circumference is denoted as r(n),
c(n)=√{square root over (dx(n)2+dy(n)2)}{square root over (dx(n)2+dy(n)2)} sin(θ(n))
r(n)=√{square root over (dx(n)2+dy(n)2)}{square root over (dx(n)2+dy(n)2)} cos(θ(n))
The travel amount o(n) in the outer circumferenceouter circumference direction is calculated as follows:
o(n)=r(n)−abs(c(n))
The cumulative amount of o(n) is denoted as oh(n).
o
h(n)=oh(n−1)+o(n)
o
h(n)=0 (if oh(n−1)+o(n)<0)
When either of the following conditions is satisfied, the rotation operation is terminated (Step S213; see
The first condition indicates that the travel amount in the outer circumference direction is too large. The second condition indicates that the rotary movement is too far away from the center.
d
m×Outward_Movement_Rate<oh(n)
d
m×Outward_Distance_Rate<dc(n)
The start of the rotation is judged only from a hand movement. In the normal mode, a cursor operation or other operations are conducted, but only when a hand rotational movement occurs, a rotational operation is conducted.
The center and the current hand position are displayed on the screen during the rotation, and an operation is performed in accordance with an angle between the center and the current hand position. This makes it easy for the user to adjust the rotation amount and reverse the direction of the rotation.
Because the end of the rotation is judged from the hand movement only, a normal cursor operation can be performed when the hand rotation is stopped in the middle of the rotational operation.
The purpose of the present invention is to offer an information processing device that can accurately detect circular movements and switch the operations in a seamless manner, as well as a program and method for such a device.
According to the present embodiment, a circular motion can be accurately detected with a simple structure, and the cursor can be moved in accordance with the detected circular motion. Furthermore, the circular motion can be easily terminated.
It should be noted that the present invention is not limited to the embodiment as discussed above. The present invention, when put into practice, may be realized by modifying the structural components without departing from the scope of the invention.
Furthermore, various inventions may be formed by suitably combining the structural components discussed above. For instance, some of the structural components of the embodiment may be omitted, or the structural components of different embodiments may be suitably combined.
While certain embodiments of the inventions have been described, these embodiments have been presented by way of example only, and are not intended to limit the scope of the inventions. Indeed, the novel methods and systems described herein may be embodied in a variety of other forms; furthermore, various omissions, substitutions and changes in the form of the methods and systems described herein may be made without departing from the spirit of the inventions. The accompanying claims and their equivalents are intended to cover such forms or modifications as would fall within the scope and spirit of the inventions.
Number | Date | Country | Kind |
---|---|---|---|
2007-224891 | Aug 2007 | JP | national |