Gesture control device and method

Information

  • Patent Grant
  • 10289901
  • Patent Number
    10,289,901
  • Date Filed
    Thursday, May 25, 2017
    7 years ago
  • Date Issued
    Tuesday, May 14, 2019
    5 years ago
Abstract
A device for recognizing control gestures and determining which one device out of a plurality is the target of control acquires images of a gesture from each electronic device. A three dimensional coordinate system for each image is established, and coordinate of a central point of each electronic device determined. Extent of gesture to the left and to the right at different depths is determined and a regression plane equation is calculated. A distance between the regression plane and center points of each electronic device is determined and the electronic device with the closest (the shortest distance) center point is determined as the target device of the control gesture. A gesture control method is also provided.
Description
CROSS-REFERENCE TO RELATED APPLICATIONS

This application claims priority to Taiwan Patent Application No. 105116709 filed on May 27, 2016.


FIELD

The subject matter herein generally relates to interface devices, and particularly to a gesture control device and method capable of determining an object to be controlled by gesture, among a plurality of electronic devices.


BACKGROUND

Electronic devices can be controlled by gestures. A gesture command usually controls one electronic device. However, a number of electronic devices may be close together, and it is difficult to determine which one of the electronic devices should be controlled by the gesture.





BRIEF DESCRIPTION OF THE DRAWINGS

Implementations of the present technology will now be described, by way of example only, with reference to the attached figures.



FIG. 1 is a block diagram illustrating an exemplary embodiment of an operating environment of a device with system for control by gesture.



FIG. 2 is a block diagram illustrating an exemplary embodiment of a gesture control system running in the device of FIG. 1.



FIG. 3 is a schematic diagram illustrating an exemplary embodiment of a working process of the device of FIG. 1.



FIG. 4 is a flowchart illustrating an exemplary embodiment of a gesture control method.





DETAILED DESCRIPTION

It will be appreciated that for simplicity and clarity of illustration, where appropriate, reference numerals have been repeated among the different figures to indicate corresponding or analogous elements. In addition, numerous specific details are set forth in order to provide a thorough understanding of the embodiments described herein. However, it will be understood by those of ordinary skill in the art that the embodiments described herein can be practiced without these specific details. In other instances, methods, procedures, and components have not been described in detail so as not to obscure the related relevant feature being described. The drawings are not necessarily to scale and the proportions of certain parts may be exaggerated to better illustrate details and features. The description is not to be considered as limiting the scope of the embodiments described herein.


The term “comprising” means “including, but not necessarily limited to”, it specifically indicates open-ended inclusion or membership in a so-described combination, group, series, and the like.



FIG. 1 illustrate an exemplary embodiment of an operating environment of a gesture control device 100. The gesture control device 100 can communicate with a number of electronic devices. The gesture control device 100 can determine which one of the electronic devices should be controlled by a gesture. In the exemplary embodiment, the electronic devices can be, but are not limited to, televisions, air conditioners, fridges, multimedia players, monitors, computers, and the like. The gesture control device 100 can communicate with the electronic devices wirelessly, for example by using WIFI, Global System for Mobile Communications (GSM), General Packet Radio Service (GPRS), Code Division Multiple Access (CDMAW-CDMA), CDMA2000, IMT Single Carrier, Enhanced Data Rates for GSM Evolution (EDGE), Long-Term Evolution, (LTE), Time-Division LTE, (TD-LTE), High Performance Radio Local Area Network, (HiperLAN), High Performance Radio Wide Area Network, (HiperWAN), Local Multipoint Distribution Service, (LMDS), Worldwide Interoperability for Microwave Access, (WiMAX), ZIGBEE, BLUETOOTH, Flash Orthogonal Frequency-Division Multiplexing, (Flash-OFDM), High Capacity Spatial Division Multiple Access, (HC-SDMA), Universal Mobile Telecommunications System, (UMTS), UMTS Time-Division Duplexing, (UMTS-TDD), Evolved High Speed Packet Access, (HSPA+), Time Division Synchronous Code Division Multiple Access, (TD-SCDMA), Evolution-Data Optimized, (EV-DO), Digital Enhanced Cordless Telecommunications, (DECT), or the like. The gesture control device 100 also can communicate with electronic devices by wires.


In the exemplary embodiment, a first electronic device 200 and a second electronic device 300 are taken as examples of communicating with the gesture control device 100. In the exemplary embodiment, the first electronic device 200 includes a first image capturing device 20, the second electronic device 300 includes a second image capturing device 30. The first image capturing device 20 is a depth camera configured to capture images of gestures in a first effective range R1, the second image capturing device 30 is a depth camera configured to capture gesture images in a second effective range R2. The first electronic device 200 and the second electronic device 300 can execute functions according to gestures captured by the image capturing devices 20 and/or 30.


In the exemplary embodiment, the gesture control device 100 can be, but is not limited to, a server, a communication device such as a Set Top Box, or an integrated chip or programming modules embedded in the first electronic device 200 or in the second electronic device 300.


Referring to FIG. 2, in the exemplary embodiment, the gesture control device 100 can include, but is not limited to, a storage device 11 and a processor 12. The storage device 11 can be, but is not limited to, an internal storage system, such as a flash memory, a random access memory (RAM) for temporary storage of information, and/or a read-only memory (ROM) for permanent storage of information. The storage device 11 can also be a storage system, such as a hard disk, a storage card, or a data storage medium. The processor 12 can be, but is not limited to, a central processing unit, a digital signal processor, or a single chip, for example.


In the exemplary embodiment, the storage device 11 can store a gesture control system 10. The gesture control system 10 can include a number of modules, which are collections of software instructions stored in the storage device 11 and executed by the processor 12. In the exemplary embodiment, the gesture control system 10 can include an acquiring module 101, an establishing module 102, a calculating module 103, and a determining module 104.


The acquiring module 101 acquires an image of a gesture from each of the first image capturing device 20 and/or the second image capturing device 30 and acquires an orientation of a gesture in the gesture image. In the exemplary embodiment, the gesture image can include a depth information as to objects in the images. In the exemplary embodiment, the gesture image can include a number of pictures, thus the orientation and motion in the gesture can be acquired according to a position of the gesture in different pictures. The orientation of the gesture can indicate that the gesture of a user has a directivity. The orientation is determined as ended if the gesture is stopped for a preset time interval. In the exemplary embodiment, the first image capturing device 20 and the second image capturing device 30 capture the gesture image when a gesture is detected in the effective area of the first image capturing device 20 and the second image capturing device 30.


The establishing module 102 establishes a three dimensional coordinate system for the gesture image, and determines a coordinate of a central point of each of the electronic devices (the first electronic device 200 and the second electronic device 300). In the exemplary embodiment, plane coordinates of the gesture image are determined as an X axis and a Y axis of the coordinate system, and the depth direction of the gesture image is determined as a Z axis of the coordinate system. In the exemplary embodiment, the coordinate of the center points of the electronic devices are predetermined according to a position of the image capturing device. For example, if the first electronic device 200 is a computer, and the image capturing device 20 is located in the middle of an upper edge of a screen of the first electronic device 200, the coordinate of the center point of the first electronic device 200 will be predetermined as the coordinate of the center point of the screen of the first electronic device 200.


The acquiring module 101 further determines a coordinate of a most extreme left-side horizontal position (left end) of the gesture in different depths and a coordinate of a most extreme right-side horizontal position (right end) of the gesture in different depths when the gesture is ended. For example, as shown in FIG. 3, a coordinate of a left end A1 of the gesture in a depth Z1 is (x′1,y′1,z1), a coordinate of a right end B1 of the gesture in the depth Z1 is (x″1,y″1,z1), a coordinate of the left end A2 of the gesture in a depth Z2 is (x′2,y′2,z2), a coordinate of the right end B2 of the gesture depth Z2 is (x″2,y″2,z2). Thus a coordinate of the left end An of the gesture in a depth Zn is (x′n,y′n,zn), and a coordinate of the right end Bn of the gesture depth Zn is (x″n,y″n,zn).


The calculating module 103 calculates a coordinate of a center point between the left end and the right end of the gesture in each different depth. For example, a coordinate of a center point C1 in the depth Z1 is







(




x
1


+

x
1



2

,



y
1


+

y
1



2

,

z
1


)

,





a coordinate of a center point C2 in the depth Z2 is







(




x
2


+

x
2



2

,



y
2


+

y
2



2

,

z
2


)

,





and a coordinate of a center point Cn in the depth Zn is







(




x
n


+

x
n



2

,



y
n


+

y
n



2

,

z
n


)

.




The calculating module 103 calculates a regression plane equation according to the coordinate of the center points of the gesture image in different depths.


In the exemplary embodiment, the calculating module 103 calculates the regression plane equation by using a regression analysis method. By standardizing the coordinate of the center points, the calculating module 103 obtains the following formulas:








x
k


=



x
k

-

x
_



S
x



,


y
k


=



y
k

-

y
_



S
y



,


z
k


=



z
k

-

z
_



S
z



,
wherein
,






x
_

=



x
1

+

x
2

+

+

x
n


n


,


S
x

=






i
=
1

n




(


x
i

-

x
_


)

2



n
-
1




,


y
_

=



y
1

+

y
2

+

+

y
n


n


,






S
y

=






i
=
1

n




(


y
i

-

y
_


)

2



n
-
1




,


z
_

=



z
1

+

z
2

+

+

z
n


n


,


S
z

=







i
=
1

n




(


z
i

-

z
_


)

2



n
-
1



.







Setting a standardized plane equation as z′=ax′+by′+c, a residual error is ei=z′i−{circumflex over (z)}′i, and is the result. The calculating module 103 further calculates a value of “a”, “b”, and “c” in above formula when a value of









i
=
1

n



e
i
2






is minimal.













i
=
1

n



e
i
2


=







i
=
1

n




(


ax
i


+

by
i


+
c
-

z
i



)

2


=




i
=
1

n




[


(


ax
i


+

by
i


-

z
i



)

+
c

]

2









=







i
=
1

n




(


ax
i


+

by
i


-

z
i



)

2


+

2

c





i
=
1

n



(


ax
i


+

by
i


-

z
i



)



+




i
=
1

n



c
2









=







i
=
1

n




(


ax
i


+

by
i


-

z
i



)

2


+

2

ac





i
=
1

n



x
i




+

2

bc





i
=
1

n



y
i




-

2

c





i
=
1

n



z
i




+

nc
2









=







i
=
1

n




(


ax
i


+

by
i


-

z
i



)

2


+

nc
2



,









here
,



x


_

=


0





i
=
1

n



x
i




=
0


,



y


_

=


0





i
=
1

n



y
i




=
0


,



z


_

=


0





i
=
1

n



z
i




=
0.







If C=0, above formula becomes:










i
=
1

n




(


ax
i


+

by
i


-

z
i



)

2


=



(




i
=
1

n



x
i
′2


)



a
2


+


(

2





i
=
1

n




x
i




y
i





)


ab

+


(




i
=
1

n



y
i
′2


)



b
2


-

2


(




i
=
1

n




x
i




z
i




)


a

-

2


(




i
=
1

n




y
i




z
i




)


b

+




i
=
1

n




z
i
′2

.








By







f


(

a
,
b

)


=



(




i
=
1

n



x
i
′2


)



a
2


+


(

2





i
=
1

n




x
i




y
i





)


ab

+


(




i
=
1

n



y
i
′2


)



b
2


-

2


(




i
=
1

n




x
i




z
i




)


a

-

2


(




i
=
1

n




y
i




z
i




)


b







referring to a function







+




i
=
1

n



z
i
′2



,





that is: ƒ(a,b)=Aa2+2Bab+Cb2+Da+Eb+F.


By using a Cauchy inequality algorithm:









(




i
=
1

n



x
i
′2


)



(




i
=
1

n



y
i
′2


)





(




i
=
1

n




x
i




y
i




)

2






B
2

-
AC


0.





Generally speaking, B2−AC≠0.


According to a formula: if ƒ(x,y)=ax2+2bxy+cy2+dx+ey+ƒ, here a>0, and b2−ac≤0, ƒ(x,y) is minimal when (x,y)=(h,k). In this condition,







h
=








-
d




2

b






-
e




2

c












2

a




2

b






2

b




2

c







=



-
dc

+
de


2


(

ac
-

b
2


)





,

k
=








2

a




-
d






2

b




-
e












2

a




2

b






2

b




2

c







=




-
ae

+
bd


2


(

ac
-

b
2


)



.







Calculating the function ƒ(a,b)=Aa2+2Bab+Cb2+Da+Eb+F,







a
=








-
D




2

B






-
E




2

C












2

A




2

B






2

B




2

C







=



-
DC

+
DE


2


(

AC
-

B
2


)





,

b
=








2

A




-
D






2

B




-
E












2

A




2

B






2

B




2

C







=




-
AE

+
BD


2


(

AC
-

B
2


)



.




Here



,





A
=




i
=
1

n



x
i
′2



,

B
=




i
=
1

n




x
i




y
i





,

C
=




i
=
1

n



y
i
′2



,





D
=




i
=
1

n




x
i




z
i





,

E
=




i
=
1

n




y
i




z
i





,

F
=




i
=
1

n




z
i
′2

.







The parameters “a” and “b” in above formula further can be converted to







a
=




-
DC

+
DE


2


(

AC
-

B
2


)



=





-

(




i
=
1

n




x
i




z
i




)




(




i
=
1

n



y
i
′2


)


+


(




i
=
1

n




x
i




z
i




)



(




i
=
1

n




y
i




z
i




)




2


[



(




i
=
1

n



x
i
′2


)



(




i
=
1

n



y
i
′2


)


-


(




i
=
1

n




x
i




y
i




)

2


]



=




-

r
xz




r
yy


+


r
xz



r
yz




2


(



r
xx



r
yy


-

r
xy
2


)






,





b
=




-
AE

+
BD


2


(

AC
-

B
2


)



=





-

(




i
=
1

n



x
i
′2


)




(




i
=
1

n




y
i




z
i




)


+


(




i
=
1

n




x
i




y
i




)



(




i
=
1

n




x
i




z
i




)




2


[



(




i
=
1

n



x
i
′2


)



(




i
=
1

n



y
i
′2


)


-


(




i
=
1

n




x
i




y
i




)

2


]



=





-

r
xx




r
yz


+


r
xy



r
xz




2


(



r
xx



r
yy


-

r
xy
2


)



.








The calculated regression plane equation is:








z
-

z
_



S
z


=


a


(


x
-

x
_



S
x


)


+


b


(


y
-

y
_



S
y


)


.






The calculating module 103 further calculates a distance between the regression plane and the center points of each of the electronic devices (the first electronic device 200 and the second electronic device 300). For example, if the coordinate of the center point of the first electronic device 200 is (x0,y0,z0), and if the regression plane equation is expanded to form px+qy+rz+s=0, the calculating module 103 calculates the distance between the center point of the first electronic device 100 and the regression plane according to the formula:









px
0

+

qy
0

+

rz
0

+
s




p
2

+

q
2

+

r
2




.




The determining module 104 determines which one of the electronic devices (for example the first electronic device 200 or the second electronic device 300) is intended as the target of the gesture by determining which one of the distances between the regression plane and the center points of the electronic devices is less than a preset value. If the determining module 104 determines that a distance between the regression plane and the center point of one electronic device is less than the preset value, the determining module determines that such electronic device is the object to be controlled by the gesture. If the determining module 104 determines that a distance between the regression plane and the center point of the electronic device is equal to or greater than the preset value, the determining module determines that such electronic device is not the intended target object to be controlled.


A method for determining a target electronic device controlled by a gesture is illustrated in FIG. 4. The method is provided by way of example, as there are a variety of ways to carry out the method. Each block shown in FIG. 4 represent one or more processes, methods, or subroutines carried out in the example method. Additionally, the illustrated order of blocks is by example only and the order of the blocks can be changed. The example method can begin at block S40.


At block S40, an acquiring module of a gesture control device acquires an image of a gesture from an image capturing device of each electronic device communicated with the gesture control device, and acquires an orientation of a gesture in the image of the gesture. In the exemplary embodiment, the gesture image can include a depth information. The gesture image can include a number of pictures, the orientation and motion in the gesture is acquired according to a position of the gesture in different pictures. The orientation of the gesture can indicate that the gesture of a user has a directivity. The orientation is determined as ended if the gesture is stopped for a preset time interval.


At block S41, a establishing module establishes a three dimensional coordinate system for the gesture image, and determines a coordinate of a central point of each of the electronic devices communicated with the gesture control device. In the exemplary embodiment, the coordinate of the center point of the electronic devices are predetermined according to a position of the image capturing device.


At block S42, the acquiring module determines a coordinate of a most extreme left-side horizontal position (left end) of the gesture in different depths and a coordinate of a most extreme right-side horizontal position (right end) of the gesture in different depths when the gesture is ended. For example, as shown in FIG. 3, a coordinate of a left end A1 of the gesture in a depth Z1 is (x′1,y′1,z1), a coordinate of a right end B1 of the gesture in the depth Z1 is (x″1,y″1,z1), a coordinate of the left end A2 of the gesture in a depth Z2 is (x′2,y′2,z2), a coordinate of the right end B2 of the gesture depth Z2 is (x″2,y″2,z2). Thus, a coordinate of the left end An of the gesture in a depth Zn is (x′n,y′n,zn), and a coordinate of the right end Bn of the gesture depth Zn is (x″n,y″n,zn).


At block S43, the calculating module calculates a coordinate of a center point between the left end and the right end of the gesture in each different depth.


At block S44, the calculating module calculates a regression plane equation according to the coordinate of the center points of the gesture image in different depths. The method for calculating the regression plane equation is described previously.


At block S45, the calculating module calculates a distance between the regression plane and the center points of each of the electronic devices. The method for calculating the distance is described previously.


At block S46, the determining module determines whether the distance between the regression plane and the center point of one of the electronic device is less than a preset value. If the determining module determines that the distance between the regression plane and the center point of one of the electronic device is less than the preset value, the procedure goes to block S47. Otherwise, the procedure is ended.


At block S47, the determining module determines that the electronic device is a target controlled by the gesture if the distance between the regression plane and the center point of this electronic device is less than the preset value.


It is believed that the present embodiments and their advantages will be understood from the foregoing description, and it will be apparent that various changes may be made thereto without departing from the spirit and scope of the disclosure or sacrificing all of its material advantages, the examples hereinbefore described merely being exemplary embodiments of the present disclosure.

Claims
  • 1. A gesture control device to communicate with at least two electronic devices, each of the electronic devices comprising an image capturing device, the gesture control device comprising: at least one processor; andat least one storage device storing one or more programs, when executed by the at least one processor, the one or more programs cause the at least one processor to: acquire an image of a gesture from the image capturing device of each electronic device communicated with the gesture control device and acquire an orientation of a gesture in the image of the gesture;establish a three dimensional coordinate system for the gesture image, and determine a coordinate of a central point of each of the electronic devices communicated with the gesture control device;determine a coordinate of a left end of the gesture in different depths and a coordinate of a right end of the gesture in different depths when the gesture is ended;calculate a coordinate of a center point between the left end and the right end of the gesture in each different depths; calculate a regression plane equation according to the coordinate of the center points of the gesture image in different depths;calculate a distance between the regression plane and the central points of each of the electronic devices; anddetermine which one of the electronic devices is intended as a target controlled by the gesture by determining which one of the distances between the regression plane and the central point of the electronic device is less than a preset value.
  • 2. The gesture control device of claim 1, wherein the gesture image comprises a plurality of pictures, the orientation of the gesture is acquired according to a position of the gesture in different pictures.
  • 3. The gesture control device of claim 1, wherein the orientation is determined as ended if the gesture is stopped for a preset time interval.
  • 4. The gesture control device of claim 1, wherein a formula for calculating the distance between the regression plane and the center point of the electronic device is:
  • 5. A gesture control method applied in a gesture control device, the gesture control device configured to communicate with at least two electronic devices, each of the electronic devices comprising an image capturing device, the gesture control method comprising: acquiring a gesture image from the image capturing device of each electronic device communicated with the gesture control device and acquiring an orientation of a gesture in the gesture image;establishing a three dimensional coordinate system for the gesture image, and determining a coordinate of a central point of each of the electronic devices communicated with the gesture control device;determining a coordinate of a left end of the gesture in different depths and a coordinate of a right end of the gesture in different depths when the gesture is ended;calculating a regression plane equation according to the coordinate of the center points of the gesture image in different depths;calculating a distance between the regression plane and the central points of each of the electronic devices; anddetermining which one of the electronic devices is intended as a target controlled by the gesture by determining which one of the distances between the regression plane and the central point of the electronic device is less than a preset value.
  • 6. The gesture control method of claim 5, wherein the gesture image comprises a plurality of pictures, the orientation of the gesture is acquired according to a position of the gesture in different pictures.
  • 7. The gesture control method of claim 5, wherein the orientation is determined as ended if the gesture is stopped for a preset time interval.
  • 8. The gesture control method of claim 5, wherein a formula for calculating the distance between the regression plane and the center point of the electronic device is:
Priority Claims (1)
Number Date Country Kind
105116709 A May 2016 TW national
US Referenced Citations (8)
Number Name Date Kind
9477302 Kauffmann Oct 2016 B2
9784554 Gray Oct 2017 B2
20070259716 Mattice Nov 2007 A1
20120113241 Sairam et al. May 2012 A1
20150199122 Garmark Jul 2015 A1
20160224036 Baker Aug 2016 A1
20160321838 Barone Nov 2016 A1
20170277267 Liu Sep 2017 A1
Non-Patent Literature Citations (2)
Entry
Caon, Maurizio, Yong Yue, Julien Tscherrig, Elena Mugellini, and O. Abou Khaled. “Context-aware 3d gesture interaction based on multiple kinects.” In Proceedings of the first international conference on ambient computing, applications, services and technologies , AMBIENT, pp. 7-12. 2011. (Year: 2011).
Zhen-Zhang Li, Yuan-Xiang Zhang, Zhi-Heng Li; A Fingertip Detection and Interaction System Based on Stereo Vision; http://cvl.ice.cycu.edu.tw/publications/Li2011.pdf; 2011; pp. 2-9, Form 1; Taiwan.
Related Publications (1)
Number Date Country
20170344813 A1 Nov 2017 US