The present invention relates to a recognition method and a recognition system, and more particularly to a gesture recognition method, a gesture recognition system, and a performing device thereof.
Recognition systems generally receive sensing signals from a sensor to recognize the motion of the user. For example, the recognition system receives sensing signals from the sensor, processes the sensing signals using the recognition system, and utilizes the recognition system to implement a recognition method to determine whether a user being observed by the sensor is using portions of his or her body to make particular actions or form particular shapes or gestures. The recognition system classifies the motion of the user, and associates the motion of the user with executable commands or instructions.
However, the motion of the user may be a simple motion or a complex motion. When the motion of the user is the simple motion, the recognition system may quickly recognize the motion of the user. When the motion of the user is the complex motion, the recognition system may need more time to recognize the motion of the user. The recognition system cannot determine whether the motion of the user is the simple motion or the complex motion before the recognition system classifies the motion of the user.
Namely, the recognition system needs a predetermined time period to recognize the motion of the user. The predetermined time period needs to be associated with the complex motion to ensure the motion of the user can be clearly classified.
When the recognition system needs to classify a plurality of motions, the recognition system may need a total time period that is many times of the predetermined time period, which may waste time and negatively influence a performance of the recognition system.
Therefore, the recognition system needs to be further improved.
An objective of the present invention is to provide a gesture recognition method, a gesture recognition system, and a performing device of the gesture recognition system. The present invention may classify a motion of a user in a dynamical time period that is associated with the motion of the user. Therefore, the performance of the gesture recognition system can be improved.
The gesture recognition method includes a performing procedure;
wherein the performing procedure includes steps of:
Further, the gesture recognition system includes a performing device and a training device. The performing device includes a sensing unit, a memory unit, and a processing unit.
The sensing unit senses a sensing signal and a training signal. The memory unit stores a neural network.
The processing unit is electrically connected to the sensing unit and the memory unit. The processing unit executes a performing procedure.
The performing procedure includes steps of:
The training device is electrically connected to the performing device, and executes a training procedure. The training procedure includes steps of:
When the gesture recognition system classifies the gesture event to determine the motion of the user, the gesture recognition system can classify the gesture event. Therefore, the gesture recognition system does not need a predetermined time period to recognize the motion of the user. The time period for recognizing the motion of the user can be dynamical, and a total time period for classifying a plurality of motions can be decreased. Namely, the performance of the gesture recognition system can be improved.
With reference to
The performing procedure includes steps of:
Since the gesture event can be classified according to the soft label, the gesture recognition method does not need a predetermined time period to recognize the motion of the user. The time period for recognizing the motion of the user can be dynamical, and a total time period for classifying a plurality of motions can be decreased. Therefore, the performance of the gesture recognition method can be improved.
Moreover, the gesture recognition method further includes a training procedure, and the training procedure includes steps of:
In the embodiment, the soft labels of the training frames are encoded into the one-hot vectors to train the neural network.
Moreover, the function is a monotonic function, a first training frame of the training signal is mapped to the soft label of zero through the function, and a last training frame of the training signal is mapped to the soft label of one through the function.
With reference to
The sensing unit 101 senses the sensing signal and the training signal. The memory unit 102 stores the neural network. The processing unit 103 is electrically connected to the sensing unit 101 and the memory unit 102. The processing unit 103 executes the performing procedure as described above.
The training device 20 is electrically connected to the performing device 10, and executes the training procedure as described above. In the training procedure, the training signal is received from the sensing unit 101 of the performing device 10.
With reference to
For example, the training signal includes n frames, the soft label of the first training frame 11 is 0, the soft label of the second training frame 12 is 0.1, the soft label of the third training frame is 0.2, and the last training frame, such as the nth training frame 1n, is 1.
In another embodiment, the performing procedure further includes steps of:
determining whether the soft label of the selected sensing frame exceeds a confidence threshold before classifying the gesture event (S1031);
when the soft label of the selected sensing frame exceeds the confidence threshold, the soft label of the selected sensing frame is approved, and the gesture event is classified;
when the soft label of the selected sensing frame does not exceed the confidence threshold, selecting another one of the sensing frames (S1032), determining a soft label of the selected another one of the sensing frames according to the neural network (S1033), and determining again whether the soft label of said selected another sensing frame exceeds the confidence threshold (S1031).
The gesture recognition method classifies the gesture event when the soft label of the selected sensing frame exceeds the confidence threshold. Therefore, the gesture recognition method does not need to classify each time when the soft label is calculated, and computation for classifying the gesture event can be reduced. Moreover, the performance of the gesture recognition method can be further improved.
Further, the training procedure can be executed two times for receiving two said training signals. A percentage of a sequence of the training frames of one of the training signals equals to a percentage of a sequence of the training frames of another one of the training signals having the same soft label when the soft label equals to the confidence threshold.
For example, the confidence threshold is 0.8. When the amount of the training frames a first training signal is 20, the 80% of the sequence of the training frames of the first training signal is the 16th training frame of the first training signal, and the soft label of the 16th training frame of the first training signal is 0.8.
When the amount of the training frames of a second training signal is 10, the 80% of the sequence of the training frames of the second training signal is the 8th training frame of the second training signal, and the soft label of the 8th training frame of the first training signal is also 0.8.
Even though numerous characteristics and advantages of the present invention have been set forth in the foregoing description, together with details of the structure and function of the invention, the disclosure is illustrative only. Changes may be made in detail, especially in matters of shape, size, and arrangement of parts within the principles of the invention to the full extent indicated by the broad general meaning of the terms in which the appended claims are expressed.
Number | Date | Country | |
---|---|---|---|
62626147 | Feb 2018 | US |