The present invention relates to video games. More particularly, the present invention relates to systems for and methods of detecting and reproducing motions for video games.
In the United States, video gaming has become a staple in many households. Reports have shown that children spend an average of as much as 44.5 hours per week playing video games. With rising concerns regarding the correlation between child obesity and the time children spend playing video games (because of the sedentary lifestyle it promotes), efforts are being made to find ways to make children more active. One approach is to find physical activities to replace video gaming, therefore attempting to limit the amount of time a child remains sedentary as a result of playing video games all day long. An alternative approach is to accept the fact that children will not give up video gaming so easily and to therefore alter the interface of video games to ensure that they have more active gaming experiences.
The Nintendo® Wii® is one of the first successful home gaming consoles to appeal to a wide range of audiences: toddlers, children, teens, young adults, adults, parents, grandparents, etc. It can be argued that the success of the Nintendo® Wii® comes from the intuitive user interface it provides and the physical activity it encourages. Traditional gaming consoles use controller devices which elude the average person, sometimes requiring complex sequences of button presses that are not only difficult to remember but also difficult to execute. Nintendo's Wiimote™ overcomes the shortcomings of such control devices by providing users a way to control video game objects and characters with intuitive motion gestures. The ability to control game objects and characters with intuitive motion gestures give users the confidence to play the game, making it a more enjoyable and satisfying experience.
The popularity of the Nintendo® Wii® gaming system and of titles such as Nintendo's WiiFIT® is evidence that there is a rising trend of gamers seeking more active gaming experiences. At its core, users interface with the gaming system through the manipulation of handheld wireless devices equipped with an accelerometer and an infrared camera. The sensed physical motion is then processed and mapped into video game controls that manipulate one or more objects or characters within the game. The handheld devices also have expansion ports to which expansion devices (e.g., an extra accelerometer, a gyroscope, etc.) can be connected to further enhance the user interface. However, these devices, because of their handheld nature, still, on the average, limit the amount of physical activity users experience while playing games, typically limiting body movement to only the upper body.
To create an even more immersive and active gaming experience, motion capture devices for gaming (including Nintendo's Wiimote™) can be used in novel ways to facilitate users in participating in full-body activities. For example, one of WiiFIT's mini-games has the user place a Wiimote™ in her pocket and jog in place. As the user jogs in place, the mini-game maps the Wiimote's movements into velocity, and the character in the game will jog at a corresponding speed. Games like this, where sensors are used to detect full-body motion, will help promote more active gaming experiences.
However, although significant work has and is being done to incorporate motion into computing activities such as video games, prior art motion capture systems such as the Wiimote™, the Wii Balance Board® and the Dance Dance Revolution® pad to name a few, are able to detect rough physical motions moving in three different directions (x, y, and z). They utilize a simple algorithm that is not adapted well to properly detect movement. Algorithmically, no work has been done to significantly clean up the signal processing.
Another shortcoming of these motion capture systems is their lack of cheating prevention. For example, Nintendo's Wii Sports™ has a tennis mini-game. The idea is to use the Wiimote™ as the user would with a tennis racquet, immersing the user in a virtual tennis experience. However, by simply flicking one's wrist while holding the Wiimote™, the user can still successfully play the tennis game. There is no requirement or focus on getting the users to swing their virtual racquets properly. The main drawback of this type of “cheating” is that users can find ways around being as active as the game was originally trying to promote.
Another shortcoming of the prior art motion capture system is that they are simply indoor devices. For example, the WiiFIT® allows users to, among other things, perform exercise routines, track their weight and body mass index. The heart of the WiiFIT® is the Wii Balance Board®. During use, a user typically positions the Wii Balance Board® in front of a television set, stands on the Wii Balance Board®, and performs exercise routines on it. The Wii Balance Board® senses weight shifts and the Wii® console determines whether the user is in a proper alignment, while results and instructions are displayed on the television set. The WiiFIT® behaves like a personal trainer, tracking the user's progress and providing feedback. However, the Wii Balance Board® must be used in conjunction with a Wii® console and a display, such as a television set. As such, the WiiFIT® remains an indoor device, preventing the user to enjoy these activities outdoors.
The present invention addresses at least these limitations in the prior art.
Embodiments of the present invention serve as an instrument to gather and process data from one or more capture devices. The data can thereafter be processed using one or more classification techniques to properly detect and/or reproduce motions for an application, such as a game. The present invention can be used both outdoors and indoors.
In one aspect, a computer-readable medium stores instructions that, when executed by a computing device, cause the computing device to perform a method. The method includes obtaining one or more signals from at least one motion capture device. The one or more signals are obtained wirelessly from the at least one motion capture device. Alternatively, the one or more signals are obtained via a wired connection from the at least one motion capture device. The motion capture device is typically coupled a body part, such as a foot, a leg, an arm or a hand. In some embodiments, the one or more signals obtained from the at least one motion capture device are filtered.
In some embodiment, a probabilistic network, such as a Bayesian network, is used to classify a movement. First, the one or more signals are interpolated using previously collected data from a history record to thereby determine a movement class. In some embodiments, the one or more signals are interpolated by averaging a subset of values in a history record. In some embodiments, this determining step is performed by calculating a probability that the movement corresponds to that motion category based on the one or more signals and a history record for each motion category. Based on calculated probabilities, the type of the movement is determined and outputted. Alternatively, the signal information and probabilities can be given to a classifier such as a Support Network Machine for assistance. The movement class is then mapped to at least one input event recognized by a primary application. In some embodiments, the type of movement and corresponding information are stored in the history record for subsequent use.
In another aspect, a gaming kit includes at least one pad. Each pad is configured to be in contact with a foot and includes one or more sensors for capturing motion data and a transmitter for transmitting the data to a computing device, such as a mobile device. The gaming kit also includes a software application configured to be accessed by the computing device. The software application typically uses the data transmitted by the transmitter. In some embodiments, the software application is configured to retrieve information from an external source, such as the Internet and/or an external storage device coupled to the computing device.
In yet another aspect, a system maps physical motion data to an action within a primary application. The system includes a motion interpretation unit and an action mapping unit. In some embodiments, the motion interpretation unit and the action mapping unit are in communication with the primary application. The system also includes a motion sensing unit having one or more sensors.
In some embodiments, the motion interpretation unit includes a signal processor and at least one finite state machine. The signal processor is configured to encode motion data into at least one of one or more states and one or more state transitions, and to pass motion data to the action mapping unit to be directly mapped to one or more actions within the primary application. The at least one finite state machines is configured to interpret the motion data and to communicate with the action mapping unit a working knowledge of each of the at least one finite state machine. The motion interpretation unit is configured to periodically sample raw motion data from one or more motion capture devices.
In some embodiments, the action mapping unit includes an action dictionary configured to map at least one of one or more states and one or more state transitions to one or more input events recognized by the primary application. In some embodiments, the action mapping unit is at least partly integrated with the primary application.
Reference will now be made in detail to implementations of the present invention as illustrated in the accompanying drawings. The same reference indicators will be used throughout the drawings and the following detailed description to refer to the same or like parts.
In the following description, numerous details are set forth for purposes of explanation. However, one of ordinary skill in the art will realize that the invention may be practiced without the use of these specific details. Thus, the present invention is not intended to be limited to the embodiments shown but is to be accorded the widest scope consistent with the principles and features described herein.
Embodiments of the present invention serve as an instrument to gather and process data from one or more capture devices. The data can thereafter be processed using one or more classification techniques to properly detect and/or reproduce motions for an application, such as a game. The present invention can be used both outdoors and indoors.
A capture device of the present invention typically includes one or more sensors. The one or more sensors include accelerometers, gyroscopes, ECG, magnetometer, and/or the like. The sensors typically detect external conditions such as acceleration (linear and angular), velocity (linear and angular), pressure, EMG and other relevant data. The capture device also includes other components, such as a controller, a processor and a transmitter, which are coupled to the sensors, for gathering, processing and transmitting the data detected by the sensors. Data is typically transmitted to at least one networked computing device. In some embodiments, data is wirelessly transmitted to the computing device via a personal area network using technology such as Bluetooth, ZigBee or the like, or via a larger network. Alternatively, or in addition, the data is transmitted to the computing device via a wired connection.
In some embodiments, a capture device is in the form of a pad.
In some embodiments, a capture device is the form of a harness.
In some embodiments, a capture device is in the form of a wand.
A networked computing device communicatively coupled with one or more capture devices can be mobile or stationary.
In some embodiments, a package is sold to users which includes at least one capture device, such as the pad 100 illustrated in
To properly detect, interpret and/or reproduce a physical motion, the present invention uses a probabilistic algorithm, finite state machines or both to accurately, and even realistically, recognize motions. These classification techniques can be implemented in an application or a script (hereafter “secondary application”) that is completely or partly integrated with the primary application, or can be completely separate from the primary application. The secondary application containing one or more techniques is executed alongside the primary application. The secondary application can be executed on the same or different computing device as the one that primary application is executed on. Each of these techniques is discussed in detail below.
This technique of the present invention uses basic tools and information for processing signals and input the information to the logic of a primary application. This technique uses Bayesian probabilities networks and/or other statistical algorithms (e.g., Support Vector Machines) to accurately classify specific physical motions or other detectable activities of the body, such as punches, kicks, head-butts, etc. The detection of these activities can be done through the use of one or more sensors that detect characteristics of these activities such as acceleration, velocity, pressure and EMG.
For example, the algorithm for this technique uses motions that are mapped and implemented to properly control movement of a soccer player in a soccer game. Motions in the soccer game include, but are not limited to, a sideways passing movement, a forward kicking/shooting movement, an upward running movement. While the invention described hereafter in this section is relative to a soccer game, the invention can be applied to other types of games.
In some embodiments, calibration is done by first positioning the capture device 130 flat on, for example, a table. By knowing values of acceleration on the accelerometer 135 based upon reading the data when the capture device 130 is laid flat, an ideal configuration for the capture device 130 is known. In this position, a first axis (e.g., Y-axis) moves forward and back, a second axis (e.g., X-axis) moves side to side, and a third axis (e.g., Z axis) moves up and down. Since the raw values are known, the capture device 130 is instantly calibrated when the capture device 130 is switched on.
Based upon the initial reading of the capture device 130, as the user activates the accelerometer 135, a rotation matrix is applied to the values of acceleration. The rotation matrix is based upon the angle of inclination downward between the three point vector formed by the initial x, y, z accelerometer values and the currently read x, y, z values. For example, assume that the capture device 130 is strapped to the foot, as illustrated in
After obtaining the rotation, the rotation matrix is calculated. Any new reading of the accelerometer 135 is multiplied by the rotation matrix, causing a calibrated value, that accounts for the effects of gravity, without needing the user to actively participate in any calibration phase or do any special movements as this is done at the beginning of the algorithm when the capture device 130 is first turned on.
At a step 410, values are read from the accelerometer 135 to determine the direction of motion. This can be done via a wireless connection, a wired connection or by any other methods known to those of ordinary skill in the art. In some embodiments, this is done through a polling method using the Zigbee protocol. Preferably, the accelerometer 135 can be read at a frequency as high as 100 Hz or as low as 20 Hz through frequencies outside this range is contemplated. The reading of the accelerometer 135 is multiplied by the rotation matrix.
The value of the pressure sensor 140 can also be read. Based upon whether the value of the pressure sensor 140 should be read, it can help determine if a movement is occurring or not. The pressure sensor 140 is typically located at the bottom of the capture device 130. When the capture device 130 is worn around a foot, the pressure sensor 140 is located beneath the foot. In some embodiments, the pressure sensor 140 helps filter noise from accelerometer vibrations when steps are taken to determine if kicks and passes are occurring or if the user is simply running. If the pressure sensor 140 senses weight of the user, then the foot is likely not in the air moving for a kick or a pass, and the user is likely to be standing. The use of pressure sensor 140 can prevent false positives from accelerometer vibrations and can prevent cheating by ensuring that the user is indeed standing. By applying pressure and releasing it for durations of time, the rate at which the feet, for example, are running can be measured, and can be used to verify kicks are actually happening when the foot is in the air.
At a step 415, the signals read from the accelerometer 135 are filtered. This filtering reduces noise and gives a more accurate value representing the movement. This filtering also puts the values into a range that the rest of the probabilistic algorithm can work with. In some embodiments, the filtering is done on each axis independently by normalizing values to 0. The normalization allows positive to be forward movements for shots, left movements for passing (the idealized pass movement for a right-footed player) and upward for running. It should be understood that bigger and smaller ranges of movements are possible, depending on the filtering method used. Other normalizing methods are possible from using the absolute values to more complicated filtering (such as low-pass, high-pass, band-pass etc.) to remove noise and then normalize values.
At a step 420, values from the current reading are interpolated to further remove noise or determine a change in direction. Since any one reading may be erroneous for a number of reasons, the probabilistic algorithm interpolates values from previous history in order to more accurately determine what is happening. In some embodiments, the last three values are averaged to better adjust the value. This both corrects noise but makes changes more gradual by forcing the player to actively move in greater motions to prevent cheating or small waggle problems. Interpolation can be done in a more complicated fashion, weighing certain historical values versus the current reading differently, or taking a greater history of values, or smaller, and using these numbers as desired.
After interpolating the values determined, at a step 425, a movement class is determined based on the interpolated signals. A simple method would be to use threshold testing, but this does not lend itself to accurate movements and often can cause false positives. If thresholds are too low, running, kicking, and passing motions cannot be accurately distinguished. If thresholds are too high, a movement may not be detected because it is simply confused with noisy behavior. As a result, a probabilistic method is needed in order to more accurately and properly determine what is moving and in what fashion. This also resolves confusing scenarios when information from the accelerometer 135 is noisy and does not properly attribute to, for example, a perfect kick or perfect pass. As such, a history of values is read to calculate probabilities to determine a movement class.
A probabilistic method will allow the algorithm to learn the proper behavior and use past values and tests to determine if scenarios that are unfolding are more likely to be motion A or motion B. In other words, if one were to swing their leg forward and slightly to the left, the probabilistic algorithm is able to determine and learn from past behavior whether a kick was actually intended or if the kick should have been interpreted as a pass instead. The probabilistic algorithm is able to learn from a test set and store its results to use in later comparisons and predict what movements are happening, in order to better reduce false positives and accurately determine movement types and strengths.
When training and building the probabilities for the Bayesian Network, false positives are identified and probabilities of events adjusted so that the algorithm learns and becomes stronger over time. Because of this, later comparisons will be able to better predict what movements are actually occurring, and thus will reduce false positives, and may even serve to accurately determine total applied strength and speed of the motion itself.
Referring to
At a step 480a, it is determined whether the movement is a kick. If the movement is a kick, a kick class is outputted at a step 480b, and the method 450 ends. If the movement is not a kick, at a step 485a, it is determined whether the movement is a pass. If the movement is a pass, a pass class is outputted at a step 485b, and the method 450 ends. If the movement is not a pass, at a step 490a, it is determined whether the movement is a run. If the movement is a run, a run class is outputted at a step 490b, and the method 450 ends. If the movement is not a run, then a no move is outputted at a step 495, and the method 450 ends. The determination steps 480a-490a do not necessarily need to follow the sequence illustrated in
Other probabilistic and statistical methods can fall in this category as well and the algorithm will still work the same, since values are turned into classifications for movements and length of these movements. For example, in place of a Bayesian Network, one could use Support Vector Machines, that, instead of learning and adjusting probabilities while running, learn by taking a larger set and attempt to find the subset that more properly predicts movements is found to build the probabilities that the algorithm will then use to determine if test actions later are certain movements or not, for example, running, shooting, or passing. It is therefore contemplated that such classifications can be attained by, for example, calculating probabilities, applying Support Vector Machines for classification based on the calculated probabilities, signals, and history, and then outputting a movement class.
Referring back to
At a step 435, these set of values and actions are stored as another stage to then be used in further interpolation and predictions in the next iteration of the algorithm. This storage can be done in any fashion as necessary based upon how the interpolation and probabilities are done as mentioned earlier.
The algorithm loops and repeats as necessary until it is determined at a step 440 that the actions are no longer needed to be viewed and the user stops the algorithm.
This technique of the present invention uses one or more finite state machines (FSMs) for interpreting and reproducing realistic motion as video game actions. FSMs can control the behavior of a primary application by defining a finite set of application states, state transitions and actions. State diagrams provide easy to understand illustrations of such state machines, making it easy to communicate the logic flow of the primary application. FSMs are suitable for motion interpretation because they are deterministic, have low computational overhead, and allow signals to be described and analyzed within some context (as defined by the state machine). The low computational overhead of FSMs make them perfect candidates for video game applications and other applications that require real-time response to user input. By using finite state machines to interpret and reproduce realistic motion captured from body-wearable sensors as video game actions, gamers can be put into more immersive and active gaming experiences The advantages of using FSMs for interpreting and reproducing motion is that they are deterministic, they are easy to construct, and they have low computational overhead.
Specifically, the motion sensing unit 505 includes sensors 510, 515 of one or more capture devices for transmitting and receiving motion data. Physical motion data is captured by one or more sensors 515, which includes, but not limited to, an accelerometer, a gyroscope, a camera, an illuminated array, and/or an RF tag. Motion data is received by one or more sensors 510, which transfer the motion data to the motion interpretation unit 520. The data transfer can be done via direct wired connection, wireless data transmission or by any other methods known to those of ordinary skill in the art.
Raw motion data is periodically sampled from one or more motion sensors of the motion sensing unit 505. A signal processing unit 525 encodes the processed motion data using one or more states and/or one or more state transitions 530, or it passes the processed motion data directly to the action mapping unit 535 to be directly mapped to one or more actions within the primary application 545.
One or more finite state machines 530 are constructed from the identified states and state transitions and are used for interpreting the physical motion (using the received motion data). These finite state machines 530 preferably provide short term memory, providing signals a context at the particular time that it is sampled. This short term memory allows for a more reliably interpreted physical movement because certain motions can have different effects depending on the users previous state.
For example,
Continuing with the example,
Referring back to
The primary application 545 typically contains one or more virtual objects or characters that are to be controlled. The primary application 545 can include a control dictionary 550 that maps one or more input (e.g., mouse, keyboard, joystick, gamepad, etc.) events to one or more actions defined by the primary application 545; for example, the right button on the keyboard may map to the “walk right” action, etc. In some embodiments, the action mapping unit 535 is completely or partly integrated with the primary application 545, or it can be completely separate from the primary application 545.
Using motion data from sensor(s) of one or more body-wearable capture devices, such as those illustrated in
Exemplary Implementation.
As a proof-of-concept, this system was tested and implemented using two Nintendo Wiimotes as the capture devices in the motion sensing unit, GlovePIE was used as both the motion interpretation unit and action mapping unit, and the game “Super Maryo Chronicles” was used as the primary application.
Nintendo's Wiimotes have a built-in 3-axis accelerometer from which the raw motion data was captured. Theses devices used the bluetooth protocol to pair with a machine running Windows XP. The setup for the interface for this particular application required one Wiimote™ to be held by the user, while another Wiimote™ to be placed in a pant pocket of the user (or somehow otherwise attached to align with one of the user's thighs).
The Wiimote™ that is held senses two motions: a throwing motion and a twisting motion. The throwing motion maps to the game's throw fireball action, and the twisting motion maps to the games enter door action (the twisting is supposed to correspond to the motion of twisting a doorknob to open a door). The Wiimote™ that is aligned with the user's thigh also senses two motions: a jumping motion and a squatting/kneeling motion. The jumping motion maps to the game's jump action, and the squatting/kneeling motion maps to the game's crouching/ducking action.
Although this implementation has the ability to sense four different motions performed by the user, only the interpreting of the jumping motion makes use of a finite state machine. This is because the other three motions' raw data were enough to be able to interpret whether or not those motions were being performed. The jumping motion, on the other hand, required more information in order to be interpreted correctly.
To understand the mechanics of a jump motion, it is helpful to visualize the signal of the z-acceleration. As seen in
In preliminary tests for interpreting jump motions, simple thresholding was used to detect jumps. That is, if the z-acceleration crossed a certain threshold, a jump was interpreted. This initial implementation, however, lead to many false positives (e.g., jump actions being registered when the user did not jump). To overcome these false positives, the mechanics of a jump and how each sub-motion (e.g., energy build-up, the actual jump, landing, and recovering) is represented by the motion data must be understood. The challenge was that the jump sub-motion looks very similar to the recover sub-motion when sampling the signal instantaneously. From this, creating a finite state machine to give the signal context in order to properly interpret the motion became clear.
The finite state machine can inherently provide short-term memory, so to speak, which is useful in determining whether the sampled signal is a jump motion or a recover motion. If the user was in an idle state, and then a positive z-acceleration is encountered that crosses some threshold, the FSM can correctly interpret that as a jump motion. If, however, the user just landed and then a positive z-acceleration is encountered, the FSM can interpret that as a recover motion, and not signal a jump. The state machine implemented for this particular application is illustrated in
GlovePIE was used to define and implement the finite state machine. GlovePIE is an open-source solution for emulating joystick, keyboard, and mouse input using other external devices, such as Nintendo's Wiimotes. The way it works is that the developer writes a GlovePIE script that processes the signals from the external devices and creates the desired keyboard, mouse, and joystick mappings. Once the script is ready to go, it is executed alongside the application which will use these new input mappings. For example, for this application, the JUMP STATE is mapped to the keyboard button “S,” which in the game is the JUMP BUTTON.
“Secret Maryo Chronicles” is an open-source, 2D sidescrolling action game that is very similar to Nintendo's Super Maryo Brothers. Side-scrollers are essentially made up of many 2D virtual obstacle courses (e.g., levels), and the objective of the game is to reach the end of each level without dying. Each level is littered with enemies and traps that try to impede the user's progress. This game was chosen because of the amount of jumping involved. By making the user jump in order to make the Maryo character jump, it is hoped that a very active (and enjoyable) gaming experience was achieved.
While the invention has been described with reference to numerous specific details, one of ordinary skill in the art will recognize that the invention can be embodied in other specific forms without departing from the spirit of the invention. Thus, one of ordinary skill in the art will understand that the invention is not to be limited by the foregoing illustrative details, but rather is to be defined by the appended claims.
The application claims priority of U.S. provisional application Ser. No. 61/435,206, filed Jan. 21, 2011, entitled “PROBABILISTIC ALGORITHM FOR PROPER KICK DETECTION,” of U.S. provisional application Ser. No. 61/435,211, filed Jan. 21, 2011, entitled “MOBILE FOOT-BASED GAMING SYSTEM,” and of U.S. provisional application Ser. No. 61/435,220, filed Jan. 21, 2011, entitled “METHOD FOR INTERPRETING AND REPRODUCING REALISTIC MOVEMENT AS VIDEO GAME ACTIONS,” their entirety of which is herein incorporated by reference.
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/US12/21906 | 1/19/2012 | WO | 00 | 10/7/2013 |
Number | Date | Country | |
---|---|---|---|
61435206 | Jan 2011 | US | |
61435211 | Jan 2011 | US | |
61435220 | Jan 2011 | US |