The present invention relates to a method and a device for fall prevention and detection, specially for monitoring elderly people in order to emit an alarm signal in case of a risk for a fall or an actual fall being detected.
The problem of accidental falls among elderly people is a major health problem. More than 30 percent of people more than 80 years old fall at least once during a year and as many as 3,000 aged people die from fall injuries in Sweden each year. Preventive methods can be used but falls will still occur and with increased average lifetime, the share of population above 65 years old will be higher, thus resulting in more people suffering from falls.
Different fall detectors are available. One previously known detector comprises an alarm button worn around the wrist. Another detector, for example known from US 2001/0004234, measures acceleration and body direction and is attached to a belt of the person. But people refusing or forgetting to wear this kind of detectors, or being unable to press the alarm button due to unconsciousness or dementia, still need a way to get help if they are incapable of getting up after a fall.
Thus, there is a need for a fall detector that remedies the above-mentioned shortcomings of prior devices.
In certain instances, it might also be of interest to provide for fall prevention, i.e. a capability to detect an increased risk for a future fall condition, and issue a corresponding alarm.
Intelligent optical sensors are previously known, for example in the fields of monitoring and surveillance, and automatic door control, see for example WO 01/48719 and SE 0103226-7. Thus, such sensors may have an ability to determine a person's location and movement with respect to predetermined zones, but they currently lack the functionality of fall prevention and detection.
An object of the present invention therefore is to solve the above problems and thus provide algorithms for fall prevention and detection based on image analysis using image sequences from an intelligent optical sensor. Preferably, such algorithms should have a high degree of precision, to minimize both the number of false alarms and the number of missed alarm conditions.
This and other objects that will be apparent from the following description have now been achieved, completely or at least partially, by means of methods and devices according to the independent claims. Preferred embodiments are defined in the dependent claims.
The fall detection of the present invention may be divided into two main steps; finding the person on the floor and examining the way in which the person ended up on the floor. The first step may be further divided into algorithms investigating the percentage share of the body on the floor, the inclination of the body and the apparent length of the person. The second step may include algorithms examining the velocity and acceleration of the person. When the first step indicates that the person is on the floor, data for a time period before, and possibly also after, the indication may be analysed in the second step. If this analysis indicates a fall, a countdown state may be initiated in order to reduce the risk of false alarms, before sending an alarm.
The fall prevention of the present invention may also be divided into two main steps; identifying a person entering a bed, and identifying the person leaving the bed to end up standing beside it. The second step may be further divided into algorithms investigating the surface area of one or more objects in an image, the inclination of these objects, and the apparent length of these objects. When the second step indicates that a person is in an upright condition, a countdown state may be initiated in order to allow for the person to return to the bed.
Further objects, features and advantages of the invention will appear from the following detailed description of the invention with reference to the accompanying drawings, in which:
Sweden has one of the worlds highest shares of population older than 65 years. This share will increase further. The situation is similar in other Western countries. An older population puts larger demands on the medical care. One way to fulfil these high demands may be to provide good technical aids.
In the field of geriatrics, confusion, incontinence, immobilization and accidental falls are sometimes referred to as the “geriatric giants”. This denomination is used because these problems are both large health problems for elderly, and symptoms of serious underlying problems. The primary reasons for accidental falls can be of various kinds, though most of them have dizziness as a symptom. Other causes are heart failures, neurological diseases and poor vision.
As much as half of the older persons who contact the emergency care in Sweden do this for dizziness and fall-related problems. This makes the problem a serious health issue for the elderly.
Risk factors for falls are often divided into external and intrinsic risk factors. It is about the same risk that a fall is caused by an external risk factor as it is by an intrinsic risk factor. Sometimes the fall is a combination of both.
External risk factors include high thresholds, bad lighting, slippery floors and other circumstances in the home environment. Another common external risk is medicines, itself or in combination, causing e.g. dizziness for the aged. Another possible and not unusual external effect is inaccurate walking aids.
Intrinsic risk factors depend on the patient himself. Poor eyesight, reduced hearing or other factors making it harder for elderly to observe obstacles are some examples. Others are dementia, degeneration of the nervous system and muscles making it harder for the person to parry a fall and osteoporosis, which makes the skeleton more fragile.
In order to avoid elderly from falling, different preventive measures could be taken, e.g. removing thresholds and carpets and mounting handrails on the beds. In short, minimizing the external risk factors. This may also be combined with frequent physical exercise for the elderly. But whatever measures that is taken, falls will still occur, causing pain and anxiousness among the elderly.
When an elderly person falls, it often results in minor injuries such as bruises or small wounds. Other common consequences are soft-tissue injuries and fractures, including hip fractures. The elderly could also sustain pressure-wounds if he or she lies on the floor for a longer time without getting help.
In addition to physical effects, a fall has also psychological effects. Many elderly are afraid of falling again and choose to move to elderly care centres or to not walk around as they used to do. This makes them more immovable, which weaken the muscles and frails the skeleton. They enter a vicious circle.
It is important to make the elderly person who suffered from a fall accident to feel more secure. If he or she falls, a nurse should be notified and assist the person. Today a couple of methods are available. The most common is an alarm button worn around the wrist. In this way the person can easily call for help when needed. Another solution is a fall detector mounted e.g. on the person's belt, measuring high accelerations or changes in the direction of the body.
The present invention provides a visual sensor device that has the advantage that it is easy to install and is cheap and possible to modify for the person's own needs. Furthermore, it doesn't demand much effort from the person using it. It also provides for fall prevention or fall detection, or both.
The device may be used by and for elderly people who want an independent life without the fear of not getting help after a fall. It can be used in home environments as well as in elderly care centres and hospitals.
The device according to the invention comprises an intelligent optical sensor, as described in Applicant's PCT publications WO 01/48719, WO 01/49033 and WO 01/48696, the contents of which are incorporated in the present specification by reference.
The sensor is built on smart camera technology, which refers to a digital camera integrated with a small computer unit. The computer unit processes the images taken by the camera using different algorithms in order to arrive at a certain decision, in our case whether there is a risk for a future fall or not, or whether a fall has occurred or not.
The processor of the sensor is a 72 MHz ASIC, developed by C Technologies AB, Sweden and marketed under the trademark Argus CT-100. It handles both the image grabbing from the sensor chip and the image processing. Since these two processes share the same computing resource, considerations has to be taken between higher frame rate on the one hand, and more computational time on the other. The system has 8 MB SDRAM and 2 MB NOR Flash memory.
The camera covers 116 degrees in the horizontal direction and 85 degrees in the vertical direction. It has a focal length of 2.5 nm, and each image element (pixel) measures 30×30 μm2. The camera operates in the visual and near infrared wavelength range.
The images are 166 pixels wide and 126 pixels high with an 8 bit grey scale pixel value.
The sensor may be placed above a bed 1 overlooking the floor. As shown in
The fall detection according to the present invention is only one part of the complete system. Another feature is a bed presence algorithm, which checks if a person is going in or out of the bed. The fall detection may be activated only when the person has left the bed.
The system may be configured not to trigger the alarm if more than one person is in the room, since the other person not falling is considered capable of calling for help. Pressing a button attached to the sensor may deactivate the alarm. The alarm may be activated again automatically after a preset time period, such as 2 hours, or less, so that the alarm is not accidentally left deactivated.
The sensor may be placed above the short side of the bed at a height of about two meters looking downwards with an angle of about 35 degrees. This placement is a good position since no one can stand in front of the bed, thereby blocking the sensor and it is easy to get a hint of whether the person is standing, sitting or lying down. However, placing the sensor higher up, e.g. in the corner of the room would decrease the number of hidden spots and make it easier with shadow reduction on the walls, since the walls can be masked out. Of course, other arrangement are possible, e.g. overlooking one longitudinal side of the bed. The arrangement and installation of the sensor may be automated according to the method described in Applicant's PCT publication WO 03/091961, the contents of which is incorporated in the present specification by reference.
The floor area monitored by the sensor may coincide with the actual floor area or be smaller or larger. If the monitored floor area is larger than the actual floor area, some algorithms to be described below may work better. The monitored floor area may be defined by the above-mentioned remote control.
In order to make a system that could recognize a fall, the distinguishing features for a fall have to be found and analysed. The distinguishing features for a fall can be divided into three events:
1) The body moves towards the floor with a high velocity in an accelerating movement.
2) The body hits the floor and a retarding movement occurs.
3) The person lies fairly still on the floor, with no motion above a certain height, about one meter.
One can of course find other distinguishing features for a fall. However, many of them are not detectable by an optical sensor. A human being could detect a possible fall by hearing the slam that occurs when the body hits the floor. Of course, such features could be accounted for by connecting or integrating a microphone with the above sensor device.
There are different causes for a fall, and also different types of fall. Those connected to high velocities and much (heavy) motion are easy to detect while others happen more slowly or with smaller movement. It is therefore important to characterize a number of fall types.
Bed Fall
A person falls from the bed down onto the floor. Since the fall detection should not be used until the system has detected an “out of bed”-event, this fall is a special case. One way to solve this is to check if the person is lying on the floor for a certain time after he or she left the bed.
Collapse Fall
A person suffering from a sudden lowering in blood pressure or having a heart attack could collapse on the floor. Since the collapse can be of various kinds, fast or slow ones, with more or less motion, it could be difficult to detect those falls.
Chair Fall
A person falling off a chair could be difficult to detect, since the person is already close to the floor and therefore will not reach a high velocity.
Reaching and Missing Fall
Another type of falls is when a person reaches for example a chair, misses it and falls. This could be difficult to detect if the fall occurs slowly, but more often high velocities are connected to this type of fall.
Slip Fall
Wet floors, carpets etc. could make a person slip and fall. High velocities and accelerations are connected to this type of fall making it easy to separate from a non-fall situation, e.g. a person lying down on the floor.
Trip Fall
This type of fall has the same characteristics as the slip fall, making it easy to detect. Thresholds, carpets and other obstacles are common causes for trip falls.
Upper Level Fall
Upper level falls include falls from chairs, ladders, stairs and other upper levels. The high velocities and accelerations are present here.
The detection must be accurate. The elderly have to receive help when they fall but the system may not send too many false alarms, since it would cost a lot of money and decrease the trust of the product. Thus, it must be a good balance between false alarms and missed detections.
By finding the fact that a person is lying on the floor by a “floor algorithm” may be sufficient for sending an alarm. Here it is important to wait for a couple of minutes before alarming to avoid false alarms.
Another approach is to detect that a person is lying on the floor for a couple of seconds by the floor algorithm and then detect whether a fall had occurred by a “fall algorithm”. In this way the fall detection algorithm must not run all the time but rather at specific occasions.
Yet another approach is to detect that a person attains an upright position, by an “upright position algorithm”, and then sending a preventive alarm. The upright position may include the person sitting on the bed or standing beside it. Optionally, the upright position algorithm is only initiated upon the detection, by a bed presence algorithm, of a person leaving the bed. Such an algorithm may be used whenever the monitored person is known to have a high disposition to falling, e.g. due to poor eyesight, dizziness, heavy medication, disablement and other physical incapabilities, etc.
Both the floor algorithm and the upright position algorithm may use the length of the person and the direction of the body as well as the covering of the floor by the person.
The fall algorithm may detect heavy motion and short times between high positive and high negative accelerations.
A number of borderline cases for fall detection may occur. A person lying down quickly on the floor may fulfil all demands and thereby trigger the alarm. Likewise, if the floor area is large, a person sitting down in a sofa may also trigger the alarm. A coat falling down on the floor from a clothes hanger may also trigger the alarm.
There are also borderline cases that work in the opposite direction. A person having a heart attack may slowly sink down on the floor.
In order to obtain statistical data used for the following evaluation, several tests films have been performed with the following conditions.
The frame rate in the tests films is about 3 Hz under normal light conditions compared to about 10-15 Hz when the images are handled inside the sensor. All tests films were shot under good light conditions.
In order to see that the system worked properly not only in the test studio, the tests films were performed in six different home interiors. Important differences between the interiors were different illumination conditions, varying sunlight, varying room size, varying number of walls next to the bed, diverse objects on the floor, etc.
When the camera takes a picture, it may transform the room coordinates to image coordinates, pixels. This procedure may be divided into four parts: room to sensor, sensor to undistorted image coordinates, undistorted to distorted image coordinates, and distorted image coordinates to pixel coordinates, see
The room coordinate system has its origin on the floor right below the sensor 1, with the X axis along the sensor wall, the Y axis upwards and the Z axis out in the room parallel to the left and right wall, as shown in
In
The transformation from room coordinates to sensor coordinates is a translation in Y followed by a rotation around the X axis,
X′=X
Y′=(Y−h)·cos(α)+Z·sin(α)
Z′=−(Y−h)·sin(α)+Z·cos(α) [1]
where h is the height of the sensor and a is the angle between the Z and Z′ axis.
While the room has three axes, the image has only two. Thus, the sensor coordinates has to be transformed to two-dimensional image coordinates. The first step is perspective divide, which transforms the sensor coordinates to real image coordinates.
If the camera behaves as a pinhole camera:
where f is the focal length of the lens. Accordingly, the undistorted image coordinates xu and yu are given by:
Notice that when transforming back from image coordinates to room coordinates the system is underdetermined. Thus, one of the room coordinates should be given a value before transforming.
The sensor uses a fish-eye lens that distorts the image coordinates. The distortion model used in our embodiments is:
The image is discretely divided into m rows and n columns with origin (1,1) in the upper left corner. To obtain this, a simple transformation of the distorted coordinates (xd, yd) is done:
where xp and yp is the width and height, respectively, of a pixel, and xi and yi are the pixel coordinates.
The goal of the pre-treatment of the images is to create a model of the moving object in the images. The model has the knowledge of which pixels in the image that belongs to the object. These pixels are called foreground pixels and the image of the foreground pixels are called the foreground image.
How can you tell whether a certain object is part of the background or is moving in relation to the background? By just looking at one single image it may be difficult to decide, but with more than one image in a series of images it is more easily achievable. What then differs the background from the foreground? In this case, it is the movement of the objects. An object having different locations in space in a series of images is considered moving and an object having the same appearance for a certain period of time is considered as a background. This means that a foreground object will become a background object whenever it stops to move, and will once again become a foreground object when it starts moving again. The following algorithm calculates the background image.
Background Algorithm
The objective is to create an image of the background that does not contain moving objects according to what has been mentioned above. Assume a series of N grey scale images I0 . . . IN, consisting of m rows and n columns. Divide the images in blocks of 6×6 pixels and assign a timer to each block controlling when to update the block as background. Now, for each image Ii, i=x . . . N, subtract Ii with the image Ii−x to obtain a difference image DIi. For each block in DIi, reset the timer if there are more than y pixels with an absolute pixel value greater than z. Also reset the timers for the four nearest neighbours. If there are less than y pixels, the block is considered as motionless and the corresponding block in Ii is updated as background if its timer has ended. The parameter values used are x=10, y=5, and timer ending=2000 ms. The noise determines the value of z.
To determine the value of z, it is convenient to estimate the noise in the image. The model described below is quite simple but gives good results.
Assume a series of N images Ii . . . Li+N−1. The standard deviation of the noise for the pixel at row u and column v is then:
where p(u,v,j) is the pixel value at row u and column v in image j, and
is the mean of the pixels at row u and column v in the N images. The mean standard deviation of all pixels is then:
The estimation of the noise has to be done all the time since changes in light, e.g. opening a Venetian blind, will increase or decrease the noise. The estimation cannot be done on the entire image since a presence of a moving object will increase the noise significantly. Instead, this is done on just the four corners, in blocks of 40×40 pixels with the assumption that a moving object will not pass all four corners during the time elapsed from image Ii until image Ii+N−1. The value used is the minimum of the four mean standard deviations. In the present embodiments, z is chosen as
z=3·
Foreground
By subtracting the background image from the present image a difference image is obtained. This image now contains those areas in which motion has occurred. In an ideal image, it now suffices to select as foreground pixels those pixels that have an grey scale value above a certain threshold. However, shadows, noise, flickering screens and other disturbances also occur as motion in the image. Persons with clothes having the same colour as the background will also cause a problem, since they may not appear in the difference image.
Shadows
Objects moving in the scene cast shadows on the walls, on the floor and on other objects. Shadows vary in intensity depending on the light source, e.g. a shadow cast by a moving object on a white wall from a spotlight might have higher intensity than the object itself in the difference image. Thus, shadow reduction may be an important part of the pretreatment of the images.
To reduce the shadows, the pixels in the difference images with high grey scale values are kept as foreground pixels as well as areas with high variance. The variance is calculated as a point detection using a convolution, see Appendix A, between the difference image and a 3×3-matrix SE:
Noise and False Objects
The image is now a binary image consisting of pixels with values 1 for foreground pixels. It may be important to remove small noise areas and fill holes in the binary image to get more distinctive segments. This is done by a kind of morphing, see Appendix A, where all 1-pixels with less than three 1-pixel neighbours are removed, and all 0-pixels with more than three 1-pixel neighbours are set to 1.
If the moving person picks up another object and puts it away on some other place in the room, then two new “objects” will arise. Firstly, at the spot where object was standing, the now visible background will act as an object and secondly, the object itself will act as a new object when placed at the new spot since it will then hide the background.
Such false objects can be removed, e.g. if they are small enough compared to the moving person, in our case less than 10 pixels, or by identifying the area(s) where image movement occurs and by elimination objects distant from such area(s). This is done in the tracking algorithm.
Tracking Algorithm
Keeping track of the moving person can be useful. False objects can be removed and assumptions on where the person will be in the next frame can be made.
The tracking algorithm tracks several moving objects in a scene. For each tracked object, it calculates an area A in which the object is likely to appear in the next image:
The algorithm maintains knowledge of where each tracked object has been for the last five images, in room coordinates X0 . . . X4, Y0 . . . Y4=0 and Z0 . . . Z4. The new room or floor coordinates are calculated as
and respectively for Znew. Ynew=0.
The coordinates for a rectangle with corners in Xnew−0.5, −0.5, Znew), (Xnew−0.5, 2.0, Znew), (Xnew+0.5, 2.0, Znew) and (Xnew+0.5, −0.5, Znew) are transformed to pixel coordinates xi0 . . . xi3, and the area A is taken as the pixels inside the rectangle with corners at xi0 . . . xi3. This area corresponds to a rectangle of 1.0×2.5 meters, which should enclose a whole body.
The tracking is done as follows.
Assuming a binary noise-reduced image L the N different segments S0 to SN in I are found using a region-grow segmentation algorithm, see Appendix A.
The different segments are added to a tracked object if they consist of more than 10 pixels and have more than 10 percent of their pixels inside the area A of the object. In this way, several segments could form an object.
The segments that do not belong to an object become new objects themselves if they have more than 100 pixels. This is e.g. how the first object is created.
When all segments have been processed, new X and Z values for the tracked objects are calculated. If a new object is created, new X and Z values are calculated directly to be able to add more segments to that object.
With several objects being tracked, it may become important to identify the object that represents the person. One approach is to choose the largest object as the person. Another approach is to choose the object that moves the most as the person. Yet another approach is to use all objects as input for the fall detection algorithms.
Floor Algorithms
For the floor algorithm, the following algorithms may be used.
On Floor algorithm
The percentage share of foreground pixels on the floor is calculated by taking the amount of pixels that are both floor pixels and foreground pixels divided by the total amount of foreground pixels.
This algorithm has a small dependence of shadows. When the person is standing up, he or she will cast shadows on the floor and walls but not when lying down. Thus, the algorithm could give false alarms, but has an almost 100 percent accuracy in telling when a person is on the floor.
In big rooms, the floor area is large and a bending person or a person sitting in a sofa could fool the algorithm to believe that he or she is on the floor. The next two algorithms help to avoid such false alarms.
Angle Algorithm
One significant difference between a standing person and a person lying on the floor is the angle between the direction of the person's body and the Y-axis of the room. The smaller angle the higher probability that the person is standing up.
The most accurate way of calculating this would be to find the direction of the body in room coordinates. This is, however, not easily achievable, since transforming from 2D image coordinates to 3D room coordinates requires pre-setting one of the room coordinates, e.g. Y=0.
Instead, the Y-axis is transformed, or projected, onto the image in the following way:
1) Transform the coordinates of the person's feet (uf, vf) into room coordinates (Xf, Yf=0, Zf).
2) Add a length ΔY to Yf and transform this coordinate back to image coordinates (uh, vh).
3) The Y-axis is now the vector between (uf, vf) and (uh, vh).
This direction is compared with the direction of the body in the image, which could be calculated in a number of ways. One approach is to use the least-square method. Another approach is to randomly choose N pixels p0 . . . pN−1, calculate the vectors v0 . . . vN/2−1, vi=p2i+1−p2i between the pixels, and finally representing the direction of the body as the mean vector of the vectors v0 . . . vN/2−1.
Apparent Length Algorithm
A third way is to find the image coordinates for the “head” and the “feet” of the object and calculating the vector between them. Depending on whether the object has a longer height than width, or vice versa, the object is split up vertically or horizontally, respectively, into five parts. The mass centres of the extreme parts are calculated and the vector between them is taken as the direction of the body.
Since measuring of the angle is done in the image, some cases will give false alarms, e.g. if a person is lying on the floor in the direction of the Z-axis straight in front of the sensor. This would look like a very short person standing up and the calculated angle would become very small, indicating that the person is standing up. The next algorithm compensates for this.
Assume that a person is lying down on the floor in an image. Then it is easy to calculate the length of the body by transforming the “head” and “feet” image coordinates (uh, vh) and (uf, vf) into room coordinates (Xh, 0, Zh) and (Xf, 0, Zh), respectively. The distance between the two room points are then a good measurement of the length of the person. Now, what would happen if the he was standing up? The feet coordinates would be transformed correctly but the head coordinates would be inaccurate. They would be considered much further away from the sensor, see
Thus, the distance between the two room coordinates would be large and therefore large values of the length of the person, say more than two or three meters would be considered as the person standing up. And consequently small values of the person, less than two or three meters would assume the person to be lying down. The (uh, vh) and (uf, vf) coordinates may be calculated the same way as in the Angle algorithm.
Fall Algorithms
According to a study on elderly people, the velocity of a fall is 2-3 times higher than the velocity of normal activities such as walking, sitting, bending down, lying down etc. This result is the cornerstone of the following algorithm.
Mass Centre Algorithm
The velocity v of the person is calculated as the distance between the mass centres Mi and Mi+1 of the foreground pixels of two succeeding images Ii and Ii+1 divided by the time elapsed between the two images.
It may be desirable to calculate the mass centres in room coordinates, but once again this may be difficult to achieve. Instead, the mass centres may be calculated in image coordinates. By doing this, the result becomes dependent on where in the room the person is located. If the person is far away from the sensor, the distances measured will be very short, and the other way around if the person is close to the sensor. To compensate for this, dividing with the Z-coordinate of the person's feet normalizes the calculated distances.
Previous Image Algorithm
Another way to measure the velocity is used in the following algorithm. It is based on the fact that a fast moving object will result in more foreground pixels when using the previous image as the background than a slow one would.
In this algorithm the first step is to calculate a second foreground image FIp using the previous image as the background. Then this image is compared with the normal foreground image FIn. If an object moves slowly, the previous image would look similar to the present image, resulting in a foreground image FIp with few foreground pixels. On the other hand, a fast moving object could have as much as twice as many foreground pixels in FIp as in FIn.
Percentage Share Algorithm
When a person falls, he or she will eventually end up lying on the floor. Thus, no points of the body will be higher than say about half a meter. The idea here is to find a horizontal line in the image corresponding to a height of about one meter. Since this depends on the location of the person within the image, the algorithm starts by calculating the room coordinates for the person's feet. A length ΔY=1 m is added to Y, and the room coordinates are transformed back into image coordinates. The image coordinate yi now marks the horizontal line. The algorithm returns the number of foreground pixels below the horizontal line divided by the total number of foreground pixels.
The fall detection algorithms MassCentre and PreviousImage show a noisy pattern. They may return many false alarms if they were to be run all the time, since shadows, sudden light changes and false objects fool the algorithms. To reduce the number of false alarms, the Fall algorithms are not run continually, but rather at times when one or more of the Floor algorithms (On Floor, Angle and Apparent Length) indicates that the person is on the floor. Another feature reducing the number of false alarms is to wait a short time before sending an alarm after a fall has occurred. Thus, the fall detection may be postponed until one or more of the Floor algorithms has detected a person on the floor for more than 30 seconds. With this approach the number of false alarms are reduced significantly.
The first embodiment is divided into five states, “No Person state”, “Trigger state”, “detection state”, “Countdown state” and “Alarm state”. A state space model of the first embodiment is shown in
When the sensor is switched on, the embodiment starts in the No Person state. While in this state, the embodiment has only one task, to detect motion. If motion is detected, the embodiment switches to the Trigger state. The embodiment will return to the No Person state if it detects a person leaving the room while in the Trigger state, or if the alarm is deactivated.
Motion detection works by a simple algorithm that subtracts the present image by the previous image and counts those pixels in the resulting image with grey level values above a certain threshold. If the sum of the counted pixels is high enough, then motion has been detected.
As mentioned above, the Trigger state will be activated as soon as any motion has been detected in the No Person state. The steps of the Trigger state is further illustrated in
The Trigger state has a timer, which controls the amount of time passed since the person was first detected as on the floor. When the person is off the floor the timer is being reset. When a person has been on the floor for a number of seconds, e.g. 2 seconds, the sequence of data from standing position to lying position is saved for later fall detection, e.g. by the last 5 seconds being saved.
The embodiment switches to the Detection state when a person has been detected as being on the floor for more than 30 seconds.
This state is where the actual fall detection takes place. Based on the saved data from the Trigger state, an analysis is effected of whether a fall has occurred or not. If the detection state detects a fall, the embodiment switches to the Countdown state, otherwise it goes back to the Trigger state.
While in the Countdown state, the embodiment makes sure that the person is still lying on the floor. This is only to reduce the number of false alarms caused by e.g. persons vacuuming under the bed etc. When two minutes has passed and the person is still on the floor, the embodiment switches to the Alarm state. Should the person get off of the floor, embodiment switches back to the Trigger state.
In the Alarm state, an alarm is sent and the embodiment waits for the deactivation of the alarm.
As already stated above, it may be desirable to issue an alarm on detection of an upright condition, to thereby prevent a future possible fall. Below, the algorithm(s) used for such detection is referred to as a BedStand process.
Evidently, the above-identified Floor algorithms may also be use to identify an upright condition of an object, for example a person sitting up in the bed or leaving the bed to end up standing beside it. A person could be classified as standing if its apparent length exceeds a predetermined height value, e.g. 2 or 3 meters, and/or if the angle of the person with respect to the vertical room direction is less than a predetermined angle value, e.g. 10 or 20 degrees. The determination of an upright condition could also be conditioned upon the location of the person within the monitored floor area (see
It is also to be realized that the above-identified Percentage Share algorithm may be used, either by itself or in combination with any one of the above algorithms, to identify an upright condition, by the share of foreground pixels over a given height, e.g. 1 meter, exceeding a predetermined threshold value.
The combination of algorithms may be done in other ways, for example by forming a combined score value which is based on a resulting score value for each algorithm, and comparing the combined score value to a threshold score value for upright detection.
Fall prevention according to the second embodiment includes a state machine using the above BedStand process and a BedMotion process which checks for movement in the bed and detects a person entering the bed. Before illustrating the state machine, the BedMotion process will be briefly described.
The BedMotion process looks for movement in the bed caused by an object of a certain size, to avoid detection of movement from cats, minor dogs, shadows or lights, etc. The bed is represented as a bed zone in the image. The BedMotion process calculates the difference between the current image and the last image, and also the difference between the current image and an older image. The resulting difference images are then thresholded so that each pixel is either a positive difference, a negative difference or not a difference. The thresholded images are divided into blocks, each with a certain number of pixels. Each block that has enough positive and negative differences, and enough differences in total, are set as detection blocks. The detection blocks are active for some frames ahead. The percentage share of difference pixels in the bed zone compared to the area outside the bed is calculated from the thresholded difference images. The bed zone is then further split up in three parts: lower, middle and upper. A timer is started if there are detections in all three parts. The timer is reset every time one or more parts does not have detections. The requirements for an “in bed detection” is the combination of: the timer has run out; the number of detection blocks in each bed zone part exceeds a limit value; and the percentage share of the difference pixels is high enough. The BedMotion process may also signal that there is movement in the bed based on the total number of detection blocks in the bed zone.
The state machine of the second embodiment is shown in
Statistical Decision Process for Fall Detection
A person can end up on the floor in several ways. However, these can be divided into two main groups: fall or not fall. In order to make the decision process reliable, these two groups of data have to be as separated as possible.
It may also be important to find invariant variables. An invariant variable is a variable that is independent of changes in the environment, e.g. if the person is close or far away from the sensor or if the frame rate is high or low. If it is possible to find many uncorrelated invariant variables, the decision process will be more reliable.
The PreviousImage algorithm may be used to obtain an estimate of the velocity in the picture. As described above, one of the main characteristics of a fall is the retardation (negative acceleration) that occurs when the body hits the floor. An estimate of the acceleration may be obtained by taking the derivative of the results from the PreviousImage algorithm. The minimum value thereof is an estimate of the minimum acceleration or maximum retardation (Variable 1). This value is assumed to be the retardation that occurs when then person hits the floor.
The MassCentre algorithm also measures the velocity of the person. A fall is a big and fast movement, which imply a big return value. Taking the maximum value of the velocity estimate of the MassCentre algorithm (Variable 2), may give a good indication of whether a fall has occurred or not.
Alternatively or additionally, taking the derivative of the velocity estimation of the MassCentre algorithm, may give another estimate of the acceleration. As already concluded above, the minimum acceleration value may give information whether a fall has occurred or not (Variable 3).
Even with well-differentiated data it can be hard to set definite limits. One possible way to calculate the limits is with the help of statistics. In this way the spread of the data, or in a statistical term variance, is taken into account.
The distribution model for the variables is assumed to be the normal distribution. This is an easy distribution to use, and the data received from the algorithms has indicated that this is the distribution to use. The normal probability density function is defined as:
where d is the dimension of x, m is the expected value and Σ is the covariance matrix.
The expected values mfall and mno fall and the covariance matrices Σfall and Σno fall were calculated using test data from 29 falls and 18 non-falls.
The expectation value m is calculated as:
and the covariance matrix Σ as:
Given the values for m and Σ, it is possible to decide whether a fall has occurred or not. Assume data x from a possible fall. Equation 13 then returns two values ffall(x) and fno fall(x) for a fall and a non-fall, respectively. It may be easier to relate to the probability for a fall than for a non-fall.
When calculating the probability for a fall, the probability for a person ending up on the floor after a non-fall, p(not fall|on floor), and after a fall, p(fall|on floor), must be taken into account in order to be statistically correct. However, the current model assumes that these two are equal.
This implies that if ffall(x) is higher than fno fall(x) then the decision is that a fall has occurred, and vice versa if ffall(x) is lower than fno fall(x).
Assume two one-dimensional normal distributed variables, one with high variance and the other with low variance. The normal distribution functions for these variables could then look like in
Luckily the variances do not differ that much between the fall and non-fall cases, see
The tests were conducted on an embodiment developed in MATLAB™, for 58 falls and 24 non-falls. The algorithms returned the values shown in
The falls and no-falls used as input for the database were tested in order to decide whether the model worked or not. Out of the 29 falls, 28 were detected, and none of the 18 non-falls caused a false alarm. Thus, the model worked properly.
Among the other test data, 27 falls were detected out of 29 possible, and 2 of 6 non-falls returned a false alarm.
Hereinabove, several embodiments of the invention have been described with reference to the drawings. However, the different features or algorithms may be combined differently than described, still within the scope of the present invention.
For example, the different algorithms may run all in parallel, and the algorithms may be combined as defined above and in the claims at suitable time occasions. Specifically, the Fall algorithms may nm all the time but only be used when the Floor algorithms indicate that a person is lying on the floor.
The invention is only limited by the appended patent claims.
Image analysis is a wide field with numerous embodiments, from face recognition to image compression. This chapter will explain some basic image analysis features.
A.1. A Digital Image
A digital image is often represented as an m by n matrix, where m is the number of rows and n the number of columns. Each matrix element (u,v), where u=1 . . . m and v=1 . . . n, is called a pixel. The more pixels in a digital image the higher resolution.
Each pixel has a value, depending on which kind of image it is. If the image is a grey scale image with 256 grey scale levels every pixel has a value between 0 and 255, where 0 represent black and 255 white. However, if the image is a colour image one value isn't enough. In the RGB-model every pixel has three values between 0 and 255, if 256 levels are assumed. The first value is the amount of red, the second the amount of green and the last the amount of blue. In this way over 16 millions (256*256*256) different colour combinations can be achieved, which is enough for most embodiments.
A.2. Basic Operations
Since the digital image is represented as a matrix, standard matrix operations like addition, subtraction, multiplication and division can be used. Two different multiplications are available, common matrix multiplication and element wise multiplication:
respectively.
A.3. Convolution and Correlation
Another operation that is useful is the convolution or correlation between two images. Often one of the images, the kernel, is small, e.g. a 3×3 matrix. The correlation between the images B and C is defined as:
The convolution is defined as:
Correlation can be used to blur an image,
to find edges in the image,
or to find details, area with high variance, in an image,
A.4. Morphology
Morphing is a powerful processing tool based on mathematical set theory. With the help of a small kernel B a segment A can either be expanded or shrunk. The expansion process is called dilation and the shrinking process is called erosion. Mathematically these are described as:
A⊕B={x|[({circumflex over (B)})x∩A]⊂A [5.]
and
AΘB={x|(B)x⊂A} [6.]
respectively, where
(A)x={c|c=a+x, for aεA} [7.]
{circumflex over (B)}{x|x=−b, for bεB} [8.]
The erosion of A with B followed by the dilation of the result with B is called opening. This operation separates segments from each other.
A∘B=(AΘB)⊕B [9.]
Another operation is closing. It's a dilation of A with B followed by an erosion of the result with B. Closing an image will merge segments and fill holes.
A·B=(A⊕B)ΘB [10.]
A.5. Segmentation
It is often useful to subdivide the image into different segments, depending on e.g. shape, colour, variance and size. Segmentation can be done on colour images, grey level images and binary images. Only binary image segmentation is explained here.
One way to segment a binary image is by using the region-grow algorithm:
As seen above the region-grow algorithm is recursive and therefore uses a lot of memory. In systems with low memory, this could cause memory overflow. Because of this the following iterative method has been developed.
Number | Date | Country | Kind |
---|---|---|---|
0203483-3 | Nov 2002 | SE | national |
This application is a Divisional of co-pending application Ser. No. 10/536,016 filed on May 23, 2005 now U.S. Pat. No. 7,541,934, and for which priority is claimed under 35 U.S.C. §120; and this application claims priority of Application No. 0203483-3 filed in Sweden on Nov. 21, 2002 under 35 U.S.C. §119; the entire contents of all are hereby incorporated by reference.
Number | Name | Date | Kind |
---|---|---|---|
5099324 | Abe et al. | Mar 1992 | A |
6049281 | Osterweil | Apr 2000 | A |
6239706 | Yoshiike et al. | May 2001 | B1 |
6462663 | Wilson et al. | Oct 2002 | B1 |
6616607 | Hashimoto et al. | Sep 2003 | B2 |
20010004234 | Petelenz et al. | Jun 2001 | A1 |
20020196141 | Boone et al. | Dec 2002 | A1 |
Number | Date | Country |
---|---|---|
1 117 081 | Jul 2001 | EP |
1 195 139 | Apr 2002 | EP |
0 933 726 | Feb 2003 | EP |
523 547 | Apr 2004 | SE |
WO-0148696 | Jul 2001 | WO |
WO-0148719 | Jul 2001 | WO |
WO-0149033 | Jul 2001 | WO |
WO-0156471 | Aug 2001 | WO |
WO-03027977 | Apr 2003 | WO |
Number | Date | Country | |
---|---|---|---|
20090121881 A1 | May 2009 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 10536016 | May 2005 | US |
Child | 12240735 | US |