The present invention relates to a motion targeting system and method, and, in particular, to motion targeting of moving objects in a video system.
Systems for detecting and tracking moving objects through use of a video camera are known. Generally, such systems include algorithms for identifying and/or tracking motion in a video output of a camera or group of cameras. Tracking of the image may be achieved by controlling the pan-tilt-zoom (PTZ) of the camera. As used herein, PTZ refers to any imaging device associated with a camera, such as conventional video cameras, video surveillance domes, etc.
Several difficulties are associated with conventional motion detection and tracking systems. For example, tracking an object using a moving camera requires more powerful processors performing more extensive calculations. Also, when a camera is tracking an object, other objects can move into areas outside the current field of view without detection. The more zoomed in the camera is, the less surrounding area is covered and the easier it is to miss detecting significant events outside of the field of view or to loose track of an object because it slipped out of the field of view.
Fixed mount wide-angle cameras can track multiple objects simultaneously over large areas, but conventionally could not digitally zoom in on an object with enough resolution to facilitate positive identification. Wide-angle cameras with a high pixel resolution imager have been developed to provide improved digital zoom capability, but the digitally zoomed resolution of known wide-angle cameras remains much lower than current technology optical zoom cameras. Cameras with high pixel density imagers are also cost prohibitive compared to optical zoom cameras, and have slow frame rates because of the magnitude of pixels that must be processed during each frame.
A system approach using a stationary wide-angle video camera to track objects and command another camera is expensive. Very low cost wide-angle motion detectors, e.g. PIR sensors, etc., generally do not have sufficient resolution or intelligence to accurately control an associated camera. Covering a wide area with multiple, discrete, low cost motion detectors configured to target a camera requires a large number of sensors to obtain sufficient resolution.
In view of difficulties such as these, many currently installed systems do not include motion detection capability. Instead, a camera is operated in an automatic scanning mode with an output recorded on a time lapse or multiplexed recording device. These systems can cover a wide area with acceptable recording media requirements, but miss a significant amount of activity because they scan a wide space, with a single, relatively narrow field of view. A camera with a wider field of view can provide more continuous coverage, but requires a higher resolution, non-standard camera and expansive memory to provide sufficient resolution.
Accordingly, there is a need for a system and method for detecting and/or tracking moving objects in a video system in a cost efficient and reliable manner.
For a better understanding of the present invention, together with other objects, features and advantages, reference should be made to the following detailed description which should be read in conjunction with the following figures wherein like numerals represent like parts:
For simplicity and ease of explanation, the present invention will be described herein in connection with various exemplary embodiments thereof. Those skilled in the art will recognize, however, that the features and advantages of the present invention may be implemented in a variety of configurations. It is to be understood, therefore, that the embodiments described herein are presented by way of illustration, not of limitation.
Turning now to
The video camera(s) 104 may be any of a variety of cameras known in the art having analog or digital video output. Where multiple cameras 104 are coupled to the motion detector 102, mixtures of camera types and configurations may be provided. The camera(s) may have one or more camera operating characteristics including PTZ condition, focus, etc., that may be controlled by a user control interface 110 coupled thereto. The control interface 110 may provide user initiated control signals to the camera(s). In response to the control signals received at the camera, motors may be operated to change one or more of the camera's 104 operating characteristics.
Those skilled in the art will recognize a variety of configurations for the recording media 108 and video display 106. For example, the display 106 may be directly coupled to the camera, or may be coupled thereto through other devices, such as video matrix switches, video multiplexers, etc (not shown). The recording media 108 may be any fixed or removable machine-readable media configured for storing representations of the camera video output, and may be provided as a component of a video recorder, such as digital or analog tape recorders, write-once or re-writable video disk recorders, and/or DVD recorders. The recording media 108 may be coupled to the video display 106 for selective display of recorded or buffered video data.
Although the illustrated components are shown as separate components in the illustrated exemplary embodiment, those skilled in the art will recognize that one or more of the components may be combined into a single component. For example, the user control interface 110 may be presented as a graphical user interface on the video display 106. Also, in embodiments including multiple cameras 104, each camera may be associated with one or more motion detectors 102, video displays 106, recording media 108, and user interfaces 110, or the cameras may be configured to share one or more of these devices.
The devices 102, 104, 106, 108, 110 may be communicatively coupled by transmission media in a variety well known configurations. The transmission media may be any medium capable of transmitting signals between the particular devices, such as a coaxial cable, twisted pair wire, fiber optic cable, air, etc. Protocols for facilitating such communicative coupling are well known, and need not be further described herein.
In the illustrated exemplary embodiment, the video camera 104a is configured as a dome-type camera. Dome-type cameras are well known to those skilled in the art, and are often used in surveillance applications. A motion detector 102a consistent with the invention is fixedly mounted to the camera 104a. The motion detector 102a may include a lens 204, e.g. a wide-angle lens, and an associated imager and video processing logic. When changes associated with a moving object are detected, the detector 102a may provide an output via cable 206 to control the PTZ of the camera 104a to pan, tilt or zoom to capture the moving object with an optimum or desired resolution.
The video output of the camera 104a may be coupled via cable 208 to the display device 106a, e.g. a video monitor, for displaying the output. The recording device 202, e.g. a digital video recorder, may be coupled for receiving and recording the video output on a recording media via cable 210, e.g. in response to the detector output. The user control interface 110a may be coupled to the camera via cable 212 and may include a console including user input keys 214 and a display 216. A variety of user control interfaces are known. The user control interface may be configured for providing user-initiated control commands to the camera and/or the motion detector via cables 212 and 206. For example, a user may initiate control functions from the interface to manually control the PTZ of the camera, the on/off state of the camera 104a and/or motion detector 102a, and/or to download software updates to the camera and/or motion detector.
Turning now to
The lens 204a may be any of a variety of known lenses for directing an optical image onto the imager 300. In one embodiment, the lens 204a may be a conventional wide-angle lens to provide wide-angle viewing and detection of objects within a wide-angle field of view. As used herein, “wide-angle” when used in reference to a lens or detector shall refer to a lens or detector having a field of view greater than 50 degrees. This would include fisheye lenses that have a 180 degree field of view or greater.
In a manner well-known to those skilled in the art, the imager 300 converts the optical image from the lens 204a to an electrical representation of the image. The imager 300 may be any of a variety of imagers known in the art. However, since the resolution required for the imager to achieve acceptable motion detection is much less than the resolution required for object recognition, the imager 300 may be a low resolution, standard density, low-cost imager including, for example, a complimentary metal oxide semiconductor (CMOS) imager or a charge coupled device (CCD) imager. As used herein, “low resolution” when used in reference to an imager shall refer to an imager having a resolution of less than 380 vertical lines and “high resolution” when used in reference to an imager shall refer to an imager having a resolution of 480 vertical lines or greater.
The output of the imager 300 may be provided to the motion detect sequencer 302, which may include video processing logic for applying any of a number of well-known algorithms to continually monitor the video images for moving objects. Generally, the motion detect sequencer 302 buffers and monitors video frames for changes between successive frames. When, for example, the background is a fixed/motionless background, any changes from one video frame to the next represents a moving object.
From the location of the object within a video frame, the rate of change of the object in the frame, assumptions concerning object size, etc., the sequencer 300 may provide an output to the controller 306 representative of the location, speed and distance of the object relative to the detector 102. The controller may be configured or programmed for providing a PTZ control output on line 310 for controlling the PTZ of at least one associated camera in response to the output from the sequencer 302. For example, the controller may be configured to provide an output to the camera to cause the camera to pan, tilt, and/or zoom to capture the object with an optimum or desired resolution.
The controller 306 may be any type of electronic circuit capable of providing the speed and functionality required by the embodiments of the invention. For example, the controller may be configured as a microprocessor, field programmable gate array (FPGA), complex programmable logic device (CPLD), application specific integrated circuit (ASIC), or other similar device. In an embodiment where the controller is configured as a microprocessor, the processor could be a processor from the Pentium® family of processors made by Intel Corporation, or the family of processors made by Motorola. Software instructions for causing the controller/processor to provide an appropriate output may be stored on any machine-readable media capable of storing instructions adapted to be executed by the processor/controller. As used herein, the phrase “adapted to be executed by a processor” is meant to encompass instructions stored in a compressed and/or encrypted format, as well as instructions that have to be compiled or installed by an installer before being executed by the processor.
Although a variety of imagers 300 may be used in a detector consistent with the invention, use of a low resolution imager reduces image-related buffer memory sizes associated with the sequencer as well as processing speed required for image processing. These reductions in size and speed result in lower system cost. Lower cost lenses may also be used since some minor distortion does not significantly effect detection of most objects.
Also, the images processed by the detector 102 may not require viewing, e.g. on a video display. As such, motion detection in a system consistent with the invention may be performed on raw image data without the extensive processing required for human viewing. For example, a detector consistent with the invention may perform motion detection on the raw data without application of well-known visual perception algorithms conventionally applied to facilitate human visual perception on a display. As used herein “visual perception algorithms” shall refer to known algorithms for color space correction (Bayer to RGB to YUV, etc.), color purity correction, pixel to pixel sensitivity (gain and offset compensation), stuck pixel compensation, gamma correction and encoding to a standard such as CCIR-656, NTSC or PAL, etc. Omitting such algorithms allows for relatively simple detector electronics and lower system cost compared to the use of a common video camera with built-in motion detection or other known detector configurations. Although these advantages are most significantly achieved by omitting all of these algorithms, a system consistent with the invention may omit any one or more of these algorithms. Also, these advantages may also be achieved by applying such algorithms to only some limited portion of the raw image data.
Those skilled in the art will recognize that noise filtering algorithms, may still be required to prevent false motion detection in a system consistent with the invention, depending on system requirements and the lens and imager quality. Monitoring color space information from a color sensor may also be implemented in a detector consistent with the invention. However, a black and white imager may be used to achieve reasonable motion detection at very low cost.
Moreover, use of a detector and camera consistent with the invention provides significant advantages over use of high resolution imagers with built-in motion detection. The independent detector allows for un-interrupted motion detection coverage of an area of interest. The detector output can cause the camera to aim and zoom in on moving objects, while also commanding a recording device to capture segments of the camera video output, e.g. through a serial communication port or alarm inputs to the recording device. The detector may be configured to be compatible with most known PTZ cameras and recording devices, allowing system customization for diverse requirements of resolution, cost, zoom capabilities, etc. A system consistent with the invention also, for example, achieves better low light capability, better automatic gain control, full 30 frames per second (or more) update rate, and allows use of mature image enhancement algorithms for the video output. Moreover, in a system incorporating a camera with optical zoom, loss of resolution associated with digital zoom may be avoided.
As shown, the detector continually monitors 402 received images for changes indicative of a moving object. During this time the camera may be allowed to operate independently according to a default pattern or user-initiated scanning pattern, e.g. in a wide-angle scanning pattern. In a configuration where the detector is secured to a fixed location, the background of the detector's field of view may always be stationary. Running default patterns or jumping between any wide-angle or zoomed views with the video camera will not effect motion detection since the camera and detector operate independently.
When changes associated with a moving object are detected 404, the detector may provide an output to command 406 the camera to pan, tilt and/or zoom to capture moving object with an optimum or desired resolution. The detector output may also command 408 a recording device to capture frames or video clips of the moving object. In one embodiment, after targeting the object or area of activity for a predetermined amount of time, the detector may command the camera to move to another area of activity to capture another moving object. The detector may thus be configured to command the camera to independently track multiple moving objects by cycling between views of the targets, e.g. with optimized resolution, while simultaneously commanding a recording device to capture frames or video clips of each moving object. When no moving objects are detected 404, the camera may be left in its current operating mode or returned to a default mode 410, e.g., a wide-angle scanning pattern, to maximize value of the video content for live viewing or recording.
In one embodiment, the detector may be configured to command the recording device to record a varying number of images per second based on the nature of the video activity in terms of amount, frequency or other parametric measure. This may provide improved use of limited recording media for storage of the most desirable video for security or other applications. This spatial compression also allows the recording media to be optimized for use over a longer period of time, and can greatly increase the probability of recording the most important video content. In addition to a motion targeting application, a system consistent with the invention may be used for automatic tracking wherein the detector may lock on to a moving object and record the object as it moves around without regard to spatial compression.
Again, a system consistent with the invention may include a variety of detector and camera configurations. For example, a single detector may be used to target multiple cameras. In such an embodiment, different cameras may be commanded to track different moving objects and/or multiple moving objects while one or more recording devices are commanded to record video associated with the objects. Also, multiple detectors may be configured to coordinate with each other to control multiple cameras and to control the selection of video streams to recorders from the cameras and/or fixed cameras not controlled by the detector.
The field of view for each lens 510, 512, 514, 516 is identified by the angles FOV1, FOV2, FOV3 and FOV4, respectively As shown, the fields of view for the lenses may overlap, thus providing a continuous 360 degree view around the camera 510. Motion detection electronics 530, e.g. including a sequencer and controller as described above, may receive and time multiplex the respective outputs of the imagers 518, 520, 522, 524 and mask off overlapping fields of view areas. Dewarping compensation may be performed for each command to the camera 510, as opposed to on a real-time pixel-by-pixel basis, if desired to minimize cost by simplifying the electronics. Any of a variety of known dewarping algorithms may be used.
There is thus provided a system and method for monitoring moving objects in a video system. According to one aspect of the invention, the system includes at least one video camera, and at least one motion detector. The motion detector may include a lens having a field of view fixedly directed to an area of interest, and an imager for receiving an image through the lens and converting the image to video data. The motion detector may be configured to monitor the video data for movement of an object in the field of view and to provide a detector output in response to the movement of the object. The detector output may be configured to cause adjustment of at least one operating characteristic of the video camera to target the camera on the object. According to one embodiment, the lens may be a wide-angle lens and the detector output may control the pan, tilt and zoom of the camera to target the camera on the object.
According to another aspect of the invention there is provided a method of monitoring a moving object in a video system. The method includes providing at least one motion detector consistent with the invention, operating the motion detector to continually monitor video data to detect movement of the moving object; and providing an output from the motion detector in response to the movement to cause adjustment of at least one operating characteristic of a video camera to target the camera on the moving object.
According to yet another aspect of the invention there is provided a method of monitoring multiple moving objects in a video system. The method includes providing at least one motion detector consistent with the invention, operating the motion detector to continually monitor the video data to detect movement of the moving objects; providing a first output from the motion detector in response to the movement of a first one of the objects to cause adjustment of at least one operating characteristic of a video camera to target the camera on the first one of the moving objects; and providing a second output from the motion detector in response to the movement of a second one of the objects to cause adjustment of at least one operating characteristic of the video camera to target the camera on the second one of the moving objects. The detector may provide record commands to cause a recording media to record at least a portion of the video camera output while the camera is targeted on the first and second objects.
The embodiments that have been described herein, however, are but some of the several which utilize this invention and are set forth here by way of illustration but not of limitation. Many other embodiments, which will be readily apparent to those skilled in the art, may be made without departing materially from the spirit and scope of the invention.