This application claims priority from Korean Patent Application No. 10-2015-0007449, filed on Jan. 15, 2015, in the Korean Intellectual Property Office, the disclosure of which is incorporated herein in its entirety by reference.
1. Field
Apparatuses and methods consistent with exemplary embodiments relate to simultaneously providing a multi-video summary about a certain object via images captured by a plurality of imaging devices.
2. Description of the Related Art
A network video surveillance system allows a user to monitor various environments such as roads, buildings, factories, etc. for certain purposes, such as security reasons, via video recording apparatuses such as network cameras.
The network video surveillance system may receive, via a transmission network, a video captured by a monitoring camera. A user may directly monitor security circumstances through a monitor or store the video in a digital video recorder (DVR), network video recorder (NVR), or VMS format and search through the stored video later.
However, when such a network video surveillance system is used, a user may not be able to directly monitor required areas and analyze captured videos due to use of an increased number of monitoring cameras. Also, a time for searching through stored videos may rapidly increase depending on the number of monitoring cameras. Accordingly, there is a need for a new search technology, and thus, a video synopsis/summary technology has been developed.
The video summary technology may be used to summarize a long video and provide a compressed video so that a user may efficiently search a video.
When it is difficult to recognize a person's face in a video provided by an imaging apparatus such as a monitoring camera, videos captured by nearby cameras have to be manually examined to identify the victim's face.
According to exemplary embodiments of the inventive concept, during a process of identifying the person's face in a video provided by a monitoring camera or the like, a video summary generated based on videos captured by nearby cameras in an identical time range are simultaneously used. Search correctness and efficiency may be improved by simultaneously providing a video summary in which a captured object is disclosed from various angles.
Additional aspects will be set forth in part in the description which follows and, in part, will be apparent from the description, or may be learned by practice of the presented exemplary embodiments.
According to one or more exemplary embodiments, there is provided a method of providing a multi-video summary which may include: setting up a plurality of imaging devices to monitor, in different field of views (FOVs), a plurality of areas including at least one overlapping area commonly monitored by the imaging devices, and capture a plurality of videos from the areas, respectively; storing overlapping information in which the imaging devices and the overlapping area are mapped to one another to indicate that objects respectively detected in the overlapping area by the imaging devices are identical to one another; and providing at least two video summaries which are generated from the videos and display at least two images of the identical object in the overlapping area captured by at least two imaging devices among the imaging devices in the different FOVs, based on the overlapping information.
The providing the two video summaries may include displaying, on a single screen of a display, the two video summaries which respectively show at least one image of the object detected in the overlapping area, the two video summaries being generated from at least two videos captured by the two imaging devices. When the two video summaries are displayed on the single screen, the two video summaries may be synchronized in time to indicate that an image of the object shown in a first video summary and an image of the object shown in a second video summary are captured at a same time.
The above method may further include: displaying, on a single screen of a display, a first video summary which shows a plurality of images of the object detected in the overlapping area by a first imaging device with a first FOV; and displaying at least one second video summary including at least one second image of the object detected in the overlapping area by at least one second imaging device with at least one second FOV, in response to selecting at least one image of the first images or indication of the image of the first images. The selected image of the first images and the second image may have been captured by the first imaging device and the second imaging device, respectively, at the same time.
The above method may further include indicating presence of the second image of the object captured by the second imaging device in a storage when the first video summary is displayed. The presence of the second image may be indicated by at least one of information about a number of the second imaging device that captures a video of the overlapping area and information about identifiers of the second imaging device that captures the video of the overlapping area.
The above method may also further include, in response to selecting an image included in a first video summary of the two video summaries, displaying an original video among the videos, corresponding to the image, captured by a first imaging device among the imaging devices.
According to one or more exemplary embodiments, there is provided a multi-video summary providing apparatus which may include: an FOV information storage configured to store information about at least one overlapping area commonly monitored by a plurality of imaging devices among a plurality of areas monitored by the imaging devices having different FOVs about the overlapping area; a mapping unit configured to store overlapping information in which the imaging devices and the overlapping area are mapped to one another to indicate that objects respectively detected in the overlapping area by the imaging devices are identical to one another; and a multi-video summary provider configured to provide at least two video summaries which are generated from the videos and display at least two images of the identical object in the overlapping area captured by at least two imaging devices among the imaging devices at the different FOVs, based on the overlapping information.
The multi-video summary provider may be configured to display, on a single screen of a display, a first video summary which shows a plurality of images of the object detected in the overlapping area by a first imaging device with a first FOV, and at least one second video summary including at least one second image of the object detected in the overlapping area by at least one second imaging device with at least one second FOV, in response to at least one image of the first images or indication of the image of the first images.
These and/or other aspects will become apparent and more readily appreciated from the following description of the exemplary embodiments, taken in conjunction with the accompanying drawings, in which:
As the inventive concept allows for various changes and numerous exemplary embodiments, particular exemplary embodiments will be illustrated in the drawings and described in the written description. The effect and features of the inventive concept and the method of realizing the effect and the features will be clear with reference to the exemplary embodiments described below with reference to the drawings. However, the inventive concept may be embodied in various forms and should not be construed as being limited to the exemplary embodiments presented herein. As used herein, the term “and/or” includes any and all combinations of one or more of the associated listed items. Expressions such as “at least one of,” when preceding a list of elements, modify the entire list of elements and do not modify the individual elements of the list.
Reference will now be made in detail to the exemplary embodiments which are illustrated in the accompanying drawings, wherein like reference numerals refer to like elements throughout.
Referring to
The multi-video summary providing apparatus 200 may store input video received wirelessly or via wires from a network camera or an external imaging device, and may reproduce the stored input video. When reproducing the input video, the multi-video summary providing apparatus 200 may simultaneously provide a video summary of the reproduced input video according to a user's request.
The multi-video summary providing apparatus 200 may receive videos from a plurality of network cameras through a plurality of channels and display the videos on a single screen of a display device (not shown). The multi-video summary providing apparatus 200 may perform wireless and wired communication with the plurality of network cameras. Also, respective video summaries of the plurality of videos may be displayed on the screen.
According to an exemplary embodiment, the multi-video summary providing apparatus 200 may provide a plurality of video summaries based on videos with respect to a single object captured by using a plurality of imaging devices. To do so, the multi-video summary providing apparatus 200 may include a field of view (FOV) information storage 210, a mapping unit 220 and a multi-video summary provider 230.
When respective monitoring areas monitored by a plurality of imaging devices 110, 120, 130, 140 and 150 are partially overlapping, the FOV information storage 210 stores overlapping area information S110 and S120. The FOV information storage 210 may detect respective FOVs of the imaging devices 110, 120, 130, 140 and 150 to establish relationships between views of the same object in imaging devices 110, 120, 130, 140 and 150, and thus detect the respective monitoring areas of the imaging devices 110, 120, 130, 140 and 150. The relationships between the views of the same object are established by matching geometric and appearance features.
Referring to
Referring to
The mapping unit 220 may map information which indicates that an area captured by the second camera 120 partially overlaps an area captured by the third camera 130 in the first overlapping area S110. Also, the mapping unit 220 may map information which indicates that an area captured by the third camera 130 partially overlaps an area captured by the fourth camera 140 in the second overlapping area S120.
According to an exemplary embodiment, when the multi-video summary providing apparatus 200 detects movement of a person 100 in the second overlapping area S120, the multi-video summary providing apparatus 200 may use the information of the mapping unit 220 as shown in the exemplary embodiment of
Referring to
In this case, based on the information of the mapping unit 220, the multi-video summary providing apparatus 200 may recognize that the object ID1 detected by the third camera 130 at the calibration (10,100) at the time t1 is the same as the object ID27 detected by the fourth camera 140 at the calibration (20,100) at the time t1.
In this case, the multi-video summary providing apparatus 200 may map the object ID1 detected by the third camera 130 and the object ID27 detected by the fourth camera 140 as an identical object. The multi-video summary providing apparatus 200 may use mapping of objects as mapping information.
Also, the multi-video summary providing apparatus 200 may map the calibration (10,100) detected by the third camera 130 and the calibration (20,100) detected by the fourth camera 140 as an identical calibration. The multi-video summary providing apparatus 200 use mapping of calibration information as mapping information.
The multi-video summary providing apparatus 200 may also map and store calibration information in advance based on information of respective FOVs of the third and fourth cameras 130 and 140.
According to an exemplary embodiment, the mapping unit 220 may additionally map and store time stamp information. For example, when movement of a person ID1 (corresponds to the person 100 of
(t3, second overlapping area, third camera (10, 90), fourth camera (20, 90))
When movement of a person is detected by the third camera at coordinates (10,90) at time t3 based on the mapping information according to the above exemplary embodiment, the multi-video summary providing apparatus 200 may use information of a video of coordinates (20,90) captured by the fourth camera at the time t3 based on the mapping information.
According to an exemplary embodiment, the mapping unit 220 may map the person 100 that is set to an identification code ID_1 in the third camera 130 and the person 100 that is set to an identification code ID_27 in the fourth camera 140, and recognize that ID_1 and ID_27 indicate an identical object.
According to an exemplary embodiment, based on the mapping information of the mapping unit 220, the multi-video summary provider 230 may simultaneously display video summaries that are generated by using videos of an overlapping area captured by different imaging devices in different FOVs. Also, based on the mapping information, the multi-video summary provider 230 may further include a user interface (not shown) that allows the user to select a video summary of an imaging device (e.g., the fourth camera) from among the plurality of imaging devices that has captured the same object as another imaging device (e.g., the third camera) at the time t3.
In particular, based on the time stamp information and videos of the second overlapping area S120 captured by the third and fourth cameras 130 and 140, the multi-video summary provider 230 may be configured to simultaneously or sequentially reproduce a video summary of the second overlapping area S120 generated by the third camera 130 and a video summary of the second overlapping area S120 generated by the fourth camera 140. In this case, a user interface for selection may be provided to the user.
According to an exemplary embodiment, when providing a video summary, the multi-video summary provider 230 may be configured to simultaneously display videos of an overlapping area captured by a plurality of cameras, as in exemplary embodiments of
According to another exemplary embodiment, when providing a video summary, the multi-video summary provider 230 may be configured to display a video summary of a video captured by a predetermined camera, and display, on the video summary, notification messages S710, S711 and S712 which indicate that a video captured from an overlapping area by another camera exists, as shown in
According to another exemplary embodiment, when providing a video summary, the multi-video summary provider 230 may be configured to display only a video summary of a video captured by a predetermined camera, and display, on the video summary, notification messages S810, S811 and S812 which indicate that a video captured from the overlapping area by another camera exists, as shown in
In this case, the notification messages S710, S711, S712, S810, S811, and S812 may include at least one of information S710, S711, and S712 about the number of other cameras that capture videos of the overlapping area, and information S810, S811 and S812 about identifiers (IDs) of the other cameras that capture the videos of the overlapping area.
Exemplary embodiments of displaying a multi-video summary will be described as below with reference to
Referring to
According to exemplary embodiments of
According to exemplary embodiments of
Referring to
Referring to
In particular, the person 330 captured at the time t3 by the third camera 130 (of
A person 331 captured by the third camera 130 (of
Likewise, a person 332 captured by the third camera 130 (of
Video summaries of different imaging devices may be simultaneously displayed by using the person 332 captured by the third camera 130 (of
The notification messages S710, S711 and S712 may be shown in talk bubbles or numbers around the videos. The user may see the numbers in the talk bubbles, and understand that the videos have been captured by other cameras at a corresponding time.
For example, when the number “2” is in a talk bubble, two cameras may be in an overlapping area. When the number “2” is clicked as shown in
According to another exemplary embodiment, referring to
In this case, the three videos may be displayed by using various methods, for example, vertically displaying the three videos, horizontally displaying the three videos, displaying the three videos in a scroll view, 3-dimensionally displaying the three videos as illustrated in
As described above with reference to
A multi-video summary providing apparatus may receive a video from the third camera 130, and display the received video on a single screen. Also, a video summary of the received video may be displayed on the screen, separately or together with the original video.
In this case, the multi-video summary providing apparatus may display as in
Although
The first camera first captures a person 410 with identification code ‘ID#1’ and sequentially captures a person 420 with identification code ‘ID#2.’
A video summary S400 written at time t1 is an exemplary embodiment in which an object is not detected from an overlapping area S430. In this case, the video summary S400 written by the first camera simultaneously displays a first object (ID#1) 410 and a second object (ID#27) 420 on a single screen.
A video summary S410 written at time t3 is an exemplary embodiment in which an object is detected from the overlapping area S430. In this case, the video summary S410 captured by the first camera may simultaneously display the first object (ID#1) 410 and the second object (ID#27) 420 on a single screen. Also, a video S440 of the second object 420 in the overlapping area S430 captured by the second camera may be displayed.
According to a timeline, at time t3, the first object 410 captured by the first camera, the second object 420 (460) captured by the first camera, and the second object 420 (470) captured by the second camera may be simultaneously displayed on a single screen.
According to another exemplary embodiment, the video summary S410 written at the time t3 may display the first object (ID#1) 410, the second object (ID#27) 420, and a notification message 420 indicating that a video captured by another camera exists nearby the second object 420.
Referring to the multi-video summary of
Also, a video summary 1411 based on a video of the person 1410 captured by the third camera nearby the second camera in the first overlapping area S1410 in a different FOV from the second camera is displayed with a notification message 1412 indicating that the video is captured by the third camera.
Also, a video summary 1413 based on a video of the person 1410 captured by the seventh camera nearby the second camera in the first overlapping area S1410 in a different FOV from the second camera is displayed with a notification message 1415 indicating that the video is captured by the seventh camera.
In this case, the videos respectively captured by the third camera and the seventh camera may be sequentially displayed with a time interval. Alternatively, the videos may be shown as a panorama. Alternatively, a video, in which a feature to be identified is highlighted the most, may be enlarged. Examples of the feature to be identified may include fire, eyes, a nose, and a face.
According to another exemplary embodiment, from among a plurality of imaging devices that capture the first overlapping area S1410, at least one of the second, third, and seventh cameras that have the same FOV may generate a video summary at an identical time range and display the video summary together when the user requests. Also, at least one of the second, third, and seventh cameras may reproduce a corresponding part of an original video using synchronized time stamp information when an object in the generated video summary is clicked, simultaneously with another at least one video clip for the overlapping area at the identical time range. The other at least one video clip is recorded and derived from the imaging devices with the overlapping FOV, which have generated a video summary for the overlapping area.
A second overlapping area S1420 shows a video summary based on a video of a person 1420 captured by the second camera. In this case, the second overlapping area S1420 may display a video summary 1421 based on a video of the person 1420 captured by the first camera nearby the second camera in a different FOV from the second camera in the second overlapping area S1420 and/or a notification message 1422 indicating that the video is captured by the first camera.
The multi-video summary providing apparatus 500 may be provided along with at least one camera installed at necessary locations in a monitoring area, for example, front, back, left, right, or inside. Thereafter, calibration is performed on the plurality of cameras to obtain geometry information and calculate an overlapping area between videos of the cameras.
Thereafter, video metadata S510 is generated by analyzing video data (or video stream) S511 from the cameras by using a multi-video summary unit 520. In this case, the video metadata S510 may include information such as moving object detection and detected time, moving path of an object, color/shape of the object, and changes in the object.
When a video summary request is input, the multi-video summary providing apparatus 500 may generate a video summary script S512 to generate a video summary. The video summary script S512 includes, for example, a location of an object and time information.
According to an exemplary embodiment, the multi-video summary providing apparatus 500 may, first, generate a video summary script about a predetermined camera, and then add a video summary script so as to additionally use a portion of a video captured by another camera at a different angle. The portion of the video covers an overlapping area where an FOV of the other camera overlaps that of the predetermined camera.
The multi-video summary providing apparatus 500 may generate a video summary script file about the overlapping area by using time stamp information, and render a video. In the case of an object detected by the cameras, time stamp information is stored for simultaneous reproduction, and then a final video is output.
A multi-video summary providing apparatus may perform calibration on a plurality of cameras installed at a plurality of locations, respectively, in a monitoring area, and thus obtain geometry information and information about an overlapping area between videos of the cameras. Through this process, the multi-video summary providing apparatus may determine an area monitored by each of the cameras, and detect an overlapping area when the areas monitored by the cameras overlap one another (S1710).
Next, the detected overlapping areas may be mapped to information about at least two imaging devices that capture the overlapping areas (S1720). Thereafter, when providing a video summary of each of the overlapping areas, as shown in
A multi-video summary providing apparatus may support a user interface that allows a user to select a multi-video summary or a single video summary generated based on a video captured by a predetermined camera (S1810). In this case, the user interface may be provided to the user via a display screen.
When the user selects the multi-video summary (S1810), the multi-video summary providing apparatus enters a multi-video summary display mode, and additionally provides a mode selection interface to the user (S1820).
In this case, the mode selection interface may provide a video display mode S1832, a data display mode S1834, and a video and data display mode S1836.
In the video display mode S1832, as in the exemplary embodiments of
In the data display mode S1834, as in the exemplary embodiments of
In the video and data display mode S1834, as in the exemplary embodiment of
In the above-described exemplary embodiments, video summaries are generated and stored by respective cameras before being provided to a display device (not shown) by the multi-video summary providing apparatus. However, the inventive concept is not being limited thereto. The video summaries may be generated and stored by a separate device such as a DVR or NVR having a storage (not shown).
The operations or steps of the methods or algorithms described above can be embodied as computer readable codes on a computer readable recording medium, or to be transmitted through a transmission medium. The computer readable recording medium is any data storage device that can store data which can be thereafter read by a computer system. Examples of the computer readable recording medium include read-only memory (ROM), random-access memory (RAM), compact disc (CD)-ROM, digital versatile disc (DVD), magnetic tape, floppy disk, and optical data storage device, not being limited thereto. The transmission medium can include carrier waves transmitted through the Internet or various types of communication channel. The computer readable recording medium can also be distributed over network coupled computer systems so that the computer readable code is stored and executed in a distributed fashion.
At least one of the components, elements, modules or units represented by a block as illustrated in
It should be understood that exemplary embodiments described herein should be considered in a descriptive sense only and not for purposes of limitation. Descriptions of features or aspects within each exemplary embodiment should typically be considered as available for other similar features or aspects in other exemplary embodiments.
While one or more exemplary embodiments have been described with reference to the figures, it will be understood by those of ordinary skill in the art that various changes in form and details may be made therein without departing from the spirit and scope of the inventive concept as defined by the following claims.
Number | Date | Country | Kind |
---|---|---|---|
10-2015-0007449 | Jan 2015 | KR | national |