The invention described and claimed hereinbelow is also described in German Patent Application DE 102005053148.2 filed on Nov. 4, 2005. This German Patent Application, whose subject matter is incorporated here by reference, provides the basis for a claim of priority of invention under 35 U.S.C. 119(a)-(d).
The present invention relates to a method for handling content information, a device, a network, a computer program, and a computer program product.
In security electronics, devices are known in the field of video monitoring which analyze a video signal and deliver data regarding the scenes and, therefore, content information, to the video signals. Devices of this type can be operated in networks. The MPEG-7 or MPEG-4 international standards used to describe metadata are extremely complex, however.
Accordingly, it is an object of the present invention to provide a method for handling content information, a device for handling content information, a network, a computer program, and a computer program product in accordance with the invention.
In keeping with these objects and with others which will become apparent hereinafter, one feature of the present invention resides, briefly stated, in a method for handling content information, comprising the steps of carrying out the method using at least one device in a network; carrying out a video content analysis to provide content information for video data; and accumulating the content information in different hierarchal levels.
Another feature of the present invention resides, briefly stated, in a device for handling content information in a network, comprising means for accumulating content information for video data in different hierarchal levels.
Also, a network which includes the above mentioned device in accordance with the present invention, a computer program with program code means to carry out all steps of the inventive method, and a computer program with program code means which are stored in a computer readable data storage device to carry out all steps of the inventive method are proposed.
The inventive method for handling content information is carried out using at least one device in a network. Video content analysis is carried out to provide content information for video data, and this content information is accumulated in different hierarchical levels.
The inventive device is designed to accumulate content information for video data in different hierarchical levels.
The inventive network includes at least one inventive device and is designed to carry out a method according to the present invention.
The present invention also relates to a computer program product with program code means to carry out all steps of an inventive method when the computer program is run on a computer or a related arithmetic unit, in particular in an inventive device or in an inventive network.
The present invention also relates to a computer program product with program code means, which are stored on a computer-readable data storage device, to carry out all steps of an inventive method when the computer program is run on a computer or a related arithmetic unit, in particular in an inventive device or in an inventive network.
The proposed invention is preferably suitable for a network-like video monitoring system which includes at least one device for evaluating video data or signals. This video monitoring system can also include at least one unit for depicting and storing video data and content information. Several devices and units of this type can be interconnected using the network according to the present invention. The evaluation can take place in a device designed as a camera or, in particular, in a device designed as a multimedia device.
With the present invention it is now possible to combine the content information in a structured manner, via accumulation in the hierarchical levels, and, particularly advantageously, to prepare and/or code it, in order to transfer, display, and/or store it in a network in which the at least one device is connected with at least one unit. Inventive devices can also be interconnected within the network.
The content information is accumulated for transfer and/or storage in different hierarchical levels or layers which contain the various information, content information in particular, and it is stored in these hierarchical levels.
With block-based accumulation, information about regions within images or blocks of video data in which something has changed can be made available. In other hierarchical levels, information about objects which have been discovered in at least one image within the video data, and their trajectories, can be made available. This is object-based accumulation. When accumulation is carried out for event detection, information about events that occur in one scene can be made available. A scene of this type is, e.g., an object which goes or has gone from a first region into a second region within a sequence of images. In addition, at least one hierarchical level can be provided for overview information, which combines—in a compressed manner—the data on events, objects, and image changes over large periods of time, i.e., content information.
With the present invention, data, such as video data, and content information can be coded for purposes of transfer and storage such that the data from the various hierarchical levels can be read independently of each other. It is therefore possible to provide devices and/or units designed as receivers, which can interpret only a subset of information. It is also possible that existing receivers can continue to read data they are aware of, even when new hierarchical levels are added.
Using coding provided by the present invention, it is also possible to send data to several receivers simultaneously, every receiver having the capability to connect into a data stream made available in this manner, at any point in time. As a result of the coding, data, content information in particular, from lower hierarchical levels can also be compressed into overview information in higher hierarchical levels and therefore also accumulated. This overview information can be advantageously used to search for relevant events in order to limit a review of image archives of video data to short, relevant periods of time.
When carried out, the inventive method can be supported with a suitable data format, a video content description format (VCD). By using a data format of this type, a related protocol is used to transfer data, i.e., video data and/or content information.
The inventive device is designed to make content information available for video data when video content analysis is carried out, and to accumulate this content information in different hierarchical levels. With this device, it is therefore possible to carry out any steps of an inventive method. In one embodiment, this device can be designed to accumulate content information received via a network in higher hierarchical levels and to forward it immediately and/or with time delay to another device or a multimedia unit. According to this embodiment, it is also possible with this device to make hierarchical levels available without coupling to video content analyses.
Further advantages and embodiments of the present invention result from the description and the attached drawing.
It is understood that the features mentioned above and to be described below can be used not only in the combination described, but also in other combinations or alone without leaving the framework of the present invention.
The present invention is depicted schematically with reference to an exemplary embodiment in the drawing, and it is described in detail below with reference to the drawing.
Units 4, 8 designed as cameras, and device 12, which includes the camera, are designed to record video data. Devices 6, 10, 12, which are designed as evaluation units or which include an evaluation unit, are designed to evaluate the recorded video data. It is provided that content information which provides information about changes that occurred in the images depicted by the video data can be extracted from the video data and thereby made available. This content information is accumulated in different hierarchical levels. Certain bits of content information are accumulated in hierarchical levels provided therefor.
Content information from lower hierarchical levels is accumulated in lower resolutions in higher hierarchical levels, thereby providing overview-type summaries of the content information.
The video data and content information, which has been combined in the hierarchical levels in an accumulated or structured manner, is forwarded via network 2 to units 14, 16, which are designed as user interfaces, and to unit 18, which is designed as a memory. The content information and video data are stored in unit 18, which is designed as a memory, for any period of time.
Using units 14, 16, which are designed as user interfaces, users can access the video data and content information at any time. To obtain an initial, rough overview of the events depicted in the video data, the users can first view higher hierarchical levels, in which the content information has been accumulated in low resolution as overview information and, if it is expected that special events are depicted, users can view high-resolution, accumulated content information from lower hierarchical levels, to obtain an exact picture of the events.
With the process depicted in
Block-based analytical component 48 is suited for dissecting individual images 42 into blocks, an image 42 being subdivided into rows and columns contained in the individual blocks. Images 42 can be incremented into the smallest blocks necessary, and they can include only one or a few image pixels. It is possible to compare temporally successive images 42 using block-based analytical components 48 preferably automatically via a computer-assisted comparison, and therefore automatically.
A motion map of images 42 of video data 44 is made available via the object-based analytical components, and related content information 54 is therefore accumulated.
Object-based analytical component 50 is connected with block-based analytical component 48 so that information can be exchanged. Object-based analytical component 50 is designed to analyze content information 54 made available by block-based analytical component 48 such that individual objects and the identities of these objects are detected and made available as content information 56, which is depicted here as character boxes. Character boxes of this type provide information about the position, speed, width, height, etc., of detected or identified objects.
It is provided that object-based analytical component 50 exchanges content information 54, 56 with event-detecting or event-displaying analytical component 52. Event-detecting analytical component 52 provides content information 58 that provides information about special events, which document, e.g., loitering persons, lost children, or acts of crime.
Content information 54, 56, 58 made available within the framework of video content analysis 46 is accumulated in hierarchical levels in the inventive devices via a video content description 60. Content information 54, 56, 58 accumulated in the hierarchical levels and video data 44 are made available via a network 62 with units 64, 66, which are designed as computers and memories. Using unit 64, which is designed as a computer, a user can analyze the video data and content information 54, 56, 58 accumulated in the hierarchical levels. User can also access unit 66 designed as a memory at any time; unit 66 makes available—via a device 68 designed as an event data base—prepared images 42 based on content information 54, 56, 58 accumulated in the hierarchical levels.
The format provided by video content description 60 is suited to and useful for providing results, i.e., content information 54, 56, 58, of an algorithm carried out within the framework of video content analysis 46 accumulated over the hierarchical levels, for transfer and storage. Users are therefore provided with an overview-type summary of events contained in video data 44. Content information 54, 56, 58 and, therefore, the depicted results, can be easily decoded and analyzed in unit 64, which is designed as a computer. Due to the structure provided via the hierarchical levels, is it possible to perform a rapid, intelligent search for certain events. Video content description 60 is designed to be flexible, and can be expanded easily with the addition of new features. In addition, using the format to code video data 44 makes is possible to reduce the memory space required for video data 44.
Content information 54, 56, 58 and, therefore, results of the algorithm provided via video content description 60 are defined and coded using the format provided by video content description 60 for transfer and storage in network 62. Content information 54, 56, 58 can be transferred independently of coded video data streams, and can be linked with video data 44 via a real-time transport protocol (RTP) using timing marks. Depending on the configuration of device 68, which is designed as a data base, content information 54, 56, 58 can be stored in context with video data 44, also with consideration for the hierarchical levels.
A grouping of a data stream in up to sixteen or more hierarchical levels is supported by including overview data or information that is accumulated in higher hierarchical levels from content information 54, 56, 58 from lower hierarchical levels. Content information 54, 56, 58 from lower hierarchical levels is combined in low resolution in higher hierarchical levels of this type.
An output of different abstraction levels of the algorithm which provides video content analysis 46 can be described via video content description 60. Video content description 60 also makes it possible to perform an elegant and effective search, in which different temporal resolutions of content information 54, 56, 58 can be selected in an overview, so that a large quantity of data, such as video data 44, can be viewed in summarized form in a short period of time.
All packets 88, 90, 92 are shown in
For example, in lowest hierarchical level 80, complete descriptions of objects or events are coded as content information within packets 88, while, in middle hierarchical level 82, content information about objects or events within a particular packet 90 are accumulated per second, and they are accumulated per minute in packets 92 of highest hierarchical level 84.
Since only the highest hierarchical level 84 is viewed, a decoding device can recognize how many objects have been detected within this minute. Accumulations 93 described are defined in this case by an algorithm which provides a video content analysis and/or a video content description. A format provided by the video content description is therefore designed as a packet-based protocol.
Accordingly, a beginning of a packet 88, 90, 92 must be known in order to analyze the content information. Packet tags which contain the related information are provided for this purpose. These packet tags can be designed as field-describing “headers”. It is therefore possible to easily analyze a stream of video data based on packet tag types. Some of the packet tags can have special significance. For example, a frame information tag can signal the start of a new frame within a data stream, which means that all subsequent packet tags of a particular hierarchical level 80, 82, 84 belong to this frame until the next frame information tag arrives.
In addition, a level information tag can be provided that indicates order 86 and time 94 within the higher hierarchical levels 82, 84, so an overview of a certain time interval can be provided. A level information tag of this type can be used to start a hierarchical search in a first overview at an uppermost hierarchical level and, to obtain more detailed content information, it can scan ahead in a descending manner across the middle hierarchical level 82 into lowest hierarchical level 80.
The protocol provided by the present invention serves to transfer and control data, which are provided according to a video content description. These data are content information for video data, which have been accumulated in different hierarchical levels. It is provided that packets of content information have tags which describe the content and temporally structured sequence of the packets. Tags of this type can also provide information about objects or events. New tags can be assigned to existing packets at any time, thereby ensuring that an expansion can be carried out for future applications.
By using the level information tags, the content information within different hierarchical levels can be structured and made available in different levels of resolution for a quick overview when the user is performing an analysis. As such, it is possible to carry out a quick, intelligent search through a large amount of data provided by the video content description.
It is also feasible to provide tags for different alarms, a motion map, and object property descriptions. These object property descriptions can be expanded with the addition of additional tags. An object description can include alarm identifiers, standstill periods, motion vectors, structural statistics, and shape properties, e.g., character boxes and outlines or contours. For example, the shape and instant when an object was originally discovered can be transferred retroactively, e.g., if an alarm has been triggered.
It will be understood that each of the elements described above, or two or more together, may also find a useful application in other types of methods and constructions differing from the type described above.
While the invention has been illustrated and described as embodied in a method for handling content information, etc., it is not intended to be limited to the details shown, since various modifications and structural changes may be made without departing in any way from the spirit of the present invention.
Without further analysis, the foregoing will so fully reveal the gist of the present invention that others can, be applying current knowledge, readily adapt it for various applications without omitting features that, from the standpoint of prior art, fairly constitute essential characteristics of the generic or specific aspects of this invention.
Number | Date | Country | Kind |
---|---|---|---|
10 2005 053 148.2 | Nov 2005 | DE | national |