The invention relates to a method and an apparatus for visualizing data sets, and more specifically to a method and an apparatus for visualizing data sets that are subject to a temporal process, i.e. that partly uncompleted and/or continuously updated.
Digital motion pictures, also referred to as digital image sequences, often come along with metadata information. Preferably, metadata information is available for every single frame of the digital image sequence. Metadata is typically generated either by the camera itself or, more likely, by a real-time or non-real-time post processing algorithm. Metadata comprises a plurality of information, e.g. the noise level, image contrast or, in case of more sophisticated algorithms, a number of objects, such as faces or cars or the like, detected within the respective frame. In professional post production environments a human reviewer working on multiple data sets per movie or even with multiple movies having a plurality of data sets needs to inspect the metadata information for certain quality criteria, e.g. for reviewing defects that have been identified in a previous automatic detection process. Due to the huge amount of information the human reviewer has a vital interest in optimizing the time needed to inspect the metadata.
Common techniques for displaying such large amount of data in a single plot, e.g. in a graph or bar plot, typically use downsampling in order to match the amount of data that is desired to be displayed to the available pixels or dots of a display unit. A display unit within the meaning of the term is, for example, a monitor especially used for display of the metadata or a graphical user interface (typically referred to as a GUI), e.g. a window that is used for that purpose. The available resolution is defined by the monitor itself, i.e. by the hardware resolution of the respective monitor or by a number of pixels inside a GUI-window that is used for display of the metadata.
In a recent patent application EP11305111 it has been proposed to split the available metadata into a plurality of clusters and to determine a representative value for each cluster by applying a predetermined function to the metadata elements of the respective cluster. The number of clusters depends on the resolution of the display unit. For each cluster only the representative value is displayed. The predetermined functions are chosen such that outliers in the metadata are not omitted or diminished, e.g. due to averaging, but preserved and well visible for the reviewer.
The above described approach works on an existing metadata sequence, i.e. it is assumed that the complete sequence of metadata is available. However, especially for reviewing large sequences of metadata in the course of movie restoration, it would be desirable to start the reviewing process, and hence the clustering of metadata, already before the complete sequence of metadata is available. Of course, the same problem arises for visualization of any large data set that is partly uncompleted and/or continuously updated.
It is thus an object of the present invention to propose a solution for visualizing data sets that are subject to a temporal process, i.e. that are partly uncompleted and/or continuously updated.
According to the invention, a method for visualizing a data set on a display, the data set comprising a defined number of elements, comprises the steps of:
Advantageously, an apparatus for visualizing a data set on a display is adapted to perform the above method according to the invention. For this purpose the apparatus has an input for receiving the elements of the data set, a calculator for determining the number of displayable clusters, a processor for retrieving the elements of the data set, for clustering the retrieved elements into the determined number of clusters using placeholders for not yet retrieved elements of the data set, and for assigning a value to each cluster, a graphics block for generating a display signal from the data provided by the processor, and an output for supplying the display signal to a display.
The invention solves the problem of quickly reviewing large data sets of defined size, which are partly uncompleted and/or continuously updated, i.e. that are subject to a temporal process. The invention allows the data set to be inspected as early as possible, i.e. without waiting for the data set to be complete. For the case that the elements of the data set are generated exactly once within a temporal process of finite duration and do not change afterwards, the invention provides progress information. For the case that the elements of the data set are updated partly, i.e. the elements change continuously, the invention delivers a continuous view on the current data set.
Preferably, clusters that include placeholders are marked for visualization. Such marked clusters are then highlighted when they are visualized, e.g. by color, shape, texture, or symbols. In this way an operator is immediately aware that certain clusters do not yet necessarily have their final value and need to be considered with care.
Favorably, a value is assigned to a cluster by applying a function to the elements of the cluster and assigning a result of the applied function to the cluster. This allows to assign a representative value to each cluster without the need to display too many details of the data set.
Advantageously, the number of displayable clusters is performed by comparing a resolution of the display with a number of pixels needed per cluster. This allows to calculate the number of displayable clusters in a simple manner by dividing the resolution by the number of pixels per cluster.
Preferably, the visualization of the values of the cluster on the display is initiated only when a defined first minimum number of elements of the data set has been retrieved. This ensures that display of the data set starts with a meaningful number of clusters that have their final values. As is depends on the user perception which number is considered to be meaningful, the defined first minimum number is favorably settable by the user.
Advantageously, the updating of the clusters, the assigned values and the visualization on the display is initiated only when a defined second minimum number of elements of the data set has been modified or of further elements has been retrieved. Preferably, the defined second minimum number is settable by a user. This avoids too frequent changes of the display, which could otherwise disturb a review process performed by an operator.
For a better understanding the invention shall now be explained in more detail in the following description with reference to the figures. It is understood that the invention is not limited to this exemplary embodiment and that specified features can also expediently be combined and/or modified without departing from the scope of the present invention as defined in the appended claims. In the figures:
In the following the invention is explained with reference to metadata associated to a digital video. Of course, the invention is likewise applicable to other types of data sets.
Metadata information, i.e. a metadata vector M of a length S is input to the apparatus 6. The metadata vector M is assigned to a digital image sequence, e.g. a digital video. The length S of the metadata vector M means that the vector comprises a number of S metadata elements, e.g. a set of metadata comprising a number of S metadata elements. A metadata element may be a single value, e.g. a contrast of a frame, or a set of data, e.g. a contrast and a brightness value. The apparatus 6 is configured to process the metadata vector M for visualization on the output unit 2. The metadata vector M is visualized as a plurality of bars 8, each bar 8 having a four horizontal pixels (two dark pixels and two bright pixels).
The bars 8 displayed in
C=CEIL(S/B)=3. The calculation of the height of the bars is determined by the following operation on the metadata vector M: G=(MAX([122]), MAX([031]), MAX([701]),MAX ([1]))=[2371], where G is the resulting display vector.
The method described above with reference to
Coming back to the exemplary metadata vector M that is used for
An apparatus 6 according to the invention for visualizing a data set is schematically illustrated in some more detail in
Number | Date | Country | Kind |
---|---|---|---|
11306590.8 | Nov 2011 | EP | regional |