1. Field of the invention
This invention relates to automated or part automated quality control of audio-visual media.
2. Description of the Related Art
There is a need to ensure that the quality of audio-visual media meets appropriate criteria for its commercial sale and distribution. The criteria are usually expressed in the form of tests that comprise measurements resulting in value that have to fall within thresholds. There are many automated tools that are able to perform computations on the media in order to generate the value for these measurements. These automated measurements are usually calibrated by means of subjective testing or by correlating the results against known good and bad material.
For any given measurement, a particular operation will set its threshold on the value of that measurement to a level that is appropriate for that operation. Operations that make or distribute quality movies are likely to have thresholds that are significantly higher than those that are set—for example—by operations that distribute user generated content on the Internet. It is common for higher quality content to be re-purposed for delivery through lower quality delivery channels. For these and other reasons, it would be advantageous if methods and apparatus for quality assessment of audio-visual media could be provided which offer both the flexibility to meet different business needs and the consistency to set standards of quality acceptable across a range of—possibly conflicting—business interests.
There exist a variety of automated quality assessment tools that measure aspects of audio or video quality. However, despite extensive testing by the measurement tool vendors, there still remains significant margin for error in the values resulting from some measurements. These errors may result in:
While the disadvantages of any specific automated measurement tool can usually be overcome through operator involvement, it is not viable economically to involve operators at each stage of an audio-video delivery chain or within each involved business.
It is an object of embodiments of the present information to overcome or ameliorate some or all of these disadvantages.
Accordingly, the present invention includes in one aspect in apparatus for monitoring the quality of an audio-video process chain comprising a plurality of processes operating in stages on audio-video content, the apparatus comprising a plurality of measurement tool interfaces for connection with measurement tools to receive measurements; and a confidence threshold processor for converting each measurement to a threshold confidence; wherein the confidence threshold processor receives the measurement, a threshold value of acceptability; a latitude of acceptability relating to the threshold; and an error function relating to the reliability of the measurement; the confidence threshold processor being configured such that the threshold confidence extends linearly between a value denoting certainty of acceptability and a value denoting certainty of unacceptability. A graphical user interface may include a linear graphical representation of each threshold confidence.
In another aspect, the present invention includes in a method of controlling an audio-video process chain comprising a plurality of processes operating in stages on audio-video content, the method comprising the steps of: taking in respective measurement tools a plurality of measurements; in a processor, converting each measurement to a threshold confidence by normalizing where required and comparing with a threshold value of acceptability taking into account the latitude of acceptability associated with that threshold and an error function relating to the reliability of the measurement, such that the threshold confidence extends linearly between a value denoting certainty of acceptability and a value denoting certainty of unacceptability; defining pass and fail ranges of values of the threshold confidence; storing for delivery or further processing audio-video content having a threshold confidence within the pass range; and rejecting audio-video content having a threshold confidence within the fail range. A warn range of values of the threshold confidence may be defined between and contiguous with the pass and fail ranges and content having a threshold confidence within the warn range may be diverted for reworking.
In another aspect, the present invention includes a method of monitoring the quality of an audio-video process chain comprising a plurality of processes operating in stages on audio-video content, the method comprising the steps of taking a plurality of measurements Mg,n,s where #g denotes the measurement group, #n denotes the measurement tool employed in taking the measurement and #s denotes the processing stage at which the measurement is taken; converting each measurement Mg,n,s to a threshold confidence TCg,n,s by normalizing where required and comparing with a threshold value of acceptability taking into account the latitude Lg,s of acceptability and an error function EBg,n,s relating to the reliability of the measurement Mg,n,s such that the threshold confidence TCg,n,s extends linearly between a value denoting certainty of acceptability and a value denoting certainty of unacceptability.
Preferably the method further comprises providing at a first process stage for an operator viewing the audio-video content to input an Override Confidence OCg,n,s to override the threshold confidence TCg,n,s; and, at a second subsequent process stage, electing automatically whether to override the threshold confidence TCg,n,s with the previous Override Confidence OCg,n,s based on the second stage measurement Mg,n,s.
In the case of an operator inputting an Override Confidence OCg,n,s, the overridden threshold confidence TCg,n,s and its associated normalised measurement value Ng,n,s may be made available in subsequent process stages.
Pass and fail ranges of values of the threshold confidence TCg,n,s may be defined. The method may suitably comprise the steps of approving and/or storing for delivery or further processing audio-video content having a threshold confidence TCg,n,s within the pass range and rejecting audio-video content having a threshold confidence TCg,n,s within the fail range.
Pass, warn and fail ranges of values of the threshold confidence TCg,n,s may be defined. The method may suitably comprise the steps of approving and/or storing for delivery or further processing audio-video content having a threshold confidence TCg,n,s within the pass range; diverting for reworking audio-video content having a threshold confidence TCg,n,s within the warn range and rejecting audio-video content having a threshold confidence TCg,n,s within the fail range.
A collection of audio-video content may be ranked in accordance with the threshold confidence TCg,n,s and reworking resource is allocated to audio-video content within the collection in dependence upon the ranking.
TCg,n,s is represented in the embodiment of this invention by a number between 0 and some maximal value (for example between TCmin=0 and TCmax=100%). The value 0 represents a certainty that the measured value represents an error and TCmax represents certainty that the measured value represents a pass. Calculation of values of TC are system dependent but can be represented by the following representation:
TCg,n,s=Cg(Ng,n,s, EBg,n,s, TVg,s, Lg,s)
where Ng,n,s is a normalised measurement value, TVg,s is a threshold value denoting pass/fail, EBg,n,s is an error bar function that denotes the statistical uncertainty of that measurement type and Cg is a confidence mapping function that converts normalised measurement values of the given measurement type to a linearly, monotonic threshold confidence value based on that measurement's properties.
The latitude Lg,s denotes the tolerance of a measurement to the threshold and may vary between process stages. Threshold confidence TCg,n,s values for all tools in the same measurement group may be aggregated at each process stage. The result is an overall threshold confidence for that group. An example of this is the application of different loudness tools to a piece of music that has undergone processing. The answer the business question “Is the music too loud and has the music always been too loud?” can be answered by inspecting the threshold confidence for a group of loudness measurements spanning the processes the music underwent in a facility.
The present invention includes in another aspect in a method of monitoring the quality of an audio-video process chain comprising a plurality of processes operating in stages on audio-video content, the method comprising the steps of taking a plurality of measurements comprising at least a first measurement at a first process stage and at least a second measurement at a second process stage measurement; generating for each measure an error function which is monotonically related to the expected error of the measurement and combining the error function with the measure to form a measurement range; providing at a first process stage for an operator viewing the audio-video content to override the first measure with a first operator override; and, at a second subsequent process stage, automatically overriding the second measurement with the first operator override if the second measurement range overlaps the first measurement or overlaps the first operator override.
The present invention includes in still another aspect in apparatus for monitoring the quality of an audio-video process chain comprising a plurality of processes operating in stages on audio-video content, the apparatus comprising a plurality of measurement tool interfaces for connection with measurement tools to receive measurements Mg,n,s where #g denotes the measurement group, #n denotes the measurement tool employed in taking the measurement and #s denotes the processing stage at which the measurement is taken; and means for converting each measurement Mg,n,s to a threshold confidence TCg,n,s by normalizing where required and comparing with a threshold value of acceptability taking into account the latitude Lg,s of acceptability and an error function EBg,n,s relating to the reliability of the measurement Mg,n,s such that the threshold confidence TCg,n,s extends linearly between a value denoting certainty of acceptability and a value denoting certainty of unacceptability.
The apparatus may further comprising an operator override input enabling an operator viewing the audio-video content to input an Override Confidence OCg,n,s to override the threshold confidence TCg,n,s and a graphical user interface including a linear graphical representation of the measurement Mg,n,s and/or each threshold confidence TCg,n,s.
In the context of this invention the term audio-video content includes audio content; video content and audio-video content.
It will be seen that some embodiments of the invention provide a mechanism for storing and automatically propagating operator derived information in a way that enables the results of a media measurement tool to be trusted to a greater extent than hitherto.
Additional features and advantages of the invention will be set forth in the description that follows, and in part will be apparent from the description, or may be learned by practice of the invention. The advantages of the invention will be realized and attained by the structure particularly pointed out in the written description and claims hereof as well as the appended drawings.
It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory and are intended to provide further explanation of the invention as claimed.
The accompanying drawings, which are included to provide a further understanding of the invention and are incorporated in and constitute a part of this specification, illustrate embodiments of the invention and, together with the description, serve to explain the principles of the invention.
In the drawings:
a-3c illustrate the lifecycle of a measurement;
Reference will now be made in detail to the preferred embodiments of the present invention, examples of which are illustrated in the accompanying drawings.
Audio-video content 100 is delivered to a processing block 101 providing process PRp. The audio-video content or asset 102 is then passed for storing or further processing or failed, in decision block 120.
A measurement tool 103 takes a measurement on the audio-video content 102 and that measurement is processed in blocks 103 to 106 (the functionality of which will be described in detail below) to control the decision block 120.
It will be helpful to define some terms used in the diagrams and texts:
PRp Process #p which operates on audio-video material
MTn Measurement Tool #n that comprises a number of discrete measurements. e.g. MTA might be the Acme Testing Tool.
Mg,n,s Measurement performed by tool #n of measurement tool group #g at processing stage #s of a workflow, e.g. M3,A,1 and M3,A,2 and M3,A,3 are all the same type of measurement created with the same tool, but performed at different stages in the lifecycle of the material.
Ng,n,s Normalised Measurement in group #g from tool #n at stage #s.
TVg,s Threshold Value for group #g at stage #s. This is the critical value that a Normalised Measurement must meet for a measurement to have “passed”.
TCg,n,s Threshold Confidence of Measurement #g implemented by tool #n and used at stage #s
Lg,s Latitude in the Threshold Value for group #g at stage #s. This controls the range of Threshold Values for which a Normalised Measurement will generate non-zero Threshold Confidence values.
EBg,n,s Error Bar function. A factor that controls the size of the error bars of a measurement in group #g of tool #n at stage #s.
OCg,n,s Override Confidence of a Measurement in group #g from tool #n at stage #s. This is the Confidence value given to an Override whether introduce manually by a human at a GUI or automatically via stored knowledge in, for example, a database.
ACg,n,s Auto-assigned Confidence of measurement in group #g, from tool #n at stage #s.
An explanation of the lifecycle of a measurement may be helpful in clarifying the relationship between the different terms is clear.
Referring to
Define measurement MB,X,0 at stage 0 before Process PR1 with tool MTX from Group B.
Define measurement MB,Y,1 at stage 1 after Process PR1 with tool MTY from Group B.
These measurements cannot be directly compared unless their units are normalised by some tool specific function that ensures their units, range and linearity are similar.
N
B,X,0
=f
X(MB,X,0) and NB,Y,1=fY(MB,Y,1)
Where fX( ) and fY( ) are the tool specific normalisation functions for Tools MTX and MTY.
Now that there are comparable results, it is possible to compare the Normalised Value Ng,n,s to a Threshold Value TVg,s to see if the measurement was within tolerance at that stage of the media's lifecycle. For simplicity in
It is necessary to take into account the fact that many media measurements are inherently uncertain so there is a need to know the confidence that a result has exceeded a threshold. It should be noted that some Threshold Values are minimal in nature (e.g. to pass a test, all 8 bit pixel values should be greater than the decimal Threshold Value 16) and some Threshold Values are maximal in nature (e.g. all to pass a test, all 8 bit pixel values should be less than the decimal Threshold Value 235). The examples in this text will predominantly use the maximal threshold case. The same principles hold for the minimal threshold case.
There are two main elements of uncertainty that are taken into account in this method. The first is the Latitude Lg,s of a threshold in group #g at stage #s of a measurement. This property measures how much uncertainty can be tolerated at stage #s in the media lifecycle and is shown in
In a simple system with a maximal Threshold Value, a Normalised Measurement N must be below some threshold to pass. From the example in
In a media system, many of these absolute Threshold Values are in practice somewhat arbitrary, so the present invention employs the concept of Threshold Confidence TCg,n,s. The Normalised Measurement Values Ng,n,s all have the same units, range and linearity so that they can be compared with other normalised measurements in the same group #g. The Threshold Confidence TCg,n,s is a unitless value ranging from 0 to some maximum value TCmax that expresses the confidence that a measurement Ng,n,s passed a threshold value TVg,s. Because TCg,n,s has been generated as a linear value with a defined range and no units, it can be compared and manipulated with dissimilar measurements from different groups. Importantly, use of the instrument TCg,n,s allows more accurate QC results to be obtained in a QC process by propagation and aggregation of TCg,n,s from measurements that could otherwise not be compared.
A worked example could be used to calculate the TC for the whitest pixel that is allowed in a picture. In an 8 bit video system, black pixels have the value 16 and white pixels have the value 235. Checking the white value constitutes a Measurement Value with no intrinsic measurement error where the threshold Value TVg,s is a maximal threshold (235), one way to calculate the Threshold Confidence TCg,n,s would be to calculate the percentage distance of the Normalised Measurement in the Latitude window (values 233-237):
This is shown in
The Threshold Confidence, however, becomes more complicated due to the fact that many measurements (such as blockiness) are intrinsically inaccurate and subjective. Others (such as loudness) use Normalised Measurement units that are non-linear in nature. To compensate for this, an Error Bar function is introduced to allow for a window of possible values that might have been made by a given tool. In general, the Error Bar function will be a simple linear multiplier, but may be more complex for some measurement groups, such as blockiness, where other picture detail may be taken into account.
The Error Bar Function is shown pictorially in
For the remaining examples in this text, we will use the abstract function Cg to create a Threshold Confidence value TCg,n,s for measurement group #g. This function is different for each measurement group and takes into account differences in output between different measurement tools of the same measurement group as well as its intrinsic error properties when generating TC.
a-3c illustrate different levels of complexity in the function Cg.
TCn,g,s=Cg(Ng,n,s, EBg,n,s, TVg,s, Lg,s)
b shows a more realistic mapping function Cg that takes into account the fact that the TVg,s threshold is not symmetrical and that values of N that are less than the TVg,s are not 100% certain to be passes due to the fact that the TVg,s is a somewhat arbitrary number such as would be the case for a blockiness threshold.
c shows the effect of the EBg,n,s( ) function. It has the effect of adding a non-linear transfer function around the actual value of N.
This unitless measure of threshold confidence for a measurement may now be compared with other TCg,n,s values from different groups #g made with different tools #n to give an overall confidence that material has passed or failed at any stage #s in its lifecycle. The TCg,n,s value also acts as a parameter for allowing an override confidence to auto-propagate from an earlier stage of a Quality Control process to a latter stage of a Quality Control process.
An example of the process for using these values is given in
After processing the media in Process PR1 the output media is tested again, as shown in
After processing the media in Process PR2, the output media is tested again. In this case three tools are used. Tool MTA is used to create Report R3, Tool MTC is used to create Report R4 and Tool MTD is used to create Report R5. For the case of Tool MTA the same three measurements techniques are used to create new Measurement Values MX,A,3, MY,A,3 and MZ,A,3.
It is helpful here to refer to
To exemplify comparison between the same tool from the same group at difference stages, measurements from group X are first normalized with a measurement specific process as follows:
N
X,A,1
=f
X(MX,A,1)
N
X,A,2
=f
X(MX,A,2)
N
X,A,3
=f
X(MX,A,3)
The next step is to introduce Threshold Confidence using the measurement specific mapping function Cg.
TCX,A,1=Cg(NX,A,1, EBX,A,1, TVX,1, LX,1)
TCX,A,2=Cg(NX,A,2, EBX,A,2, TVX,2, LX,2)
TCX,A,3=Cg(NX,A,3, EBX,A,3, TVX,3, LX,3)
Typically the resulting values of TCg,n,s will correspond to some business indicator:
The present invention introduces the concept of threshold confidence where, for example, a continuous value from 0% to 100% is introduced that allows any measure to have a continuous scale of confidence that the measure has passed the test.
Threshold Confidence varies according to the measurement being carried out. Here are some examples:
1. “Sub-blacks” and “Supra Whites”. These measures correspond to
Luminance values of a TV signal that have exceeded the legal values from a Rec.601 video signal. The thresholds are 16 and 235 for 8 bit video. A value of 15 or 236 would be given a TC of 0% whereas a value of 17 or 234 would be given a TC of 100%. A value of 16 or 235 would be very, very close to failure without actually failing. Operationally, this would be given a TC of around 60% to indicate that it was very close to a failure threshold.
2. “Loudness” measurements are a single floating point value that indicates a modified mean loudness over a whole media clip. If this modified mean loudness exceeds a given threshold then it is a clear fail and would be allocated a TC of 0%. If the mean was less than the legal threshold mandated by either the government regulator or broadcaster. Then a TC value of 100% some_function(spread(loudness), safety_margin) would be allocated. The closer that the mean was to the threshold, then the lower the TC. If the spread of loudness values was very large, then this increases the uncertainty that the mean loudness is truly representative of the clip, reducing the confidence again.
3. “Blockiness” measurements are often a single floating point value that represent the perceived worst blockiness in a clip. The TC value can be set according to an overall blockiness measure and a ratio of how detailed the imagery in the clip was, when the blockiness measure was high. The more detailed the video content, the less visible that blockiness can be.
Threshold Confidence on its own is a useful, new concept that considerably strengthens the use in a system of automated QC measures.
It is however in the nature of many audio or video applications that the creation of interesting and entertaining content pushes the media in unexpected directions. These unexpected departures from what previously had been regarded as normal may be interpreted by automated QC measures as failures. It is also the case that, however well designed the automated QC measures, a skilled operator may reach a different and better conclusion. For example, the enormous variation in styles and genre of video content passing through a typical delivery channel, makes it very difficult for an automated QC measure to distinguish reliably between what an end user would perceive as acceptable and unacceptable departures from a perhaps arbitrary measurement threshold.
In a QC workflow according to the embodiment shown in
If an operator overrides the value of a measurement, then all user interaction (GUI, reports etc.) will display the Override Confidence rather than the Threshold Confidence. Override Confidence uses the same scale as TC and indicates how confident the operator was that the measurement was a pass. In many cases operational staff will only ascribe values of 0% or 100%.
For some measurements where human judgement is vital (such as a blockiness measure), the operator would be presented with a user interface to allow them to give a true OC value between 0% and 100%. This allows the operator to give a value that represents “how close to being fit for purpose” the media clip is. This threshold will vary based on the value of the material and the target to which the material is being sent. Operators are trained to make these judgements.
A rule for automatic propagation of Override Confidence OCg,n,s is based on the normalized measures and normalized confidences.
To allow Auto-propagation, it is desirable to define the word “consistent”. There is scope for tuning these rules for each specific measurement, but a useful basic rule is that if the TC is the same or better (higher or lower depending on the nature of the threshold) than the previous stage then flag it as a valid override. Whether or not the override is actually propagated may be determined by the product configuration to maximise flexibility.
As a worked example the False-Negative override case (OCg,n,s>TCg,n,s)—
If [((TCg,n,s>=TCg,n,s−1) or (Ng,n,s−1 falls within EBg,n,s(Ng,n,s)))=and (OCg,n,s−1>TCg,n,s−1)] then ACg,n,s=OCg,n,s−1 i.e. if the TC is the same or higher than the previous stage then flag it as a valid override.
False-Positive override case (OCg,n,s<TCg,n,s)—
If [((TCg,n,s<=TCg,n,s−1) or (Ng,n,s−1 falls within EBg,n,s(Ng,n,s))) and (OCg,n,s−1<TCg,n,s−1)] then ACg,n,s=OCg,n,s−1 i.e. if the TC is the same or lower then flag it as a valid override.
An enhancement to these basic sample rules is to look back over all previous stages and not just the most recent stage. This would create a control that would allow “auto-propagate overrides from any previous stage”.
A further enhancement is to provide that, if the first rule is not met, but there is an overlap between the TC error bars of the current stage and the TC error bars of the stage where the override was created, then this should be flagged as “likely to be a valid override”. This catches the case where it is difficult to ascertain that the error bars were either masking a bad value or degrading a good value.
When the auto-propagation rules are applied and there exists an Override-Confidence, there can be up to 3 confidence values. They are applied in this order of precedence:
1. OCg,n,s—this is used if it exists
2. ACg,n,s—this is used if it exists and the business rules allow
3. TCg,n,s—the default value of threshold confidence for this measure
Using this simple rule and this methodology, operator actions can be rapidly and automatically applied to many different tools taking into account their inherent differences and algorithms.
Using the same methodology as above, but looking across difference stages of a group of measurements, the same rules can be applied to propagate an upstream decision of an operator to a downstream measurements and gives the ability to propagate judgements in an environment where allowances are made for the uncertainty of the underlying measures.
The methodology can also be used in conjunction with propagated upstream measurements and decisions where an adjustment rework operation is performed to either improve on a failing measure, or to ensure acceptance of the result within a tighter specification. This continuation of the lifecycle has the added advantage of automatically verifying that repeated application of the original measurements produce results within the expected Latitude range, i.e. that fixing problem A has not created a problem B.
Overrides in this case may be created, not just by operators, but also from stored knowledge about the content. For example a low chroma detector can be automatically overridden if it is known that certain portions of the content are in black and white rather than in colour.
The rules and values can be applied without change to multiple tools at multiple stages.
The described methodology allows the threshold for individual QC measurements to remain independent whilst simultaneously being able to combine and weight dissimilar measurements types to give an overall confidence that media is of acceptable quality for an application. Typically the thresholds for a given measurement will be determined by one or all of the following drivers:
No single measurement may cause a piece of material to fail, but the described methods can be used to show that the overall confidence of a pass may be very low.
As illustrated in
Often, a platform will be provided at the point of first receipt of content by a commercial entity within a chain or group of entities working together in the creation; post-production; repurposing or other processing (where appropriate) and delivery of content. In that case, a threshold confidence value can determine whether or not content is accepted and/or whether content is stored or not.
Platform 390 comprises a unifier block 391 which communicates with the various measurement tools through respective measurement tool interfaces 392. There is also provided a graphical user interface. Inputs 394 and 395 communicate with upstream and downstream platforms. An operator override 396 is also provided. Operation of the platform will be understood from the foregoing description.
Separation of the measurement tool interfaces 392 from the unifier block 391 simplifies the extension of the platform to work with new proprietary measurement tools.
The GUI will preferably take advantage of the linear, dimensionless nature of the threshold confidence to provide absolute or relative indications of quality which can be interpreted instantly, even by non-specialists. These may take the form of simple bars, although the linear nature can still be maintained with curved lines. In the typical case, where the range of threshold confidence values is divided into pass and fail ranges or pass, warn and fail ranges, the simplicity of the interface can be maintained by changing the colour of the linear bar (curved of straight) to signify movement from one range to the next. The colours green, yellow and red can conveniently be employed for the pass, warn and fail ranges, respectively.
Content which has a threshold confidence value in the warn range can be treated in various ways. It may be appropriate automatically to divert any such content to reworking station where (usually) human operators can attempt to improve the quality. If the reworking resources are limited, the actual threshold confidence values can be used to rank the content to provide a priority order for reworking.
It will be understood that the process chain has been depicted symbolically and “adjacent” processes may actually be widely separated geographically and in time. It will be convenient to transport the reports which have been described above, in the form of metadata associated with the audio-video content.
The platform may take the form of software only, in a case where appropriate general purpose hardware already exists with the appropriate connectivity. Hardware/software hybrids or wholly hardware solutions are of course also possible.
Having thus described a preferred embodiment, it should be apparent to those skilled in the art that certain advantages of the described method and apparatus have been achieved.
It should also be appreciated that various modifications, adaptations and alternative embodiments thereof may be made within the scope and spirit of the present invention. The invention is further defined by the following claims.
Number | Date | Country | Kind |
---|---|---|---|
1204221.4 | Mar 2012 | GB | national |
This application is a US National Phase of PCT/GB2013/050596, filed on Mar. 11, 2013, which claims priority to GB 1204221.4, filed on Mar. 9, 2012, incorporated herein by reference in their entirety.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/GB2013/050596 | 3/11/2013 | WO | 00 |