In the following, method and device for clustering patches of a degraded picture in view of a restoration are disclosed.
This section is intended to introduce the reader to various aspects of art, which may be related to various aspects of the present principles that are described and/or claimed below. This discussion is believed to be helpful in providing the reader with background information to facilitate a better understanding of the various aspects of the present invention. Accordingly, it should be understood that these statements are to be read in this light, and not as admissions of prior art.
The present principles relate to the restoration of a degraded picture or of a degraded video. Restoration methods have been notably used for de-noising and for super-resolution application, but those restoration methods are also compatible with a broader scope of applications, such as deblurring or color enhancement. In the following, we refer to these methods as restoration method or upgrade method.
Clustering-based restoration methods mainly comprises two steps: firstly clustering the degraded image (being a still picture or an image from a video), and secondly processing in a same way each cluster in order to restore the degraded input image. State-of-the-art methods mainly focus on designing efficient methods for the second step. They use generic clustering methods such as k-means for the first step.
A method for clustering an image where the method would advantageously improve the overall restoration performance is therefore desirable. Clustering-based methods have been explored recently and shown to bring significant improvement in the fields of de-noising for additive white Gaussian noise (AWGN) as disclosed in “Sparsity-based Image Denoising via Dictionary Learning and Structural Clustering” by W. Dong, X. Li, L. Zhang and G. Shi, CVPR 2011 as well as signal-dependent noise (SDN) as disclosed in “Signal dependent noise removal from a single image” by X. Liu, M. Tanaka and M. Okutomi, IEEE ICIP 2014. Such clustering based method was also proved useful for super-resolution as disclosed for instance in “Learning multiple linear mappings for efficient single image super-resolution”, from Kaibing Zhang, Dacheng Tao, Xinbo Gao, Xuelong Li, and Zenggang Xiong, in IEEE Trans. On Image Processing, vol. 24, no. 3, march 2015.
While each one of the aforementioned documents discloses a specific method configured for their target application, all methods rely on k-means clustering. k-means is known for cluster analysis in data mining. K-means clustering aims to partition n observations into k clusters in which each observation belongs to the cluster with the nearest mean. In other words, the degraded image is split into patches, seen as observations, and the patches are classified into clusters by a k-means method applied to a characteristic extracted from the patches of the degraded image. As such, patches, considered as homogenous, are identically processed. However such clustering is not always optimal, as its design is generic and does not take into account the following process.
Consequently, a method for clustering an image where the method would advantageously improve the overall restoration performance is therefore desirable.
Also, the article “Structured Overcomplete Sparsifying Transform Learning with Convergence Guarantees and Applications” by B. Wen, S. Ravishankar and Y. Bresler, International Journal of Computer Vision, vol. 114, no 2-3, 19 Oct. 2014, discloses a union of sparsifying transforms model, which is notably applied to image denoising.
That solution is relevant to cases in which a Gaussian noise having known properties is present. In other situations, such as for example in video compression, the noise may prove highly correlated and thus far from Gaussian, thereby leading to unsuited clustering and restoration in which parasitic structures of blocks induced by compression run the risk of being assimilated to the signal itself.
The purpose of the invention is to overcome at least one of the disadvantages of prior art by proposing a method for clustering patches of a degraded version of an image. A salient idea of the present principles is that in some applications, the original and full graded version of the image is known at some step in the image processing chain, naturally this step is usually not known at the restoration step itself. Therefore an optimal clustering is disclosed which relies on the minimization of the reconstruction error between the restored degraded version of an image and an original version of the image. Since the source image is not known from the restoration applications, a classification-based framework to estimate the optimal clusters according to the present principles is further disclosed.
In particular, by contrast with the article by B. Wen et al. above, no restrictive hypotheses need to be made on the nature of the noise.
The above applications can also be combined.
According to a first aspect, a method for clustering patches of a degraded version of an image into at least two clusters is disclosed wherein the clusters are based on errors computed between patches of the degraded version processed with upgrade functions associated respectively with the clusters and patches of a full graded version of the image corresponding respectively to the patches of the degraded version of the image.
In a first preferred implementation, the clustering is carried out by at least one processor, and the method comprises receiving data representative of the full graded version of the image in at least one input, and outputting data representative of the degraded version of the image and information on the clustering from at least one output.
That outputting is then preferably for at least one of transmission to a telecommunication network and recording on at least one storage unit.
According to a specific characteristic, the method comprises determining the patches of the degraded version belonging to a first of the clusters adapted to a first of the upgrade functions and the patches of the degraded version belonging to a second of the clusters adapted to a second of the upgrade functions, wherein one of the patches belongs to the first cluster in case where the error computed between that patch of the degraded version processed with the first upgrade function and the corresponding patch of the full graded version of the image is lower than the error computed between the same patch of the degraded version processed with the second upgrade function and the same patch of the full graded version of the image.
According to various embodiment, the method comprises:
In a second preferred implementation, the clustering is carried out by at least one processor, and the method comprises receiving data representative of the degraded version of the image and information on the clustering in at least one input, and deriving with the processor(s) the clustering from the data representative of the degraded version and from the information on the clustering.
That input is then preferably from at least one of a telecommunication network and at least one storage unit.
The method for clustering advantageously comprises receiving indications chosen among indications of the patches of the degraded version of the image belonging to the clusters adapted to the associated upgrade functions, indications of clustering of the patches of the degraded version in the clusters adapted to the associated upgrade functions, and a subset of the patches for clustering the patches of the degraded version in the clusters adapted to the associated upgrade functions.
The disclosure also pertains to a method for restoring a degraded version of an image, comprising clustering patches of the degraded version into at least two clusters, and processing those clusters of patches so as to produce a restored version of the image, the clustering being compliant with any embodiment of the above method for clustering.
Advantageously, the clustering and the processing of the clusters are carried out by the processor(s), as well as deriving the clustering from the data representative of the degraded version and from information on the clustering, and the method comprises receiving data representative of the degraded version of the image and information on the clustering in at least one input, and outputting the restored version of the image from at least one output.
According to a third aspect, a device is disclosed that comprises at least one processor configured to cluster patches of a degraded version of an image into at least two clusters, wherein the clusters are based on errors computed between patches of the degraded version processed with upgrade functions and patches of a full graded version of the image corresponding respectively to the patches of the degraded version of the image.
That device is preferably configured for executing any embodiment of a method for clustering or a method for restoring as recited above.
In a first preferred implementation, that device comprises at least one input configured for receiving data representative of the full graded version of the image, and at least one output configured for outputting data representative of the degraded version of the image and information on the clustering.
In a second preferred implementation, that device comprises at least one input configured for receiving data representative of the degraded version of the image and information on the clusters, and at least one output for outputting the restored version of the image, the processor(s) being adapted for clustering the patches from the data representative of the degraded version and from the information on the clusters.
In a variant, a device comprising means for clustering patches of a degraded version of an image into at least two clusters, wherein the clusters are based on errors computed between patches of the degraded version processed with upgrade functions associated respectively with the clusters and patches of a full graded version of the image corresponding respectively to the patches of the degraded version of the image.
According to a specific embodiment, the device belongs to a set comprising:
According to a fourth aspect, a computer program product comprising program code instructions to execute the steps of the clustering method or the restoring method when this program is executed on a computer is disclosed.
According to a fifth aspect, a processor readable medium is disclosed that has stored therein instructions for causing a processor to perform at least the clustering of patches of a degraded version of an image into at least two clusters, wherein the clusters are based on errors computed between patches of the degraded version processed with upgrade functions associated respectively with the clusters and patches of a full graded version of the image corresponding respectively to the patches of the degraded version of the image.
According to a sixth aspect, a non-transitory program storage device is disclosed that is readable by a computer, tangibly embodies a program of instructions executable by the computer to perform a method comprising clustering patches of a degraded version of an image into at least two clusters, wherein the clusters are based on errors computed between patches of the degraded version processed with upgrade functions associated respectively with the clusters and patches of a full graded version of the image corresponding respectively to the patches of the degraded version of the image.
While not explicitly described, the present embodiments may be employed in any combination or sub-combination. For example, the invention is not limited to the described color mapping operators and any adjustable mapper can be used.
Besides, any characteristic or embodiment described for the interpolating method is compatible with a device intended to process the disclosed method and with a computer-readable storage medium storing program instructions.
Other characteristics and advantages of the present principles will appear through the description of non-limiting embodiments, which will be illustrated, with the help of the enclosed drawings:
The input 10 is further configured to obtain clusters adapted to an update function.
According to different embodiments of the invention, the source belongs to a set comprising:
The input 10 is linked to a clustering module 12 configured to determine to which cluster a patch of degraded version Y of the image belongs to. Advantageously, the clustering module 12 further comprises a classifier and/or a learning module for the classifier as it will be described after with disclosure of various restoration framework. The output of the clustering module 12 is linked to an output 18. According to various restoration framework latter on detailed, the output 18 provides:
As an example, the clustering in any of its output embodiments are stored in a remote or in a local memory, e.g. a video memory or a RAM, a hard disk. Thus the clustering is advantageously performed off-line in device 1 and results are available for a future on-line process in a remote restoration device. In a variant, the clustering output are sent to a restoration device by means of a storage interface, e.g. an interface with a mass storage, a ROM, a flash memory, an optical disc or a magnetic support and/or transmitted over a communication interface, e.g. an interface to a point to point link, a communication bus, a point to multipoint link or a broadcast network.
According to different embodiments, the clustering of the degraded version of the picture may be sent to a destination, e.g. a restoration device or display device. As an example, the clustering of the degraded version of the picture is stored in a remote or in a local memory, e.g. a video memory or a RAM, a hard disk. In a variant, the clustering is sent to a storage interface, e.g. an interface with a mass storage, a ROM, a flash memory, an optical disc or a magnetic support and/or transmitted over a communication interface, e.g. an interface to a point to point link, a communication bus, a point to multipoint link or a broadcast network.
According to an exemplary and non-limiting embodiment, the clustering device 1 further comprises a computer program stored in the memory 120. The computer program comprises instructions which, when executed by the clustering device 1, in particular by the processor 110, enable the clustering device 1 to execute the method described with reference to
The clustering device 1 is advantageously part of a coder or of a TV set.
Where C={C1 . . . Ck} is the set of clusters, with k the number of clusters, and process is the restoration function F apply on each cluster Ci, with Ω the set of parameters for restoration process. Advantageously, k the number of clusters, as well as process the restoration function are known. A greedy algorithm is disclosed to solve the above equation taking as input X, Y, k, process and as output C={C1 . . . Ck} and the set Ω of parameters Ωi respectively applied for each cluster Ci.
Sub steps of the clustering 300 are now detailed.
In an initialization step 310, patches are clustered into C={C1 . . . Ck}. In a first variant the initialization is random where y are randomly assigned to a cluster. According to a second variant, a generic clustering algorithm such as k-means is applied to the set of patches y.
Then in an iterative update the following steps are performed until convergence is reached that is until the clusters do not change.
In an assignment step 320, patches y are reassigned to clusters:
∀i=1 . . . k,Ci={x,y:|x−process(y,Ωi)|2≤|x−process(y,Ωj)|2∀j=1 . . . k,x∈X,y∈Y}
Let's consider the variant limited to 2 clusters C1 and C2 for simplification of the description. Thus, in the step 320, a patch y of a degraded version Y is determined as belonging to a first cluster C1 adapted to a first upgrade function process with Ω1 or to a second cluster C2 adapted to a second upgrade function process with Ω2 in case where an error e=|x−process(y,Ω1)| computed between the patch y of degraded version processed with the first upgrade function process with Ω1 and a patch x of a full graded version of the image is lower than an error e=|x−process(y,Ω2)| computed between the patch y of the degraded version processed with the second upgrade function process with Ω2 and the patch x of the full graded version of the image:
C
1
={x,y:|x−process(y,Ω1)|2≤|x−process(y,Ω2)|2∀x∈X,y∈Y}
Then in a update step 330 the parameters values are adjusted and obtained from:
The assignment step is well defined and can be computed for any restoration function process. However, the update step can not be generally defined, as the set Ω of parameters is specific to each process algorithm.
Thus two embodiments of the update step, each corresponding to a specific restoration function process are detailed hereafter.
According to a first embodiment, the restoration process is a de-noising with soft or hard thresholding in the transform domain as disclosed in “De-noising by soft-thresholding” by Donoho, David L., IEEE Transactions on Information Theory 1995 or in “Improved wavelet denoising via empirical Wiener filtering” by Ghael, Sandeep P, Sayeed, Akbar M, Baraniuk, Richard G, Proceedings of SPIE 1997. The de-noising function comprises 3 steps described below. Here the only set D. of parameters is the threshold τ={T1 . . . Tk} thus a cluster Ci is adapted to the threshold value τi of the de-noising function process. Thus if y′ refers to the de-noised version of y, we have
y′=process(y,τ) with
In others words, a transform function is applied to a patch y, the obtained value Υ′ of y in the transform domain is then clipped with a threshold τ and the de-noised version of the patch y′ is obtained from the inverse transform function applied to the clipped value Υ′.
The assignment step is performed as described above, with Ω=τ.
∀i=1 . . . k,Ci={x,y:|x−process(y,τi)|2≤|x−process(y,τj)|2∀j=1 . . . k,x∈X,y∈Y}
The update step is described below:
∀i=1 . . . k,τi={τ*:|x−process(y,τ*)|2≤|x−process(y,τ)|2,∀τ=τmin . . . τmax,∀x,y∈Ci}
In a first variant, the values τmin and τmax are defined by the user. In another variant, the values τmin and τmax are defined as the minimum and maximum values for a patch y in the transform domain Υ.
According to another variant, not only one, but a set of transforms is considered for the de-noising function. transform thus becomes a parameter, and the update step finds for each cluster the optimal parameters couple (transformi, τi).
According to a second embodiment, the restoration process is a linear mappings for de-noising or super-resolution application. The de-noising algorithm is described below and is the results of a product of the patch with a linear mapping projection matrix P. Here the only set Ω of parameters is the linear mapping projection matrix P, and if y′ refers to the de-noised version of y we have:
y′=process(y,P)=P*y
The assignment step is performed as described above, with Ω=P.
∀i=1 . . . k,Ci={x,y:|x−Pi*y|2≤|x−Pj*y|2∀j=1 . . . k,x∈X,y∈Y}
The update step can be formulated in a matrix form as follow:
Where Xi and Yi contains in their columns the vectorized source and degraded patches respectively from cluster Ci.
Minimizing |Xi−YiP|2 is a linear regression problem which solution is known, for instance in “Solving Least Squares Problems” by Lawson, C. L.; Hanson, R. J., Englewood Cliffs, N.J.: Prentice-Hall 1964. The update step thus comes down to the following equation:
P
i
=X
i
Y
i
T(YiYiT)−1
In a nutshell, a training phase is performed on a classifier from a training dataset YT for which the source images XT are known. The learned classifier M is then used to predict the optimal clusters C′ for degraded images for which the corresponding source image is unknown. According to the various embodiments, the aside indications used to predict the optimal clusters C′ for degraded images, for which the corresponding source image is unknown, change.
In a first step 300, a clustering of patches of the dataset YT for which the source images XT are known is performed according to the present principles and results in optimal clusters C associated with optimal parameters D. of the restoration function.
In a second step 400, a classifier is trained from the optimal clusters C resulting in classifier model M. The skilled in the art will understand that optimal parameters Q of the restoration function are not modified in this step. They come with the optimal clusters C used to train the classifier. The trained classifier is thus configured to predict the optimal cluster labels from the degraded signal only:
M=trainClassifier(YT,C)
Where M is the classifier model.
Classification, also known as supervised learning, has been well studied. The design of a specific classification algorithm is out the scope of the present principles, and any existing algorithms is compatible with the present principles. Among non-limiting examples, any of the following models is considered for the supervised learning of the classifier:
In a third step 420, the trained classifier M is sent to a remote storage device for online processing or to the device processing the restoration method. In other words, an indication, corresponding to the classifier model, of clustering of patches of the degraded version in one of the clusters wherein a cluster is adapted to a specific upgrade function are sent to a remote device.
In a fourth step 500, the clusters C′ adapted to the restoration application are determined for a degraded signal Y for which a source signal X is not available (unlike the degraded signal YT of the dataset T). The clusters C′ are obtained using the classifier model as explained above:
C′=estimateCluster(Y,M)
Where C′ are the estimated clusters.
In a last step 600, the restoration algorithm is possibly applied based on the obtained clusters C′ to recover a restored version of the image Y′ by applying the restore function to patches of Y belonging to an estimated cluster with the parameters attached to this estimated cluster.
Depending on the application, different workflows are considered. In the case of generic restoration as shown on
Another application is the restoration of an image with quantification noise resulting from compression application. In this restoration application, the source signal X is known for all coded image Y, consequently, the training dataset T may be all or part of the sequence to be processed.
In a first embodiment of the restoration framework illustrated on
In a second embodiment of the restoration framework illustrated on
In a third embodiment of the restoration framework illustrated on
Thus
Thus in a compression context, the source signal is known at the encoder and the clustering method can be applied. The classifier model M based on optimal parameters Ω should then be sent 440 to the decoder as metadata or aside indications. Alternatively, a subset of the optimal clusters labels C can be sent 460 to the decoder, where the classifier can then be learned.
The implementations described herein may be implemented in, for example, a method or a process, an apparatus, a software program, a data stream, or a signal. Even if only discussed in the context of a single form of implementation (for example, discussed only as a method or a device), the implementation of features discussed may also be implemented in other forms (for example a program). An apparatus may be implemented in, for example, appropriate hardware, software, and firmware. The methods may be implemented in, for example, an apparatus such as, for example, a processor, which refers to processing devices in general, including, for example, a computer, a microprocessor, an integrated circuit, or a programmable logic device. Processors also include communication devices, such as, for example, computers, cell phones, portable/personal digital assistants (“PDAs”), and other devices that facilitate communication of information between end-users.
Implementations of the various processes and features described herein may be embodied in a variety of different equipment or applications, particularly, for example, equipment or applications. Examples of such equipment include an encoder, a decoder, a post-processor processing output from a decoder, a pre-processor providing input to an encoder, a video coder, a video decoder, a video codec, a web server, a set-top box, a laptop, a personal computer, a cell phone, a PDA, and other communication devices. As should be clear, the equipment may be mobile and even installed in a mobile vehicle.
Additionally, the methods may be implemented by instructions being performed by a processor, and such instructions (and/or data values produced by an implementation) may be stored on a processor-readable medium such as, for example, an integrated circuit, a software carrier or other storage device such as, for example, a hard disk, a compact diskette (“CD”), an optical disc (such as, for example, a DVD, often referred to as a digital versatile disc or a digital video disc), a random access memory (“RAM”), or a read-only memory (“ROM”). The instructions may form an application program tangibly embodied on a processor-readable medium. Instructions may be, for example, in hardware, firmware, software, or a combination. Instructions may be found in, for example, an operating system, a separate application, or a combination of the two. A processor may be characterized, therefore, as, for example, both a device configured to carry out a process and a device that includes a processor-readable medium (such as a storage device) having instructions for carrying out a process. Further, a processor-readable medium may store, in addition to or in lieu of instructions, data values produced by an implementation.
As will be evident to one of skill in the art, implementations may produce a variety of signals formatted to carry information that may be, for example, stored or transmitted. The information may include, for example, instructions for performing a method, or data produced by one of the described implementations. For example, a signal may be formatted to carry as data the rules for writing or reading the syntax of a described embodiment, or to carry as data the actual syntax-values written by a described embodiment. Such a signal may be formatted, for example, as an electromagnetic wave (for example, using a radio frequency portion of spectrum) or as a baseband signal. The formatting may include, for example, encoding a data stream and modulating a carrier with the encoded data stream. The information that the signal carries may be, for example, analog or digital information. The signal may be transmitted over a variety of different wired or wireless links, as is known. The signal may be stored on a processor-readable medium.
A number of implementations have been described. Nevertheless, it will be understood that various modifications may be made. For example, elements of different implementations may be combined, supplemented, modified, or removed to produce other implementations. Additionally, one of ordinary skill will understand that other structures and processes may be substituted for those disclosed and the resulting implementations will perform at least substantially the same function(s), in at least substantially the same way(s), to achieve at least substantially the same result(s) as the implementations disclosed. Accordingly, these and other implementations are contemplated by this application.
Number | Date | Country | Kind |
---|---|---|---|
15306662.6 | Oct 2015 | EP | regional |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/EP2016/074362 | 10/11/2016 | WO | 00 |