The invention is related to an encoding method and device which allows for providing cartoonized video. The invention is further related to corresponding decoding methods and devices. The invention is also related to corresponding video signals.
Video cartoonization attracts more and more attention in recent years. A source video, also called natural or photorealistic video, is unified, edge processed, overlaid with edges and/or quantized to generate cartoon-like effects.
Cartoonization or cartoonizing is also known as non-photorealistic rendering. Cartoonizing smoothes low contrast regions. At the same time it enhances high contrast regions.
Cartoonization keeps or fortifies the edges and lines of the source image, while unifying the color in almost continuous regions. This produces the cartoon-like effect. A cartoon may have sharp shadows, little highlights and contours of objects are overlaid with contour lines.
Detailed descriptions of techniques for cartoonization/non-photorealistic rendering are given, for instance, in European Patent Application EP07301342 or in H. Winnemoller, S. C. Olsen and B. Gooch, “Real-time Video Abstraction”, in ACM SIGGRAPH 2006. The terms cartoonizing and cartoonization as used within the current application refer to one or more of said described techniques.
Cartoonized movies are very vivid, which is welcomed by most of the audiences, especially by kids. Besides utilization for movies, cartoonized video is also suitable for video communication applications, such as internet protocol television (IPTV), mobile television (mobile TV) and video conferencing.
The special cartoon-like video will make all these application more attractive. Furthermore, the cartoon-like effect is achieved by removing some detailed information. Thus, cartoonized video are likely to require less bandwidth than the corresponding source video.
But the removed detailed information may be very important to some audiences. Or people would like to refer to the source video for some details while watching cartoonized video.
Therefore, it is desirable to provide a cartoonized video and its corresponding source video at the same time.
This achieved by the methods, devices and video signal of the independent claims.
An encoding method, which allows for providing a cartoonization of a source video and for recombining the source video from the cartoonization of the source video and a residual video, comprises the following steps:
Cartoonizing the source video, encoding a first video, reconstructing the first video, determining a first residual video between a second video and the reconstructed first video, encoding the first residual video and combining the encoded first video and the encoded first residual video wherein one of the first video and the second video is the source video and the other is the cartoonized source video. That is, either the second video is the source video and the first video is the cartoonized source video or the first video is the source video and the second video is the cartoonized source video.
Said encoding method may result in a video signal comprising the first video encoded in a base layer, an enhancement layer comprising the encoded first residual video wherein either a cartoonization of a source video is reconstructible by combining a reconstruction of the first video and a reconstruction of the first residual video if the first video is a source video or the source video is reconstructible by combining a reconstruction of the first video and a reconstruction of the first residual video if the first video is the cartoonization of the source video.
If the first video is the source video, a photorealistic video can be extracted from said video signal by a decoding method comprising the following steps:
Separating an encoded first residual video and an encoded cartoonized video, reconstructing the cartoonized video, reconstructing the first residual video and forming the photorealistic video by combining the cartoonized video and the first residual video.
If the first video is the cartoonization of the source video, a high quality cartoon can be extracted from said video signal by execution of the following steps:
Separating an encoded first residual video and an encoded photorealistic video, reconstructing the photorealistic video, reconstructing the first residual video and forming the cartoon by combining the photorealistic video and the first residual video.
Further embodiments of the methods and devices comprise features of one or more dependent claims.
Exemplary embodiments of the invention are illustrated in the drawings and are explained in more detail in the following description.
In the figures:
a, 2b, 2c and 2d show exemplary embodiments of a first kind of inventive encoding devices,
a, 3b, 3c and 3d show exemplary embodiments of a first kind of inventive decoding devices,
a, 4b, 4c and 4d show exemplary embodiments of a second kind of inventive encoding devices,
a, 5b, 5c and 5d show exemplary embodiments of a second kind of inventive decoding devices,
a and 6b show exemplary embodiments of a third kind of inventive encoding devices and
a and 7b show exemplary embodiments of a third kind of inventive decoding devices.
When cartoonizing a natural or photorealistic video commonly the steps depicted in
a and 2b show two exemplary embodiments of a first kind of encoder for encoding natural video together with a cartoonization of the natural video.
The natural video SRC is fed into a cartoonizer CART. The resulting cartoon is forwarded to encoding means ECN. The encoding means ENC pass the encoded cartoon to a reconstructor RBL. The reconstructed cartoon is compared with the source SRC and a resulting residual is encoded by residual encoding means EN1. Said residual encoding means EN1 may be comprised in the encoding means ENC. The encoded residual is then combined with the encoded cartoon resulting in an output video signal SVC.
In
A third exemplary embodiment of the first kind of encoder for encoding natural video together with a cartoonization of the natural video is depicted in
A fourth exemplary embodiment of the first kind of encoder for encoding natural video together with a cartoonization of the natural video is depicted in
The encoded source, the encoded residual and the encoded additional residual, if there is any, may be combined in the output stream SVC following a scalable video coding scheme. Then, the encoded cartoon may be comprised in a base layer while a first enhancement layer comprises the encoded residual. If the encoded additional residual exists, it may be comprised in a second enhancement layer.
Then, the cartoon may be encoded with a low bit rate such that low bandwidth devices are allowed for decoding the base layer, only, resulting in a base cartoon which is of low quality. At the same time, a high bandwidth device may decode a high quality cartoon and/or a high quality version of the natural video the high quality cartoon is generated from. This is achieved through decoding of the residual and/or the additional residual comprised in the first and/or the second enhancement layer. The high bandwidth device may also allow a user to switch between the different cartoon qualities and/or between natural video and cartoon.
Exemplary embodiments of a first kind of decoding devices which are suitable for extracting the source and/or a cartoon of the source video are shown in the
All decoding devices of the first kind comprise a separator SEP for separating the encoded base cartoon from the encoded residual and the further encoded residual, if there is any, from the received video signal SVC. Furthermore, all decoders of the first kind comprise means for reconstructing RBL the encoded base cartoon and means for reconstructing RE1 the encoded residual.
In
The reconstruction of the base cartoon TLO may be of low quality due to distortions and artefacts introduced by the encoding process. Therefore, the exemplary embodiments of
In
And in
In
The means for reconstructing RBL a base cartoon and for reconstructing RE1 the residual may be realised by same hardware. Furthermore, in
a and 4b show two exemplary embodiments of a second kind of encoder for encoding natural video together with a cartoonization of the natural video.
Within the embodiments depicted in
In
A third exemplary embodiment of the second kind of encoder for encoding natural video together with a cartoonization of the natural video is depicted in
A fourth exemplary embodiment of the second kind of encoder for encoding natural video together with a cartoonization of the natural video is depicted in
Again, the encoded source, the encoded residual and the further encoded residual may be combined in the output stream SVC following a scalable video coding scheme. Then, the encoded natural video may be comprised in a base layer while a first enhancement layer comprises the encoded remainder and a second enhancement layer comprises the encoded additional remainder.
Then, the natural video may be encoded with a low bitrate such that low bandwidth devices are allowed for decoding the base layer, only, resulting in a base natural video of low quality. At the same time, a more capable device with higher bandwidth may decode a high quality cartoon and/or a high quality version of the natural video the high quality cartoon is generated from. This is achieved through decoding of the remainder and/or the additional remainder comprised in the first and/or the second enhancement layer. The more capable device may also allow a user to switch between the different natural video qualities and/or between natural video and cartoon.
Exemplary embodiments of a second kind of decoding devices which are suitable for extracting the source and/or a cartoon of the source video are shown in the
All decoders of the second kind comprise a separator SEP for separating the encoded base natural video from the encoded remainder and the encoded additional remainder, if there is any, from the received video signal SVC. Furthermore, all decoders of the second kind comprise means for reconstructing RBL the encoded base natural video and means for reconstructing RE1 the encoded remainder. The exemplary embodiments of
The decoders depicted in
In
And in
In
The means for reconstructing RBL a base natural video and for reconstructing RE1 the remainder may be realised by the same hardware. Furthermore, in
a and 6b show two exemplary embodiments of a third kind of encoder for encoding natural video together with a cartoonization of the natural video.
Within the embodiments depicted in
In
Again, the encoded natural video, the encoded cartoon remainder and the encoded natural video remainder, if there is any, may be combined in the output stream SVC following a scalable video coding scheme. Then, the encoded natural video may be comprised in a base layer while a first enhancement layer comprises the encoded cartoon remainder and a second enhancement layer comprises the encoded natural video remainder, if there is any.
Then, the natural video may be encoded with a low bit rate such that low bandwidth devices are allowed for decoding the base layer, only, resulting in a base natural video of low quality. At the same time, a more capable device with higher bandwidth may decode a high quality cartoon and/or a high quality version of the natural video the high quality cartoon is generated from. This is achieved through decoding of the encoded cartoon remainder and/or the encoded natural video remainder comprised in the first and/or the second enhancement layer. The more capable device may also allow a user to switch between the different natural video qualities and/or between natural video and cartoon.
Exemplary embodiments of a third kind of decoding devices which are suitable for extracting the source and/or a cartoon of the source video are shown in the
Both decoders of the third kind comprise a separator SEP for separating the encoded base natural video from the encoded cartoon remainder and the encoded natural video remainder, if there is any, from the received video signal SVC. Furthermore, both decoders of the third kind comprise means for reconstructing RBL the encoded base natural video and means for reconstructing RE1 the encoded cartoon remainder. And, both decoders comprise means for cartoonization CART which serve for cartoonizing the reconstructed base natural video SLO. The exemplary embodiments of
The decoders depicted in
In
The means for reconstructing RBL a base natural video and for reconstructing RE1 the cartoon remainder may be realised by the same hardware. Furthermore, in
| Number | Date | Country | Kind |
|---|---|---|---|
| 07301459.9 | Oct 2007 | EP | regional |
| Filing Document | Filing Date | Country | Kind | 371c Date |
|---|---|---|---|---|
| PCT/EP2008/063676 | 10/10/2008 | WO | 00 | 4/12/2010 |