The present invention relates to an apparatus and a method for broadcasting a stereoscopic 3D video, and more specifically, to a technique to processing speed and time for acquiring a high-quality 3D video by lowering complexity of a reception symbol vector log likelihood ration (LLR) operation in an iterative channel decoding process for acquiring the high-quality 3D video using a hierarchical IL-FEC method.
In a wireless broadcasting system, reception quality of various receivers distributed in a wide range may be significantly various. For example, some applications tend to be received by mobile receivers and fixed receivers. Further, it is sometimes preferable to perform a local service insertion (LSI) inserting local content into some cells without influencing national content during broadcasting. A hierarchical modulation adopted as a broadcasting standard such as Digital Video Broadcasting-Terrestrial (DVB-T) and Digital Video Broadcasting-Next Generation Handheld (DVB-NGH) may be used in applications transmitting two bits as the same signal.
For example, the hierarchical modulation may be used for transmitting content having various definitions such as standard definition (SD), high definition (HD), or ultra-high definition (UHD). The high-definition content is transmitted by mapping high priority (HP) bits and low priority (LP) bits and modulating a hierarchical 16 QAM.
In this case, the broadcasting standard defines a plurality of hierarchical values α and a service provider may select a value suitable for the provided application from the hierarchical values α. For example, the DVB-T allows a service provider to select the value α of 1, 2, or 4, while the DVB-NGH allows the service provider to select the value α of 1, 2, 3, or 4. The value α affects an inter-stream interference level and affects a trade-off between throughput gain and degradation of performance. Typically, a set of hierarchical values α is provided to each application in use. The hierarchical value α is an important element for acquiring a high-quality 3D video and it is necessary to acquire the hierarchical value α suitable thereto.
In a process of de-mapping a symbol vector received from a destination node receiving the hierarchical-modulated symbol vector on the basis of the appropriate hierarchical value α for iterative channel decoding, there is a disadvantage in that the LLR operation complexity of each of the HP stream and the LP bit is large.
As a result, the applicant of the present invention proposes a method for reducing the processing time and speed of the iterative channel decoding at the rear end by reducing the LLR operation complexity of each of the HP stream and the LP bit.
In order to solve the problems in the related art, an object of the present invention is to provide an apparatus and a method for broadcasting a stereoscopic 3D video capable of shortening processing time and speed of iterative channel decoding performing iterative decoding to improve BER performance by reducing operation complexity of a HP bit LLR and a LP bit LLR of a reception symbol vector.
Another object of the present invention is to provide an apparatus and a method for broadcasting a stereoscopic 3D video capable of improving reliability of decoding by updating the HP bitstream during iterative channel decoding.
Yet another object of the present invention is to provide an apparatus and a method for broadcasting a stereoscopic 3D video capable of displaying a high-quality 3D video on a screen by acquiring an appropriate hierarchical value in an iterative channel decoding process in which hierarchical modulation is combined with hierarchical 16QAM.
Objects of the present invention are not limited to the aforementioned objects, and other objects and advantages of the present invention, which are not mentioned, can be appreciated by the following description and will be more apparently known by the exemplary embodiments of the present invention. Further, it can be easily understood that the objects and advantages of the present invention may be implemented by means illustrated in claims and combinations thereof.
According to an aspect of the present invention, there is provided an apparatus for broadcasting a stereoscopic 3D video including: a source node that transmits a left video and a right video collected by a plurality of cameras in a transmission symbol vector form by performing 16QAM modulation and hierarchical modulation based on hierarchical values set from a predetermined model after compressing the left video and the right video; and a destination node that acquires the left video and the right video by performing iterative channel decoding after performing hierarchical 16QAM de-mapping based on a predetermined hierarchical value and transmits the acquired left and right videos to a display device with respect to a reception symbol vector in which Gaussian noise is added to the transmission symbol vector at the source node side.
Preferably, the source node may include an IL-FEC channel encoding unit that generates a combined video bitstream b12 by combining compressed left video bitstream b1 and right video bitstream b2 of NAL units and performing an exclusive OR operation with respect to the left video V1 and the right video V2 collected through the plurality of cameras; an RSC encoding unit including a plurality of RSC encoders that output parity bitstreams P1 and P2 of the left video and the right video by performing RSC encoding for the compressed left video bitstream and right video bitstream; a concatenate vector generation module that generates a codeword bitstream c1 of the left video and a codeword bitstream c12 of the combined video by concatenating the left video bitstream b1, the right video bitstream b2, the parity bitstream P2 of the right video, and the combined video bitstream b12 and outputs the generated codeword bitstream c1 of the left video and codeword bitstream c12 of the combined video to a HP bitstream and a LP bitstream, respectively; and a hierarchical 16QAM mapping module that maps the received HP bitstream and LP bitstream after performing both 16 QAM and hierarchical modulation based on the hierarchical value set from the predetermined model to transmit the mapped HP bitstream and LP bitstream to the destination node in a transmission symbol vector form.
Preferably, the IL-FEC channel encoding unit may include an interleaver that combines the compressed left video bitstream and right video bitstream; and an exclusive OR operator that outputs a combined video bitstream by operating an exclusive OR with respect to the output bitstream and the right video bitstream of the interleaver.
Preferably, the destination node may include a hierarchical 16QAM de-mapper that acquires the hierarchical value and performs the hierarchical 16QAM de-mapping based on the acquired hierarchical value, and then outputs a left video bitstream LLR, a parity bitstream LLR, a combined video bitstream LLR and a parity bitstream LLR by operating the HP codeword bitstream LLR and the LP codeword bit stream LLR, with respect to the reception symbol vector in which Gaussian noise is added to the transmission symbol vector of the source node side, respectively; an iterative channel decoder that acquires a left video stream and a right video stream by performing a predetermined number of iterative channel decoding with respect to the generated left video bitstream LLR, parity bitstream LLR, combined video bitstream LLR and parity bitstream LLR; and an encoding unit including a plurality of encoders that transmits a left video and a right video to a display device after compressing and encoding the acquired left and right video bitstreams, respectively.
Preferably, in the hierarchical 16QAM de-mapper, each operation relation of a LP codeword bitstream LLR and a HP codeword bitstream LLR, drawn through logarization for the probability that the received reception symbol vector is to be transmitted, is provided by the four fundamental arithmetic operations, in order to reduce operation complexity of the LP codeword bitstream LLR and the HP codeword bitstream LLR.
Preferably, the iterative channel decoder may include a first SISO decoder that generates an additional bitstream LLR by performing soft decoding by inputting the left video bitstream LLR and the parity bitstream LLR and outputs the left video bitstream; an interleaver that outputs an interleaved additional bitstream LLR by rearranging a sequence of the additional bitstream LLR of the first SISO decoder; a first LLR operator that generates a prior bitstream of the right video by combining the interleaved additional bitstream and the combined video bitstream as the input based on the LLR; a second SISO decoder that generates an additional bitstream of the left video and outputs the right video bitstream by soft decoding the prior bitstream of the right video and the parity bitstream of the combined video of the first LLR operator as the input; a second LLR operator that generates an additional bitstream of the left video by combining the additional bitstream of the right video of the second SISIO decoder and the combined video bitstream based on the LLR; a deinterleaver that generates a prior bitstream of the left video through deinterleaving and exclusive OR operation with respect to the additional bitstream of the left video; and an exclusive OR operator that updates the left video bitstream by providing the prior bitstream of the left video of the deinterleaver to the first SISO decoder through the exclusive OR operation to improve reliability of the decoding for the left video.
Preferably, the hierarchical value may be provided to acquire bit error rate (BER) performance based on noise for each predetermined hierarchical value of left and right video bitstreams output from the first and second SISO decoders of the iterative channel decoder, set a hierarchical value having the acquired low BER performance, draw an average PSNR for each of the left video bitstream and the right video bitstream reconfigured based on the predetermined hierarchical value, and acquire, as the appropriate hierarchical value, a hierarchical value in which a sum of the drawn left video bitstream and right video bitstream exceeds a predetermined target average PSNR and each average PSNR has a predetermined minimum required PSNR.
According to another aspect of the present invention, there is provided an apparatus for broadcasting a stereoscopic 3D video including: an IL-FEC channel encoding unit that generates a combined video bitstream b12 by combining compressed left video bitstream b1 and right video bitstream b2 of NAL units and performing an exclusive OR operation with respect to the left video V1 and the right video V2 collected through the plurality of cameras; an RSC encoding unit including a plurality of RSC encoders that output parity bitstreams P1 and P2 of the left video and the right video by performing RSC encoding for the compressed left video bitstream and right video bitstream, respectively; a concatenate vector generation module that generates a codeword bitstream c1 of the left video and a codeword bitstream c12 of the combined video by concatenating the left video bitstream b1, the right video bitstream b2, the parity bitstream P2 of the right video, and the combined video bitstream b12 and outputs the generated codeword bitstream c1 of the left video and codeword bitstream c12 of the combined video to a HP bitstream and a LP bitstream, respectively; and a hierarchical 16QAM mapping module that maps the received HP bitstream and LP bitstream after performing both 16 QAM and hierarchical modulation based on the hierarchical value set from the predetermined model to transmit the mapped HP bitstream and LP bitstream to the destination node in a transmission symbol vector form.
Preferably, the IL-FEC channel encoding unit may include an interleaver that combines the compressed left video bitstream and right video bitstream; and an exclusive OR operator that outputs a combined video bitstream by operating an exclusive OR with respect to the output bitstream and the right video bitstream of the interleaver.
According to yet another aspect of the present invention, there is provided an apparatus for broadcasting a stereoscopic 3D video including: a hierarchical 16QAM de-mapper that acquires an appropriate hierarchical value, performs the hierarchical 16QAM de-mapping based on the acquired hierarchical value, with respect to the reception symbol vector, and then outputs a left video bitstream LLR, a parity bitstream LLR, a combined video bitstream LLR and a parity bitstream LLR by operating the LP codeword bitstream LLR and the HP codeword bitstream LLR, respectively; an iterative channel decoder that acquires a left video stream and a right video stream by performing a predetermined number of iterative channel decoding with respect to the generated left video bitstream LLR, parity bitstream LLR, combined video bitstream LLR and parity bitstream LLR; and an encoding unit including a plurality of encoders that transmits a left video and a right video to a display device after compressing and encoding the acquired left and right video bitstreams, respectively.
Preferably, in the hierarchical 16QAM de-mapper, each operation relation of a LP codeword bitstream LLR and a HP codeword bitstream LLR drawn through logarization for the probability that the received reception symbol vector is to be transmitted is provided by the four fundamental arithmetic operations, in order to reduce operation complexity of the LP codeword bitstream LLR and the HP codeword bitstream LLR.
Preferably, the iterative channel decoder may include a first SISO decoder that generates an additional bitstream LLR by performing soft decoding by inputting the left video bitstream LLR and the parity bitstream LLR and outputs the left video bitstream; an interleaver that outputs an interleaved additional bitstream LLR by rearranging a sequence of the additional bitstream LLR of the first SISO decoder; a first LLR operator that generates a prior bitstream of the right video by combining the interleaved additional bitstream and the combined video bitstream as the input based on the LLR; a second SISO decoder that generates an additional bitstream of the left video and outputs the right video bitstream by soft decoding the prior bitstream of the right video and the parity bitstream of the combined video of the first LLR operator as the input; a second LLR operator that generates an additional bitstream of the left video by combining the additional bitstream of the right video of the second SISIO decoder and the combined video bitstream based on the LLR; a deinterleaver that generates a prior bitstream of the left video through deinterleaving and exclusive OR operation with respect to the additional bitstream of the left video; and an exclusive OR operator that updates the left video bitstream by providing the prior bitstream of the left video of the deinterleaver to the first SISO decoder through the exclusive OR operation to improve reliability of the decoding for the left video.
Preferably, the hierarchical value may be provided to acquire bit error rate (BER) performance based on noise for each predetermined hierarchical value of left and right video bitstreams output from the first and second SISO decoders of the iterative channel decoder, set a hierarchical value having the acquired low BER performance, draw an average PSNR for each of the left video bitstream and the right video bitstream reconfigured based on the predetermined hierarchical value, and acquire, as the appropriate hierarchical value, a hierarchical value in which a sum of the drawn left video bitstream and right video bitstream exceeds a predetermined target average PSNR and each average PSNR has a predetermined minimum required PSNR.
According to still another aspect of the present invention, there is provided a method for broadcasting a stereoscopic 3D video including: transmitting a left video and a right video collected by a plurality of cameras in a transmission symbol vector form by performing 16QAM modulation and hierarchical modulation based on hierarchical values set from a predetermined model after compressing the left video and the right video; and acquiring the left video and the right video by performing iterative channel decoding after performing hierarchical 16QAM de-mapping based on an appropriate hierarchical value and transmitting the acquired left and right videos to a display device with respect to a reception symbol vector in which Gaussian noise is added to the transmission symbol vector at the source node side, in which the process of performing the hierarchical 16QAM de-mapping based on the predetermined hierarchical value further includes providing each operation relation of a LP codeword bitstream LLR and a HP codeword bitstream LLR drawn through logarization for the probability that the received reception symbol vector is to be transmitted as the four fundamental arithmetic operations, in order to reduce operation complexity of the LP codeword bitstream LLR and the HP codeword bitstream LLR.
Preferably, the iterative channel decoding process may include generating an additional bit stream by performing soft decoding by the first SISO decoder by inputting a left video bitstream and a parity bitstream; rearranging a sequence of the additional bitstream and then transmitting the rearranged sequence to the second SISO decoder as an input; performing soft decoding by inputting the additional bitstream rearranged in the second SISO decoder and the parity bit; generating a prior bitstream of the left video after inverse-arranging the sequence through deinterleaving for the additional bitstream; and updating the left video bitstream by providing the prior bitstream of the left video of the deinterleaver to the first SISO decoder through an exclusive OR operation to improve reliability of the decoding for the left video.
Preferably, the process of acquiring the appropriate hierarchical value may include: acquiring bit error rate (BER) performance based on noise for each predetermined hierarchical value of left and right video bitstreams output from the first and second SISO decoders; setting a hierarchical value having the acquired low BER performance; drawing an average PSNR for each of the left video bitstream and the right video bitstream reconfigured based on the predetermined hierarchical value; and acquiring, as the appropriate hierarchical value, a hierarchical value in which a sum of the drawn left video bitstream and right video bitstream exceeds a predetermined target average PSNR and each average PSNR has a predetermined minimum required PSNR.
According to the present invention, the LLR operation of each of the HP bitstream and the LP bitstream is provided by the four fundamental arithmetic operations through the logarization for the probability that a video bit for the reception symbol vector is to be transmitted, thereby lowering the operation complexity of the HP bitstream LLR and the LP bitstream LLR and improving the de-mapping processing speed for the reception symbol vector y.
In addition, when performing the iterative channel decoding, the iterative channel decoding is performed by acquiring, as the appropriate hierarchical value, a hierarchical value in which a sum of the drawn left video bitstream and right video bitstream exceeds a predetermined target average PSNR and each average PSNR has a predetermined minimum required PSNR, thereby displaying a high-quality stereoscopic video.
Further, the left video bitstream is updated by providing the prior bitstream of the left video to the first SISO decoder, thereby improving reliability of the decoding for the left video.
The above and other aspects, features and other advantages of the present invention will be more clearly understood from the following detailed description taken in conjunction with the accompanying drawings, in which:
Hereinafter, exemplary embodiments of the present invention will be described in more detail with reference to the accompanying drawings.
Various advantages and features of the present invention and methods accomplishing thereof will become apparent from the following description of exemplary embodiments with reference to the accompanying drawings. However, the present invention is not limited to the following exemplary embodiments but may be implemented in various different forms. The exemplary embodiments are provided only to complete disclosure of the present invention and to fully provide a person having ordinary skill in the art to which the present invention pertains with the category of the invention, and the present invention will be defined by the appended claims.
Terms used in the present specification will be briefly described and the present invention will be described in detail.
Terms used in the present invention adopt general terms which are currently widely used as possible by considering functions in the present invention, but the terms may be changed depending on an intention of those skilled in the art, a precedent, and emergence of new technology. Further, in a specific case, a term which an applicant arbitrarily selects is present and in this case, a meaning of the term will be disclosed in detail in a corresponding description part of the invention. Accordingly, a term used in the present invention should be defined based on not just a name of the term but a meaning of the term and contents throughout the present invention.
Throughout the specification and the claims, unless explicitly described to the contrary, the term “comprise” will be understood to imply the inclusion of stated elements but not the exclusion of any other elements. In addition, a term ‘unit’ used in the specification means software or a hardware component such as FPGA or ASIC and the ‘unit’ performs certain roles. However, the ‘unit’ is not a meaning limited to software or hardware. The “unit’ may be configured to be positioned in an addressable storage medium and configured to regenerate one or more processors.
Therefore, as one example, the ‘unit’ includes components such as software components, object oriented software components, class components, and task components, processes, functions, attributes, procedures, subroutines, segments of a program code, drivers, firmware, a microcode, a circuit, data, a database, data structures, tables, arrays, and variables. Functions provided in the components and ‘units’ may be joined as a smaller number of components and ‘units’ or further separated into additional components and ‘units’.
The exemplary embodiment of the present invention will be described more fully hereinafter with reference to the accompanying drawings so that those skilled in the art to which the present invention pertains can be easily implemented. In addition, a part irrelevant to the description will be omitted to clearly describe the exemplary embodiments of the present invention.
Hereinafter, a history according to an exemplary embodiment of the present invention will be described in detail with reference to the accompanying drawings.
Herein, the source node 10 may include cameras 110 and 111, H264/AVC encoders 120 and 121, an IL-FEC channel encoding unit 130, recursive systematic convolution (RSC) encoders 140 and 141, a concatenate vector generation module 150, and a hierarchical 16QAM mapping module 160.
A left video and a right video acquired from the respective cameras 110 and 111 are transmitted to the H264/AVC encoders 120 and 121, and in this case, the H264/AVC encoders 120 and 121 may compress the left video and the right video based on a predetermined video standard compression algorithm.
That is, with respect of a video signal Vk={Vk1, Vk2, . . . , Vki} included in an I frame, a compressed k-th video bitstream bk is defined as bk={bk1, bk2, . . . , bkn} Herein, k is a positive integer, 1≤i≤I, 1≤n≤N, and n is defined by a unit of an n-th network abstraction layer (hereinafter, abbreviated as an NAL). The video bitstream bk is divided into NAL units, and in this case, the total number N of NALs is equal to the number of compressed frames I. That is, N=I.
Accordingly, since the n-th NAL unit includes M bits (bkn={bkn
That is, the left video signal and the right video signal received by the cameras 100 and 101 are transmitted to the IL-FEC channel encoding unit 130 via the H264/AVC encoders 120 and 121, respectively.
In this case, the H264/AVC encoders 120 and 121 output left and right video bitstreams b1 and b2 by NAL units, respectively.
The IL-FEC channel encoding unit 130 may be provided by a universal mobile telecommunication system (UMPS) interleaver Π( ) (hereinafter, abbreviated as an interleaver) and an exclusive OR operator ⊕. As a result, the IL-FEC channel encoding unit 130 outputs a combined video bitstream b12 acquired by combining the left video bitstream b1 and the right video bitstream b2 and the combined video bitstream b12 is expressed by b12=Π(b1)⊕b2=b1′⊕b2. Herein, the combined video bitstream b12 is provided as a redundancy of the left video bitstream b1.
The left video bitstream b1 and the combined video bitstream b12 are transmitted to the RSC encoders 140 and 141, respectively, and as a result, the left video bitstream b1 and the combined video bitstream b12 are encoded by the RSC encoders 140 and 141 to output a video bitstream bk′ and a parity bitstream pk′. A series of processes of encoding the left video bitstream b1 and the combined video bitstream b12 by the RSC encoders 140 and 141 may be understood by those skilled in the art related with the exemplary embodiment of the present invention.
In addition, the encoded parity bitstream pk′ and the video bitstream bk′ output a codeword bitstream ck′ by the concatenate vector generation module 150 and in this case, the codeword bitstream ck′ is represented by ck′={ck′1, ck′2, . . . , ck′n} (k′=1, 12). That is, the codeword bitstream ck′ is defined as a concatenate vector between the parity bitstream pk′ and the video bitstream bk′, the codeword bitstream ck′ is defined as ck′=bk′pk′, and is defined as a serial concatenate for the video bitstream bk′={bk′1, bk′2, . . . , bk′n} and the parity bitstream pk′={pk′1, pk′2, . . . , pk′n}.
In addition, the encoded codeword bitstream ck′={ck′1, ck′2, . . . , ck′n} is transmitted to the hierarchical 16QAM mapping module 160 for strong error correction and the hierarchical 16QAM mapping module 160 performs the hierarchical 16QAM mapping with respect to the codeword bitstream c1 of the NAL unit. In this case, the codeword bitstreams c1 and c12 of the NAL unit are input to the hierarchical 16QAM mapping module 160 as the HP bitstream and the LP bitstream, respectively.
x
n
=K{f
(α)
16(c1(2n
Herein, xn
In addition, the parameter k is defined as a scaling factor with respect to an average value of average symbol energy. Herein, the video bitstream b1 of the NAL unit, which is the left video, is defined as the HP bitstream, and the video bitstream b2 of the NAL unit, which is the right video, is defined as the LP bitstream.
In this case, when the hierarchical value α for the HP codeword bitstream c1 mapped with the HP bitstream is increased, the LP codeword bitstream c12 mapped with the LP bitstream is decreased. Reversely, when the hierarchical value α for the LP codeword bitstream c12 is decreased, the HP codeword bitstream c1 is increased. As a result, a high-quality 3D video may not be acquired by the increased hierarchical value α of the codeword c1 due to the decrease of the LP codeword bitstream c1. As a result, it is very important that the hierarchical 16QAM mapping module 160 acquires an appropriate hierarchical value α in order to improve stereoscopic 3D video quality based on a predetermined model. The appropriate hierarchical value α is acquired based on the predetermined model for the PSNR.
In addition, a transmission symbol vector xn
As illustrated in
A reception symbol vector y arriving at the destination node 20 is transmitted to the hierarchical 16QAM de-mapper 210, the hierarchical 16QAM de-mapper 210 adds a predetermined Gaussian noise w to the received transmission symbol vector x, and the reception symbol vector y added with the Gaussian noise w is represented by Equation 2 below.
That is, when the reception symbol vector y is y={y1,y2, . . . , yn},
y=x+w (Equation 2) is satisfied.
Herein, x=∪n=1Nxn (Equation 3) is satisfied and
herein, xn={xn
In addition, the hierarchical 16QAM de-mapper 210 operates each codeword bitstream ĉkn, log-likelihood ratio (hereinafter, abbreviated as an LLR) as the reception symbol vector y of the received n-th NAL unit and the codeword bitstream ĉk
{circumflex over (f)}
(α)
16(yn)=L(ĉk′n) Equation 4
Herein, k′ a is 1 and 12, f(α)16( ) represents soft hierarchical 16QAM de-mapping function, and L( ) represents an LLR value.
In addition, when the Gaussian noise w is defined as a probability distribution function, Equation 5 below is satisfied.
Herein, μ=0 and a parameter is σ2=N0/2.
Hereinafter, a process of operating the LLR for each bit of the codeword bitstream ĉkn of the NAL unit will be described.
For example, a first video bit to a fourth video bit b1, b2, b3, and b4 in each constellation point are considered as two bits through independent 4 pulse amplitude modulation (PAM) for a real axis I and an imaginary axis Q and the mapping for the hierarchical modulation is shown in Table below.
Herein, b1=c1(2m
For hierarchical 16QAM de-mapping, a probability P(bm|y) to be transmitted with each video bit br (r=1, 2, 3, 4) for the given reception symbol vector y of the NAL unit satisfies Equation 6 below.
In this case, the maximum P(bm|y) is the same as the maximum P(y|bm) because all probabilities of the same constellation points are the same as each other.
Herein, in a period 1 of y<−d1, the HP bit b1 LLR has a low contribution ratio of a constellation point d1+d2 of a numerator and −d1+d2 of a denominator.
Accordingly, the LLR for the HP bit b1 is represented by Equation 8 below.
When both sides are summarized, the LLR for the HP bit b1 may be represented by Equation 9 below.
In addition, in Periods 2 and 3 of d1<y<+d1, since d1+d2 of the numerator and −d1−d2 of the denominator have small values enough to be negligible, the HP bit b1 LLR may be represented by Equation 10 below.
When both sides of Equation 18 are logarized and summarized, the HP bit b1 LLR is represented by Equation 11 below.
Meanwhile, in Period 4 of y≥+d1, since d1−d2 of the numerator and −d1−d2 of the denominator have small values enough to be negligible, the HP bit b1 LLR may be represented by Equation 12 below.
In addition, when both sides of the HP bit b1 LLR are logarized and summarized, the HP bit b1 LLR may be represented by Equation 13 below.
Accordingly, the HP codeword bit ĉkn, LLR L(ĉ1n)={L(c1n
Herein, 1≤n≤N, that is, n is a positive integer.
Referring to
Herein, in Periods 1 and 2 of y<0, since the constellation point d1+d2 of the numerator and di−d2 of the denominator of the LP bit b1 LLR have small values enough to be negligible, the LP bit b1 LLR is represented by Equation 16 below.
When both sides of the LP bit b2 LLR are logarized and summarized, the LP bit b2 LLR is represented by Equation 17 below.
Herein, in Periods 3 and 4 of y≥0, since the constellation point −d1+d2 of the numerator and −d1−d2 of the denominator of the LP bit b2 LLR have small values enough to be negligible, the LP bit b2 LLR is represented by Equation 18 below.
In addition, when both sides of the LP bit b2 LLR are logarized and summarized, the LP bit b2 LLR satisfies Equation 19 below.
Accordingly, the LP bit ĉkn, LLR L(ĉ12n)={L(c12n
As illustrated in Equations 14 and 20, it can be seen that the LLR operation complexity of each of the HP bit and the LP bit is low through logarization for the probability P(bm|y) that the bit br (r=1, 2, 3, 4) for the given reception symbol vector y is to be transmitted, and as a result, a de-mapping processing speed for the reception symbol vector y is improved.
In addition, since the codeword LLR L(ĉ12n) is L(ĉk′nr)={L({circumflex over (b)}k′n), L({circumflex over (p)}k′n)} and k′=1, 12, the hierarchical 16QAM de-mapper 210 outputs four de-mapped video bitstreams and four parity bitstream LLRs L({circumflex over (b)}1n), L({circumflex over (b)}12n), L({circumflex over (p)}1n) and L({circumflex over (p)}12n) and the four de-mapped video bitstreams and the parity bitstream LLRs L({circumflex over (b)}1n), L({circumflex over (b)}12n), L({circumflex over (p)}1n), and L({circumflex over (p)}12n) are transmitted to the iterative channel decoder 220.
That is, the first SISO decoder 221 outputs an additional bitstream LLR Le(1)({circumflex over (b)}1n) by inputting the video bitstream LLR L({circumflex over (b)}1n) and the parity bitstream LLR L({circumflex over (p)}1n). Herein, Le(j) is defined as extrinsic information (EI) during j-th half for an iteration.
In addition, the additional bitstream LLR Le(1)({circumflex over (b)}1n) of the first SISO decoder 221 is transmitted to the interleaver 222 and the interleaver 222 rearranges the sequence of the additional bitstream LLR Le(1)({circumflex over (b)}1n) and then outputs an interleaved additional bitstream LLR Le(1)({circumflex over (b)}1n), in which the interleaved additional bitstream LLR Le(1)({circumflex over (b)}1n) is defined as Π(Le(1)({circumflex over (b)}1n)).
As a result, the additional bitstream is generated by using the video bitstream and the parity bitstream by the first SISO decoder 221 and the interleaver 222 rearranges the sequence of the additional bitstream by using the generated additional bitstream to output the interleaved additional bitstream.
Further, the interleaved additional bitstream LLR Le(1)({circumflex over (b)}1n)′ and the combined bitstream LLR L({circumflex over (b)}12n) are transmitted to the first LLR operator 223, and the first LLR operator 223 combines the interleaved additional bitstream LLR Le(1)({circumflex over (b)}1n)′ and the combined bitstream LLR L({circumflex over (b)}12n) based on the LLR to generate a prior bitstream (Priori bit) LLR Lα(1)({circumflex over (b)}2n). Herein, with respect to the interleaved additional bitstream LLR Le(1)({circumflex over (b)}1n)′=Π(Le(1)({circumflex over (b)}1n)), an exclusive OR is defined as L({circumflex over (b)}1n)L({circumflex over (b)}12n)L({circumflex over (b)}1n⊕{circumflex over (b)}12n).
As a result, during the first half for an iteration decoding, the prior bitstream LLR Lα(1)({circumflex over (b)}2n) and the de-mapped parity bitstream LLR L({circumflex over (p)}12n) are transmitted to the second SISO decoder 224 and the second SISO decoder 224 performs soft-decoding of the received prior bitstream LLR Lα(1)({circumflex over (b)}2n) and the de-mapped parity bitstream LLR L({circumflex over (p)}12n).
Thereafter, during the second half for an iteration decoding, the SISO decoder 224 generates an additional bitstream LLR Le(2)({circumflex over (b)}2n) which is the EI for the prior bitstream LLR Lα(1)({circumflex over (b)}2n) and the additional bitstream LLR Le(2)({circumflex over (b)}2n) is transmitted to the first SISO decoder 221 as prior information via the second LLR operator 225 and the deinterleaver 226 in sequence. As a result, the prior bitstream LLR Lα(2)({circumflex over (b)}1n) transmitted to the first SISO decoder 221 is defined as Lα(2)({circumflex over (b)}1n)=Π−1(Le(2)({circumflex over (b)}2n)L({circumflex over (b)}12n)) and herein, Π−1( ) is an deinterleaver. The channel decoding process is repetitively performed predetermined times. As a result, the video bitstream LLR L({circumflex over (b)}1n) is updated by the prior bitstream LLR Lα(2)({circumflex over (b)}1n) and transmitted to the first SISO decoder 221, thereby improving reliability for the video bitstream LLR L({circumflex over (b)}1n).
In addition, video bitstreams {circumflex over (b)}1n and {circumflex over (b)}2n of the first SISO decoder 221 and the second SISO decoder 224 are output.
The prior information API of the additional bit LLR Le(1)({circumflex over (b)}1n) of the first SISO decoder 221 is transmitted to the next first SISO decoder 221 from the video bitstream LLR L({circumflex over (b)}1n) and the parity bitstream LLR L({circumflex over (p)}1n), thereby improving the reliability for the video bitstream LLR L({circumflex over (b)}1n).
As a result, in the present invention, it is required to set the appropriate hierarchical value α for improving the reliability for the video bit {circumflex over (b)}1n and the combined bit {circumflex over (b)}12n at the same time.
The video bits {circumflex over (b)}1n and {circumflex over (b)}2n of the iterative channel decoder 220 are transmitted to the H264/AVC decoders 230 and 231, respectively, and the H264/AVC decoders 231 and 232 decode the video bits {circumflex over (b)}1n and {circumflex over (b)}2n to reconfigure the left and right video signals {circumflex over (V)}1 and {circumflex over (V)}2. In this case, the decoded video bits {circumflex over (b)}1n and {circumflex over (b)}2n are defined as the left and right video signals {circumflex over (V)}1 and {circumflex over (V)}2, and the left video signal {circumflex over (V)}1 provides a legacy 2DTV (display device) service and is combined with the reconfigured right video signal {circumflex over (V)}2 to provide a 3DTV service.
Hereinafter, with respect to the left and right video signals {circumflex over (V)}1 and {circumflex over (V)}2 acquired through the iterative channel decoding for the symbol vector y received from the destination node 20, a series of processes of updating the hierarchical value α in the model constructed in the source node 10 by acquiring the appropriate hierarchical value α based on the PSNR performance in the low SNR environment will be described.
That is, the quality of the left and right video signals {circumflex over (V)}1 and {circumflex over (V)}2 reconfigured in the destination node 20 is measured by the peak signal to noise ratio (PSNR) and the stereoscopic 3D video quality is measured based on the average PSNR. The average PSNR performance is determined by bit error rate (BER) performance and the average PSNR performance is determined depending on the hierarchical value α.
Herein, the average PSNR for the left and right video signals {circumflex over (V)}1 and {circumflex over (V)}2 is represented by Equation 21 below.
Herein, k is 1 and 2, PSNR({circumflex over (V)}k,α) is defined as a k-th video PSNR of the given hierarchical value α, and PSNR({circumflex over (V)}k,α) is defined as Equation 22 below.
Herein, I represents a total number of frames and MAX represents a probable maximum pixel value in one frame. For example, in the case of 8 bits per one sample, a probable maximum pixel value is 255.
In addition, MSEk(i) is defined as a mean squared error (MSE) in an I-th frame of the k-th video and MSEk(i) satisfies Equation 23 below.
Herein, ori and rec represent an original frame and a reconfigured frame, and Wand H represent a height and a width of the frame.
If such an average PSNR exceeds the target PSNR and the PSNR of each video does not reach a minimum required value, a low-quality stereoscopic 3D video is viewed. For example, when the target average PSNR is set as 30 dB, a sum of the left and right PSNRs needs to exceed 60 dB to reach the target average PSNR. If the left and right PSNRs are 40 dB and 20 dB, respectively, the target average PSNR is satisfied, but the quality of the stereoscopic 3D video is gradually deteriorated because the right video has a low PSNR.
As a result, the left video and right video average PSNRs need to exceed the minimum required PSNR, and the sum of the left video and right video average PSNRs needs to acquire the hierarchical value α exceeding the target average PSNR from a predetermined model.
Accordingly, the average PSNR for the left and right video signals {circumflex over (V)}1 and {circumflex over (V)}2 needs to satisfy Equations 24 to 26 below.
Herein, NRAvg({circumflex over (V)}α)≥PSNRAvg+({circumflex over (V)}α),∀α Equation 25
PSNR({circumflex over (V)}k,α)≥PSNR−({circumflex over (V)}k,α),k=1,2,∀α Equation 26
Herein, PSNRAvg+ and PSNR− represent the target average PSNR of the left and right videos and the minimum required PSNR, respectively.
In this case, the left video and right video average PSNRs need to exceed the minimum required PSNR and the sum of the left video and right video average PSNRs a model for acquiring the hierarchical value α in which the sum of the left video and right video average PSNRs exceeds the target average PSNR is constructed as follows.
That is, when the hierarchical value α is between 0.1 to 0.2, the BER performance of the LP bit is excessively decreased and the LP bit LLR extracted by the network decoding causes the decrease in performance, and it may be represented by a Relation Equation 27 below.
L({circumflex over (b)}1n)({circumflex over (b)}12n)=L({circumflex over (b)}2n) Equation 27
As a result, when the minimum required PSNR is set to 28 dB and the target average PSNRs are set to 33 dB and 38 dB, respectively, the PSNR of 28 dB represents the low quality of the reconfigured video, the PSNR of 38 dB represents the high quality of the reconfigured video, and the PSNR of 33 dB is considered as intermediate quality of the reconfigured video.
However, it can be seen that even though the hierarchical value α is increased further than the minimum required PSNR of the left and right videos, the average PSNRs of the left and right videos do not reach the target average PSNR.
(b) is an exemplary diagram showing PSNR performance when Eb/N0=6.5 dB, and it can be seen that most of hierarchical values α except for 0.1 to 0.3 satisfy the minimum required PSNR, while the hierarchical values α of 0.2 to 0.6 with respect to the PSNR of the right video is satisfied. As illustrated in the average PSNR of
Accordingly, it can be seen that when Eb/N0=6 dB to 6.5 dB, the hierarchical value α is 0.5 to 0.6.
Meanwhile, referring to (b), when Eb/N0=6.4 dB, the hierarchical values α of 0.525 and 0.55 reach the target average PSNR and have the same performance even when the target average PSNR is 38 dB. Similarly, when the target average PSNR is set to 33 dB, as illustrated in
As a result, in order to acquire the high-quality stereoscopic video in a low SNR environment, the hierarchical values α of 0.525 and 0.55 are selected, and in order to acquire the intermediate-quality stereoscopic video in a low SNR environment, the hierarchical value α of 0.5 is selected.
Accordingly, it can be seen that the LLR operation complexity of each of the HP bitstream and the LP bitstream is low through logarization for the probability P(bm|y) that the video bit br (r=1, 2, 3, 4) for the given reception symbol vector y is to be transmitted, and as a result, a de-mapping processing speed for the reception symbol vector y is improved.
Further, the prior information API of the additional bit LLR Le(1)({circumflex over (b)}1n) of the first SISO decoder 221 is transmitted to the next first SISO decoder 221 from the video bitstream LLR L({circumflex over (b)}1n) and the parity bitstream LLR L({circumflex over (p)}1n), thereby improving the reliability for the video bitstream LLR L({circumflex over (b)}1n).
Meanwhile, according to another aspect of the present invention, a method for broadcasting a stereoscopic 3D video may include a transmission step of transmitting a left video and a right video collected by a plurality of cameras in a transmission symbol vector form by performing 16QAM modulation and hierarchical modulation based on hierarchical values set from a predetermined model after compressing the left video and the right video; and a reception step of acquiring the left video and the right video by performing iterative channel decoding after performing hierarchical 16QAM de-mapping based on an appropriate hierarchical value and transmitting the acquired left and right videos to a display device with respect to a reception symbol vector in which Gaussian noise is added to the transmission symbol vector at a source node side. The process of performing the hierarchical 16QAM de-mapping based on the predetermined hierarchical value may include providing each operation relation of a LP codeword bitstream LLR and a HP codeword bitstream LLR drawn through logarization for the probability that the received reception symbol vector is to be transmitted as the four fundamental arithmetic operations, in order to reduce operation complexity of the LP codeword bitstream LLR and the HP codeword bitstream LLR.
Further, the iterative channel decoding process may include generating an additional bit stream by performing soft decoding by the first SISO decoder by inputting a left video bitstream and a parity bitstream; rearranging a sequence of the additional bitstream and then transmitting the rearranged sequence to the second SISO decoder as an input; performing soft decoding by inputting the additional bitstream rearranged in the second SISO decoder and the parity bit; generating a prior bitstream of the left video after inverse-arranging the sequence through deinterleaving for the additional bitstream; and updating the left video bitstream by providing the prior bitstream of the left video of the deinterleaver to the first SISO decoder through an exclusive OR operation to improve reliability of the decoding for the left video.
In addition, the process of acquiring the appropriate hierarchical value may include acquiring bit error rate (BER) performance based on noise for each predetermined hierarchical value of left and right video bitstreams output from the first and second SISO decoders; setting a hierarchical value having the acquired low BER performance; drawing an average PSNR for each of the left video bitstream and the right video bitstream reconfigured based on the predetermined hierarchical value; and acquiring, as the appropriate hierarchical value, a hierarchical value in which a sum of the drawn left video bitstream and right video bitstream exceeds a predetermined target average PSNR and each average PSNR has a predetermined minimum required PSNR. The detailed abstraction will be omitted as a function in which each step of the method for broadcasting the stereoscopic 3D video is performed in the source node 10 and the destination node 20 described above.
According to the apparatus and the method for broadcasting the stereoscopic 3D video, the LLR operation of each of the HP bitstream and the LP bitstream is provided by the four fundamental arithmetic operations through the logarization for the probability that a video bit for the reception symbol vector is to be transmitted, thereby lowering the operation complexity of the HP bitstream LLR and the LP bitstream LLR and improving the de-mapping processing speed for the reception symbol vector y. When performing the iterative channel decoding, the iterative channel decoding is performed by acquiring, as the appropriate hierarchical value, a hierarchical value in which a sum of the drawn left video bitstream and right video bitstream exceeds a predetermined target average PSNR and each average PSNR has a predetermined minimum required PSNR, thereby displaying a high-quality stereoscopic video. In addition, the left video bitstream is updated by providing the prior bitstream of the left video to the first SISO decoder, thereby improving reliability of the decoding for the left video. Therefore, the present invention is the industrially available invention in that it is possible to make a very great improvement in terms of performance efficiency as well as accuracy of the operation and reliability, a possibility that the broadcasting system is commercially available is sufficient and the present invention may be practically clearly implemented.
A number of exemplary embodiments have been described above. Nevertheless, it should be understood that various modifications may be made to these exemplary embodiments. For example, suitable results may be achieved if the described techniques are performed in a different order and/or if components in a described system, architecture, device, or circuit are combined in a different manner and/or replaced or supplemented by other components or their equivalents. Therefore, the scope of the present invention should not be limited to the exemplary embodiment and should be defined by the appended claims to be described below and equivalents to the appended claims.
Number | Date | Country | Kind |
---|---|---|---|
10-2017-0142410 | Oct 2017 | KR | national |