The present invention relates to a video image processing apparatus and a method for controlling a video image processing apparatus.
Currently, three-dimensional (3D) video images are broadcasted (3D broadcasting). For example, the channel BS 11, which is a Japanese digital satellite broadcasting channel, performs 3D broadcasting by the “side-by-side method” in which a video image for a single screen is transmitted by dividing the image into a left eye video image and a right eye video image. In addition, the “polarization method” and the “active shutter method” are known as typical display methods for 3D video images. The active shutter method is also referred to as the time-division display method, the frame sequential method or the like.
In the case of the polarization method, a receiver for 3D broadcasts includes a film having polarization properties on a display panel, and carries out an interlaced display of left eye video images and right eye video images. A viewer wears polarization glasses and views the display lines for the left eye video images with his/her left eye, and views the display lines for the right eye video images with his/her right eye, thereby enabling stereoscopic perception of video images.
In the case of the active shutter method, the 3D broadcasting receiver alternately displays left eye video images and right eye video images. A viewer wears glasses including a liquid-crystal shutter. The glasses control the liquid-crystal shutter such that the left eye video image enters his/her left eye only and the right eye video image enters his/her right eye only. In this manner, a viewer can perceive video images stereoscopically.
Incidentally, a phenomenon called burn-in may occur on a display panel. Burn-in is a phenomenon in which degradation of light-emitting devices advances due to lighting the light-emitting devices for a long time, leading to a condition in which its light-emitting function is not performed properly. Especially, in the case where a still image or a moving image including a still region is displayed for a long time, such an image may sometimes remain on the display panel like an afterimage due to burn-in.
In this manner, burn-in is caused by degradation of the light-emitting devices, and may occur both with two-dimensional (2D) video images and 3D video images. Some techniques for suppressing burn-in from occurring are currently known.
Japanese Patent Laid-Open No. 2000-338947 discloses a technique for mitigating burn-in by the method so-called “pixel shifting”, which, when an input image does not change for a certain amount of time, changes the display size of the image to shift the display position thereof.
Japanese Patent Laid-Open No. 2004-264366 discloses a technique in which the input format of an image is detected and interpolation is carried out in accordance with the display size. Then, burn-in is mitigated by shifting, by one pixel per prescribed amount of time, the display position of the image in horizontal and vertical directions within a prescribed pixel range, while suppressing visual distractions caused by the shifting of the image.
Japanese Patent Laid-Open Nos. 2000-338947 and 2004-264366 aim at applying a pixel shifting to 2D video images, but do not give consideration to the case that video images include left eye video images and right eye video images. Nevertheless, when processing such as conventional pixel shifting is applied to 3D video images, there is a possibility that the 3D video image is perceived with a different stereoscopic effect from that originally intended by the creator thereof, due to the following reasons.
For example, in the case of the polarization method, if the pixels for the left eye video image are shifted to the positions of the pixels for the right eye video image and, the pixels for the right eye video image are shifted to the positions of the pixels for the left eye video image, the parallax between the right and left eyes is inverted. Also in the case of the active shutter method, the parallax between both video images fluctuates when the shift amount of the pixels for the left eye video image and that for the right eye video image are different from each other, or when the shift timings for the pixels for these video images are different from each other. In the 3D video image, people can perceive video images stereoscopically by providing parallax (binocular parallax) between the left eye video image and the right eye video image. However, as a result of the parallax based on which a video image is designed being changed due to a burn-in control, it is anticipated that the 3D video image becomes blurred, or in a worse case, fails to be perceived as a 3D video image. That is, conventional burn-in mitigation techniques have not given consideration to the issue of burn-in in the 3D video images.
The present invention has been achieved in consideration of these circumstances, and provides a technique for mitigating burn-in for 3D video images.
According to the first aspect of the present invention, there is provided a video image processing apparatus that outputs a video image to a display device having a display region including a first display portion for displaying a left eye image and a second display portion for displaying a right eye image, comprising: video image processing means for cutting out a part of frame images from a three-dimensional video image configured as an aggregate of the frame images for each scene, a single frame image being formed by arranging a pair of a left eye image and a right eye image corresponding to the same scene in a prescribed pattern; and output means for outputting the partial image cut out by the video image processing means to the display device, wherein the video image processing means, when changing the cutout position of the partial image, cuts out the partial image from the frame images such that an arrangement pattern of the left eye image and the right eye image in the partial image after the cutout position is changed matches an arrangement pattern of the first display portion and the second display portion in the display region.
According to the second aspect of the present invention, there is provided a video image processing apparatus that outputs a video image to a display device, comprising: video image processing means for cutting out a part of a left eye frame image and a right eye frame image from a three-dimensional video image configured as an aggregate of left and right frame image sets for each scene, a single left and right frame image set being formed of a pair of a left eye frame image and a right eye frame image corresponding to the same scene, and output means for outputting the partial image cut out by the video image processing means to the display device, wherein the video image processing means executes processing for changing a cutout position of the partial image, and the video image processing means executes the processing for changing the cutout position at a timing in which, in the three-dimensional video image, a left and right frame image set corresponding to the same scene is switched to the next left and right frame image set.
According to the third aspect of the present invention, there is provided a method for controlling a video image processing apparatus that outputs a video image to a display device having a display region including a first display portion for displaying a left eye image and a second display portion for displaying a right eye image, comprising: a video image processing step of cutting out a part of frame images from a three-dimensional video image configured as an aggregate of the frame images for each scene, a single frame image being formed by arranging a pair of a left eye image and a right eye image corresponding to the same scene in a prescribed pattern; and an outputting step of outputting the partial image cut out in the video image processing step to the display device, wherein in the video image processing step, when changing the cutout position of the partial image, the partial image is cut out from the frame image such that an arrangement pattern of the left eye image and the right eye image in the partial image after the cutout position is changed matches an arrangement pattern of the first display portion and the second display portion in the display region.
According to the forth aspect of the present invention, there is provided a method for controlling a video image processing apparatus that outputs a video image to a display device, comprising: a video image processing step of cutting out a part of a left eye frame image and a right eye frame image from a three-dimensional video image configured as an aggregate of right and left frame image sets for each scene, a single right and left frame image set being formed of a pair of a left eye frame image and a right eye frame image corresponding to the same scene, and an outputting step of outputting the partial image cut out in the video image processing step to the display device, wherein in the video image processing step, processing for changing a cutout position of the partial image is executed, and in the video image processing step, the processing for changing the cutout position is executed at a timing in which, in the three-dimensional video image, a left and right frame image set corresponding to the same scene is switched to the next left and right frame image set.
With the above configuration, the present invention can provide a technique for mitigating burn-in for 3D video images.
Further features of the present invention will become apparent from the following description of exemplary embodiments with reference to the attached drawings.
Embodiments of the present invention will now be described with reference to attached drawings. It should be noted that the technical scope of the present invention is defined by claims, and is not limited by each embodiment described below. In addition, not all combinations of the features described in the embodiments are necessarily required for realizing the present invention.
In the following embodiments, a case is described in which a video image processing apparatus of the present invention is applied to a broadcast receiver, however, application examples are not limited thereto. It is also possible to apply a video image processing apparatus of the present invention to a video image reproduction apparatus, such as DVD player for example.
First Embodiment
In the first embodiment, a case is described in which the polarization method is used as the display method for three-dimensional video images (3D video images). Accordingly, the 3D video image is constituted as an aggregate of pairs of a left eye image and a right eye image corresponding to the same scene.
The display panel 101 receives and displays video images output by a broadcast receiver. The display panel 101 may be of any type, such as a liquid crystal display, a plasma display, or an SED (Surface-conduction Electron-emitter Display).
A polarization filter 102 is attached on the display panel 101 (display device). As shown in
A RAM 303 is a volatile memory, and is used as a work memory of the CPU 301. The RAM 303 is used also as a temporary storage area for various data. In particular, a 2D pixel shifting table is stored in a first memory area 304 and a 3D pixel shifting table is stored in a second memory area 305.
Note that a computation expression for determining the display start position, for example, may be used instead of the table. Although the number of occurrences of the trigger signal in
A tuner unit 306 demodulates broadcast waves received via an antenna (not shown) and outputs video and audio baseband data. A demultiplexer unit 307 performs frame disassembly on the data received from the tuner unit 306 so as to separate video data, audio data and program information data.
The video data and the audio data separated by the demultiplexer unit 307 is input to a decoder unit 308. The decoder unit 308 decodes MPEG-2 coded video data and inputs the decoded data to a 3D video image generation unit 310. The decoder unit 308 also decodes MPEG-2 AAC-coded audio data to convert the audio data into a linear PCM format, and inputs the converted data to an audio processing unit 315.
The program information data separated by demultiplexer unit 307 is input to a video image determination unit 309. The program information data is transmitted in a data structure defined by the standard, “Service Information for Digital Broadcasting System”, by the Association of Radio Industries and Businesses (ARIB), for example. Main constituent data of such a structure includes an SDT, which carries information regarding program channels, and an EIT, which carries information regarding a program such as the name or broadcast time of the program, the type of components to be transmitted, or the like. Note that SDT is an abbreviation of Service Description Table, and EIT is an abbreviation of Event Information Table.
With the present embodiment, the 3D video image format is additionally defined in the video image component type, which is currently standardized (component_type). Then, the video image determination unit 309 determines whether the format of the video data being decoded by the decoder unit 308 corresponds to two-dimensional video image (2D video image) or 3D video image, based on the component type included in the EIT, and notifies the 3D video image generation unit 310 of the determination result.
The 3D video image generation unit 310 performs processing to convert the video image input by the decoder unit 308 into a 3D video image if the video image format notified by the video image determination unit 309 indicates a 3D video image. Specifically, the 3D video image generation unit 310 processes a frame image constituting the video image input from the decoder unit 308, such that the left eye image is displayed in the odd numbered lines and the right eye image is displayed in the even numbered lines, as shown in
The display position control unit 311 determines, based on the determination result by the video image determination unit 309, a display region (prescribed region) in the image whose resolution has been converted by the resolution conversion unit 313 (described later). Then, the display position control unit 311 inputs to a video image processing unit 314 an image (partial image) cut out from the determined display region. Specifically, the display start position of the video image is determined by using the 2D pixel shifting table stored in the first memory area 304 in the case of the 2D video image, while using the 3D pixel shifting table stored in the second memory area 305 in the case of the 3D video image. If the image size of the video image is 1920×1080 for example, an image whose size is 1900×1070 is cut out starting from the display start position and the cutout image is input to the video image processing unit 314. Accordingly, pixel shifting can be understood as processing for shifting the region of the image cut out and output to the display panel 101 in accordance with the display start position. Of course, processing of merely shifting the display position without cutting out the image also corresponds to pixel shifting processing.
A frame number measurement unit 312 counts the number of frames output to the display panel, and sends the burn-in mitigation trigger signal to the display position control unit 311 when the count number has reached a prescribed number. The display position control unit 311, upon receipt of the burn-in mitigation trigger signal, updates the display start position based on the pixel shifting table according to the video image format.
Note that the condition for generating the burn-in mitigation trigger signal is not limited to the number of frames. For example, a trigger signal may be generated in the case where a change of channel is detected based on the above-described program information data, or in the case where the channel is changed in response to a user operation via a remote controller 317 or the like.
The resolution conversion unit 313 performs resolution conversion on the video image input from the 3D video image generation unit 310 so as to comply with the display panel 101. In this manner, a 2D video image or a 3D video image as a target for the pixel shifting processing is obtained. Therefore, the resolution conversion unit 313 can obtain a 2D video image or a 3D video image. The resolution conversion unit 313 sends a video image that has undergone resolution conversion to the display position control unit 311.
The video image processing unit 314 performs γ processing or the like on the image (partial image) input from the display position control unit 311, thereafter performs conversion into video image data complying with the display panel 101, such as an LVDS signal for example, and outputs the video image data.
The audio processing unit 315 performs digital-to-analog (D/A) conversion processing or the like on audio data, and outputs the converted audio data to a speaker 316. The remote controller 317 sends a control signal as an infrared signal in response to a user operation. The CPU 301 generates various commands, control signals or the like for controlling the broadcast receiver 300 from the infrared signal received by an infrared light receiving unit 318, and outputs the commands, control signals or the like.
Note that in the present embodiment, for the purpose of description, the left eye image is displayed in odd numbered lines of the display panel 101, while right eye image is displayed in the even numbered lines of the display panel 101 (see
In step S503, the display position control unit 311 reads out the 2D pixel shifting table from the RAM 303. In step S504, the display position control unit 311 acquires a corresponding display start position from the 2D pixel shifting table, based on the number of occurrences of the trigger signal (specifically, how many trigger signals have occurred up to the current processing of the flowchart).
Meanwhile, in step S505, the display position control unit 311 reads out the 3D pixel shifting table from the RAM 303. Then in step S506, the display position control unit 311 acquires a corresponding display start position from the 3D pixel shifting table based on the number of occurrences of the trigger signal.
In step S507, the display position control unit 311 cuts out an image based on the display start position acquired in step S504 or S506, and inputs the image cutout into the video image processing unit 314.
Note that the format of the burn-in mitigation trigger signal may be defined such that the display position control unit 311 can acknowledge the cause of occurrence of the burn-in mitigation trigger signal. In this case, once the process for determining the format type of the video image (steps S501 and S502) is performed, unless the burn-in mitigation trigger signal is generated due to a cause that may accompany a change in the video image format (for example, a change of channel), this process can be omitted thereafter.
As described above, with the present embodiment, the broadcast receiver 300 shifts the region of the video image output on the display panel 101 in order to mitigate burn-in on the display panel 101. In the case where the video image is a 3D video image, the broadcast receiver 300 shifts the region such that the pixels for the left eye image are displayed in the positions of the pixels for the left eye image on the display panel 101, and the pixels for the right eye image are displayed in the positions of the pixels for the right eye image on the display panel 101. In this manner, a technique for mitigating burn-in for the 3D video image is provided.
Second Embodiment
In a second embodiment, a case is described in which the active shutter method is employed as the method for displaying 3D video images.
A decoder unit 701 includes a first decoder 702 that decodes left eye images and a second decoder 703 that decodes right eye images, and decodes MPEG-2 coded video data input from the demultiplexer unit 307 and inputs the decoded video data into a decoder switching unit 704. Also, the decoder unit 701 decodes MPEG-2 AAC-coded audio data to convert the audio data into a linear PCM format, and inputs the converted data to the audio processing unit 315.
The decoder switching unit 704 alternately inputs the left eye images and the right eye images input from the decoder unit 701 to the resolution conversion unit 313. Using two decoders in this manner enables display of video images at a higher frame rate.
A display position control unit 705 determines the region to be displayed of the image whose resolution has been converted by the resolution conversion unit 313, based on whether the image is the left eye image or the right eye image, and the determination result by the video image determination unit 309, and the partial image cut out from the determined display region is input into the video image processing unit 314. When determining the display region, the display position control unit 705 causes the display start positions for a pair of a left eye image and a right eye image to be matched.
If the video data is a 3D video image, a glasses control unit 706 controls the liquid crystal shutter of glasses 707 such that the left eye image only enters the left eye and the right eye image only enters the right eye in synchronization with the timing in which the left eye images and the right eye images are displayed in alternation.
A display panel 708 differs from the display panel 101 in the first embodiment, and does not include a polarization film. A first memory area 709 holds the 2D pixel shifting table and a second memory area 710 holds the 3D pixel shifting table. However, unlike the first embodiment, since the left eye image and the right eye image do not coexist in a single frame, design of the 3D pixel shifting table is not restricted by the arrangement of the polarization film.
In the second embodiment, it is assumed that the pixel shifting table is configured such that the display start position is shifted by two pixels or more for a single trigger signal, for example, N={1, 2, . . . }={[1,1], [7,1], . . . }. In this case, if the display start position is shifted by the entire shift amount at once, when one frame is changed to the next frame, such shifting is noticeable to viewers, which may be perceived as a disruption in the video image. For this reason, with the present embodiment, the display position control unit 705 gradually shifts the display start position over several frames, from the display start position before the trigger signal is generated (e.g., [1, 1]) to the display start position after the trigger signal is generated (e.g., [7, 1]).
In step S801, the display position control unit 705 obtains a display start position corresponding to the number of occurrences of the trigger signal from the 2D pixel shifting table, and sets the position as a target shift position. In step S802, the display position control unit 705 shifts the display start position by a prescribed unit amount (e.g., one pixel). For example, if the display start position before shifting is [1, 1] and the target shift position is [7, 1], the display start position after shifting is [2, 1].
In step S803, the display position control unit 705 cuts out an image based on the display start position obtained after shifting in step S802, and inputs the image cutout to the video image processing unit 314. In step S804, the display position control unit 705 determines whether or not the display start position has reached the target shift position. If the target shift position is not reached, the procedure returns to step S802, and if the target shift position is reached, the processing ends.
Meanwhile, in step S805, the display position control unit 705 obtains the display start position corresponding to the number of occurrences of the trigger signal from the 3D pixel shifting table, and sets the position as the target shift position. In step S806, the display position control unit 705 determines whether or not a video image as the processing target is the left eye image. If the video image is the left eye image, the procedure proceeds to step S807, and if it is not the left eye image, the procedure proceeds to step S808.
In step S807, the display position control unit 705 shifts the display start position by a prescribed unit amount (e.g., one pixel). For example, if the display start position before shifting is [1, 1] and the target shift position is [5, 1], the display start position after shifting is [2, 1].
In step S808, the display position control unit 705 cuts out an image based on the display start position obtained after shifting in step S807, and inputs the image to the video image processing unit 314. In step S809, the display position control unit 705 determines whether or not the display start position has reached the target shift position. If the target shift position is not reached, the procedure returns to step S806, and if the target shift position is reached, processing ends.
As understood from steps S806 to S809, when the processing target video image is the right eye image, the display start position is not shifted, so the display start positions of a pair of the left eye image and the right eye image that correspond to the same scene match. As a result, the parallax does not change between the left eye image and right eye image that correspond to the same scene.
Note that setting the shift amount to 0 with respect to the coordinate values for the 3D pixel shifting table means stopping pixel shifting processing. If a motion vector between frames is determined to be large with the decoder unit 701, the motion of an object in the video image can be considered to be large. If pixel shifting is performed in this case, a significant disruption in the video image appears and the 3D effect is reduced. Accordingly, the decoder unit 701 divides a frame into a plurality of regions, references the motion vector for each region, and notifies the display position control unit 705 of the average motion amount of the object in the frame.
If the received average motion amount exceeds a prescribed threshold, the display position control unit 705 operates with replacing the shift amount to “0” with respect to the coordinate values in the 3D pixel shifting table, even in the case where the burn-in mitigation trigger signal is generated. As a result, a reduction of the 3D effect can be avoided. Note that in this case, since the motion of the object in the movie image itself is large, burn-in that may bring about an afterimage does not readily occur.
As described above, according to the present embodiment, the broadcast receiver 700 shifts the region of the video image to be output in the display panel 708 so as to mitigate burn-in on the display panel 708. In the case where the video image is a 3D video image, the broadcast receiver 700 shifts the region such that the display start positions of the left eye image and the right eye image corresponding to the same scene match. Accordingly, fluctuation in the parallax between the left eye image and the right eye image corresponding to the same scene is prevented. In this manner, a burn-in mitigation technique for 3D video images is provided.
Other Embodiments
Aspects of the present invention can also be realized by a computer of a system or apparatus (or devices such as a CPU or MPU) that reads out and executes a program recorded on a memory device to perform the functions of the above-described embodiment(s), and by a method, the steps of which are performed by a computer of a system or apparatus by, for example, reading out and executing a program recorded on a memory device to perform the functions of the above-described embodiment(s). For this purpose, the program is provided to the computer for example via a network or from a recording medium of various types serving as the memory device (e.g., computer-readable medium).
While the present invention has been described with reference to exemplary embodiments, it is to be understood that the invention is not limited to the disclosed exemplary embodiments. The scope of the following claims is to be accorded the broadest interpretation so as to encompass all such modifications and equivalent structures and functions.
This application claims the benefit of Japanese Patent Application No. 2009-134298, filed on Jun. 3, 2009, which is hereby incorporated by reference herein in its entirety.
Number | Date | Country | Kind |
---|---|---|---|
2009-134298 | Jun 2009 | JP | national |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/JP2010/056894 | 4/13/2010 | WO | 00 | 11/28/2011 |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO2010/140430 | 12/9/2010 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
5541642 | Ashbey | Jul 1996 | A |
5717415 | Iue et al. | Feb 1998 | A |
5786848 | Isobe et al. | Jul 1998 | A |
6075556 | Urano et al. | Jun 2000 | A |
6549295 | Fantone et al. | Apr 2003 | B1 |
20020009137 | Nelson et al. | Jan 2002 | A1 |
20060001601 | Ono | Jan 2006 | A1 |
20060209054 | Someya et al. | Sep 2006 | A1 |
20070096767 | Tsai | May 2007 | A1 |
20100039428 | Kim et al. | Feb 2010 | A1 |
20100171817 | Tourapis et al. | Jul 2010 | A1 |
20100328428 | Booth et al. | Dec 2010 | A1 |
Number | Date | Country |
---|---|---|
1774735 | May 2006 | CN |
1801269 | Jul 2006 | CN |
1555814 | Jul 2005 | EP |
1679683 | Jul 2006 | EP |
2000-338947 | Dec 2000 | JP |
2004-264366 | Sep 2004 | JP |
2004-267442 | Sep 2004 | JP |
2006-013913 | Jan 2006 | JP |
2008150111 | Dec 2008 | WO |
2008150111 | Dec 2008 | WO |
Entry |
---|
Chinese Office Action issued in corresponding application No. 201080024839.9 on Nov. 4, 2014. |
Japanese Office Action dated Sep. 28, 2012 for corresponding application No. 2009-134298. |
Number | Date | Country | |
---|---|---|---|
20120075438 A1 | Mar 2012 | US |