Method and apparatus for frame coding in vertical raster scan order for HEVC

Information

  • Patent Grant
  • 10165276
  • Patent Number
    10,165,276
  • Date Filed
    Monday, May 1, 2017
    7 years ago
  • Date Issued
    Tuesday, December 25, 2018
    6 years ago
Abstract
A method and apparatus for frame coding in adaptive raster scan order. The method includes encoding at least one of image or video utilizing input frames and at least one of a data related to the input frame to produce bitstream with raster scan order information and displacement information for producing compressed video bitstream, at decoding time, decoding at least one of the encoded bitstream with raster scan order information and displacement information for producing compressed video bitstream.
Description
BACKGROUND OF THE INVENTION

Field of the Invention


Embodiments of the present invention generally relate to a method and apparatus for frame coding in vertical raster scan for HEVC.


Description of the Related Art


HEVC is a video coding standard being standardized to improve coding efficiency by 50% over H264/AVC. To achieve this goal, lots of new coding tools have been proposed for HEVC. However, all the proposed coding tools assume the horizontal raster scan of micro blocks, which has two problems: 1) Horizontal raster scan is not always effective in prediction especially for intra coding and 2) with sliding window memory for motion estimation (ME) and motion compensation (MC), large on-chip memory is required to compensate for large vertical motion. This is especially a challenging for UHD (ultra high definition) videos, such as, 4k×2k and 8k×4k. While UHD video coding is one of the application areas of HEVC, the frame coding in horizontal scan order is not cost effective at all for UHD videos with high vertical motion.


Therefore, there is a need for a method and/or apparatus for frame coding for HEVC.


SUMMARY OF THE INVENTION

Embodiments of the present invention relate to a method and apparatus for frame coding in adaptive raster scan order. The method includes encoding at least one of image or video utilizing input frames and at least one of a data related to the input frame to produce bitstream with raster scan order information and displacement information for producing compressed video bitstream, at decoding time, decoding at least one of the encoded bitstream with raster scan order information and displacement information for producing compressed video bitstream.





BRIEF DESCRIPTION OF THE DRAWINGS

So that the manner in which the above recited features of the present invention can be understood in detail, a more particular description of the invention, briefly summarized above, may be had by reference to embodiments, some of which are illustrated in the appended drawings. It is to be noted, however, that the appended drawings illustrate only typical embodiments of this invention and are therefore not to be considered limiting of its scope, for the invention may admit to other equally effective embodiments.



FIG. 1 is an embodiment of a frame coding in horizontal raster scan order;



FIG. 2 is an embodiment of a frame coding in vertical raster scan order;



FIG. 3 is an embodiment of a vertical sliding window scheme for horizontal raster coding order;



FIG. 4 is an embodiment of a horizontal sliding window scheme for vertical raster coding order;



FIG. 5 is an embodiment depicting horizontal bottom-right to left;



FIG. 6 is an embodiment of a vertical right-bottom to top;



FIG. 7 is an embodiment of a portable device with video camera and motion sensor;



FIG. 8 is an embodiment of a video stabilizer in accordance with the prior art; and



FIG. 9 is an embodiment of an improved video stabilizer.





DETAILED DESCRIPTION

The proposed method and apparatus add a vertical raster scan order to the HEVC standard and adaptively select the best coding scan order between the horizontal and vertical raster scan orders based on any criteria, such as, size of horizontal/vertical motion, coding efficiency, on-chip memory saving and etc.


The vertical raster scan is effective for the frames which have lots of vertical discontinuities. FIG. 1 is an embodiment of a frame coding in horizontal raster scan order. FIG. 2 is an embodiment of a frame coding in vertical raster scan order. In FIG. 1 and FIG. 2, the frame with vertical discontinuities with horizontal and vertical raster scan orders are shown, respectively. When we apply the horizontal raster scan order, as shown in FIG. 1, more bits are used to encode micro block mode. This is due to incorrect mode prediction caused by discontinuity. However, the number of bits is reduced when we apply the vertical raster scan order, as shown in FIG. 2. Hence, the coding gain will be more significant for intra frames. The result indicates that the vertical raster scan can reduce bitrates up to 4% for some sequences, even without adaptive horizontal/vertical raster scan order decision. Thus, adaptive decision will result in even higher bit-rate saving.



FIG. 3 is an embodiment of a vertical sliding window scheme for horizontal raster coding order. If a video contains large vertical motion, the vertical sliding window scheme may not cover the large vertical motion. In this scenario, inter mode may not be used because required reference area is not available. This eventually causes encoding efficiency degradation. However, this problem could be solved by employing a horizontal sliding window scheme with vertical raster coding order, as illustrated in FIG. 4. FIG. 4 is an embodiment of a horizontal sliding window scheme for vertical raster coding order. Utilizing adaptively choosing raster scan order improves the coding efficiency of inter frames at the same on-chip memory requirement. On the other hand, the required on-chip memory size without coding gain loss is reduced.


For horizontal and vertical search range srX*srY, on-chip memory size (byte) for sliding window for 8-bit luma can be calculated as follows:

MemSizeHorOrder=picWidth*(2*srY+N),
MemSizeVertOrder=picHeight*(2*srX+N).


MemSizeHorOrder and MemSizeVertOrder are on-chip memory sizes for vertical sliding window (horizontal raster coding order) and horizontal sliding window (vertical raster coding order), respectively. N*N is the largest coding unit, and picWidth and picHeight are the horizontal and vertical size of the picture. Based on the equations, Table 1 lists the search ranges for different available on-chip memory sizes for 4k×2k (3840×2160) videos. For a given on-chip memory size, srY is always limited in vertical sliding window, which potentially causes encoding efficiency degradation for large vertical motion videos as illustrated in FIG. 3. By introducing vertical coding order with horizontal sliding window, we can remove the limitation on srY and avoid the degradation for large vertical motion videos.









TABLE 1







Available on-chip memory sizes vs. search ranges











500 KBytes
750 Kbytes
1,000 KBytes














srX
srY
srX
srY
srX
srY





Vertical
unlim-
+/−33
unlim-
+/−65
Unlim-
+/−98


sliding
ited

ited

ited



window








Horizontal
+/−83
unlim-
+/−141
Unlim-
+/−199
unlim-


sliding

ited

ited

ited


window









The idea of vertical raster scan can be realized with frame rotation. Hence, similar effect may be seen by rotating input frames with horizontal raster scan. This information can be simply added as SEI (Supplemental Enhancement information) or VUI (Video Usability Information) in bitstreams. Moreover, we can extend this idea to any arbitrary raster scan order. We can apply one of 8 different raster can orders for each frames; horizontal top-left to right, as shown in FIG. 1, horizontal top-right to left, horizontal bottom-left to right, horizontal bottom-right to left, as shown in FIG. 5, vertical left-top to bottom FIG. 2, vertical left-bottom to bottom, vertical right-top to bottom and vertical right-bottom to top FIG. 6. It also can be realized by modifying input frames, i.e., e.g. rotating and/or flipping using SEI or VUI.



FIG. 7 is an embodiment of a portable device with video camera and motion sensor. The motion sensor consists one of the following sensors or a combination of them: accelerometer, gyroscope, magnetosensor etc. The motion sensor provided the camera orientation or displacement information. The displacement can be specified in terms of a rotation matrix, quarternion, euler angle etc.



FIG. 8 and FIG. 9 show a video stabilization where camera displacement information can be used. Video stabilization compensates for jitter in video due to changing camera position in order to provide a steady video. FIG. 8 is an embodiment of a video stabilizer in accordance with the prior art. In FIG. 8, the video stabilization is carried out in decoder. The camera displacement parameters are deduced from decoded frames and are used to translates/rotate/warp the decoded frame to generate stabilized video frames. Calculation of camera displacement parameters from decoded frames is very computationally intensive and prone to incorrect estimation.


On the other hand, FIG. 9 is an embodiment of an improved video stabilizer. The camera displacement information is calculated in the transmitter by using motion sensors. The displacement information is encapsulated in SEI/VUI and transmitted inside compressed bitstreams to receiver. The decoder at the receiver decodes the compressed bitstream with camera displacement SEI/VUI and generates decoded frames and displacement information which is then passed on to the video stabilization module which carries out video stabilization.


While the foregoing is directed to embodiments of the present invention, other and further embodiments of the invention may be devised without departing from the basic scope thereof, and the scope thereof is determined by the claims that follow.

Claims
  • 1. A method comprising: decoding an encoded bitstream that includes displacement information for producing decoded video frames, wherein the displacement information is embedded in at least one of: a picture header; a supplemental enhancement information (SEI); or a video usability information (VUI) message.
  • 2. The method of claim 1, wherein the encoded bitstream further includes raster scan order information.
  • 3. The method of claim 2, wherein the raster scan order information is embedded in at least one of the SEI and VUI message.
  • 4. The method of claim 1, further comprising modifying the decoded video frames using the displacement information to generate stabilized video frames.
  • 5. The method of claim 4, wherein the displacement information is calculated in a transmitter using motion sensors.
  • 6. An apparatus for frame coding, comprising: a decoder to decode an encoded bitstream that includes displacement information for producing decoded video frames, wherein the displacement information is embedded in at least one of: a picture header; a supplemental enhancement information (SEI); or a video usability information (VUI) message.
  • 7. The apparatus of claim 6 further comprising a stabilizer for stabilizing the decoded frames based on the displacement information.
  • 8. The apparatus of claim 6, wherein the encoded bitstream further includes raster scan order information.
  • 9. The apparatus of claim 8, wherein the raster scan order information is embedded in at least one of the SEI and VUI message.
  • 10. The apparatus of claim 6, wherein the decoder further modifies the decoded video frames using the displacement information to generate stabilized video frames.
  • 11. The apparatus of claim 10, wherein the displacement information is calculated in a transmitter using motion sensors.
  • 12. A non-transitory computer readable medium comprising instructions that, when executed by a decoder, cause the decoder to: decode an encoded bitstream with displacement information for producing decoded video frames, wherein the displacement information is embedded in at least one of: a picture header; a supplemental enhancement information (SEI); or a video usability information (VUI) message.
  • 13. The non-transitory computer readable medium of claim 12, wherein the encoded bitstream further includes raster scan order information.
  • 14. The non-transitory computer readable medium of claim 13, wherein the raster scan order information is embedded in at least one of the SEI and VUI message.
  • 15. The non-transitory computer readable medium of claim 13, wherein the instructions that, when executed by the decoder, further cause the decoder to modify the decoded video frames using the displacement information to generate stabilized video frames.
  • 16. The non-transitory computer readable medium of claim 13, wherein the displacement information is calculated in a transmitter using motion sensors.
CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a Continuation of U.S. patent application Ser. No. 14/561,816, filed Dec. 5, 2014, which is a continuation of U.S. patent application Ser. No. 13/250,806, filed Sep. 30, 2011, now U.S. Pat. No. 8,934,729, which claims the benefit of U.S. Provisional Patent Application Ser. No. 61/388,478 filed Sep. 30, 2010, and which claims the benefit of U.S. Provisional Patent Application Ser. No. 61/474,435, filed Apr. 12, 2011, all of which are incorporated herein by reference in their entirety.

US Referenced Citations (9)
Number Name Date Kind
3571700 Paine et al. Mar 1971 A
5687095 Haskell Nov 1997 A
6278272 Scarzello et al. Aug 2001 B1
7298141 Bartington Nov 2007 B2
7391210 Zhang et al. Jun 2008 B2
7391211 Cripe Jun 2008 B2
8339133 Teppan Dec 2012 B2
20060209206 Wang et al. Sep 2006 A1
20100254617 Hwang et al. Oct 2010 A1
Non-Patent Literature Citations (9)
Entry
Bazzocchi, et al. “Interference rejection algorithm for current measurement using magnetic sensor arrays”, Sensors and Actuators 85, 2000, pp. 38-41 (4 pages).
Dezuari, et al. “Printed circuit board integrated fluxgate sensor”, Sensors and Actuators 81, 2000, pp. 200-203 (4 pages).
Tang, et al. “Excitation circuit for fluxgate sensor using saturable inductor”, Sensors and Actuators A 113, 2004, pp. 156-165 (10 pages).
Ripka, et al. “Pulse Excitation of Micro-Fluxgate Sensors”, IEEE Transactions on Magnetics, vol. 37, No. 4, Jul. 2001, pp. 1998-2000 (3 pages).
Drljaca et al., “Low-Power 2-D Fully Integrated CMOS Fluxgate Magnetometer”, IEEE Sensors Journal, vol. 5, No. 5, Oct. 2005, pp. 909-915 (7 pages).
Choi et al., “The Microfluxgate Magnetic Sensor Having Closed Magnetic Path”, IEEE Sensors Journal, vol. 4, No. 6, Dec. 2004, pp. 768-771 (4 pages).
Minhua Zhou “Sub-Picture Based Raster Scan Coding Order for HEVC UHD Video Coding,” JCTVC-B062, Jul. 21-28, 2010, Geneva, Switzerland (5 pages).
Kwon et al. “Frame Coding in Vertical Scan Order,” JCTVC-C224, Oct. 7, 2010, Guangzhou, China (4 pages).
Prosecution History in U.S. Appl. No. 14/561,816, filed Dec. 5, 2014, dated from Dec. 5, 2014 to Apr. 13, 2017 (136 pages).
Related Publications (1)
Number Date Country
20170237984 A1 Aug 2017 US
Provisional Applications (2)
Number Date Country
61474435 Apr 2011 US
61388478 Sep 2010 US
Continuations (2)
Number Date Country
Parent 14561816 Dec 2014 US
Child 15583460 US
Parent 13250806 Sep 2011 US
Child 14561816 US