The present invention relates generally to the field of video processing, and particularly relates to an anti-shake method for a panoramic video and a portable terminal.
CMOS and CCD sensors are two types of image sensors that are currently commonly used. Both use photodiodes for photoelectric conversion to convert images into digital data. The main difference is the way in which digital data is transmitted.
The charge data of each pixel in each line of the CCD sensor will be transferred to the next pixel in turn, and be outputted from the bottom thereof, then be amplified and outputted by the amplifier at the edge of the sensor; while in the CMOS sensor, each pixel will be adjacent to one amplifier and an A/D conversion circuit, and data is outputted in a way similar to a memory circuit.
Generally, CMOS cameras use rolling shutter mode to obtain fisheye images through progressive exposure.
CMOS chips share the workload through many parallel A/Ds, but the entire array in the sensor must be converted one line at a time, which results in a small time delay between the readings of each line. Each individual line is usually able to start the exposure of the next frame when the readout of the previous frame is completed. Although the speed is very fast, the time delay between the readings of each line will be converted into the delay between the start of the exposure of each line, exposure of each line will not happen at the same time. The result is that each line in the frame will be exposed for the same time length, but the exposure will start at a different time point. Two frames are allowed to overlap exposure, and the final frame rate depends on the speed at which the scrolling readout process can be completed.
Such exposure mode causes a time difference between different lines of the same frame of image. If a handheld panoramic camera is used to shoot, when moving at high speeds, a jelly effect will occur due to the progressive exposure of the CMOS camera. The jelly effect is a problem that has not been well solved, especially for a VR panoramic video, and the anti-shake effect of video is poor.
The problem solved by the present invention is to provide an anti-shake method for a panoramic video, a computer-readable storage medium and a portable terminal, which aims to solve the jelly effect caused by the shaking of the picture due to the rolling shutter mode of the CMOS chip, and the poor anti-shake effect of the camera.
According to a first aspect, the present invention provides an anti-shake method for a panoramic video, comprises steps of:
obtaining, in real time, an output video frame, a fisheye image corresponding to the output video frame, a timestamp of the pixel in the video frame, and a corresponding camera gyroscope timestamp;
synchronizing the timestamp of the pixel in the video frame with the corresponding camera gyroscope timestamp, and calculating a rotation matrix of the camera movement in the camera gyroscope timestamp;
smoothing the camera movement and establishing a coordinate system of a smooth trajectory;
correcting the fisheye image distortion; and
rendering the fisheye image by means of forward rendering to generate a stable video.
According to a second aspect, the present invention provides an anti-shake method for a panoramic video, comprises steps of:
S201, obtaining, in real time, an output video frame, a fisheye image corresponding to the output video frame, a timestamp of the pixel in the video frame, and a corresponding camera gyroscope timestamp;
S202, synchronizing the timestamp in the video frame to the camera gyroscope acquisition system, and calculating a rotation matrix of the camera movement in the timestamp of the pixel in the current video frame as the approximate rotation matrix of the camera when collecting the current pixel;
S203, projecting the output video frame pixels to the corresponding grid points of the spherical model, and using the approximate rotation matrix to rotate the corresponding grid points of the spherical model to the grid points in the sensor coordinate system; using the fisheye image distortion correction method to establish the relationship between the grid points in the sensor coordinate system and the pixels in the fisheye image to obtain approximate reverse-mapping pixels;
S204, calculating the camera gyroscope timestamp of the approximate reverse-mapping pixels in the fisheye image, obtaining the accurate rotation matrix of the camera when collecting the current pixel, and using the accurate rotation matrix to rotate the corresponding grid points of spherical model again to obtain the second grid points in the sensor coordinate system;
S205, using the fisheye image distortion correction method to establish the relationship between the second grid points in the sensor coordinate system and the pixels in the fisheye image to obtain accurate pixels in the fisheye image, and using the mapping relationship between the output video frame pixels and the accurate pixels in the fisheye image, rendering the fisheye image in reverse rendering to generate a stable video.
According to a third aspect, the present invention provides a computer-readable medium that stores a computer program or computer programs, which when being executed by a processor or processors, cause the processor or processors to perform steps of the above-mentioned anti-shake method for a panoramic video.
According to a fourth aspect, the present invention provides a portable terminal, comprising:
one or more processors;
a memory; and
one or more computer programs where the one or more computer programs are stored in the memory and are configured to be executed by the one or more processors, and when executed by the one or more processors, cause the one or more processors to perform the above-mentioned anti-shake method for a panoramic video.
In the present invention, based on the timestamp of the pixel in the video frame, the fisheye image is pixel by pixel converted into the coordinate system of the smooth trajectory using the rotation matrix of the camera motion under the timestamp of the current video frame pixel in real time, and then rendered and output; thereby the distortion of the rolling shutter of the panoramic video sequence can be corrected. The invention can correct the image distortion caused by the CMOS rolling shutter and eliminate the jelly effect, and achieve better video image anti-shake effect;
Further, in the present invention, according to the timestamp of the video frame pixel, calculating the rotation matrix of the camera movement under the timestamp of the current video frame pixel as the approximate rotation matrix of the camera when collecting the current pixel, projecting the output video frame pixels to the corresponding grid points of the spherical model, and using the approximate rotation matrix to rotate the corresponding grid points of the spherical model to the grid points in the sensor coordinate system; using the fisheye image distortion correction method to establish the relationship between the grid points in the sensor coordinate system and the pixels in the fisheye image to obtain approximate reverse mapping pixels, calculating the camera gyroscope timestamp of the approximate reverse-mapping pixels in the fisheye image, obtaining the accurate rotation matrix of the camera when collecting the current pixel, and using the accurate rotation matrix to rotate the corresponding grid points of spherical model again to obtain the second grid points in the sensor coordinate system; using the fisheye image distortion correction method to establish the relationship between the second grid points in the sensor coordinate system and the pixels in the fisheye image to obtain accurate pixels in the fisheye image, and using the mapping relationship between the output video frame pixels and the accurate pixels in the fisheye image, rendering the fisheye image in reverse rendering to generate a stable video. Thereby, the distortion of the rolling shutter of the panoramic video sequence can be corrected. The invention can correct the image distortion caused by the CMOS rolling shutter and eliminate the jelly effect, thereby achieve a better video image anti-shake effect.
The foregoing objects, technical solutions and advantages of the invention will be much clearer from the following detail description taken with the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are only used to explain the present invention and are not intended to limit the present invention.
In order to explain the technical solutions of the present invention, the following will be described by specific embodiments.
Referring to
S101, obtaining, in real time, an output video frame, a fisheye image corresponding to the output video frame, a timestamp of the pixel in the video frame, and a corresponding camera gyroscope timestamp;
in the first embodiment of the present invention, S101 may specifically comprise the following steps:
S1011, obtaining the output video frame and the fisheye image corresponding to the output video frame;
S1012, obtaining the timestamp tframek
S1013, obtaining the gyroscope timestamp tgyrok
S102, synchronizing the timestamp of the pixel in the video frame with the corresponding camera gyroscope timestamp, and calculating a rotation matrix of the camera movement in the camera gyroscope timestamp;
referring to
S1021, calculating the timestamp tframek
in formula (1), tframek
S1022, synchronizing the timestamp of the pixel in the k video frame with the camera gyroscope timestamp, a conversion relationship between the two as formula (2):
t
gyro
k
=ΔT
0
+t
frame
k
(2)
S1023, calculating a rotation matrix of the camera movement in the camera gyroscope timestamp at time tgyrok
R
w2s(tgyrok
in formula (3), Rw2g(tgyrok
S103, smoothing the camera movement and establishing a coordinate system of a smooth trajectory;
in the first embodiment of the present invention, S103 may specifically comprise the following steps:
S1031, where Ps representing the 3D coordinates in the camera coordinate system, let t=tgyrok
P
t
s
=R
w2s(t)·Ptw+{right arrow over (t0)} (4)
in the formula (4), Rw2s(t) is a matrix of the camera coordinate system relative to the world coordinate system at time t, and {right arrow over (t0)} is a translation amount of the two frames before and after of the video frame of the camera;
S1032, smoothing the camera movement, where setting {right arrow over (t0)}=0 and then obtaining
S1033, establishing the coordinate system Pt′ of the smooth trajectory, performing a 3D grid rendering processing on the coordinate system Pt′ of the camera's smooth trajectory by OpenGl Rendering formula: Ptgl=K□Rnvp□Pt′; where K is a perspective matrix, Rnvp is a matrix of the movement direction manually controlled, and Ptgl is the 3D coordinates rendered by OpenGl in the coordinate system of the smooth trajectory; and
S1034, in the coordinate system of the smooth trajectory, obtaining the minimum square difference of any pixel in Pt′ in the two frames before and after by a formula: min(Pt+1′−Pt′)2+(Pt′−Ptw)2, where Pt′ and Pt+1′ are the coordinate systems of the smooth trajectory of the two frames before and after, respectively; setting Pt′=Pt+1′=Ptw, and obtaining the rendering formula as formula (5):
P
t
gl
=K□R
nvp
□R
w2s(t)
−1
□P
t
s (5)
calculating the 3D coordinates of the camera rendered to the coordinate system of the smooth trajectory by formula (5).
S104, correcting the fisheye image distortion.
In the first embodiment of the present invention, S104 may specifically comprise:
where let
obtaining a panoramic expanded image of Ps in the camera coordinate system:
in formula (6), u and v are the abscissa and ordinate of the panoramic expanded image in the camera coordinate system, respectively;
a fisheye distortion model ρ=f(θ)=θ(1+k0θ2+k1θ4+k2θ6+k3θ8), where θ is an incident angle of the light, k0, k1, k2, and k3 are the camera calibration coefficients, obtaining the corrected coordinates (x, y) of the fisheye image:
in formula (7), r=√{square root over (Pxs2+Pys2)}, θ=atan(r, Pzs); the point (xc, yc) is a projection center of the sensor, x and y are the 2D coordinates of the projection position of the camera sensor respectively.
S105, rendering the fisheye image by means of forward rendering to generate a stable video.
In the first embodiment of the present invention, S105 may specifically comprise the following steps of:
establishing a correspondence between the fisheye image and the grid points of the spherical model in the camera coordinate system by means of the fisheye image distortion correction method;
generating the grid points of the spherical model in the coordinate system of the smooth trajectory by rendering; and
performing panoramic expansion of the grid points of the spherical model in the coordinate system of the smooth trajectory to generate a stable video. Where, specifically, color information can be input by interpolation, and projection can generate perspective images or asteroid images.
Referring to
A1 corresponds to A2, where A1 is the coordinate in the fisheye image, A2 corresponds to A1, is the coordinate of the grid point of the spherical model in the camera coordinate system; A3 corresponds to A2, is the coordinate of the grid point of the spherical model in the coordinate system of the smooth trajectory generated by rendering; and A4 is the coordinate point generated by the panoramic expansion projection of A3;
Similarly, B1 corresponds to B2, where B1 is the coordinate in the fisheye image, B2 corresponds to B1, is the coordinate of the grid point of the spherical model in the camera coordinate system; B3 corresponds to B2, is the coordinate of the grid point of the spherical model in the coordinate system of the smooth trajectory generated by rendering; and B4 is the coordinate point generated by the panoramic expansion projection of B3.
In the first embodiment of the present invention, based on the timestamp of the pixel in the video frame, the fisheye image is pixel by pixel converted into the coordinate system of the smooth trajectory using the rotation matrix of the camera motion under the timestamp of the current video frame pixel in real time, and then rendered and output; thereby the distortion of the rolling shutter of the panoramic video sequence can be corrected. The invention can correct the image distortion caused by the CMOS rolling shutter and eliminate the jelly effect, and achieve better video image anti-shake effect.
Referring to
S201, obtaining, in real time, an output video frame, a fisheye image corresponding to the output video frame, a timestamp of the pixel in the video frame, and a corresponding camera gyroscope timestamp;
in the second embodiment of the present invention, S201 may specifically comprise steps of:
S2011, obtaining the output video frame and the fisheye image corresponding to the output video frame;
S2012, obtaining the timestamp tframek
S2013, obtaining the gyroscope timestamp tgyrok
S202, synchronizing the timestamp of the pixel in the video frame to the camera gyroscope acquisition system, and calculating the rotation matrix of the camera movement in the timestamp of the pixel in the current video frame as the approximate rotation matrix of the camera when collecting the current pixel;
in the second embodiment of the present invention, S202 may specifically include the following steps:
S2021, synchronizing the timestamp of the pixel in the video frame to the camera gyroscope acquisition system, the conversion relationship between the two as shown in formula (8):
t
gyro
k
=ΔT
0
+t
frame
k (8)
in formula (8), ΔT0=tgyrok−tframek, tgyrok is the camera gyroscope timestamp at the beginning of the k video frame, and tframek is the timestamp of the pixel at the beginning of the k video frame;
S2022, calculating a rotation matrix of the camera movement under the timestamp of the pixel in the current video frame at time tgyrok by formula (9):
R
w2s(tgyrok)=Rw2g(tgyrok)·Rg2s(tgyrok) (9)
in formula (9), Rw2g(tgyrok) is the rotation matrix measured by the gyroscope sensor at time tgyrok, Rw2g(tgyrok) is a 3*3 matrix; and Rg2s(tgyrok) is the rotation matrix from the calibrated gyroscope coordinate system to the camera.
S203, projecting the output video frame pixels to the corresponding grid points of the spherical model, and using the approximate rotation matrix to rotate the corresponding grid points of the spherical model to the grid points in the sensor coordinate system; using the fisheye image distortion correction method to establish the relationship between the grid points in the sensor coordinate system and the pixels in the fisheye image to obtain approximate reverse-mapping pixels;
in the second embodiment of the present invention, S203 may specifically be:
where let
obtaining a panoramic expanded image of Ps in the camera coordinate system:
where u and v are the abscissa and ordinate of the panoramic expanded image in the camera coordinate system, respectively;
a fisheye distortion model ρ=f(θ)=θ(1+k0θ2+k1θ4+k2θ6+k3θ8), where θ is an incident angle of the light, k0, k1, k2, and k3 are the camera calibration coefficients, obtaining the corrected coordinates (x, y) of the fisheye image:
where r=√{square root over (Pxs2+Pys2)} and θ=atan(r, Pzs); the point (xc, yc) is a projection center of the sensor, x and y are the 2D coordinates of the projection position of the camera sensor respectively.
S204, calculating the camera gyroscope timestamp of the approximate reverse-mapping pixels in the fisheye image, obtaining the accurate rotation matrix of the camera when collecting the current pixel, and using the accurate is rotation matrix to rotate the corresponding grid points of spherical model again to obtain the second grid points in the sensor coordinate system;
in the second embodiment of the present invention, the step of calculating the camera gyroscope timestamp of the approximate reverse-mapping pixels in the fisheye image, obtaining the accurate rotation matrix of the camera motion when collecting the current pixel, specifically is:
using the approximate reverse-mapping pixels in the fisheye image to calculate the camera gyroscope timestamp of the pixel by the formula:
in formula (10), Δt is a sampling time interval of the progressive scan of the video frame, and H is the number of lines of the image;
further calculating the accurate rotation matrix Rw2s(tgyrok
R
w2s(tgyrok
S205, using the fisheye image distortion correction method to establish the relationship between the second grid points in the sensor coordinate system and the pixels in the fisheye image to obtain accurate pixels in the fisheye image, and using the mapping relationship between the output video frame pixels and the accurate pixels in the fisheye image, rendering the fisheye image in reverse rendering to generate a stable video.
Referring to
projecting the output video frame pixel A4 to the corresponding grid point A3 of the spherical model, using the approximate rotation matrix to rotate A3 to obtain A2 in the sensor coordinate system, using the fisheye image distortion correction method to project A2 onto the fisheye image to obtain the pixel A1, using the accurate rotation matrix to rotate A1 to obtain A2 in the sensor coordinate system, and then using the fisheye image distortion correction method to project A2 to the fisheye image to obtain the accurate pixel A′1; using the mapping relationship between the output video frame pixel A4 and the accurate pixel A′1 in the fisheye image to reverse map and render;
Similarly, projecting the output video frame pixel B4 to the corresponding grid point B3 of the spherical model, using the approximate rotation matrix to rotate B3 to obtain B2 in the sensor coordinate system, using the fisheye image distortion correction method to project B2 onto the fisheye image to obtain pixel B1, using the accurate rotation matrix to rotate B1 to obtain B′2 in the sensor coordinate system, and then using the fisheye image distortion correction method to project B2 to the fisheye image to obtain the accurate pixel B′1; using the mapping between the output video frame pixel B4 and the accurate pixel B1 in the fisheye image to reverse map and render;
It should also be noted that the perspective view generated by the projection or the color information of the asteroid image is input by interpolation, and finally a stable video can be generated.
In the second embodiment of the present invention, according to the timestamp of the video frame pixel, calculating the rotation matrix of the camera movement under the timestamp of the current video frame pixel as the approximate rotation matrix of the camera when collecting the current pixel, projecting the output video frame pixels to the corresponding grid points of the spherical model, and using the approximate rotation matrix to rotate the corresponding grid points of the is spherical model to the grid points in the sensor coordinate system; using the fisheye image distortion correction method to establish the relationship between the grid points in the sensor coordinate system and the pixels in the fisheye image to obtain approximate reverse-mapping pixels, calculating the camera gyroscope timestamp of the approximate reverse-mapping pixels in the fisheye image, obtaining the accurate rotation matrix of the camera when collecting the current pixel, and using the accurate rotation matrix to rotate the corresponding grid points of spherical model again to obtain the second grid points in the sensor coordinate system; using the fisheye image distortion correction method to establish the relationship between the second grid points in the sensor coordinate system and the pixels in the fisheye image to obtain accurate pixels in the fisheye image, and using the mapping relationship between the output video frame pixels and the accurate pixels in the fisheye image, rendering the fisheye image in reverse rendering to generate a stable video. Thereby, the distortion of the rolling shutter of the panoramic video sequence can be corrected.
The invention can correct the image distortion caused by the CMOS rolling shutter and eliminate the jelly effect, thereby achieve a better video image anti-shake effect.
The third embodiment of the present invention provides a computer-readable storage medium that stores a computer program or computer programs, which when being executed by a processor or processors, cause the processor or processors to perform steps of the anti-shake method for a panoramic video provided in the first or second embodiment of the present invention.
A person of ordinary skill in the art may understand that all or part of the steps in the method of the above-mentioned embodiments can be implemented by a program or programs instructing relevant hardware. The program or programs can be stored in a computer-readable storage medium, and the storage media can be, such as ROM/RAM, magnetic disk, optical disk, etc.
The above descriptions are only preferred embodiments of the present invention and are not intended to limit the present invention. Any modification, equivalent replacements and improvement made within the spirit and principle of the present invention shall be included in the protection of the present invention.
Number | Date | Country | Kind |
---|---|---|---|
201810217087.6 | Mar 2018 | CN | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/CN2019/078329 | 3/15/2019 | WO | 00 |