PANORAMIC VIDEO ANTI-SHAKE METHOD AND PORTABLE TERMINAL

Information

  • Patent Application
  • 20220060626
  • Publication Number
    20220060626
  • Date Filed
    September 30, 2019
    4 years ago
  • Date Published
    February 24, 2022
    2 years ago
  • Inventors
  • Original Assignees
    • ARASHI VISION INC.
Abstract
The present invention is applicable to the field of videos, and provides a panoramic video anti-shake method and a portable terminal. The method comprises: obtaining world coordinates of any reference point in a world coordinate system in real time, and simultaneously obtaining camera coordinates corresponding to the reference point in the portable terminal and an angular velocity value of a current state of a gyroscope in the portable terminal; smoothing the motion of a camera using an extended Kalman filter; decomposing the smoothed motion to synthesize virtual lens motion of a free lens mode, and calculating the rotation amount of the virtual lens; and re-projecting an original video to generate a stable video.
Description
TECHNICAL FIELD

The present disclosure relates to the field of panoramic video, and particularly to a panoramic video anti-shake method and a portable terminal.


BACKGROUND

At present, when a panoramic video is shot, it is usually to hold a panoramic shooting device in hand to shoot. When a mobile shooting is taken, a phenomenon of jitter occurs in the panoramic shooting video due to unstable hands. When a panoramic view angle is acquired, the focus of the original lens is often lost due to the movement or shaking of the camera, which affects the viewing experience of the panoramic video. One of the current solutions is to use a pan-tilt to stabilize the panoramic shooting device to stabilize the pictures taken. However, the disadvantage is that the pan-tilt is more expensive and the volume is generally larger, and the pan-tilt does not completely solve the problem of picture jitter when shooting the video with a handheld panoramic shooting device.


When a panoramic video viewer wants to watch from a viewing angle of an original motion direction of the video, and the impact of artificial shaking on the video can be avoided, it is necessary to enable the video to keep the viewing angle changing in the original shooting direction and keep the video stable. Accordingly, it is necessary to study a panoramic video anti-shake method that can retain the change state of the original shooting direction of the shooting device.


SUMMARY
Technical Problem

The purpose of the present disclosure is to provide a panoramic video anti-shake method, a computer-readable storage medium and a portable terminal, and is intended to solve the problem of the loss of the original lens focus when the viewing angle of the panoramic video is acquired. The method can, through decomposing the motion of the camera, retain the original shooting angle of the camera, synthesize the motion of the virtual lens and generate a stable video.


Technical Solution

In the first aspect, the present disclosure provides a panoramic video anti-shake method, which includes:


acquiring a world coordinate of any one reference point in a world coordinate system in real time, and simultaneously acquiring a camera coordinate corresponding to the reference point in a portable terminal and an angular velocity value of a gyroscope in the portable terminal in a current state;


smoothing a motion of the camera by using an extended Kalman filter;


decomposing the smoothed motion, synthesizing a motion of a virtual lens in a free lens mode, and calculating a rotation quantity of the virtual lens;


re-projecting an original video according to the rotation quantity of the virtual lens and a rotation matrix by which the camera coordinate is transformed to the world coordinate, to generate a stable video.


In the second aspect, the present disclosure provides a computer-readable storage medium, on which a computer program is stored, the computer program, when executed by a processor, implements the steps of the above-mentioned panoramic video anti-shake method.


In the third aspect, the present disclosure provides a portable terminal, which includes:


one or more processors;


a memory; and


one or more computer programs, wherein the one or more computer programs are stored in the memory and configured to be executed by the one or more processors, wherein the processor, when executing the computer programs, implement the steps of the above-mentioned panoramic video anti-shake method.


Advantages


In the present disclosure, by decomposing the motion of the camera and retaining the synthesized motion of the virtual lens at the original shooting angle of the camera, the stable video can be generated. Therefore, this method can keep smooth motion of the rendering lens, generate the stable video, and retain the original shooting angle of the camera, which has strong robustness to large noise scenes and most sports scenes.





BRIEF DESCRIPTION OF THE DRAWINGS


FIG. 1 is a flow chart showing a panoramic video anti-shake method according to an embodiment I of the present disclosure.



FIG. 2 is a schematic structure diagram of a portable terminal according to an embodiment III of the present disclosure.





DETAILED DESCRIPTION

In order to make the objectives, technical solution, and advantages of the present disclosure clearer, the present disclosure will be described in detail with reference to the accompanying drawings and embodiments. It should be appreciated that the specific embodiments described here are only used for explaining the present disclosure, rather than limiting the present disclosure.


In order to illustrate the technical solution of the present disclosure, specific embodiments are used for description below.


Embodiment I

Referring to FIG. 1, the panoramic video anti-shake method provided by the Embodiment 1 of the present disclosure includes the following steps.


S101: a world coordinate of any one reference point in a world coordinate system is acquired in real time, and a camera coordinate corresponding to the reference point in a portable terminal and an angular velocity value of a gyroscope in the portable terminal in a current state are simultaneously acquired.


In the embodiment I of the present disclosure, S101 can specifically be as follows.


The world coordinate of the reference point is Pw, and the camera coordinate is Pc, the following relationship is specifically included:






P
w
R
w2c
P
c  (1)


In the formula (1),







R

w

2

c


=

[




r

1

1





r

1

2





r

1

3







r

2

1





r

2

2





r

2

3







r

3

1





r

3

2





r

3

3





]





is a rotation matrix by which the camera coordinate is transformed to the world coordinate; the elements r11 to r33 are elements of the rotation matrix Rw2c,













[




r
11




r
12




r
13






r
21




r
22




r
23






r
31




r
32




r
33




]

T

·





[




r
11




r
12




r
13






r
21




r
22




r
23






r
31




r
32




r
33




]

=







[




r
11




r
12




r
13






r
21




r
22




r
23






r
31




r
32




r
33




]

·


[




r
11




r
12




r
13






r
21




r
22




r
23






r
31




r
32




r
33




]

T


=
I


,







I is the unit matrix.


The step of acquiring the angular velocity value of the gyroscope in the portable terminal in real time specifically includes: an angular velocity sensor is adopted to read a three-axis angular velocity value as wk.


S102: an extended Kalman filter is utilized to smooth a motion of the camera.


The extended Kalman filter algorithm linearizes the nonlinear system, and then performs Kalman filter. The Kalman filter is a high-efficiency recursive filter which can estimate a state of a dynamic system from a series of measurements that do not completely contain noises.


In the embodiment I of the present disclosure, S102 can specifically be as follows.


The extended Kalman filter algorithm is utilized to establish a state model and an observation model of a motion state of the camera; specifically:


the state model is:









{







q
~

k

=


Φ


(


w
~


k
-
1


)


·


q
~


k
-
1











w
~

k

=


w
~


k
-
1






;





(
2
)







the observation model is:










{





q
k

=


q
˜

k








w
k

=


w
˜

k






.




(
3
)







In the formulas (2) and (3), k is time, wk is an obtained angular velocity, and qk is an obtained observation vectors of the rotation quantity; {tilde over (w)}k and {tilde over (q)}k are state values of the angular velocity and rotation quantity, {tilde over (w)}k−1 are {tilde over (q)}k−1 state values of the angular velocity and rotation quantity at time k−1; qk is a quaternion representation of Rw2c−1; wk is the angular velocity value of the gyroscope; wk=lg(qk+1·qk−1), Φ({tilde over (w)}k−1) is a state transition matrix at the time k−1; Φ({tilde over (w)}k−1)=exp([{tilde over (w)}k−1]x), {tilde over (q)}k is the quaternion representation of the estimated smoothed motion of the camera, {tilde over (q)}k is the state value estimated by the value of {tilde over (q)}k−1 at the previous time.


The specific process of updating the prediction includes: at time k, {tilde over (q)}k−1 estimated at the previous time and the observation value qk the current time are utilized to update the estimation of the state variable {tilde over (q)}k to obtain an estimated value at the current time. The predicted value {tilde over (q)}k is the rotation quantity of the virtual lens at the k-th time.


S103: the smoothed motion is decomposed, a motion of the virtual lens is synthesized in a free lens mode, and a rotation quantity of the virtual lens is calculated.


In the embodiment I of the present disclosure, S103 can specifically be as follows.


The coordinate of the reference point in the virtual lens is Pc, and the following relationship is specifically included:






P

c

=R

c
2w

P
w;  (4)


In formula (4), Rc2w is a 3*3 matrix, which is the rotation quantity of the virtual lens.


The free lens mode is a mode in which motions in the original shooting changing directions of the shooting device are retained. For the synthesized motion of the virtual lens in the free lens mode, the rotation quantity is set as Rc2w={tilde over (R)}, {tilde over (R)} is the original motion trajectory of the shooting device.


the step of decomposing the smoothed motion, synthesizing the motion of the virtual lens in the free lens mode and calculating the rotation quantity of the virtual lens specifically includes:


For the synthesized motion of the virtual lens, the shaking is prevented in the free lens mode, then Rc2w={tilde over (R)}k, where {tilde over (R)}k is the rotation quantity of {tilde over (q)}k.


It should be noted that the Rodrigues formula can be utilized to obtain the rotation matrix R from the quaternion after the unit vector is rotated by an angle θ. Specifically, the quaternion is set as q(θ,x,y,z)T, then calculation formula of the rotation matrix R is:






R
=






[





cos





θ

+


x
2



(

1
-

cos





θ


)








-
z






sin





θ

+

xy


(

1
-

cos





θ


)










y





sin





θ

+






xz


(

1
-

cos





θ


)











z





sin





θ

+

xy


(

1
-

cos





θ


)







cos





θ

+


y
2



(

1
-

cos





θ


)











-
x






sin





θ

+






yz


(

1
-

cos





θ


)












-
y






sin





θ

+

xz


(

1
-

cos





θ


)







x





sin





θ

+

yz


(

1
-

cos





θ


)










cos





θ

+







z
2



(

1
-

cos





θ


)








]

.







S104: the original video is re-projected according to the rotation quantity of the virtual lens and the rotation matrix by which the camera coordinate is transformed to the world coordinate, to generate a stable video.


In the embodiment I of the present disclosure, S104 can specifically be as follows.


A corresponding relationship between a pixel in the original video frame and a pixel in an output video frame is calculated, and then interpolation resampling is performed on the original video frame according to the corresponding relationship to generate the output video frame, and finally a stable video is generated.


The pixel in the original video frame is set as Ps, and the pixel in the corresponding output video frame is Pd, then the corresponding relationship is: Ps=Dc(KcRw2c−1Rc2w−1Kc−1Pd) where Ps=[xs,ys]T, Pd=[xd,yd]T, xs and ys are the coordinate values of the abscissa and ordinate of the pixel Ps in the original video frame; xd and yd are the coordinate values of the abscissa and ordinate of the pixel Pd in the output video frame respectively; Kc and Dc are respectively an internal parameter and a distortion model of the camera, Kcis a projection internal parameter of the virtual lens.


Then the step of performing the interpolation resampling on the original video frame Is according to the corresponding relationship and generating the output video frame Id specifically includes:











I
d



(

P
d

)


=





w
i




I
s



(

P
s
i

)







w
i







(
5
)







In formula (5), wi is the interpolation weight, and Psi∈U(Ps,δ) is a neighborhood coordinate of Ps.


Embodiment II

In the embodiment II of the present disclosure, a computer-readable storage medium is provided, which stores a computer program that, when executed by a processor, implements the steps of the panoramic video anti-shake method provided in the embodiment I of the present disclosure.


The computer-readable storage medium can be a non-transitory computer-readable storage medium.


Embodiment III


FIG. 2 shows a specific structure block diagram of a portable terminal provided in the Embodiment III of the present disclosure. The portable terminal 100 includes: one or more processors 101, a memory 102, and one or more computer programs; the processor 101 is connected to the memory 102 by a bus; the one or more computer programs are stored in the memory 102, and are configured to be executed by the one or more processors 101; and the processor 101, when executing the computer program, implements the steps of the panoramic video anti-shake method provided in the embodiment I of the present disclosure.


In the present invention, by decomposing the motion of the camera, retaining the original shooting viewing angle of the camera and synthesizing the motion of the virtual lens, and the stable video can be generated. Therefore, this method can maintain smooth motion of the rendering lens, generate the stable video, and retain the original shooting angle of the camera. Accordingly, when the user is watching the panoramic video, this method can avoid artificial video jitters and retain changes in the viewing angle in the original direction of the panoramic video. Therefore, this method retains the original shooting angle of the camera, which can keep the smooth motion of the rendering lens and generate the stable video, and has strong robustness to large noise scenes and most sports scenes.


Those of ordinary skill in the art can understand that all or part of the steps in the various methods of the above-mentioned embodiments can be completed by a program instructing relevant hardware. The program can be stored in a computer-readable storage medium, and the storage medium can include: a Read Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk or an optical disk, etc.


The above are merely the preferred embodiments of the present disclosure and are not intended to limit the present disclosure. Any modification, equivalent replacement and improvement made within the spirit and principle of the present disclosure shall be regarded as the protection scope of the present disclosure.

Claims
  • 1. A panoramic video anti-shake method, comprising: acquiring a world coordinate of any one reference point in a world coordinate system in real time, and simultaneously acquiring a camera coordinate corresponding to the reference point in a portable terminal and an angular velocity value of a gyroscope in the portable terminal in a current state;smoothing a motion of the camera by using an extended Kalman filter;decomposing the smoothed motion, synthesizing a motion of a virtual lens in a free lens mode, and calculating a rotation quantity of the virtual lens;re-projecting an original video according to the rotation quantity of the virtual lens and a rotation matrix by which the camera coordinate is transformed to the world coordinate, to generate a stable video.
  • 2. The method according to claim 1, wherein the world coordinate of the reference point is Pw, the camera coordinate is Pc, and an angular velocity value is wk, the following relationship is specifically comprised: Pw=Rw2cPc wherein,
  • 3. The method according to claim 1, wherein a coordinate of the reference point in the virtual lens is Pc, the following relationship is specifically comprised: Pc=Rc2wPw wherein Rc2w is a 3*3 matrix and is the rotation quantity of the virtual lens.
  • 4. The method according to claim 1, wherein the smoothing the motion of the camera by using the extended Kalman filter specifically comprises: using a state model
  • 5. The method according to claim 4, wherein the decomposing the smoothed motion, synthesizing the motion of the virtual lens in the free lens mode and calculating the rotation quantity of the virtual lens specifically comprises: for the synthesized motion of the virtual lens, preventing shaking in the free lens mode, so that Rc2w={tilde over (R)}k, wherein {tilde over (R)}k is the rotation quantity of {tilde over (q)}k.
  • 6. The method according to claim 1, wherein the re-projecting the original video to generate the stable video specifically comprises: calculating a corresponding relationship between a pixel in an original video frame and a pixel in an output video frame, and then performing interpolation sampling on the original video frame according to the corresponding relationship, generating the output video frame and finally generating the stable video;wherein, the pixel in the original video frame is set as Ps, and the pixel in the corresponding output video frame is Pd, accordingly the corresponding relationship is: Ps=Dc(KcRw2c−1Rc2w−1Kc−1Pd) where Ps=[xs,ys]T, Pd=[xd,yd]T, xs and ys are respectively coordinate values of an abscissa and an ordinate of the pixel Ps in the original video frame; xd and yd are respectively coordinate values of an abscissa and an ordinate of the pixel Pd in the output video frame; Kc and Dc are respectively an internal parameter and a distortion model of the camera, Kcis a projection internal parameter of the virtual lens;the performing the interpolation sampling on the original video frame Is according to the corresponding relationship, and generating the output video frame Id specifically comprises:
  • 7. A computer-readable storage medium, on which a computer program is stored, wherein the computer program, when executed by a processor, implements the steps of the panoramic video anti-shake method according to claim 1.
  • 8. A portable terminal, comprising: one or more processors;a memory; andone or more computer programs, wherein the one or more computer programs are stored in the memory and configured to be executed by the one or more processors, wherein the processor, when executing the computer programs, implement following steps of a panoramic video anti-shake method:acquiring a world coordinate of any one reference point in a world coordinate system in real time, and simultaneously acquiring a camera coordinate corresponding to the reference point in a portable terminal and an angular velocity value of a gyroscope in the portable terminal in a current state;smoothing a motion of the camera by using an extended Kalman filter;decomposing the smoothed motion, synthesizing a motion of a virtual lens in a free lens mode, and calculating a rotation quantity of the virtual lens;re-projecting an original video according to the rotation quantity of the virtual lens and a rotation matrix by which the camera coordinate is transformed to the world coordinate, to generate a stable video.
  • 9. The portable terminal according to claim 8, wherein the world coordinate of the reference point is Pw, the camera coordinate is Pc, and an angular velocity value is wk, the following relationship is specifically comprised: Pw=Rw2cPc wherein,
  • 10. The portable terminal according to claim 8, wherein a coordinate of the reference point in the virtual lens is Pc, the following relationship is specifically comprised: Pc=Rc2wPw wherein Rc2w is a 3*3 matrix and is the rotation quantity of the virtual lens.
  • 11. The portable terminal according to claim 8, wherein the smoothing the motion of the camera by using the extended Kalman filter specifically comprises: using a state model
  • 12. The portable terminal of claim 11, wherein the decomposing the smoothed motion, synthesizing the motion of the virtual lens in the free lens mode and calculating the rotation quantity of the virtual lens specifically comprises: for the synthesized motion of the virtual lens, preventing shaking in the free lens mode, so that Rc2w={tilde over (R)}k, wherein {tilde over (R)}k is the rotation quantity of {tilde over (q)}k.
  • 13. The portable terminal according to claim 8, wherein the re-projecting the original video according to the rotation quantity of the virtual lens and the rotation matrix by which the camera coordinate is transformed to the world coordinate to generate the stable video specifically comprises: calculating a corresponding relationship between a pixel in an original video frame and a pixel in an output video frame, and then performing interpolation sampling on the original video frame according to the corresponding relationship, generating the output video frame and finally generating the stable video;wherein, the pixel in the original video frame is set as Ps, and the pixel in the corresponding output video frame is Pd, accordingly the corresponding relationship is: Ps=Dc(KcRw2c−1Rc2w−1Kc−1Pd) where Ps=[xs,ys]T, Pd=[xd,yd]T, xs and ys are respectively coordinate values of an abscissa and an ordinate of the pixel Ps in the original video frame; xd and yd are respectively coordinate values of an abscissa and an ordinate of the pixel Pd in the output video frame; Kc and Dc are respectively an internal parameter and a distortion model of the camera, Kcis a projection internal parameter of the virtual lens;the performing the interpolation sampling on the original video frame Is according to the corresponding relationship, and generating the output video frame Id specifically comprises:
Priority Claims (1)
Number Date Country Kind
201811550596.7 Dec 2018 CN national
CROSS REFERENCE TO RELATED APPLICATION

The present application is a National Phase of International Application No. PCT/CN2019/109482, filed on Sep. 30, 2019 claims priority to Chinese Patent Application No. 201811550596.7, filed on Dec. 18, 2018 and entitled “Panoramic Video Anti-Shake Method and Portable Terminal”, and the content of which is herein incorporated by reference in their entireties.

PCT Information
Filing Document Filing Date Country Kind
PCT/CN2019/109482 9/30/2019 WO 00