(a) Field of the Invention
The present invention is related to a video generation system and method, and more particularly, to one that executes synthesis of an original video and a medium object at a remote server.
(b) Description of the Prior Art
As digital cameras, network videos, and camera on cell phones are getting more popular, consumers demand more on video multi-media. Consumers are no longer satisfying with taking photos and recording images with a portable image device, they further expect to incorporate those images or videos with special effects as seen on TV or video games to promote entertaining interest. However, data process speed of the portable video device is slower than that of a desktop or a notebook, and thus requires longer time to process video and audio data. A user may run into serious problem if he/she attempts to process information containing massive multimedia operation with the portable video device.
With comprehensive application of Internet and wireless local area network (WLAN) today, the portable video device is usually provided with the function of data transmission through the network that gives the user to take advantage of a remote server with more powerful operation capability to perform complicated data process and then transmit the processed data back to the portable video device.
The primary purpose of the present invention is to provide a video generation system and method by having a remote server to execute synthesis of an original video and a medium object to solve the problem of poor multimedia data process capability by the portable video device.
To achieve the purpose, the video generation system of the present invention includes a network, a camera device, and a server. The camera device captures an original video and then transmits it to the server through the network. The server is provided with a feature recognition unit, a medium object modification unit, and a video synthesis unit. The feature recognition unit recognizes and positions feature information of the original video; the medium object modification unit modifies a medium object based on the feature information to generate a modified medium object; and the video synthesis unit synthesizes the original video and the modified medium object to generate a synthesized video based on the feature information.
Furthermore, the present invention discloses a method to generate video involving the following steps. Firstly, an original video is provided and transmitted to a remote server through a network for the remote server to recognize and position a feature information of the original video; based on the feature information, the remote server modifies a medium object to generate a modified medium object; and finally the remote server synthesizes the original video and the modified to produce a synthesized video based on the feature information.
Referring to
The original video 12 may be related to AV (audio and video) data or a still image. The original video 12 is preferred containing portrait image data. The feature information 16 is related to any combination of features of positions of a facial feature, features and locations of hairs, features and locations of fingers, and postures of trunk and extremities of the image information of the portrait. The medium object 17 is preferred a 2D (dimensional) model, a 3D model, audio data or any combination among them, e.g., virtual portrait, virtual idol of god, cartoon figure or situation music, etc. Each medium object may contain multiple modification parameters, e.g., any combination of 2D and 3D models provided with parameters of facial expressions, parameters of ratios of facial features, parameters of hair ratios, parameters of figure motions, parameters of finger ratios, parameters of motions of human trunk and extremities, and parameters of ratios of human trunk and extremities while the audio information contains parameters of human expressions responding to a situation music. Accordingly, the medium-object modification unit 15 changes motions of the portrait or the situation music depending on the human face or body features in the original video 12. While transmitting the original video 12 to the server 13, a user may manually preset a topic, e.g., happy birthday, wedding ceremony or graduation ceremony for the medium object 17 to utilize virtual portrait model and music readily available in the database.
The user may decide a topic for the medium object 17 based on the feature information 16. For example, if the facial image of the original video 12 is determined as a sad expression (with dropping corners of mouth) according to the feature information 16, the virtual portrait is modified to look sad while the situation music is change to a sad one; alternatively, when the trunk and extremities of the figure in the original video 12 is determined as that imitating the Superman according to the feature information 16, the music is changed to a background music of the Superman; or if the fingers in the image is determined as making a sign of victory according to the feature information 16, V, then the music is changed to one that exclaims joy.
Wherein, the camera device 11 is adapted to a portable electronic device, e.g., a cell phone, notebook, PDA (personal digital assistant) or a digital camera. The network 10 is preferably an Internet or Wireless LAN. The original video 12 may be transmitted in a file format or a streaming format. The medium object 17 is stored in a database built in the server 13 or a remote database.
Now referring to
Furthermore, the synthetic video 27 may be stored in the network server 21. If the network server 21 when turned on is also related to a cell phone server, and a friend of the user makes a call to the user's cell phone for online operation with the user's cell phone through its server, the cell phone server may transmit the synthetic video 27 to the calling cell phone for displaying to achieve the purpose of prompting a video of the caller.
As illustrated in
Step 31: an original video is provided;
Step 32: the original video is transmitted through a network to a remote server;
Step 33: feature information of the original video is recognized and positioned at the remote server;
Step 34: based on the feature information, a medium object is modified at the remote server to generate a modified medium object; and
Step 35: at the remote server, the original video and the modified medium object are synthesized based on the feature information to generate a synthetic video.
In a preferred embodiment of the video generation method of the present invention as illustrated in
Step 41: a human face video is recorded using a camera module of a cell phone;
Step 42: the human face video is transmitted using a streaming format to a network server through a wireless network and a medium object to be synthesized is preset;
Step 43: Locations of facial features on the human face video are recognized at the network server;
Step 44: at the network server, the preset medium object is modified to generate a modified medium object according to those locations of facial features recognized; and
Step 45: at the network server, the human face video and the modified medium object are synthesized to produce a synthesized video according to the feature information.
It is to be noted that the preferred embodiments disclosed in the specification and the accompanying drawings are not limiting the present invention; and that any construction, installation, or characteristics that is same or similar to that of the present invention should fall within the scope of the purposes and claims of the present invention.
| Number | Date | Country | Kind |
|---|---|---|---|
| 095124995 | Jul 2006 | TW | national |