This Application claims priority of Taiwan Application No. 103139704, filed on Nov. 17, 2014, and the entirety of which is incorporated by reference herein.
1. Field of the Invention
The invention generally relates to image capture apparatuses and related image processing methods, and more particularly, to capture apparatuses of video images and image processing methods thereof capable of simultaneously capturing multiple video images.
2. Description of the Related Art
Due to the popularity of Internet applications in recent years, using video conferencing for communication has become a trend. Video conferencing allows two or more people to instantly transfer text messages, files, audio messages and videos through the Internet. With the video conferencing, multiple users can transmit video and/or audio messages to each other for conference through a webcam and microphone device.
With the video call applications reaching maturity, users' demand switched from one-by-one video calls to one-to-many group video calls. In order to allow remote users to clearly understand what a proximal speaker wants to express, there is often a need to share electronic documents information, how to use a certain program, the demonstrated operating steps of specific instruments and equipment, the site situation spotted by the illustrator, or a combination of the above to the user group engaged in the call. Therefore, instantly sending multiple videos and display screens to each caller is a pressing issue.
However, in a conference room where a video conferencing with multiple users is being performed, large network bandwidth consumption may be caused when each user has their own computer device for video conferencing. Additionally, the camera on each user's computer device can only have a single angle of display. Thus, remote users may not fully acquire the entire scenes and statuses during the video conferencing.
Accordingly, embodiments of the invention provide capture apparatuses of video images and image processing methods thereof capable of simultaneously capturing multiple video images.
In one aspect of the invention, a capture apparatus of video image is provided with a plurality of image sensors and an image composer. Each image sensor continually captures plurality sets of video image data, wherein each set of video image data includes an odd frame image data and an even frame image data and the images sensors includes a first set of image sensors and a second set of image sensors. The image composer is coupled to the image sensors and is configured to filter the odd frame image data from the video image data of the first set of image sensors and the even frame image data from the video image data of the second set of image sensors, compose the odd frame image data and/or the even frame image data to generate an output image with a fixed output resolution according to an input resolution of the video image data and the fixed output resolution.
In another aspect of the invention, a processing method of video images is provided. The method comprises the following steps. First, a plurality of image sensors are configured to continually captures plurality sets of video image data, wherein each set of video image data includes an odd frame image data and an even frame image data and the images sensors includes a first set of image sensors and a second set of image sensors. Then, an image composer is configured to filter the odd frame image data from the video image data of the first set of image sensors and the even frame image data from the video image data of the second set of image sensors. Thereafter, the odd frame image data and/or the even frame image data are composed by the image composer to generate an output image with a fixed output resolution according to an input resolution of the video image data and the fixed output resolution.
Image processing methods may be practiced by the disclosed apparatuses or systems which are suitable firmware or hardware components capable of performing specific functions. Image processing methods may also take the form of a program code embodied in a tangible media. When the program code is loaded into and executed by a machine, the machine becomes an apparatus for practicing the disclosed method.
The invention can be more fully understood by reading the subsequent detailed description and examples with references made to the accompanying drawings, wherein:
The following description is of the best-contemplated mode of carrying out the invention. This description is made for the purpose of illustrating the general principles of the invention and should not be taken in a limiting sense.
Embodiments of the invention provide capture apparatuses of video images and related image processing methods thereof, which can use multiple image sensors arranged and pointed in different directions to capture images. Multiple images captured by respective image sensors can be composed into one large output image through a unique data selection circuit and one line storage unit, without requiring extra storage memory for storing the image data captured. The composed large output image can then be compressed through video compression and connected to a host device through the connection interface, such as a universal serial bus (USB) interface, so as to achieve the image composition online, thus effectively reducing costs and maintaining high quality.
In some embodiments, the image sensors 110 are configured in different directions to capture video image data in the corresponding directions. For example, assuming that the capture apparatus of video images 100 has four image sensors, the image sensors can be set up in four different directions to capture video images of meeting attendants in four directions, as shown in
Each image sensor 110 can be used to continuously capture multiple sets of video image data, wherein each set of video image data contains odd frame image data and even frame image data.
The image composer 120 which is coupled to all of the image sensors 110 can perform the image processing method of the present invention, which will be discussed further in the following paragraphs. To be more specific, the image composer 120 can be used to separately filter odd frame image data from video image data of some of the image sensors and even frame image data from video image data of remaining image sensors, and then, based on the video image data input resolution and output resolution, composes the odd frame image data and the even frame image data into an output image with the output resolution in response to a mode selection signal. In particular, the mode selection signal is used to indicate the output mode of the output image. In some embodiments, the host device end user may input a selected output mode through a user interface or one input unit to generate the mode selection signal. In this embodiment, the output mode of the output image can include at least a first mode, a second mode, and a third mode, wherein the output images generated by different output mode also vary. To be more specific, when the first mode is selected, the output images are odd frame image data and/or even frame image data composed in all the image sensors. When the second mode is selected, the output images are odd frame image data and/or even frame image data composed from one selected image sensor. When the third mode is selected, the output images are odd frame image data or even frame image data selected directly from one image sensor. For example, assuming that the capture apparatus of video images 100 has four image sensors, when the output mode indicated by the mode selection signal is the first mode, the output image can be generated by composing odd frame image data and/or even frame image data from four image sensors; when the output mode indicated by the mode selection signal is the second mode, the output image can the output image can be generated by composing any two of the odd frame image data and/or even frame image data from the four image sensors; and when the output mode indicated by the mode selection signal is the third mode, the output image can be generated by directly selecting an odd frame image data or an even frame image data generated from one of the four image sensors. Thereafter, the image composer 120 may transmit the output image generated to the video compressor 130.
The video compressor 130 which is coupled to the image composer 120 can compress the output images into compressed video signals based on one video compression standard (such as H.264, MPEG2 etc.), which are delivered and displayed on the host device. Specifically, the video compressor 130 may further include a USB interface 132, through which compressed video signals are transmitted and displayed on the host device.
The image composer 120 may further include a first filter 121 and a second filter 122, which are used to filter odd frame image data of the first set of image sensors and even frame image data from the second set of image sensors, respectively. The image composer 120 may further include a down-sampler 123, which is coupled to the first filter 121 and the second filter 122 and is used for selectively downsampling the odd frame image data and/or even frame image data based on the input resolution of the video images, the output resolution of the output image, and the output mode indicated by the mode selection signal and generating the output images subsequently based on the down-sampled even frame image data and/or odd frame image data.
In particular, the down-sampler 123, in response to the mode selection signal, downsampling the odd frame image data and/or even frame image data when the input resolution is equivalent to the output resolution. For example, when the input resolution is equivalent to the output resolution and the mode selection signal falls under the aforementioned first mode or second mode, it means the output image is made up of the video images captured by a number of image sensors 110. Thus, the video images captured can only be composed into the output image after the downsampling.
The image composer 120 may further include a line buffer 124 used to temporarily store odd frame image data and/or even frame image data required for the composing of output image. In particular, the line buffer 124 can be used to store image data with a length that is the length of the width of one line of the screen.
The image composer 120 may further include a pixel selector 125 used to select one of the multiple output modes to compose the filtered odd frame image data and/or odd frame image data into the output image with the output resolution based on the mode indicated by the mode selection signal.
First, in step S502, multiple sets of video image data are continuously captured by the image sensors 110. In particular, each set of video image data includes odd frame image data and even frame image data. Moreover, the image sensors 110 can include first set of image sensors and second set of image sensors. Then, in step S504, odd frame image data from the video image data of the first set of image sensors and the even frame image data from the video image data of the second set of image sensors are filtered through the image composer 120. Thereafter, in step S506, through the image composer 120 and based on the input resolution of the video image data and the output resolution of the output image, the odd frame image data and even frame image data are composed into an output image with an output resolution in response to a mode selection signal. Specifically, the mode selection signal is used to indicate the output mode of the output image.
For explanation, image processing methods are illustrated as examples in the following embodiments, and those skilled in the art will understand that the present invention is not limited thereto. In the following embodiments, assuming that the capture apparatus of video images 100 has four image sensors directed at four different directions for allowing users from the four directions and a computer device in the remote end to engage in video conferencing at the same time, and then it is connected to the computer device through a connection interface, such as the USB interface to be regarded as an image capture device of the computer device.
Therefore, the capture apparatus of video images and related image processing methods of the invention can utilize the output capability of the image composer and the synchronization capability between all of the image sensors to design a unique data selection circuit and one line storage unit so as to achieve online composition without requiring extra storage spaces to store image data, thereby effectively reducing costs and maintaining the high quality of transmitted images.
Systems and method thereof, or certain aspects or portions thereof, may take the form of a program code (i.e., executable instructions) embodied in tangible media, such as floppy diskettes, CD-ROMS, hard drives, or any other machine-readable storage medium, wherein, when the program code is loaded into and executed by a machine, such as a computer, the machine thereby becomes an apparatus for practicing the methods. The methods may also be embodied in the form of a program code transmitted over some transmission medium, such as electrical wiring or cabling, through fiber optics, or via any other form of transmission, wherein, when the program code is received and loaded into and executed by a machine, such as a computer, the machine becomes an apparatus for practicing the disclosed methods. When implemented on a general-purpose processor, the program code combines with the processor to provide a unique apparatus that operates analogously to application specific logic circuits.
While the invention has been described by way of example and in terms of preferred embodiment, it is to be understood that the invention is not limited thereto. Those who are skilled in this technology can still make various alterations and modifications without departing from the scope and spirit of this invention. Therefore, the scope of the present invention shall be defined and protected by the following claims and their equivalents.
Number | Date | Country | Kind |
---|---|---|---|
103139704 | Nov 2014 | TW | national |