The present invention relates to a teleconferencing device which transmits and receives images captured by cameras and displays images to perform communication with a person at a remote location, and to an image display processing method.
In recent years, with the organization of the infrastructure of an IP network, the introduction of a teleconferencing device in which data, such as images or sound, is transmitted to a remote base through the IP network and displayed is advancing. With the propagation of a large-screen television, such as a plasma display, a teleconferencing system in which the zoom magnification of a camera is adjusted to set such that an image of a subject is viewed on the screen of a corresponding base in life-size is also considered. According to this teleconferencing system, realistic sensation is obtained as if any other party of a teleconference is right in front of his/her eyes.
In a teleconferencing system described in PTL 1, a projector, a camera, and the seating position of a person serving as a subject are fixed, and the zoom magnification of the camera and the display magnification of the projector are set. Thus, in the teleconferencing system described in PTL 1, the person serving as a subject can be displayed on a screen to be irradiated by a projector on the corresponding base side at the time of a teleconference in life-size.
In the bases which execute a teleconferencing system, display apparatuses which are different in the screen size, such as displays, may be used. In this case, in order that an image of a subject on a host base (first base) side is displayed on the screen of a display apparatus provided in a corresponding base (second base) in life-size, the following operation should be performed. That is, a user on the second base side should set the zoom magnification of the camera provided in the first base by a remote operation. Alternatively, the user on the first base side should adjust the zoom magnification of the camera provided in the first base in accordance with an instruction from the second base side through a remote communication unit, such as a teleconference or a telephone.
In order to realize the teleconferencing system described in PTL 1, it is necessary to provide a system in which all the bases involving in a teleconference are constituted by the same apparatus.
An object of the invention is to provide a teleconferencing device capable of displaying a subject captured by each camera of a corresponding base on each display provided in a host base in life-size without depending on the screen size of the display provided in each base, and an image display processing method.
The invention provides a teleconferencing device for a teleconferencing system which transmits and receives an image captured by a camera between a host base and at least one corresponding base, and displays the images on a display. The teleconferencing device includes an image receiving section that receives an image transmitted from the corresponding base; a zoom magnification setting receiving section that receives zoom magnification setting information of each camera provided in the corresponding base; an image enlargement and reduction ratio derivation section that derives an enlargement or reduction ratio, at which each subject in the image captured by each camera of the corresponding base is displayed on the display provided in the host base in life-size, for each corresponding base on the basis of the zoom magnification setting information received by the zoom magnification setting receiving section and screen size information of the display provided in the host base; an image enlargement and reduction section that enlarges or reduces the image transmitted from the corresponding base on the basis of the enlargement or reduction ratio; and an image display control section that conducts a control to display the image from each corresponding base enlarged or reduced by the image enlargement and reduction section on the display of the host base.
The invention also provides an image display processing method which is executed by a teleconferencing device for a teleconferencing system which transmits and receives an image captured by a camera between a host base and at least one corresponding base and displays the image on a display. The method includes receiving an image transmitted from the corresponding base; receiving zoom magnification setting information of each camera provided in the corresponding base; deriving an enlargement or reduction ratio, at which each subject in the image captured by each camera of the corresponding base is displayed on the display provided in the host base in life-size, for each corresponding base on the basis of the zoom magnification setting information and screen size information of the display provided in the host base; enlarging or reducing the image transmitted from the corresponding base on the basis of the enlargement or reduction ratio; and conducting a control to display the enlarged or reduced image from each corresponding base on the display of the host base.
According to the teleconferencing device and the image display processing method of the invention, even when the display provided in each base is different in the screen size, the image of the subject captured by the camera of the corresponding base can be displayed on the display provided in the host base in life-size. That is, the subject captured by each camera provided in the corresponding base can be displayed on the display provided in the host base in life-size. Therefore, the users of the teleconferencing device can have a teleconference with realistic sensation as if all the users are present on the host base. The number of bases is not limited to two, and even when the number of bases is three or more, the same effects can be obtained.
a) is a diagram showing the size relationship between an enlarged image and the screen of a display, and
a) to 4(c) are diagrams showing an example of the relationship between the position of a face of a subject in an enlarged image and a truncated region of image data.
a) is a diagram showing the size relationship between a reduced image and the screen of a display, and
a) to 6(c) are diagrams showing an example of the relationship between the position of a face of a subject in a reduced image and an added region of image data.
a) is a diagram showing an example where an image processing section 131 displays an image, in which black image data is added on the circumference of a reduced image, on a display 130, and
a) is a diagram showing an example of an image, and
Hereinafter, an embodiment of the invention will be described with reference to the drawings.
A camera 110, a display 130, and an input device 140 are connected to the teleconferencing device 100 of each base. The camera 110 captures an image of a person who is in each base. The camera 110 stores zoom magnification setting information. The teleconferencing device 100 transmits data of an image captured by the camera 110 to the teleconferencing device of the corresponding base through the network 120. The teleconferencing device 100 receives data transmitted from the teleconferencing devices of the corresponding base through the network 120. The display 130 displays images of data received by the teleconferencing device 100. The input device 140 is an input interface, such as a mouse or a remote controller, which is used when the user inputs the conditions or the like to be set in the teleconferencing device 100.
The image acquisition section 111 acquires data of an image of a subject in the host base captured by the camera 110. The image encoding section 113 encodes image data acquired by the image acquisition section 111 in a format to be transmitted to a network. The image encoding section 113 may change the resolution of the image depending on the transmission band situation of the network 120 and may perform encoding. For example, when the transmission band of the network 120 is narrow, the image encoding section 113 converts the image captured by the camera 110 to an image having low resolution and then performs encoding.
The image transmitting section 115 transmits image data (encoded image data) encoded by the image encoding section 113 to the teleconferencing device of the corresponding base through the network 120. Encoded image data to be transmitted by the image transmitting section 115 may include information (image resolution information) representing the resolution of the image. In this case, when encoding image data, the image encoding section 113 includes the image resolution information in encoded image data.
The image receiving section 117 receives encoded image data transmitted from the teleconferencing device of another base through the network 120. The image decoding section 119 decodes encoded image data and sends image data in a format to be displayed on the display 130 to the image enlargement and reduction section 129. When the image resolution information is included in encoded image data received by the image receiving section 117, the image decoding section 119 sends the image resolution information to the image enlargement and reduction ratio derivation section 127.
The zoom magnification setting acquisition section 121 acquires the zoom magnification setting information of the camera 110. Although in this embodiment, the camera 110 stores the zoom magnification setting information, the teleconferencing device 100 may store the zoom magnification setting information in a memory (not shown). In this case, when the user of each base installs the teleconferencing device 100 and the camera 110 or when the user sets the zoom magnification of the camera 110, the user sets the zoom magnification by the input device 140.
The zoom magnification setting information is information representing the size of a subject with respect to the size of a display, unlike a zoom magnification expression of a general camera, for example, 50 mm in 35 mm equivalent, or the like. For example, the zoom magnification setting information is expressed as “life-size in a 50-inch display”, “half life-size in a 42-inch display”, or the like.
The size of the subject represented in the zoom magnification setting information may be represented by the size of a specific body site, not the ratio with respect to life-size. For example, the zoom magnification setting information may be expressed as “the size of a face in a vertical direction is 10 cm in a 50-inch display”, “a shoulder-width is 30 cm in a 42-inch display”, or the like. In this case, the image enlargement and reduction ratio derivation section 127 calculates the ratio of life-size on the basis of average size data of a body site represented by the zoom magnification setting information.
The zoom magnification setting transmitting section 123 sends the zoom magnification setting information acquired by the zoom magnification setting acquisition section 121 to the teleconferencing device of the corresponding base through the network 120. For example, at the time of call control in which the teleconferencing device 100 which starts a teleconference establishes the connection to the teleconference terminal of the corresponding base, the zoom magnification setting transmitting section 123 sends the zoom magnification setting information together with connection information including an image data compression format, transmission rate, and the like.
The zoom magnification setting receiving section 125 receives the zoom magnification setting information transmitted from the teleconferencing device of another base through the network 120. The zoom magnification setting receiving section 125 sends the zoom magnification setting information to the image enlargement and reduction ratio derivation section 127 without delay.
The image enlargement and reduction ratio derivation section 127 derives a ratio (enlargement or reduction ratio), in which the image enlargement and reduction section 129 enlarges or reduces an image, on the basis of the zoom magnification setting information received by the zoom magnification setting receiving section 125 and screen size information of the display 130. The image enlargement and reduction ratio derivation section 127 derives the enlargement or reduction ratio such that a subject captured by the camera 110 of the corresponding base can be displayed on the display 130 of the host base in life-size. The details of the method of deriving the enlargement or reduction ratio will be described below.
With regard to the screen size information of the display 130, the image enlargement and reduction ratio derivation section 127 acquires the screen size information from the display 130, or the user inputs the screen size information to the image enlargement and reduction ratio derivation section 127 by the input device 140. The screen size information of the display 130 includes information regarding “inch” representing the size of a screen 132 of the display 130 and resolution information regarding the number of pixels in each of the vertical and horizontal directions of the screen 132 (the number of vertical pixels×the number of horizontal pixels).
The image enlargement and reduction section 129 performs data process for enlarging or reducing the size of an image of image data sent from the image decoding section 119 on the basis of the enlargement or reduction ratio derived by the image enlargement and reduction ratio derivation section 127. The image enlargement and reduction section 129 sends data of the enlarged or reduced image to the image processing section 131.
The image processing section 131 performs image data processing which is required when the image enlargement and reduction section 129 enlarges or reduces an image. The details of image processing which is performed by the image processing section 131 will be described below. The image display control section 133 performs control such that an image processed by the image processing section 131 is displayed on the display 130.
Hereinafter, the method of deriving the enlargement or reduction ratio by the image enlargement and reduction ratio derivation section 127 will be described in detail. In the following description, it is assumed that the resolution (image resolution) of an image received by the image receiving section 117 is the same as the resolution (display resolution) of the display 130.
When the screen size information of the display 130 represents “x-inch”, and the zoom magnification setting information represents “life-size in a y-inch display”, the image enlargement and reduction ratio derivation section 127 derives an enlargement or reduction ratio p by Expression (1).
Thus, when the screen size of the display 130 is “50-inch”, and the zoom magnification setting information is “life-size in a 42-inch display”, the image enlargement and reduction ratio derivation section 127 derives the enlargement or reduction ratio p of 0.84 (=42/50) times. In this case, since the enlargement or reduction ratio p is smaller than 1, the image enlargement and reduction section 129 reduces an image. When the enlargement or reduction ratio p is greater than 1, the image enlargement and reduction section 129 enlarges an image.
A function of enlarging or reducing and displaying an image having resolution different from the set resolution may be set in the display 130. In this case, even when an image obtained by enlarging or reducing the image on the basis of the enlargement or reduction ratio p is displayed on the display 130, a subject on the corresponding base side cannot be displayed in life-size. Accordingly, when the resolution (image resolution) of an image received by the image receiving section 117 is different from the resolution (display resolution) of the display 130, the enlargement or reduction ratio is derived with reference to the resolution too. Specifically, the image enlargement and reduction ratio derivation section 127 derives the enlargement or reduction ratio with reference to the image resolution and the display resolution in addition to the zoom magnification setting information and the screen size information of the display 130.
The image enlargement and reduction ratio derivation section 127 derives an enlargement or reduction ratio p′ by Expression (2) on the basis of the screen sizes x and y. In Expression (2), it is assumed that the image resolution and the display resolution have the same aspect ratio (the aspect ratio of the screen), the resolution in the vertical direction of the image resolution is m, and the resolution in the vertical direction of the display resolution is n. For example, if x=50, y=42, m=1080, and n=720, the enlargement or reduction ratio p′becomes 0.56.
Hereinafter, image processing which is performed by the image processing section 131 will be described in detail. In the following description, the resolution (image resolution) of an image received by the image receiving section 117 is the same as the resolution (display resolution) of the display 130.
First, image processing when the image enlargement and reduction section 129 enlarges an image will be described.
An image enlarged by the image enlargement and reduction section 129 cannot be displayed on the display 130 as it is. That is, as shown in
When the size of the screen 132 of the display 130 is “vertical H pixels×horizontal L pixels”, he is expressed by Expression (3), and le is expressed by Expression (4). p is the above-described enlargement or reduction ratio.
A subject is not limited as being in the center of the image. For this reason, as described above, if image data is truncated evenly from the upper and lower sides or from the left and right sides, there is a situation where the face of the subject is not displayed on the display 130. Accordingly, the image processing section 131 may determine a region where image data will be truncated in accordance with the position of the face in the enlarged image detected by a face detection function.
a) to 4(c) are diagrams showing an example of the relationship between the position of the face of the subject in the enlarged image 301 and a truncated region of image data. As shown in
As shown in
As shown in
As described above, the image processing section 131 determines a region where image data will be truncated such that the face of a subject detected by the face detection function becomes close to the center of the screen 132 of the display 130. Thus, the image processing section 131 can display the face of the subject close to the center of the display 130.
Next, image processing when the image enlargement and reduction section 129 reduces an image will be described.
If an image reduced by the image enlargement and reduction section 129 is displayed on the display 130, as shown in
When the size of the screen 132 of the display 130 is “vertical H pixels×horizontal L pixels”, hr is expressed by Expression (5), and Ir is expressed by Expression (6). p is the above-described enlargement or reduction ratio.
A subject is not limited as being in the center of the image. As described above, if image data is truncated evenly from the upper and lower sides or from the left and right sides, there is a case where the subject is displayed to be deviated from the center of the display 130. Thus, the image processing section 131 may determine a region where image data will be added to the reduced image 302 in accordance with the position of the face in the reduced image detected by the face detection function.
a) to 6(c) are diagrams showing an example of the relationship between the position of the face of a subject in the reduced image 302 and an added region of image data. As shown in
As shown in
As shown in
As described above, the image processing section 131 determines an added region of image data such that the face of the subject detected by the face detection function becomes close to the center of the screen 132 of the display 130. Thus, the image processing section 131 can display the face of the subject close to the center of the display 130.
The image enlargement and reduction section 129 compares the enlargement or reduction ratio derived in Step S103 with 1 to determines whether to enlarge or reduce the image (S105). When the enlargement or reduction ratio is greater than 1, the image enlargement and reduction section 129 progresses to Step S107, and enlarges the image of image data sent from the image decoding section 119 on the basis of the enlargement or reduction ratio (S107). Next, the image processing section 131 truncates at least a part of the circumference of the enlarged image to adjust the image to the size of the screen 132 of the display 130 (S109).
When the enlargement or reduction ratio is smaller than 1, the image enlargement and reduction section 129 progresses to Step S111, and reduces the image of image data sent from the image decoding section 119 on the basis of the enlargement or reduction ratio (S111). Next, the image processing section 131 adds image data to at least a part of the circumference of the reduced image to adjust the image to the size of the screen 132 of the display 130 (S113).
As described above, even when the displays installed in the bases constituting the teleconferencing system of this embodiment are different in size, the image of the subject captured by the camera of the corresponding base can be displayed on the display of the host base in life-size using the zoom magnification setting information sent from the corresponding base and the screen size information in the host base. That is, if the zoom magnification setting information of the corresponding base can be received, the teleconferencing device of the host base can display the subject captured by the camera of the corresponding base on the display of the host base in life-size. Therefore, the users can have a teleconference with realistic sensation such that all the users are present on the host base side.
In recent years, a large-screen display, such as 103-inch or 150-inch, is available in the market. Thus, it is anticipated that such a display or a larger-screen display is used in a teleconferencing system. Even when such a large-screen display is used, if zoom magnification setting information sent from another base is “life-size in a 42-inch display”, as shown in
In general, the viewing angle of a person is 100 degrees, and if an image is filled over the viewing angle of 100 degrees, it becomes possible to obtain realistic sensation as if something in the screen is in front of his/her eyes. Thus, the image processing section 131 may process image data to be added to the circumference of the image such that an image shown in
a) is a diagram showing an example of an image.
As shown in
Finally, the image processing section 131 adds image data including texture information of the background portion 921 to the extended background portion 1001, and adds image data including texture information of the desk portion 924 to the extended desk portion 1002.
The image processing section 131 performs the above-described process when a reduced image is displayed on a large-screen display 130, making is possible for a user to have a teleconference with higher realistic sensation without causing a sense of discomfort with respect to the viewing angle of a person.
Although the invention has been described in detail or with reference to a specific embodiment, it is obvious to those skilled in the art that various changes or modifications can be made without departing from the spirit and scope of the invention.
This application is based on Japanese Patent Application No. 2009-165922, filed Jul. 14, 2009, the content of which is incorporated herein by reference.
The teleconferencing device according to the invention is useful as a teleconferencing device or the like which displays a subject captured by the camera of the corresponding base on the display of the host base in life-size.
Number | Date | Country | Kind |
---|---|---|---|
2009-165922 | Jul 2009 | JP | national |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/JP2010/003436 | 5/21/2010 | WO | 00 | 1/26/2012 |