This patent application is based on and claims priority pursuant to 35 U.S.C. § 119(a) to Japanese Patent Application No. 2018-068440, filed on Mar. 30, 2018, in the Japan Patent Office, the entire disclosure of which is hereby incorporated by reference herein.
Embodiments of the present disclosure relate to a communication terminal, an image communication system, and a display control method.
Remote conference systems, such as videoconference systems, are now in widespread use, enabling users to remotely attend a conference (meeting) that is held among different sites via a communication network such as the Internet. In such videoconference systems, a communication terminal for a remote conference system is provided in a conference room (meeting room) where attendees of one party in a remote conference are attending. This communication terminal collects an image or video of the conference room including the attendees and sounds such as a speech made by an attendee, and transmits digital data converted from the collected image (video) and/or sounds to the other party's terminal provided at a different conference room. Based on the transmitted digital data, the other party's terminal displays images on a display or outputs audio from a speaker in the different conference room to establish video communication (video call). This enables the attendees to carry out the conference among remote sites, as if they are close to each other as an actual conference.
In addition, an image capturing device that is capable of capturing a spherical image in real time is connectable to such communication terminals described above to transmit the spherical image acquired by the image capturing device to each communication terminal of the other party. Each communication terminal sequentially converts the received spherical image to a planar image representing a predetermined area, which is a part of the spherical image, and displays the planar image on a display. This enables a user in each of remote sites to determine, by his or her own, a predetermined area image to be displayed, representing an image of a predetermined area that the user is interested in, from a whole image of the spherical image.
In addition, there is a known technique that superimposes a predetermined figure on an object in a video image indicated by a video image communication terminal in superimposing the figure on the video image by an image relay server that relays a video image between or among two or more video image communication terminals. This provides a video image in which the figure is combined with the object in the video image even when objects in the video image moves.
An exemplary embodiment of the present disclosure includes an image communication system including a first communication terminal and a second communication terminal. The first communication terminal includes first circuitry and a second communication terminal includes second circuitry. The first circuitry of the first communication terminal transmits, to the second communication terminal, first image data representing a first image and second image data representing a second image. The first circuitry of the first communication terminal transmits, to the second communication terminal, position information indicating a predetermined position on the first image. The second circuitry of the second communication terminal combines, based on the position information, the second image with the first image at the predetermined position on the first image to generate a combined image. The second circuitry of the second communication terminal displays, on a display, the combined image.
A more complete appreciation of the disclosure and many of the attendant advantages and features thereof can be readily obtained and understood from the following detailed description with reference to the accompanying drawings, wherein:
The accompanying drawings are intended to depict example embodiments of the present disclosure and should not be interpreted to limit the scope thereof. The accompanying drawings are not to be considered as drawn to scale unless explicitly noted.
The terminology used herein is for describing particular embodiments only and is not intended to be limiting of the present disclosure. As used herein, the singular forms “a”, “an” and “the” are intended to include the plural forms as well, unless the context clearly indicates otherwise. It will be further understood that the terms “includes” and/or “including”, when used in this specification, specify the presence of stated features, integers, steps, operations, elements, and/or components, but do not preclude the presence or addition of one or more other features, integers, steps, operations, elements, components, and/or groups thereof. In describing preferred embodiments illustrated in the drawings, specific terminology is employed for the sake of clarity. However, the disclosure of this patent specification is not intended to be limited to the specific terminology so selected, and it is to be understood that each specific element includes all technical equivalents that have the same function, operation in a similar manner, and achieve a similar result.
Hereinafter, a description is given of one of the embodiments of the present disclosure, with reference to the attached drawings,
Method of Generating Spherical Image
A method of generating a spherical image is described below, with reference to
A description is now given of an external view of an image capturing device 1, with reference to
As illustrated in
A description is now given of an example of how the image capturing device 1 is used, with reference to
A description is now given of an overview of a process of generating a spherical image from the images captured by the image capturing device 1, with reference to
As illustrated in
The Mercator image is attached so as to cover the sphere surface using Open Graphics Library for Embedded Systems (OpenGL ES) as illustrated in
Because the spherical image is an image attached to the sphere surface, a part of the image may look distorted when viewed from the user, and this may give a feeling of strangeness to the user. To cope with this, a part of the spherical image is displayed as a planar image having fewer curves. The part of the spherical image displayed as a planar image is referred to as a predetermined area. In addition, the predetermined area may also be selectable, or settable according to a user instruction. The displayed planar image corresponding the predetermined area is, hereinafter, referred to as a “predetermined area image”. A description is now given of displaying the predetermined area image, with reference to
The predetermined area image Q, which is an image of the predetermined area T illustrated in
A description is now given of a relation between the predetermined area information and the predetermined area T, with reference to
L/f=tan(α/2) tm (Equation 1)
Overview of Image Communication System
A description is now given of an overview of a configuration of an image communication system according to the present embodiment of the disclosure, with reference to
As illustrated in
Each of the image capturing device 1a and the image capturing device 1b is a special digital camera that captures an image including an object or a view (surroundings) to obtain two hemispherical images from which a spherical image is generated, as described above. The image capturing device 8 is a general-purpose digital camera that captures an image of an object or a view (surroundings) to obtain a general planar image.
Each of the videoconference terminal 3a and the videoconference terminal 3b is a terminal dedicated to videoconferencing. The videoconference terminal 3a and the videoconference terminal 3d display, on the display 4a and the display 4d, respectively, a video image obtained by performing a video communication (video call) via a wired cable such as a universal serial bus (USB) cable. The videoconference terminal 3a usually captures an image by a camera 312, as illustrated in
The communication management system 5 manages and controls communication among the videoconference terminal 3a, the videoconference terminal 3d, the PC 7 and the smartphone 9. In addition, the communication management system 5 manages types of image data (a general image type and a special image type) to be transmitted or received in the communication among the videoconference terminal 3a, the videoconference terminal 3d, the PC 7 and the smartphone 9. In other words, the communication management system 5 is a communication control system. In the description of the present embodiment, a spherical image is used as a special image, and a planar image is used as a general image. The communication management system 5 is installed in, for example, an office of a service provider that provides a video communication service. The communication management system 5 may be configured as a single computer. Alternatively, the communication management system 5 may be configured as a plurality of computers, and one or more units (functions, means, or storages) are arbitrarily assigned to each of the plurality of computers. That is, the communication management system 5 may be implemented by a plurality of servers that operate in cooperation with one another.
The PC 6 generates material image data that is image data of a material image to be displayed in the videoconference. In this disclosure, the material image is any image to be presented for participants during the videoconference. Examples of the material image include an image displayed, created, or edited by a general-purpose application being executed on the PC 6, and an image, which is photographed by a general-purpose digital camera, reproduced on the PC 6. However, these are not intended to be limiting the embodiment.
The PC 7 can perform a video communication by connecting with the image capturing device 8. In the present embodiment, the PC 7 and the image capturing device 8 are provided in the same site that is a site C. There is one user, a user C, participating in the video communication in the site C.
The smartphone 9 includes a display 917, which is described later, and displays an image of the video communication on the display 917. The smartphone 9 includes a complementary metal oxide semiconductor (CMOS) sensor 905, and usually captures an image using the CMOS sensor 905. In addition, the smartphone 9 is capable of obtaining data of two hemispherical images, which are the original image data of a spherical image, captured by the image capturing device 1b using a wireless communication such as Wireless Fidelity (Wi-Fi) or Bluetooth (registered trademark). When such a wireless communication is used, a cradle 2b supplies power to the image capturing device 1b and holds the image capturing device 1b, but not establish a communication. In the present embodiment, the image capturing device 1b, the cradle 2b, and the smartphone 9 are provided in the same site that is a site B. In addition, two users, a user B1 and a user B2, are participating in the video communication in the site B.
Each of the videoconference terminal 3a, the videoconference terminal 3d, the PC 7 and the smartphone 9 is an example of a communication terminal. OpenGL ES is installed on each of the communication terminals to enable each of the communication terminals to generate predetermined area information that indicates a partial area of the spherical image, or to generate a predetermined area image from a spherical image that is transmitted from a different one of the communication terminals.
The arrangement of the terminals (i.e., the communication terminals, the displays, the image capturing devices), the apparatuses and the users illustrated in
Hardware Configuration
A description is now given of hardware configurations of the image capturing device 1, the videoconference terminal 3, the communication management system 5, the PC 6, the PC 7, and the smartphone 9 according to the present embodiment, with reference to
Hardware Configuration of Image Capturing Device
A description is now given of a hardware configuration of the image capturing device 1 according to the present embodiment, with reference to
As illustrated in
The image sensor converts an optical image formed by the fisheye lenses 102a and 102b into electric signals to output image data. The timing generation circuit generates horizontal or vertical synchronization signals, pixel clocks and the like for the image sensor. Various commands, parameters, and the like for operations of the imaging elements 103a and 103b are set in the group of registers.
Each of the imaging elements 103a and 103b of the imaging unit 101 is connected to the image processor 104 through a parallel I/F bus. In addition, each of the imaging elements 103a and 103b of the imaging unit 101 is connected to the image controller 105 through a serial I/F bus such as an inter-integrated circuit (I2C) bus. Each of the image processor 104 and the image controller 105 is connected to the CPU 111 through a bus 110. In addition, the ROM 112, the SRAM 113, the DRAM 114, the operation device 115, the network I/F 116, the communication device 117, and an electronic compass 118 are also connected to the bus 110.
The image processor 104 obtains image data from each of the imaging elements 103a and 103b through the parallel I/F bus and performs predetermined processing on the image data obtained from each of the imaging elements 103a and 103b separately and combines the processed image data to generate data representing a Mercator image as illustrated in
The image controller 105 usually functions as a master device while each of the imaging elements 103a and 103b usually functions as a slave device, and the image controller 105 sets commands in the group of registers of each of the imaging elements 103a and 103b through the I2C bus. The image controller 105 receives necessary commands from the CPU 111. In addition, the image controller 105 obtains status data of the group of registers of each of the imaging elements 103a and 103b through the I2C bus and transmits the status data to the CPU 111.
The image controller 105 instructs the imaging elements 103a and 103b to output the image data at a time when the shutter button of the operation device 115 is pressed. The image capturing device 1 can support a preview display function (e.g., displaying a preview on a display such as a display of the videoconference terminal 3a) or a movie display function. In case of displaying movie, the image data is continuously output from the imaging elements 103a and 103b at a predetermined frame rate (frames per minute).
Furthermore, the image controller 105 operates in conjunction with the CPU 111 to synchronize times when the imaging elements 103a and 103b output the image data. In the present embodiment, the image capturing device 1 does not include a display unit (display). However, in some embodiments, the image capturing device 1 may include a display. The microphone 108 converts sound into audio data (signals). The audio processor 109 obtains the audio data from the microphone 108 through an I/F bus and performs predetermined processing on the audio data.
The CPU 111 controls the image capturing device 1 and performs necessary processing. The ROM 112 stores various programs to be executed by the CPU 111. Each of the SRAM 113 and the DRAM 114 operates as a work memory to store programs loaded from the ROM 112 to be executed by the CPU 111 or data being currently processed. More specifically, in one example, the DRAM 114 stores image data currently processed by the image processor 104 and data of the Mercator image on which processing has been performed.
The operation device 115 collectively refers to various operation keys, a power switch, a shutter button, and a touch panel having functions of both displaying information and receiving input from a user, which may be used in combination. The user operates the operation keys to input various image capturing modes or image capturing conditions.
The network I/F 116 collectively refers to an interface circuit such as a USB I/F that enables the image capturing device 1 to communicate with an external media such as a secure digital (SD) card or an external personal computer. The network I/F 116 supports at least one of a wired communication and a wireless communication. The data representing the Mercator image, which is stored in the DRAM 114, can be stored in the external media through the network I/F 116 or transmitted to the external device such as the videoconference terminal 3a via the network I/F 116, as needed.
The communication device 117 communicates with an external device such as the videoconference terminal 3a via the antenna 117a of the image capturing device 1 by a short range wireless communication such as Wi-Fi and Near Field Communication (NFC). The communication device 117 may transmit the data representing the Mercator image to a device external to the videoconference terminal 3a.
The electronic compass 118 computes an orientation and a tilt (roll angle) of the image capturing device 1 based on the Earth magnetism to output orientation and tilt information. The orientation and tilt information is an example of related information, which is metadata described in compliance with Exif. In addition, the orientation and tilt information is used for performing image processing, such as image correction, on captured image data. The related information also includes data indicating a time (date) when an image is captured by the image capturing device 1, and data indicating a size of image data (an amount of image data), for example.
Hardware Configuration of Videoconference Terminal
A description is now given of a hardware configuration of the videoconference terminal 3 according to the present embodiment of the disclosure, with reference to
The CPU 301 controls the entire operation of the videoconference terminal 3. The ROM 302 stores a control program such as an Initial Program Loader (IPL) used for operating the CPU 301. The RAM 303 is used as a work area for the CPU 301. The flash memory 304 stores various data such as a communication control program, image data, and audio data. The SSD 305 controls reading and/or writing of various data to and/or from the flash memory 304 under control of the CPU 301. In alternative to the SSD, a hard disk drive (HDD) may be used. The medium I/F 307 reads and/or writes (stores) data from and/or to a recording medium 306 such as a flash memory. The operation key 308 is operated according to a user input indicating an instruction in selecting a destination of a communication from the videoconference terminal 3, for example. The power switch 309 is a switch that turns on or off the power of the videoconference terminal 3.
The network I/F 311 enables the videoconference terminal 3 to establish a data communication with an external device via the communication network 100 such as the Internet. The camera 312 is an example of a built-in imaging device capable of capturing an object under control of the CPU 301 to obtain image data. The imaging element I/F 313 is a circuit that controls driving of the camera 312. The microphone 314 is an example of a built-in sound collecting device capable of inputting sounds. The audio input/output interface 316 is a circuit for controlling input and output of audio signals between the microphone 314 and the speaker 315 under control of the CPU 301. The display I/F 317 is a circuit for transmitting image data to an external display 4 under control of the CPU 301. The external device connection I/F 318 is an interface that connects the videoconference terminal 3 to various external devices. The short-range communication circuit 319 is a communication circuit such as NFC standard, Bluetooth (registered trademark) or the like.
The bus line 310, which includes an address bus and a data bus, electrically connects to various elements, including the CPU 301 illustrated in
The display 4 is an example of a display unit, such as a liquid crystal or organic electroluminescence (EL) display that displays an image of object, an operation icon, and the like. The display 4 is connected to the display I/F 317 by a cable 4c. The cable 4c may be an analog red green blue (RGB) (video graphic array (VGA)) signal cable, a component video cable, a high-definition multimedia interface (HDMI (registered trademark)) signal cable, or a digital video interactive (DVI) signal cable.
The camera 312 includes a lens and a solid-state imaging element that converts an image (video image) of object to electronic data by photoelectric conversion. Examples of the solid-state imaging element to be used include a CMOS sensor and a CCD sensor. The external device connection I/F 318 is capable of connecting the videoconference terminal 3 to an external device such as an external camera, an external microphone, or an external speaker through a USB cable, for example. When an external camera is connected, the external camera is driven in preference to the built-in camera 312 under control of the CPU 301. In a similar manner, when an external microphone is connected, or an external speaker is connected, the external microphone or the external speaker is driven in preference to the built-in microphone 314 or the built-in speaker 315 under control of the CPU 301.
The recording medium 306 is removable from the videoconference terminal 3. The flash memory 304 is replaceable with any suitable memory, such as an electrically erasable and programmable ROM (EEPROM), as long as the memory is a non-volatile memory that reads or writes data under control of CPU 301.
Hardware Configurations of Communication Management System and PC
A description is now given of a hardware configuration of each of the communication management system 5, PC 6, and the PC 7 according to the present embodiment, with reference to
The communication management system 5 includes a CPU 501, a ROM 502, a RAM 503, a hard disk (HD) 504, a hard disc drive (HDD) 505, a media drive 507, a display 508, a network IN 509, a keyboard 511, a mouse 512, a compact-disc rewritable (CD-RW) drive 514, and a bus line 510. The CPU 501 controls the entire operation of the communication management system 5. The ROM 502 stores programs such as an IPL to boot the CPU 501. The RAM 503 is used as a work area for the CPU 501. The HD 504 stores various data such as programs for the communication management system 5. The HDD 505 controls reading or writing of data from and to the HD 504 under control of the CPU 501. The media drive 507 controls reading or writing (storing) of data from or to a recording medium 506 such as a flash memory. The display 508 displays various information such as a cursor, menus, windows, characters, and images. The network I/F 509 enables the communication management system 5 to establish a communication with an external device via the communication network 100. The keyboard 511 includes a plurality of keys to allow a user to input characters, numbers, and various instructions. The mouse 512 allows a user to input an instruction for selecting and executing various functions, selecting an item to be processed, or moving the cursor. The CD-RW drive 514 controls reading of data from a CD-RW 513, which is an example of a removable recording medium. The bus line 510 electrically connects those parts or devices of the communication management system 5 to one other as illustrated in
Hardware Configuration of Smartphone
A description is now given of a hardware configuration of the smartphone 9 according to the present embodiment, with reference to
The CPU 901 controls the entire operation of the smartphone 9. The ROM 902 stores a program, such as an IPL, used for controlling the CPU 901. The RAM 903 is used as a work area for the CPU 901. The EEPROM 904 reads or writes various data such as a control program for the smartphone 9 under control of the CPU 901. The CMOS sensor 905 captures an object (mainly, a self-image of a user operating the smartphone 9) under control of the CPU 901 to obtain image data. The acceleration and orientation sensor 906 includes various sensors such as an electromagnetic compass for detecting geomagnetism, a gyrocompass, and an acceleration sensor. The medium I/F 908 controls reading and/or writing data from and/or to a recording medium 907, such as a flash memory. The GPS receiver 909 receives a GPS signal from a GPS satellite.
The smartphone 9 further includes a long-range communication circuit 911, a camera 912, an imaging element I/F 913, a microphone 914, a speaker 915, an audio input/output (1/0) I/F 916, a display 917, an external device connection I/F 918, a short-range communication circuit 919, an antenna 919a for the short-range communication circuit 919, and a touch panel 921.
The long-range communication circuit 911 is a circuit that enables the smartphone 9 to establish a communication with other device via the communication network 100. The camera 912 is an example of a built-in imaging device capable of capturing an object under control of the CPU 901 to obtain image data. The imaging element I/F 913 is a circuit that controls driving of the camera 912. The microphone 914 is an example of a built-in audio collecting device configured to input audio. The audio input/output interface 916 is a circuit for controlling input and output of audio signals between the microphone 914 and the speaker 915 under control of the CPU 901. The display 917 is an example of a display unit, such as a liquid crystal or organic electroluminescence (EL) display that displays an image of object, and/or an operation icon, for example. The external device connection I/F 918 is an interface that connects the smartphone 9 to various external devices. The short-range communication circuit 919 is a communication circuit such as a NFC standard, Bluetooth (registered trademark) or the like. The touch panel 921 is an example of an input device to operate the smartphone 9 according to a user operation of touching a surface of the display 917.
The smartphone 9 further includes a bus line 910. Examples of the bus line 910 include an address bus and a data bus. The bus line 910 electrically connects the elements including the CPU 901, one another.
In addition, a storage medium such as a compact-disc read only memory (CD-ROM) storing any of the above-described programs or an HD storing any of the above-described programs can be distributed domestically or overseas as a program product.
Functional Configuration
A description is now given of a functional configuration of an image communication system according to the present embodiment, with reference to
Functional Configuration of Image Capturing Device
As illustrated in
The image capturing device 1a further includes a memory 1000a, which is implemented by the ROM 112, the SRAM 113, and the DRAM 114 illustrated in
The image capturing device 1b includes a receiving unit 12b, an image capturing unit 13b, a sound collecting unit 14b, a communication unit 18b, a writing and reading unit 19b, and a memory 1000b. Each of the above-mentioned functional units of the image capturing device 1b implements substantially the same function as corresponding one of the receiving unit 12a, the image capturing unit 13a, the sound collecting unit 14a, the communication unit 18a, the writing and reading unit 19a, and the memory 1000 of the image capturing device 1a, and the redundant description is omitted here.
Functional Units of Image Capturing Device
A detailed description is now given of each functional unit of the image capturing device 1a according to the present embodiment, with reference to
The receiving unit 12a of the image capturing device 1a is mainly implemented by operation of the operation device 115 illustrated in
The image capturing unit 13a is mainly implemented by operation of the imaging unit 101, the image processor 104, and the image controller 105 illustrated in
The sound collecting unit 14a is implemented by operation of the microphone 108 and the audio processor 109 illustrated in
The communication unit 18a is mainly implemented by operation of the CPU 111 and communicates with a communication unit 38a of the videoconference terminal 3a using a short range wireless communication technology in compliance with such as NFC, Bluetooth (registered trademark), or Wi-Fi.
The writing and reading unit 19a is mainly implemented by operation of the CPU 111 illustrated in
Functional Configuration of Videoconference Terminal
As illustrated in
The videoconference terminal 3a further includes a memory 3000a that is implemented by the ROM 302, the RAM 303, and the flash memory 304 illustrated in
The videoconference terminal 3d includes a transmission and reception unit 31d, a receiving unit 32d, an image and audio processing unit 33d, a display control unit 34d, a determination unit 35d, a generating unit 36d, a computing unit 37d, communication unit 38d, a writing and reading unit 39d, and a memory 3000d. Each of the above-mentioned functional units of the videoconference terminal 3d implements substantially the same function as corresponding one of the transmission and reception unit 31a, the receiving unit 32a, the image and audio processing unit 33a, the display control unit 34a, the determination unit 35a, the generating unit 36a, the computing unit 37a, the communication unit 38a, the writing and reading unit 39a, and the memory 3000a of the videoconference terminal 3a, and the redundant description is omitted here. In addition, the memory 3000d of the videoconference terminal 3d includes an image type management DB 3001d, an image capturing device management DB 3002d, a predetermined area management DB 3003d, and a combined position management DB 3004d. These DBs 3001d, 3002d, 3003d and 3004d have substantially the same or similar data structure as or to the image type management DB 3001a, the image capturing device management DB 3002a, the predetermined area management DB 3003a, and the combined position management DB 3004a of the videoconference terminal 3a, respectively.
Image Type Management Table
The example of the image type management table illustrated in
In addition, data other than the image data may be stored in the image type management table in association with the image data ID. Examples of the data other than the image data include audio data.
Image Capturing Device Management Table
Predetermined Area Management Table
In the example of
When the transmission and reception unit 31a newly receives predetermined area information including the same set of IP addresses of the communication terminal of transmission source and the communication terminal of transmission destination that is already managed in the table, the writing and reading unit 39a overwrites the currently managed predetermined area information with the newly received predetermined area information.
Combined Position Management Table
A description is now given of coordinates for combining, which are examples of the combined position information, with reference to
Functional Units of Videoconference Terminal
A detailed description is now given of each functional unit of the videoconference terminal 3a according to the present embodiment, with reference to
The transmission and reception unit 31a of the videoconference terminal 3a is implemented by the network I/F 311 illustrated in
The receiving unit 32a is implemented by the operation key 308, when operating under control of the CPU 301. The receiving unit 32a receives selections or inputs from a user. In another example, an input device such as a touch panel is used in addition to or in place of the operation key 308.
The image and audio processing unit 33a is implemented by instructions from the CPU 301 illustrated in
Further, the image and audio processing unit 33a processes image data received from another communication terminal based on the image type information, such as a source name, to enable the display control unit 34a to cause the display 4 to display an image based on the processed image data. More specifically, when the image type information indicates “special image”, the image and audio processing unit 33a converts the image data such as hemispherical image data as illustrated in
The display control unit 34a is implemented by the display I/F 317, when operating under control of the CPU 301. The display control unit 34a causes the display 4 to display images or characters.
The determination unit 35a, which is mainly implemented by instructions of the CPU 301, determines an image type corresponding to image data received from, for example, the image capturing device 1a. This determination is just one example performed by the determination unit 35a, and the determination unit 35a performs other various determinations regarding image data.
The generating unit 36a is implemented by instructions of the CPU 301. The generating unit 36a generates a source name, which is one example of the image type information, according to the above-described naming rule, based on a determination result generated by the determination unit 35a indicating a general image or a special image (that is, a spherical image in the present embodiment). For example, when the determination unit 35a determines that an image type is “general image”, the generating unit 36a generates a source name of “Video” that indicates a “general image” type. On the other hand, when the determination unit 35a determines that an image type is “special image”, the generating unit 36a generates a source name of “Video Theta” that indicates a “special image” type.
The computing unit 37a, which is mainly implemented by instructions of the CPU 301, calculates the combined position information.
The communication unit 38a is mainly implemented by the short-range communication circuit 319 and the antenna 319a, each of which operates under control of the CPU 301. The communication unit 38a communicates with the communication unit 18a of the image capturing device 1a using a short range wireless communication network in compliance with an NFC standard, Bluetooth (registered trademark), or Wi-Fi, for example. In the above description, the communication unit 38a and the transmission and reception unit 31a individually have a communication unit. In another example, the communication unit 38a and the transmission and reception unit 31a share a single communication unit.
The writing and reading unit 39a is mainly implemented by instructions from the CPU 301 illustrated in
Functional Configuration of Communication Management System
A detailed description is now given of each functional unit of the communication management system 5 according to the present embodiment, with reference to
The communication management system 5 further includes a memory 5000 that is implemented by the RAM 503 and the HD 504 illustrated in
Session Management Table
Image Type Management Table
Predetermined Area Management Table
Functional Units of Communication Management System
A detailed description is now given of each functional unit of the communication management system 5 according to the present embodiment, with reference to
The transmission and reception unit 51 of the communication management system 5 is implemented by the network I/F 509 illustrated in
The determination unit 55, which is mainly implemented by operation of the CPU 501 and performs various determinations.
The generating unit 56 is mainly implemented by operation of the CPU 501 and generates an image data ID.
The writing and reading unit 59 is implemented by the HDD 505 illustrated in
Functional Configuration of PC
A detailed description is now given of a functional configuration of the PC 6 according to the present embodiment, with reference to
As illustrated in
The PC 6 further includes a memory 6000, which is implemented by the ROM 502, the RAM 503, and the HD 504 illustrated in
Functional Units of PC
The receiving unit 62 of the PC 6 is mainly implemented by operation of the keyboard 511 or the mouse 512 under control of the CPU 501 and implements substantially the same function as the receiving unit 32a. The display control unit 64 is implemented by the CPU 501, when executing according to the program, to control the display 508 to display images or characters. The communication unit 68 is mainly implemented by operation of the CPU 501 and communicates with a communication unit 38a of the videoconference terminal 3a using a short range wireless communication network in compliance with such as NFC, Bluetooth (registered trademark), or Wi-Fi. The writing and reading unit 69, which is mainly implemented by instructions of the CPU 501, stores various data or information in the memory 6000 or reads out various data or information from the memory 6000.
Functional Configuration of PC as Communication Terminal
A detailed description is now given of a functional configuration of the PC 7 according to the present embodiment, with reference to
The PC 7 further includes a memory 7000, which is implemented by the ROM 502, the RAM 503 and the HD 504 illustrated in
Each functional Unit of PC as Communication Terminal
The transmission and reception unit 71 of the PC 7 is mainly implemented by operation of the network I/F 509 illustrated in
The receiving unit 72 is mainly implemented by operation of the keyboard 511 or the mouse 512 under control of the CPU 501 and implements substantially the same function as the receiving unit 32a. The image and audio processing unit 73 is mainly implemented by instructions from the CPU 501 and implements substantially the same function as the image and audio processing unit 33a. The display control unit 74 is mainly implemented by operation of the CPU 501 and implements substantially the same function as the display control unit 34a. The determination unit 75 is mainly implemented by operation of the CPU 501 and implements substantially the same function as the determination unit 35a. The generating unit 76 is mainly implemented by operation of the CPU 501 and implements substantially the same function as the generating unit 36a. The computing unit 77 is mainly implemented by operation of the CPU 501 and implements substantially the same function as the computing unit 37a. The communication unit 78 is mainly implemented by operation of the CPU 501 and implements substantially the same function as the communication unit 38a. The writing and reading unit 79 is implemented by operation of the CPU 501 and stores data or information in the memory 7000 or reads data or information from the memory 7000.
Functional Configuration of Smartphone
A detailed description is now given of a functional configuration of the smartphone 9 according to the present embodiment, with reference to
The smartphone 9 further includes a memory 9000, which is implemented by the ROM 902, the RAM 903, and the EEPROM 904 illustrated in
Functional Units of Smartphone
The transmission and reception unit 91 of the smartphone 9 is mainly implemented by operation of the long-range communication circuit 911 illustrated in
The receiving unit 92 is mainly implemented by the touch panel 921 under control of the CPU 901 and implements substantially the same function as the receiving unit 32a.
The image and audio processing unit 93 is mainly implemented by instructions from the CPU 901 and implements substantially the same function as the image and audio processing unit 33a. The display control unit 94, which is mainly implemented by operation of the CPU 901 and implements substantially the same function as the display control unit 34a. The determination unit 95 is mainly implemented by operation of the CPU 901 and implements substantially the same function as the determination unit 35a. The generating unit 96 is mainly implemented by operation of the CPU 901 and implements substantially the same function as the generating unit 36a. The computing unit 97 is mainly implemented by operation of the CPU 901 and implements substantially the same function as the computing unit 37a. The communication unit 98 is mainly implemented by operation of the CPU 901 and implements substantially the same function as the communication unit 38a. The writing and reading unit 99 is implemented by operation of the CPU 901 and stores data or information in the memory 9000 or reads data or information from the memory 9000.
Operation or Process
Referring now to
Participation Process
Referring now to
When a user in the site A (e.g., user A1) operates the videoconference terminal 3a to display the session selection screen for selecting a communication session (virtual conference room), the receiving unit 32a receives the operation to display the session selection screen, and the display control unit 34a causes the display 4a to display the session selection screen as illustrated in
When the user A1 selects a desired selection button (in this example, the selection button b1) on the session selection screen, the receiving unit 32a receives selection of a corresponding communication session (step S22). Then, the transmission and reception unit 31a transmits, to the communication management system 5, a request to participate in the communication session, namely to enter the corresponding virtual conference room (step S23). Hereinafter, the request is also referred to as a participation request. The participation request includes a session ID identifying the communication session, which is selected and received at step S22, and the IP address of the videoconference terminal 3a, which is a request transmission source terminal. The transmission and reception unit 51 of the communication management system 5 receives the participation request.
Subsequently, the writing and reading unit 59 performs a process for causing the videoconference terminal 3a to participate in the communication session (step S24). More specifically, the writing and reading unit 59 adds, in the session management DB 5001 (see
Process of Managing Image Type Information
A description is now given of a process of managing the image type information according to the present embodiment with reference to
When a user in the site A (e.g., the user Al) connects the cradle 2a, on which the image capturing device 1a is mounted, to the videoconference terminal 3a, using a wired cable such as a USB cable, the writing and reading unit 19a of the image capturing device 1a reads the GUID of the own device (e.g., the image capturing device 1a) from the memory 1000a. Then, the communication unit 18a transmits the own device's GUID to the communication unit 38a of the videoconference terminal 3 (step S51). The communication unit 38a of the videoconference terminal 3a receives the GUID of the image capturing device 1a.
Subsequently, the determination unit 35a of the videoconference terminal 3a determines whether a vendor ID and a product ID that are same as those in the GUID received at step S51 are stored in the image capturing device management DB 3002a (see
Subsequently, the writing and reading unit 39a stores, in the image type management DB 3001a (see
Then, the transmission and reception unit 31a transmits a request for addition of the image type information to the communication management system 5 (step S54). The request for addition of the image type information includes the IP address of the own terminal (i.e., the videoconference terminal 3a) as a transmission source terminal and the image type information, both of which are stored at step S53 in association with one another. The transmission and reception unit 51 of the communication management system 5 receives the request for addition of the image type information.
Subsequently, the writing and reading unit 59 of the communication management system 5 refers to the session management DB 5001 (see
Subsequently, the generating unit 56 generates a unique image data ID (step S56). Then, the writing and reading unit 59 stores, in the image type management DB 5002 (see
Subsequently, the writing and reading unit 39a of the videoconference terminal 3a stores, in the image type management DB 3001a (see
Further, the transmission and reception unit 51 of the communication management system 5 transmits a notification of addition of the image type information to other communication terminal (i.e., the videoconference terminal 3d in the present embodiment) (step S60). The notification of addition of the image type information includes the image data ID generated at step S56, and the IP address of the own terminal (i.e., the videoconference terminal 3a) as the transmission source terminal and the image type information that are stored at step S53. The transmission and reception unit 31d of the videoconference terminal 3d receives the notification of addition of the image type information. The destination of the notification transmitted by the transmission and reception unit 51 is indicated by an IP address associated with the session ID with which the IP address of the videoconference terminal 3a is associated in the session management DB 5001 (see
Then, the writing and reading unit 39d of the videoconference terminal 3d stores, in the image type management DB 3001d (see
Process of Establishing Communication to Transmit and Receive Captured Image Data
A description is now given of a process of communicating captured image data and material image data in video communication according to the present embodiment, with reference to
As illustrated in
A description is now given of a process of transmitting captured image data, audio data, and material image data obtained in the site A illustrated in
The communication unit 18a of the image capturing device 1a transmits captured image data obtained by capturing an object or surrounding and audio data obtained by collecting sounds to the communication unit 38a of the videoconference terminal 3a (step S101). Because the image capturing device 1a is a device that is capable of obtaining two hemispherical images, from which a spherical image is generated, the captured image data is configured by data of the two hemispherical images as illustrated in
Subsequently, the communication unit 68 of the PC 6 transmits the material image data displayed by the display control unit 64 to the communication unit 38a of the videoconference terminal 3a (step S102).
Subsequently, the transmission and reception unit 31a of the videoconference terminal 3a transmits, to the communication management system 5, the captured image data, the audio data, and the material image data received from the image capturing device 1a (step S103). The transmission and reception unit 51 of the communication management system 5 receives the captured image data, the audio data, and the material image data. In step S103, along with the captured image data, an image data ID identifying the captured image data, which is a transmission target, is also transmitted.
Subsequently, the transmission and reception unit 51 of the communication management system 5 transmits the captured image data, the audio data, and the material image data to each of other participant communication terminals (i.e., the smartphone 9, the PC 7, and the videoconference terminal 3d) participating in the same video communication in which the videoconference terminal 3a is participating (steps S104, S105, S106). At each of these steps, along with the captured image data, the image data ID identifying the captured image data, which is a transmission target, is also transmitted. Accordingly, each of the transmission and reception unit 91 of the smartphone 9, the transmission and reception unit 71 of the PC 7 and the transmission and reception unit 31d of the videoconference terminal 3d receives the captured image data and the image data ID, and further receives the audio data and the material image data.
A description is now given of examples of a screen of the display 917 in the site B, according to the present embodiment with reference to
When images of the captured image data transmitted from the image capturing device 1a and the image capturing device 1b, each of which captures a spherical image, are displayed as they are, the images of the site A and the site B are displayed as illustrated in
On the other hand, when the image and audio processing unit 93 generates a spherical image based on the captured image data output from the image capturing device 1a and the image capturing device 1b, each of which obtains two hemispherical images from which a spherical image is generated, and further generates a predetermined area image, the predetermined area image, which is a planar image, is displayed as illustrated in
Furthermore, a user is able to change the predetermined area corresponding to the predetermined area image in the same spherical image. For example, when the user B1 operates the touch panel 921, the receiving unit 92 receives a user operation to shift the predetermined area image, and the display control unit 94 shifts, rotates, reduces, or enlarges the predetermined area image. Thereby, a default predetermined area image in which the user Al and the user A2 are displayed as illustrated in
Note that celestial sphere icons 191 and 192 illustrated in
A description is now given of a process performed by the image communication system, when a predetermined area image as illustrated in
First, when the user D1, D2 or D3 operates the videoconference terminal 3d in the site D to display the predetermined area image of the site A as illustrated in
The writing and reading unit 59 of the communication management system 5 stores, in the predetermined area management DB 5003, the predetermined area information and the IP address of the transmission source terminal and the IP address of the transmission destination terminal, which are received at step S111, in association with one another (step S112). The processing of step S111 and step S112 is performed each time when the predetermined area image is changed in the videoconference terminal 3d, for example, from the one as illustrated in
The writing and reading unit 59 of the communication management system 5 reads out, from a plurality of sets of the predetermined area information and the IP addresses of the transmission source terminal and the transmission destination terminal stored in the predetermined area management DB 5003, the latest (the most recently stored) set of predetermined area information and the IP addresses of the transmission source terminal and the transmission destination terminal, at preset intervals such as every thirty seconds (step S113). Next, the transmission and reception unit 51 distributes (transmits) the predetermined area information including the IP addresses read at step S113, to other communication terminals (i.e., the videoconference terminal 3a, the smartphone 9, and the PC 7) participating in the same video communication in which the videoconference terminal 3d, which is the transmission source terminal of the predetermined area information, is participating (steps S114, S116, S118). The videoconference terminal 3a receives the predetermined area information and the IP addresses at the transmission and reception unit 31a. The writing and reading unit 39a stores the predetermined area information and the IP addresses received at step S114 in association with one another in the predetermined area management DB 3003a (
A description is now given of another process of sharing predetermined area information according to the present embodiment, with reference to
In the process described above with reference to
The operation illustrated in
In the operation illustrated in
Next, the transmission and reception unit 51 of the communication management system 5 transmits the predetermined area information including the IP addresses received at step S211 to the videoconference terminal 3a, which is a transmission source terminal of the captured image data (step S212). The videoconference terminal 3a receives the predetermined area information and the IP addresses at the transmission and reception unit 31a.
Next, the writing and reading unit 39a of the videoconference terminal 3a stores, in the predetermined area management DB 3003a, the predetermined area information, the IP address of the transmission source terminal and the IP address of the transmission destination terminal, which are received at step S212, in association with one another (step S213). This processing of S213 is a process of managing how the captured image data transmitted from the own terminal (i.e., the videoconference terminal 3a, in this example) is displayed in each of the other communication terminals. The processing of S211 to S213 is performed each time the predetermined area image is changed in the videoconference terminal 3d.
The writing and reading unit 39a of the videoconference terminal 3a reads out, from a plurality of sets of the predetermined area information and the IP address of each of the transmission source terminal and the transmission destination terminal stored in the predetermined area management DB 3003a, the latest (the most recently stored) set of predetermined area information and the IP address of each of the transmission source terminal and the transmission destination terminal, at preset intervals such as every thirty seconds (step S214). Then, the transmission and reception unit 31a transmits the predetermined area information including the IP addresses read out at step S214 to the communication management system 5 (step S215). The transmission and reception unit 51 of the communication management system 5 receives the predetermined area information.
Next, the transmission and reception unit 51 of the communication management system 5 transmits (distributes) the predetermined area information including the IP addresses received at step S215 to each of the communication terminals (i.e., the videoconference terminal 3d, the smartphone 9, the PC 7) (steps S216, S218, S220). The videoconference terminal 3d receives the predetermined area information including the IP addresses at the transmission and reception unit 31d. The writing and reading unit 39d stores, in the predetermined area management DB 3003d, the predetermined area information received at step S216 in association with the IP addresses that are also received at step S216 (step S217). In substantially the same manner, the transmission and reception unit 91 of the smartphone 9 receives the predetermined area information and the IP addresses. Then, the writing and reading unit 99 stores, in the predetermined area management DB 9003, the predetermined area information received at step S218 in association with the IP addresses that are also received at step S218 (step S219). Further, the transmission and reception unit 71 of the PC 7 receives the predetermined area information and the IP addresses. The writing and reading unit 79 stores, in the predetermined area management DB 7003, the predetermined area information received at step S220 in association with the IP addresses that are also received at step S220 (step S221).
Thus, the predetermined area information indicating the predetermined area image changed in the site A is transmitted to each of the communication terminals in the other sites B, C and D participating in the same video communication. As a result, the predetermined area information indicating the predetermined area image being displayed in the site A is shared by the other communication terminals in the other sites B, C and D. This operation is performed in substantially the same manner, when the predetermined area image being displayed at any one of the communication terminals in the sites B, C, and D is changed. Accordingly, the predetermined area information indicating the predetermined area image being displayed by the communication terminal in any one of the sites is shared by the other communication terminals in the other sites which are participating in the same video communication.
Referring now to
The receiving unit 32a receives selection of a material image according to an operation by the user A1, A2, A3 or A4 in the site A (step S71). For example, the user A1, A2, A3 or A4 selects the material image by right clicking, double clicking, or pressing a corresponding key on a portion of the material image displayed on a preview screen of the spherical image being distributed from the videoconference terminal 3a. In a case where a terminal including a touch panel as an input device is used, the user can select the material image by long tapping or double tapping the material image displayed in the preview screen. Subsequently, the receiving unit 32a receives a change of a combined position of the material image according to an operation by the user A1, A2, A3 or A4 (step S72).
Then, the receiving unit 32a receives determination of the combined position of the material image according to an operation by the user A1, A2, A3 or A4, and the computing unit 37a generates combined position information (step S73). For example, the receiving unit 32a accepts that the combined position is determined automatically after the operation at step S72. In another example, the receiving unit 32a accepts the determination of the combined position according to the user operation of an “Enter” button on the screen or pressing a preset key.
A description is now given of an example of a screen displayed when the user A1, A2, A3 or A4 changes the combined position of the material image, with reference to
Referring to
A description is now given of an operation, in correspondence to the above-described steps S71 to S73, of the user A1, A2, A3 or A4, with reference to
The method of changing the combined position is not limited to the “drag and drop” described with reference to
Referring to
Next, the transmission and reception unit 51 distributes (transmits) the combined position information to other communication terminals (the videoconference terminal 3a, the smartphone 9, the PC 7) participating in the same video communication in which the videoconference terminal 3d, which is the transmission source terminal of the predetermined area information, is participating (steps S75, S77, S79). The videoconference terminal 3d receives the combined position information including the IP addresses at the transmission and reception unit 31d. Then, the writing and reading unit 39a stores the combined position information received at step S75 in the combined position management DB 3004d in association with the transmission source address (step S76). In substantially the same manner, the transmission and reception unit 91 of the smartphone 9 receives the combined position information and the IP addresses. Then, the writing and reading unit 99 stores the combined position information received at step S77 in the combined position management DB 9004 in association with the transmission source address (step S78). Further, the transmission and reception unit 71 of the PC 7 receives the combined position information and the IP addresses. The writing and reading unit 79 stores, in the combined position management DB 7004, the combined position information received at step S79 in association with the IP addresses that are also received at step S79 (step S80).
First, the writing and reading unit 99 of the smartphone 9 searches the image type management DB 9001 (see
Next, the determination unit 95 determines whether the image type information read at step S131 indicates “special image” or not (step S132). Furthermore, when the determination result of step S132 is a special image, the determination unit 95 checks, or determines, whether material image data is received or not (step S133). On the other hand, when the determination result of step S132 is not a special image (NO at step S132), the process proceeds to step S134, and a determination whether there is image type information that has not been read yet. In step S134, there is no image type information, which has not been read, (NO at step S134), the process ends, and there is image type information, which has not been read, (YES at step S134), the process returns to step S131. In step S133, for example, when there is image data that is not managed by an image data ID, the image data is determined to be the material image data. When the determination result of step S133 indicates that the material image data is not received, that is, if No at step S133, the image and audio processing unit 93 displays the spherical image (step S137), and the process ends. On the other hand, when the determination result of step 133 indicates that the material image data is received, that is, if Yes at step S133, the determination unit 95 determines whether the transmission source IP address of the image information is stored in the combined position management DB 9004d or not (step S135). When the determination result of step S135 indicates that the transmission source IP address is not stored, that is, if No at step S135, the image and audio processing unit 93 combines the material image data with the spherical image at an initial combined position (default combined position) that is a predetermined combined position on the spherical image (step S136). Then, the spherical image is displayed (step S137). When the determination result of step S135 indicates that the transmission source IP address is stored, that is, if Yes at step S135, the writing and reading unit 99 acquires the combined position from the combined position management DB 9004d (step S138). Subsequently, the image and audio processing unit 93 combines the material image data with the spherical image at the combined position acquired (step S139) and displays the spherical image (step S137).
Thus, the combined position information changed in the site A is transmitted to each of the communication terminals in the other sites B, C and D participating in the same video communication. As a result, the material image being displayed in the site A is also displayed by the other communication terminals in the other sites B, C and D to be shared.
As described heretofore, the communication terminal, such as the videoconference terminal 3a, according to one or more of the embodiments, generates a spherical image and a predetermined area image based on image type information associated with the image data ID transmitted with image data.
This prevents the front-side hemispherical image and the back-side hemispherical image from being displayed as illustrated in
Further, according to the present embodiment, in a conference system or the like in which a transmission destination terminal combines a part of the certain image (combination destination image) and display another image (combination source image), a transmission source terminal can change a position of displaying the combination source image according to a user operation. Therefore, the embodiment described above can provide eye-friendly and easy understanding image in a case where a combination source image is combined at a position that is not intended by a user of the transmission source terminal, or in a case where a user desires to combine the image at a different position during a conference, by changing the combined position.
In the above embodiments, a captured image (whole image) is a three-dimensional spherical image, as an example of a panoramic image, which is a destination to be combined. In another example, the captured image is a two-dimensional panoramic image. In addition, in this disclosure, the spherical image does not have to be a full-view spherical image. For example, the spherical image can be a wide-angle view image having an angle of about 180 to any amount less than 360 degrees in the horizontal direction. It is desirable that the spherical image is image data having at least a part that is not entirely displayed in the predetermined area.
Further, the spherical image or any other image being captured, if desired, can be made up of multiple pieces of image data which have been captured through different lenses, or using different image sensors, or at different times.
Further, In the above-described embodiments, the communication management system 5 transfers the predetermined area information transmitted from each communication terminal. In another example, each communication terminal can directly transmit or receive the predetermined area information from or to any one or more of the other communication terminals.
Each of the functions of the above-described embodiments may be implemented by one or more processing circuits or circuitry. The processing circuitry includes a programmed processor, as a processor includes circuitry. A processing circuit also includes devices such as an application specific integrated circuit (ASIC), a digital signal processor (DSP), a field programmable gate array (FPGA), a system on a chip (SOC), a graphics processing unit (GPU), and conventional circuit components arranged to perform the recited functions.
Any one of the above-described operations may be performed in various other ways, for example, in an order different from the one described above.
Each of the functions of the described embodiments may be implemented by one or more processing circuits or circuitry. Processing circuitry includes a programmed processor, as a processor includes circuitry. A processing circuit also includes devices such as an application specific integrated circuit (ASIC), DSP (digital signal processor), FPGA (field programmable gate array) and conventional circuit components arranged to perform the recited functions.
Although the embodiments of the disclosure have been described and illustrated above, such description is not intended to limit the disclosure to the illustrated embodiments. Numerous additional modifications and variations are possible in light of the above teachings. It is therefore to be understood that within the scope of the appended claims, the embodiments may be practiced otherwise than as specifically described herein. For example, elements and/or features of different illustrative embodiments may be combined with each other and/or substituted for each other within the scope of this disclosure and appended claims.
Number | Date | Country | Kind |
---|---|---|---|
2018-068440 | Mar 2018 | JP | national |