The present disclosure relates to an information processing system, an information processing method, and a program.
Conventionally, various technologies for realizing communication between remote places, such as a video conference system, have been developed.
For example, Patent Document 1 describes a technology for enabling a plurality of information processing apparatuses to share a common display screen, and presenting information of a real object present in one work space to the other work space as display information.
However, in the technology described in Patent Document 1, generating an image corresponding to an operation body adaptively to an operation of the operation body in a certain place and displaying the image in another place has not been considered.
Therefore, in the present disclosure, proposed are new and improved information processing system, information processing method, and program capable of displaying an image corresponding to an operation body in a certain place adaptively generated for an operation of the operation body in another place.
According to the present disclosure, provided is an information processing system including: an information acquisition unit configured to acquire first transmission information according to an operation of an operation body related to a first user located in a first place, the first transmission information being for communication with a second user located in a second place; and an output control unit configured to display, on a display surface in the second place, an image corresponding to the operation body generated on the basis of a pointing position of the operation body in the second place and a first virtual position of the first user in the second place, the pointing position and the first virtual position being specified on the basis of the first transmission information.
Furthermore, according to the present disclosure, provided is an information processing method including: acquiring first transmission information according to an operation of an operation body related to a first user located in a first place, the first transmission information being for communication with a second user located in a second place; and displaying, by a processor, on a display surface in the second place, an image corresponding to the operation body generated on the basis of a pointing position of the operation body in the second place and a first virtual position of the first user in the second place, the pointing position and the first virtual position being specified on the basis of the first transmission information.
Furthermore, according to the present disclosure, provided is a program for causing a computer to function as: an information acquisition unit configured to acquire first transmission information according to an operation of an operation body related to a first user located in a first place, the first transmission information being for communication with a second user located in a second place; and an output control unit configured to display, on a display surface in the second place, an image corresponding to the operation body generated on the basis of a pointing position of the operation body in the second place and a first virtual position of the first user in the second place, the pointing position and the first virtual position being specified on the basis of the first transmission information.
As described above, according to the present disclosure, an image corresponding to an operating body adaptively generated for an operation of the operation body at a certain place can be displayed at another place. Note that effects described here are not necessarily limited, and any of effects described in the present disclosure may be exhibited.
Favorable embodiments of the present disclosure will be described in detail with reference to the appended drawings. Note that, in the present specification and drawings, redundant description of configuration elements having substantially the same functional configuration is omitted by providing the same sign.
Furthermore, in the present specification and the drawings, a plurality of configuration elements having substantially the same functional configuration may be distinguished by providing different alphabets to the same reference numeral. For example, a plurality of configuration elements having substantially the same functional configuration is distinguished such as a table 90a and a table 90b as needed. However, in a case where there is no need to distinguish the plurality of configuration elements having substantially the same functional configuration, only the same reference numeral will be provided. For example, in the case where there is no need to distinguish the table 90a and the table 90b, these tables are simply referred to as table(s) 90.
Furthermore, “modes for carrying out the invention” will be described according to the order of items illustrated below.
1. Configuration of Information Processing System
2. First Embodiment
3. Second Embodiment
4. Third Embodiment
5. Fourth Embodiment
6. Fifth Embodiment
7. Sixth Embodiment
8. Seventh Embodiment
9. Eighth Embodiment
10. Ninth Embodiment
11. Tenth Embodiment
12. Eleventh Embodiment
13. Twelfth Embodiment
14. Thirteenth Embodiment
15. Fourteenth Embodiment
16. Fifteenth Embodiment
17. Sixteenth Embodiment
18. Seventeenth Embodiment
19. Eighteenth Embodiment
20. Nineteenth Embodiment
21. Twentieth Embodiment
22. Hardware Configuration
23. Modification
First, a configuration example of an information processing system common to embodiments of the present disclosure will be described with reference to
In the present specification, a system may mean a configuration for executing predetermined processing. The system may include one device or may include a plurality of devices. Furthermore, the information processing system 10 according to each embodiment is only required to be able to execute the predetermined processing as the entire information processing system 10, and which configuration in the information processing system 10 being regarded as one device is arbitrary.
Note that, in the present specification and drawings, the information processing systems 10 at the respective points may be distinguished by adding different numbers to after the “information processing systems 10”. For example, the information processing system 10 at a “point A” may be described as information processing system 10-1, and the information processing system 10 at a “point B” may be described as information processing system 10-2 or the like.
The communication network 22 is a wired or wireless transmission path of information transmitted from a device connected to the communication network 22. For example, the communication network 22 may include a public network such as a telephone network, the Internet, and a satellite network, various local area networks including Ethernet (registered trademark), a wide area network (WAN), and the like. Furthermore, the communication network 22 may include a leased line network such as an internet protocol-virtual private network (IP-VPN).
The output unit 124a displays various types of information on the table 90a. The output unit 124a can include a projection unit (projector). For example, as illustrated in
For example, the output unit 124a displays a virtual display object according to the control of a display processing unit 122 to be described below. The display object is, for example, a window, a UI object, or the like. The UI object is a predetermined image (a still image or a moving image) that receives various operations (such as selection and input) by a user. For example, the UI object is an image including a graphical user interface (GUI) component (for example, a button, a slider, a check box, a text box, a software keyboard, or the like). Furthermore, the UI object can be arranged in the window.
Moreover, the output unit 124a can include a sound output device such as a speaker, for example. For example, the output unit 124a outputs a sound according to the control of the display processing unit 122 and the like.
The input unit 120a includes, for example, a camera that images an image of the table 90a with one lens. Alternatively, the input unit 120a can include a stereo camera capable of recording information in a depth direction by imaging the table 90a with two lenses. As the stereo camera, for example, a visible light camera, an invisible light camera capable of detecting invisible light such as infrared light, or the like can be used. Furthermore, the input unit 120a can further include a sound input device such as a microphone that collects a voice uttered by the user and an environmental sound of a surrounding environment.
In the case where the above camera is used as the input unit 120a, the information processing system 10a analyzes the image (captured image) imaged by the camera to determine the position of an object (for example, a hand of the user, or the like) located on the screen 20. Furthermore, in the case where the stereo camera is used as the input unit 120a, the information processing system 10a analyzes the image imaged by the stereo camera to be able to acquire depth information of the object in addition to position information of the object located on the screen 20. The information processing system 10a can detect contact or proximity of the hand of the user to the screen 20, and detachment of the hand from the screen 20 on the basis of the depth information. Note that the input unit 120a may include a depth sensor (for example, a time of flight sensor, a structured light sensor, or the like) instead of the stereo camera. In this case, the depth sensor can obtain the depth information of the object located on the screen 20.
In each embodiment, the position of an operation body (various operation members and the like such as a hand of the user and a stylus, for example) on the screen 20 is detected on the basis of the image imaged by the input unit 120a, and various types of information can be input on the basis of the detected position of the operation body. That is, the user can perform various operation inputs by moving the operation body on the screen 20. For example, when a contact of the hand of the user with respect to the window or the UI object is detected, an operation input with respect to the window or the UI object is performed.
Furthermore, the camera included in the input unit 120a may capture not only the top surface of the table 90a but also the user present around the table 90a. In this case, the information processing system 10a can detect the position of the user around the table 90a on the basis of the captured image by the input unit 120a. Furthermore, the information processing system 10a may perform personal recognition of the user by extracting physical characteristics (such as the size of the face and the body) with which the individual user is specifiable on the basis of the captured image.
The present invention is not limited to the above-described example, and the operation input by the user may be executed by another method. For example, the input unit 120a may be installed as a touch panel on the top surface (screen 20a) of the table 90a. Then, the operation input of the user may be detected by a contact of a finger or the like of the user with respect to the touch panel. Furthermore, the operation input of the user may be detected by a gesture with respect to the camera included in the input unit 120a.
The configuration of the information processing system 10a according to each embodiment has been described above. Note that the configuration of the information processing system 10 according to each embodiment is not limited to the example illustrated in
Furthermore, in the example illustrated in
Alternatively, the output unit 124d may be configured as a non-transmissive display device. For example, the output unit 124d can include a liquid crystal display (LCD), an organic light emitting diode (OLED), and the like. In this case, the camera included in the input unit 120d may capture a video in front of the user wearing the head mounted device, and the output unit 124d may sequentially display the captured video. Thereby, the user can view the scenery in front of the user through the video displayed on the output unit 124d.
The configuration of the information processing system 10 according to each embodiment has been described above. By the way, communication is desired between users located at different points on the basis of information acquired by the input units 120 at the respective points.
In this case, if the video itself acquired by the input unit 120 at each point is shared, performing image processing is required in real time. Therefore, a calculation load may be large and a transmission cost may be large.
Furthermore, in the case where the video itself is transmitted, information unnecessary for the communication (for a discussion or the like, for example) may be transmitted. Therefore, such transmission of information may adversely affect the communication. For example, when a certain point is a home of the user and the user is wearing at-home clothes as illustrated in
Note that a method of transmitting only a pointing position of the user among the information acquired by the input unit 120 at each point to another point is conceivable. For example, as illustrated in
Therefore, the information processing system 10 according to each embodiment has been made in consideration of the above circumstances. The information processing system 10 according to each embodiment can acquire first transmission information according to an operation of an operation body related to a first user located at a first point, the first transmission information being for communication with a second user located at a second point, and then display an image corresponding to the operation body generated on the basis of a pointing position of the operation body at the second point and a first virtual position of the first user at the second point, the pointing position and the first virtual position being specified on the basis of the first transmission information, on a display surface in the second point. Thereby, the amount of information transmitted between different points can be reduced while realizing smooth communication between the different points.
Here, the operation body related to the user may be a hand of the user or may be an operation device (for example, a stylus, a mouse, a keyboard, a touch panel, or the like) being used by the user.
For example, in a case where the operation body is the hand of the user, the first transmission information can include information of the pointing position of the hand, position information of the hand, and position information of the arm (on the same side as the hand). Moreover, the first transmission information may include the shape of the hand (for example, whether or not the appropriate hand is open, or the like). Here, the position of the hand (hereinafter sometimes referred to as hand position) is an example of a “second virtual position” according to the present disclosure. Furthermore, the position of the arm (hereinafter sometimes referred to as arm position) is an example of the “first virtual position” according to the present disclosure. Alternatively, in a case where the operation body is the operation device, the first transmission information can include information of the pointing position corresponding to the operation device. Hereinafter, an example in which the operation body is the hand of the user will be mainly described.
Note that, in the present specification and drawings, the description “hand” is mainly used as an expression meaning the hand itself. However, the present invention is not limited to such an example, and the description “hand” may be used as an expression meaning the finger, hand, arm, or two or more of the aforementioned parts (for example, upper limb or the like). Hereinafter, the content of each embodiment will be sequentially described in detail.
First, a functional configuration according to a first embodiment will be described in detail.
The display processing unit 122 may to include one or a plurality of processing circuits (for example, a central processing unit (CPU), a graphics processing unit (GPU), and the like). The display processing unit 122 performs processing regarding graphics to be displayed by the output unit 124 on the basis of input information acquired by the input unit 120. For example, the display processing unit 122 performs drawing control of a display object such as a window or performs drawing processing for the individual display object according to an operation of the user.
Moreover, the display processing unit 122 can output the input information (for example, a captured image or the like) acquired by the input unit 120 to the processing unit 100 to be described below. Furthermore, the display processing unit 122 receives information processed by the processing unit 100 from the processing unit 100, and performs the drawing processing on the basis of the information.
The processing unit 100 can include one or a plurality of processing circuits (for example, a CPU 150 to be described below and the like). The processing unit 100 performs various types of processing on the basis of the input information acquired by the input unit 120.
Furthermore, as illustrated in
The information generation unit 102 generates transmission information for communication with the other user located at another point on the basis of the various types of information acquired by the input unit 120.
For example, the information generation unit 102 first acquires information regarding the operation body (for example, the hand of the user or the like) being used by an individual user at a principal point on the basis of the various types of information acquired by the input unit 120. Then, the information generation unit 102 generates the transmission information according to the acquired information. For example, the transmission information has a smaller amount of information than the information regarding the operation body. As an example, the transmission information is abstracted information of the information regarding the operation body. Here, the principal point is an example of a second point according to the present disclosure. The principal point can be a point (current point) where the appropriate information processing system 10 is located.
For example, in the case where the operation body is the hand of the user, the information generation unit 102 generates the transmission information on the basis of the input information (the captured image and the like) acquired by the input unit 120 and an information generation and reproduction rule DB 130 to be described below. Here, the information generation and reproduction rule DB 130 is a database in which a rule at the time of generation of the transmission information and a rule at the time of reproduction of the transmission information when the transmission information is received from another point (for example, display of a video according to the transmission information) are registered.
In the example illustrated in
Moreover, the information generation unit 102 may specify information (for example, the position, shape, texture, and the like) regarding individual object arranged on the screen 20-1 on the basis of the input information. Moreover, the information generation unit 102 may specify sound information (for example, a voice uttered by the appropriate user, an environmental sound, and the like) at the principal point on the basis of the input information. Then, the information generation unit 102 can generate transmission information to include all the pieces of information.
Alternatively, in the case where the operation body is the operation device, the information generation unit 102 generates the transmission information on the basis of operation information acquired from the operation device and the information generation and reproduction rule DB 130. For example, the information generation unit 102 generates the transmission information to include at least the pointing position of the operation device.
The transmission control unit 104 controls transmission of the transmission information generated by the information generation unit 102 to another point. For example, the transmission control unit 104 causes the communication unit 126 to be described below to transmit the generated transmission information to the information processing systems 10 at individual other points currently in communication.
The information acquisition unit 106 performs reception or read processing to acquire the transmission information from the communication unit 126, which has been received from another point by the communication unit 126.
The output control unit 108 controls the output of various types of information to the output unit 124 on the basis of the transmission information acquired by the information acquisition unit 106. For example, the output control unit 108 displays an image corresponding to the operation body generated on the basis of the pointing position of the operation body corresponding to the transmission information (for example, the hand of the user or the like (in the point) of a transmission source of the transmission information) at the principal point (in other words, the point where the appropriate information processing system 10 is located), and the arm position of the user of the transmission source of the transmission information at the principal point, the pointing position and the arm position being specified on the basis of the acquired transmission information, on the screen 20 (display surface) in the principal point.
As an example, the output control unit 108 first specifies the pointing position of the hand of the user of the transmission source, a virtual hand position of the user of the transmission source, and a virtual arm position of the user of the transmission source at the principal point on the basis of the position information of the finger, hand, and arm of the user at the point of the transmission source included in the acquired transmission information. Next, the output control unit 108 generates (reconstructs) an image corresponding to the hand of the user of the transmission source in full size, as illustrated in
According to this display example, in the scene where the hand of the user is used as the operation body at the point of the transmission source, and the pointing position is moved by the user moving the hand, as illustrated in
The communication unit 126 transmits and receives various types of information to and from the information processing system at another point according to the control of the transmission control unit 104. For example, the communication unit 126 transmits the transmission information generated by the information generation unit 102 to the information processing systems 10 at other points currently in communication according to the control of the transmission control unit 104. Furthermore, the communication unit 126 receives the transmission information generated by the information processing system 10 at another point from the another point.
For example, in a case of communication with the information processing system 10 within the same point, the communication unit 126 performs general inter-process communication. Furthermore, in a case of communication between different points, the communication unit 126 performs communication via the communication network 22.
The storage unit 128 stores various data and various types of software. For example, the storage unit 128 stores an information generation and reproduction rule DB 130 and an information conformity rule DB 132. The information conformity rule DB 132 is a database in which a rule for determining appropriateness of presenting transmission information received from another point to the user are registered. The information conformity rule DB 132 can be realized, for example, in the form of a relational database or a look-up table.
The functional configuration according to the first embodiment has been described above. Next, a flow of processing according to the first embodiment will be described in “2-2-1. Flow of Processing at the Time of Generating Transmission Information” and “2-2-2. Flow of Processing At the Time of Reproducing Transmission Information”. Note that the information processing system 10 at each point can basically execute the “flow of processing at the time of generating transmission information” and the “flow of processing at the time of reproducing transmission information” in parallel.
First, the “flow of processing at the time of generating transmission information” will be described with reference to
As illustrated in
Next, the information generation unit 102 performs “transmission information generation processing” to be described below (S103).
Next, the communication unit 126 transmits the transmission information generated in S103 to each information processing system 10 at another point being in communication, for example, according to the control of the transmission control unit 104 (S105).
Next, the processing unit 100 determines whether or not a termination operation has been performed by the user (S107). In a case where the termination operation has not been performed (S107: No), the processing unit 100 repeats the processing of S101 and the subsequent steps again. On the other hand, in a case where the termination operation has been performed (S107: Yes), the flow of the processing is terminated.
Here, content of S103 (“transmission information generation processing”) will be described in detail with reference to
Then, the information generation unit 102 performs the following processing of S157 to S165 as long as I is equal to or less than N (S155: Yes). Note that in a case where I becomes larger than N (S155: No), the “transmission information generation processing” is terminated.
Specifically, first, the information generation unit 102 determines whether or not information of all the information types defined in the I-th generation rule stored in the information generation and reproduction rule DB 130 is included in the input information acquired in S101. For example, in a case where the finger position information is defined as the information type in the I-th generation rule, the information generation unit 102 determines whether or not the finger position information is included in the input information acquired in S101. Furthermore, in a case where a sound is defined as the information type in the I-th generation rule, the information generation unit 102 determines whether or not a sound collection result with a volume of a predetermined threshold or larger is included in the input information acquired in S101 (S157). In a case where the information of at least one information type is not included in the input information (S157: No), the information generation unit 102 performs processing of S165 to be described below.
On the other hand, in a case where the information of all the information types is included in the input information (S157: Yes), the information generation unit 102 next determines whether or not the condition at generation associated with the I-th generation rule is established in the information generation and reproduction rule DB 130 (S159). In a case where the information generation unit 102 determines that the condition at generation is not established (S159: No), the information generation unit 102 performs processing of S165 to be described below.
On the other hand, in a case where the information generation unit 102 determines that the condition at generation is established (S159: Yes), the information generation unit 102 performs the processing at generation associated with the I-th generation rule in the information generation and reproduction rule DB 130 (S161).
Then, the information generation unit 102 stores the processing result of S161 in the transmission information to be generated. For example, the information generation unit 102 stores the position information of the finger, hand, and arm of the user in the transmission information to be generated (S163).
Next, the information generation unit 102 adds “1” to I (S165). Thereafter, the information generation unit 102 repeats the processing of S155 and the subsequent steps again.
Next, the “flow of processing at the time of reproducing transmission information” will be described with reference to
As illustrated in
Next, the output control unit 108 performs “transmission information check processing” to be described below (S203).
Next, the output control unit 108 performs “output processing based on transmission information” to be described below (S205).
Next, the processing unit 100 determines whether or not the termination operation has been performed by the user (S207). In the case where the termination operation has not been performed (S207: No), the processing unit 100 repeats the processing of S201 and the subsequent steps again. On the other hand, in the case where the termination operation has been performed (S207: Yes), the flow of the processing is terminated.
Here, content of S203 (“transmission information check processing”) will be described in detail with reference to
Then, the output control unit 108 performs the following processing of S227 to S235 as long as I is equal to or less than N (S225: Yes). Note that in a case where I becomes larger than N (S225: No), the “transmission information check processing” is terminated.
Specifically, first, the output control unit 108 determines whether or not information of all the information types defined in the I-th conformity rule stored in the information conformity rule DB 132 is included in the transmission information acquired in S201 (S227). In a case where the information of at least one information type is not included in the transmission information (S227: No), the output control unit 108 performs processing of S235 to be described below.
On the other hand, in a case where the information of all the information types is included in the transmission information (S227: Yes), the output control unit 108 specifies a check target associated with the I-th conformity rule in the information generation and reproduction rule DB 130. Then, the output control unit 108 specifies a value of the check target on the basis of the transmission information and the like, for example (S229).
Next, the output control unit 108 determines whether or not the value specified in S229 satisfies the conformity condition associated with the I-th conformity rule in the information generation and reproduction rule DB 130 (S231). In a case where the value specified in S229 satisfies the conformity condition (S231: Yes), the output control unit 108 performs processing of S235 to be described below.
On the other hand, in a case where the value specified in S229 does not satisfy the conformity condition (S231: No), the output control unit 108 performs processing at condition unestablished associated with the I-th conformity rule in the information generation and reproduction rule DB 130 (S233).
Next, the output control unit 108 adds “1” to I (S235). Thereafter, the output control unit 108 repeats the processing of S225 and the subsequent steps again.
Here, content of S205 (“output processing based on transmission information”) will be described in detail with reference to
Then, the output control unit 108 performs the following processing of S257 to S261 as long as I is equal to or less than N (S255: Yes). Note that in a case where I becomes larger than N (S255: No), the “output processing based on transmission information” is terminated.
Specifically, first, the output control unit 108 determines whether or not information of all the information types defined in the I-th reproduction rule stored in the information generation and reproduction rule DB 130 is included in the transmission information acquired in S201 (S257). In a case where the information of at least one information type is not included in the transmission information (S257: No), the output control unit 108 performs processing of S261 to be described below.
On the other hand, in a case where the information of all the information types is included in the transmission information (S257: Yes), the output control unit 108 performs the processing at reproduction associated with the I-th reproduction rule in the information generation and reproduction rule DB 130 (S259).
Next, the output control unit 108 adds “1” to I (S261). Thereafter, the output control unit 108 repeats the processing of S255 and the subsequent steps again.
As described above, the information processing system 10 according to the first embodiment acquires the first transmission information according to the operation of the operation body related to the first user located at another point, the first transmission information being for communication with the second user located at the principal point, and then displays the image corresponding to the operation body generated on the basis of the pointing position of the operation body at the principal point and the arm position of the first user at the principal point, the pointing position and the arm position being specified on the basis of the first transmission information, on the display surface in the principal point. For this reason, the amount of information transmitted between different points can be reduced while realizing smooth communication between the different points.
For example, the information processing system 10 shares only the information necessary for communication generated on the basis of the input information with another point as the transmission information, instead of sharing the input information (the video and the like) itself acquired by the input unit 120. Therefore, components that inhibit concentration on the communication can be cut, and thus the users at different points are expected to be more able to concentrate on the communication. For example, the users at different points can participate in the discussion without hesitation regardless of his/her skin, clothes, and the like. Furthermore, for similar reasons, the users at different points are expected to be able to discuss more smoothly and to be able to easily understand the intention of the other party.
Furthermore, the information processing system 10 can specify the position information at the principal point of the finger, hand, and arm of the user at another point on the basis of the position information of the finger, hand, and arm of the user at the another point, generate the image indicating the hand of the user at the another point on the basis of the specified position information, and then display the image on the display surface in the principal point. Therefore, the movement of the hand of the user at another point can be naturally reproduced at the principal point. As a result, for example, the user at the principal point can understand the process in which the user at another point operates the information.
The first embodiment has been described above. Next, a second embodiment according to the present disclosure will be described. In the second embodiment, a scene where brainstorming is performed between users located at different points is assumed. As will be described below, according to the second embodiment, when the users operate objects and discuss at different points, the users can clearly understand who is operating which object.
Configuration elements included in the information processing system 10 according to the second embodiment may be the same as those in the first embodiment (illustrated in
In an information generation and reproduction rule DB 130 according to the second embodiment, registered content illustrated in
In a case where an operation to slide a hand in a region on a screen 20 where no object is arranged, for example (hereinafter, the operation may be referred to as a write operation) having been performed is detected, an information generation unit 102 according to the second embodiment further stores a sequence of pointing positions of the hand of the user corresponding to the write operation in transmission information to be transmitted as a point group.
In a case where an operation to move at least one object is specified at another point on the basis of the transmission information received from another point, an output control unit 108 according to the second embodiment moves a display position on the screen 20, of an image corresponding to the at least one object, according to the specified operation. For example, in a case where the hand of the user at another point having been slid while being in contact with at least one object at another point is specified, the output control unit 108 moves the display position of the image corresponding to the at least one object according to the specified slide. Here, the object may be a real object or, for example, a virtual object displayed on the screen 20.
Alternatively, in a case where the write operation having been performed on the display surface at another point is specified on the basis of the transmission information, the output control unit 108 can further display a locus of pointing positions of the hand of the user at another point corresponding to the write operation on a display surface at a principal point.
Here, the function of the output control unit 108 will be described in more detail with reference to
Then, it is assumed that an operation to drag one of the plurality of sticky notes with one finger of the user at another point has been detected at the another point. In this case, the output control unit 108 can first specify the drag operation of the finger on the basis of the transmission information acquired from the another point. Then, as illustrated in
Alternatively, it is assumed that an operation to drag a group of a plurality of sticky notes, of the plurality of sticky notes, with the palm of the hand of the user at another point has been detected at the another point. In this case, the output control unit 108 can first specify the drag operation of the palm of the hand on the basis of the transmission information acquired from the another point. Then, as illustrated in
Alternatively, it is assumed that a drag operation (write operation) by the user at another point in a region where no sticky note exists has been detected on the screen 20 at the another point. In this case, the output control unit 108 can first specify information (point group) written up to the present at the another point on the basis of the transmission information acquired from the another point. Then, as illustrated in
As described below, according to the second embodiment, when the users operate the sticky notes and discuss at different points, the users can clearly understand who is operating which sticky note.
Moreover, according to the second embodiment, the users at different points can naturally communicate, similarly to a case where the users are facing each other, on the basis of the image of the hand of the user at another point displayed on the screen 20 at the principal point and a sound of the user at the another point output at the principal point. For example, even if the user at another point utters a word whose meaning is ambiguous, such as a pronoun (“this”, “it”, or the like), the user at the principal point can easily grasp what the user at the another point is discussing.
The second embodiment has been described above. Next, a third embodiment according to the present disclosure will be described.
Although a user at a principal point can understand a pointing position Pf of a user at another point even with an image 30b illustrated in
As will be described below, according to the third embodiment, an image of the hand with a more natural shape can be displayed on the basis of the transmission information received from another point.
In a case where a relationship among a pointing position of the hand of the user at another point, a hand position at the principal point, of the user at the another point, and an arm position at the principal point, of the user at the another point, which are specified on the basis of the transmission information obtained from the another point, becomes a predetermined relationship, an output control unit 108 according to the third embodiment can correct the arm position and/or the hand position of the user at the another point on the basis of the pointing position of the hand of the user at the another point.
For example, in
Here, the above correction example will be described in more detail with reference to
Moreover, after the above correction, the output control unit 108 generates an image corresponding to the hand of the user at the another point on the basis of the pointing position of the hand of the user at the appropriate another point, the hand position of the user after correction, and the arm position of the user after correction. Then, the output control unit 108 displays the generated image on the screen 20 in the principal point, as illustrated in
As described above, according to the third embodiment, the image of the hand with a more natural shape can be displayed on the basis of the transmission information received from another point. For example, the hand position and the arm position of the user can be corrected such that the shape of the hand (the wrist or the like) of the user becomes more natural without changing the pointing position of the user corresponding to the transmission information received from another point.
The third embodiment has been described above. Next, a fourth embodiment according to the present disclosure will be described. As will be described below, according to the fourth embodiment, a further simplified or anonymized image of a hand can be displayed on the basis of transmission information received from another point.
In an information generation and reproduction rule DB 130 according to the fourth embodiment, registered content illustrated in
An information generation unit 102 according to the fourth embodiment further stores, regarding an individual user at a principal point, attribute information (for example, a type of job, a gender, and the like) of the user in transmission information corresponding to the user.
An output control unit 108 according to the fourth embodiment changes a value of a display parameter of the image of the hand of the user corresponding to the transmission information, the value being generated on the basis of the transmission information obtained from another point, according to the attribute information of the user included in the transmission information. The display parameter may be, for example, a display color, a thickness of a line to be drawn, a line type to be drawn, or the like.
Here, the above function will be described in more detail with reference to
As described above, according to the fourth embodiment, the display color of the image of the hand of the user corresponding to the transmission information acquired from another point is changed according to the type of job of the user included in the transmission information. For this reason, the image of the hand of the user can be more simply or more anonymously displayed. For example, the displayed image can be simplified and anonymized as compared to a case where an image is color-coded for each user or for each point. As a result, the users at different points are expected to be able to discuss more smoothly and to be able to easily understand mutual intentions.
The fourth embodiment has been described above. Next, a fifth embodiment according to the present disclosure will be described. First, the background on which the fifth embodiment has been made will be described with reference to
In the examples illustrated in
Note that, to improve this phenomenon, a method of storing the information of the thickness of the arm of the user in the transmission information is conceivable. However, by this method, the area of a hand image (for example, the area of an arm region) becomes larger. As a result, the user may feel obstructive during communication (such as discussion).
As will be described below, according to the fifth embodiment, an image of a hand with a more natural shape can be displayed on the basis of the transmission information at the time of pointing even in a case where the user points at a position distant from the torso of the user at another point.
In an information conformity rule DB 132 according to the fifth embodiment, a conformity rule illustrated in
In a case where the distance between the hand position of the user and the arm position of the user at another point, which are specified on the basis of the transmission information obtained from the another point, is larger than a predetermined distance, an output control unit 108 according to the fifth embodiment can correct a finger position of the user and the hand position of the user on the basis of the arm position of the user at a principal point.
In the example illustrated in
Here, the above correction example will be described in more detail with reference to
Moreover, after the above correction, the output control unit 108 generates an image corresponding to the hand of the user on the basis of the newly set finger position Pvf, the hand position Pvh of the user after correction, and the arm position Pa of the user, as illustrated in the right diagram in
As described above, according to the fifth embodiment, the image of the hand with a more natural shape can be displayed on the basis of the transmission information at the time of pointing even in a case where the user points at a position distant from the torso of the user at another point. Moreover, since the size of the image of the hand can be suppressed, it is possible to prevent interference with communication.
The fifth embodiment has been described above. Next, a sixth embodiment according to the present disclosure will be described. First, the background on which the sixth embodiment has been made will be described with reference to
Note that a method of displaying the image 30 of the hand only when the z coordinate included in transmission information acquired from another point is a fixed value or larger is conceivable. However, by this method, the image 30 may be suddenly displayed at the pointing position of the user, and thus the user who is viewing the image may feel unnatural.
As will be described below, according to the sixth embodiment, an image of the hand of the user can be displayed such that whether or not the user is pointing at some object is distinguishable, in the middle of the user at another point extending the hand.
In an information generation and reproduction rule DB 130 according to the sixth embodiment, registered content illustrated in
An information generation unit 102 according to the sixth embodiment further stores a sensing result of the height of the hand of the user with reference to a screen 20 at a principal point, which is specified from input information by an input unit 120, in the transmission information corresponding to the user.
An output control unit 108 according to the sixth embodiment changes a value of a display parameter of the image of the hand of the user corresponding to the transmission information, the value being generated on the basis of the transmission information obtained from another point, according to height information of the user included in the transmission information. The display parameter may be, for example, the transparency, a display color, the degree of blurring of an outline of the hand image, or the like.
For example, as illustrated in
Alternatively, the output control unit 108 may change the transparency of the image of the hand of the user on the basis of a first threshold, a second threshold larger than the first threshold, and the value indicated by the height information of the hand. For example, in a case where the value indicated by the height information of the hand is less than the first threshold, the output control unit 108 sets the transparency of the image of the hand of the user to be low (for example, “0%” or the like). Furthermore, in a case where the value indicated by the height information of the hand is the first threshold or larger, and less than the second threshold, the output control unit 108 sets the transparency of the image of the hand of the user to be high (for example, “50V” or the like). Furthermore, in a case where the value indicated by the height information of the hand is the second threshold or larger, the output control unit 108 may hide the image of the hand of the user.
As described above, according to the sixth embodiment, the image of the hand of the user can be displayed such that whether or not the user is pointing at some object is distinguishable, in the middle of the user at another point extending the hand. Therefore, the intention of the user at another point can be more easily understood.
The sixth embodiment has been described above. Next, a seventh embodiment according to the present disclosure will be described. First, the background on which the seventh embodiment has been made will be described with reference to
In the seventh embodiment, a scene in which a meeting is performed connecting a plurality of points is assumed.
As described above, basically, the output control unit 108 according to the first embodiment can display the image of the hand of the user on a screen 20-4 at the principal point while maintaining the positional relationship between the screen 20 at another point and the arm position of the user at the another point. Therefore, in the situations illustrated in
As will be described below, according to the seventh embodiment, the image of the hand of each user can be displayed with high visibility while maintaining a pointing position of each user at each point.
In an information conformity rule DB 132 according to the seventh embodiment, a conformity rule illustrated in
An output control unit 108 according to the seventh embodiment specifies the pointing position of the hand at the principal point of the user corresponding to transmission information, a hand position of the user, and an arm position of the user on the basis of the individual transmission information acquired from each of a plurality of other points. Moreover, in a case where a relationship of the specified arm positions of the individual users becomes a predetermined relationship, the output control unit 108 changes display positions of the images corresponding to the hands of the individual users according to the points where the users are located.
Here, the above function will be described in more detail with reference to
Moreover, the output control unit 108 corrects, regarding each user of which the arm position has been moved, the hand position of the user on the basis of the arm position of the user after movement and the pointing position of the user.
Moreover, as illustrated in
As described above, according to the seventh embodiment, the image of the hand of each user can be displayed with high visibility while maintaining the pointing position of each user at each point. Then, the usability can be improved.
The seventh embodiment has been described above. Next, an eighth embodiment according to the present disclosure will be described. First, the background on which the eighth embodiment was made will be described with reference to
In the eighth embodiment, a scene in which a teacher gives an instruction to a student from a remote place is assumed. For example, as illustrated in
As described above, basically, the output control unit 108 according to the first embodiment can display the image of the hand of the user on the screen 20-4 at the principal point while maintaining the positional relationship between the screen 20 at another point and the arm position of the user. Therefore, in the situation illustrated in
As will be described below, according to the eighth embodiment, the image of the hand of the teacher can be displayed with high visibility while maintaining a pointing position of the teacher at the remote place.
In an information conformity rule DB 132 according to the eighth embodiment, a conformity rule illustrated in
In a case where a relationship between an arm position at the principal point, of the user at another point, and an arm position of the user at the principal point, which are specified on the basis of transmission information obtained from the another point, becomes a predetermined relationship, an output control unit 108 according to the eighth embodiment corrects the arm position of the user at the another point on the basis of the arm position of the user at the principal point and the pointing position of the hand of the user at the another point.
Here, the above function will be described in more detail with reference to
Moreover, the output control unit 108 corrects the hand position of the user of which the arm position has been moved on the basis of the arm position of the user after movement and the pointing position of the user.
Moreover, after the above correction, the output control unit 108 generates an image corresponding to the hand of the user on the basis of the pointing position of the hand of the user (the “teacher” in the example illustrated in
As described above, according to the eighth embodiment, the image of the hand of the teacher can be displayed with high visibility while maintaining the pointing position of the teacher at the remote place. For example, a video as if the teacher is facing and teaching the student while both the student and the teacher are actually looking at the teaching material from the same direction.
The eighth embodiment has been described above. Next, a ninth embodiment according to the present disclosure will be described. In the ninth embodiment, a scene in which a user at at least one point communicates with a user at another point by using an operation device such as a mouse. In such a case, a hand position and an arm position of the user are acquirable at a point where a hand is used as an operation body, whereas at a point where a mouse is used the an operation body, only information of a cursor position of the mouse can be usually obtained.
As will be described below, according to the ninth embodiment, an image in which a hand corresponding to a user at another point naturally moves can be displayed on a screen 20 according to transmission information acquired from the another point even in a case where the user at the another point uses an operation device.
In an information generation and reproduction rule DB 130 according to the ninth embodiment, registered content illustrated in
In a case where the user uses the operation device, an information generation unit 102 according to the ninth embodiment further stores information (for example, an x coordinate, a y coordinate, and the like) of a pointing position of the operation device, which is specified from input information acquired by an input unit 120, in the transmission information corresponding to the user. Note that the input unit 120 may include the operation device. Furthermore, the input unit 120 and the operation device may be configured in a communicative manner by wired or wireless communication.
In a case where information of the pointing position of the operation device is included in the transmission information acquired from another point, the output control unit 108 according to the ninth embodiment determines a virtual hand position at a principal position, of the user at the another point, and a virtual arm position at the principal point, of the user at the another point, on the basis of a positional relationship between a pointing position at the principal point specified from the information of the pointing position and each side of a screen at the principal point. Then, the output control unit 108 generates an image of a hand of the user at the another point on the basis of the pointing position at the principal point, the determined virtual hand position, and the determined virtual arm position.
Here, the above function will be described in more detail with reference to
Moreover, the output control unit 108 displays the generated image of the hand on the screen 20.
Note that, when the virtual hand position and the virtual arm position of the user are simply moved in parallel in accordance with the movement of the cursor position when the pointing position of the operation device is changed (for example, when the mouse cursor is moved), a viewer may feel that the movement is unnatural movement. For example, the viewer notices that a user at a remote location is operating the mouse or the like.
Therefore, when the pointing position of the operation device is changed at another point, the output control unit 108 can change the virtual hand position and the virtual arm position of the user at the another point, following the change of the pointing position, by an amount smaller than a change amount of the pointing position of the operation device. For example, as illustrated in
Thereby, when movement of the cursor position (that is, the pointing position) of the house has been detected at another point, for example, natural movement of the hand according to the change of the pointing position can be presented at the principal point.
As described above, according to the ninth embodiment, the image in which the hand corresponding to the user at another point naturally moves can be displayed on the screen 20 according to the transmission information acquired from the another point even in the case where the user at the another point uses an operation device. As a result, a video as if all the users are performing operation by hand can be presented at each point, which leads to smooth communication.
The ninth embodiment has been described above. Next, a tenth embodiment according to the present disclosure will be described. In general, in a scene where users at different points conduct a meeting for the first time, individual users located at the different points may feel reluctant to disclose all of information of their own to other users.
As will be described below, according to the tenth embodiment, an amount of information (type of information or the like) to be disclosed to a user at another point can be increased or decreased according to familiarity with the user at the another point.
An information generation unit 102 according to the tenth embodiment further stores the gender information of the user, the size of the hand, the thickness of the arm, and the like, which are specified from input information by an input unit 120, in transmission information corresponding to the user.
Note that the gender information of the user may be stored in advance in a storage unit 128 or may be input by the user at the time of using an information processing system 10. Furthermore, the size of the hand and the thickness of the arm can be specified on the basis of, for example, recognition of an image captured by the input unit 120.
An information acquisition unit 106 according to the tenth embodiment can further acquire information indicating the familiarity between an individual user at another point and the user at the principal point. Here, the information indicating the familiarity may be specified according to a predetermined action performed between users at different points, for example. For example, in a case where self-introduction, a simple game aiming at ice breaking, or the like, having been performed among the users is detected, the familiarity between the users having reached a predetermined threshold (“a familiarity level of 2” in the example illustrated in
Note that, when the operation of handshake is detected, an output control unit 108 may further superimpose and display a predetermined effect 44 (such as filling a region with a predetermined color) on only a region where an image 30 of the hand of the user at the another point and a hand 2 of the user at the principal point overlap, as illustrated in
Alternatively, the information indicating the familiarity may be specified on the basis of, for example, registration information or the like of the individual user at another predetermined and the user at the principal point in a predetermined social networking service (SNS).
The output control unit 108 according to the tenth embodiment changes a display mode of the image of the hand of each user at another point according to the familiarity between the each user and the user at the principal point. For example, while the familiarity between the user at another point and the user at the principal point is less than the predetermined threshold, the output control unit 108 displays the image of the hand of the user at the another point without reflecting physical characteristics and gender of the user at the another point. In this case, for example, as illustrated in
Furthermore, after the familiarity between the user at another point and the user at the principal point becomes the predetermined threshold or larger, the output control unit 108 may display the image of the hand of the user at the another point in a display mode in which the physical characteristics (for example, the size of the hand, the thickness of the arm, and the like) and the gender of the user at the another point are further reflected. In this case, for example, as illustrated in
Note that, as a modification, for example, the output control unit 108 at a point of a role performing facilitation may display, regarding each of individual users at all the other points currently in communication, the image of the hand of the user in a display mode in which all of information of the user are reflected in a constant manner.
As described above, according to the tenth embodiment, the amount of information (type of information or the like) to be disclosed to the user at another point can be increased or decreased according to the familiarity with the user at the another point. As a result, the users are expected to more casually use the system.
The tenth embodiment has been described above. Next, an eleventh embodiment of the present disclosure will be described.
In the eleventh embodiment, at each point, an input unit 120 including only one microphone (sound input device) and an output unit 124 including only one speaker (sound output device) is assumed. In this case, if simultaneous utterance of users located at a plurality of points in communication is detected, the output unit 124 can simultaneously output the uttered sounds. As a result, a user at a principal point cannot know which user at which point is currently speaking.
As will be described below, according to the eleventh embodiment, a display mode of an image of a hand of a user at another point can be changed according to a sound collection result of the utterance of the user at the another point.
In an information generation and reproduction rule DB 130 according to the eleventh embodiment, registered content illustrated in
An information generation unit 102 according to the eleventh embodiment further stores the sound collection result of the utterance of the user acquired by the input unit 120 in transmission information corresponding to the user.
An output control unit 108 according to the eleventh embodiment changes the display mode of the image of the hand of the user according to the sound collection result of the utterance of the user included in the transmission information acquired from another point. For example, as illustrated in
Alternatively, as illustrated in
As described above, according to the eleventh embodiment, the display mode of the image of the hand of the user at another point can be changed according to the sound collection result of the utterance of the user at the another point. For this reason, even in a case where the user at another point utters using a pronoun such as “please see this”, for example, the user at the principal point can easily understand which object (item or the like) the user is pointing at.
Furthermore, the display of the appropriate image can be changed reflecting the strength of the sound and the timing of the utterance. Therefore, for example, as illustrated in
The eleventh embodiment has been described above. Next, a twelfth embodiment according to the present disclosure will be described. In the twelfth embodiment, at each point, an input unit 120 including a plurality microphones (sound input devices) is assumed. For example, the plurality of microphones is configured as a microphone array. Furthermore, a plurality of users being mainly located at each point is assumed.
As will be described below, according to the twelfth embodiment, a display mode of an image of a hand of a user at another point can be changed according to spatial distribution of volume sensed at the another point.
In an information generation and reproduction rule DB 130 according to the twelfth embodiment, registered content illustrated in
An information generation unit 102 according to the twelfth embodiment further stores the spatial distribution of the volume (at a principal point) sensed by an input unit 120 in transmission information.
An output control unit 108 according to the twelfth embodiment can further displays, on a screen 20, an image indicating an utterance situation of a user according to information indicating the spatial distribution of the volume sensed at another point included in the transmission information acquired from the another point, and an arm position of the user corresponding to the transmission information specified on the basis of the transmission information.
Here, the above function will be described in more detail with reference to
In this case, for example, the output control unit 108 first specifies a direction (and a position) of a speaker on the basis of the distribution 46 of the volume illustrated in
As a modification, the output control unit 108 may change transparency or a display color of an image of a hand of the speaker or blink the image of the hand of the speaker, in place of (or in addition to) the display of the effect 48.
As described above, according to the twelfth embodiment, the display mode of the image of the hand of the user at another point can be changed according to the spatial distribution of the volume sensed at the another point.
The twelfth embodiment has been described above. Next, a thirteenth embodiment according to the present disclosure will be described. In the thirteenth embodiment, at each point, an input unit 120 including a plurality microphones (sound input devices) (similarly to the twelfth embodiment) is assumed.
As will be described below, according to the thirteenth embodiment, a sound recognition result of an utterance of a user at another point can be displayed on a screen 20 together with an image of a hand of the user.
In an information generation and reproduction rule DB 130 according to the thirteenth embodiment, registered content illustrated in
An information generation unit 102 according to the thirteenth embodiment can recognize a sound collection result of the utterance of the user acquired by the input unit 120. Then, the information generation unit 102 further stores spatial distribution of volume (at a principal point) sensed by the input unit 120, a character string of the sound recognition result of the utterance, and collected sound data in transmission information.
An output control unit 108 according to the thirteenth embodiment displays, on a screen 20, the character string of the sound recognition result of the utterance included in the transmission information acquired from another point in association with the image of the hand of the user corresponding to the transmission information. For example, the output control unit 108 displays, on the screen 20, the character string of the sound recognition result of the utterance in association with the image of the hand of the user at the time of utterance of the user, which is specified on the basis of the acquired transmission information.
As an example, the output control unit 108 first determines whether or not the user corresponding to the transmission information is uttering (or has previously uttered) on the basis of the information indicating the spatial distribution of the sensed volume included in the acquired transmission information. Then, when the user is determined to be uttering, the output control unit 108 displays a character string 50 of the sound recognition result of the utterance included in the transmission information near an arm position of the user at the principal point, as illustrated in
Alternatively, the output control unit 108 may display the character string of the sound recognition result of the utterance in association with an object corresponding to a pointing position of the user. For example, the output control unit 108 may display the character string of the sound recognition result of the utterance near the object pointed at by the user.
Moreover, the output control unit 108 may further display, on the screen 20, one or more keywords (for example, nouns, or the like) obtained from a result of a morphological analysis for the character string of the sound recognition result, as tag information. For example, the output control unit 108 first performs the morphological analysis for the character string of the sound recognition result. Then, as illustrated in
Moreover, the output control unit 108 may further store, in a storage unit 128, the one or more keywords (tag information) obtained from the result of the morphological analysis for the character string of the sound recognition result in association with other various types of information. The other various types of information may include, for example, the sound recognition result, identification information of the user who has uttered, identification information of the object pointed at by the user at the time of utterance, time and data information of the utterance, sound binary data at the time of appropriate communication, or the like.
Thereby, these pieces of information can be used as meeting minutes. Furthermore, in a scene where recorded video data (or recorded sound data) of the meeting is reproduced to look back on the meeting, the tag information can be displayed together with the recorded video data. Therefore, the realistic feeling can be improved. Moreover, since a search can be conducted using the tag information or the other types of information as a key. Therefore, the recorded video data of a desired meeting can be searched and desired reproduction timing of certain recorded video data can be searched.
As described above, according to the thirteenth embodiment, the sound recognition result of the utterance of the user at another point can be displayed together with the image of the hand of the user at the another point.
The thirteenth embodiment has been described above. Next, a fourteenth embodiment according to the present disclosure will be described. In the fourteenth embodiment, at each point, an input unit 120 including a plurality of microphones (sound input devices) and an output unit 124 including a plurality speakers (sound output devices) is assumed.
As will be described below, according to the fourteenth embodiment, a sound image of an utterance of a user collected at another point can be localized near an arm position of the user at a principal point.
In an information generation and reproduction rule DB 130 according to the fourteenth embodiment, registered content illustrated in
An information generation unit 102 according to the fourteenth embodiment further stores information indicating spatial distribution of volume (at a principal point) sensed by the input unit 120 (similarly to the thirteenth embodiment) and collected sound data in transmission information.
An output control unit 108 according to the fourteenth embodiment causes the plurality of microphones included in the output unit 124 to output the sound data included in the transmission information on the basis of the information indicating the spatial distribution of the sensed volume included in the acquired transmission information. For example, as illustrated in
As described above, according to the fourteenth embodiment, the sound image of the utterance of the user collected at another point can be localized near an arm position of the user at the principal point.
The fourteenth embodiment has been described above. Next, a fifteenth embodiment according to the present disclosure will be described. As will be described below, according to the fifteenth embodiment, when a user at any of other points having uttered is detected, an object arranged at the another point can be emphasized and displayed. For example, the object can be focused and displayed.
In the fifteenth embodiment, for each point, an object previously arranged by a user at the point being stored is assumed. For example, as illustrated in
In an information generation and reproduction rule DB 130 according to the fifteenth embodiment, registered content illustrated in
An information generation unit 102 according to the fifteenth embodiment further stores a sound collection result of an utterance of a user acquired by an input unit 120 in transmission information corresponding to the user (similarly to the eleventh embodiment).
When a user corresponding to transmission information having uttered is specified on the basis of the transmission information acquired from each of a plurality of other points currently in communication, for example, an output control unit 108 according to the fifteenth embodiment emphasizes and displays an object (and/or an image of a hand of the user at the another point) arranged in advance at a point corresponding to the transmission information stored in the storage unit 128. For example, when the user at the “point A” having uttered is detected, the output control unit 108 may display an image of the sticky note 32a arranged in advance at the “point A” as it is, and display images of other objects (the sticky note 32b and the sticky note 32c in the example illustrated in
As described above, according to the fifteenth embodiment, when the user at any of other points having uttered is detected, the object arranged at the another point can be emphasized and displayed.
For example, even if insufficient words such as “that mentioned earlier” are uttered in the middle of a discussion, the object arranged at the point of the speaker can be emphasized and displayed. Therefore, the intention of the utterance of the user can be easily understood. For example, users at other points can easily understand what purpose the user at the point has arranged the appropriate object in the past discussion. Thus, for example, the user can discuss while compensating for the context as compared to a case where all objects are uniformly displayed.
The fifteenth embodiment has been described above. Next, a sixteenth embodiment of the present disclosure will be described. As will be described below, according to the sixteenth embodiment, a display mode of an image corresponding to an object can be changed according to the number of times of a user pointing at the object.
A processing unit 100 according to the sixteenth embodiment records, for each object arranged (displayed) on a screen, the number of times of at least one user pointing at the object, and stores the number of times in a storage unit 128, for example.
An output control unit 108 according to the sixteenth embodiment changes a display mode of an image corresponding to an object according to a cumulative number of times of pointing at the object, of each object. For example, the output control unit 108 changes a display mode of an image corresponding to an individual object to make an object with a larger number of pointed times stand out.
Here, the above function will be described in more detail with reference to
Alternatively, as illustrated in the right diagram in
Alternatively, as illustrated in the right diagram in
As described above, according to the sixteenth embodiment, the display mode of the image corresponding to the object can be changed according to the number of times of the user pointing at the object. Thereby, the user can grasp the importance of the individual object by simply looking at the image corresponding to the individual object.
The sixteenth embodiment has been described above. Next, a seventeenth embodiment according to the present disclosure will be described. As will be described below, according to the seventeenth embodiment, when an image corresponding to an object arranged (displayed) on a screen at another point is displayed on a screen at a principal point, a display mode of the image can be appropriately changed according to attribute information of a user at the principal point.
Here, the attribute information of the user includes, for example, age, gender, nationality, use language, or the like, of the user. Note that, the attribute information of an individual user can be registered in advance at the time of starting use of an information processing system 10 according to each embodiment. Alternatively, a processing unit 100 may estimate the age and nationality of the individual user on the basis of a recognition result of utterance content, voice quality, and the like, of the individual user. Alternatively, the processing unit 100 may estimate the age and gender of the individual user on the basis of a recognition result of the sizes of a hand, an arm, and the like, of the individual user.
An output control unit 108 according to the seventeenth embodiment changes, when displaying, on the screen at the principal point, an image corresponding to an object arranged (displayed) on the screen at another point, the display mode of the image according to the attribute information of the user at the principal point.
Here, the above function will be described in more detail with reference to
Alternatively, the output control unit 108 may convert a character string described on the object according to a knowledge level or a recognition level of the user at the principal point, and then display an image after conversion on the screen at the principal point. For example, in a case where the user at the principal point (“point C”) is a child, the output control unit 108 may convert the character string described on the sticky note 32 into a character string for children of the same language, and display an image including the character string after conversion on the screen at the principal point, regarding each of the individual sticky notes 32 displayed on the screen 20 at the “point A”, as illustrated in
Alternatively, in a case where the user at the principal point (“point C”) is an elderly person, the output control unit 108 may display an image with thick or enlarged individual characters in the sticky note 32 on the screen at the principal point, regarding each of the individual sticky notes 32 displayed on the screen 20 at the “point A”, as illustrated in
Alternatively, as illustrated in
As described above, according to the seventeenth embodiment, when the image corresponding to the object arranged (displayed) on the screen at another point is displayed on the screen at the principal point, the display mode of the image can be appropriately changed according to the attribute information of the user at the principal point. Thereby, for example, at the time of a meeting between different points, the above processing is performed in real time, whereby each user can discuss without stress, even in a case where the knowledge level or the recognition level of the user at each point is different or the use language is different.
The seventeenth embodiment has been described above. Next, an eighteenth embodiment of the present disclosure will be described. As will be described below, according to the eighteenth embodiment, recorded data of a discussion between different points can be divided or combined in a time direction or a space direction.
In the eighteenth embodiment, when a discussion is performed between different points, a processing unit 100 periodically records information of movement of fingers of the users in discussion, and information of individual objects arranged (displayed) on a screen 20 to a storage unit 128.
Here, an example of dividing and combining recorded data of a discussion between different points in a time direction according to the eighteenth embodiment will be described with reference to
Thereafter, the user can re-do the discussion by going back in time (for example, playing back a recorded video, or the like). Then, as a result of the re-discussion, the state of the screen 20 at the timing in
Alternatively, a user who has not participated in the discussion can also refer to the content of the discussion later, and can relive the discussion with highly realistic feeling. At that time, the playback speed can also be adjusted. Furthermore, according to the eighteenth embodiment, it is only necessary to store the information of the movement of the finger of each user and the information of the individual objects arranged (or displayed) on the screen 20, and there is no need to store the content of the discussion as a video, for example. Therefore, the amount of data to be recorded can be suppressed.
Next, an example of dividing and combining recorded data of a discussion between different points in a space direction according to the eighteenth embodiment will be described with reference to
Thereafter, the users can be divided for each point or into predetermined groups and have a discussion using the recorded data acquired during the discussion. For example, the users are divided into a “group A” and a “group B”, and have a discussion in each group. As a result, in the “group A”, the state of the screen 20 at the timing in
In this case, the user can obtain composite data as illustrated in
Moreover, regarding the composited recording data, the output control unit 108 may change the display mode of the image corresponding to each object according to the group that has operated each object, the importance for each object, or the like. For example, the output control unit 108 may display the image of an object determined to be important in many groups more emphatically than the images of other objects.
As described above, according to the eighteenth embodiment, the recorded data of a discussion among different points can be divided or combined in the time direction or the space direction.
The eighteenth embodiment has been described above. Next, a nineteenth embodiment of the present disclosure will be described. As will be described below, according to the nineteenth embodiment, in a scene where communication is performed between different points, a user can control information at a principal point to be shared with another point only at desired timing or not to be shared with another point only at desired timing.
A transmission control unit 104 according to the nineteenth embodiment causes a communication unit 126 to transmit only information of an object specified by the user, of one or more objects arranged (displayed) on a screen 20 at a principal point, to an information processing system 10 at another point.
Here, the above function will be described in more detail with reference to
As illustrated in the right diagram in
According to this control example, for example, information of all of objects (for example, a coffee cup 32 illustrated in
Moreover, the transmission control unit 104 can cause the communication unit 126 to transmit display instruction information regarding a screen shared with another point to the another point on the basis of an instruction by the user. The screen shared with the another point can be a screen corresponding to a captured image of the screen 20 at the principal point.
For example, as illustrated in
According to the control example, the user can explicitly instruct the timing to share the video of the screen 20 at the principal point with another point. For example, in the middle of a meeting, the user can perform an instruction not to share only a material that the user does not want the user at another point to see or only discussion that the user does not want the user at another point to hear.
As described above, according to the nineteenth embodiment, in the scene where communication is performed between different points, a user can control information at the principal point to be shared with another point only at desired timing or not to be shared with another point only at desired timing.
The nineteenth embodiment has been described above. By the way, when a user located in a certain place operates an operation body (for example, a hand of the user, or the like), if a captured video of the user at the time of operation can be displayed in another place together with an image corresponding to the operation body, the realistic feeling can be further improved.
Next, a twentieth embodiment of the present disclosure will be described. As will be described below, according to the twentieth embodiment, when the user located in a certain place operates the operation body, the image corresponding to the operation body and a captured video of a space including the user can be simultaneously displayed in a different place.
First, a configuration example of a system according to the twentieth embodiment will be described with reference to
Then, a video that captures the inside of the point B in real time, which is received from the point B, is displayed (projected, for example) by an output unit 824a to be described below on a screen 24 (a wall surface or the like) behind the output unit 124, for example. The video can be a video capturing one or more users at the point B (for example, a front of the users) and a space around the one or more users. Note that the screen 24 is an example of a second display surface according to the present disclosure.
Moreover, as illustrated in
According to the above display example, the user at the point A can simultaneously view images corresponding to the users at the point B and the point C displayed on the screen 20, and the videos capturing the users at the point B and the point C in real time. For this reason, the users at different points can communicate with higher realistic feeling.
The configuration example of the system according to the twentieth embodiment has been described above. Next, a functional configuration according to the twentieth embodiment will be described in detail.
The functional configuration of the information processing system 10 according to the twentieth embodiment can be similar to one or more of the first to nineteenth embodiments described above. For example, the information processing system 10 according to the twentieth embodiment may be able to execute all the functions and processing described with reference to
For example, similarly to the seventh embodiment described with reference to
For example, the information processing system 10 may display the image corresponding to the hand of the user in the point B displayed on the screen 24, on the screen 20 with reference to the side of the screen 24 in
As illustrated in
The processing unit 800 (also referred to as an output control unit) can include one or more processing circuits (for example, a CPU, a GPU, and the like). The processing unit 800 centrally controls the telepresence system 80.
For example, the processing unit 800 causes the output unit 824 to output the captured video and sound of one or more other points received from the other points. As an example, as illustrated in
Furthermore, the processing unit 800 causes the communication unit 826 to sequentially transmit the video captured in real time by the imaging unit 820, and the sound data collected in real time by the sound collection unit 822 to all the other points currently in communication, for example.
The imaging unit 820 causes an imaging element such as a charge coupled device (CCD) or a complementary metal oxide semiconductor (CMOS) to image external light through a lens to capture an image (moving image, or the like). For example, the imaging unit 820 captures an appearance of one or more users located at the principal point (that is, a point where the appropriate telepresence system 80 is installed) and a space around the one or more users.
Moreover, the imaging unit 820 may include, for example, a stereo camera capable of recording information in the depth direction by capturing an image using two lenses.
The sound collection unit 822 includes, for example, a sound input device such as a microphone. The sound collection unit 822 collects a sound (for example, an utterance of the user at the principal point, or the like) emitted at the principal point.
The output unit 824 may include, for example, a display device such as a projector, an LCD, or an OLED. Moreover, the output unit 824 may include a sound output device such as, for example, a speaker. The output unit 824 displays (for example, projects) a video or outputs sound according to the control of the processing unit 800.
The communication unit 826 transmits and receives various types of information to and from the telepresence system 80 at another point. For example, the communication unit 826 transmits the video captured by the imaging unit 820, the sound data collected by the sound collection unit 822 and the like to the telepresence system 80 at another point according to the control of the processing unit 800. Furthermore, the communication unit 826 receives a video captured at another point and the like from the telepresence system 80 at the another point.
Note that the functional configurations according to the twentieth embodiment are not limited to the above-described example. For example, the information processing system 10 and the telepresence system 80 may be integrally configured instead of being separately provided. As an example, all the functions of the processing unit 800 may be included in the processing unit 100, instead of providing the processing unit 800. Similarly, the input unit 120, the imaging unit 820, and the sound collection unit 822 may be integrally configured. Furthermore, the output unit 124 and the output unit 824 may be integrally configured. Furthermore, the communication unit 126 and the communication unit 826 may be integrally configured.
As described above, according to the twentieth embodiment, when the user located in a certain place operates the operation body, the image corresponding to the operation body and the captured video of a space including the user can be simultaneously displayed in a different place. For this reason, the users at different points can communicate with higher realistic feeling.
The twentieth embodiment has been described above. Next, a hardware configuration of the information processing system 10 common to the embodiments will be described with reference to
The CPU 150 functions as an arithmetic processing device and a control device, and controls the overall operation in the information processing system 10 according to various programs. Furthermore, the CPU 150 can also realize the functions of the processing unit 100 and the display processing unit 122 in the information processing system 10. The CPU 150 can be configured by a processor such as a microprocessor.
The ROM 152 stores a program and control data such as calculation parameters used by the CPU 150, and the like.
The RAM 154 temporarily stores the program to be executed by the CPU 150, or the like, for example.
The bus 156 is configured by a CPU bus and the like. The bus 156 connects the CPU 150, the ROM 152, and the RAM 154 to one another.
The interface 158 connects the input device 160, the output device 162, the storage device 164, and the communication device 166 to the bus 156.
The input device 160 includes input means (for example, a touch panel, a button, a switch, a dial, a lever, a microphone, or the like) for the user to input information. Moreover, the input device 160 includes an input control circuit or the like, that generates an input signal on the basis of an input by the user and outputs the input signal to the CPU 150. The input device 160 can realize the function of the input unit 120 in the information processing system 10.
The output device 162 includes, for example, a display device such as a projector, a liquid crystal display device, an organic light emitting diode (OLED) device, or a lamp. Furthermore, the output device 162 includes a sound output device such as a speaker. The output device 162 can realize the function of the output unit 124 in the information processing system 10.
The storage device 164 is a device for storing data. The storage device 164 includes a storage medium, a recording device that records data in the storage medium, a reading device that reads data from the storage medium, a deletion device that deletes data recorded in the storage medium, or the like. The storage device 164 can realize the function of the storage unit 128 in the information processing system 10.
The communication device 166 is a communication interface configured by a communication device for being connected to the communication network 22. Furthermore, the communication device 166 may be a wireless LAN compatible communication device, a long term evolution (LTE) compatible communication device, or a wire communication device performing communication by wire. The communication device 166 can realize the function of the communication unit 126 in the information processing system 10.
The favorable embodiments of the present disclosure have been described in detail with reference to the accompanying drawings, but the present disclosure is not limited to such examples. It is obvious that persons having ordinary knowledge in the technical field to which the present disclosure belongs can conceive various modifications or alterations within the scope of the technical idea described in the claims, and the modifications and alterations are naturally understood to belong to the technical scope of the present disclosure.
The configuration of the information processing system 10 according to each embodiment is not limited to the example described above. For example, although
Furthermore, although
An apparatus (information processing apparatus) including the processing unit 100 according to each embodiment may include one or more of the input unit 120, the display processing unit 122, or the output unit 124. For example, the information processing apparatus may be a projector including the processing unit 100, the display processing unit 122, and the output unit 124.
Alternatively, the information processing apparatus may be integrally configured with the table 90. Alternatively, the information processing apparatus may be connected to at least one of the input unit 120, the display processing unit 122, and the output unit 124 via the communication network 22. For example, the information processing apparatus may be a server, a general-purpose personal computer (PC), a tablet terminal, a game machine, a mobile phone such as a smartphone, a portable music player, a television receiver, for example, a head mounted display (HMD), augmented reality (AR) glasses, a wearable device such as a smart watch, a robot, or the like.
The steps in the flow of the processing described above may not necessarily be processed in the described order. For example, the steps may be processed in an appropriately changed order. Furthermore, the steps may be partially processed in parallel or individually instead of being chronologically processed. Furthermore, some of the described steps may be omitted or another step may be added.
Furthermore, according to each embodiment described above, a computer program for causing the hardware such as the CPU 150, the ROM 152, and the RAM 154 to exhibit functions equivalent to the functions of the configurations of the information processing system 10 (particularly the processing unit 100) according to each embodiment described above. Furthermore, a storage medium that has stored the computer program can be provided.
Furthermore, the effects described in the present specification are merely illustrative or exemplary and are not restrictive. That is, the technology according to the present disclosure can exhibit other effects obvious to those skilled in the art from the description of the present specification together with or in place of the above-described effects.
Note that following configurations also belong to the technical scope of the present disclosure.
(1)
An information processing system including:
an information acquisition unit configured to acquire first transmission information according to an operation of an operation body related to a first user located in a first place, the first transmission information being for communication with a second user located in a second place; and
an output control unit configured to display, on a display surface in the second place, an image corresponding to the operation body generated on the basis of a pointing position of the operation body in the second place and a first virtual position of the first user in the second place, the pointing position and the first virtual position being specified on the basis of the first transmission information.
(2)
The information processing system according to (1), in which
a second virtual position of the first user between the pointing position of the operation body in the second place and the first virtual position of the first user in the second place is further specified on the basis of the first transmission information, and
the image corresponding to the operation body is generated further on the basis of the second virtual position of the first user.
(3)
The information processing system according to (2), in which
the information acquisition unit further acquires a sensing result of at least one object located on a display surface in the first place, and
the output control unit further displays an image corresponding the at least one object on the display surface in the second place.
(4)
The information processing system according to (3), in which
the operation body related to the first user is a hand of the first user, and
in a case where an operation to move the at least one object is specified on the basis of the first transmission information, the output control unit moves a display position of the image corresponding to the at least one object according to the specified operation.
(5)
The information processing system according to (2) or (3), in which
the operation body related to the first user is a hand of the first user,
the pointing position of the operation body is a pointing position of the hand of the first user, and
in a case where the hand of the first user having been slid on the display surface in the first place is specified on the basis of the first transmission information, the output control unit further displays an image indicating a locus of the pointing position of the hand of the first user being slid on the display surface in the second place.
(6)
The information processing system according to any one of (2) to (5), in which
the operation body related to the first user is a hand of the first user,
the pointing position of the operation body is a pointing position of the hand of the first user,
in a case where a relationship among the pointing position of the hand of the first user, the second virtual position of the first user, and the first virtual position of the first user becomes a predetermined relationship, the first virtual position of the first user and/or the second virtual position of the first user are corrected on the basis of the pointing position of the hand of the first user, and
the output control unit displays an image corresponding to the operation body generated on the basis of the pointing position of the hand of the first user, the second virtual position of the first user after the correction, and the first virtual position of the first user after the correction on the display surface in the second place.
(7)
The information processing system according to any one of (2) to (6), in which
the operation body related to the first user is a hand of the first user,
the pointing position of the operation body is a pointing position of the hand of the first user,
in a case where a distance between the second virtual position of the first user and the first virtual position of the first user becomes larger than a predetermined distance, the pointing position of the hand of the first user and the second virtual position of the first user are each corrected on the basis of the first virtual position of the first user, and
the output control unit displays an image corresponding to the operation body generated on the basis of the pointing position of the hand of the first user after the correction, the second virtual position of the first user after the correction, and the first virtual position of the first user on the display surface in the second place, and
further displays an image indicating the pointing position of the hand of the first user before correction in association with the image corresponding to the operation body on the display surface in the second place.
(8)
The information processing system according to any one of (2) to (7), in which
the first transmission information further includes attribute information of the first user, and
the output control unit changes a value of a display parameter of the image corresponding to the operation body according to the attribute information of the first user.
(9)
The information processing system according to any one of (2) to (8), in which
the information acquisition unit further acquires information indicating familiarity between the first user and the second user, and
the output control unit changes a display mode of the image corresponding to the operation body according to the familiarity between the first user and the second user.
(10)
The information processing system according to any one of (2) to (9), in which
the operation body related to the first user is a hand of the first user,
the first transmission information further includes a sensing result of a height of the hand of the first user with reference to a display surface in the first place, and
the output control unit changes a value of a display parameter of the image corresponding to the operation body according to the sensing result of the height of the hand of the first user.
(11)
The information processing system according to any one of (1) to (10), in which
the operation body related to the first user is a hand of the first user,
the pointing position of the operation body is a pointing position of the hand of the first user,
the information acquisition unit further acquires, regarding each of a plurality of users located in a plurality of places including the first place and other than the second place, transmission information corresponding to the user according to an operation of a hand of the user, the transmission information being for communication with the second user,
regarding the each of a plurality of users, a pointing position of the hand of the user in the second place and a first virtual position of the user in the second place are specified on the basis of the transmission information corresponding to the user, and
regarding the each of a plurality of users, the output control unit displays an image corresponding to the hand of the user generated on the basis of the pointing position of the hand of the user and the first virtual position of the user in the second place, on a display surface in the second place, and
in a case where a relationship among first virtual positions of the plurality of users in the second place becomes a predetermined relationship, the output control unit changes, regarding the each of a plurality of users, the display position of the image corresponding to the hand of the user according to a place where the user is located.
(12)
The information processing system according to any one of (2) to (11), in which
the operation body related to the first user is a hand of the first user,
the pointing position of the operation body is a pointing position of the hand of the first user,
in a case where a relationship between the first virtual position of the first user and a position of the second user in the second place becomes a predetermined relationship, the first virtual position of the first user is corrected on the basis of the position of the second user and the pointing position of the hand of the first user, and
the output control unit displays an image corresponding to the operation body generated on the basis of the pointing position of the hand of the first user and the first virtual position of the first user after the correction on the display surface in the second place.
(13)
The information processing system according to any one of (2) to (12), in which
the operation body related to the first user is an operation device used by the first user,
the first virtual position of the first user in the second place is determined on the basis of a pointing position of the operation body in the second place,
in a case where the pointing position of the operation body is changed, the first virtual position of the first user is changed following the change by an amount smaller than a change amount of the pointing position of the operation body, and
the output control unit displays an image corresponding to the operation body generated on the basis of the pointing position of the operation body after the change and the first virtual position of the first user after the change, on the display surface in the second place.
(14)
The information processing system according to any one of (2) to (13), in which
the operation body related to the first user is a hand of the first user,
the image corresponding to the operation body is an image indicating the hand of the first user,
the first transmission information includes a sound collection result of an utterance of the first user, and
the output control unit changes a display mode of the image indicating the hand of the first user according to the sound collection result of an utterance of the first user.
(15)
The information processing system according to any one of (2) to (14), in which
the operation body related to the first user is a hand of the first user,
the image corresponding to the operation body is an image indicating the hand of the first user,
the first transmission information further includes information indicating spatial distribution of a volume sensed in the first place, and
the output control unit further displays an image indicating an utterance situation of the first user according to the spatial distribution of a volume sensed in the first place and the first virtual position of the first user, on the display surface in the second place.
(16)
The information processing system according to any one of (2) to (15), in which
the first transmission information includes a sound collection result of an utterance of the first user, and
the output control unit further displays a sound recognition result for the sound collection result of an utterance of the first user in association with the image corresponding to the operation body or the pointing position of the operation body in the second place on the display surface in the second place.
(17)
The information processing system according to any one of (2) to (16), in which
the first transmission information further includes a sound collection result of an utterance of the first user and information indicating spatial distribution of a volume sensed in the first place,
a plurality of sound output control units is arranged in the second place, and
the output control unit further causes the plurality of sound output units to output the sound collection result of an utterance of the first user according to the spatial distribution of a volume sensed in the first place.
(18)
The information processing system according to any one of (2) to (17), in which
the first transmission information includes a sound collection result of an utterance of the first user, and
when the first user having uttered is detected, the output control unit displays an image corresponding to a first object arranged in the first place and an image corresponding to a second object arranged in a place other than the first place on the display surface in the second place such that the image corresponding to a first object has higher visibility than the image corresponding to a second object.
(19)
An information processing method including:
acquiring first transmission information according to an operation of an operation body related to a first user located in a first place, the first transmission information being for communication with a second user located in a second place; and
displaying, by a processor, on a display surface in the second place, an image corresponding to the operation body generated on the basis of a pointing position of the operation body in the second place and a first virtual position of the first user in the second place, the pointing position and the first virtual position being specified on the basis of the first transmission information.
(20)
A program for causing a computer to function as:
an information acquisition unit configured to acquire first transmission information according to an operation of an operation body related to a first user located in a first place, the first transmission information being for communication with a second user located in a second place; and
an output control unit configured to display, on a display surface in the second place, an image corresponding to the operation body generated on the basis of a pointing position of the operation body in the second place and a first virtual position of the first user in the second place, the pointing position and the first virtual position being specified on the basis of the first transmission information.
(21)
An information processing system including:
an information acquisition unit configured to acquire first transmission information according to an operation of an operation body related to a first user located in a first place, the first transmission information being for communication with a second user located in a second place, and a imaged video of a space including the first user imaged in a period including an operation time of the operation body, and
an output control unit configured to display, on a first display surface in the second place, an image corresponding to the operation body generated on the basis of a pointing position of the operation body in the second place and a first virtual position of the first user in the second place, the pointing position and the first virtual position being specified on the basis of the first transmission information, and
displays the imaged video on a second surface in the second place.
Number | Date | Country | Kind |
---|---|---|---|
2017-115265 | Jun 2017 | JP | national |
This application is a continuation of U.S. patent application Ser. No. 16/616,683 (filed on Nov. 25, 2019), which is a National Stage Patent Application of PCT International Patent Application No. PCT/JP2018/016509 (filed on Apr. 24, 2018) under 35 U.S.C. § 371, which claims priority to Japanese Patent Application No. 2017-115265 (filed on Jun. 12, 2017), which are all hereby incorporated by reference in their entirety.
Number | Name | Date | Kind |
---|---|---|---|
20160055675 | Kasahara et al. | Feb 2016 | A1 |
20170228981 | Van Asdale | Aug 2017 | A1 |
Number | Date | Country |
---|---|---|
102301317 | Dec 2011 | CN |
102460373 | May 2012 | CN |
102693047 | Sep 2012 | CN |
1883238 | Jan 2008 | EP |
H11-224152 | Aug 1999 | JP |
2000-231644 | Aug 2000 | JP |
2010-170354 | Aug 2010 | JP |
2011-145794 | Jul 2011 | JP |
2015-228054 | Dec 2015 | JP |
2015-228256 | Dec 2015 | JP |
2017-034658 | Feb 2017 | JP |
WO 2012081194 | Jun 2012 | WO |
WO 2016002445 | Jan 2016 | WO |
WO 2017033544 | Mar 2017 | WO |
Entry |
---|
Mar. 29, 2023, Chinese Office Action issued for related CN Application No. 201880037496.6. |
Number | Date | Country | |
---|---|---|---|
20210318749 A1 | Oct 2021 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 16616683 | US | |
Child | 17355561 | US |