The present disclosure relates generally to providing information for a conference at one or more locations. More specifically, the present disclosure relates to systems and methods for providing information for a conference to mobile devices by detecting one or more starting requirements of the conference in the mobile devices.
In personal and business communications, meetings or conferences are often necessary. Specifically, a teleconference is widely used because of distance and inconvenience in traveling to a remote location where a meeting is held. For example, in the work setting, conferences involving two or more geographically distinct locations are often necessary for discussions and sharing opinions between people at geographically remote locations on a real-time basis.
Unfortunately, as conferences often require attendance of many unfamiliar people, conventional conferences are often inconvenient or confusing to the attendees because of lack of sufficient information on the attendees, such as the names, current speakers, arrangement of the attendees, etc. For example, when a person attends a business meeting with unfamiliar people, it may be difficult to identify or remember the names of other attendees during the meeting. In a teleconference setting involving two or more geographically remote locations, in particular, attendees may find it confusing and inconvenient to participate in the conference or remember the details of the conference without sufficient visual information. That is, in a teleconference scenario, since the attendees at one location cannot see the other remote attendees at other locations, they may not be able to identify or remember the other attendees at the other locations, or recognize a current speaker among the other attendees at a particular time. In addition, the attendees may not have access to information on activities of the other attendees at the other locations, e.g., the sitting arrangement of the other attendees, whether a particular attendee remains attending the conference or has quit the conference, or the like.
To address the above problems, visual sensors such as cameras and display devices such as televisions may be installed in each of the locations so that the images of the attendees at one location can be transmitted and displayed to the other attendees at the other location, and vice versa. However, such a solution generally requires additional hardware and costs. Further, the cameras and display devices may not be a complete solution to the above-described problems, especially when the attendees are not provided in advance with identification or profile information on other remote attendees. Furthermore, such an arrangement generally requires costly equipments, and often requires lengthy and complicated initial set up, which may not be convenient to ordinary users.
The present disclosure provides systems and methods for sharing a variety of information between attendees of a conference at one or more locations based on similarity of their surrounding sounds. Further, the systems and methods of the present disclosure provide information for a conference to one or more mobile devices by automatically generating the information upon detecting one or more starting requirements of the conference in each of the mobile devices.
According to one aspect of the present disclosure, a method for providing conference information in a mobile device is disclosed. The method includes monitoring, in a mobile device, one or more starting requirements of a conference at one or more locations. Input sound information is transmitted from the mobile device to a server when the one or more starting requirements of the conference is detected. Conference information is received from the server and the conference information is displayed on the mobile device. This disclosure also describes an apparatus, a combination of means, and a computer-readable medium relating to this method.
According to another aspect of the present disclosure, a mobile device for providing conference information is provided. The mobile device includes an initiating unit, a transmitting unit, a receiving unit, and a display unit. The initiating unit is adapted to monitor one or more starting requirements of a conference at one or more locations. The transmitting unit is configured to transmit input sound information to a server when the one or more starting requirements of the conference is detected. Further, the receiving unit is configured to receive conference information from the server, and the display unit is adapted to display the conference information.
According to yet another aspect of the present disclosure, a method for providing conference information in a system having a server and a plurality of mobile devices is disclosed. In this method, one or more mobile devices monitor one or more starting requirements of a conference at one or more locations, and transmit input sound information to the server when the one or more starting requirements of the conference is detected. The server generates conference information based on the input sound information from each mobile device and transmits the conference information to each mobile device. The conference information is displayed on each mobile device. This disclosure also describes an apparatus, a combination of means, and a computer-readable medium relating to this method.
Various embodiments are now described with reference to the drawings, wherein like reference numerals are used to refer to like elements throughout. In the following description, for purposes of explanation, numerous specific details are set forth in order to provide a thorough understanding of one or more embodiments. It may be evident, however, that such embodiment(s) may be practiced without these specific details. In other instances, well-known structures and devices are shown in block diagram form in order to facilitate describing one or more embodiments.
In the illustrated embodiment, the mobile devices 160, 162, 164, 166, and 168 are presented only by way of examples, and thus the number of mobile device(s) located in each location or the number of location(s) may be changed depending on individual conference settings. The mobile devices may be any suitable device such as a cellular phone, smartphone, laptop computer, or tablet personal computer equipped with a sound capturing capability, e.g., a microphone, and communication capability through a data and/or communication network.
The system 100 is configured to generate a variety of information associated with a conference based on input sounds received by the mobile devices 160, 162, 164, 166, and 168 and to provide the information to attendees of the conference, e.g., at least one of the mobile device users. In one conference scenario, only the users of the mobile devices 160 and 162 both located at the location 110 attend a conference without involving other users at remote locations such as the locations 120 and 130. In another conference scenario, the users of the mobile devices 160 and 162 located in the location 110 attend a teleconference with the users of the mobile devices 164 and 166 located in a remote location such as the location 120. In such a scenario, the users of the mobile devices 160, 162, 164, and 166 attend the teleconference using a teleconference system (not shown) which are implemented with conventional teleconference phones and a teleconference equipment capable of exchanging sound between the teleconference phones at the remote locations. The teleconference phones and equipment may be operated separately from the mobile devices 160, 162, 164, 166, and 168, the network 140, and the server 150 of the system 100. Further, in yet another conference scenario, the users of the mobile devices 160 and 162 may start a local conference for internal or preliminary discussion between them at the location 110, prior to joining a teleconference with the users of the mobile devices 164 and 166 at the remote location 120. Meanwhile, the user of the mobile device 168 located in the location 130 geographically separate and distant from the locations 110 and 120, e.g., a street, is not involved in any conferences between the users of the mobile devices 160, 162, 164, and 166.
Although the two locations 110 and 120 are geographically remote from each other, if the users at the two locations are in communication with each other through the teleconference system, surrounding ambient sounds and voices generated in each location and respectively inputted to the mobile devices 160, 162, 164, and 166 may be similar to each other. Specifically, a sound generated within the location 110 is transmitted into the location 120 through the teleconference phones (not shown). Similarly, another sound generated within the location 120 is transmitted into the location 110 through the teleconference phones. As a result, in the location 110, the sound generated therein and the sound transmitted from the location 120 are inputted into the mobile devices 160 and 162. Similarly, in the location 120, the sound generated therein and the sound transmitted from the location 110 are inputted into the mobile devices 164 and 166. As a result, the input sounds of the mobile device 160, 162, 164, and 166 may be similar to each other.
Meanwhile, the user of the mobile device 168 located in the location 130 is not involved in any teleconferences. Thus, the mobile device 168 does not receive any voices input to the mobile devices 160, 162, 164, and 166 or ambient sounds emanating from the location 110 or 120 during the teleconference. Accordingly, the input sound of the mobile device 168 may not be similar to those of the mobile devices 160, 162, 164, and 166.
In one embodiment, each of the mobile devices 160, 162, 164, 166, and 168 transmits its input sound information to the server 150 through the network 140. The input sound information may include, but not limited to, any suitable representation of the input sound of each mobile device, sound signature extracted from the input sound, sound level, voice activity information, etc. Based on the input sound information from the mobile devices, the server 150 generates and provides conference information to the mobile devices 160, 162, 164, and 166, and if necessary, to the mobile device 168. The conference information includes information on attendees of the conference at one or more locations such as identification and location of the attendees, an arrangement of the attendees, and/or a meeting log of the conference including attendee participation information, which will be described in detail below.
As one exemplary setting where the server 150 is operated to generate the conference information above, it is assumed that the mobile devices 160, 162, 164, 166, and 168 are carried by their respective users or located near the users. It is also assumed that a mobile device is placed closer to its user than the users of other mobile devices. For example, the mobile device 160 is placed closer to its user than the user of the mobile device 162 in the first conference room. Similarly, the mobile device 164 is placed closer to its user than the user of the mobile device 166 in the second conference room.
The initiating unit 210 monitors one or more starting requirements of a particular conference and determines whether the one or more starting requirements is detected. The sound sensor 220 (e.g., microphone) is configured to receive and sense the sound around the mobile device 160. The sound signature extraction unit 230 extracts a sound signature, i.e., a unique or distinguishable characteristic, from the sound. The clock unit 270 monitors the current time of the mobile device 160, and the positioning unit 280 estimates the current location of the mobile device 160 using, e.g., Global Positioning System (GPS). The transmitting unit 240 transmits information, e.g., input sound information, to the server 150 through the network 140, and the receiving unit 250 receives conference information from the server 150 through the network 140. The display unit 290 displays various information, e.g., the conference information received from the server 150. The storage unit 260 stores various information needed to process the input sound, input sound information, location, time, conference information, etc.
The sound sensor 220 may include, e.g., one or more microphones or any other type of sound sensors used to capture, measure, record, and/or convey any aspect of the captured input sound of the mobile device 160. Some embodiments may take advantage of sensors already used in the daily operation of the mobile device 160 such as microphones used to convey a user's voice during a telephone call. That is, the sound sensor 220 may be practiced without requiring any modification of the mobile device 160. Also, the sound sensor 220 may employ additional software and/or hardware to perform its functions in the mobile device 160.
Further, the sound signature extraction unit 230 may use any suitable signal processing scheme, including speech compression, enhancement, recognition, and synthesis methods to extract the sound signature of input sound. For example, such signal processing scheme may employ MFCC (Mel-frequency cepstral coefficients), LPC (linear predictive coding), and/or LSP (line spectral pair) techniques, which are well-known methods for speech recognition or speech codec.
In one embodiment, a sound signature may include multiple components, which are represented as a vector having n-dimensional values. Under the MFCC method, for example, a sound signature may include 13 dimensions with each dimension represented as a 16 bit value. In this case, the sound signature is 26 bytes long. In another embodiment, the sound signature may be binarized so that each dimension is represented as a 1 bit binary value. In such a case, the binarized sound signature may be 13 bits long.
A sound signature may be extracted from an input sound under the MFCC method as follows. A frame of an input sound in the time domain (e.g., raw sound signal) is multiplied by a windowing function, e.g., hamming window. Thereafter, the sound signal is Fourier transformed to the frequency domain, and then a power is calculated for each band in the spectrum of the transformed signal in the frequency domain. A logarithm operation and a discrete cosine transform (DCT) operation are performed on each calculated power to obtain DCT coefficients. A mean value over a period of a predetermined time in the past is subtracted from each DCT coefficient for binarization and a set of the binarization results constitutes the sound signature.
The receiving unit 340 is configured to receive information, e.g., input sound information, from each of the mobile devices 160, 162, 164, 166, and 168. The similarity determining unit 310 determines degrees of similarity between input sound information from the mobile devices 160, 162, 164, 166, and 168. The attendee determining unit 320 determines attendees at the conference based on the degrees of similarity. The log generating unit 360 generates a meeting log of the conference including attendee participation information. Further, the attendee arrangement calculating unit 370 calculates the arrangement of the attendees at each location of the conference. The speaker determining unit 380 determines a current speaker among the attendees at a particular time. The transmitting unit 330 is configured to transmit conference information including the above information to each of the mobile devices 160, 162, 164, and 166, and if necessary, to the mobile device 168. The information database 350 may be configured to store various information including the above information and any other information needed for processing the above information.
The transmitting unit 240 in the mobile device 160 transmits input sound information associated with the input sound through the network 140 to the server 150, at 420. A transmitting unit in each of the other mobile devices 162, 164, 166, and 168 also transmits input sound information associated with input sound captured by the respective sound sensors through the network 140 to the server 150.
The transmitting unit 240 may also transmit information relating to the user and the mobile device 160 including, but not limited to, identification information, time information, and location information. For example, the identification information may include a product number, serial number, ID of the mobile device 160, user name, user profile, etc. The time information may include a current time or a time when the input sound is captured, which may be monitored by the clock unit 270. The location information may include a geographic location of the mobile device 160, which may be estimated by the positioning unit 280, when the input sound is captured. Some of the above information may be pre-stored in the storage unit 260 of the mobile device 160.
The receiving unit 250 in the mobile device 160 receives conference information from the server 150, at 430. The display unit 290 displays the conference information according to a desired display format, at 440.
The server 150 generates conference information for a conference involving at least some of the mobile devices 160, 162, 164, 166, and 168 based on the received information, at 520. For example, at least one of the similarity determining unit 310, the attendee determining unit 320, the information database 350, the log generating unit 360, the attendee arrangement calculating unit 370, the speaker determining unit 380 may be used in generating the conference information.
When conference information is generated, the server 150 transmits, via the transmitting unit 330, the conference information to each of the mobile devices 160, 162, 164, and 166, and if necessary, to the mobile device 168, at 530. If a subset of the mobile devices is in the conference, the server 150 may transmit the conference information to those mobile devices. For example, the server 150 may not send the conference information to the mobile device 168, whose user is not participating in the conference.
The detailed operations of the server 150 and the mobile devices 160, 162, 164, 166, and 168 according to embodiments of the present disclosure will be described below with reference to
In one embodiment of the present disclosure, a degree of similarity between the input sounds of two mobile devices, e.g., an m-th mobile device and an n-th mobile device, may be determined based on a Euclidean distance between vectors respectively representing the sound signatures of the input sounds of the two mobile devices, e.g., according to the following equation:
where a[i] indicates an i-th dimension value of a vector a representing the sound signature of the m-th mobile device, and b[i] indicates an i-th dimension value of a vector b representing the sound signature of the n-th mobile device.
The degree of similarity between the input sounds of the two mobile devices may be determined based on a Euclidean distance between a pair of sound signature sequences that are extracted over a period of time at predetermined time intervals. If a sequence of sound signatures is extracted at time intervals of 10 ms over a period of 1 sec in each of the m-th and n-th mobile devices, the server 150 will receive one hundred pairs of sound signatures from the mobile devices. In this case, a Euclidean distance for each pair of sound signatures from the m-th and n-th mobile devices is calculated and the degree of similarity is determined based on a mean value of the Euclidean distances. For example, the degree of similarity may be a reciprocal of the mean value or a log-scaled value of the reciprocal.
Based on the degrees of similarity, the attendee determining unit 320 in the server 150 determines a subset of mobile devices whose users are attending the same conference among all of the plurality of mobile devices, which transmitted the input sound information to the server 150, at 630. For example, a mobile device of a user attending a particular conference can be considered to have a greater degree of similarity with another mobile device in the same conference than with another mobile device not in the same conference. Once the mobile devices which are in the conference have been determined, the attendee determining unit 320 identifies the users of the determined mobile devices based on the information relating to the mobile devices and the associated users, and determines them to be the attendees at the conference.
The server 150 generates conference information including information on the attendees, which may include at least one of identification information, location information of each attendee, etc. Then, the transmitting unit 330 of the server 150 transmits the conference information to the subset of mobile devices which have been determined to be in the conference, at 640.
In some embodiments, mobile devices having degrees of similarity greater than a predetermined similarity threshold may be determined to belong to the conference group, while other mobile devices having degrees of similarity less than or equal to the similarity threshold may be determined not to belong to the conference group. The predetermined similarity threshold may be configured according to the needs of the system 100 and pre-stored in the information database 350 of the server 150.
The following is a more detailed procedure of determining the degrees of similarity and determining the attendees at a conference based on the degrees of similarity according to one embodiment.
Referring back to
In a first conference scenario in
In a second conference scenario, it is assumed that the users of the mobile devices 160 and 162 at the location 110 and the users of the mobile devices 164 and 166 located at the location 120 attend a same conference from their respective locations. The user of the mobile device 168 remains in location 130 and does not attend the conference. Such a conference may be a main conference after the preliminary one such as the first scenario above, and may be a telephone conference, video conference, etc.
As describe above, the degrees of similarity of the input sound information for the mobile device 160 with respect to that of each of the other mobile devices 162, 164, 166, and 168 are determined. Since the mobile devices 160, 162, 164, and 166 are in the same conference with similar input sounds, the degree of similarity of the input sound information between each pair of the mobile devices 160, 162, 164, and 166, which are in the conference, will be greater than the degree of similarity of the input sound information between the mobile device 168 and each of the mobile devices 160, 162, 164, and 166. In the case where a similarity threshold is used, the degree of similarity of the input sound information between each pair of the mobile devices 160, 162, 164, and 166 may be greater than the similarity threshold, while the other degrees of similarity may not be greater than the similarity threshold. As a result, the attendee determining unit 320 determines that the users of the mobile devices 160, 162, 164, and 166 attend the same conference. In this case, the users of the mobile devices 160, 162, 164, and 166 may be displayed on the display unit of each of the mobile devices with the locations and names of the attendees, as shown in
According to one embodiment of the present disclosure, the operation of transmitting the input sound information by the mobile device may be automatically initiated if one or more starting requirements of a conference is detected. In general, one or more starting requirements for a conference may be deter mined prior to the conference, such as an attendee list, a starting time for the conference, a conference location (e.g., a plurality of conference rooms when the conference is a teleconference), and the like. Each user of a mobile device may input and store the conference starting requirements. Additionally or alternatively, a conference scheduling application according to the present disclosure may obtain conference starting requirement information from another application, e.g., a calendar application, a schedule management application such as MS Outlook™ program, or the like, running on the mobile device or an external device such as a personal computer.
The starting requirement may specify a condition to initiate transmission of input sound information. For example, the starting requirement may be a starting time, one or more conference locations, acoustic characteristics of a conference environment, or the like. The starting requirement may be stored in each mobile device by the user to be automatically operational when the mobile device detects one or more starting requirements. For example, the starting requirement may be met when the current time of the mobile device 160, which may be monitored by the clock unit 270, reaches the starting time of a conference. Similarly, the starting requirement may be met when the current location of the mobile device 160, which may be estimated by the positioning unit 280, is determined to be a location for a conference, e.g., a conference room. In some embodiments, the location requirement may be satisfied when the current location of the mobile device 160 is determined to be within a predetermined range, e.g., twenty meters, from a specified conference location.
Further, a sound representative of a conference environment may also be used as a starting requirement. According to one embodiment, a conference environment is distinguished based on acoustic characteristics. For example, the conference environment can be characterized by voices of conference attendees that can be included in the sound inputted to mobile devices present in the conference. The maximum number of conference attendees, i.e., mobile device users, whose voices are input to the mobile devices may be set to a predetermined threshold. Also, the level of allowable background sound, which may refer to noise, included in the input sound may be set to a predetermined sound level threshold. If either the maximum number of conference attendees exceeds the predetermined threshold or the level of background sound exceeds the sound level threshold, the starting requirement will not be detected. Further, the allowable reverberation time of the input sound may be set to a predetermined time period (e.g., 200 to 500 ms), which falls into a range of reverberation time measurable in a conference room of a suitable size.
According to another embodiment, an acoustic model of a conference environment may be used as a starting requirement. In this case, a variety of conference environments are trained through a modeling methodology such as GMM (Gaussian Mixture Model) method or HMM (Hidden Markov Model) method to obtain the acoustic model representative of the conference environment. Using such an acoustic model, the starting requirement is detected when the input sound of the mobile device corresponds to the acoustic model. For example, the starting requirement may be detected when a degree of similarity between the input sound and the acoustic model is greater than a predetermined similarity threshold.
Although
In another embodiment of the present disclosure, the server 150 determines a current speaker among attendees at a conference at a particular time based on sound levels or voice activity information of the input sounds from the mobile devices of the attendees.
According to one embodiment, the input sound information associated with an input sound captured at each mobile device includes a sound level of the input sound. The sound level indicates the energy or loudness of the sound and may be represented by amplitude, intensity, or the like, for example, measured in decibels. Each mobile device transmits the input sound information including the sound level to the server 150.
With reference to
The current speaker may be determined periodically at predetermined time intervals.
Based on the sound levels of the mobile devices, the server 150 generates conference information including information on the current speaker and transmits the conference information to the mobile devices of the attendees. Each mobile device that has received the conference information from the server 150 may display the information on the current speaker on its display unit.
In this embodiment, the input sound information associated with an input sound captured at each mobile device includes the voice activity information of the input sound. The voice activity information of each mobile device is determined from a ratio of a current input sound level to an average input sound level over a predetermined period of time. The ratio indicates the loudness of a current input sound at a given time in comparison with an average input sound over a predetermined period of time. The average input sound may represent a background sound or an ambient sound around a mobile device that has been continuously emanating from the surroundings of the mobile device and, therefore, the ratio may curb or get rid of the effect of the background sound in determining the current speaker. Each mobile device transmits the input sound information including voice activity information to the server 150.
With reference to
The current speaker may be determined periodically at predetermined time intervals.
Based on the sound level ratios of the mobile devices, the server 150 generates conference information including information on the current speaker and transmits the conference information to the mobile devices of the attendees. Each mobile device that has received the conference information from the server 150 may display the information on the current speaker on its display unit.
In this embodiment, the input sound information associated with an input sound captured at each mobile device includes the voice activity information of the input sound. The voice activity information of each mobile device is determined from a probability that an input sound of the mobile device matches acoustic characteristics of a voice of a user of the mobile device. The acoustic characteristics may be pre-stored in each mobile device. For example, a message displayed on a display unit of the mobile device prompts the user to read a predetermined phrase so that the voice of the user is stored in the mobile device and is processed to analyze and store the acoustic characteristics thereof. In one embodiment, an acoustic model representing the acoustic characteristics of the user's voice may be used. Specifically, a probability that the input sound corresponds to the acoustic model may be determined based on a degree of similarity between the input sound and the acoustic model. For example, the degree of similarity may be estimated based on a Euclidean distance between a vector representing the input sound and another vector representing the acoustic model. Each mobile device transmits the input sound information including voice activity information to the server 150.
With reference to
The current speaker may be determined periodically at predetermined time intervals.
Based on the matching probabilities of the mobile devices, the server 150 generates conference information including information on the current speaker and transmits the conference information to the mobile devices of the attendees. Each mobile device that has received the conference information from the server 150 may display the information on the current speaker on its display unit.
In one embodiment of the present disclosure, the server 150 calculates an arrangement of attendees at a conference based on a degree of similarity between the input sound information of each pair of the mobile devices of the attendees. It is assumed that N attendees with their mobile devices such as the mobile devices 160 and 162 participate in a conference at one specified location such as the location 110. The server 150 identifies the N attendees based on degrees of similarity between input sound information from the mobile devices. Further, the server 150 identifies the location of the N mobile devices based on location information transmitted from the N mobile devices. Each of the N mobile devices also transmits its input sound information to the server, and the attendee arrangement calculating unit 370 of the server 150 calculates an N×N matrix based on the input sound information from the N mobile devices. The input sound information from each mobile device includes the input sound of the mobile device and/or the sound signature of the input sound. The entry of the i-th row and the j-th column of the N×N matrix, which may be referred to be as ai,j, may be calculated based on a degree of similarity between the input sound from the i-th mobile device and the input sound from the j-th mobile device of the N mobile devices. Although the above embodiment employs a degree of similarity, it should be appreciated that a degree of dissimilarity between the input sound information of each pair of the mobile devices of the attendees may be used interchangeably.
In some embodiments, the degree of similarity may be calculated based on a Euclidean distance between a vector representing the sound signature from the i-th mobile device and another vector representing the sound signature from the j-th mobile device. For example, the degree of similarity may be a value determined to be inversely proportional to the Euclidean distance, e.g., a reciprocal number of the Euclidean distance or a value of taking logarithm of the reciprocal number, whereas the degree of dissimilarity may be a value proportional to the Euclidean distance.
In one embodiment, each entry of the N×N matrix may be calculated based on a difference in a sound level between the input sounds of each pair of the N mobile devices. For example, the entry of the i-th row in the j-th column may be determined based on a difference or a ratio of the input sound level of the i-th mobile device with respect to that of the j-th mobile device.
After every entry of the N×N matrix is determined, the attendee arrangement calculating unit 370 transforms the N×N matrix to a 2×N matrix through a dimension reduction methodology such as PCA (principal component analysis), MDS (multidimensional scaling), or the like. Since the N×N matrix is, in general, a symmetric matrix, an Eigen decomposition process may be performed on the N×N matrix so that two largest eigenvectors constitute the 2×N matrix. Then, the two entries in each column of the 2×N matrix may be regarded as the x and y coordinates of a specified mobile device on a two-dimensional plane. For example, the two entries a1,j and a2,j in the j-th column of the 2×N matrix may be the x and y coordinates of the j-th mobile device on a two-dimensional plane.
After every entry is determined, the attendee arrangement calculating unit 370 transforms the 4×4 matrix to a 2×4 matrix, for example, using the above described methodology such as PCA or MDS. The entries in each column of the 2×4 matrix indicate the x and y coordinates of each mobile device on a two-dimensional plane. For example, the entries a1,1 and a2,1 may respectively indicate the x and y coordinates of the mobile device 1201, i.e., (x1, y1). The locations of the mobile devices are regarded as the locations of the attendees and thus the arrangement of the attendees can be represented on a two-dimensional plane as shown in
The arrangement on the two-dimensional plane shows relative positional relationships between the attendees. Thus, the actual arrangement of the attendees may be obtained through certain processes such as rotating, scaling, or flipping the arrangement represented on the two-dimensional plane with the x and y coordinates.
The server 150 generates conference information including information on the arrangement of the attendees calculated as above and transmits the conference information to each of the mobile devices of the attendees. The display unit of each mobile device may visually display the arrangement of the attendees as shown in
In one embodiment of the present disclosure, the log generating unit 360 of the server 150 generates a meeting log of a conference including attendee participation information. The attendee participation information includes a variety of activities of the attendees at the conference, e.g., when which attendee joins the conference, when which attendee is a current speaker at a particular time, when which attendee quits the conference, or the like.
Specifically, the attendee determining unit 320 of the server 150 determines that a new attendee has joined the conference based on the degree of similarity between the input sound from the mobile device of the new attendee and the input sound from each of the other mobile device of the other attendees. Then, the log generating unit 360 updates the log information, e.g., with the time when the new attendee has joined, identification of the new attendee, etc. Similarly, the attendee determining unit 320 of the server 150 also determines that one of the attendees at the conference has quit the conference based on the degree of similarity between the input sound from the mobile device of the quitting attendee and the input sound from each of the other mobile device of the other attendees. Then, the log generating unit 360 updates the log information, e.g., with the time when the attendee has quit, identification of the quitting attendee, etc. The log generating unit 360 further updates the log information, e.g., with identification of a current speaker at a given time.
The log information may be generated in a form capable of representing a diagram as shown in
In some embodiments, the log information may include the total time that each attendee is determined as the current speaker. Further, the log information may further include the ratio of the total time as the current speaker to the entire conference time for each attendee.
The server 150 generates conference information including the log information generated in the manner as described above and transmits the conference information to each of the mobile devices of the attendees. The display unit of each of the mobile devices may display the log information.
The mobile device 1400 is capable of providing bidirectional communication via a receive path and a transmit path. On the receive path, signals transmitted by base stations are received by an antenna 1412 and are provided to a receiver (RCVR) 1414. The receiver 1414 conditions and digitizes the received signal and provides samples such as the conditioned and digitized digital signal to a digital section for further processing. On the transmit path, a transmitter (TMTR) 1416 receives data to be transmitted from a digital section 1420, processes and conditions the data, and generates a modulated signal, which is transmitted via the antenna 1412 to the base stations. The receiver 1414 and the transmitter 1416 may be part of a transceiver that may support CDMA, GSM, LTE, LTE Advanced, etc.
The digital section 1420 includes various processing, interface, and memory units such as, for example, a modem processor 1422, a reduced instruction set computer/digital signal processor (RISC/DSP) 1424, a controller/processor 1426, an internal memory 1428, a generalized audio encoder 1432, a generalized audio decoder 1434, a graphics/display processor 1436, and an external bus interface (EBI) 1438. The modem processor 1422 may perform processing for data transmission and reception, e.g., encoding, modulation, demodulation, and decoding. The RISC/DSP 1424 may perform general and specialized processing for the mobile device 1400. The controller/processor 1426 may perform the operation of various processing and interface units within the digital section 1420. The internal memory 1428 may store data and/or instructions for various units within the digital section 1420.
The generalized audio encoder 1432 may perform encoding for input signals from an audio source 1442, a microphone 1443, etc. The generalized audio decoder 1434 may perform decoding for coded audio data and may provide output signals to a speaker/headset 1444. The graphics/display processor 1436 may perform processing for graphics, videos, images, and texts, which may be presented to a display unit 1446. The EBI 1438 may facilitate transfer of data between the digital section 1420 and a main memory 1448.
The digital section 1420 may be implemented with one or more processors, DSPs, microprocessors, RISCs, etc. The digital section 1420 may also be fabricated on one or more application specific integrated circuits (ASICs) and/or some other type of integrated circuits (ICs).
In general, any device described herein may represent various types of devices, such as a wireless phone, a cellular phone, a laptop computer, a wireless multimedia device, a wireless communication personal computer (PC) card, a PDA, an external or internal modem, a device that communicates through a wireless channel, etc. A device may have various names, such as access terminal (AT), access unit, subscriber unit, mobile station, mobile device, mobile unit, mobile phone, mobile, remote station, remote terminal, remote unit, user device, user equipment, handheld device, etc. Any device described herein may have a memory for storing instructions and data, as well as hardware, software, firmware, or combinations thereof.
The techniques described herein may be implemented by various means. For example, these techniques may be implemented in hardware, firmware, software, or a combination thereof. Those ordinary skill in the art would further appreciate that the various illustrative logical blocks, modules, circuits, and algorithm steps described in connection with the disclosure herein may be implemented as electronic hardware, computer software, or combinations of both. To clearly illustrate this interchangeability of hardware and software, the various illustrative components, blocks, modules, circuits, and steps have been described above generally in terms of their functionality. Whether such functionality is implemented as hardware or software depends upon the particular application and design constraints imposed on the overall system. Skilled artisans may implement the described functionality in varying ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the present disclosure.
For a hardware implementation, the processing units used to perform the techniques may be implemented within one or more ASICs, DSPs, digital signal processing devices (DSPDs), programmable logic devices (PLDs), field programmable gate arrays (FPGAs), processors, controllers, micro-controllers, microprocessors, electronic devices, other electronic units designed to perform the functions described herein, a computer, or a combination thereof.
Thus, the various illustrative logical blocks, modules, and circuits described in connection with the disclosure herein may be implemented or performed with a general-purpose processor, a DSP, an ASIC, a FPGA or other programmable logic device, discrete gate or transistor logic, discrete hardware components, or any combination thereof designed to perform the functions described herein. A general-purpose processor may be a microprocessor, but in the alternative, the processor may be any conventional processor, controller, microcontroller, or state machine. A processor may also be implemented as a combination of computing devices, e.g., a combination of a DSP and a microprocessor, a plurality of microprocessors, one or more microprocessors in conjunction with a DSP core, or any other such configuration.
For a firmware and/or software implementation, the techniques may be embodied as instructions stored on a computer-readable medium, such as random access memory (RAM), read-only memory (ROM), non-volatile random access memory (NVRAM), programmable read-only memory (PROM), electrically erasable PROM (EEPROM), FLASH memory, compact disc (CD), magnetic or optical data storage device, or the like. The instructions may be executable by one or more processors and may cause the processor(s) to perform certain aspects of the functionality described herein.
If implemented in software, the functions may be stored on or transmitted over as one or more instructions or code on a computer-readable medium. Computer-readable media includes both computer storage media and communication media including any medium that facilitates transfer of a computer program from one place to another. A storage media may be any available media that can be accessed by a computer. By way of example, and not limitation, such computer-readable media can comprise RAM, ROM, EEPROM, CD-ROM or other optical disk storage, magnetic disk storage or other magnetic storage devices, or any other medium that can be used to carry or store desired program code in the form of instructions or data structures and that can be accessed by a computer. Also, any connection is properly termed a computer-readable medium. For example, if the software is transmitted from a website, a server, or other remote source using a coaxial cable, fiber optic cable, twisted pair, digital subscriber line (DSL), or wireless technologies such as infrared, radio, and microwave, then the coaxial cable, the fiber optic cable, the twisted pair, the DSL, or wireless technologies such as infrared, radio, and microwave are included in the definition of medium. Disk and disc, as used herein, includes CD, laser disc, optical disc, digital versatile disc (DVD), floppy disk and blu-ray disc where disks usually reproduce data magnetically, while discs reproduce data optically with lasers. Combinations of the above should also be included within the scope of computer-readable media.
A software module may reside in RAM memory, flash memory, ROM memory, EPROM memory, EEPROM memory, registers, hard disk, a removable disk, a CD-ROM, or any other form of storage medium known in the art. An exemplary storage medium is coupled to the processor such that the processor can read information from, and write information to, the storage medium. Alternatively, the storage medium may be integral to the processor. The processor and the storage medium may reside in an ASIC. The ASIC may reside in a user terminal. Alternatively, the processor and the storage medium may reside as discrete components in a user terminal.
The previous description of the disclosure is provided to enable any person skilled in the art to make or use the disclosure. Various modifications to the disclosure will be readily apparent to those skilled in the art, and the generic principles defined herein may be applied to other variations without departing from the spirit or scope of the disclosure. Thus, the disclosure is not intended to be limited to the examples described herein but is to be accorded the widest scope consistent with the principles and novel features disclosed herein.
Although exemplary implementations may refer to utilizing aspects of the presently disclosed subject matter in the context of one or more stand-alone computer systems, the subject matter is not so limited, but rather may be implemented in connection with any computing environment, such as a network or distributed computing environment. Still further, aspects of the presently disclosed subject matter may be implemented in or across a plurality of processing chips or devices, and storage may similarly be effected across a plurality of devices. Such devices may include PCs, network servers, and handheld devices.
Although the subject matter has been described in language specific to structural features and/or methodological acts, it is to be understood that the subject matter defined in the appended claims is not necessarily limited to the specific features or acts described above. Rather, the specific features and acts described above are disclosed as example forms of implementing the claims.
The present. Application for patent claims priority to Provisional Application No. 61/419,683 filed on Dec. 3, 2010, and assigned to the assignee hereof and hereby expressly incorporated by reference herein.
Number | Date | Country | |
---|---|---|---|
61419683 | Dec 2010 | US |