The invention relates to a method for the automatic comparison of information characterizing reference values and information characterizing current values of sound-reproducing systems of a system of microphones and speakers for the control of the sound-reproducing system.
The field of the invention is that of the automatic control of the gains, functioning and position of several microphones and several speakers in the context of a system of videoconferencing between participants located at distinct sites that are generally remote sites. The invention can also be applied to the control of microphones and speakers installed in the same room such as a theatre stage, concert hall or cinema hall. It can be used to control the spatialized sound rendition of the scene which provides concordance between visual images and sound. In the videoconferencing context, the invention makes it possible to approach a natural communications situation: when a participant changes position in a remote room during a meeting, the sound follows him in the room in which he is being listened to, with a passage, for example, from one speaker to another as he moves. The microphones and speakers are designated, without distinction, by the term transducers.
The problem is to detect the changes that occur at the transducers between their installation and the times at which the checks are made.
An object of the present invention therefore is a method of comparison between pieces of information characterizing reference values and pieces of information characterizing current values of sound-reproducing systems of a system of (n) microphones mi and (p) speakers hpj for the control of said sound-reproducing systems characterized in that:
A: for each speaker hpj,
B: a reference matrix Qr is saved, this reference matrix being constituted by all the pieces of reference information hpjmi obtained following the sending of the sound signal S,
C: as soon as a comparison is to be made, the step A is run with a sound signal S′ to obtain current information on a matrix Q,
D: the matrices Q and Qr are compared.
An object of the invention is also a device for comparing pieces of information characterizing reference values and pieces of information characterizing current values of sound-reproducing systems of a system of n microphones mi and p speakers hpj for the control of the sound-reproducing system, characterized in that the control system comprises means for the measurement of the pieces of information hpjmi characterizing the sound-reproducing systems comprising a microphone mi and a speaker hpj, digital processing means to compare said pieces of information hpjmi and, connected to these digital processing means, means for saving the matrix Qr constituted by all the pieces of information hpjmi.
An object of the invention is also a system for the control of sound-reproducing systems comprising several devices such as those mentioned here above, characterized in that the devices are distributed among several rooms and in that the control system comprises a high bit-rate telecommunications network connecting said rooms and means to centralize the management of the devices.
Other special features and advantages of the invention shall appear more clearly from the following description given by way of a non-restrictive example, with reference to the appended drawings, of which:
a) is a diagrammatic view of a videoconferencing room according to the invention,
b) is a diagrammatic view of the direct paths between speakers and microphones,
a) and 2b) are views of sound-reproducing systems respectively in the case of local processing and when the processing is done in the network,
a) and 3b) respectively show examples of curves representing white noise and USASI noise on the one hand and pink noise and pseudo-random binary sequences on the other hand,
A videoconference is set up between participants distributed among several rooms, a high-bit-rate communications network such as an ATM network being used to convey visual and sound information. A videoconferencing room shown
The sound-reproducing systems between the microphones mi and the speakers hpj of a local processing system (shown in
According to another embodiment, the sound-reproducing systems between the microphones mi and the speakers hpj of a remote processing system shown in
A routing system A obtained by a multiplexer/demultiplexer also called a switching matrix, which is commercially available, may be inserted if necessary into the sound-reproducing systems between, firstly, the analog-digital converters ADCi and the encoders Ci and, secondly, the decoders Dj and the analog-digital converters ADCj. A remotely controllable system A of this kind makes it possible, at this level of the sound-reproducing system, to route the information characterizing a transducer from one transducer to another.
Each element of these sound-reproducing systems must be adjusted so as to provide for efficient sound transmission. During the installation of these elements, which is also known as an alignment, the gains, wirings and positions of the transducers of each room are set, and these parameters are stored in a file of a digital processing card of the signal.
To simplify the matter, the word “transducer” (speaker or microphone respectively) will designate the transducer (the speaker or microphone respectively) and the elements of the sound-reproducing system between the digital processing card and the transducer (speaker or microphone respectively).
Thereafter, when the videoconference room is used, a week or a month later for example, checks may be made on any modifications that will have occurred in these parameters in order to make the necessary corrections The transducers may have been moved and in certain cases may have become defective; the room configuration may have been changed; the amplifiers also may have been subjected to high variations over time, possibly caused by the heating of the electronic components. It may be preferred sometimes to act on the transducers in order to compensate for a defect in another element of the sound-reproducing system.
The term “sound signal” refers to a signal that can be sent by the speakers and detected by the microphones. As indicated in
All these hpjmi pieces of information constitute a matrix with a size n*p, a line of the matrix corresponding to a speaker and a column corresponding to a microphone.
The first time this matrix is constituted after the alignment, or at another preferred time, it is saved in memory: it is called the reference matrix Qr, the elements hpjmi of this matrix being reference values. Thereafter, when a check has to be made on the parameters of these transducers, these steps are reiterated with a signal S′ to obtain current values hpjmi and set up a matrix Q that is compared with the matrix Qr.
In certain cases, it is simpler to choose a signal S′ identical to the signal S, especially when it is sought to compare gains corresponding to the ratio between the energy of the signal sent and the energy of the signal received. In other cases, S is different from S′ and the elements of the matrices Qr and Q to be compared are different in nature. By saving S and S′ and by applying an adequate processing operation to the elements of Q, it is possible to deduce elements comparable to those of Qr. With S being known, it is possible to choose a signal S′ that enables, for example, the measurement of the impulse response or the transfer function hpjmi between the transmission point hpj and the reception point mi; given S and the characteristics of hpjmi, it is possible, from the elements hpjmi of Q, to deduce elements comparable to those of Qr by applying an adequate processing operation (Fourier transform, . . . ).
It is also possible to set up several matrices Qr by considering several types of signals S and then set up several corresponding matrices Q. If the signal S is, for example, a white noise filtered in different octaves, it is possible to set up a matrix Qr for each octave.
In general, the elements hpjmi are set up from signals S and S′ considered in the time domain, but it is possible to base the operation on the frequency domain and set up the matrices Q and/or Qr from the spectral responses hpjmi of the microphones mi at a frequency band sent by the speakers hpj: whatever the width of the frequency band of the signals S and S′ sent by the speakers hpj, only a determined frequency band will be received by the microphones mi It could be a frequency band with a width of about 200 Hz, an octave band or a one-third-octave band. This frequency band will then be made in order to slide to sweep through a spectrum of 0 Hz to 1000 Hz for example.
During the alignment, the flatness of the spectrum of each transducer is verified, i.e. it is verified that all the frequencies pass through each transducer. If one of them has irregularities, the necessary corrections are made. The microphones sometimes have irregularities related to the table or room effect (to the reflections from the table or room), where the wave reflected by the table or room may be in phase opposition with the direct wave, then giving rise to black regions in the spectral response: the gain of the microphone will then be increased in the corresponding frequency band.
During subsequent checks, the spectral responses of the transducers by frequency band will be verified. The comparison between the matrices Q and Qr makes it possible, especially, to obtain a piece of information on any movement undergone by the transducers, these transducers being directional and their directivity depending on the frequency Depending on the results of the comparisons, it is also possible to make a spectral correction to the transducers in order to reduce the coupling between speakers and microphones and cause less deformation in the sound signals sent out by the participants. The exploitation of the results is sometimes more complex than it is when the operation is situated in the time domain.
The sound signals S and S′ are generally recorded in the internal memory of the signal digital processing card. They may possibly be computed (generated) in this card.
These sound signals may, for example, be a white noise, a pink noise, an USASI noise, a pseudo-random binary sequence respectively shown in
The method according to the invention has been carried out with a pink noise sent successively to each of the speakers for one second. Between two sending operations on two consecutive speakers, there is a wait for a certain time (a period of silence) for the next sound signal to start in a state of the sound-reproducing system that is, in principle, a stable state. The invention has been achieved with a two-second period of silence. The elements hpjmi are determined for each hpj at the same instant t of the sound signal. If, for example, hp1m1, hp1m2, . . . , hp1mn are determined at t=start of the sound signal+0.9 second, then hp2m1, . . . , hp2mn will be determined at t+3 seconds, hp3m1, . . . , hp3mn at t+6 seconds, etc.
In adding up and averaging each line and each column of the matrices Qr and Q, possibly after the processing of the elements of a matrix to obtain elements directly comparables to those of the other matrix, a mean value HPjQr, HPjQ respectively for each speaker hpj is calculated by the formula:
and a mean value MiQr, MiQ respectively for each microphone mi is calculated by the formula:
By computing HPjQ/HPjQr, we obtain the divergence between the speaker considered and its reference value. Similarly, by computing MiQ/MiQr, we obtain the divergence between the microphone itself and its reference value. If, for the speakers as well as the microphones, this divergence is contained in a predetermined range referenced FHP for the speakers and FM for the microphones, then no correction is applied as the difference is tolerable. A threshold of 3 dB is, for example, commonly accepted for a visioconference room. For divergence values outside the predetermined range, a corresponding divergence is applied as a corrective value to the transducer, at the signal digital processing card. As the case may be, the correction could be applied to the gain of the transducer itself. In certain cases, the correction will consist in repositioning the transducer; in other cases, it will not be possible to apply the correction because of a transducer malfunction, and the defective transducer will then be changed.
The characteristics of the pseudo-random binary sequences make them a preferred signal for the high-precision measurement of the impulse response of a system according to the invention. The use of a pseudo-random binary sequence as a sound signal sent to the speakers hpj therefore enables the measurement of the impulse responses, as a function of time Rji, of all the microphones mi. Depending on the instant at which the impulse response is considered, each impulse response Rji gives information on the delay, namely, the propagation time between a speaker hpj and a microphone mi, the direct wave corresponding to the direct paths between a speaker hpj and microphone mi, or again the room effect corresponding to the paths with one or more reflections.
In
It is thus possible to evaluate the direct wave resulting from the direct path between the speaker hpj and the microphone mi. Each element hpjmi of the matrices Q and Qr then represents the first spike of the impulse response.
When the evaluation to be made relates to the room effect due to the indirect paths between the speaker hpj and the microphone mi, namely the paths of the signals that have undergone various reflections on the walls of the room, on the furniture or on any other obstacle, each element hpjmi of the matrices Q and Qr will represent the part of the impulse response that succeeds the first spike and starts at t2ji.
In one application of the invention, the signal-to-noise ratio of the microphones mi is evaluated by comparing the mean values of the microphones computed from the matrix Qr, set up in considering a sound signal S, with the mean values of the microphones computed from the matrix Q set up in considering a signal S′ of silence.
The signal S may be, especially, a white, rose or USASI noise, or a pseudo-random binary sequence. If the signal S is interspersed with silences, in practice, the signal-to-noise ratio will be measured during a phase of silence.
It is also possible to remotely process the information characterizing the signals coming from a local room, as a telecommunications or computer network connects the rooms to each other. The information processing comprises especially the measurements, computations, saving operations and corrections to be made. Remote processing can be done by a computer remotely controlling another computer, located in a local room, through the network.
It is also possible, in the local room, to deal with the case of the remote room or rooms by sending the signals S and S′ through the telecommunications network and retrieving, in the local room, through the network, information characterizing the result of these signals in the remote room or rooms. The same method as described here above is used and, at the level of the signal digital processing card, coefficients are applied to the pieces of information characterizing the transmitted and retrieved signals to have a balanced system.
An echo phenomenon sometimes occurs: when a participant speaks in a room A, the corresponding sound signal is transmitted to the participants located in a room B by the speakers of this room B, the microphones of this room B taking up the signal coming from these speakers and sending them on to the room A. The speaker of the room A hears himself again with the echo. This echo can be evaluated by measuring the level of the return signal with respect to the level of the signal sent. The control parameters of the echo cancellation or transducer gain variation algorithms are then adjusted.
It is also possible to comprehensively process the pieces of information hpjmi in the telecommunications network, for example at the level of a multipoint bridge PMP interconnecting several remote rooms Sa, shown in
The device according to the invention comprises a signal digital processing card CTN, shown in
Number | Date | Country | Kind |
---|---|---|---|
00/01976 | Feb 2000 | FR | national |
The present application is a continuation of U.S. application Ser. No. 10/203,856 entitled “METHOD AND DEVICE FOR COMPARING SIGNALS TO CONTROL TRANSDUCERS AND TRANSDUCER CONTROL SYSTEM,” filed Dec. 23, 2002, which claims priority to the PCT Application No. FR01/00457 filed Feb. 15, 2001, which claim priority to French Application No. 00 01976 filed Feb. 17, 2000, which are incorporated herein by reference.
Number | Date | Country | |
---|---|---|---|
Parent | 10203856 | Dec 2002 | US |
Child | 11755563 | May 2007 | US |