The disclosure relates to a microphone assembly of a loud speaking conference endpoint, more specifically, a microphone assembly is provided for minimizing acoustic feedback from a loudspeaker.
A conventional video conferencing endpoint includes a codec, a camera, a video display, a loudspeaker and a microphone, integrated in a chassis or a rack. In larger endpoints for use in meetings and boardrooms, the audio equipment is installed separately. The microphone is often placed on the meeting table to bring the audio recorder closer to the audio source in an acoustic sense.
However, personal video conferencing endpoints, also referred to as desktop terminals, are now becoming more common in offices as a substitute or supplement to larger endpoints or to traditional telephony. Personal equipment is more portable, and is likely to be placed close to the user on a table. Thus, all the equipment belonging to one endpoint, including the microphone is integrated in one device.
The microphone in a communication system should pick up voice from the user (called the near end user) with maximum quality and a suitable sensitivity. However, because a desktop system is relatively small, and all parts (including microphone and speaker) are integrated in one device, the microphone is positioned relatively close to the loudspeaker. This results in audio problems, as described below.
Desktop telecommunication terminals (video conferencing systems, IP-phones, or any loud speaking integrated communication system) with integrated loudspeaker(s) and microphone(s), for handsfree operation (loud speaking mode) experience an effect referred to as feedback. Feedback occurs when the sound from the loudspeaker of a terminal is received by the microphone of the same terminal. Feedback is highly unwanted in communication systems, for a number of reasons, as discussed below.
Feedback causes an echo in the communication (a loop back of sound) where the user hears a delayed version of his/her own voice. Echo in a communication system can be very disturbing, especially if the communication system includes large delays. The subjective degradation in communication quality caused by the echo depends on several factors, including the level of the echo, and the delay.
Furthermore, feedback also restrains the maximum allowable output level on the loudspeaker, which may result in the near end user having difficulties hearing the far end user. As mentioned, desktop systems are often compact in size, and the loudspeaker is placed close to the microphone. Often the microphone is closer to the loudspeaker than the near end user. Hence, the sound level from the loudspeaker is often more powerful than the sound level (speech) from the near end user. If the sound level from the loudspeaker is too high, it may overload the microphone (acoustical overload) or the circuits (electrical overload), which leads to distortion of the microphone signal. Thus, the sound levels from the loudspeaker picked up by the microphone constrains the design of audio circuits, audio signal processing, and the allowed maximum level from the loudspeaker.
The loudspeaker signal can include far end talk and sounds generated by the near end system, e.g. key tones, ringing tones and so on. The loudspeaker signal is picked up by the microphone and transmitted to the far end. In general, the loudspeaker signal is unwanted in the microphone signal sent to the far end. The captured loudspeaker signal (referred to as echo) must be removed, or suppressed, from the microphone signal if the level and/or delay of the echo is large enough to cause significant disturbance in the communication. This is a well developed technology, and acoustic echo cancellation and/or echo suppression algorithms are incorporated in most digital IP based communication systems.
Therefore, the goal of the microphone and loudspeaker design of an integrated communication system with loud speaking hands-free mode is to allow for the best possible near end sound pick up (sound from near end user, e.g. speech), while simultaneously minimizing the acoustical feedback level from the loudspeaker(s) to the microphone(s). This allows for the best possible quality in the signal sent to the far end, and the level of the near end loudspeaker can also be maximized, to the benefit of the near end user. Echo cancellation and suppression algorithms will also benefit from minimal acoustical feedback from the loudspeaker to the microphone, and the risk of overloading the microphone and the audio circuitry is reduced. Digital signal processing is often used to ensure that the microphone and audio circuits are not overloaded by limiting the maximum loudspeaker signal.
Acoustical feedback can be reduced by increasing the distance from a loudspeaker to a microphone. However, the physical dimensions of the integrated system dictate the maximum distance. In addition, other considerations might require placing the microphone closer to the loudspeaker than the maximal possible distance. For example, to avoid comb filter effects caused by a table reflection of speech, the microphone needs to be placed very close to the table surface. This might not be the optimal placement with regards to acoustical feedback in an integrated desktop system.
Directional microphones can also be utilized to maximize microphone sensitivity in one or more directions, and minimize or reduce the sensitivity towards the loudspeaker, and as such, are commonly used in telephony and conferencing equipment. For example, the Polycom Soundstation™ series uses such microphones. However, the physical properties of directional microphone elements require that sound waves reach both the front and the rear of the microphone. Hence, the microphones are typically mounted in an open acoustical space of the product, typically beneath a perforated area or grill. This allows free flow of air past the microphone, but also requires a fragile mounting, and does not allow adjustments or optimization of the directional behavior of the microphone.
Further, directional microphones only effectively suppress sound when the sound source is directly behind the microphone. This is seldom the case in a desktop system.
The requirements for sound quality are increasing as communication systems are using higher bandwidth audio. Increasingly, acoustic echo and feedback controls are becoming critical issues for desktop systems. Microphone design, placement and assembly are therefore critical factors for the optimization of sound quality.
The present disclosure employs a directional microphone element in a communication system in a way that maximizes microphone sensitivity in the direction of a near end user, while simultaneously minimizing the sensitivity in the direction of the integrated loudspeaker, to minimize feedback. The utilization of a directional microphone also reduces the ambient noise and reverberation pick-up.
More specifically, a desktop telecommunications terminal includes a loudspeaker and a directional microphone that has a front acoustical input port and a rear acoustical input port. The directional microphone is encapsulated in a housing. An acoustic waveguide is also disposed within the housing, and extends from the rear acoustical input port of the directional microphone to a waveguide inlet located on an upper surface of the desktop telecommunication terminal. The length and direction of the waive guide is tuned to simultaneously reduce an acoustic distance between the loudspeaker and the rear acoustical input port of the directional microphone, and to increase the acoustic distance between the user and the front acoustical input port of the microphone. A facing surface of the desktop telecommunication terminal includes at least one hole, and admits sound to the front acoustical input port of the microphone.
A more complete appreciation of the inventions and many of the attendant advantages thereof will be readily obtained as the same becomes better understood by reference to the following detailed description when considered in connection with the accompanying drawings. However, the accompanying drawings and their exemplary depictions do not in any way limit the scope of the inventions embraced by this specification. The scope of the inventions embraced by the specification and drawings are defined by the words of the accompanying claims.
In the following, the present advancement will be discussed by describing a preferred embodiment, and by referring to the accompanying drawings. However, those skilled in the art will realize other applications and modifications within the scope of the invention as defined in the enclosed claims.
A microphone assembly for desktop telecommunication terminals is described herein. The exemplary assembly utilizes a directional electret condenser microphone element with a cardioid directivity pattern. The directional microphone has acoustical input ports at both a front and a rear of the element that, together with its internal design, gives the microphone a directional behavior. The directional behavior of the microphone is enhanced by guiding sound to the front and the rear sides of the microphone in a controlled way to maximize sensitivity in the direction of the near end user and minimize sensitivity in the direction of the integrated loudspeaker of a product. This is achieved by positioning the microphone at a front location of a base of a video conferencing terminal, in a mechanically controlled and robust way, using a tuned acoustical waveguide. The tuned acoustical waveguide is used to control the time delay between sound received at the front and the rear of the directional microphone, optimizing sound quality.
Directional microphones have acoustical input ports at both their front and the rear sides. The acoustical input ports are separated by an effective distance “d” which represents the distance that a sound wave must travel around the directional microphone in going from one acoustical input port to the other. Movements of a diaphragm inside the microphone are converted into voltages at the output of the microphone. The magnitude of the voltage output of the directional microphone is a function of the instantaneous difference in sound pressure on the opposite sides of diaphragm. As distance “d” becomes smaller and smaller, so too does the output voltage from the directional microphone. Velocity of sound in air at room temperature is 1128 feet per second, so that a f=2250 Hz audible signal has a wavelength of about 15 cm. Thus, even small separation distances provide sufficient phase difference between the acoustical input ports so that the directional microphone has a polar response pattern 202 as shown in
An exemplary embodiment of the present disclosure provides a microphone assembly which changes the acoustical distance of sound waves traveling to the rear acoustical input port of the microphone from one or more point sources, relative to a free field response in order to modify the directivity pattern of the microphone. The microphone assembly simultaneously optimizes the microphone response for maximum sensitivity in one direction, and minimizes the sensitivity in another direction, even if these directions are not 180 degrees apart. (In the case of the unmodified cardioid microphone free field response, the directions of the maximum and minimum sensitivity are separated by 180 degrees.)
As indicated in
The acoustical waveguide 602 extends from a top surface 606 of the housing 601 to a back surface 703 of the cavity 603. In another exemplary embodiment of the present disclosure, the channel is at an oblique angle both in azimuth and elevation relative to the central axis of the cavity 603 (said axis being parallel with the normal vector of the back surface). The acoustic waveguide 602 is angled towards the loudspeaker situated behind the microphone on the opposite side of the terminal. The length and direction of the acoustical waveguide 602 depend on the position of the loudspeaker relative to the microphone, and on a typical near end user 203 position relative to the microphone 201. As discussed below, the waveguide serves as a sound guide for sound from one or more sound sources to the rear acoustical input port of the microphone 201.
Though the acoustical waveguide 602 of
As shown in
When the housing 601 with the microphone 201 is mounted in a desktop system 401 the front acoustical input port of the microphone 201 faces away from the system. According to one exemplary embodiment of the disclosure, the front acoustical input port faces forward, in the general direction of the near end user. However, the microphone may also be tilted slightly towards the desktop (or table surface). The acoustical waveguide 602 for guiding sound to the rear acoustical input port is designed to simultaneously minimize the microphone sensitivity in the direction of the internal loudspeaker, and maximize the microphone sensitivity in the direction of the user. This is achieved by making the acoustical waveguide's 602 length dimension much larger than its diameter, and slightly angling the waveguide in the direction of the loudspeaker 204 to approximate a free field response. Thus, sound from the loudspeaker 204 arrives at the rear input port of the microphone before arriving at the front input port of the microphone, reducing the microphone's sensitivity to sound from the loudspeaker. Further, the additional distance the sound from the loudspeaker needs to travel to traverse the corners of the microphone housing and the protective cover increases the relative delay between the loudspeaker sound reaching the rear and the front acoustical input ports of the directional microphone, further decreasing sensitivity to loudspeaker sound.
Sensitivity is, however, enhanced with respect to the near end user. The acoustical waveguide 602 is angled in the direction of the loudspeaker, and simultaneously angled away from the near end user. The length and direction of the acoustical waveguide increase the acoustic distance between the near end user and the rear acoustical input port, relative to a free field acoustical distance. Sound from the near end user arrives at the front input port of the microphone without delay, while arriving at the rear input port of the microphone with delay, due to the configuration of the acoustical waveguide. The length and direction of the acoustical waveguide 602 increases the relative delay between sound reaching the rear of the unidirectional microphone and sound reaching the front of the unidirectional microphone, increasing the sensitivity of the microphone for sound coming from the user (speech). In other words, the increased delay experienced by the microphone “moves” the direction of sound closer to 0° as illustrated by arrow 503 in
The length of the channel guiding sound to the rear acoustical input port of the microphone causes the frequency response and directional properties to differ from the free field case. The long channel causes a narrower frequency range of directional behavior.
In another exemplary embodiment, mechanical protection of the microphone element is secured in a sturdy, rugged housing made out of a relatively hard rubber material.
The cavity 603 for housing the microphone element should encapsulate the microphone element. A gap between the rear of the microphone 201 and the back surface 703 of the cavity 603, together with the acoustical waveguide, may create a resonant system with a resonance peak at a resonance frequency within the frequency response. To control the resonance of the cavity, the distance between the microphone and the back surface should be minimized to move the resonance frequency as to a high frequency outside the voice frequency band 803. The distance between the back surface of the microphone housing and the microphone may be minimized by controlling the dimensions of the microphone housing, or by inserting an insert into the cavity between the rear surface and the microphone, and the like. Further, the diameter of the sound guide should be wide enough to minimize the low resonance peak. This will ensure a proper frequency response and directional behavior.
Alternatively, the resonance peak may also be attenuated using a filter, such as a digital filter or an analog filter. Further the filter may also be used to equalize the frequency response of the system to a predetermined response characteristic. For example, the filter may be designed to produce a maximally flat frequency response in the range of 300 Hz to 3400 Hz.
Structure-borne noise and vibrations from, for example, the tabletop surface on which the terminal is placed, can result from bumping or knocking the table. To minimize pickup of such sounds and vibrations from the terminal assembly or the table surface, the microphone housing 601 is preferably made of a vibration damping material. The material of the housing 601 should be quite hard for rigidity and protection, yet somewhat elastic to withstand varying stresses from the terminal 401 above it. Further, the material should also hold the microphone 201 in a fixed position, as described above. The housing 601 should be able to temporarily carry the weight of the whole terminal 401 without damage or deformation to acoustic waveguide 602. The material should be nonporous to minimize sound absorption. Suitable materials include, for example, an elastomer cast with hardness of at least shore 35.
The microphone housing 601 can be designed to be used as a base on which the desktop system rests. This significantly reduces the degree of integration, thereby making an independent microphone module that can easily be reused in new systems. In this context a “base” is a portion of the video conferencing terminal that is in contact with the surface upon which the terminal rests, such as a table, and may be integrally formed with the terminal or may be detachable from the terminal.
When the above aspects are considered, the following practical dimensions could be used according to one exemplary embodiment of the present disclosure: A acoustical waveguide width in the range of 1-4 mm, which matches sound entry holes in a typical unidirectional electret microphone element, a waveguide length in the range of 10-20 mm, and a protective cover thickness in the range of 0.5-5 mm.
Further, when used as a base for a system, the housing 601 also includes a cable guiding structure to position and thread signal cable from the microphone to the rest of the electronics in the system.
Any microphone element requiring sound wave entry from two directions could be used. A typical choice is a unidirectional cardioid electret condenser microphone of any size.
A benefit of the present disclosure is that the housing minimizes feedback from loudspeaker to microphone, while simultaneously maximizing microphone sensitivity to the user for a unidirectional microphone element, while keeping the microphone protected. The present disclosure also increases sound quality for full audio band sound pickup with only one acoustic waveguide tuned to optimize the directivity pattern of the microphone element and simultaneously minimize feedback.
Obviously, numerous modifications and variations of the present invention are possible in light of the above teachings. It is therefore to be understood that within the scope of the appended claims, the invention may be practiced otherwise than as specifically described herein.
Number | Date | Country | Kind |
---|---|---|---|
20076609 | Dec 2007 | NO | national |