The present disclosure relates to the field of acoustics, and in particular to acoustic input-output devices.
A loudspeaker assembly transmits sound by generating mechanical vibrations. A microphone receives voice signals of a user by picking up vibrations of, e.g., the skin, when the user speaks. When the loudspeaker assembly and the microphone work at the same time, the mechanical vibrations of the loudspeaker assembly would be transmitted to the microphone, so that the microphone receives the vibration signals of the loudspeaker assembly and generates echoes, which reduces the quality of the sound signals generated by the microphone and affects the usage experience of the user.
The present disclosure provides an acoustic input-output device that may reduce the effect of the loudspeaker assembly on the microphone, reduce the intensity of the echo signals generated by the microphone, and improve the quality of the voice signals collected by the microphone.
In the present disclosure, an acoustic input-output device is provided for a purpose of reducing the effect of a loudspeaker assembly on the vibration of a bone conduction microphone, reducing the intensity of echo signals generated by the bone conduction microphone, and improving the quality of sound signals picked up by the bone conduction microphone.
To achieve the above-mentioned purpose, the present disclosure provides the following technical solutions.
An acoustic input-output device a loudspeaker assembly and a microphone.
The loudspeaker assembly is configured to transmit sound waves by generating a first mechanical vibration. The microphone is configured to receive a second mechanical vibration of a voice signal source that is generated when the voice signal source provides a voice signal. The microphone generates a first signal and a second signal in response to the first mechanical vibration and the second mechanical vibration, respectively. In a specific frequency range, a ratio of an intensity of the first mechanical vibration to an intensity of the first signal is greater than a ratio of an intensity of the second mechanical vibration to an intensity of the second signal.
In some embodiments, the loudspeaker assembly is a bone conduction loudspeaker assembly. The bone conduction loudspeaker assembly includes a housing and a vibration component that is connected to the housing and configured to generate the first mechanical vibration. The microphone is directly or indirectly connected to the housing.
In some embodiments, when the user wears the acoustic input-output device, the clamping force formed between the acoustic input-output device and a contact portion of the user is within a range of 0.1 N to 0.5N.
In some embodiments, the acoustic input-output device further includes a damping structure. The microphone is connected to the loudspeaker assembly through the damping structure.
In some embodiments, the damping structure includes a damping material with an elastic modulus less than a first threshold.
In some embodiments, the elastic modulus of the damping material is within a range of 0.01 Mpa to 1000 Mpa.
In some embodiments, a thickness of the damping structure is within a range of 0.5 mm to 5 mm.
In some embodiments, a first portion of a surface of the microphone is configured to conduct the second mechanical vibration. A second portion of the surface of the microphone is provided with the damping structure and connected to the loudspeaker assembly through the damping structure.
In some embodiments, the first portion of the surface of the microphone is provided with a vibration transmission layer.
In some embodiments, an elastic modulus of a material of the vibration transmission layer is greater than a second threshold.
In some embodiments, the loudspeaker assembly includes a housing and a vibration component. There is a first connection between the housing and the vibration component. There is a second connection between the microphone and the housing. The first connection includes a first damping structure.
In some embodiments, the second connection includes a second damping structure.
In some embodiments, a mass of the vibration component is within a range of 0.005 g to 0.3 g.
In some embodiments, when a user wears the acoustic input-output device, a clamping force formed between the acoustic input-output device and a contact portion of the user is within a range of 0.01 N to 0.05N.
In some embodiments, the loudspeaker assembly includes a first diaphragm and a second diaphragm. Vibration directions of the first diaphragm and the second diaphragm are opposite.
In some embodiments, the loudspeaker assembly includes a housing. The housing includes a first cavity and a second cavity. The first diaphragm and the second diaphragm are located in the first cavity and the second cavity, respectively. A side wall of the first cavity is set with a first sound transmission hole and a second sound transmission hole. A side wall of the second cavity is opened with a third sound transmission hole and a fourth sound transmission hole. A phase of sound transmitted by the first sound transmission hole is the same as a phase of sound transmitted by the third sound transmission hole. A phase of sound transmitted by the second sound transmission hole is the same as a phase of sound transmitted by the fourth sound transmission hole.
In some embodiments, the first sound transmission hole and the third sound transmission hole are provided on a same side wall of the housing. The second sound transmission hole and the fourth sound transmission hole are provided on another same side wall of the housing. The first sound transmission hole and the second sound transmission hole are provided on non-adjacent side walls of the housing. The third sound transmission hole and the fourth sound transmission hole are provided on non-adjacent side walls of the housing.
In some embodiments, the loudspeaker assembly further includes a first magnetic circuit assembly and a second magnetic circuit assembly configured to form a magnetic field. The first magnetic circuit assembly is configured to cause the first diaphragm to vibrate. The second magnetic circuit assembly is configured to cause the second diaphragm to vibrate. The first cavity and the second cavity are spatially connected. The first magnetic circuit assembly and the second magnetic circuit assembly are connected directly or indirectly.
In some embodiments, the voice signal source is a vibration portion of a user providing the voice signal. When the user wears the acoustic input-output device, a distance between the vibration portion of the user and the microphone is less than a third threshold.
In some embodiments, the microphone is located close to at least one of the vocal cords, the larynx, the mouth, or the nasal cavity of the user.
In some embodiments, the acoustic input-output device further includes a fixing assembly configured to maintain stable contact between the acoustic input-output device and a user. The fixing assembly is fixedly connected to the loudspeaker assembly.
In some embodiments, the acoustic input-output device is a headset. The fixing assembly includes a headband and two earmuffs ear cups connected to both sides of the headband. The headband is configured to fix the acoustic input-output device to the skull of the user and fix the two earmuffs to both sides of the skull of the user. The microphone and the loudspeaker assembly are arranged in the two earmuffs, respectively.
In some embodiments, the acoustic input-output device is a binaural headset. One side of each earmuff in contact with the user is provided with a sponge sleeve. The microphone is accommodated in the sponge sleeve.
In some embodiments, a ratio of the intensity of the second signal to the intensity of the first signal is greater than a threshold.
An acoustic input-output device is provided in one or more embodiments of the present disclosure. The acoustic input-output device includes a loudspeaker assembly and a microphone. The loudspeaker assembly is configured to transmit sound waves by generating a first mechanical vibration. The microphone is configured to receive a second mechanical vibration of a voice signal source that is generated when the voice signal source provides a voice signal. The microphone generates a first signal and a second signal in response to the first mechanical vibration and the second mechanical vibration, respectively. A first angle formed by a vibration direction of the microphone and a direction of the first mechanical vibration is within a set angle range so that in a specific frequency range, a ratio of an intensity of the first mechanical vibration to an intensity of the first signal is greater than a ratio of an intensity of the second mechanical vibration to an intensity of the second signal.
In some embodiments, the first angle is within an angle range of 20 degrees to 90 degrees.
In some embodiments, the first angle includes 90 degrees.
In some embodiments, a second angle formed by the vibration direction of the microphone and a direction of the second mechanical vibration is within a set angle range so that the ratio of the intensity of the first mechanical vibration to the intensity of the first signal is greater than the ratio of the intensity of the second mechanical vibration to the intensity of the second signal.
In some embodiments, the second angle is within an angel range of 0 degrees to 85 degrees.
The present disclosure is further illustrated in terms of exemplary embodiments, and these exemplary embodiments are described in detail with reference to the drawings. These embodiments are not limiting. In these embodiments, the same number indicates the same structure, wherein:
To illustrate the technical solutions related to the embodiments of the present disclosure, a brief introduction of the drawings referred to in the description of the embodiments is provided below. Obviously, the drawings described below are only some examples or embodiments of the present disclosure. Those skilled in the art, without further creative efforts, may apply the present disclosure to other similar scenarios according to these drawings. It should be understood that these exemplary embodiments are merely provided for those skilled in the art to better comprehend thereby realizing the present disclosure, but not limit the scope of the present disclosure in any way. Unless apparent from the locale or otherwise stated, like reference numerals represent similar structures or operations throughout the several views of the drawings.
As shown in the present disclosure and claims, unless the context clearly suggests an exception, the words “one”, “a”, “an” and/or “the” are not specific to the singular form, but may also include the plural form. In general, the terms “includes” and “comprises” suggest only the inclusion of clearly identified steps and elements that do not constitute an exclusive list, and the method or apparatus may also contain other steps or elements. The term “based on” is “based, at least in part, on”. The term “an embodiment” means “at least one embodiment”; the term “another embodiment” means “at least one additional embodiment”. Definitions of other terms will be given in the description below. In the following, without loss of generality, the description of “bone conduction microphone,” “bone conduction microphone assembly,” “bone conduction loudspeaker”, “bone conduction loudspeaker,” or “bone conduction headset” will be used when describing the bone conduction related technology in the present disclosure. The description of “air conduction microphone,” “air conduction microphone assembly,” “air conduction loudspeaker”, “air conduction loudspeaker,” or “air conduction headset” will be used when describing the air conduction related technology in the present disclosure. This description is only a form of bone conduction application, for the ordinary skilled person in the field, “speaker” or “headset” can also be replaced by other similar words, such as “player”, “hearing aid”, etc. In fact, the various implementations of the invention can be easily applied to other non-speaker-based hearing devices. For example, for professionals in the field, after understanding the basic principle of the bone conduction speaker, it is possible to make various modifications and changes in the form and details of the specific ways and steps of implementing the bone conduction speaker without departing from this principle, in particular, adding the function of environmental sound pickup and processing to the bone conduction speaker so that the speaker can realize the function of a hearing aid. For example, a sensor, e.g., a microphone can pick up the sound of the user/wearer's surroundings and, under a certain algorithm, transmit the sound processed (or the electrical signal generated) to the bone conduction speaker. That is, the bone conduction speaker can be modified in a certain way to include the function of picking up environmental sound and transmitting the sound to the user/wearer through the bone conduction speaker after certain signal processing, thereby realizing the function of a bone conduction hearing aid. By way of example, the algorithm described herein may include one or a combination of a noise cancellation, an auto gain control, an acoustic feedback suppression, a wide dynamic range compression, an active environment identification, an active anti-noise, a directional processing, a tinnitus processing, a multi-channel wide dynamic range compression, an active whistle suppression, a volume control, etc.
The loudspeaker assembly 110 may be configured to convert a signal containing sound information into a sound signal (also referred to as a voice signal). For example, the loudspeaker assembly 110 may generate a mechanical vibration to transmit sound waves (i.e., sound signals) in response to receiving the signal containing sound information. For ease of description, the mechanical vibration generated by the loudspeaker assembly 110 may be referred to as a first mechanical vibration. In some embodiments, the loudspeaker assembly may include a vibration component and/or a vibration transmission component (e.g., at least a portion of a housing of the acoustic input-output device 100, a vibration transmission sheet) that is connected to the vibration component. Energy conversion occurs when the loudspeaker assembly 110 generates the first mechanical vibration, so that the loudspeaker assembly 110 may achieve a conversion of the signal containing sound information to the mechanical vibration. The process of energy conversion may include a coexistence and a conversion of many different types of energy. For example, an electrical signal (i.e., the signal containing sound information) may be directly converted into the first mechanical vibration by a transducer in the vibration component of the loudspeaker assembly 110, and the first mechanical vibration is transmitted by the vibration transmission component of the loudspeaker assembly 110 to transmit the sound waves. As another example, the sound information may be contained in an optical signal, a specific transducer may achieve the conversion of the optical signal to the vibration signal. Other types of energy that may coexist and be converted during the operation of the transducer include thermal energy, magnetic energy, etc. An energy conversion manner of the transducer may include moving-coil, electrostatic, piezoelectric, moving-iron, pneumatic, electromagnetic, etc.
The loudspeaker assembly 110 may include an air conduction loudspeaker assembly and/or a bone conduction loudspeaker assembly. In some embodiments, the loudspeaker assembly 110 may include a vibration component and a housing. In some embodiments, when the loudspeaker assembly 110 is a bone conduction loudspeaker assembly, the housing of the loudspeaker assembly 110 may be in contact with a certain portion (e.g., face) of a user's body and transmit the first mechanical vibration generated by the vibration component to the auditory nerve through the bone to enable the user to hear the sound, and used as at least a portion of the housing of the acoustic input-output device 100 to accommodate the vibration component and the microphone assembly 120. In some embodiments, when the loudspeaker assembly 110 is an air conduction loudspeaker assembly, the vibration component may cause the air to vibrate to change the density of the air to enable the user to hear the sound, and the housing may be used as at least a portion of the housing of the acoustic input-output device 100 to accommodate the vibration component and the microphone assembly 120. In some embodiments, the loudspeaker assembly 110 and the microphone assembly 120 may be located in different housings.
The vibration component may convert a sound signal into a mechanical vibration signal, thereby generating the first mechanical vibration. In some embodiments, the vibration component (i.e., the transducer device) may include a magnetic circuit assembly. The magnetic circuit assembly may provide a magnetic field. The magnetic field may be configured to convert the signal containing sound information into the mechanical vibration signal. In some embodiments, the sound information may include video and audio files with a particular data format or data or files that may be converted to the sound through a particular way. The signal containing sound information may come from a storage component of the acoustic input-output device 100 or from an information generation, storage, or transmission system other than the acoustic input-output device 100. The signal containing the sound information may include an electrical signal, an optical signal, a magnetic signal, a mechanical signal, or the like, or any combination thereof. The signal containing sound information may come from one or multiple signal sources. The multiple signal sources may be correlated or uncorrelated. In some embodiments, the acoustic input-output device 100 may obtain the signal containing sound information through different ways, and the ways may be wired or wireless, real-time or time-delayed. For example, the acoustic input-output device 100 may receive an electrical signal containing sound information through a wired way or a wireless way or may obtain data directly from a storage medium to generate the sound signal. As another example, the acoustic input-output device 100 may include a component (e.g., an air conduction microphone assembly) with a sound collection capability that picks up the sound in the environment, converts the mechanical vibration of the sound into the electrical signal, which is processed by an amplifier to obtain an electrical signal that meets specific requirements. In some embodiments, the wired connection may include one or a combination of a metallic cable, an optical cable, or a hybrid metallic and optical cable, such as, for example, a coaxial cable, a communication cable, a flexible cable, a spiral cable, a non-metallic sheathed cable, a metal sheathed cable, a multi-core cable, a twisted pair cable, a ribbon cable, a shielded cable, a telecommunication cable, a double stranded cable, a parallel two-core conductor, a twisted pair cable, etc. The examples described above are for convenience of illustration only. The medium of the wired connection may also be of other types, for example, other carriers for the transmission of the electrical or the optical signals, etc.
The wireless connection may include a radio communication, a free-space optical communication, an acoustic communication, and an electromagnetic induction, etc. The radio communication may include IEEE 802.11 series of standards, IEEE 802.15 series of standards (e.g., Bluetooth technology and cellular technology, etc.), a first generation mobile communication technology, a second generation mobile communication technology (e.g., FDMA, TDMA, SDMA, CDMA, and SSMA, etc.), a general packet radio service technology, a third generation mobile communication technology (e.g. CDMA2000, WCDMA, TD-SCDMA, and WiMAX, etc.), a fourth generation mobile communication technology (e.g., TD-LTE and FDD-LTE, etc.), a satellite communication (e.g., GPS technology, etc.), a near field communication (NFC), and other technologies operating in the ISM band (e.g., 2.4 GHz, etc.); the free-space optical communication may include a visible light, an infrared signal, etc.; the acoustic communication may include an acoustic wave, an ultrasonic signal, etc.; the electromagnetic induction may include a near-field communication technology, etc. The examples described above are for convenience of illustration only, and the medium for the wireless connection may also be of other types, e.g., a Z-wave technology, other tolled civilian radio bands, and military radio bands, etc. For example, as some application scenarios of the present disclosure, the bone conduction speaker 100 may obtain the signal containing the sound information from other devices via a Bluetooth™ technology.
The microphone assembly 120 may be configured to pick up the sound signal (also be referred to as the voice signal) and convert the sound signal into the signal containing the sound information (e.g., the electrical signal). For example, the microphone assembly 120 picks up the mechanical vibration of a voice signal source that is generated when the voice signal source provides the voice signal and converts the mechanical vibration into the electrical signal. For ease of description, the mechanical vibration generated when the user provides the voice signal may be referred to as the second mechanical vibration. In some embodiments, the microphone assembly 120 may include one or more microphones. In some embodiments, the microphones may be classified into bone conduction microphones and/or air conduction microphones based on their working principles. For ease of description, in one or more embodiments of the present disclosure, the bone conduction microphone will be used as an example for illustration. It is noted that the bone conduction microphone in the one or more embodiments of the present disclosure may be replaced with the air conduction microphone.
The bone conduction microphone may be configured to collect any mechanical vibration (e.g., the first mechanical vibration and the second mechanical vibration) that is conducted by the user's bones, skin, and other tissues and can be precepted by the bone conduction microphone. The collected mechanical vibration causes internal components (e.g., a diaphragm) of the bone conduction microphone 120 to generate the corresponding mechanical vibration (e.g., a third mechanical vibration and a fourth mechanical vibration), and the mechanical vibration is converted into an electrical signal containing sound information (e.g., a first signal and a second signal). The first signal may be understood as an echo signal generated by the bone conduction microphone. The second signal may be understood as a voice signal generated by the bone conduction microphone. The air conduction microphone may collect an air conduction mechanical vibration (i.e., the sound waves) and convert the mechanical vibration into the signal containing sound information (e.g., an electrical signal). For example, if the loudspeaker assembly 110 includes an air conduction loudspeaker, the air conduction microphone may receive the echo signal transmitted (by air conduction) by the air conduction loudspeaker. As another example, if the loudspeaker assembly 110 includes a bone conduction loudspeaker, the air conduction microphone may receive both the mechanical vibrations transmitted by the bone conduction loudspeaker and the echo signal transmitted by the bone conduction loudspeaker through air conduction. In some embodiments, the microphone assembly 120 may include the microphone diaphragm and other electronic components. After being transmitted to the microphone diaphragm, the mechanical vibration of the voice signal source may cause the corresponding mechanical vibration of the microphone diaphragm, and the electronic components may convert the mechanical vibration into the signal containing sound information (e.g., the electrical signal). In some embodiments, the microphone assembly 120 may include, but is not limited to, a ribbon microphone, a microelectromechanical systems (MEMS) microphone, a dynamic microphone, a piezoelectric microphone, a condenser microphone, a carbon microphone, an analog microphone, a digital microphone, or the like, or any combination thereof. For example, the bone conduction microphone may include an omnidirectional microphone, a unidirectional microphone, a bidirectional microphone, a cardioid microphone, or the like, or any combination thereof.
In some embodiments, when the loudspeaker assembly 110 and the microphone assembly 120 operate simultaneously, the microphone assembly 120 may percept the first mechanical vibration generated by the loudspeaker assembly 110 and the second mechanical vibration that is generated by the voice signal source. In response to the first mechanical vibration, the microphone assembly 120 may generate the third mechanical vibration and convert the third mechanical vibration into the first signal. In response to the second mechanical vibration, the microphone assembly 120 may generate the fourth mechanical vibration and convert the fourth mechanical vibration into the second signal. In some embodiments, the loudspeaker assembly 110 may be referred to as an echo signal source. In some embodiments, when the loudspeaker assembly 110 and the microphone assembly 120 operate simultaneously, in a specific frequency range, a ratio of an intensity of the first mechanical vibration to an intensity of the first signal is greater than a ratio of an intensity of the second mechanical vibration to an intensity of the second signal. The frequency range may include 200 Hz to 10 kHz, 200 Hz to 5000 Hz, 200 Hz to 2000 Hz, or 200 Hz to 1000 Hz, etc.
The fixing assembly 130 may support the loudspeaker assembly 110 and the microphone assembly 120. In some embodiments, the fixing assembly 130 may include an arc-shaped elastic member capable of forming a force of rebounding toward the middle of the arc so as to be in stable contact with the human skull. In some embodiments, the fixing assembly 130 may include one or more connectors. The one or more connectors may connect the loudspeaker assembly 110 and/or the microphone assembly 120. In some embodiments, the fixing assembly 130 may be worn binaurally. For example, both ends of the fixing assembly 130 may be fixedly connected to two sets of the loudspeaker assemblies 110, respectively. When the user wears the acoustic input-output device 100, the fixing assembly 130 may hold the two sets of the loudspeaker assemblies 110 near the user's left and right ears, respectively. In some embodiments, the fixing assembly 130 may be worn monaurally. For example, the fixing assembly 130 may be fixedly connected to only one set of the loudspeaker assemblies 110. When the user wears the acoustic input-output device 100, the fixing assembly 130 may hold the loudspeaker assembly 110 near the user's ear on one side. In some embodiments, the fixing assembly 130 may be glasses (e.g., sunglasses, augmented reality glasses, virtual reality glasses), a helmet, a hairband, or the like, or any combination thereof, which is not limited herein.
The above description of the structure of the acoustic input-output device is merely a specific example and should not be considered as the only feasible embodiment. Obviously, for those skilled in the art, after understanding the basic principle of the acoustic input-output device 100, various amendments and variations in form and detail may be made to the specific manner and steps for implementing the acoustic input-output device 100 without departing from the principle, but these amendments and variations remain within the scope of the above description. For example, the acoustic input-output device 100 may include one or more processors, and the processors may perform one or more sound signal processing algorithms. The sound signal processing algorithms may correct or enhance the sound signal. For example, the sound signal is subjected to a noise reduction, an acoustic feedback suppression, a wide dynamic range compression, an automatic gain control, an active ambient recognition, ab active anti-noise, a directional processing, a tinnitus processing, a multi-channel wide dynamic range compression, an active whistle suppression, a volume control, or other similar, or any combination thereof, and these amendments and variations remain within the scope of protection of the claims of the present disclosure. As another example, the acoustic input-output device 100 may include one or more sensors, such as a temperature sensor, a humidity sensor, a speed sensor, a displacement sensor, etc. The sensor may collect user information or environmental information.
In some embodiments, the ear-hook 231 may include an elastic metal wire. The elastic metal wire may be configured to keep the ear-hook 231 in a shape that matches the user's ear, and the elastic metal wire has a certain degree of elasticity, so that when the user wears the ear-clip headset, the elastic metal wire may be elastically deformed according to the user's ear shape and head shape to adapt to users with different ear shapes and head shapes. In some embodiments, the elastic metal wire may be made of memory alloy with good deformation recovery ability. Even if the ear-hook 231 is deformed by an external force, when the external force is removed, the ear-hook 231 may return to its original shape, thereby extending a service life of the ear-clip headset. In some embodiments, the elastic metal wire may be made of non-memory alloy. Wires may be provided in the elastic metal wire to establish electrical connections between the headset core 210 and other components (e.g., the control circuitry 260, the battery 270, etc.), so as to provide power and data transmission for the headset core 210. In some embodiments, the ear-hook 231 may include a protective sleeve 236 and a housing protector 237 integrally formed with the protective sleeve 236.
In some embodiments, the headset housing 232 may be configured to accommodate the headset core 210. The headset core 210 may include one or more loudspeaker assemblies and/or one or more microphone assemblies. The one or more loudspeaker assemblies may include a bone conduction loudspeaker assembly, an air conduction loudspeaker assembly, etc. The one or more microphone assemblies may include a bone conduction microphone assembly, an air conduction microphone assembly, etc. The structure and arrangement of the one or more loudspeaker assemblies and the one or more microphone assemblies may be found elsewhere in the present disclosure, e.g.,
In some embodiments, the ear-hook 231 and the headset housing 232 may be molded separately and further assembled together, rather than directly molding the two together.
In some embodiments, the headset housing 232 may be provided with a contact surface 2321. The contact surface 2321 may be in contact with the skin of the user. When the ear-clip headset is used, the sound waves generated by one or more bone conduction loudspeakers of the headset core 210 may be transmitted outside (e.g., the eardrum of the user) the headset housing 232 through the contact surface 2321. In some embodiments, a material and thickness of the contact surface 2321 may affect the transmission of the bone conduction sound waves to the user, thereby affecting the sound quality. For example, if an elasticity of the material of the contact surface 2321 is relatively large, the transmission of bone conduction sound waves in a low frequency range may be better than the transmission of bone conduction sound waves in a high frequency range. If the elasticity of the material of the contact surface 2321 is relatively small, the transmission of the bone conduction sound waves in the high frequency range may be better than the transmission of bone conduction sound waves in the low frequency range. It should be noted that the headset housing 232 in the embodiments and the housing in other embodiments of the present disclosure refer to the component of the acoustic input-output device 200 in contact with the user.
The loudspeaker assembly 310 may transmit the sound waves by generating the first mechanical vibration to cause the user to hear the sound. A sound wave conduction way of the loudspeaker assembly 310 includes air conduction and bone conduction. The air conduction corresponds to the air conduction loudspeaker assembly. The air conduction loudspeaker assembly transmits the sound waves in a form of waves through the air, and the sound waves are transmitted to the auditory nerve through the user's eardrum-auditory ossicles-cochlea, so that the user can hear the sound. The bone conduction corresponds to the bone conduction loudspeaker assembly. The bone conduction loudspeaker assembly transmits the mechanical vibration to the skins and bones of the user's face 340 by contacting the user's face 340 (for example, the housing 350 of the bone conduction loudspeaker assembly is in contact with the user's face 340), and further transmits the mechanical vibration to the auditory nerve through the skins and bones, so that the user can hear the sound. Whether the bone conduction loudspeaker assembly or the air conduction loudspeaker assembly, the bone conduction microphone 320 is directly or indirectly connected to the loudspeaker assembly 310. Specifically, when the loudspeaker assembly 310 is a bone conduction loudspeaker assembly, the housing 350 is one of the vibration transmission components of the bone conduction loudspeaker assembly, the vibration component in the bone conduction loudspeaker assembly needs to be directly or indirectly connected to the housing 350 to transmit the vibration to the skin and the bone of the user. The bone conduction microphone 320 needs to be directly or indirectly connected to the housing 350 to collect the vibration that is generated when the user is speaking. The sound waves transmitted by the bone conduction loudspeaker cause the mechanical vibration of the housing 350, the housing 350 transmits the mechanical vibration to the bone conduction microphone 320. After receiving the mechanical vibration, the bone conduction microphone 320 generates the corresponding third mechanical vibration and generates the first signal containing sound information based on the third mechanical vibration. When the loudspeaker assembly 310 is an air conduction loudspeaker assembly, the housing 350 is configured to accommodate the air conduction loudspeaker assembly and the bone conduction microphone 320, which is equivalent to the housing of the acoustic input-output device 300, and the vibration component in the air conduction loudspeaker assembly may be directly or indirectly connected to the housing 350 to fix the air conduction loudspeaker assembly. In summary, the bone conduction microphone 320 needs to be directly or indirectly connected to the housing 350 to collect the vibration that is generated when the user is speaking. The sound waves transmitted by the air conduction loudspeaker cause the mechanical vibration of the housing 350, the housing 350 transmits the mechanical vibration to the bone conduction microphone 320. After receiving the mechanical vibration, The bone conduction microphone 320 generates the corresponding third mechanical vibration and generates the first signal containing sound information based on the third mechanical vibration.
Therefore, at least a portion of the first mechanical vibration that is generated by the loudspeaker assembly 310 is transmitted to the bone conduction microphone 320 to cause the bone conduction microphone 320 to generate the third mechanical vibration. In addition to the first mechanical vibration transmitted by the loudspeaker assembly 310, the bone conduction microphone 320 may be in contact with the skin of the user's face 340 to receive the second mechanical vibration (e.g., the vibration of the skin and bone) that is generated when the user speaks to cause the bone conduction microphone 320 to generate the fourth mechanical vibration.
When the bone conduction microphone 320 and the loudspeaker assembly 310 operate simultaneously, for example, the bone conduction microphone 320 receives a voice signal (e.g., by picking up the vibration of the skin and other positions when the user speaks) while the loudspeaker assembly 310 transmits a voice signal (e.g., music) through vibration, the bone conduction microphone 320 receives both the first mechanical vibration and the second mechanical vibration. A microphone diaphragm of the bone conduction microphone 320 (not shown) generates the third mechanical vibration and the fourth mechanical vibration corresponding to the first mechanical vibration and the second mechanical vibration, respectively, and converts the third mechanical vibration and the fourth mechanical vibration into the first signal and the second signal, respectively. When the microphone diaphragm generates the third mechanical vibration in response to the picked up first mechanical vibration, the bone conduction microphone 320 receives the voice information transmitted by the first mechanical vibration other than the voice information transmitted by the second mechanical vibration, thereby affecting the quality of the sound signal picked up by the microphone. For ease of description, the signal transmitted by the first mechanical vibration may be referred to as an echo signal (or a sub-voice signal), and the component (e.g., the loudspeaker assembly 310, the housing 350) that generates and transmits the first mechanical vibration may be referred to as an echo signal source (or a sub-voice signal source). The second mechanical vibration may be referred to as a voice signal (or a primary voice signal), and the component (e.g., the user's vocal cord, nasal cavity, mouth, etc.) that generates and transmits the second mechanical vibration may be referred to as a voice signal source (or a primary voice signal source). Vibration directions of the voice signal source, the echo signal source, and the bone conduction microphone are illustrated in
For the above reasons, an intensity of the echo signal (i.e., the intensity of the first signal) generated by the bone conduction microphone 320 may be reduced by performing some design on the acoustic input-output device 300. Further, while reducing the intensity of the echo signal generated by the bone conduction microphone 320, the intensity of the voice signal generated by the bone conduction microphone 320 (i.e., the intensity of the second signal) may be increased to achieve a purpose of reducing the intensity of the first signal and increasing the intensity of the second signal, such that the ratio of the intensity of the first mechanical vibration to the intensity of the first signal is greater than the ratio of the intensity of the second mechanical vibration to the intensity of the second signal, thereby improving the quality of the sound signal generated by the bone conduction microphone.
According to
where L1 refers to the intensity of the second mechanical vibration received by the bone conduction microphone 320, L2 refers to the intensity of the received first mechanical vibration, and m refers to the mass of the bone conduction microphone 320. ω refers to an angle frequency of the voice signal and/or the echo signal.
may indicate an effect of L1 (i.e., the second mechanical vibration) on L.
may indicate an effect of L2 (i.e., the first mechanical vibration) on L.
Therefore, the greater the elasticity coefficient k1 of the first elastic connection 370, the greater the effect of the intensity L1 of the vibration of the voice signal source on the intensity L of the mechanical vibration received by the bone conduction microphone 320. The smaller the elasticity coefficient k2 of the second elastic connection 390, the smaller the elasticity coefficient of the second elastic connection 390, the smaller the effect of the intensity L2 of the mechanical vibration of the echo signal source on the intensity L of the mechanical vibration received by the bone conduction microphone 320, the smaller the echo signal received by the bone conduction microphone 320.
According to equation (1), it may be learned that to reduce the echo signal received by the bone conduction microphone 320, the acoustic input-output device may be designed in a plurality of ways, for example, by increasing as much as possible L1 and/or k1, and minimizing L2 and/or k2 to increase the effect of L1 on L and reduce the effect of L2 on L, thereby improving the quality of the sound signal generated by the bone conduction microphone.
According to the above principle, in some embodiments, an installation location of the bone conduction microphone 520 may be designed, such that the vibration direction of the bone conduction microphone 520 and a vibration direction (i.e., the direction of the first mechanical vibration) of an echo signal source 580 (e.g., the loudspeaker assembly 310 shown in
In some embodiments, according to equation (1), it may be known that the greater the effect of an intensity L1 of the vibration of a voice signal source 660 on an intensity L of the mechanical vibration received by the bone conduction microphone 620, i.e., the greater the intensity of the vibration of the voice signal source 660 received by the bone conduction microphone 620, which is equivalent to reducing an effect of an intensity L2 of the vibration of the voice signal source 680 on a voice signal L generated by the bone conduction microphone 620. In some embodiments, to increase the effect of the intensity L1 of the mechanical vibration of the voice signal source 660 on the voice signal L generated by the bone conduction microphone 620, an angle between a vibration direction of the bone conduction microphone 620 and a vibration direction of the voice signal source 660 may be designed to be within a specific range. An angle between the vibration direction of the bone conduction microphone 620 and the vibration direction of the voice signal source 660 may be a second angle β. In some embodiments, the second angle β may be within an angle range of 0 degrees to 85 degrees. In some embodiments, the second angle β may be within an angle range of 0 degrees to 75 degrees. In some embodiments, the second angle β may be within an angle range of 0 degrees to 60 degrees. In some embodiments, the second angle β may be within an angle range of 0 degrees to 45 degrees. In some embodiments, the second angle β may be within an angle range of 0 degrees to 30 degrees. In some embodiments, the second angle β may be within an angle range of 0 degrees to 15 degrees. In some embodiments, the second angle β may be within an angle range of 0 degrees to 5 degrees. In some embodiments, the second angle β may be 0 degrees, i.e., the vibration direction of the bone conduction microphone 620 is parallel to the vibration direction of the voice signal source 660. In the embodiments, within the range of 0 degrees to 90 degrees, the smaller the second angle β, the closer the vibration direction of the microphone diaphragm is to be parallel to the vibration direction of the voice signal source 660, and the greater the intensity of the second signal converted by the microphone diaphragm. When the second angle β is 0 degrees, the intensity of the first signal converted by the microphone diaphragm is the greatest, and at this time the intensity of the second signal generated by the bone conduction microphone 620 is the greatest, i.e., the intensity of the generated voice signal is the greatest. As used herein, an angle between two directions refers to a smallest positive angle formed by the intersection of straight lines where the two directions are located.
It is noted that a scheme of controlling the first angle α in a set angle range and a scheme of controlling the second angle β in a set angle range may be combined. In some embodiments, the first angle α may be set to 90 degrees, and the second angle β may be set to 30 degrees. In some embodiments, the first angle α may be set to 90 degrees and the second angle β to 45 degrees. In some embodiments, the first angle α may be set to 90 degrees, and the second angle β may be set to 60 degrees. In some embodiments, the first angle α may be set to 45 degrees, and the second angle β may be set to 30 degrees. In some embodiments, the first angle α may be set to 90 degrees, and the second angle β is set to 15 degrees. As shown in
For example,
In some embodiments, by designing the locations of the bone conduction microphone 620 and the echo signal source 680 (e.g., the loudspeaker assembly 310), the intensity reduction of the second signal is significantly less than the intensity reduction of the first signal, accordingly, a ratio of the intensity of the second signal to the intensity of the first signal to be greater than a threshold, thereby increasing a proportion of the voice signal in the sound signal generated by the bone conduction microphone and making the voice signal cleaner and the experience of the user better. In some embodiments, the ratio of the intensity of the second signal to the intensity of the first signal may be greater than ¼. In some embodiments, the ratio of the intensity of the second signal to the intensity of the first signal may be greater than ⅓. In some embodiments, the ratio of the intensity of the second signal to the intensity of the first signal may be greater than ½. In some embodiments, the ratio of the intensity of the second signal to the intensity of the first signal may be greater than ⅔.
It should be noted that the scheme, in one or more above-mentioned embodiments, of increasing the intensity of the voice signal received by the microphone assembly (e.g., the microphone assembly 320 shown in
In some embodiments, the single-axis bone conduction microphone is merely an example, and the bone conduction microphone (e.g., the bone conduction microphone 320 shown in
Continuously, according to
x(t)=e(t)cos(α€)+s(t)cos(β(s)) (2).
A vibration component of the echo signal source 380 and the voice signal source 360 on the Y-axis of the bone conduction microphone is:
y(t)=e(t)sin(α(e))+s(t)sin(β(s)) (3).
The echo signal of the bone conduction microphone 320 may be eliminated by weighting the vibration component x(t) of the echo source 380 and the voice source 360 on the X-axis of the bone conduction microphone and the vibration component y(t) of the echo signal source 380 and the voice signal source 360 on the Y-axis of the bone conduction microphone, and accordingly, a total sound signal of the bone conduction microphone 320 is:
out(t)=x(t)sin(α(e))−y(t)cos(α(e))=s(t)sin(α(e)−β(s)) (4),
where a weighting factor corresponding to the vibration component x(t) of the echo signal source 380 and the voice signal source 360 on the X-axis of the bone conduction microphone is sin(α(e)), and a weighting factor corresponding to the vibration component y(t) of the echo source 380 and the voice source 360 on the Y-axis of the bone conduction microphone is −cos(α(e)). In some embodiments, the angle α(e) between the echo signal source 380 and the X-axis of the bone conduction microphone may be obtained when the acoustic input-output device is installed. In some embodiments, a(e) may be obtained through the following operations including determining whether a current signal of the bone conduction microphone 320 has a voice signal s(t); when the current signal does not have the voice signal s(t), the α(e) may be obtained through the following equations (5)-(7).
x(t)=e(t)cos(α(e)) (5),
y(t)=e(t)sin(α(e)) (6).
According to equation (5) and equation (6), the following equation (7) is obtained.
In some embodiments, the x(t) and y(t) may be weighted, and then α(e) may be obtained based on weighted x(t) and y(t) according to the equation (7). In some embodiments, when α(e) is obtained according to equation (9), a more stable α(e) estimate may be obtained by smoothing α(e) in time.
In some embodiments, the bone conduction microphone 320 may be a three-axis microphone. For example, the microphone may have the X-axis, the Y-axis, and a Z-axis, and the sound signal generated by the three-axis microphone may be obtained by weighting vibration components of the voice signal s(t) and the echo signal e(t) on the X, Y, and Z axes of the bone conduction microphone. The principle of obtaining the sound signal generated by the three-axis microphone is similar to the principle of obtaining the sound signal generated by the two-axis microphone, which is not repeated herein.
In some embodiments, the vibration direction of the echo signal source 380 may not be a single direction, for example, the vibration direction of the echo signal source 380 may be diffused along an arc track. In such cases, the vibration in the vibrations generated by the echo signal source 380 that is not perpendicular to the vibration direction of the bone conduction microphone 320 may be received by the bone conduction microphone 320 and converted into the first signal, i.e., an echo signal is generated. Therefore, in some embodiments, the loudspeaker assembly 310 and the bone conduction microphone 320 may be designed, such that a relative location between the bone conduction microphone 320 and the loudspeaker assembly 310 (e.g., the housing 350) is fixed to reduce the vibration transmitted by the echo signal source 380 received by the bone conduction microphone 320.
In some embodiments, in addition to designing the first angle α and the second angle β, a purpose of reducing the echo may be achieved by changing the elasticity coefficient k1 of the first elastic connection 370 and the elastic coefficient k2 of the second elastic connection 390.
In some embodiments, the intensity of the first mechanical vibration received by the bone conduction microphone 320 may be reduced by reducing the elasticity coefficient k2 of the second elastic connection 390 between the bone conduction microphone 320 and the echo signal source 380.
The damping structure 1100 may refer to a structure with certain elasticity, and an intensity of the mechanical vibration transmitted by the echo signal source 1010 is reduced through the elasticity of the damping structure. In some embodiments, the damping structure 1100 may be an elastic component to reduce the intensity of the transmitted mechanical vibration. The elasticity of the damping structure 1100 may be determined by various aspects such as a material of the damping structure, a thickness of the damping structure, a structure of the damping structure, or the like.
In some embodiments, the damping structure 1100 may be made of a damping material with an elastic modulus less than a first threshold. In some embodiments, the first threshold may be 5000 MPa. In some embodiments, the first threshold may be 4000 MPa. In some embodiments, the first threshold may be 3000 MPa. In some embodiments, the elastic modulus of the damping material may be within a range of 0.01 MPa to 1000 MPa. In some embodiments, the elastic modulus of the damping material may be within a range of 0.015 MPa to 2500 MPa. In some embodiments, an elastic modulus of the damping material may be within a range of 0.02 MPa to 2000 MPa. In some embodiments, the elastic modulus of the damping material may be within a range of 0.025 MPa to 1500 MPa. In some embodiments, an elastic modulus of the damping material may be within a range of 0.03 MPa to 1000 MPa. In some embodiments, the damping material may include, but is not limited to, foam, plastic (e.g., but not limited to, polymer polyethylene, blow-molded nylon, engineering plastic, etc.), rubber, silicone, etc. In some embodiments, the damping material may be the foam.
In some embodiments, the damping structure 1100 may have a certain thickness. Referring to
In some embodiments, the elasticity of the damping structure 1100 may be provided by its structural design. For example, the damping structure 1100 may be an elastic structure, even if the material used to make the damping structure 1100 has high stiffness, its structure can provide elasticity. In some embodiments, the damping structure 1100 may include, but is not limited to, a spring-like structure, a ring or ring-like structure, etc.
In some embodiments, a surface of the bone conduction microphone 1020 may include a first portion 1021 and a second portion 1022. The first portion 1021 may be in contact with the user's face 1040 to conduct the second mechanical vibration provided by the voice signal source, the second portion 1022 may be connected to other components (e.g., the loudspeaker assembly 1010) of the acoustic input-output device 1000. The second portion 1022 may be provided with the damping structure 1100. The second portion 1022 is connected to the loudspeaker assembly 1010 through the damping structure 1100. In the embodiments, the damping structure 1100 arranged between the loudspeaker assembly 1010 and the bone conduction microphone 1020 has a certain elasticity, which may reduce the first mechanical vibration transmitted by the loudspeaker assembly 1010 and reduce the intensity of the first mechanical vibration received by the bone conduction microphone 1020, which makes the echo signal generated by the bone conduction microphone 1020 to be small. Further, since the first portion 1021 of the surface of the bone conduction microphone 1020 is in contact with the user's face 1040 to transmit the second mechanical vibration, the damping structure 1100 is not arranged in the first portion 1021. For example, the first portion 1021 may be close to one side of the microphone diaphragm. Since the second mechanical vibration represents the voice signal provided by the voice signal source, it should be ensured the second mechanical vibration is not weakened as much as possible. Specifically, as shown in
In some embodiments, the damping structure 1100 may be connected to the second portion 1022 of the surface of the bone conduction microphone by adhesive. In some embodiments, the damping structure 1100 may be connected to the bone conduction microphone 1020 by welding, clamping, riveting, a threaded connection (e.g., by screws, screws rods, bolts, and other components), a clamp connection, a pin connection, a wedge key connection, and integral molding.
In some embodiments, the first portion 1021 of the surface of the bone conduction microphone 1020 may be provided with a vibration transmission layer 1023. The stiffness of the bone conduction microphone 1020 is relatively great, if the first portion 1021 is in direct contact with the user's face 1040, the user may feel uncomfortable, which reduces the user experience. After the first portion 1021 is provided with vibration transmission layer 1023, when the user wears the bone conduction microphone 1020, the tactile feeling may be better, which may effectively improve the user experience.
In some embodiments, the vibration transmission layer 1023 needs to maintain a certain elasticity, which reduces a loss of the second mechanical vibration during conduction, and ensures that the tactile feeling when the user wears the acoustic input-output device 1000. In some embodiments, the elastic modulus of the material of the vibration transmission layer 1023 is too small, which means that the elasticity of the material of the vibration transmission layer 1023 is relatively small, which weakens the intensity of the second mechanical vibration. Therefore, in some embodiments, the elastic modulus of the material of the vibration transmission layer 1023 may be greater than a second threshold. In some embodiments, the second threshold may be 0.01 Mpa. In some embodiments, the second threshold may be 0.015 Mpa. In some embodiments, the second threshold may be 0.02 Mpa. In some embodiments, the second threshold may be 0.025 Mpa. In some embodiments, the second threshold may be 0.03 Mpa. In some embodiments, the elastic modulus of the vibration transmission layer 1023 may be within a range of 0.03 MPa to 3000 MPa. In some embodiments, the elastic modulus of the vibration transmission layer 1023 may be within a range of 5 MPa to 2000 MPa. In some embodiments, the elastic modulus of the vibration transmission layer 1023 may be within a range of 10 MPa to 1500 MPa. In some embodiments, the elastic modulus of the vibration transmission layer 1023 may be within a range of 10 MPa to 1000 MPa. In some embodiments, the material of the vibration transmission layer 1023 may be silicone (the elastic modulus of the silicone is 10 MPa), rubber, or plastic (the elastic modulus of the plastic is 1000 MPa).
In some embodiments, the loss of the second mechanical vibration during conduction may be reduced by reducing the thickness of the vibration transmission layer 1023. When the thickness of the vibration transmission layer 1023 is relatively thin, even if the elastic modulus of the material of the vibration transmission layer 1023 is relatively small, the intensity of the second mechanical vibration is not greatly lost. In some embodiments, the thickness of the vibration transmission layer 1023 may be less than 30 mm. In some embodiments, the thickness of the vibration transmission layer 1023 may be less than 25 mm. In some embodiments, the thickness of the vibration transmission layer 1023 may be less than 20 mm. In some embodiments, the thickness of the vibration transmission layer 1023 may be less than 15 mm. In some embodiments, the thickness of the vibration transmission layer 1023 may be less than 10 mm. In some embodiments, the thickness of the vibration transmission layer 1023 may be less than 5 mm. In some embodiments, the rubber or silicone with a thickness of 5 mm may be used to produce the vibration transmission layer 1023 to ensure a good tactile feeling and the intensity of the second mechanical vibration received by the bone conduction microphone 1020.
It is noted that the above-mentioned embodiments of the acoustic input-output device 1000 are applicable to both the bone conduction loudspeaker assembly and the air conduction loudspeaker assembly. For example, when the acoustic input-output device 1000 is a bone conduction loudspeaker assembly, a housing 1050 may be a portion of the bone conduction loudspeaker assembly, and the bone conduction microphone 1020 may be connected to the housing of the bone conduction loudspeaker assembly through the damping structure 1010. When the acoustic input-output device 1000 is an air conduction loudspeaker assembly, the air conduction loudspeaker assembly and the bone conduction microphone 1020 may both be connected to the housing (e.g., a diaphragm is connected to the housing and the bone conduction microphone 1020 is connected to the housing), and the damping structure is provided between the bone conduction microphone 1020 and the housing.
In some embodiments, the intensity of the second mechanical vibration received by the bone conduction microphone may be increased by increasing a clamping force formed between the acoustic input-output device 1000 and a contact portion of the user. It is understood that the closer the contact between the acoustic input-output device 1000 and the contact portion (e.g., the user's face 1040) of the user, the less the loss of the second mechanical vibration during transmission, but if the clamping force formed between the acoustic input-output device 1000 and the contact portion of the user is great, the user will feel pain and have a poorer service experience. Therefore, the clamping force needs to be controlled within a specific range. In some embodiments, when the loudspeaker assembly 1010 is the air conduction loudspeaker assembly, i.e., the acoustic input-output device 1000 transmits the sound signal to the user through the air conduction loudspeaker assembly and receives the voice signal of the user through the bone conduction microphone 1020, the clamping force may be within a range of 0.001 N to 0.3N. In some embodiments, the clamping force may be within a range of 0.0025N to 0.25N. In some embodiments, the clamping force may be within a range of 0.005N to 0.15N. In some embodiments, the clamping force may be within a range of 0.0075N to 0.1 N. In some embodiments, the clamping force may be within a range of 0.01N to 0.05N. In some embodiments, because the bone conduction loudspeaker assembly transmits the mechanical vibration generated by the vibration component to the user's face through the housing to make the user hear the sound, the clamping force is different when the loudspeaker assembly 1010 is the bone conduction loudspeaker assembly. For example, when the loudspeaker assembly 1010 of the acoustic input-output device 1000 includes the bone conduction loudspeaker assembly, if the clamping force is too small, the intensity of the mechanical vibration transmitted to the user by the bone conduction loudspeaker assembly will also be too small, i.e., a volume of the sound transmitted to the user through the acoustic input-output device 1000 is small. Therefore, to ensure the intensity of the mechanical vibration received by the user, in some embodiments, when the loudspeaker assembly 1010 of the acoustic input-output device 1000 includes the bone conduction loudspeaker assembly, the clamping force needs to be set within a specific range. In some embodiments, the clamping force may be within a range of 0.01N to 2.5N. In some embodiments, the clamping force may be within a range of 0.025N to 2N. In some embodiments, the clamping force may be within a range of 0.05N to 1.5N. In some embodiments, the clamping force may be within a range of 0.075N to 1N. In some embodiments, the clamping force may be within a range of 0.1 N to 0.5N.
In some embodiments, the loudspeaker assembly 1010 may be directly connected to the bone conduction microphone 1020, for example, the bone conduction microphone 1020 is directly connected to and accommodated within the housing 1050 of the loudspeaker assembly 1010 (the housing of the bone conduction loudspeaker assembly). In some embodiments, the bone conduction microphone may be indirectly connected to the loudspeaker assembly.
In some embodiments, the bone conduction microphone 1220 may be arranged at any location of an inner wall of the housing 1250. For example, as shown in
When the air conduction loudspeaker assembly operates, the diaphragm 1313 generates a mechanical vibration. Since the diaphragm 1313 and the housing 1350 are directly connected (as shown in
In some embodiments, the arrangement manner of the first damping structure may be the same or similar to the arrangement manner the damping structure 1100 in the above embodiments. For example, the first damping structure may be made of the same thickness, the same material, and the same structure as the damping structure 1100. In some embodiments, the first damping structure may be different from the damping structure 1100. For example, the first damping structure may be a strip or sheet member with a certain elasticity. Both ends of the strip or sheet member are respectively connected to the diaphragm 1313 and the housing 1350 to reduce the intensity of the mechanical vibration transmitted by the diaphragm 1313 to the housing 1350. The first damping structure may also be a ring member. The middle of the ring member is connected to the diaphragm, and an outer side of the ring member is connected to the housing 1350, which may reduce the intensity of the mechanical vibration transmitted by the diaphragm 1313 to the housing 1350.
With continued reference to
In some embodiments, the bone conduction microphone 1320 and the loudspeaker assembly 1310 may be respectively arranged in different regions of the acoustic input-output device, and then the second damping structure is arranged between the bone conduction microphone 1320 and the housing 1350 of the loudspeaker assembly 1310. In some embodiments, the bone conduction microphone 1320 may be separately arranged in other regions of the acoustic input-output device and then connected to the housing 1350 through the second damping structure. For example, in the embodiments shown in
The above embodiments of the second damping structure are applicable not only to the air conduction loudspeaker assembly, but also to the bone conduction loudspeaker assembly. For example, the loudspeaker assembly in the embodiments shown in
It should be noted that when the bone conduction microphone is arranged inside the housing as shown in
According to
Similarly, if a mass of the housing 1350 is much greater than the mass of the diaphragm 1313, the impact of the mechanical vibration of the diaphragm 1313 on the housing 1350 is relatively small. Therefore, in some embodiments, the intensity of the mechanical vibration of the housing 1350 may be reduced by increasing the mass of the housing 1350. In some embodiments, the mass of the housing 1350 may be controlled to be within a range of 2 g to 20 g. In some embodiments, the mass of the housing 1350 may be controlled to be within a range of 3 g to 15 g. In some embodiments, the mass of the housing 1350 may be controlled to be within a range of 4 g to 10 g. In some embodiments, a ratio of the mass of the housing 1350 to the mass of the diaphragm 1313 may be controlled, so that the mass of the housing 1350 is much larger than the mass of the diaphragm 1313, thereby reducing the impact of the mechanical vibration of the diaphragm 1313 on the housing 1350. In some embodiments, the ratio of the mass of the housing 1350 to the mass of the diaphragm 1313 may be controlled to be within a range of 10 to 100. In some embodiments, the ratio of the mass of the housing 1350 to the mass of the diaphragm 1313 may be controlled to be within a range of 15 to 80. In some embodiments, the ratio of the mass of the housing 1350 to the mass of the diaphragm 1313 may be controlled to be within a range of 20 to 60. In some embodiments, the ratio of the mass of the housing 1350 to the mass of the diaphragm 1313 may be controlled to be within a range of 25 to 50. In some embodiments, the ratio of the mass of the housing 1350 to the mass of the diaphragm 1313 may be controlled to be within a range of 30 to 50.
In some embodiments, the housing 1450 may include a first cavity 1455 and a second cavity 1456, and the first diaphragm 1413 and the second diaphragm 1414 may be respectively located in the first cavity 1455 and the second cavity 1456. The housing 1450 may include a first portion corresponding to the first cavity 1455 and a second portion corresponding to the second cavity 1456. A side wall of the first cavity 1455 (i.e., a side wall of the first portion of the housing 1450) may be set with a first sound transmission hole 1451 and a second sound transmission hole 1452. In some embodiments, the first sound transmission hole 1451 and the second sound transmission hole 1452 may be arranged on different side walls of the first portion of the housing 1450. In some embodiments, the first sound transmission hole 1451 and the second sound transmission hole 1452 may be arranged on non-adjacent side walls of the first portion of the housing 1450, i.e., the first sound transmission hole 1451 and the second sound transmission hole 1452 may be arranged on opposite side walls of the first portion of the housing 1450 (as shown in
A side wall of the second cavity 1456 (i.e., a side wall of the second portion of the housing 1450) may be provided with a third sound transmission hole 1453 and a fourth sound transmission hole 1454. In some embodiments, the third sound transmission hole 1453 and the fourth sound transmission hole 1454 may be arranged on different side walls of the second portion of the housing 1450. In some embodiments, the third sound transmission hole 1453 and the fourth sound transmission hole 1454 may be arranged on non-adjacent side walls of the second portion of the housing 1450, i.e., the third sound transmission hole 1453 and the fourth sound transmission hole 1454 may be provided on opposite walls of the second portion of the housing 1450 (as shown in
As shown in
In some embodiments, when the loudspeaker assembly 1410 has multiple diaphragms, a structure of the loudspeaker assembly 1410 may be adjusted to reduce an overall size.
As shown in
In some embodiments, the first air conduction loudspeaker assembly and the second air conduction loudspeaker assembly may be two same loudspeakers. In some embodiments, the first air conduction loudspeaker assembly and the second air conduction loudspeaker assembly may be two different loudspeakers. For example, the acoustic input-output device 1500 includes a first air conduction loudspeaker assembly and a second air conduction loudspeaker assembly. The first air conduction loudspeaker assembly may act as a primary loudspeaker configured to generate the sound signal heard by the user. The second air conduction loudspeaker assembly may act as an auxiliary loudspeaker. The auxiliary loudspeaker produces a force opposite to that of the primary loudspeaker on the housing 1550 by adjusting the intensity of the mechanical vibration of the auxiliary loudspeaker, thereby reducing the intensity of vibration of the housing 1550. In some embodiments, the loudspeaker assembly 1510 may include the primary loudspeaker and an auxiliary device configured to generate the vibration on the housing 1550 in a direction opposite to that of the primary loudspeaker. In some embodiments, the auxiliary device may be a vibration motor that may generate the vibration on the housing 1550 in a direction opposite to that of the primary loudspeaker, thereby reducing the intensity of the vibration of the housing 1550. In some embodiments, the intensity of the mechanical vibration generated by the auxiliary loudspeaker may be adjusted. Specifically, the loudspeaker assembly 1510 may include an auxiliary loudspeaker control device. The auxiliary loudspeaker control device may obtain the intensity and direction of the mechanical vibration of the primary loudspeaker, and adjust the intensity and direction of the mechanical vibration of the auxiliary loudspeaker based on the intensity and direction of the mechanical vibration of the primary loudspeaker. Therefore, a force of the auxiliary loudspeaker on the housing and a force of the primary loudspeaker on the housing 1550 may cancel each other to reduce the vibration of the housing 1550, and accordingly, the vibration transmitted by the housing 1550 to the bone conduction microphone 1520 may be reduced, thereby reducing an intensity of the echo signal generated by the bone conduction microphone (not shown in
It is noted that the embodiments in which the vibration directions of the two diaphragms are set in opposite directions may be combined with one or more of the above-mentioned embodiments. For example, in the embodiment in which the vibration directions of the two diaphragms are set in opposite directions, the second damping structure may be provided both between the first diaphragm (e.g., the first diaphragm 1413) and the housing (e.g., the housing 1450) and between the second diaphragm (e.g., the second diaphragm 1414) and the housing 1450, which reduces the mechanical vibration received by the housing 1450, thereby reducing the intensity of the first mechanical vibration received by the bone conduction microphone.
In some embodiments, the voice signal source may be a vibration part of the user when providing the voice signal. For example, when the user speaks, the intensity of vibration of parts of the user such as the vocal cords, the mouth, the nasal cavity, the larynx is significantly greater than the ears, the eyes of the user, and therefore these parts may act as the voice signal source. In some embodiments, when designing the bone conduction microphone 1920, the bone conduction microphone 1920 may be located close to the mouth, the nasal cavity, or the vocal cords of the user. For example, when the acoustic input-output device 1900 is glasses shown in
In some embodiments, the bone conduction microphone 1720 or the loudspeaker assembly 1710 may not be arranged in the earmuff 1731. For example, the bone conduction microphone may be arranged at a position D on the headband shown in
It should be noted that the glasses described in the above embodiments may be various types of glasses, for example, sunglasses, glasses for near-sightedness, and glasses for long-sightedness. In some embodiments, the glasses may o be glasses with a virtual reality (VR) function or an augmented reality (AR) function.
The potential beneficial effects of the embodiments of the present disclosure may include, but are not limited to: (1) The first angle formed between the vibration direction of the bone conduction microphone and the vibration direction of the echo signal source is set within a specific angle range, thereby reducing the intensity of the vibration of the echo signal source received by the bone conduction microphone and the intensity of the generated echo signal (i.e., the first signal). (2) The second angle formed between the vibration direction of the bone conduction microphone and the vibration direction of the voice signal source is set within a specific angle range, thereby improving the intensity of the vibration of the voice signal source received by the bone conduction microphone and the intensity of the generated voice signal (i.e. the second signal). (3) The clamping force formed between the acoustic input-output device and the contact portion of the user is controlled within a specific range, so that the bone conduction microphone is in closer contact with the user, and the intensity of the vibration (i.e. the fourth mechanical vibration) of the voice signal source received by the bone conduction microphone is higher. (4) The damping structure is arranged between the bone conduction microphone and the housing of the loudspeaker assembly to reduce the intensity of the vibration (i.e., the third mechanical vibration) of the loudspeaker assembly. (5) The damping structure is arranged between the vibration component of the loudspeaker assembly and the housing to reduce the impact of the vibration of the vibration component on the housing through the damping structure, thereby reducing the intensity of the mechanical vibration generated by the housing, and accordingly reducing the intensity of the vibration of the speaker component received by the bone conduction microphone. (6) The bone conduction microphone is arranged close to the vibration part when the user provides the voice signal, thereby improving the intensity of the vibration of the voice signal source received by the bone conduction microphone. It should be noted that different embodiments may produce different beneficial effects, and in different embodiments, the possible beneficial effects may be any one or a combination of the above beneficial effects, or any other beneficial effects.
Having thus described the basic concepts, it may be rather apparent to those skilled in the art after reading this detailed disclosure that the foregoing detailed disclosure is intended to be presented by way of example only and is not limiting. Various alterations, improvements, and modifications may occur and are intended for those skilled in the art, though not expressly stated herein. These alterations, improvements, and modifications are intended to be suggested by this disclosure, and are within the spirit and scope of the exemplary embodiments of this disclosure.
Moreover, certain terminology has been used to describe embodiments of the present disclosure. For example, the terms “one embodiment,” “an embodiment,” and/or “some embodiments” mean that a particular feature, structure, or characteristic described in connection with the embodiment is included in at least one embodiment of the present disclosure. Therefore, it is emphasized and should be appreciated that two or more references to “an embodiment” or “one embodiment” or “an alternative embodiment” in various portions of this specification are not necessarily all referring to the same embodiment. Furthermore, the particular features, structures, or characteristics may be combined as suitable in one or more embodiments of the present disclosure.
Furthermore, the recited order of processing elements or sequences, or the use of numbers, letters, or other designations thereof, are not intended to limit the claimed processes and methods to any order except as may be specified in the claims. Although the above disclosure discusses through various examples what is currently considered to be a variety of useful embodiments of the disclosure, it is to be understood that such detail is solely for that purpose, and that the appended claims are not limited to the disclosed embodiments, but, on the contrary, are intended to cover modifications and equivalent arrangements that are within the spirit and scope of the disclosed embodiments. For example, although the implementation of various components described above may be embodied in a hardware device, it may also be implemented as a software-only solution, e.g., an installation on an existing server or mobile device.
Similarly, it should be appreciated that in the foregoing description of embodiments of the present disclosure, various features are sometimes grouped together in a single embodiment, figure, or description thereof for the purpose of streamlining the disclosure aiding in the understanding of one or more of the various embodiments. This method of disclosure, however, is not to be interpreted as reflecting an intention that the claimed subject matter requires more features than are expressly recited in each claim. Rather, claimed subject matter may lie in less than all features of a single foregoing disclosed embodiment.
In some embodiments, the numbers expressing quantities or properties used to describe and claim certain embodiments of the application are to be understood as being modified in some instances by the term “about,” “approximate,” or “substantially.” For example, “about,” “approximate,” or “substantially” may indicate ±20% variation of the value it describes, unless otherwise stated. Accordingly, in some embodiments, the numerical parameters set forth in the written description and attached claims are approximations that may vary depending upon the desired properties sought to be obtained by a particular embodiment. In some embodiments, the numerical parameters should be construed in light of the count of reported significant digits and by applying ordinary rounding techniques. Notwithstanding that the numerical ranges and parameters setting forth the broad scope of some embodiments of the application are approximations, the numerical values set forth in the specific examples are reported as precisely as practicable. In closing, it is to be understood that the embodiments of the application disclosed herein are illustrative of the principles of the embodiments of the application. Other modifications that may be employed may be within the scope of the application. Therefore, by way of example, but not of limitation, alternative configurations of the embodiments of the application may be utilized in accordance with the teachings herein. Accordingly, embodiments of the present application are not limited to that precisely as shown and described.
This specification is a Continuation of International Application No. PCT/CN2021/090298 filed on Apr. 27, 2021, the entire contents of which are hereby incorporated by reference.
Number | Date | Country | |
---|---|---|---|
Parent | PCT/CN2021/090298 | Apr 2021 | US |
Child | 18327873 | US |