This disclosure generally relates to wearable audio devices. More particularly, the disclosure relates to wearable audio devices configured to enhance detection of voice signals in noisy environments.
Wearable audio devices can significantly improve communication between users in noisy environments, e.g., in industrial use applications, open-air environments, or other areas with high levels of background noise. Conventionally, these devices employ a “boom” microphone (e.g., microphone placed on a boom or arm) that is placed next to the user's mouth to aid in voice pickup and noise cancellation. While boom microphones can be useful for communication purposes, these microphones are not practical in all instances. For example, the user must actively position the boom to enhance effectiveness. Additionally, the boom and microphone can reduce the user's field of vision, creating challenges in a dynamic and/or dangerous environment.
All examples and features mentioned below can be combined in any technically possible way.
Various implementations include wearable audio devices. The wearable audio devices are configured to enhance the acoustic response proximate a user, e.g., in the direction of the user's mouth.
In some particular aspects, the wearable audio device includes: a head mount having: a crown portion for resting on a head of a user, and a brim extending from the crown portion in a forward-oriented direction; and a plurality of microphones coupled to the brim of the head mount.
Implementations may include one of the following features, or any combination thereof.
In certain aspects, the wearable audio device further includes: a controller coupled with the plurality of microphones and configured to combine a plurality of signals from the plurality of microphones to provide an output signal having an enhanced acoustic response in a selected direction.
In some implementations, the selected direction is a direction of a mouth of the user.
In certain aspects, the selected direction is a forward-oriented direction.
In particular cases, the wearable audio device further includes a voice activity detection (VAD) system coupled to the head mount and the controller.
In some aspects, the wearable audio device further includes: an additional microphone located proximate a rear of the crown; and an accelerometer located proximate the additional microphone, where the VAD system is configured to use a noise pickup signal from the additional microphone to filter out acoustic noise in a signal from the accelerometer.
In some aspects, the VAD system includes at least one microphone selected from the plurality of microphones coupled to the brim of the head mount.
In certain implementations, the VAD system includes a vibration sensor.
In particular aspects, the wearable audio device further comprises a suspension system coupled with the head mount, where the vibration sensor is mounted to a back strap of the suspension system.
In certain cases, the vibration sensor is mounted to the head mount in a manner configured to detect vibration of the temple of the user, or in a manner configured to detect jaw vibration of the user.
In some implementations, the vibration sensor is mounted to an inside surface of the crown portion.
In particular aspects, the vibration sensor is an accelerometer for detecting vibration of bones of the user.
In certain cases, the wearable audio device further includes a transducer coupled to the head mount and the controller, the transducer configured to provide an audio output.
In some implementations, the transducer is an earbud.
In particular cases, the plurality of microphones comprises at least two microphones.
In certain aspects, each of the plurality of microphones is coupled to a lower surface of the brim.
In some implementations, an upper surface of the brim is shaped to shield the plurality of microphones from wind in the ambient environment.
In particular aspects, the head mount further includes a dome portion extending from the crown portion to cover a top of the head of the user.
In certain implementations, the head mount includes a rigid protective helmet or a hat.
In particular aspects, the brim extends from the crown portion by a distance that locates the plurality of microphones at a relative angle to the mouth of the user such that the plurality of microphones are positioned to enhance an acoustic response from user voice signals.
In certain cases, the plurality of microphones is positioned on the brim to enhance voice detection while ambient sound pressure level (SPL) exceeds approximately 75 decibels (dB).
In particular aspects, the wearable audio device further includes an additional microphone assembly coupled with the head mount, the additional microphone assembly including: an arm in a fixed position relative to the head mount; and at least one additional microphone coupled with the arm.
Two or more features described in this disclosure, including those described in this summary section, may be combined to form implementations not specifically described herein.
The details of one or more implementations are set forth in the accompanying drawings and the description below. Other features, objects and benefits will be apparent from the description and drawings, and from the claims.
It is noted that the drawings of the various implementations are not necessarily to scale. The drawings are intended to depict only typical aspects of the disclosure, and therefore should not be considered as limiting the scope of the implementations. In the drawings, like numbering represents like elements between the drawings.
This disclosure is based, at least in part, on the realization that a wearable audio device with brim-mounted microphones can effectively enhance voice pickup in noisy environments. For example, wearable audio devices disclosed according to implementations can provide a user with an effective, hands-free approach for communicating in noisy environments. The systems disclosed according to various implementations can improve communications in such environments.
Commonly labeled components in the FIGURES are considered to be substantially equivalent components for the purposes of illustration, and redundant discussion of those components is omitted for clarity.
Aspects and implementations disclosed herein may be applicable to a wide variety of speaker systems, such as wearable audio devices in various form factors, such as head-worn devices (e.g., helmets, hats, visors, headsets, headphones, eyeglasses), neck-worn speakers, shoulder-worn speakers, body-worn speakers (e.g., watches), etc. Some particular aspects disclosed may be applicable to personal (wearable) audio devices such as head-mounted audio devices, including helmets, hats, visors, eyeglasses etc. It should be noted that although specific implementations of speaker systems primarily serving the purpose of acoustically outputting audio are presented with some degree of detail, such presentations of specific implementations are intended to facilitate understanding through provision of examples and should not be taken as limiting either the scope of disclosure or the scope of claim coverage.
In the particular example of a head-mounted audio device 10 depicted in
As noted herein, the audio device 10 can also include an additional suspension system for directly coupling the crown 30 to the user's head in some implementations. For example, as depicted in the simplified perspective view of an audio device in
With continuing reference to
In various implementations, the brim 40 has an upper surface 75 and a lower surface 80 opposing the upper surface 70. In a forward-oriented position, the lower surface 80 faces generally downward toward the floor or the user's feet. In various implementations, as shown in
The audio device 10 can also include a transducer 90 (e.g., electroacoustic transducer or bone conduction transducer) for providing an audio output to a user. In certain cases, as depicted in the example in
In certain cases, the audio device 10 also includes electronics 100, which are shown in the example depictions in
In additional implementations, one or more components depicted in the electronics 100 are located in a separate, connected device 115. For example, processing and/or control components can be located in a separate connected device 115 that is in communication with the electronics 100 physically located at the head mount 20. In some cases, the device 115 includes a smart device such as a smart phone, tablet, wearable communication device, controller, etc., that is configured to communicate with one or more electronic components in the audio device 10.
As shown in
Electronics 100 can include other components not specifically depicted herein, such as one or more power sources, motion detection systems (e.g., an inertial measurement unit, or IMU), communications components (e.g., a wireless transceiver (WT)) configured to communicate with one or more other electronic devices connected via one or more wireless networks (e.g., a local WiFi network, Bluetooth/Bluetooth Low Energy connection, or radio frequency (RF) connection), and amplification and signal processing components (e.g., one or more digital signal processors (DSPs)). It is understood that these components or functional equivalents of these components can be connected with, or form part of, the controller 120.
In certain implementations, the electronics 100 can include a voice enhancement system (or voice pick-up system) which may be part of the controller 120 and/or part of any hardware and/or software construct described herein. The voice enhancement system is configured to enhance user voice signals in the presence of noise.
In various optional implementations, the audio device 10 further includes a voice activity detection system (or simply, “VAD system”) that is configured to detect voice activity, e.g., from the user of the audio device 10, and indicate a presence of that voice activity for enhancing the acoustic response from the microphones 50. In certain implementations, the VAD system is implemented as hardware and/or software in the electronics 100 (at the head mount 20 and/or at the connected device 115), and in some cases, can execute functions as part of, or in cooperation with, the voice enhancement system. Portions of the VAD system can be located in the controller 120, however, in other implementations, functions of the VAD system can be performed by another hardware and/or software system coupled with the controller 120 or otherwise contained in electronics 100. In particular cases, functions of the VAD system are used in the voice pick-up (enhancement) system that is configured to aid in enhancing the user's voice signals in the presence of noise, e.g., by freezing the adaptation of filter coefficients in an adaptive filter when voice activity is present. Additional details of processes performed by the voice enhancement system and the VAD system are described in co-pending U.S. patent application Ser. No. ______ (“Audio Processing for Wearables in High-Noise Environment”, attorney docket number RS-19-315-US), filed herewith on ______, which is herein incorporated by reference in its entirety.
In particular cases, the VAD system includes or otherwise utilizes inputs from physical sensors at the audio device 10. For example, in some implementations, the VAD system includes a vibration detection system, for example, at least one vibration sensor 150 located at one or more locations on the audio device 10. In some cases, the vibration sensor 150 includes an accelerometer (e.g., one or more multi-axis accelerometer(s)) or a bone conduction microphone. In some cases, the vibration sensor 150 is mounted to the crown 30 or the suspension system 52 (
In additional cases, the VAD system includes or otherwise receives signals from one or more microphones to validate voice detection. For example, in some cases, the VAD system is configured to use signals detected by one or more microphones 50 to validate voice detection. In these cases, the VAD system includes or is otherwise connected with at least one microphone 50 selected from the plurality of microphones 50 located on the brim 40, or an additional microphone 50A mounted elsewhere on the audio device 10 (e.g., a microphone 50A mounted to an inside surface 160 of the crown 30 or to a back strap of the head mount 20) for validating detected voice activity (e.g., detected via bone conduction at the vibration sensor 150). Several example locations for the additional microphone 50A are depicted in
In various implementations, signals from the vibration sensor 150 and the additional microphone 50A can be used to enhance accuracy of voice detection. That is, in a head-worn system such as the audio device 10, a vibration sensor 150 such as an accelerometer can be located such that it makes contact with the user's head in order to effectively sense bone-conducted vibration from the user's speech. In certain cases, the audio device 10 can further enhance adaptive acoustic response functions using input(s) from one or more additional microphones 50A. That is, the microphone-based voice activity approach described according to various implementations can enhance the robustness of the audio device 10 in situations where reliable skin contact between the accelerometer and the user's skin is not feasible.
While certain accelerometers provide reliable bone conduction voice pickup, some of these accelerometers can be sensitive to acoustic noise. In particular cases, this sensitivity to acoustic noise can make it difficult to define universal bone-conducted voice activity thresholds. In addressing this issue, in various particular implementations, the audio device 10 includes a vibration sensor 150 (e.g., accelerometer) and a microphone (e.g., additional microphone 50A) located proximate one another but separated from the user's mouth, e.g., proximate the rear 140 of the crown 30 or on the back strap 54 (
In still further implementations, as noted herein, the vibration sensor 150 can be mounted in the head mount 20 in a manner configured to detect vibration of one or more portions of the user's head. For example, vibration sensor 150A is configured to detect vibration of the user's temple region. Vibration sensor 150B can be configured to detect vibration from the user's jaw. In additional implementations, one or more vibration sensors 150 and/or additional microphones 50A are located along straps or other mounting equipment within or coupled to the head mount 20, e.g., to detect bone conduction (and verify such detection) from other regions of the user's head.
In still further implementations, as noted herein, the VAD system can include or otherwise be coupled with additional sensors that are capable of detecting voice activity of the user. For example, the VAD system can include (or otherwise be coupled) with one or more optical sensors (e.g., a camera) or infra-red (IR) sensors for detecting movement of the user's mouth and thus flagging voice activity.
Returning to
In some cases, the audio device 10 is particularly well suited to detect voice signals from the user in noisy ambient conditions, for example, in industrial use cases, outdoor use cases, etc. In particular cases, the microphones 50 are positioned on the brim 40 to detect voice signals from the user in such noisy ambient conditions. In some examples, the noisy ambient conditions are defined by conditions where the ambient sound pressure level (SPL) exceeds approximately 75 decibels (dB).
In some additional implementations, as shown in
With continuing reference to
In contrast to conventional systems for communicating in noisy environments, the audio devices described according to various implementations are configured to enhance communication while keeping the user immersed in the environment. The user can remain heads up and hands free in performing one or more tasks while still effectively communicating with others. That is, these audio devices can effectively enhance the user's voice in noisy environments without the need for a boom or other externally adjustable microphone.
The functionality described herein, or portions thereof, and its various modifications (hereinafter “the functions”) can be implemented, at least in part, via a computer program product, e.g., a computer program tangibly embodied in an information carrier, such as one or more non-transitory machine-readable media, for execution by, or to control the operation of, one or more data processing apparatus, e.g., a programmable processor, a computer, multiple computers, and/or programmable logic components.
A computer program can be written in any form of programming language, including compiled or interpreted languages, and it can be deployed in any form, including as a stand-alone program or as a module, component, subroutine, or other unit suitable for use in a computing environment. A computer program can be deployed to be executed on one computer or on multiple computers at one site or distributed across multiple sites and interconnected by a network.
Actions associated with implementing all or part of the functions can be performed by one or more programmable processors executing one or more computer programs to perform the functions of the calibration process. All or part of the functions can be implemented as, special purpose logic circuitry, e.g., an FPGA and/or an ASIC (application-specific integrated circuit). Processors suitable for the execution of a computer program include, by way of example, both general and special purpose microprocessors, and any one or more processors of any kind of digital computer. Generally, a processor will receive instructions and data from a read-only memory or a random access memory or both. Components of a computer include a processor for executing instructions and one or more memory devices for storing instructions and data.
Additionally, actions associated with implementing all or part of the functions described herein can be performed by one or more networked computing devices. Networked computing devices can be connected over a network, e.g., one or more wired and/or wireless networks such as a local area network (LAN), wide area network (WAN), personal area network (PAN), Internet-connected devices and/or networks and/or a cloud-based computing (e.g., cloud-based servers).
In various implementations, components described as being “coupled” to one another can be joined along one or more interfaces. In some implementations, these interfaces can include junctions between distinct components, and in other cases, these interfaces can include a solidly and/or integrally formed interconnection. That is, in some cases, components that are “coupled” to one another can be simultaneously formed to define a single continuous member. However, in other implementations, these coupled components can be formed as separate members and be subsequently joined through known processes (e.g., soldering, fastening, ultrasonic welding, bonding). In various implementations, electronic components described as being “coupled” can be linked via conventional hard-wired and/or wireless means such that these electronic components can communicate data with one another. Additionally, sub-components within a given component can be considered to be linked via conventional pathways, which may not necessarily be illustrated.
The term “approximately” as used with respect to values denoted herein can allot for a nominal variation from absolute values, e.g., of several percent or less.
A number of implementations have been described. Nevertheless, it will be understood that additional modifications may be made without departing from the scope of the inventive concepts described herein, and, accordingly, other implementations are within the scope of the following claims.