Certain types of devices may use audio processing techniques to determine positional information about sound sources and/or to produce audio signals that emphasize sound from certain positions or directions. When developing and implementing such techniques, it is often desired to test their effectiveness and accuracy.
The detailed description is described with reference to the accompanying figures. In the figures, the left-most digit(s) of a reference number identifies the figure in which the reference number first appears. The use of the same reference numbers in different figures indicates similar or identical components or features.
This disclosure relates to systems and techniques for testing the directional capabilities of audio devices. Directional capabilities may include beamforming and sound source localization, for example. Beamforming and sound source localization rely on differences in arrival times of sound at different microphones. Beamforming shifts and combines signals produced by the microphones in such a way as to reinforce sounds coming from one direction while deemphasizing sounds coming from other directions. Sound source localization analyzes phase differences in the signals to determine the directions or positions from which sounds originate.
In order to test the performance of audio devices that utilize position-dependent audio processing techniques, a mechanism is provided for positioning an audio device and a sound source at different relative positions, for emitting sound from the sound source at each of the relative positions, and for recording data produced by the audio device at each of the relative positions in response to the emitted sound.
The mechanism has a horizontal linear actuator configured to move the audio device horizontally. A rotary actuator is configured to rotate the audio device about a vertical axis. A vertical linear actuator configured to move the sound source vertically. The horizontal linear actuator is controlled to establish the horizontal distance of the sound source relative to the audio device. The rotary actuator is controlled to establish the azimuth of the sound source relative to the audio device. The vertical linear actuator is controlled to establish the altitude or elevation of the sound source relative to the audio device.
A controller is configured to select and establish the relative positions of the sound source and the audio device through control of the actuators, to emit a test sound at each of the relative positions through control of the sound source, and to record data produced by the audio device in response to the sound emitted at each of the relative positions. This results in test data having multiple records. Each record comprises (a) coordinates specifying the actual relative position of the sound source and audio device and (b) data produced by the audio device in response to the sound emitted at the relative position. The test data records can be analyzed during or after their acquisition to determine accuracy or validity of the provided data. The data records may also be analyzed to calibrate or determine characteristics of the audio device.
In some embodiments, the data produced by the audio device may comprise audio signals, such as the audio signals generated by each of the multiple microphones of the audio device or directional audio signals generated by beamforming. In other embodiments, the data may comprise positional data corresponding to each relative position, such as coordinates of the relative position of the sound source as calculated by the audio device.
The audio device 102 has multiple microphones 110 that are spaced from each other for use in conjunction with beamforming and/or sound localization. In the illustrated example, the microphones 110 are positioned within a single, horizontal plane. However, different types of audio devices may utilize microphone arrays in which individual microphones or microphone elements are arranged linearly or in three dimensions. In some embodiments, the audio device may have only a single microphone 110.
In the embodiment of
The horizontal linear actuator 112 rests on or is mounted to the floor 106 and configured to move along a horizontal axis 120 parallel to the floor and perpendicular to the wall 108(a). The vertical linear actuator 114 is mounted to the wall 108(a) and configured to move the sound source 118 along a vertical axis 122 that is parallel to the wall 108(a) and perpendicular to the floor 106. The rotary actuator 116 is supported by and above the horizontal linear actuator 112. The rotary actuator 116 is configured to rotate the audio device 102 about a vertical axis 124.
Each of the horizontal and vertical linear actuators 112 and 114 may comprise a linear slide or other type of motorized or actuated linear motion mechanism. Each linear actuator may comprise a single slide rail as shown or may comprise a set of parallel slide rails. Various other types and configurations of linear motion mechanisms may be used.
The rotary actuator 116 may comprise a turntable or other type of rotary motion mechanism. The rotary actuator 116 may be supported by and above the horizontal linear actuator 112 by a support stand or frame 126.
The audio device 102 may be affixed to the rotary actuator 116 by means of a test fixture 128. The test fixture 128 may be configured to establish a fixed position and rotational alignment between the audio device 102 and the rotary actuator 116. The test fixture 128 may have fasteners, clamps, or other mechanisms for fixing the audio device to the rotary actuator 116. The test fixture 128 may be replaceable or interchangeable with different test fixtures to accommodate different types of audio devices 102.
The sound source 118 may be supported horizontally outwardly from the vertical linear actuator 114 by a support arm 130. The support arm 130 may in some embodiments have a pivot or pivot actuator 132 that allows the sound source 118 to be pivoted up and down about a horizontal axis 134 that is perpendicular to the vertical axis 122 of the vertical linear actuator 114. In some embodiments, the sound source 118 may comprise a sound transducer such as a conventional loudspeaker element. In other embodiments, the sound source 118 may comprise any other mechanism capable of producing sound, such as a mechanical clicker.
In the illustrated embodiment, the axes 120, 122, and 124 are in a common plane. The vertical axis 122 of the vertical linear actuator 114 intersects and is perpendicular to the horizontal axis 120 of the horizontal linear actuator 112. The vertical axis 124 of the rotary actuator 116 intersects and is perpendicular to the horizontal axis 120 of the horizontal linear actuator 112. The vertical axis 124 of the rotary actuator 116 is parallel to the vertical axis 122 of the vertical linear actuator 114.
In the illustrated embodiment, the vertical linear actuator 114 is configured and positioned so that the sound source 118 can be moved both above and below the audio device 102. The support stand 126 is used to elevate the audio device 102 above the floor 106 so that the sound source 118 can be moved below the horizontal plane of the audio device 102.
The controller 202 is configured to communicate with the audio device 102 to coordinate data capture by the audio device 102. Specifically, the controller 202 may signal the audio device 102 to capture, produce, and/or provide test data 204 during times when the test sound is being emitted from the sound source 118. The controller 202 is also configured to receive the test data 204 from the audio device 102 and to record the test data 204. The test data may comprise multiple data records. Each data record of the test data 204 may comprise (a) coordinates indicating the actual position of the sound source relative to the audio device 102 and (b) device data produced and provided by the audio device 102 in response to the test sound being emitted at that position. In some embodiments, the test data 204 may include an audio signal corresponding to the test sound, such as a digital audio file that is rendered at the sound source 118 to produce the test sound. In some embodiments, the test data 204 may indicate a more general nature of the test sound, such as a number and timing of clicks or impulse sounds that are produced by the sound source 118.
The device data may comprise one or more audio signals, calculated coordinates, and/or other data relating to the functions of the audio device 102 that depend upon the position of the sound source 118 relative to the audio device 102. For example, the device data may comprise audio signals produced by or derived from the microphones 110 of the audio device 102 during the times that the test sounds are emitted by the sound source 118. As another example, the device data may comprise directional or beamformed audio signals produced by the audio device 102 during the times that the test sounds are emitted.
Alternatively, the device data may comprise parametric data calculated or obtained by the audio device 102 in response to the test sound emitted at each of the relative locations. For example, the device data may comprise coordinates of the sound source 118 as calculated by the audio device 102 using its sound source localization functionality. As another example, the device data may comprise calculated differences in arrival times of the emitted test sound at each of multiple microphones 110 of the audio device 102.
An action 302 comprises selecting multiple relative positions of the audio device 102 and the sound source 118. Each relative position may be defined by coordinates representing (a) an azimuth of the sound source 118 relative to the audio device 102, (b) an altitude or elevation of the sound source 118 relative to the audio device 102, and (c) a horizontal distance between the sound source 118 and the audio device 102. Other types of coordinates may also be used to specify relative positions, such as XYZ Cartesian coordinates.
The positions may be selected in various ways, including by manual selection and automatic selection. In some embodiments, the positions may be selected at regular spacings. For example, the positions may be selected to form a regular grid in the three-dimensional space surrounding the audio device. In some cases, the positions may form an irregular grid having a higher position densities in certain regions or areas that or of relatively higher interest.
In some embodiments, the positions may be selected during testing based on earlier test results. For example, initial testing may find that certain regions are more critical than others to position changes or that results are relatively unreliable or inaccurate in certain regions. Position density may be increased in these areas. Similarly, testing at or around certain positions may be repeated when results are not as expected or are not uniform.
A set of actions 304, 306, and 308 are performed for every selected relative position. The action 304 comprises positioning the sound source 118 and the audio device 102 at the selected relative position by (a) moving the audio device along the horizontal axis 120, (b) rotating the audio device about the vertical axis 124, and (c) moving the sound source along the vertical axis 122. More specifically, a desired horizontal distance is established by moving the audio device 102 with the horizontal linear actuator 112 along the horizontal axis 120. A desired azimuth is established by rotating the rotary actuator 116 about the vertical axis 124. A desired elevation of the sound source 118 relative to the audio device 102 is established by moving the sound source 118 with the vertical linear actuator 114 along the vertical axis 122.
In some embodiments, the action 304 may also comprise pivoting the sound source 118 up or down about the horizontal axis 134 using the pivot actuator 132. This action may be performed in embodiments in which the sound source 118 is directional, so that the output of the sound source 118 is pointing directly toward the position of the audio device 102.
The action 306 comprises emitting the test sound at the selected relative position. In some embodiments, the sound source may comprise a loudspeaker and the controller 202 may generate a test sound and play the test sound on the loudspeaker. The test sound may in some cases comprise an impulse sound that the audio device 102 may attempt to localize. Other test sounds may comprise recorded voice, simulated voice, music, white noise, and so forth. In some cases, multiple noises or sounds may be played at each position. Furthermore, the sound source 118 may in some cases comprise multiple transducers, such as a loudspeaker and a clicking device. The action 306 may include emitting multiple test sounds at each relative position, such as audio from a loudspeaker and an impulse sound generated by a mechanical device.
The action 308 comprises receiving and recording test data from the audio device 102. The test data may comprise audio signals, audio waveforms, sound source localization data, position coordinates, directional parameters, filter parameters, time-difference-of-arrival (TDOA) data, intermediate parameters calculated by the audio device 102 when performing sound source localization or beamforming, and so forth. The test data may be recorded for each of the selected positions and saved for later analysis. Alternatively, the test data may be analyzed as it is received by the controller 202.
The memory 404 may contain applications and programs in the form of computer-executable instructions 406 that are executed by the processor 402 to perform acts or actions that implement desired functionality of the controller 202, including the methods and functionality described above. The memory 404 may be a type of non-transitory computer-readable storage media and may include volatile and nonvolatile memory. Thus, the memory 404 may include, but is not limited to, RAM, ROM, EEPROM, flash memory, or other memory technology.
Although the subject matter has been described in language specific to structural features, it is to be understood that the subject matter defined in the appended claims is not necessarily limited to the specific features described. Rather, the specific features are disclosed as illustrative forms of implementing the claims.
Number | Name | Date | Kind |
---|---|---|---|
5729612 | Abel | Mar 1998 | A |
7418392 | Mozer et al. | Aug 2008 | B1 |
7720683 | Vermeulen et al. | May 2010 | B1 |
7774204 | Mozer et al. | Aug 2010 | B2 |
20120223885 | Perez | Sep 2012 | A1 |
20120243697 | Frye | Sep 2012 | A1 |
Number | Date | Country |
---|---|---|
WO2011088053 | Jul 2011 | WO |
Entry |
---|
Pinhanez, “The Everywhere Displays Projector: A Device to Create Ubiquitous Graphical Interfaces”, IBM Thomas Watson Research Center, Ubicomp 2001, Sep. 30-Oct. 2, 2001, 18 pages. |