This application claims the priority benefit of Taiwan application serial no. 112114734, filed on Apr. 20, 2023. The entirety of the above-mentioned patent application is hereby incorporated by reference herein and made a part of this specification.
The disclosure relates to an audio processing technology, and in particular relates to a pairing method of dual channel and a mobile device.
Smart speakers may be wirelessly connected to electronic devices and play music accordingly. After connecting a mobile device to a smart speaker system, audio signals of dual channel may be output to the smart speaker system respectively, allowing users to experience realistic stereo sound effects.
However, the users are usually not sure about which smart speaker outputs audio signals for the left channel or the right channel.
An embodiment of the disclosure provides a pairing method of dual channel and a mobile device, which can solve the aforementioned problems.
A pairing method of dual channel according to an embodiment of the disclosure is suitable for two audio playback devices having speakers and microphone arrays and a mobile device. The pairing method includes (but is not limited to) the following. An available area is defined according to a first position relation between the two audio playback devices and a reference target. The first position relation is whether the reference target is positioned between the two audio playback devices, and the available area is an area extending from a position between the two audio playback devices to two sides of the reference target. A second position relation of the mobile device and the available area is determined. The second position relation includes the mobile device being positioned within the available area and the mobile device not being positioned within the available area. The second position relation is determined based on a third position relation between the two audio playback devices and the mobile device. The third position relation is a relative position between any two of the two audio playback devices and the mobile device, and the third position relation is determined based on the power obtained by one of the two audio playback devices and the mobile device receiving a test audio signal played by another one of the two audio playback devices and the mobile device through the beamforming technology. A first corresponding relation between the two audio playback devices and audio signals of a left channel and a right channel is determined according to the second position relation. The first corresponding relation includes one of the two audio playback devices corresponding to the audio signal of the left channel and another one of the two audio playback devices corresponding to the audio signal of the right channel. A second corresponding relation of two sides of the available area with respect to the left channel and the right channel has been defined. The second corresponding relation includes one side of the two sides of the available area corresponding to the audio signal of the left channel and another side of the two sides of the available area corresponding to the audio signal of the right channel. The audio signals of the left channel and the right channel are played by the two audio playback devices respectively according to the first corresponding relation.
A mobile device according to an embodiment of the disclosure includes (but is not limited to) a communication transceiver, a storage, and a processor. The storage is used to store program codes. The processor is coupled to the communication transceiver and the storage. The processor is configured to execute the program codes to: define an available area according to a first position relation between two audio playback devices and a reference target, determine a second position relation of the mobile device and the available area, determine a first corresponding relation between the two audio playback devices and audio signals of a left channel and a right channel according to the second position relation, and play the audio signals of the left channel and the right channel by the two audio playback devices respectively according to the first corresponding relation. The first position relation is whether the reference target is positioned between the two audio playback devices, and the available area is an area extending from a position between the two audio playback devices to two sides of the reference target. The second position relation includes the mobile device being positioned within the available area and the mobile device not being positioned within the available area. The second position relation is determined based on a third position relation between the two audio playback devices and the mobile device. The third position relation is a relative position between any two of the two audio playback devices and the mobile device, and the third position relation is determined based on the power obtained by one of the two audio playback devices and the mobile device receiving a test audio signal played by another one of the two audio playback devices and the mobile device through the beamforming technology. The first corresponding relation includes one of the two audio playback devices corresponding to the audio signal of the left channel and another one of the two audio playback devices corresponding to the audio signal of the right channel. A second corresponding relation of two sides of the available area with respect to the left channel and the right channel has been defined. The second corresponding relation includes one side of the two sides of the available area corresponding to the audio signal of the left channel and another side of the two sides of the available area corresponding to the audio signal of the right channel.
Based on the above, the pairing method of dual channel and the mobile device according to the embodiments of the disclosure determines the relative positions between the two audio playback devices and the mobile device through a sound source localization technology, whether the mobile device is positioned within the available area available for forming an effect of dual channel is determined, and the audio signals of the dual channels and the two audio playback devices are paired accordingly. In this way, the dual channels may be paired automatically.
In order to make the above-mentioned features and advantages of the disclosure more comprehensible, the following embodiments are provided for a detailed description along with accompanying drawings.
The mobile device 10 may be a smartphone, a tablet computer, a laptop, an intelligent assistant device, a wearable device, or other electronic devices.
The mobile device 10 includes (but is not limited to) a microphone array 11, a speaker 12, a communication transceiver 13, a storage 14, and a processor 15.
The microphone array 11 includes multiple microphones. The multiple microphones may be dynamic, condenser, electret condenser, or other types of microphones, the microphones may also be other electronic components, analog-to-digital converters, filters, and audio processors or combinations thereof that may receive sound waves (e.g., human voice, ambient noise, machine operation noise) (that is, receive sounds or record sounds) and convert into audio signals. In an embodiment, the microphone array 11 is used to receive or record sounds.
The speaker 12 may be various types of speakers or amplifiers. In an embodiment, the speaker 12 is used to play sounds.
The communication transceiver 13 may support Bluetooth, Wi-Fi, or other wireless communication receiving and transmitting circuits. The communication transceiver 13 may include digital-to-analog converters, analog-to-digital converters, amplifiers, filters, and/or mixers. In an embodiment, the communication transceiver 13 is used to receive signals/data/information from external devices (for example, the audio playback devices 20, 30).
The storage 14 may be any type of fixed or removable random access memory (RAM), read-only memory (ROM), flash memory, hard disk drive (HDD), solid-state drive (SSD), or similar components. In an embodiment, the storage 14 is used to store program codes, software modules, configuration settings, data (such as audio signals, algorithm parameters), or files, and the implementation details will be described later.
The processor 15 is coupled to the microphone array 11, the speaker 12, the communication transceiver 13, and the storage 14. The processor 15 may be a central processing unit (CPU), a graphic processing unit (GPU), or other programmable general-purpose or special-purpose microprocessors, digital signal processors (DSP), programmable controllers, field programmable gate arrays (FPGA), application-specific integrated circuits (ASIC), neural network accelerators, or other similar components or combinations of the aforementioned components. In an embodiment, the processor 15 is used to execute all or part of operations of the mobile device 10 and may load and execute various program codes, software modules, files, and data stored in the storage 14. In some embodiments, functions of the processor 15 may be implemented through software or chips.
The audio playback device 20 may be a wireless speaker, a smart speaker, or an intelligent assistant device.
The audio playback device 20 includes (but is not limited to) a microphone array 21, a speaker 22, a communication transceiver 23, a storage 24, and a processor 25.
Functions and implementation modes of the microphone array 21, the speaker 22, the communication transceiver 23, the storage 24, and the processor 25 may be respectively referred to the descriptions of the microphone array 11, the speaker 12, the communication transceiver 13, the storage 14, and the processor 15, and will not be repeated here.
In an embodiment, the processor 25 is used to execute all or part of operations of the audio playback device 20 and may load and execute various program codes, software modules, files, and data stored in the storage 24. In some embodiments, functions of the processor 25 may be implemented through software or chips.
In an embodiment, the processor 25 may integrate functions such as analog-to-digital converters, digital-to-analog converters, amplifiers, filters, or other audio processing components. In some embodiments, the functions may be implemented through one or more audio processing components respectively.
The audio playback device 30 may be a wireless speaker, a smart speaker, or an intelligent assistant device.
The audio playback device 30 includes (but is not limited to) a microphone array 31, a speaker 32, a communication transceiver 33, a storage 34, and a processor 35.
Functions and implementation modes of the microphone array 31, the speaker 32, the communication transceiver 33, the storage 34, and the processor 35 may be respectively referred to the descriptions of the microphone array 11, the speaker 12, the communication transceiver 13, the storage 14, and the processor 15, and will not be repeated here.
In an embodiment, the processor 35 is used to execute all or part of operations of the audio playback device 30 and may load and execute various program codes, software modules, files, and data stored in the storage 34. In some embodiments, functions of the processor 35 may be implemented through software or chips.
In an embodiment, the processor 35 may integrate functions such as analog-to-digital converters, digital-to-analog converters, amplifiers, filters, or other audio processing components. In some embodiments, the functions may be implemented through one or more audio processing components respectively.
In an embodiment, the mobile device 10 may be loaded with an application program for controlling the audio playback devices 20, 30. Functions of the application program include, for example, EQ settings, activation/deactivation, or volume adjustment.
In an embodiment, the mobile device 10 may transmit audio signals to the audio playback devices 20, 30 via a wireless network. The processors 25, 35 of the audio playback devices 20, 30 may convert digital signals into analog signals, increase the audio signals to an appropriate volume, and finally play the audio signals through the speakers 22, 32.
In an embodiment, received audio signals obtained by the microphone arrays 21, 31 from receiving sounds or recording sounds may be converted from analog signals to digital signals and transmitted to the mobile device 10 via the wireless network.
In the following description, the method of the disclosure will be described in conjunction with components and modules in the mobile device 10 and the audio playback devices 20, 30. Various processes of the method may be adjusted according to implementation situations and are not limited thereto.
The processor 15 may obtain received audio signals (i.e., the received audio signals obtained by respectively receiving or recording the test audio signals) from the audio playback devices 20, 30 through the communication transceiver 13. The processor 15 may determine the spacing between the two audio playback devices 20, 30 according to the power of the received audio signals. If the signal power is stronger, the spacing between the two audio playback devices 20, 30 is shorter; if the signal power is weaker, the spacing between the two audio playback devices 20, 30 is longer. For example, the signal power is inversely proportional to the square of the spacing, but it may still be affected by factors such as environment or receiver sensitivity. The storage 14 may store multiple corresponding relations or formulas between signal power and the spacing thereof in advance for the spacing decision-making.
Regarding a relative angle (i.e., an angle θ12 of the audio playback device 20 relative to the audio playback device 30 or an angle θ21 of the audio playback device 30 relative to the audio playback device 20), the microphone arrays 21, 31 may form beams with multiple receiving directions (or pointing angles). The microphone arrays 21, 31 may form beams according to the beamforming technology. Beamforming may be achieved by adjusting the parameters (e.g., phase and amplitude) of the basic units of the phase array so that signals at certain angles obtain constructive interference, while signals at some other angles obtain destructive interference. Therefore, different parameters form different beam patterns, and the receiving direction of the main beam may vary. The processor 15 may generate multiple receiving directions by predefining or based on operations input by users. For example, every 10° interval from −90° to 90° may be used as a receiving direction.
During a process of playing test audio signals, the microphone arrays 21, 31 switch to specific pointing angles, and the processor 15 measures the signal power obtained from receiving the beam at current pointing angles through the microphone arrays 21, 31. The processor 15 may determine a relative angle according to the signal power obtained from receiving the beam at the pointing angles, and the relative angle is related to the pointing angle with a high signal power. For example, the processor 15 may define a power threshold value and determine whether the signal power corresponding to each pointing angle is greater than the power threshold value. If the signal power corresponding to the pointing angle is greater than the signal threshold value, then the processor 15 may determine that there is a sound source (i.e., other audio playback devices 20, 30) on the pointing angle and take the pointing angle as a relative angle relative to the audio playback devices 30, 20. If the signal power corresponding to the pointing angle is not greater than the signal threshold value, then the processor 15 may determine that there is no sound source (i.e., other audio playback devices 20, 30) on the pointing angle. For another example, the processor 15 selects one or a specific number of pointing angles with a high signal power as relative angles. It should be noted that the signal threshold value may be determined in advance according to experiments or preset information and may vary according to actual needs.
In some embodiments, the processor 15 may improve the accuracy of relative angle prediction through the AI-beamforming technology. For example, a machine learning model may be trained according to characteristics and reception strength of the microphone arrays 31, 21, as well as the actual position of the sound source, so that the machine learning model may infer a corresponding sound source position for the data to be evaluated (for example, the reception strength of the microphone arrays 31, 21). In this way, interference can be effectively avoided.
In another embodiment, the processor 15 may estimate the relative angle of the audio playback devices 20, 30 relative to other audio playback devices 30, 20 based on the angle of arrival (AOA, or degree of arrival, DOA) positioning technology. For example, the processor 15 may determine the relative angle based on the time difference between the two sound waves arriving at microphone arrays 31, 21 of other audio playback devices 30, 20 after the test audio signal is reflected through the audio playback devices 20, 30 and a distance between two adjacent microphones in the microphone arrays 31, 21.
The processor 15 determines whether the spacing between the two audio playback devices 20, 30 is less than a length limit (Step S220). Specifically, the length limit is one of the limits of the available area for forming an effect of dual channel. The length limit may be, for example, 50 centimeters, 1 meter, or 2 meters, and may change according to specifications or capabilities of the speak lers 22, 32 of the two audio playback devices 20, 30. The available area will be described in detail in subsequent embodiments.
In response to the spacing between the two audio playback devices 20, 30 being less than the length limit, the processor 15 prompts that the spacing is less than the length limit or prompts that the spacing should be greater than the length limit (Step S230). For example, voice commands, music, or warning tones are played through the speakers 12, 22, 32. Alternatively, the prompt content is displayed on a display (not shown in the drawing). In response to the spacing between the two audio playback devices 20, 30 is not less than the length limit, the processor 15 executes subsequent steps of dual channel pairing (for example, entering Step S410). That is to say, through prompting, the user is guided to separate the two audio playback devices 20, 30 until the spacing thereof is greater than the length limit.
On the other hand, the available area is an area extending from a position between the two audio playback devices 20, 30 to two sides of the reference target. The position may be the midpoint between the two audio playback devices 20, 30 or any point on the connection line of the two audio playback devices 20, 30. The available area may be considered as the area within which users may perceive an experience of dual channel through their hearing.
In other embodiments, the shape, size, and/or position of the available area may also change.
Please refer to
Since the available area is defined based on the relative positions of the two audio playback devices 20, 30, whether the mobile device 10 is within the available area may be determined according to the third position relation between the two audio playback devices 20, 30 and the mobile device 10.
For another example,
Please refer to
Please refer to
Taking
Taking
It is assumed that another application scenario is the head of the user facing upward of the drawing. Therefore, in response to the second position relation being the mobile device 10 positioned within the available area, the processor 15 may determine the first corresponding relation to be that the audio playback device 20 near the side E1 corresponds to the audio signal of the left channel, and that the audio playback device 30 near the side E2 corresponds to the audio signal of the right channel.
Please refer to
In summary, in the pairing method of dual channel and mobile device of the disclosure, the position of the sound source (for example, the relative positions of the two audio playback devices and the mobile device) is determined based on beamforming, whether the mobile device is positioned within the available area is determined, and the audio signals of the dual channels are paired accordingly. In this way, the convenience of pairing can be improved, and the existing problem that the channels could not be distinguished can be solved.
Although the disclosure has been disclosed in the above embodiments, the embodiments are not intended to limit the disclosure. Persons with general knowledge in the relevant technical field may make modifications and refinements without departing from the spirit and scope of the disclosure. Therefore, the scope of protection of the disclosure shall be defined by the appended claims.
Number | Date | Country | Kind |
---|---|---|---|
112114734 | Apr 2023 | TW | national |