The invention relates to stereo enhancement; in particular, to a stereo enhancement system and a stereo enhancement method.
In general, as shown in
Therefore, the invention provides a stereo enhancement system and a stereo enhancement method to solve the above-mentioned problems of the prior arts.
A preferred embodiment of the invention is a stereo enhancement system. In this embodiment, the stereo enhancement system includes a beamforming unit and a signal processing unit. The beamforming unit is configured to receive a plurality of input sound signals and generate a plurality of beamforming sound signals corresponding to a plurality of direction intervals respectively. The signal processing unit is coupled to the beamforming unit and configured to receive the plurality of beamforming sound signals corresponding to the plurality of direction intervals respectively and generate a first synthesized output sound signal and a second synthesized sound signal accordingly.
In an embodiment, the signal processing unit includes: a plurality of head-related transfer function (HRTF) units, coupled to the beamforming unit and corresponding to the plurality of direction intervals respectively, and each HRTF unit in the plurality of HRTF units receiving a corresponding beamforming sound signal in the plurality of beamforming sound signals and calculating the beamforming sound signal to generate a first output sound signal and a second output sound signal; a first synthesis unit, coupled to the plurality of HRTF units, configured to synthesize a plurality of first output sound signals generated by the plurality of HRTF units into the first synthesized output sound signal; and a second synthesis unit, coupled to the plurality of HRTF units, configured to synthesize a plurality of second output sound signals generated by the plurality of HRTF units into the second synthesized output sound signal.
In an embodiment, there is an overlap between the angle ranges included in the plurality of direction intervals.
In an embodiment, the plurality of input sound signals is from a recording device, and all or part of recording range of the recording device is divided into the plurality of direction intervals, so that the beamforming unit generates the plurality of beamforming sound signals relative to all direction intervals of the recording device.
In an embodiment, the first output sound signal and the second output sound signal generated by each HRTF unit correspond to a left ear and a right ear respectively.
In an embodiment, the first synthesis unit and the second synthesis unit output the first synthesized output sound signal and the second synthesized output sound signal to a left ear and a right ear respectively.
In an embodiment, sound fields of the first synthesized output sound signal and the second synthesized output sound signal are wider than sound fields of the plurality of input sound signals.
In an embodiment, the plurality of HRTF units is operated in a real recording mode.
In an embodiment, the plurality of HRTF units is operated in a simulation mode and includes at least one of the following: a filtering unit, configured to simulate a level difference and a time difference between two ears; a delay unit, configured to simulate the time difference between the two ears; and a gain unit, configured to simulate the level difference between the two ears.
In an embodiment, the signal processing unit further includes: a sound detection unit, coupled between the beamforming unit and the plurality of HRTF units, configured to detect whether the plurality of beamforming sound signals corresponding to the plurality of direction intervals includes effective sounds and output beamforming sound signals including the effective sounds to the plurality of HRTF units respectively.
In an embodiment, the signal processing unit adjusts a width of a sound field by modifying a delay and a gain of the plurality of HRTF units.
Another preferred embodiment of the invention is a stereo enhancement method. In this embodiment, the stereo enhancement method includes the following steps: (a) generating a plurality of beamforming sound signals corresponding to a plurality of direction intervals according to a plurality of input sound signals respectively; (b) calculating each of the plurality of beamforming sound signals according to an algorithm to generate a first output sound signal and a second output sound signal corresponding to each of the plurality of direction intervals; and (c) synthesizing a plurality of first output sound signals into a first synthesized output sound signal and synthesizing a plurality of second output sound signals into a second synthesized output sound signal.
In an embodiment, the algorithm is a head-related transfer function (HRTF) or a technology simulating a channel response of a sound source to a left ear and a right ear.
In an embodiment, the step (a) further detects whether the plurality of beamforming sound signals corresponding to the plurality of direction intervals includes effective sounds and the plurality of beamforming sound signals generated in the step (a) includes the effective sounds.
In an embodiment, the stereo enhancement method further includes the following steps: adjusting a width of a sound field by modifying a gain and a delay of HRTF and other techniques simulating channel response of the sound source to the left ear and the right ear.
In an embodiment, there is an overlap between the angle ranges included in the plurality of direction intervals.
In an embodiment, the plurality of input sound signals is from a recording device, and all or part of recording range of the recording device is divided into the plurality of direction intervals, so that the step (a) generates the plurality of beamforming sound signals relative to all or part of direction intervals of the recording device.
In an embodiment, sound fields of the first synthesized output sound signal and the second synthesized output sound signal are wider than sound fields of the plurality of input sound signals.
In an embodiment, the step (b) is operated in a real recording mode, which uses at least one of filter, delay and gain generated from real recording.
In an embodiment, the step (b) is operated in a simulation mode, which uses at least one of filter, delay and gain generated from simulation and the stereo enhancement method further includes at least one of the following: simulating a time difference between two ears; and simulating a level difference between the two ears.
Compared to the prior art, the stereo enhancement system and the stereo enhancement method of the invention separate the plurality of sound signals recorded by the microphone array into different channels corresponding to different sound direction intervals through the beamforming method, and apply head-related transfer function (HRTF) processing in each channel to enhance the spatial sense of the sound signal, so that the sound signal presents a better stereo effect, making the sound heard by the left ear and the right ear wider.
The advantage and spirit of the invention may be understood by the following detailed descriptions together with the appended drawings.
A preferred embodiment of the invention is a stereo enhancement system. In this embodiment, the stereo enhancement system can retain all the input sound signals recorded by the microphone array of the recording device and separate all the input sound signals into different channels corresponding to different sound direction intervals through the beamforming method, and then separate the input sound signals in each sound direction. The head-related transfer function (HRTF) processing is applied in each channel to enhance the spatial sense of the sound signal, thereby the stereo effect of the sound signal is effectively enhanced to make the sound heard by the left ear and the right ear more spacious.
Please refer to
As shown in
As shown in
It should be noted that the invention does not detect a specific target direction interval through a recording device (e.g., a microphone array). The invention divides all or part of the sound collection range of the recording device into a plurality of direction intervals and the number is not limited to the above embodiment, and each angle range can be the same or different, and there is no specific limitation.
In addition, the angle ranges respectively included in the plurality of direction intervals may overlap. For example, assuming that an angle range of a direction interval DI1 is 0˜30 degrees and an angle range of a direction interval DI2 is 15˜45 degrees, the angle ranges respectively included in the direction intervals DI1 and DI2 overlap by 15 degrees, so as to ensure that when an object moves from the direction interval DI1 to the direction interval DI2, the sound can remain smooth.
As shown in
Please refer to
It should be noted that the first synthesized output sound signal SY1 and the second synthesized output sound signal SY2 generated by the signal processing unit 52 are transmitted to the left ear LE and the right ear RE respectively, and the sound fields of the first synthesized output sound signal SY1 and the second synthesized output sound signal SY2 will be wider than the sound field of the M input sound signals SIN1˜SINM, so that when the left ear EL and the right ear RE hear the first synthesized output sound signal SY1 and the second synthesized output sound signal SY2 respectively, there will be better stereo effect.
In practical applications, the M input sound signals SIN1˜SINM received by the beamforming unit 50 can come from a recording device (such as a microphone array), and the sound collection range of the recording device can be divided into N direction intervals DI1˜DIN, causing the beamforming unit 50 to generate N beamforming sound signals BF1˜BFN relative to all N direction intervals DI1˜DIN of the recording device.
In addition, the stereo enhancement system 5 and the recording device of the invention may be designed as different devices separated from each other or integrated into the same device according to actual needs. For example, the microphone array can be disposed on a motion camera to perform sound collection and stereo enhancing process, and then stored or listened to by the user through headphones, but not limited to this.
In this embodiment, the signal processing unit 52 can include N HRTF units HR1˜HRN, a first synthesis unit 521 and a second synthesis unit 522. The N HRTF units HR1˜HRN are coupled to the beamforming unit 50 and correspond to the N direction intervals DI1˜DIN respectively. Each of the N HRTF units HR1˜HRN receives and calculates a corresponding beamforming audio signal among the N beamforming audio signals BF1˜BFN to generate N first output audio signals SO11˜SO1N and N second output sound signal SO21˜SO2N.
The first synthesis unit 521 is coupled to the N HRTF units HR1˜HRN and used for synthesizing the N first output sound signals SO11-SO1N generated by the N HRTF units HR1˜HRN into a first synthesized output sound signal SY1 and then the first synthesized output sound signal SY1 is transmitted to the left ear LE. The second synthesis unit 522 is coupled to the N HRTF units HR1˜HRN and used for synthesizing the N second output sound signals SO21˜SO2N generated by the N HRTF units HR1˜HRN into a second synthesized output sound signal SY2 and then the second synthesized output sound signal SY2 is transmitted to the right ear RE.
In practical applications, the first synthesized output sound SY1 and the second synthesized output sound SY2 can be outputted to the left ear LE and the right ear RE of the earphone respectively, but not limited to this.
In another embodiment, as shown in
It should be noted that the way that the sound detection unit 520 detects whether the N beamforming sound signals BF1˜BFN include the effective sounds can include but not be limited to the following two:
Next, each HRTF unit in the K HRTF units HR1˜HRK receives and calculates the corresponding beamforming audio signal among the K beamforming audio signals BF1˜BFK to generate K first output audio signals SO11-SO1K and K second output sound signals SO21˜SO2K. The first synthesis unit 521 synthesizes the K first output sound signals SO11˜SO1K into a first synthesized output sound signal SY1 and transmits it to the left ear LE. The second synthesis unit 522 synthesizes the K second output sound signals SO21˜SO2K into a second synthesized output sound signal SY2 and transmits it to the right ear RE.
In practical applications, the N HRTF units HR1˜HRN can adopt a real recording mode using at least one of filter, delay and gain generated from real recording or a simulation mode using at least one of filter, delay and gain generated from simulation. When the N HRTF units HR1˜HRN adopt the simulation mode, each HRTF unit can include a filtering unit for simulating the level difference and the time difference between two ears, a delay unit for simulating the time difference between the two ears and/or a gain unit for simulating the level difference between the ears, but not limited to this. The signal processing unit 52 can adjust the width of the sound field of the sound signal by modifying the delays and gains of the N HRTF units HR1˜HRN, but not limited to this.
For example, as shown in
Another preferred embodiment of the invention is a stereo enhancement method. In this embodiment, the stereo enhancement method can be applied to the stereo enhancement systems in the foregoing embodiments, but not limited to this.
Please refer to
In practical applications, the plurality of input sound signals in the step S10 can come from a recording device, and all or part of the sound collection range of the recording device is divided into the plurality of direction intervals, so that the step S10 can generate the plurality of beamforming sound signals relative to all direction intervals of the recording device, wherein the angle ranges included in the plurality of direction intervals respectively will overlap, but not limited to this.
In addition, the step S10 can also detect whether the plurality of beamforming sound signals corresponding to the plurality of direction intervals include effective sounds and the plurality of beamforming sound signals generated in the step S10 include the effective sounds.
In another embodiment, the stereo enhancement method can further include the following steps: adjusting the width of the sound field by modifying the gain and delay of HRTF and other techniques for simulating the response of the sound source to the left ear and the right ear channels, but not limited to this.
In another embodiment, the algorithm in the step S12 can be a head-related transfer function (HRTF) or any other technique capable of simulating the channel response of the sound source to the left ear and the right ear. In addition, the step S12 can adopt a real recording mode using at least one of filter, delay and gain generated from real recording or a simulation mode using at least one of filter, delay and gain generated from simulation. When the step S12 adopts the simulation mode, the stereo enhancement method can further include at least one of the following steps: simulating a time difference between two ears; and simulating a level difference between the two ears, but not limited to this.
Compared to the prior art, the stereo enhancement system and the stereo enhancement method of the invention separate the plurality of sound signals recorded by the microphone array into different channels corresponding to different sound direction intervals through the beamforming method, and apply head-related transfer function (HRTF) processing in each channel to enhance the spatial sense of the sound signal, so that the sound signal presents a better stereo effect, making the sound heard by the left ear and the right ear wider.
With the example and explanations above, the features and spirits of the invention will be hopefully well described. Those skilled in the art will readily observe that numerous modifications and alterations of the device may be made while retaining the teaching of the invention. Accordingly, the above disclosure should be construed as limited only by the metes and bounds of the appended claims.
Number | Date | Country | Kind |
---|---|---|---|
111126730 | Jul 2022 | TW | national |