This application is based upon and claims the benefit of priority from Japanese Patent Application No. 2018-079900, filed on Apr. 18, 2018, the entire contents of which are incorporated herein by reference.
The present disclosure relates to a noise reduction apparatus and a noise suppressing method.
As a method of removing noise superimposed on audio data, a spectrum subtraction method has been known. In this spectrum subtraction method, a subtraction spectrum signal is generated by subtracting what is obtained by multiplying a spectrum signal of noise by a subtraction coefficient from an input spectrum signal based on input audio data, and an output spectrum signal and audio data in which noise is reduced are generated by using this subtraction spectrum signal. However, in the spectrum subtraction method, the subtraction coefficient takes a large value, and depending on a difference in level of voice and noise included in the input audio data, a subtraction resultant is to be zero or smaller. As a result, a large distortion is generated in an output spectrum, and this distortion causes unusual sound, so-called musical noise.
As a method of preventing occurrence of musical noise described above, a technique of preventing occurrence of musical noise by generating audio data by using a subtraction spectrum signal and background noise obtained in advance has been known. The subtraction spectrum signal is generated by subtracting a signal obtained by multiplying a spectrum signal of noise by a predetermined coefficient while sequentially changing the value from an input spectrum signal based on input audio data (for example, JP-A-2014-44313).
A noise reduction apparatus according to one aspect of the present disclosure includes: a first converting circuit configured to generate first data by subjecting externally input first audio data to Fourier transform to convert into amplitude data per frequency; a noise-data recording circuit configured to record noise data that has been detected in advance; a subtracting circuit configured to generate second data by subtracting the noise data from the first data; an arithmetic circuit configured to generate third data by synthesizing the first data and the second data; and a second converting circuit configured to generate second audio data by subjecting the third data to inverse Fourier transform.
The above and other features, advantages and technical and industrial significance of this invention will be better understood by reading the following detailed description of presently preferred embodiments of the disclosure, when considered in connection with the accompanying drawings.
Forms to implement the present disclosure (hereinafter, “embodiments”) are described in detail below with reference to the drawings. Note that the following embodiments are not intended to limit the present disclosure. Moreover, the respective drawings referred to in the following description only schematically illustrate shapes, sizes, and positional relationships to enable understanding of contents of the present disclosure. That is, the present disclosure is not limited to the shapes, the sizes, and the positional relationships illustrated in the respective drawings.
The noise reduction apparatus 1 illustrated in
The first converting unit 2 generates first data that is obtained by converting first audio data externally input into amplitude data per frequency by Fourier transform under control of the control unit 7, and outputs this first data to the subtracting unit 3 and the arithmetic unit 5. For example, the first converting unit 2 performs discrete Fourier transform, fast Fourier transform, or the like as the Fourier transform. The first converting unit 2 is constituted of, for example, a discrete Fourier transform circuit, or a fast Fourier transform circuit. In the first embodiment, a case in which the discrete Fourier transform is performed as the Fourier transform is described.
The subtracting unit 3 calculates second data by subtracting noise data recorded in the recording unit 4 from the first data that is input from the first converting unit 2 under control of the control unit 7, and outputs the calculated second data to the arithmetic unit 5. The noise data is statistical data that is acquired by statistic calculation of a result acquired by subjecting audio data that has been acquired in an anechoic condition in advance to the Fourier transform. Specifically, for the noise data, a large value, such as a statistical value (for example, ave+2σ, ave+3σ, ave×1.5, ave×2, max value) that is acquired by performing the Fourier transform on plural pieces of audio data, or the like is chosen so that noise is not left without being subtracted.
The recording unit 4 is constituted of a flash memory, a synchronous dynamic random-access memory (SDRAM), or the like. The recording unit 4 includes a noise-data recording unit 41 that records noise data used for subtraction by the subtracting unit 3, and a program recording unit 42 that records various kinds of programs executed by the noise reduction apparatus 1.
The arithmetic unit 5 performs arithmetic to generate third data by synthesizing the first data input from the first converting unit 2 and the second data input from the subtracting unit 3, and outputs this third data to the second converting unit 6. For example, the arithmetic unit 5 generates the third data by performing arithmetic of synthesizing the first data input from the first converting unit 2 and the second data input from the subtracting unit 3 at a predetermined ratio (for example, 1:1).
The second converting unit 6 generates second audio data by subjecting the third data input from the arithmetic unit 5 to inverse Fourier transform, and outputs it. The second converting unit 6 performs inverse discrete Fourier transform or inverse fast Fourier transform as the inverse Fourier transform. The second converting unit 6 is constituted of, for example, an inverse discrete Fourier transform (IDFT) circuit, or an inverse fast Fourier transform (IFFT) circuit. In the first embodiment, a case in which the inverse discrete Fourier transform is performed as the inverse Fourier transform is described. Note that the first converting unit 2, the subtracting unit 3, the arithmetic unit 5, and the second converting unit 6 may be configured respectively by using a digital signal processing (DSP), an application specific integrated circuit (ASIC), a field programmable gate array (FPGA), or the like, and to exert functions described above by reading the various kinds of programs recorded in the recording unit 4.
The control unit 7 controls respective parts constituting the noise reduction apparatus 1. The control unit 7 is constituted of, for example a DSP, an FPGA, or a central processing unit (CPU).
Next, processing performed by the noise reduction apparatus 1 is described.
As illustrated in
Subsequently, the subtracting unit 3 generates the second data by subtracting noise data that is recorded in the recording unit 4 from the first data input from the first converting unit 2 (step S2). When a subtraction result takes a negative value, the subtracting unit 3 handles the amplitude as 0.
Thereafter, the arithmetic unit 5 generates the third data by synthesizing the first data input from the first converting unit 2 and the second data input from the subtracting unit 3 (step S3).
Subsequently, the second converting unit 6 generates the second audio data by subjecting the third data input from the arithmetic unit 5 to the inverse Fourier transform, and outputs it (step S4). After step S4, the noise reduction apparatus 1 ends the processing.
According to the first embodiment described above, the arithmetic unit 5 generates the third data by synthesizing the first data input from the first converting unit 2 in which no musical noise has been generated and the second data input from the subtracting unit 3 in which no musical noise has been generated, or includes only a small amount of musical noise even if it has been generated and, therefore, it is possible to suppress generation of musical noise while suppressing deterioration of audio data.
Moreover, according to the first embodiment, as for the noise removal, for example, as shown by the curve L1 and the curve L2 in
Next, a second embodiment is described. The second embodiment is an electronic device that is equipped with the noise reduction apparatus 1 according to the first embodiment described above. Accordingly, in the following, a configuration of an electronic device according to the second embodiment is described, and then processing performed by the electronic device according to the second embodiment is described. like reference symbols are given to like components with the noise reduction apparatus 1 according to the first embodiment described above, and detailed description is omitted.
The audio input unit 101 receives input of voice to convert into an analog audio signal (electrical signal), and outputs this audio signal to the amplifier unit 102. The audio input unit 101 is constituted of a directive microphone, a stereo microphone, or the like.
The amplifier unit 102 amplifies the analog audio signal input from the audio input unit 101 with a predetermined amplification factor to output to the AD converter 103 under control of the control unit 113. The amplifier unit 102 is constituted of an amplification amplifier, or the like.
The AD converter 103 subjects the analog audio signal input from the amplifier unit 102 to AD conversion processing, to generate audio data with a predetermined bit number, for example, first audio data (quantized data) of 16 bits or 24 bits, under control of the control unit 113. The AD converter 103 is constituted of an AD converter circuit, or the like.
The gain adjusting unit 104 adjusts a gain of the first audio data under control of the control unit 113, to output to the noise reduction unit 105. The gain adjusting unit 104 is constituted of a gain adjuster circuit, or the like.
The noise reduction unit 105 subjects the first audio data input from the gain adjusting unit 104 to noise reduction processing of reducing noise, to output to the record processing unit 106 under control of the control unit 113. The noise reduction unit 105 includes the first converting unit 2, the second subtracting unit 3, the arithmetic unit 5, and the second converting unit 6 of the noise reduction apparatus 1 according to the first embodiment described above.
The record processing unit 106 stores the second audio data input from the noise reduction unit 105 in an audio file of a predetermined format to record it in the recording medium 107, under control of the control unit 113. The record processing unit 106 stores the second audio data in an audio file of either one of audio formats of, for example, MPs, WAV, AIFF, FLAC, MPEG4, and the like, to record in the recording medium 107. The record processing unit 106 is constituted of, for example, an audio codec, or the like.
The recording medium 107 is externally attachable to the electronic device 100, and stores audio file input from the record processing unit 106, or the like. The recording medium 107 is constituted of, for example, a memory card, or the like.
The volume detecting unit 108 detects volume of voice based on a voltage value of the analog audio signal input from the amplifier unit 102, and outputs this detection result to the control unit 113. The volume detecting unit 108 is constituted of, for example, a voltmeter, a voltage detection circuit, or the like.
The temperature detecting unit 109 detects temperature around the electronic device 100, and outputs this detection result to the control unit 113. The temperature detecting unit 109 is constituted of a temperature sensor, or the like.
The operating unit 110 accepts input of an instruction signal to instruct various kinds of operations relating to the electronic device 100, and outputs this accepted instruction signal to the control unit 113. The operating unit 110 accepts a start signal to instruct start of recording to the electronic device 100, an end signal to instruct end of recording, a switch signal to switch any one of plural modes (for example, record mode A, record mode B, and the like) in which the electronic device 100 may operate, an adjustment signal to adjust a gain of audio data, and the like. The operating unit 110 is constituted of a button, a switch, a toggle switch, a touch panel, and the like.
The recording unit 111 is constituted of a flash memory, a synchronous dynamic random-access memory (SDRAM), or the like. The recording unit 4 includes a noise-data recording unit 41A in which noise data used for subtraction by the subtracting unit 3 is recorded, and a program recording unit 42 in which various kinds of programs executed by the noise reduction apparatus 1 are recorded.
Referring back to
The display unit 112 displays various kinds of information relating to the electronic device 100 under control of the control unit 113. For example, the display unit 112 displays various kinds of modes, gain value, and the like relating to the electronic device 100 under control of the control unit 113. The display unit 112 is constituted of a liquid crystal or an organic electroluminescence (EL) display panels, or the like.
The control unit 113 performs overall control of the respective components of the electronic device 100. The control unit 113 is constituted of, for example, a CPU, an FPGA, an ASIC, or the like. The control unit 113 includes a determining unit 113a and a setting unit 113b.
The determining unit 113a determines (selects) a noise level that is used by the subtracting unit 3 from noise data recorded in the noise-data recording unit 41A. Specifically, the determining unit 113a determines a noise level when the gain adjusting unit 104 subjects the first data to gain adjustment based on the noise data recorded by the noise-data recording unit 41A and the gain adjusted for the first audio data by the gain adjusting unit 104. (refer to
The setting unit 113b sets a coefficient by which each of the first data and the second data is to be multiplied by the arithmetic unit 5 based on volume of the first audio data. Specifically, the setting unit 113b sets a coefficient by which each of the first data and the second data is multiplied by the arithmetic unit 5 based on a voltage value of the audio signal.
Next, processing performed by the electronic device 100 is described.
As illustrated in
Subsequently, when the operating unit 110 is operated and a start signal is input, and recording of a voice is started (step S102: YES), the determining unit 113a determines the mode of the electronic device 100 (step S103), and determines a recording level (gain) of the electronic device set according to operations of the operating unit 110 (step S104), and determines temperature based on a result of detection by the temperature detecting unit 109 (step S105). At this time, the audio input unit 101 captures recorded data (step S106), and the setting unit 113b sets a coefficient according to volume detected by the volume detecting unit 108 (step S107). Specifically, as illustrated in
As illustrated in
Referring back to
At step S108, when a noise level to be used by the subtracting unit 3 is changed (step S108: YES), the electronic device 100 shifts to step S116 described later. On the other hand, when a noise level to be used by the subtracting unit 3 is not changed (step S108: NO), the electronic device 100 shifts to step S109 described later.
At step S109, the first converting unit 2 generates the first data by subjecting the first audio data that is input from the gain adjusting unit 104 to the Fourier transform to convert into amplitude data per frequency, and outputs this first data to the subtracting unit 3 and the arithmetic unit 5. Specifically, as illustrated in
Thereafter, the subtracting unit 3 generates the second data by subtracting noise data according to a result of determination by the determining unit 113a from the first data (the first data a) input from the first converting unit 2 (step S110). The arithmetic unit 5 generates the third data by synthesizing the first data input from the first converting unit 2 and the second data input from the subtracting unit 3 by multiplying by the coefficients set by the setting unit 113b, respectively (step S111). Specifically, as illustrated in
γ=(first data×α+second data×β)/(α+β) (1)
After step S111, the second converting unit 6 converts the third data into audio data by subjecting the third data input from the arithmetic unit 5 to the inverse Fourier transform (step S112). Specifically, as illustrated in
Subsequently, the control unit 113 causes the record processing unit 106 to record the audio data generated by the second converting unit 6 in the recording medium 107 (step S113).
Thereafter, when an amount of audio data captured by the audio input unit 101 reaches a predetermined amount (step S114: YES), the electronic device 100 shifts to step S115 described later. On the other hand, when an amount of audio data captured by the audio input unit 101 has not reached the predetermined amount (step S114: NO), the electronic device 100 returns to step S109 described above.
At step S115, when the operating unit 110 is operated and an end signal to end recording is input (step S115: YES), the electronic device 100 ends this processing. On the other hand, when the operating unit 110 is operated, and an end signal to end recording has not been input (step S115: NO), the electronic device 100 returns to step S113 described above.
At step S116, when volume detected by the volume detecting unit 108 varies, the setting unit 113b updates the coefficient used when synthetizing the first data and the second data by the arithmetic unit 5 according to the volume. After step S116, the electronic device 100 shifts to step S109. In this case, as illustrated in
According to the second embodiment described above, the arithmetic unit 5 generates third data by synthesizing first data that is input from the first converting unit 2 and second data that is input from the subtracting unit 3 by multiplying by coefficients of predetermined ratio, respectively, to assign weights and, therefore, occurrence of noise and musical noise may be suppressed without deteriorating audio data.
Moreover, according to the second embodiment, the arithmetic unit 5 assigns weights to the first data that is input from the first converting unit 2 and the second data that is input from the subtracting unit 3 by multiplying by coefficients set by the setting unit, respectively. Accordingly, third data in which changes are smooth may be generated without awkwardness, and noise may be suppressed.
Furthermore, according to the second embodiment, the subtracting unit 3 removes noise data from first audio data by using a noise level determined by the determining unit 113a. Accordingly, noise that occurs according to environments, such as temperature, mode, and gain, may be suppressed.
In the second embodiment, the volume detecting unit 108 detects volume based on a voltage value of an audio signal that is output from the amplifier unit 102, but not limited thereto, for example, the volume detecting unit 108 may detect volume based on a digital value of digital audio data that is output by the AD converter 103 as illustrated in
Next, a third embodiment is described. The third embodiment has the same configuration as the electronic device 100 according to the second embodiment described above, but processing performed therein is different. Specifically, in the third embodiment, while capturing audio data in real time, the first data and the second data are synthesized, updating the coefficients by which the first data and the second data are multiplied in real time. In the following, processing performed by an electronic device according to the third embodiment is described. Like reference symbols are given to like components with the electronic device 100 according to the second embodiment described above, and detailed description is omitted.
In
At step S208, the audio input unit 101 captures audio data. Specifically, as illustrated in
Step S209 to step S213 correspond to step S109 to step S113 in
According to the third embodiment described above, the arithmetic unit 5 may generate the third data in which changes are smooth because the arithmetic unit 5 assigns weights by multiplying the first data input from the first converting unit 2 and the second data input from the subtracting unit 3 by coefficients sequentially set by the setting unit 113b, respectively, and noise may be suppressed without awkwardness.
Furthermore, according to the third embodiment, the arithmetic unit 5 generates the third data by synthesizing the first data input from the first converting unit 2 and the second data input from the subtracting unit 3 by multiplying coefficients of a predetermined ratio, respectively, to assign weights and, therefore, occurrence of noise and musical noise may be suppressed without deteriorating audio data.
Next, a fourth embodiment is described. The fourth embodiment has the same configuration as the electronic device according to the third embodiment, but processing performed therein is different. Specifically, the electronic device 100 according to the third embodiment described above sequentially changes coefficients by which the first data and the second data are respectively multiplied in real time, but in the fourth embodiment, coefficients by which the first data and the second data are respectively multiplied are changed every predetermined time. In the following, processing performed by an electronic device according to the fourth embodiment is described. Like reference symbols are given to like components with the electronic device 100 according to the third embodiment described above, and detailed description is omitted.
In
At step S316, when predetermined time has elapsed since recording is started (step S316: YES), the electronic device 100 returns to step S305 described above. On the other hand, when the predetermine time has not elapsed since the recording is started (step S316: NO), the electronic device 100 returns to step S308 described above. In this case, as illustrated in
According to the fourth embodiment described above, the setting unit 113b determines coefficients every predetermined time, and the arithmetic unit 5 assigns weights to the first data that is input from the first converting unit 2 and the second data that is input from the subtracting unit 3 by multiplying by the coefficients set by the setting unit 113b, respectively. Accordingly, the third data in which changes are smooth may be generated, and noise may be suppressed without awkwardness.
Moreover, according to the fourth embodiment, the arithmetic unit 5 generates the third data by synthesizing the first data input from the first converting unit 2 and the second data input from the subtracting unit 3 by multiplying by coefficients of a predetermined ratio, respectively, to assign weights and, therefore, occurrence of noise and musical noise may be suppressed without deteriorating audio data.
By combining the components disclosed in the first to the fourth embodiments described above as appropriate, various embodiments may be formed. For example, some components out of all of the components described in the first to the fourth embodiments described above may be excluded. Furthermore, the components described in the first to the fourth embodiments described above may be combined as appropriate.
Moreover, in the first to the fourth embodiments, “unit” may be read as “means” or “circuit”. For example, the control unit may be read as control means or control circuit.
Furthermore, the programs that cause the noise reduction apparatus or the electronic device according to the first to the fourth embodiments may be recorded in a computer-readable recording medium, such as a compact-disk read-only memory (CD-ROM), a flexible disk (FD), a compact-disk rewritable (CD-R), a digital versatile disk (DVD), a universal serial bus (USB) medium, and a flash memory, in a form of file data of an installable format or an executable format, to be provided.
Moreover, the programs that cause the noise reduction apparatus or the electronic device according to the first to the fourth embodiments may be configured to be stored in a computer connected to a network, such as the Internet, and to be provided by being downloaded through the network. Furthermore, the programs that are executed by the noise reduction apparatus or the electronic device according to the first to the fourth embodiments may be provided or distributed through a network, such as the Internet.
In the description of the flowcharts in the present disclosure, a sequence of processing among steps has been expressed by using expressions, such as “first”, “thereafter”, and “subsequently”, but the sequence of processing necessary for implementing the present invention is not determined uniquely by those expressions. That is, the sequence of processing in the flowcharts described in the present disclosure may be changed within a range not causing a contradiction. Moreover, not limited to such a program configured with simple branch processing, branching may be performed by generally determining more determination items.
As above, the embodiments have been described in detail based on the drawings, but these are only examples, and the present invention may be implemented by other embodiments in which various modifications and improvements are made based on knowledge of those skilled in the art, including forms described in a section of disclosure of the present invention.
As described, the present disclosure may include various embodiments not described herein, and various alterations in design and the like may be made within a range of specific technical ideas.
Additional advantages and modifications will readily occur to those skilled in the art. Therefore, the disclosure in its broader aspects is not limited to the specific details and representative embodiments shown and described herein. Accordingly, various modifications may be made without departing from the spirit or scope of the general inventive concept as defined by the appended claims and their equivalents.
Number | Date | Country | Kind |
---|---|---|---|
2018-079900 | Apr 2018 | JP | national |